首页 /研究 /Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results
LEARNING

Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results

Sridhar Mahadevan

发表年份
2007
引用次数
23

关键词

Reinforcement learningComputer scienceConvergence (economics)Temporal difference learningAsynchronous communicationMetric (unit)Artificial intelligenceLearning automataPerformance metricDynamic programming

相关论文

查看 LEARNING 分类全部论文