首页 /研究 /Average reward reinforcement learning: Foundations, algorithms, and empirical results
LEARNING

Average reward reinforcement learning: Foundations, algorithms, and empirical results

Sridhar Mahadevan

发表年份
1996
引用次数
401
访问权限
开放获取

关键词

Reinforcement learningComputer scienceConvergence (economics)Temporal difference learningArtificial intelligenceMachine learningLearning automataMetric (unit)Performance metricAsynchronous communication

相关论文

查看 LEARNING 分类全部论文