首页 /研究 /Learning rate free reinforcement learning for real-time motion control using a value-gradient based policy
LEARNING

Learning rate free reinforcement learning for real-time motion control using a value-gradient based policy

J.C. Van Rooijen, I. Grondman, Robert Babuška

发表年份
2014
引用次数
12

关键词

Reinforcement learningComputer scienceTemporal difference learningQ-learningTask (project management)Bellman equationController (irrigation)Inverted pendulumControl theory (sociology)Artificial intelligence

相关论文

查看 LEARNING 分类全部论文