首页 /研究 /Using Reward-weighted Regression for Reinforcement Learning of Task Space Control

LEARNING

Using Reward-weighted Regression for Reinforcement Learning of Task Space Control

Jan Peters, Stefan Schaal

发表年份: 2007
引用次数: 15

摘要

Many robot control problems of practical importance, including task or operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of the known optimization or reinforcement learning algorithms can be used in online learning control for robots, as they are either prohibitively slow, do not scale to interesting domains of complex robots, or require trying out policies generated by random search, which are infeasible for a physical system. Using a generalization of the EM-base reinforcement learning framework suggested by Dayan & Hinton, we reduce the problem of learning with immediate rewards to a reward-weighted regression problem with an adaptive, integrated reward transformation for faster convergence. The resulting algorithm is efficient, learns smoothly without dangerous jumps in solution space, and works well in applications of complex high degree-of-freedom robots

关键词

Reinforcement learningComputer scienceRobotGeneralizationTask (project management)Artificial intelligenceConvergence (economics)Robot learningMachine learningControl (management)

Using Reward-weighted Regression for Reinforcement Learning of Task Space Control

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory