首页 /研究 /Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning
LEARNING

Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

Matteo Leonetti, Petar Kormushev, Simone Sagratella

发表年份
2012
引用次数
17
访问权限
开放获取

摘要

Abstract We consider the problem of optimization in policy space for reinforcement learning. While a plethora of methods have been applied to this problem, only a narrow category of them proved feasible in robotics. We consider the peculiar characteristics of reinforcement learning in robotics, and devise a combination of two algorithms from the literature of derivative-free optimization. The proposed combination is well suited for robotics, as it involves both off-line learning in simulation and on-line learning in the real environment. We demonstrate our approach on a real-world task, where an Autonomous Underwater Vehicle has to survey a target area under potentially unknown environment conditions. We start from a given controller, which can perform the task under foreseeable conditions, and make it adaptive to the actual environment.

关键词

Reinforcement learningRoboticsArtificial intelligenceComputer scienceTask (project management)Robot learningController (irrigation)Machine learningRobotMobile robot

相关论文

查看 LEARNING 分类全部论文