Home /Research /Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning
LEARNING

Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

Matteo Leonetti, Petar Kormushev, Simone Sagratella

Year
2012
Citations
17
Access
Open access

Abstract

Abstract We consider the problem of optimization in policy space for reinforcement learning. While a plethora of methods have been applied to this problem, only a narrow category of them proved feasible in robotics. We consider the peculiar characteristics of reinforcement learning in robotics, and devise a combination of two algorithms from the literature of derivative-free optimization. The proposed combination is well suited for robotics, as it involves both off-line learning in simulation and on-line learning in the real environment. We demonstrate our approach on a real-world task, where an Autonomous Underwater Vehicle has to survey a target area under potentially unknown environment conditions. We start from a given controller, which can perform the task under foreseeable conditions, and make it adaptive to the actual environment.

Keywords

Reinforcement learningRoboticsArtificial intelligenceComputer scienceTask (project management)Robot learningController (irrigation)Machine learningRobotMobile robot

Related papers

Browse all LEARNING papers