Home /Research /Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

LEARNING

Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

Matteo Leonetti, Petar Kormushev, Simone Sagratella

Year: 2012
Citations: 17
Access: Open access

Abstract

Abstract We consider the problem of optimization in policy space for reinforcement learning. While a plethora of methods have been applied to this problem, only a narrow category of them proved feasible in robotics. We consider the peculiar characteristics of reinforcement learning in robotics, and devise a combination of two algorithms from the literature of derivative-free optimization. The proposed combination is well suited for robotics, as it involves both off-line learning in simulation and on-line learning in the real environment. We demonstrate our approach on a real-world task, where an Autonomous Underwater Vehicle has to survey a target area under potentially unknown environment conditions. We start from a given controller, which can perform the task under foreseeable conditions, and make it adaptive to the actual environment.

Keywords

Reinforcement learningRoboticsArtificial intelligenceComputer scienceTask (project management)Robot learningController (irrigation)Machine learningRobotMobile robot

Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory