首页 /研究 /Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

LEARNING

Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

Matteo Leonetti, Petar Kormushev, Simone Sagratella

发表年份: 2012
引用次数: 17
访问权限: 开放获取

摘要

Abstract We consider the problem of optimization in policy space for reinforcement learning. While a plethora of methods have been applied to this problem, only a narrow category of them proved feasible in robotics. We consider the peculiar characteristics of reinforcement learning in robotics, and devise a combination of two algorithms from the literature of derivative-free optimization. The proposed combination is well suited for robotics, as it involves both off-line learning in simulation and on-line learning in the real environment. We demonstrate our approach on a real-world task, where an Autonomous Underwater Vehicle has to survey a target area under potentially unknown environment conditions. We start from a given controller, which can perform the task under foreseeable conditions, and make it adaptive to the actual environment.

关键词

Reinforcement learningRoboticsArtificial intelligenceComputer scienceTask (project management)Robot learningController (irrigation)Machine learningRobotMobile robot

Combining Local and Global Direct Derivative-Free Optimization for Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory