An extended policy gradient algorithm for robot task learning

Andrea Cherubini, Elisa Giannone, Luca Iocchi, P. E. Palamara

发表年份: 2007
引用次数: 4

摘要

In real-world robotic applications, many factors, both at low-level (e.g., vision and motion control parameters) and at high-level (e.g., the behaviors) determine the quality of the robot performance. Thus, for many tasks, robots require fine tuning of the parameters, in the implementation of behaviors and basic control actions, as well as in strategic decisional processes. In recent years, machine learning techniques have been used to find optimal parameter sets for different behaviors. However, a drawback of learning techniques is time consumption: in practical applications, methods designed for physical robots must be effective with small amounts of data. In this paper, we present a method for concurrent learning of best strategy and optimal parameters, by extending the policy gradient reinforcement learning algorithm. The results of our experimental work in a simulated environment and on a real robot show a very high convergence rate.

关键词

Reinforcement learningComputer scienceRobotTask (project management)Convergence (economics)Control (management)Robot learningArtificial intelligenceRobot controlQuality (philosophy)

An extended policy gradient algorithm for robot task learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory