Home /Research /Autonomous Reinforcement Learning with Experience Replay for Humanoid Gait Optimization

LOCOMOTION

Autonomous Reinforcement Learning with Experience Replay for Humanoid Gait Optimization

Paweł Wawrzyński

Year: 2012
Citations: 9

Abstract

This paper demonstrates application of Reinforcement Learning to optimization of control of a complex system in realistic setting that requires efficiency and autonomy of the learning algorithm. Namely, Actor-Critic with experience replay (which addresses efficiency), and the Fixed Point method for step-size estimation (which addresses autonomy) is applied here to approximately optimize humanoid robot gait. With complex dynamics and tens of continuous state and action variables, humanoid gait optimization represents a challenge for analytical synthesis of control. The presented algorithm learns a nimble gait within 80 minutes of training.

Keywords

Computer scienceReinforcement learningHumanoid robotGaitArtificial intelligenceControl (management)AutonomyRobotHuman–computer interactionPhysical medicine and rehabilitation

Autonomous Reinforcement Learning with Experience Replay for Humanoid Gait Optimization

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory