Home /Research /Learning CPG sensory feedback with policy gradient for biped locomotion for a full-body humanoid

LOCOMOTION

Learning CPG sensory feedback with policy gradient for biped locomotion for a full-body humanoid

Gen Endo, Jun Morimoto, Takamitsu Matsubara, Jun Nakanishi, Gordon Cheng

Year: 2005
Citations: 34

Abstract

This paper describes a learning framework for a central pattern generator based biped locomotion controller using a policy gradient method. Our goals in this study are to achieve biped walking with a 3D hardware humanoid, and to develop an efficient learning algorithm with CPG by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feed-back controller can be acquired within a thousand trials by numerical simulations and the obtained controller in numerical simulation achieves stable walking with a physical robot in the real world. Numerical simulations and hardware experiments evaluated walking velocity and stability. Furthermore, we present the possibility of an additional online learning using a hardware robot to improve the controller within 200 iterations.

Keywords

Humanoid robotController (irrigation)Computer scienceControl theory (sociology)Stability (learning theory)Central pattern generatorRobotBiped robotReinforcement learningSimulation

Learning CPG sensory feedback with policy gradient for biped locomotion for a full-body humanoid

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory