首页 /研究 /Learning CPG sensory feedback with policy gradient for biped locomotion for a full-body humanoid

LOCOMOTION

Learning CPG sensory feedback with policy gradient for biped locomotion for a full-body humanoid

Gen Endo, Jun Morimoto, Takamitsu Matsubara, Jun Nakanishi, Gordon Cheng

发表年份: 2005
引用次数: 34

摘要

This paper describes a learning framework for a central pattern generator based biped locomotion controller using a policy gradient method. Our goals in this study are to achieve biped walking with a 3D hardware humanoid, and to develop an efficient learning algorithm with CPG by reducing the dimensionality of the state space used for learning. We demonstrate that an appropriate feed-back controller can be acquired within a thousand trials by numerical simulations and the obtained controller in numerical simulation achieves stable walking with a physical robot in the real world. Numerical simulations and hardware experiments evaluated walking velocity and stability. Furthermore, we present the possibility of an additional online learning using a hardware robot to improve the controller within 200 iterations.

关键词

Humanoid robotController (irrigation)Computer scienceControl theory (sociology)Stability (learning theory)Central pattern generatorRobotBiped robotReinforcement learningSimulation

Learning CPG sensory feedback with policy gradient for biped locomotion for a full-body humanoid

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory