首页 /研究 /Learning CPG-based biped locomotion with a policy gradient method
LOCOMOTION

Learning CPG-based biped locomotion with a policy gradient method

Takamitsu Matsubara, Jun Morimoto, Jun Nakanishi, Masa-aki Sato, Kenji Doya

发表年份
2006
引用次数
10

摘要

Recently, CPG-based controllers have been widely explored to achieve robust biped locomotion. However, this approach has difficulties in tuning open parameters in the controller. In this paper, we present a learning framework for CPG-based biped locomotion with a policy gradient method. We demonstrate that appropriate sensory feedback in the CPG-based control architecture can be acquired using the proposed method within a thousand trials by numerical simulations. We analyze linear stability of a periodic orbit of the acquired biped walking considering a return map. Furthermore, we apply the learned controllers in numerical simulations to our physical 5-link robot in order to empirically evaluate the effectiveness of the proposed framework. Experimental results suggest the robustness of the acquired controllers against environmental changes and variations in the mass properties of the robot

关键词

Robustness (evolution)Control theory (sociology)Computer scienceBiped robotRobotInverted pendulumController (irrigation)Control engineeringNonlinear systemArtificial intelligence

相关论文

查看 LOCOMOTION 分类全部论文