首页 /研究 /Poincaré-Map-Based Reinforcement Learning For Biped Walking
LEARNING

Poincaré-Map-Based Reinforcement Learning For Biped Walking

Jun Morimoto, Jun Nakanishi, Gen Endo, Gordon Cheng, Christopher G. Atkeson, Garth Zeglin

发表年份
2006
引用次数
60

摘要

We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Via-points are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of the Poincaré map of the periodic walking pattern. The model maps from a state in the single support phase and the control actions to a state in the next single support phase. We applied this approach to both a simulated robot model and an actual biped robot. We show that successful walking policies are acquired.

关键词

Reinforcement learningRobotBiped robotComputer scienceJerkControl theory (sociology)State (computer science)Artificial intelligenceTrajectoryControl (management)

相关论文

查看 LEARNING 分类全部论文