Poincaré-Map-Based Reinforcement Learning For Biped Walking

Jun Morimoto, Jun Nakanishi, Gen Endo, Gordon Cheng, Christopher G. Atkeson, Garth Zeglin

发表年份: 2006
引用次数: 60

摘要

We propose a model-based reinforcement learning algorithm for biped walking in which the robot learns to appropriately modulate an observed walking pattern. Via-points are detected from the observed walking trajectories using the minimum jerk criterion. The learning algorithm modulates the via-points as control actions to improve walking trajectories. This decision is based on a learned model of the Poincaré map of the periodic walking pattern. The model maps from a state in the single support phase and the control actions to a state in the next single support phase. We applied this approach to both a simulated robot model and an actual biped robot. We show that successful walking policies are acquired.

关键词

Reinforcement learningRobotBiped robotComputer scienceJerkControl theory (sociology)State (computer science)Artificial intelligenceTrajectoryControl (management)

Poincaré-Map-Based Reinforcement Learning For Biped Walking

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory