首页 /研究 /Bipedal walking energy minimization by reinforcement learning with evolving policy parameterization

LOCOMOTION

Bipedal walking energy minimization by reinforcement learning with evolving policy parameterization

Petar Kormushev, Barkan Uğurlu, Sylvain Calinon, Nikos G. Tsagarakis, Darwin G. Caldwell

发表年份: 2011
引用次数: 12

摘要

We present a learning-based approach for minimizing the electric energy consumption during walking of a passively-compliant bipedal robot. The energy consumption is reduced by learning a varying-height center-of-mass trajectory which uses efficiently the robot's passive compliance. To do this, we propose a reinforcement learning method which evolves the policy parameterization dynamically during the learning process and thus manages to find better policies faster than by using fixed parameterization. The method is first tested on a function approximation task, and then applied to the humanoid robot COMAN where it achieves significant energy reduction.

关键词

Reinforcement learningHumanoid robotTrajectoryComputer scienceEnergy minimizationEnergy consumptionRobotMinificationTask (project management)Energy (signal processing)

Bipedal walking energy minimization by reinforcement learning with evolving policy parameterization

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory