Reinforcement learning of dynamic motor sequence: learning to stand up

Jun Morimoto, Kenji Doya

发表年份: 2002
引用次数: 68

摘要

We propose a learning method for implementing human-like sequential movements in robots. As an example of dynamic sequential movement, we consider the "stand-up" task for a two-joint, three-link robot. In contrast to the case of steady walking or standing, the desired trajectory for such a transient behavior is very difficult to derive. The goal of the task is to find a path that links a lying state to an upright state under the constraints of the system dynamics. The geometry of the robot is such that there is no static solution; the robot has to stand up dynamically utilizing the momentum of its body. We use reinforcement learning, in particular, a continuous time and state temporal difference (TD) learning method. For successful results, we use 1) an efficient method of value function approximation in a high-dimensional state space, and 2) a hierarchical architecture which divides a large state space into a few smaller pieces.

关键词

Reinforcement learningRobotTemporal difference learningComputer scienceTrajectoryState spaceQ-learningTask (project management)Sequence (biology)Artificial intelligence

Reinforcement learning of dynamic motor sequence: learning to stand up

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory