首页 /研究 /Combining model-based policy search with online model learning for control of physical humanoids

LEARNING

Combining model-based policy search with online model learning for control of physical humanoids

Igor Mordatch, Nikhil Mishra, Clemens Eppner, Pieter Abbeel

发表年份: 2016
引用次数: 53

摘要

We present an automatic method for interactive control of physical humanoid robots based on high-level tasks that does not require manual specification of motion trajectories or specially-designed control policies. The method is based on the combination of a model-based policy that is trained off-line in simulation and sends high-level commands to a model-free controller that executes these commands on the physical robot. This low-level controller simultaneously learns and adapts a local model of dynamics on-line and computes optimal controls under the learned model. The high-level policy is trained using a combination of trajectory optimization and neural network learning, while considering physical limitations such as limited sensors and communication delays. The entire system runs in real-time on the robot's computer and uses only on-board sensors. We demonstrate successful policy execution on a range of tasks such as leaning, hand reaching, and robust balancing behaviors atop a tilting base on the physical robot and in simulation.

关键词

Computer scienceControl (management)Artificial intelligenceMachine learning

Combining model-based policy search with online model learning for control of physical humanoids

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory