首页 /研究 /Combining model-based policy search with online model learning for control of physical humanoids
LEARNING

Combining model-based policy search with online model learning for control of physical humanoids

Igor Mordatch, Nikhil Mishra, Clemens Eppner, Pieter Abbeel

发表年份
2016
引用次数
53

摘要

We present an automatic method for interactive control of physical humanoid robots based on high-level tasks that does not require manual specification of motion trajectories or specially-designed control policies. The method is based on the combination of a model-based policy that is trained off-line in simulation and sends high-level commands to a model-free controller that executes these commands on the physical robot. This low-level controller simultaneously learns and adapts a local model of dynamics on-line and computes optimal controls under the learned model. The high-level policy is trained using a combination of trajectory optimization and neural network learning, while considering physical limitations such as limited sensors and communication delays. The entire system runs in real-time on the robot's computer and uses only on-board sensors. We demonstrate successful policy execution on a range of tasks such as leaning, hand reaching, and robust balancing behaviors atop a tilting base on the physical robot and in simulation.

关键词

Computer scienceControl (management)Artificial intelligenceMachine learning

相关论文

查看 LEARNING 分类全部论文