Home /Research /Model-based imitation learning by probabilistic trajectory matching
LEARNING

Model-based imitation learning by probabilistic trajectory matching

Péter Englert, Alexandros Paraschos, Jan Peters, Marc Peter Deisenroth

Year
2013
Citations
49

Abstract

One of the most elegant ways of teaching new skills to robots is to provide demonstrations of a task and let the robot imitate this behavior. Such imitation learning is a non-trivial task: Different anatomies of robot and teacher, and reduced robustness towards changes in the control task are two major difficulties in imitation learning. We present an imitation-learning approach to efficiently learn a task from expert demonstrations. Instead of finding policies indirectly, either via state-action mappings (behavioral cloning), or cost function learning (inverse reinforcement learning), our goal is to find policies directly such that predicted trajectories match observed ones. To achieve this aim, we model the trajectory of the teacher and the predicted robot trajectory by means of probability distributions. We match these distributions by minimizing their Kullback-Leibler divergence. In this paper, we propose to learn probabilistic forward models to compute a probability distribution over trajectories. We compare our approach to model-based reinforcement learning methods with hand-crafted cost functions. Finally, we evaluate our method with experiments on a real compliant robot.

Keywords

Computer scienceRobotProbabilistic logicArtificial intelligenceReinforcement learningTrajectoryRobustness (evolution)Machine learningDivergence (linguistics)

Related papers

Browse all LEARNING papers