Home /Research /Path integral learning of multidimensional movement trajectories
LEARNING

Path integral learning of multidimensional movement trajectories

João André, Cristina P. Santos, Lino Costa

Year
2013
Citations
3
Access
Open access

Abstract

This paper explores the use of Path Integral Methods, particularly several variants of the recent Path Integral Policy Improvement (PI2) algorithm in multidimensional movement parametrized policy learning. We rely on Dynamic Movement Primitives (DMPs) to codify discrete and rhythmic trajectories, and apply the PI2-CMA and PIBB methods in the learning of optimal policy parameters, according to different cost functions that inherently encode movement objectives. Additionally we merge both of these variants and propose the PIBB-CMA algorithm, comparing all of them with the vanilla version of PI2. From the obtained results we conclude that PIBB-CMA surpasses all other methods in terms of convergence speed and iterative final cost, which leads to an increased interest in its application to more complex robotic problems.

Keywords

Merge (version control)Computer scienceConvergence (economics)Path (computing)ENCODEPath integral formulationArtificial intelligenceMovement (music)Mathematical optimizationMathematics

Related papers

Browse all LEARNING papers