Direct Dynamic Retargeting for Humanoid Imitation Learning from Videos
Constant Roux, Ludovic De Matteïs, Armand Jordana, Valentin Guillet, Nicolas Mansard, Olivier Stasse, Philippe Souères
2026
Abstract
Imitation Learning from monocular video demonstrations provides a scalable approach for teaching complex skills to humanoid robots. However, translating human motion to humanoids requires overcoming significant morphological mismatches. Standard approaches rely on Geometric Retargeting or Indirect Dynamic Retargeting pipelines. We identify that these intermediate kinematic projections introduce a geometric bias, restricting the search space and yielding suboptimal dynamic behaviors. In this paper, we propose Direct Dynamic Retargeting (DDR), a novel single-stage framework that generates high-fidelity, dynamically feasible trajectories directly from expert videos. By formulating the problem in the task space and leveraging a sampling-based Model Predictive Control solver within a physics simulator, DDR natively optimizes over complex contact sequences while mitigating input drift. Our experiments demonstrate that bypassing the geometric bias allows DDR to outperform state-of-the-art baselines in demonstration tracking accuracy. Furthermore, we establish that providing such physically viable references to RL agents accelerates training convergence and enhances the final execution of agile and balancing behaviors. Source code will be made publicly available.
Keywords
Related papers
Point Tracking Improves World Action Models
Jiarui Guan, Wenshuai Zhao, Yue Pei +3 more
2026
Any2Any: Efficient Cross-Embodiment Transfer for Humanoid Whole-Body Tracking
Ming Yang, Tao Yu, Feng Li +1 more
2026
Vision-Based Agile Landing on Turbulent Waters
Dimosthenis Angelis, Leonard Bauersfeld, Davide Scaramuzza +1 more
2026
How Many Training Samples Are Needed for the Inverse Kinematics Solutions by Artificial Neural Networks
Dong-Won Lim
2026