One-Shot Imitation Under Mismatched Execution

Kushal Kedia, Prithwish Dan, Angela Chao, M. Di Pace, Sanjiban Choudhury

Year: 2025
Citations: 1

Abstract

Human demonstrations as prompts are a power-ful way to program robots to do long-horizon manipulation tasks. However, translating these demonstrations into robot-executable actions presents significant challenges due to execution mismatches in movement styles and physical capabilities. Existing methods for human-robot translation either depend on paired data, which is infeasible to scale, or rely heavily on frame-level visual similarities that often break down in practice. To address these challenges, we propose RHyME, a novel framework that automatically pairs human and robot trajectories using sequence-level optimal transport cost functions. Given long-horizon robot demonstrations, RHyME synthesizes semantically equivalent human videos by retrieving and composing short-horizon human clips. This approach facilitates effective policy training without the need for paired data. RHyME successfully imitates a range of cross-embodiment demonstrators, both in simulation and with a real human hand, achieving over 50% increase in task success compared to previous methods. We release our code and datasets at this website.

Keywords

Shot (pellet)ImitationComputer scienceComputer securityArtificial intelligencePsychologyNeuroscienceMaterials science

One-Shot Imitation Under Mismatched Execution

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory