首页 /研究 /Learning aggressive animal locomotion skills for quadrupedal robots solely from monocular videos

LOCOMOTION

Learning aggressive animal locomotion skills for quadrupedal robots solely from monocular videos

Zhao Liu, Zeren Luo, Yimin Han, Jiahui Zhang, Yuanhao Chen, Yunhui Liu, Peng Lu

发表年份: 2025
引用次数: 3
访问权限: 开放获取

摘要

The quest for agile quadrupedal robots is limited by handcrafted reward design in reinforcement learning. While animal motion capture provides 3D references, its cost prohibits scaling. Video learning provides an efficient alternative yet suffers from 2D limitations and joint tracking failures during explosive motions. We address this with a novel video-based framework. First, robust 2D pose estimation constructs a skeleton graph model, enabling Kalman-filter-based joint position fusion. Next, a spatial-temporal graph convolution network aggregates spatial pose features via graph convolutions and temporal dynamics through dilated convolutions, recovering 3D joint trajectories. These trajectories are mapped to the robot’s joint space to formulate generative imitation learning. Real-robot deployment demonstrates successful learning of complex motions: gallop (high-speed), tripod (fault-tolerant), bipedal (quadrupedally challenging), and backflip. The proposed framework significantly advances robotic locomotion capabilities.

关键词

RobotGraphMonocularHumanoid robotExploitQuadrupedalismMotion captureReinforcement learning

Learning aggressive animal locomotion skills for quadrupedal robots solely from monocular videos

摘要

关键词

相关论文

Artificial intelligence: a modern approach

Self-Organizing Maps

Vision meets robotics: The KITTI dataset

Probabilistic robotics