PACE: Physics Augmentation for Coordinated End-to-end Reinforcement Learning toward Versatile Humanoid Table Tennis
Muqun Hu, Wenxi Chen, Wenjing Li, Falak Mandali, Zijian He, Renhong Zhang, Praveen Krisna, Katherine Christian, Leo Benaharon, Dizhi Ma, Karthik Ramani, Yan Gu
- Year
- 2025
- Access
- Open access
Abstract
Humanoid table tennis (TT) demands rapid perception, proactive whole-body motion, and agile footwork under strict timing--capabilities that remain difficult for end-to-end control policies. We propose a reinforcement learning (RL) framework that maps ball-position observations directly to whole-body joint commands for both arm striking and leg locomotion, strengthened by predictive signals and dense, physics-guided rewards. A lightweight learned predictor, fed with recent ball positions, estimates future ball states and augments the policy's observations for proactive decision-making. During training, a physics-based predictor supplies precise future states to construct dense, informative rewards that lead to effective exploration. The resulting policy attains strong performance across varied serve ranges (hit rate$\geq$96% and success rate$\geq$92%) in simulations. Ablation studies confirm that both the learned predictor and the predictive reward design are critical for end-to-end learning. Deployed zero-shot on a physical Booster T1 humanoid with 23 revolute joints, the policy produces coordinated lateral and forward-backward footwork with accurate, fast returns, suggesting a practical path toward versatile, competitive humanoid TT. We have open-sourced our RL training code at: https://github.com/purdue-tracelab/TTRL-ICRA2026
Keywords
Related papers
Trajectory tracking control for 6WID/4WIS UGV via nonlinear sliding mode-model predictive control with adaptive following steering and dynamic-static constraints
Shengyang Lu, Guanpeng Chen, Lijing Zhao +2 more
Robotics and Autonomous Systems · 2026
Bioinspired underwater robotics: Advances across the materials, design, control, and applications
Dilip Muchhala, Pramod Kumar Maurya, Adarsh Raut +3 more
Robotics and Autonomous Systems · 2026
Modeling and control of a rigid–soft hybrid-link humanoid robot
Zewen He, Taiki Ishigaki, Ko Yamamoto
Robotics and Autonomous Systems · 2026
Artificial pushing adaptive coordinated control for the human-exoskeleton-walker system
Xinhao Zhang, Chen Yang, Chaobin Zou +4 more
Robotics and Autonomous Systems · 2026