Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models
Tyler Westenbroek, Jacob Levy, David Fridovich-Keil
- Year
- 2023
- Access
- Open access
Abstract
We focus on developing efficient and reliable policy optimization strategies for robot learning with real-world data. In recent years, policy gradient methods have emerged as a promising paradigm for training control policies in simulation. However, these approaches often remain too data inefficient or unreliable to train on real robotic hardware. In this paper we introduce a novel policy gradient-based policy optimization framework which systematically leverages a (possibly highly simplified) first-principles model and enables learning precise control policies with limited amounts of real-world data. Our approach $1)$ uses the derivatives of the model to produce sample-efficient estimates of the policy gradient and $2)$ uses the model to design a low-level tracking controller, which is embedded in the policy class. Theoretical analysis provides insight into how the presence of this feedback controller overcomes key limitations of stand-alone policy gradient methods, while hardware experiments with a small car and quadruped demonstrate that our approach can learn precise control strategies reliably and with only minutes of real-world data.
Keywords
Related papers
Trajectory tracking control for 6WID/4WIS UGV via nonlinear sliding mode-model predictive control with adaptive following steering and dynamic-static constraints
Shengyang Lu, Guanpeng Chen, Lijing Zhao +2 more
Robotics and Autonomous Systems · 2026
Bioinspired underwater robotics: Advances across the materials, design, control, and applications
Dilip Muchhala, Pramod Kumar Maurya, Adarsh Raut +3 more
Robotics and Autonomous Systems · 2026
Modeling and control of a rigid–soft hybrid-link humanoid robot
Zewen He, Taiki Ishigaki, Ko Yamamoto
Robotics and Autonomous Systems · 2026
Artificial pushing adaptive coordinated control for the human-exoskeleton-walker system
Xinhao Zhang, Chen Yang, Chaobin Zou +4 more
Robotics and Autonomous Systems · 2026