Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup
Devin Schwab, Jost Tobias Springenberg, Murilo Fernandes Martins, Michael Neunert, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin Riedmiller
- Year
- 2019
- Citations
- 15
- Access
- Open access
Abstract
We present a method for fast training of vision based control policies on real robots.The key idea behind our method is to perform multi-task Reinforcement Learning with auxiliary tasks that differ not only in the reward to be optimized but also in the state-space in which they operate.In particular, we allow auxiliary task policies to utilize task features that are available only at training-time.This allows for fast learning of auxiliary policies, which subsequently generate good data for training the main, vision-based control policies.This method can be seen as an extension of the Scheduled Auxiliary Control (SAC-X) framework.We demonstrate the efficacy of our method by using both a simulated and real-world Ball-in-a-Cup game controlled by a robot arm.In simulation, our approach leads to significant learning speed-ups when compared to standard SAC-X.On the real robot we show that the task can be learned from-scratch, i.e., with no transfer from simulation and no imitation learning.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Fractional Differential Equations
Igor Podlubný
2025
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991