首页 /研究 /Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup

LEARNING

Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup

Devin Schwab, Jost Tobias Springenberg, Murilo Fernandes Martins, Michael Neunert, Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Roland Hafner, Francesco Nori, Martin Riedmiller

发表年份: 2019
引用次数: 15
访问权限: 开放获取

摘要

We present a method for fast training of vision based control policies on real robots.The key idea behind our method is to perform multi-task Reinforcement Learning with auxiliary tasks that differ not only in the reward to be optimized but also in the state-space in which they operate.In particular, we allow auxiliary task policies to utilize task features that are available only at training-time.This allows for fast learning of auxiliary policies, which subsequently generate good data for training the main, vision-based control policies.This method can be seen as an extension of the Scheduled Auxiliary Control (SAC-X) framework.We demonstrate the efficacy of our method by using both a simulated and real-world Ball-in-a-Cup game controlled by a robot arm.In simulation, our approach leads to significant learning speed-ups when compared to standard SAC-X.On the real robot we show that the task can be learned from-scratch, i.e., with no transfer from simulation and no imitation learning.

关键词

Artificial intelligenceBall (mathematics)Computer scienceComputer visionFeature (linguistics)Feature extractionPattern recognition (psychology)Mathematics

Simultaneously Learning Vision and Feature-Based Control Policies for Real-World Ball-In-A-Cup

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control