Home /Research /Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

LEARNING

Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

Nitish Srivastava, Walter Talbott, Martin Bertran Lopez, Shuangfei Zhai, Josh Susskind

Year: 2021
Access: Open access

Abstract

Modeling the world can benefit robot learning by providing a rich training signal for shaping an agent's latent state space. However, learning world models in unconstrained environments over high-dimensional observation spaces such as images is challenging. One source of difficulty is the presence of irrelevant but hard-to-model background distractions, and unimportant visual details of task-relevant entities. We address this issue by learning a recurrent latent dynamics model which contrastively predicts the next observation. This simple model leads to surprisingly robust robotic control even with simultaneous camera, background, and color distractions. We outperform alternatives such as bisimulation methods which impose state-similarity measures derived from divergence in future reward or future optimal actions. We obtain state-of-the-art results on the Distracting Control Suite, a challenging benchmark for pixel-based robotic control.

Keywords

cs.LGcs.AIcs.RO

Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models

Abstract

Keywords

Related papers

Parallel Differentiable Reachability for Learning and Planning with Certified Neural Dynamics and Controllers

Artificial Intelligence enhanced smart welding islands: Foundation models revolutionizing manufacturing

A deep reinforcement learning and a dynamic graph neural network-based scheduling agent to control a multi-task robot

LLM Agent-driven Automated DFA Assessment with Fine-tuning and AAS-based RAG