首页 /研究 /Pre-training of Deep RL Agents for Improved Learning under Domain Randomization

PERCEPTION

Pre-training of Deep RL Agents for Improved Learning under Domain Randomization

Artemij Amiranashvili, Max Argus, Lukás Hermann, Wolfram Burgard, Thomas Brox

发表年份: 2021
引用次数: 2
访问权限: 开放获取

摘要

Visual domain randomization in simulated environments is a widely used method to transfer policies trained in simulation to real robots. However, domain randomization and augmentation hamper the training of a policy. As reinforcement learning struggles with a noisy training signal, this additional nuisance can drastically impede training. For difficult tasks it can even result in complete failure to learn. To overcome this problem we propose to pre-train a perception encoder that already provides an embedding invariant to the randomization. We demonstrate that this yields consistently improved results on a randomized version of DeepMind control suite tasks and a stacking environment on arbitrary backgrounds with zero-shot transfer to a physical robot.

关键词

RandomizationDomain (mathematical analysis)Computer scienceTraining (meteorology)Artificial intelligenceMathematicsMedicineGeographyRandomized controlled trialInternal medicine

Pre-training of Deep RL Agents for Improved Learning under Domain Randomization

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Fractional Differential Equations

Applied Nonlinear Control