Task-Relevant Adversarial Imitation Learning

Konrad Zolna, Scott Reed, Alexander Novikov, Sergio Gomez Colmenarejo, David Budden, Serkan Cabi, Misha Denil, Nando de Freitas, Ziyu Wang

发表年份: 2019
访问权限: 开放获取

摘要

We show that a critical vulnerability in adversarial imitation is the tendency of discriminator networks to learn spurious associations between visual features and expert labels. When the discriminator focuses on task-irrelevant features, it does not provide an informative reward signal, leading to poor task performance. We analyze this problem in detail and propose a solution that outperforms standard Generative Adversarial Imitation Learning (GAIL). Our proposed method, Task-Relevant Adversarial Imitation Learning (TRAIL), uses constrained discriminator optimization to learn informative rewards. In comprehensive experiments, we show that TRAIL can solve challenging robotic manipulation tasks from pixels by imitating human operators without access to any task rewards, and clearly outperforms comparable baseline imitation agents, including those trained via behaviour cloning and conventional GAIL.

关键词

cs.LGcs.AIcs.ROstat.ML

Task-Relevant Adversarial Imitation Learning

摘要

关键词

相关论文

面向大型复杂构件的移动机器人辅助磨削技术综述

基于物理信息与机器学习的五轴铣削TC4钛合金刀具磨损融合预测模型

通过新型压电主动阻尼刀柄提升机器人铣削质量

一种利用磁致非线性宽带多向被动减振器抑制机器人铣削低频颤振的新方法