首页 /研究 /Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory\n GANs
LEARNING

Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory\n GANs

Himanshu Sahni, Toby Buckley, Pieter Abbeel, Ilya Kuzovkin

发表年份
2019
引用次数
7
访问权限
开放获取

摘要

Reinforcement Learning (RL) algorithms typically require millions of\nenvironment interactions to learn successful policies in sparse reward\nsettings. Hindsight Experience Replay (HER) was introduced as a technique to\nincrease sample efficiency by reimagining unsuccessful trajectories as\nsuccessful ones by altering the originally intended goals. However, it cannot\nbe directly applied to visual environments where goal states are often\ncharacterized by the presence of distinct visual features. In this work, we\nshow how visual trajectories can be hallucinated to appear successful by\naltering agent observations using a generative model trained on relatively few\nsnapshots of the goal. We then use this model in combination with HER to train\nRL agents in visual settings. We validate our approach on 3D navigation tasks\nand a simulated robotics application and show marked improvement over baselines\nderived from previous work.\n

关键词

HallucinatingHindsight biasComputer scienceArtificial intelligenceSample (material)Sample complexityReinforcement learningGenerative modelRoboticsMachine learning

相关论文

查看 LEARNING 分类全部论文