首页 /研究 /Creating Multimodal Interactive Agents with Imitation and\n Self-Supervised Learning
LEARNING

Creating Multimodal Interactive Agents with Imitation and\n Self-Supervised Learning

DeepMind Interactive Agents Team, Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix R. Fischer, Petko Georgiev, Alex Goldin, Mansi Gupta, Tim Harley, Felix Hill, Peter C. Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzić, Alistair Muldal, Adam Santoro, Guy Scully

发表年份
2021
引用次数
32
访问权限
开放获取

摘要

A common vision from science fiction is that robots will one day inhabit our\nphysical spaces, sense the world as we do, assist our physical labours, and\ncommunicate with us through natural language. Here we study how to design\nartificial agents that can interact naturally with humans using the\nsimplification of a virtual environment. We show that imitation learning of\nhuman-human interactions in a simulated world, in conjunction with\nself-supervised learning, is sufficient to produce a multimodal interactive\nagent, which we call MIA, that successfully interacts with non-adversarial\nhumans 75% of the time. We further identify architectural and algorithmic\ntechniques that improve performance, such as hierarchical action selection.\nAltogether, our results demonstrate that imitation of multi-modal, real-time\nhuman behaviour may provide a straightforward and surprisingly effective means\nof imbuing agents with a rich behavioural prior from which agents might then be\nfine-tuned for specific purposes, thus laying a foundation for training capable\nagents for interactive robots or digital assistants. A video of MIA's behaviour\nmay be found at https://youtu.be/ZFgRhviF7mY\n

关键词

Computer scienceImitationHuman–computer interactionAction (physics)Artificial intelligenceRobotSelection (genetic algorithm)Machine learningPsychology

相关论文

查看 LEARNING 分类全部论文