Inverse Reinforcement Learning with Natural Language Goals

Li Zhou, Kevin Small

发表年份: 2021
引用次数: 21
访问权限: 开放获取

摘要

Humans generally use natural language to communicate task requirements to each other. Ideally, natural language should also be usable for communicating goals to autonomous machines (e.g., robots) to minimize friction in task specification. However, understanding and mapping natural language goals to sequences of states and actions is challenging. Specifically, existing work along these lines has encountered difficulty in generalizing learned policies to new natural language goals and environments. In this paper, we propose a novel adversarial inverse reinforcement learning algorithm to learn a language-conditioned policy and reward function. To improve generalization of the learned policy and reward function, we use a variational goal generator to relabel trajectories and sample diverse goals during training. Our algorithm outperforms multiple baselines by a large margin on a vision-based natural language instruction following dataset (Room-2-Room), demonstrating a promising advance in enabling the use of natural language instructions in specifying agent goals.

关键词

Computer scienceNatural languageMargin (machine learning)Reinforcement learningTask (project management)GeneralizationUSableArtificial intelligenceFunction (biology)Adversarial system

Inverse Reinforcement Learning with Natural Language Goals

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory