首页 /研究 /Keeping Humans in the Loop: Teaching via Feedback in Continuous Action Space Environments

LEARNING

Keeping Humans in the Loop: Teaching via Feedback in Continuous Action Space Environments

Isaac Sheidlower, Allison Moore, Elaine Schaertl Short

发表年份: 2022
引用次数: 8

摘要

Interactive Reinforcement Learning (IntRL) allows human teachers to accelerate the learning process of Reinforcement Learning (RL) robots. However, IntRL has largely been limited to tasks with discrete-action spaces in which actions are relatively slow. This limits IntRL's application to more complicated and challenging robotic tasks, the very tasks that modern RL is particularly well-suited for. We seek to bridge this gap by presenting Continuous Action-space Interactive Reinforcement learning (CAIR): the first continuous action-space IntRL algorithm that is capable of using teacher feedback to out-perform state-of-the-art RL algorithms in those tasks. CAIR combines policies learned from the environment and the teacher into a single policy that proportionally weights the two policies based on their agreement. This allows a CAIR agent to learn a relatively stable policy despite potentially noisy or coarse teacher feedback. We validate our approach in two simulated robotics tasks with easy-to-design and - understand heuristic oracle teachers. Furthermore, we validate our approach in a human subjects study through Amazon Mechanical Turk and show CAIR out-performs the prior state-of-the-art in Interactive RL.

关键词

Reinforcement learningComputer scienceOracleAction (physics)Artificial intelligenceState spaceRobotSpace (punctuation)Process (computing)Robotics

Keeping Humans in the Loop: Teaching via Feedback in Continuous Action Space Environments

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory