首页 /研究 /Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning

LEARNING

Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning

Lisa Scherf, Cigdem Turan, Dorothea Koert

发表年份: 2022
引用次数: 4

摘要

Interactive Reinforcement Learning (IRL) uses human input to improve learning speed and enable learning in more complex environments. Human action advice is here one of the input channels preferred by human users. However, many existing IRL approaches do not explicitly consider the possibility of inaccurate human action advice. Moreover, most approaches that account for inaccurate advice compute trust in human action advice independent of a state. This can lead to problems in practical cases, where human input might be inaccurate only in some states while it is still useful in others. To this end, we propose a novel algorithm that can handle state-dependent unreliable human action advice in IRL. Here, we combine three potential indicator signals for unreliable advice, i.e. consistency of advice, retrospective optimality of advice, and behavioral cues that hint at human uncertainty. We evaluate our method in a simulated gridworld and in robotic sorting tasks with 28 subjects. We show that our method outperforms a state-independent baseline and analyze occurrences of behavioral cues related to unreliable advice.

关键词

Advice (programming)Computer scienceReinforcement learningAction (physics)Consistency (knowledge bases)Artificial intelligenceMachine learningHuman–computer interaction

Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory