首页 /研究 /Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning
LEARNING

Learning from Unreliable Human Action Advice in Interactive Reinforcement Learning

Lisa Scherf, Cigdem Turan, Dorothea Koert

发表年份
2022
引用次数
4

摘要

Interactive Reinforcement Learning (IRL) uses human input to improve learning speed and enable learning in more complex environments. Human action advice is here one of the input channels preferred by human users. However, many existing IRL approaches do not explicitly consider the possibility of inaccurate human action advice. Moreover, most approaches that account for inaccurate advice compute trust in human action advice independent of a state. This can lead to problems in practical cases, where human input might be inaccurate only in some states while it is still useful in others. To this end, we propose a novel algorithm that can handle state-dependent unreliable human action advice in IRL. Here, we combine three potential indicator signals for unreliable advice, i.e. consistency of advice, retrospective optimality of advice, and behavioral cues that hint at human uncertainty. We evaluate our method in a simulated gridworld and in robotic sorting tasks with 28 subjects. We show that our method outperforms a state-independent baseline and analyze occurrences of behavioral cues related to unreliable advice.

关键词

Advice (programming)Computer scienceReinforcement learningAction (physics)Consistency (knowledge bases)Artificial intelligenceMachine learningHuman–computer interaction

相关论文

查看 LEARNING 分类全部论文