Home /Research /Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance
LEARNING

Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance

Andrea L. Thomaz, Cynthia Breazeal

Year
2006
Citations
252

Abstract

As robots become a mass consumer product, they will need to learn new skills by interacting with typical hu-man users. Past approaches have adapted reinforcement learning (RL) to accept a human reward signal; how-ever, we question the implicit assumption that people shall only want to give the learner feedback on its past actions. We present findings from a human user study showing that people use the reward signal not only to provide feedback about past actions, but also to pro-vide future directed rewards to guide subsequent ac-tions. Given this, we made specific modifications to the simulated RL robot to incorporate guidance. We then analyze and evaluate its learning performance in a second user study, and we report significant improve-ments on several measures. This work demonstrates the importance of understanding the human-teacher/robot-learner system as a whole in order to design algorithms that support how people want to teach while simultane-ously improving the robot’s learning performance.

Keywords

Reinforcement learningComputer scienceRobotHuman–computer interactionError-driven learningRobot learningSIGNAL (programming language)Artificial intelligenceProduct (mathematics)Work (physics)

Related papers

Browse all LEARNING papers