Reinforcement learning with human teachers: evidence of feedback and guidance with implications for learning performance
Andrea L. Thomaz, Cynthia Breazeal
- Year
- 2006
- Citations
- 252
Abstract
As robots become a mass consumer product, they will need to learn new skills by interacting with typical hu-man users. Past approaches have adapted reinforcement learning (RL) to accept a human reward signal; how-ever, we question the implicit assumption that people shall only want to give the learner feedback on its past actions. We present findings from a human user study showing that people use the reward signal not only to provide feedback about past actions, but also to pro-vide future directed rewards to guide subsequent ac-tions. Given this, we made specific modifications to the simulated RL robot to incorporate guidance. We then analyze and evaluate its learning performance in a second user study, and we report significant improve-ments on several measures. This work demonstrates the importance of understanding the human-teacher/robot-learner system as a whole in order to design algorithms that support how people want to teach while simultane-ously improving the robot’s learning performance.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002