Object-Focused Advice in Reinforcement Learning
Samantha Krening, Brent Harrison, Karen M. Feigh, Charles L. Isbell, Andrea L. Thomaz
- 发表年份
- 2016
- 引用次数
- 3
摘要
In order for robots and intelligent agents to interact with and learn from people with no machine-learning expertise, robots should be able to learn from natural human instruction. Many human explanations consist of simple sentences without state information, yet most machine learning techniques that incorporate human guidance cannot use non-specific explanations. This work aims to learn policies from a few sentences that aren't state specific. The proposed Object-focused advice links an object to an action, and allows a person to generalize over an object's state space. To evaluate this technique, agents were trained using Object-focused advice collected from participants in an experiment in the Mario Bros. domain. The results show that Object-focused advice performs better than when no advice is given, the agent can learn where to apply the advice in the state space, and the agent can recover from adversarial advice. Also, including warnings of what not do to in addition to advice of what actions to take improves performance.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002