Efficient Model Learning from Joint-Action Demonstrations for Human-Robot Collaborative Tasks
Ramya Ramakrishnan, Keren Gu
- 发表年份
- 2016
- 引用次数
- 183
摘要
We present a framework for automatically learning human user models from joint-action demonstrations that enables a robot to compute a robust policy for a collaborative task with a human. First, the demonstrated action sequences are clustered into different human types using an unsuper-vised learning algorithm. A reward function is then learned for each type through the employment of an inverse rein-forcement learning algorithm. The learned model is then in-corporated into a mixed-observability Markov decision pro-cess (MOMDP) formulation, wherein the human type is a partially observable variable. With this framework, we can infer online the human type of a new user that was not in-cluded in the training set, and can compute a policy for the robot that will be aligned to the preference of this user. In a human subject experiment (n = 30), participants agreed more strongly that the robot anticipated their actions when working with a robot incorporating the proposed framework (p < 0.01), compared to manually annotating robot actions. In trials where participants faced difficulty annotating the robot actions to complete the task, the proposed framework significantly improved team efficiency (p < 0.01). The robot incorporating the framework was also found to be more re-sponsive to human actions compared to policies computed using a hand-coded reward function by a domain expert (p < 0.01). These results indicate that learning human user models from joint-action demonstrations and encoding them in a MOMDP formalism can support effective teaming in human-robot collaborative tasks. 1.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002