Learning of gestures by imitation using a monocular vision system on a humanoid robot
Elaheh Sabbaghi, Mohsen Bahrami, Saeed Shiry Ghidary
- 发表年份
- 2014
- 引用次数
- 11
摘要
In this paper, we present a vision-based imitation learning system that enables the humanoid robot to recognize and learn upper body gestures using only single camera mounted on the robot head, without forcing demonstrator (human/robot) to wear a certain color or patches. Insufficient information in monocular images and the complicated nature of human motion, make the 3D human pose reconstruction challenging. In the motion capture system, a learning-based method is used to recover 3D pose of upper body (shoulder and elbow joint angles) from monocular images by direct nonlinear regression against shape descriptor vectors extracted from image silhouettes. The goal of proposed imitation learning system is generalization over multiple demonstrations and learning them. To achieve this goal, a set of joint angle trajectories used for training, first project to a latent space with PCA and then align temporally using DTW. Obtained signals are modeled using Gaussian distributions (GMM) and then generalized using Gaussian mixture regression. This system is evaluated qualitatively on human3.6m dataset and quantitatively on sequence of images captured by robot camera. The system is also implemented and tested on the Nao humanoid robot. Experimental results show that the proposed system is capable to effectively perceive, recognize and learn demonstrated gestures in real scenarios.
关键词
相关论文
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002