首页 /研究 /Humanoid action imitation learning via boosting sample DQN in virtual demonstrator environment
LEARNING

Humanoid action imitation learning via boosting sample DQN in virtual demonstrator environment

Zhou Rong, Zhisheng Zhang, Kunyyu Peng, Yang Mi, Xiangsheng Huang

发表年份
2016
引用次数
2

摘要

With the growth of modern industrial automation, autonomous-learning applied in the field of robot has aroused considerable attentions of researchers. However, those existing learning methods typically require mass among of training set, increasing the difficulty of collecting samples which is time-consuming, while the validity of samples might be divergent greatly, and thus the training efficiency is limited. Simultaneously, the reinforcement learning used in the system was based on the hypothesis that each action in the sequence contribute equally to the consequence, which is not corresponding to the common rules. In this paper, we propose a method, boosting sample DQN, to optimize the validity of training sample set. Inspired by boosting method, by extracting samples from replay memory hierarchically based on statistical results, the efficiency of network training is improved. Our algorithm, which has a small count of parameters, has been transplanted to the dual-arm robot system successfully. This approach learns a set of trajectories for the action of reaching and grabbing target objects using real-time models obtained by interactively wearable sensing equipment. And also, solution was proposed to distinguish weights of different actions. Our method has proved to be adaptive in learning complicated tasks, including grabbing bottle within its scope, as we presented in the paper.

关键词

Computer scienceBoosting (machine learning)Artificial intelligenceMachine learningRobotReinforcement learningHumanoid robotEstimatorMathematics

相关论文

查看 LEARNING 分类全部论文