首页 /研究 /Robot Imitation Learning From Image-Only Observation Without Real-World Interaction
LEARNING

Robot Imitation Learning From Image-Only Observation Without Real-World Interaction

Xuanhui Xu, Mingyu You, Hongjun Zhou, Zhifeng Qian, Bin He

发表年份
2022
引用次数
10

摘要

Learning from observation (LfO) prompts the robot to imitate actions from experts' states via deep reinforcement learning (RL), achieving satisfactory results in simulation environments through hundreds of thousands of robot–environment interactions. While in the real world, constrained by the expensive and potentially dangerous interaction between the real robot and the environment, LfO is still challenging to be popularized. Therefore, reducing the number of interactions in LfO during training in the real world remains a hot research topic. Although significant progress has been made, the interaction is still inevitable. Hence, this article proposes the LION net ( <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Learning from Image-only Observation net</i> ), which learns action from image-only demonstrations, e.g., a video of a human demonstrating a task, and reduces the number of the robot–environment interaction to zero in the real world. It is expected to be a realistic solution for real-world robot LfO. The LION net comprises two modules: 1) a domain transfer module that bridges the gap between the simulator and the real world and an RL-based control module that utilizes images as input to learning a task. The LION net affords the robot to imitate an action from the image-only human demonstration in the simulator and perform the learned action in the real world without additional training. The proposed method is evaluated on three real-life tasks: 1) pouring water, 2) washing cup, and 3) stacking cube, deployed to a real-world robot, demonstrating state-of-the-art performance.

关键词

RobotComputer scienceArtificial intelligenceTask (project management)Reinforcement learningAction (physics)ImitationComputer visionHuman–computer interactionSimulation

相关论文

查看 LEARNING 分类全部论文