Modular deep Q networks for sim-to-real transfer of visuo-motor policies

Fangyi Zhang, Jürgen Leitner, Michael Milford, Peter Corke

发表年份: 2017
引用次数: 9
访问权限: 开放获取

摘要

While deep learning has had significant successes in computer vision thanks to the abundance of visual data, collecting sufficiently large real-world datasets for robot learning can be costly. To increase the practicality of these techniques on real robots, we propose a modular deep reinforcement learning method capable of transferring models trained in simulation to a real-world robotic task. We introduce a bottleneck between perception and control, enabling the networks to be trained independently, but then merged and fine-tuned in an end-to-end manner to further improve hand-eye coordination. On a canonical, planar visually-guided robot reaching task a fine-tuned accuracy of 1.6 pixels is achieved, a significant improvement over naive transfer (17.5 pixels), showing the potential for more complicated and broader applications. Our method provides a technique for more efficient learning and transfer of visuo-motor policies for real robotic systems without relying entirely on large real-world robot datasets.

关键词

BottleneckComputer scienceRobotModular designArtificial intelligenceTask (project management)Reinforcement learningTransfer of learningPixelDeep learning

Modular deep Q networks for sim-to-real transfer of visuo-motor policies

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory