Visually Grounded Language Learning for Robot Navigation

Emre Ünal, Ozan Arkan Can, Y. Yemez

发表年份: 2019
引用次数: 2

摘要

We present an end-to-end deep learning model for robot navigation from raw visual pixel input and natural text instructions. The proposed model is an LSTM-based sequence-to-sequence neural network architecture with attention, which is trained on instruction-perception data samples collected in a synthetic environment. We conduct experiments on the SAIL dataset which we reconstruct in 3D so as to generate the 2D images associated with the data. Our experiments show that the performance of our model is on a par with state-of-the-art, despite the fact that it learns navigational language with end-to-end training from raw visual data.

关键词

Computer scienceArtificial intelligenceRobotSequence (biology)Deep learningComputer visionPerceptionRaw dataNatural languageArtificial neural network

Visually Grounded Language Learning for Robot Navigation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory