Home /Research /Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

PERCEPTION

Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

Jing Gu, Eliana Stefani, Qi Wu, Jesse Thomason, Xin Eric Wang

Year: 2022
Access: Open access

Abstract

A long-term goal of AI research is to build intelligent agents that can communicate with humans in natural language, perceive the environment, and perform real-world tasks. Vision-and-Language Navigation (VLN) is a fundamental and interdisciplinary research topic towards this goal, and receives increasing attention from natural language processing, computer vision, robotics, and machine learning communities. In this paper, we review contemporary studies in the emerging field of VLN, covering tasks, evaluation metrics, methods, etc. Through structured analysis of current progress and challenges, we highlight the limitations of current VLN and opportunities for future work. This paper serves as a thorough reference for the VLN research community.

Keywords

cs.CVcs.AIcs.CLcs.LG

Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions

Abstract

Keywords

Related papers

Artificial intelligence: a modern approach

Are we ready for autonomous driving? The KITTI vision benchmark suite

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Vision meets robotics: The KITTI dataset