首页 /研究 /ContextualNet: Exploiting Contextual Information Using LSTMs to Improve Image-Based Localization
LEARNING

ContextualNet: Exploiting Contextual Information Using LSTMs to Improve Image-Based Localization

Mitesh Patel, Brendan Emery, Yanying Chen

发表年份
2018
引用次数
15

摘要

Convolutional Neural Networks (CNN) have successfully been utilized for localization using a single monocular image [1]. Most of the work to date has either focused on reducing the dimensionality of data for better learning of parameters during training or on developing different variations of CNN models to improve pose estimation. Many of the best performing works solely consider the content in a single image, while the context from historical images is ignored. In this paper, we propose a combined CNN-LSTM which is capable of incorporating contextual information from historical images to better estimate the current pose. Experimental results achieved using a dataset collected in an indoor office space improved the overall system results to 0.8 m & 2.5° at the third quartile of the cumulative distribution as compared with 1.5 m & 3.0° achieved by PoseNet [1]. Furthermore, we demonstrate how the temporal information exploited by the CNN-LSTM model assists in localizing the robot in situations where image content does not have sufficient features.

关键词

Computer scienceConvolutional neural networkArtificial intelligenceMonocularContext (archaeology)Image (mathematics)RobotPattern recognition (psychology)Curse of dimensionalityComputer vision

相关论文

查看 LEARNING 分类全部论文