首页 /研究 /ContextualNet: Exploiting Contextual Information Using LSTMs to Improve Image-Based Localization

LEARNING

ContextualNet: Exploiting Contextual Information Using LSTMs to Improve Image-Based Localization

Mitesh Patel, Brendan Emery, Yanying Chen

发表年份: 2018
引用次数: 15

摘要

Convolutional Neural Networks (CNN) have successfully been utilized for localization using a single monocular image [1]. Most of the work to date has either focused on reducing the dimensionality of data for better learning of parameters during training or on developing different variations of CNN models to improve pose estimation. Many of the best performing works solely consider the content in a single image, while the context from historical images is ignored. In this paper, we propose a combined CNN-LSTM which is capable of incorporating contextual information from historical images to better estimate the current pose. Experimental results achieved using a dataset collected in an indoor office space improved the overall system results to 0.8 m & 2.5° at the third quartile of the cumulative distribution as compared with 1.5 m & 3.0° achieved by PoseNet [1]. Furthermore, we demonstrate how the temporal information exploited by the CNN-LSTM model assists in localizing the robot in situations where image content does not have sufficient features.

关键词

Computer scienceConvolutional neural networkArtificial intelligenceMonocularContext (archaeology)Image (mathematics)RobotPattern recognition (psychology)Curse of dimensionalityComputer vision

ContextualNet: Exploiting Contextual Information Using LSTMs to Improve Image-Based Localization

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory