首页 /研究 /Learning to Fuse Multiscale Features for Visual Place Recognition
LEARNING

Learning to Fuse Multiscale Features for Visual Place Recognition

Jun Mao, Xiaofeng He, Lilian Zhang, Liao Wu, Michael Milford

发表年份
2018
引用次数
22
访问权限
开放获取

摘要

Efficient and robust visual place recognition is of great importance to autonomous mobile robots. Recent work has shown that features learned from convolutional neural networks achieve impressed performance with efficient feature size, where most of them are pooled or aggregated from a convolutional feature map. However, convolutional filters only capture the appearance of their perceptive fields, which lack the considerations on how to combine the multiscale appearance for place recognition. In this paper, we propose a novel method to build a multiscale feature pyramid and present two approaches to use the pyramid to augment the place recognition capability. The first approach fuses the pyramid to obtain a new feature map, which has an awareness of both the local and semi-global appearance, and the second approach learns an attention model from the feature pyramid to weight the spatial grids on the original feature map. Both approaches combine the multiscale features in the pyramid to suppress the confusing local features while tackling the problem in two different ways. Extensive experiments have been conducted on benchmark datasets with varying degrees of appearance and viewpoint variations. The results show that the proposed approaches achieve superior performance over the networks without the multiscale feature fusion and the multiscale attention components. Analyses on the performance of using different feature pyramids are also provided.

关键词

Computer scienceArtificial intelligenceFuse (electrical)Pyramid (geometry)Feature (linguistics)Benchmark (surveying)Convolutional neural networkPattern recognition (psychology)Feature extractionFeature learning

相关论文

查看 LEARNING 分类全部论文