Mining Visual Phrases for Visual Robot Localization

Kanji Tanaka, Yuuto Chokushi, Masatoshi Ando

Year: 2016
Citations: 2

Abstract

We propose a discriminative and compact scene descriptor for single-view place recognition that facilitates long-term visual SLAM in familiar, semi-dynamic, and partially changing environments. In contrast to popular bag-of-words scene descriptors, which rely on a library of vector quantized visual features, our proposed scene descriptor is based on a library of raw image data (such as an available visual experience, images shared by other colleague robots, and publicly available image data on the Web) and directly mine it to find visual phrases (VPs) that discriminatively and compactly explain an input query/database image. Our mining approach is motivated by recent success achieved in the field of common pattern discovery – specifically mining of common visual patterns among scenes – and requires only a single library of raw images that can be acquired at different times or on different days. Experimental results show that, although our scene descriptor is significantly more compact than conventional descriptors, its recognition performance is relatively high.

Keywords

Computer scienceArtificial intelligenceDiscriminative modelComputer visionRobotPattern recognition (psychology)Image (mathematics)Field (mathematics)Support vector machine

Mining Visual Phrases for Visual Robot Localization

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory