首页 /研究 /Multimodal region-consistent saliency based on foreground and background priors for indoor scene

PERCEPTION

Multimodal region-consistent saliency based on foreground and background priors for indoor scene

Jianhua Zhang, Qingwei Wang, Yanzhu Zhao, Shengyong Chen

发表年份: 2016
引用次数: 2

摘要

Visual saliency is a very important feature for object detection in a complex scene. However, image-based saliency is influenced by clutter background and similar objects in indoor scenes, and pixel-based saliency cannot provide consistent saliency to a whole object. Therefore, in this paper, we propose a novel method that computes visual saliency maps from multimodal data obtained from indoor scenes, whilst keeping region consistency. Multimodal data from a scene are first obtained by an RGB+D camera. This scene is then segmented into over-segments by a self-adapting approach to combine its colour image and depth map. Based on these over-segments, we develop two cues as domain knowledge to improve the final saliency map, including focus regions obtained from colour images, and planar background structures obtained from point cloud data. Thus, our saliency map is generated by compounding the information of the colour data, the depth data and the point cloud data in a scene. In the experiments, we extensively compare the proposed method with state-of-the-art methods, and we also apply the proposed method to a real robot system to detect objects of interest. The experimental results show that the proposed method outperforms other methods in terms of precisions and recall rates.

关键词

Artificial intelligenceComputer scienceComputer visionPoint cloudRGB color modelClutterKadir–Brady saliency detectorFeature (linguistics)Object (grammar)Pixel

Multimodal region-consistent saliency based on foreground and background priors for indoor scene

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory