首页 /研究 /Multimodal saliency-based attention for object-based scene analysis
PERCEPTION

Multimodal saliency-based attention for object-based scene analysis

Boris Schauerte, Benjamin Kühn, Kristian Kroschel, Rainer Stiefelhagen

发表年份
2011
引用次数
34

摘要

Multimodal attention is a key requirement for humanoid robots in order to navigate in complex environments and act as social, cognitive human partners. To this end, robots have to incorporate attention mechanisms that focus the processing on the potentially most relevant stimuli while controlling the sensor orientation to improve the perception of these stimuli. In this paper, we present our implementation of audio-visual saliency-based attention that we integrated in a system for knowledge-driven audio-visual scene analysis and object-based world modeling. For this purpose, we introduce a novel isophote-based method for proto-object segmentation of saliency maps, a surprise-based auditory saliency definition, and a parametric 3-D model for multimodal saliency fusion. The applicability of the proposed system is demonstrated in a series of experiments.

关键词

Computer scienceArtificial intelligenceSurpriseComputer visionSegmentationOrientation (vector space)Object (grammar)PerceptionFocus (optics)Robot

相关论文

查看 PERCEPTION 分类全部论文