首页 /研究 /Focusing computational visual attention in multi-modal human-robot interaction

HRI

Focusing computational visual attention in multi-modal human-robot interaction

Boris Schauerte, Gernot A. Fink

发表年份: 2010
引用次数: 46

摘要

Identifying verbally and non-verbally referred-to objects is an important aspect of human-robot interaction. Most importantly, it is essential to achieve a joint focus of attention and, thus, a natural interaction behavior. In this contribution, we introduce a saliency-based model that reflects how multi-modal referring acts influence the visual search, i.e. the task to find a specific object in a scene. Therefore, we combine positional information obtained from pointing gestures with contextual knowledge about the visual appearance of the referred-to object obtained from language. The available information is then integrated into a biologically-motivated saliency model that forms the basis for visual search. We prove the feasibility of the proposed approach by presenting the results of an experimental evaluation.

关键词

Computer scienceArtificial intelligenceFocus (optics)Object (grammar)GestureTask (project management)ModalRobotHuman–computer interactionHuman–robot interaction

Focusing computational visual attention in multi-modal human-robot interaction

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory