首页 /研究 /Focusing computational visual attention in multi-modal human-robot interaction
HRI

Focusing computational visual attention in multi-modal human-robot interaction

Boris Schauerte, Gernot A. Fink

发表年份
2010
引用次数
46

摘要

Identifying verbally and non-verbally referred-to objects is an important aspect of human-robot interaction. Most importantly, it is essential to achieve a joint focus of attention and, thus, a natural interaction behavior. In this contribution, we introduce a saliency-based model that reflects how multi-modal referring acts influence the visual search, i.e. the task to find a specific object in a scene. Therefore, we combine positional information obtained from pointing gestures with contextual knowledge about the visual appearance of the referred-to object obtained from language. The available information is then integrated into a biologically-motivated saliency model that forms the basis for visual search. We prove the feasibility of the proposed approach by presenting the results of an experimental evaluation.

关键词

Computer scienceArtificial intelligenceFocus (optics)Object (grammar)GestureTask (project management)ModalRobotHuman–computer interactionHuman–robot interaction

相关论文

查看 HRI 分类全部论文