首页 /研究 /An audio-visual solution to sound source localization and tracking with applications to HRI
HRI

An audio-visual solution to sound source localization and tracking with applications to HRI

BM Emery, Mohsen Jadidi, Keisuke Nakamura, Jaime Valls Miró

发表年份
2016
引用次数
2
访问权限
开放获取

摘要

Robot audition is an emerging and growing branch in the robotic community and is necessary for a natural Human-Robot Interaction (HRI). In this paper, we propose a framework that integrates advances from Simultaneous Localization And Mapping (SLAM), bearing-only target tracking, and robot audition techniques into a unifed system for sound source identification, localization, and tracking. In indoors, acoustic observations are often highly noisy and corrupted due to reverberations, the robot ego-motion and background noise, and possible discontinuous nature of them. Therefore, in everyday interaction scenarios, the system requires accommodating for outliers, robust data association, and appropriate management of the landmarks, i.e. sound sources. We solve the robot self-localization and environment representation problems using an RGB-D SLAM algorithm, and sound source localization and tracking using recursive Bayesian estimation in the form of the extended Kalman Filter with unknown data associations and an unknown number of landmarks. The experimental results show that the proposed system performs well in the medium-sized cluttered indoor environment.

关键词

Sound (geography)Computer scienceAudio visualComputer visionTracking (education)Artificial intelligenceCommunicationAcousticsMultimediaPsychology

相关论文

查看 HRI 分类全部论文