Audio-visual data fusion for tracking the direction of multiple speakers

Quang H. Nguyen, JongSuk Choi

发表年份: 2010
引用次数: 3

摘要

This paper presents a multi-speakers tracking algorithm using audio-visual data fusion. The audio information is the direction of speakers and the visual information is the direction of detected faces. These observations are used as inputs of the tracking algorithm, which employed the framework of particle filter. For multi-target tracking, we present a flexible data association and data fusion, which can deal with the appearance or absent of any information during tracking process. The experimental results on data collected from a robot platform in a conventional office room confirm a potential application for human-robot interaction.

关键词

Computer scienceComputer visionTracking (education)Artificial intelligenceSensor fusionAudio visualParticle filterFusionData associationProcess (computing)

Audio-visual data fusion for tracking the direction of multiple speakers

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory