首页 /研究 /Talking about 3D scenes: integration of image and speech understanding in a hybrid distributed system
PERCEPTION

Talking about 3D scenes: integration of image and speech understanding in a hybrid distributed system

Gudrun Socher, Gerhard Sagerer, Franz Kümmert, T. Fuhr

发表年份
2002
引用次数
8

摘要

We present a hybrid system that integrates speech and image understanding. Given spoken references, it is able to identify objects of a 3D scene perceived via a stereo camera. Central to our approach is the extraction of qualitative object features and spatial scene properties from acoustic and visual data. The interaction of the understanding processes is performed using a procedural semantic network that interfaces with signal recognition and reconstruction modules, thus integrating semantic, neural and Bayesian networks and Hidden Markov Models. 1. INTRODUCTION Man-Machine-Interaction in real environments is one of the greatest challenges for a number of scientific fields related to Computer Vision, Speech Understanding, and Robotics. At the University of Bielefeld the joint project "Situated Artificial Communicators" has been established with the goal to develop an integrated system where visual, linguistic, senso-motoric, and cognitive abilities interact. The system plays the rol...

关键词

Computer scienceArtificial intelligenceHidden Markov modelVisualizationComputer visionFeature extractionObject (grammar)Bayesian networkSemantics (computer science)Artificial neural network

相关论文

查看 PERCEPTION 分类全部论文