首页 /研究 /Speech synchronization between speech and lip shape movements for service robotics applications

PERCEPTION

Speech synchronization between speech and lip shape movements for service robotics applications

Ren C. Luo, Huang Chien-Chieh, Shu-Ruei Chang, Yi-Jeng Tsai

发表年份: 2011
引用次数: 2

摘要

Synchronization between speech and mouth shape includes technologies, such as computer vision, speech synthesis, and speech recognition. We present a method to synchronize the image and the speech, and we use Microsoft's Speech Application Programming Interface (SAPI) to be the speech synthesis tool. Speech animation includes two components, the speech and the image. Speech synthesis output is obtained from Text-to-Speech (TTS), and the images of visemes are generated from software, FaceGen Modeller. Import three key pictures to this software to calibrate and generate the face model. The viseme event handler in C# will connect the image of mouth shape and viseme together. Load the images sequentially and the visemes will one by one match with the images correctly. The main applications of speech synthesis are used as assistive devices, e.g. the use of screen readers for people with visual impairment. A mute person can take advantage of this technology to talk to others. In recent years, speech synthesis is extensively applied in service robotics and entertainment productions such as language learning, education, video games, animations, and music videos.

关键词

VisemeSpeech synthesisComputer scienceSpeech recognitionSoftwareSynchronization (alternating current)Speech analyticsSpeech technologyService (business)Computer facial animation

Speech synchronization between speech and lip shape movements for service robotics applications

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory