首页 /研究 /A Deconvolutive Neural Network for Speech Classification With Applications to Home Service Robot
LEARNING

A Deconvolutive Neural Network for Speech Classification With Applications to Home Service Robot

Donglin Wang, Henry Leung, Ajeesh P. Kurian, Ho‐Sub Yoon

发表年份
2010
引用次数
29

摘要

Reverberation deteriorates the quality and intelligibility of speech, leading to the poor performance of classification systems. Room reverberation parameters depend on the location of the speaker and the microphone and the room geometry. For mobile robots, the reverberation is constantly changing due to the relative movement of the speaker and the robot. This can affect the spectral properties of the signal and therefore, the classification accuracy. The contribution of this paper is a new network architecture, which uses neural network constant modulus algorithm (NNCMA) based equalizer followed by a multi-layer preceptron (MLP) classifier. NNCMA is an MLP which is trained with a cost function similar to constant modulus algorithm (CMA). With this two-stage structure, the classifier does not have to consider the time-varying nature of the reverberation. The proposed algorithm is applied to speech samples collected by the home service robot WEVER-R2 for speaker classification in a typical home or office environment. We use them for gender classification application. The proposed neural network was found to have 83.73% of classification accuracy for age classification and 88.91% of classification accuracy for gender classification, while the standard MLP had a classification accuracy of 71.43% and 72.29%, respectively.

关键词

ReverberationArtificial neural networkComputer scienceSpeech recognitionClassifier (UML)MicrophoneRobotArtificial intelligencePattern recognition (psychology)Mel-frequency cepstrum

相关论文

查看 LEARNING 分类全部论文