Improved Emotion Recognition With a Novel Speaker-Independent Feature

Eun Ho Kim, Kyung Hak Hyun, Soo Hyun Kim, Yoon Keun Kwak

发表年份: 2009
引用次数: 80

摘要

Emotion recognition is one of the latest challenges in human-robot interaction. This paper describes the realization of emotional interaction for a Thinking Robot, focusing on speech emotion recognition. In general, speaker-independent systems show a lower accuracy rate compared with speaker-dependent systems, as emotional feature values depend on the speaker and their gender. However, speaker-independent systems are required for commercial applications. In this paper, a novel speaker-independent feature, the ratio of a spectral flatness measure to a spectral center (RSS), with a small variation in speakers when constructing a speaker-independent system is proposed. Gender and emotion are hierarchically classified by using the proposed feature (RSS), pitch, energy, and the mel frequency cepstral coefficients. An average recognition rate of 57.2% (plusmn 5.7%) at a 90% confidence interval is achieved with the proposed system in the speaker-independent mode.

关键词

Speech recognitionSpeaker recognitionMel-frequency cepstrumComputer scienceFeature (linguistics)Emotion recognitionRSSRealization (probability)SpectrogramFeature extraction

Improved Emotion Recognition With a Novel Speaker-Independent Feature

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory