首页 /研究 /Which French speech recognition system for assistant robots?
LEARNING

Which French speech recognition system for assistant robots?

Wiam Fadel, Imane Araf, Toumi Bouchentouf, Pierre-André Buvet, François Bourzeix, Omar Bourja

发表年份
2022
引用次数
6

摘要

Artificial intelligence-based speech recognition systems are already available and capable of recognizing the French language. Still, it is quite time-consuming to compare which one will be effective for an assistant robot. The study aims to select the best French-language speech recognition system with the least error in a real environment. In this paper, we present related works on how an Automatic Speech Recognition (ASR) system works, the models used by each of its components, several open-source French datasets, and the frequently used evaluation techniques. Next, we compare deep learning-based speech recognition APIs and pre-trained models for French on two different datasets using the Word Error Rate (WER) metric. The experimental results reveal that Google's Speech-to-Text API outperforms the other systems, namely VOSK API, Wav2vec 2.0, QuartzNet, and Speech Brain's Convolutional, Recurrent, and Fully-connected Networks (CRDNN) model.

关键词

Computer scienceSpeech recognitionWord error rateMetric (unit)Artificial intelligenceAcoustic modelLanguage modelSpeaker recognitionNatural language processingWord (group theory)

相关论文

查看 LEARNING 分类全部论文