Home /Research /Which French speech recognition system for assistant robots?
LEARNING

Which French speech recognition system for assistant robots?

Wiam Fadel, Imane Araf, Toumi Bouchentouf, Pierre-André Buvet, François Bourzeix, Omar Bourja

Year
2022
Citations
6

Abstract

Artificial intelligence-based speech recognition systems are already available and capable of recognizing the French language. Still, it is quite time-consuming to compare which one will be effective for an assistant robot. The study aims to select the best French-language speech recognition system with the least error in a real environment. In this paper, we present related works on how an Automatic Speech Recognition (ASR) system works, the models used by each of its components, several open-source French datasets, and the frequently used evaluation techniques. Next, we compare deep learning-based speech recognition APIs and pre-trained models for French on two different datasets using the Word Error Rate (WER) metric. The experimental results reveal that Google's Speech-to-Text API outperforms the other systems, namely VOSK API, Wav2vec 2.0, QuartzNet, and Speech Brain's Convolutional, Recurrent, and Fully-connected Networks (CRDNN) model.

Keywords

Computer scienceSpeech recognitionWord error rateMetric (unit)Artificial intelligenceAcoustic modelLanguage modelSpeaker recognitionNatural language processingWord (group theory)

Related papers

Browse all LEARNING papers