Which French speech recognition system for assistant robots?
Wiam Fadel, Imane Araf, Toumi Bouchentouf, Pierre-André Buvet, François Bourzeix, Omar Bourja
- Year
- 2022
- Citations
- 6
Abstract
Artificial intelligence-based speech recognition systems are already available and capable of recognizing the French language. Still, it is quite time-consuming to compare which one will be effective for an assistant robot. The study aims to select the best French-language speech recognition system with the least error in a real environment. In this paper, we present related works on how an Automatic Speech Recognition (ASR) system works, the models used by each of its components, several open-source French datasets, and the frequently used evaluation techniques. Next, we compare deep learning-based speech recognition APIs and pre-trained models for French on two different datasets using the Word Error Rate (WER) metric. The experimental results reveal that Google's Speech-to-Text API outperforms the other systems, namely VOSK API, Wav2vec 2.0, QuartzNet, and Speech Brain's Convolutional, Recurrent, and Fully-connected Networks (CRDNN) model.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002