Noisy Speech Recognition Based on Combined Audio-Visual Classifiers

Autor: Juan Carlos Gómez, Marianela Parodi, Gonzalo D. Sad, Lucas D. Terissi
Rok vydání: 2015
Předmět:
Zdroj: Lecture Notes in Computer Science ISBN: 9783319148984
MPRSS
Marianela Parodi
DOI: 10.1007/978-3-319-14899-1_5
Popis: An isolated word speech recognition system based on audio-visual features is proposed in this paper. To enhance the recognition over different noisy conditions, this system combines three classifiers based on audio, visual and audio-visual information, respectively. The performance of the proposed recognition system is evaluated over two isolated word audio-visual databases, a public one and a database compiled by the authors of this paper. Experimental results show that the structure of the proposed system leads to significant improvements of the recognition rates through a wide range of signal-to-noise ratios.
Databáze: OpenAIRE