Class Confusability Reduction in Audio-Visual Speech Recognition Using Random Forests

Autor: Gonzalo D. Sad, Juan Carlos Gómez, Lucas D. Terissi
Rok vydání: 2018
Předmět:
Zdroj: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications ISBN: 9783319751924
CIARP
DOI: 10.1007/978-3-319-75193-1_70
Popis: This paper presents an audio-visual speech classification system based on Random Forests classifiers, aiming to reduce the intra-class misclassification problems, which is a very usual situation, specially in speech recognition tasks. A novel training procedure is proposed, introducing the concept of Complementary Random Forests (CRF) classifiers. Experimental results over three audio-visual databases, show that a good performance is achieved with the proposed system for the different types of input information considered, viz., audio-only information, video-only information and fused audio-video information. In addition, these results also indicate that the proposed method performs satisfactorily over the three databases using the same configuration parameters.
Databáze: OpenAIRE