Theoretical learning guarantees applied to acoustic modeling

Autor: Christopher D. Shulby, Martha D. Ferreira, Rodrigo F. de Mello, Sandra M. Aluisio
Jazyk: angličtina
Rok vydání: 2019
Předmět:
Zdroj: Journal of the Brazilian Computer Society, Vol 25, Iss 1, Pp 1-12 (2019)
Druh dokumentu: article
ISSN: 0104-6500
1678-4804
DOI: 10.1186/s13173-018-0081-3
Popis: Abstract In low-resource scenarios, for example, small datasets or a lack in computational resources available, state-of-the-art deep learning methods for speech recognition have been known to fail. It is possible to achieve more robust models if care is taken to ensure the learning guarantees provided by the statistical learning theory. This work presents a shallow and hybrid approach using a convolutional neural network feature extractor fed into a hierarchical tree of support vector machines for classification. Here, we show that gross errors present even in state-of-the-art systems can be avoided and that an accurate acoustic model can be built in a hierarchical fashion. Furthermore, we present proof that our algorithm does adhere to the learning guarantees provided by the statistical learning theory. The acoustic model produced in this work outperforms traditional hidden Markov models, and the hierarchical support vector machine tree outperforms a multi-class multilayer perceptron classifier using the same features. More importantly, we isolate the performance of the acoustic model and provide results on both the frame and phoneme level, considering the true robustness of the model. We show that even with a small amount of data, accurate and robust recognition rates can be obtained.
Databáze: Directory of Open Access Journals