Articulatory feature classification using convolutional neural networks

Autor:	Danny Merkx, Odette Scharenborg
Jazyk:	angličtina
Rok vydání:	2018
Předmět:	Feature (computer vision) Computer science business.industry Pattern recognition Artificial intelligence business Convolutional neural network
Zdroj:	Proceedings of Interspeech 2018 INTERSPEECH
Popis:	The ultimate goal of our research is to improve an existing speech-based computational model of human speech recognition on the task of simulating the role of fine-grained phonetic information in human speech processing. As part of this work we are investigating articulatory feature classifiers that are able to create reliable and accurate transcriptions of the articulatory behaviour encoded in the acoustic speech signal. Articulatory feature (AF) modelling of speech has received a considerable amount of attention in automatic speech recognition research. Different approaches have been used to build AF classifiers, most notably multi-layer perceptrons. Recently, deep neural networks have been applied to the task of AF classification. This paper aims to improve AF classification by investigating two different approaches: 1) investigating the usefulness of a deep Convolutional neural network (CNN) for AF classification; 2) integrating the Mel filtering operation into the CNN architecture. The results showed a remarkable improvement in classification accuracy of the CNNs over state-of-the-art AF classification results for Dutch, most notably in the minority classes. Integrating the Mel filtering operation into the CNN architecture did not further improve classification performance.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1b3e8ca8394228d23d2f5324b78ec57b https://hdl.handle.net/21.11116/0000-000B-639A-821.11116/0000-000B-639C-6 Zobrazit plný text záznamu