Age and Gender Recognition from Speech Using Deep Neural Networks
Autor: | Héctor A. Sánchez-Hevia, Manuel Rosa-Zurera, Manuel Utrilla-Manso, Roberto Gil-Pita |
---|---|
Rok vydání: | 2020 |
Předmět: | |
Zdroj: | Advances in Intelligent Systems and Computing ISBN: 9783030625788 WAF |
DOI: | 10.1007/978-3-030-62579-5_23 |
Popis: | This paper deals with joint gender identification and age group classification from speech, aimed at improving the functionalities of Interactive Voice Response Systems. Deep Neural Networks are used, because they have recently demonstrated discriminative and representation capabilities over a wide range of applications, among them, speech processing problems based on features extraction and selection. A comparative study of various neural network architectures and sizes is presented to gather knowledge about performance dependence on the network architecture and the number of free parameters. The classification framework was trained and evaluated using Mozilla’s ‘Common Voice’ dataset, an open and crowdsourced speech corpus. The results are promising, with the best systems achieving a gender identification error lower than 2% and an age group classification error lower than 20%. |
Databáze: | OpenAIRE |
Externí odkaz: |