Emotional Variability Analysis Based I-Vector for Speaker Verification in Under-Stress Conditions

Autor:	Barlian Henryranu Prasetio, Koichi Tanno, Hiroki Tamura
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	Speech production Channel (digital image) Computer Networks and Communications Computer science Speech recognition Word error rate lcsh:TK7800-8360 02 engineering and technology stress speech 030507 speech-language pathology & audiology 03 medical and health sciences eigenemotion Component (UML) Stress (linguistics) 0202 electrical engineering electronic engineering information engineering Electrical and Electronic Engineering Representation (mathematics) speaker verification i-vector technique lcsh:Electronics joint factor analysis emotional conditions Hardware and Architecture Control and Systems Engineering Signal Processing 020201 artificial intelligence & image processing 0305 other medical science Focus (optics) Subspace topology
Zdroj:	Electronics, Vol 9, Iss 1420, p 1420 (2020) Electronics Volume 9 Issue 9
ISSN:	2079-9292
Popis:	Emotional conditions cause changes in the speech production system. It produces the differences in the acoustical characteristics compared to neutral conditions. The presence of emotion makes the performance of a speaker verification system degrade. In this paper, we propose a speaker modeling that accommodates the presence of emotions on the speech segments by extracting a speaker representation compactly. The speaker model is estimated by following a similar procedure to the i-vector technique, but it considerate the emotional effect as the channel variability component. We named this method as the emotional variability analysis (EVA). EVA represents the emotion subspace separately to the speaker subspace, like the joint factor analysis (JFA) model. The effectiveness of the proposed system is evaluated by comparing it with the standard i-vector system in the speaker verification task of the Speech Under Simulated and Actual Stress (SUSAS) dataset with three different scoring methods. The evaluation focus in terms of the equal error rate (EER). In addition, we also conducted an ablation study for a more comprehensive analysis of the EVA-based i-vector. Based on experiment results, the proposed system outperformed the standard i-vector system and achieved state-of-the-art results in the verification task for the under-stressed speakers.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b79214a5e5fb799b2ae906b9ee8568a1 https://www.mdpi.com/2079-9292/9/9/1420 Zobrazit plný text záznamu