A study on the roles of total variability space and session variability modeling in speaker recognition
Autor: | Jean-François Bonastre, Achintya Kumar Sarkar, Driss Matrouf |
---|---|
Rok vydání: | 2015 |
Předmět: |
Linguistics and Language
business.industry Computer science Speech recognition 020206 networking & telecommunications Pattern recognition 02 engineering and technology Space (commercial competition) Mixture model Speaker recognition Language and Linguistics Session (web analytics) Human-Computer Interaction Speaker diarisation 030507 speech-language pathology & audiology 03 medical and health sciences 0202 electrical engineering electronic engineering information engineering Maximum a posteriori estimation NIST Segmentation Computer Vision and Pattern Recognition Artificial intelligence 0305 other medical science business Software |
Zdroj: | International Journal of Speech Technology. 19:111-120 |
ISSN: | 1572-8110 1381-2416 |
DOI: | 10.1007/s10772-015-9324-2 |
Popis: | Speaker verification (SV) using i-vector concept becomes state-of-the-art. In this technique, speakers are projected onto the total variability space and represented by vectors called i-vectors. During testing, the i-vectors of the test speech segment and claimant are conditioned to compensate for the session variability before scoring. So, i-vector system can be viewed as two processing blocks: one is total variability space and the other is post-processing module. Several questions arise, such as, (i) which part of the i-vector system plays a major role in speaker verification: total variability space or post-processing task; (ii) is the post-processing module intrinsic to the total variability space? The motivation of this paper is to partially answer these questions by proposing several simpler speaker characterization systems for speaker verification, where speakers are represented by their speaker characterization vectors (SCVs). The SCVs are obtained by uniform segmentation of the speakers gaussian mixture models (GMMs)- and maximum likelihood linear regression (MLLR) super-vectors. We consider two adaptation approaches for GMM super-vector: one is maximum a posteriori and other is MLLR. Similarly to the i-vector, SCVs are post-processed for session variability compensation during testing. The proposed system shows promising performance when compared to the classical i-vector system which indicates that the post-processing task plays an major role in i-vector based SV system and is not intrinsic to the total variability space. All experimental results are shown on NIST 2008 SRE core condition. |
Databáze: | OpenAIRE |
Externí odkaz: |