Vietnamese Speaker Authentication Using Deep Models
Autor: | Son T. Nguyen, Viet Dac Lai, Quyen Dam-Ba, Anh Nguyen-Xuan, Cuong Pham |
---|---|
Rok vydání: | 2018 |
Předmět: |
Authentication
Biometrics Computer science Speech recognition Word error rate 020207 software engineering 02 engineering and technology Mixture model Residual Speaker recognition ComputingMethodologies_PATTERNRECOGNITION Feature (computer vision) 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Mel-frequency cepstrum |
Zdroj: | SoICT |
DOI: | 10.1145/3287921.3287954 |
Popis: | Speaker Authentication is the identification of a user from voice biometrics and has a wide range of applications such as banking security, human computer interaction and ambient authentication. In this work, we investigate the effectiveness of acoustic features such as Mel-frequency cepstral coefficients (MFCC), Gammatone frequency cepstral coefficients (GFCC), and Linear Predictive Codes (LPC) extracted from audio streams for constructing feature spectral images. In addition, we propose to use the deep Residual Network models for user verification from feature spectrum images. We evaluate our proposed method under two settings over the dataset collected from 20 Vietnamese speakers. The results, with the Equal Error rate of around 4%, have demonstrated that the feasibility of Vietnamese speaker authentication by using deep Residual Network models trained with GFCC spectral feature images. |
Databáze: | OpenAIRE |
Externí odkaz: |