Vietnamese Speaker Authentication Using Deep Models

Autor: Son T. Nguyen, Viet Dac Lai, Quyen Dam-Ba, Anh Nguyen-Xuan, Cuong Pham
Rok vydání: 2018
Předmět:
Zdroj: SoICT
DOI: 10.1145/3287921.3287954
Popis: Speaker Authentication is the identification of a user from voice biometrics and has a wide range of applications such as banking security, human computer interaction and ambient authentication. In this work, we investigate the effectiveness of acoustic features such as Mel-frequency cepstral coefficients (MFCC), Gammatone frequency cepstral coefficients (GFCC), and Linear Predictive Codes (LPC) extracted from audio streams for constructing feature spectral images. In addition, we propose to use the deep Residual Network models for user verification from feature spectrum images. We evaluate our proposed method under two settings over the dataset collected from 20 Vietnamese speakers. The results, with the Equal Error rate of around 4%, have demonstrated that the feasibility of Vietnamese speaker authentication by using deep Residual Network models trained with GFCC spectral feature images.
Databáze: OpenAIRE