Speaker Recognition Using Wavelet Cepstral Coefficient, I-Vector, and Cosine Distance Scoring and Its Application for Forensics
Autor: | Lei Lei, She Kun |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2016 |
Předmět: | |
Zdroj: | Journal of Electrical and Computer Engineering, Vol 2016 (2016) |
Druh dokumentu: | article |
ISSN: | 2090-0147 2090-0155 |
DOI: | 10.1155/2016/4908412 |
Popis: | An important application of speaker recognition is forensics. However, the accuracy of speaker recognition in forensic cases often drops off rapidly because of the ill effect of ambient noise, variable channel, different duration of speech data, and so on. Therefore, finding a robust speaker recognition model is very important for forensics. This paper builds a new speaker recognition model based on wavelet cepstral coefficient (WCC), i-vector, and cosine distance scoring (CDS). This model firstly uses the WCC to transform the speech into spectral feature vecors and then uses those spectral feature vectors to train the i-vectors that represent the speeches having different durations. CDS is used to compare the i-vectors to give out the evidence. Moreover, linear discriminant analysis (LDA) and the within-class covariance normalization (WCNN) are added to the CDS algorithm to deal with the channel variability problem. Finally, the likelihood ratio estimates the strength of the evidence. We use the TIMIT database to evaluate the performance of the proposed model. The experimental results show that the proposed model can effectively solve the troubles of forensic scenario, but the time cost of the method is high. |
Databáze: | Directory of Open Access Journals |
Externí odkaz: |