Using Approximate Entropy as a speech quality measure for a speaker recognition system
Autor: | John F. Doherty, Richard A. Metzger, David M. Jenkins |
---|---|
Rok vydání: | 2016 |
Předmět: |
Audio mining
Voice activity detection business.industry Computer science Speech recognition 0206 medical engineering Speech coding Acoustic model Pattern recognition 02 engineering and technology Speaker recognition Speech processing Linear predictive coding 020601 biomedical engineering 01 natural sciences 010305 fluids & plasmas Speaker diarisation 0103 physical sciences Artificial intelligence business |
Zdroj: | CISS |
DOI: | 10.1109/ciss.2016.7460517 |
Popis: | In this paper, we will show that Approximate Entropy (ApEn) can be used to detect high-quality speech frames in an otherwise distorted speech signal. By exploiting the property of quasi-periodicity in speech, ApEn is able to detect small aberrations in speech frames that would otherwise cause a decrease in the performance in an automatic speaker recognition (ASR) system. In addition, we obtain the statistics of ApEn values representative of clean speech and propose threshold bounds to obtain maximum recognition rates. When compared to other popular voice activity detector (VAD) algorithms, our simulation results showed that utilization of ApEn will outperform the other VADs in discerning clean speech from noisy speech. This ability to properly detect clean speech allows for a speaker recognition system to obtain a recognition rate close to 87%, which is close to the same performance of the system when noise is not present. |
Databáze: | OpenAIRE |
Externí odkaz: |