Approximate Entropy and Empirical Mode Decomposition for Improved Speaker Recognition
Autor: | John F. Doherty, Richard A. Metzger, Donald L. Hall, David M. Jenkins |
---|---|
Rok vydání: | 2020 |
Předmět: |
021110 strategic
defence & security studies Computer science Noise (signal processing) Speech recognition 0211 other engineering and technologies 02 engineering and technology Speech processing Speaker recognition Signal Approximate entropy Hilbert–Huang transform 030507 speech-language pathology & audiology 03 medical and health sciences Computer Science::Sound General Earth and Planetary Sciences Speaker identification 0305 other medical science General Environmental Science |
Zdroj: | Advances in Data Science and Adaptive Analysis. 12:2050011 |
ISSN: | 2424-9238 2424-922X |
DOI: | 10.1142/s2424922x20500114 |
Popis: | When processing real-world recordings of speech, it is highly probable noise will be present at some instance in the signal. Compounding this problem is the situation when the noise occurs in short, impulsive bursts at random intervals. Traditional signal processing methods used to detect speech rely on the spectral energy of the incoming signal to make a determination whether or not a segment of the signal contains speech. However when noise is present, this simple energy detection is prone to falsely flagging noise as speech. This paper will demonstrate an alternative way of processing a noisy speech signal utilizing a combination of information theoretic and signal processing principles to differentiate speech segments from noise. The utilization of this preprocessing technique will allow a speaker recognition system to train statistical speaker model using noise-corrupted speech files, and construct models statistically similar to those constructed from noise-free data. This preprocessing method will be shown to outperform traditional spectrum-based methods for both low-entropy and high-entropy noise in low signal-to-noise ratio environments, with a reduction in the feature space distortion when measured using the Cauchy–Schwarz (CS) distance metric. |
Databáze: | OpenAIRE |
Externí odkaz: |