Improved Signal/Pause Segmentation Algorithm Based on the Probability Density Function of Background Noise and Empirical Mode Decomposition
Autor: | Alexander Yu. Tychkov, Alan K. Alimuradov, Pyotr P. Churakov |
---|---|
Rok vydání: | 2021 |
Předmět: |
Mahalanobis distance
Computer science 0206 medical engineering Mode (statistics) Probability density function 02 engineering and technology 020601 biomedical engineering Hilbert–Huang transform Speech segmentation Background noise 03 medical and health sciences 0302 clinical medicine Segmentation Algorithm 030217 neurology & neurosurgery Energy (signal processing) |
Zdroj: | 2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus). |
Popis: | Segmentation into informative regions is an important stage in pre-processing of speech. The quality of segmentation affects the performance of almost all known applications of speech technologies (speech recognition, speaker identification, speech-to-text conversion, etc.). The article presents an improved speech/pause segmentation algorithm. The original algorithm is based on the use of probability density function of background noise, and the analysis of one-dimensional Mahalanobis distance of discrete timing for the investigated speech signal. Modernization consists in the fragmentation of speech and the decomposition of fragments into empirical modes for subsequent analysis of one-dimensional Mahalanobis distance of discrete timing for each mode separately. A study of the modernized algorithm has been carried out in comparison with the original algorithm and the well-known segmentation methods based on the analysis of zero-crossing rate and short-time energy. In accordance with the obtained results of the study, it was concluded that the improved segmentation algorithm provides the best detection of the boundaries of the beginning and the end of informative speech sections with the first and second kind errors, being 4.5767 % and 1.421 %, respectively. |
Databáze: | OpenAIRE |
Externí odkaz: |