Autor: |
Zheng Pei, Bing Luo, Li Xu, Da Li Hu |
Rok vydání: |
2011 |
Předmět: |
|
Zdroj: |
Applied Mechanics and Materials. :461-464 |
ISSN: |
1662-7482 |
DOI: |
10.4028/www.scientific.net/amm.128-129.461 |
Popis: |
In this paper, we proposed left-right hidden Markov models (HMMs) combination with k-means threshold of Likelihood ratio test (LRT) to identify the start and end of the speech. This method builds two models of non-speech and speech but not two states, i.e. each model could conclude several states. In the experiments we present the Voice Activity Detection (VAD) results between two states hidden semi-Markov model (HSMM) and proposed algorithm. We also compare accuracy and robust between the k-means threshold and the adaptive threshold in high signal to noise rate in the background noise. It presents that k-means threshold is more effective than the adaptive threshold and the proposed method also make a better performance than two states HSMM based VAD, especially in the low signal-to-noise ratio (SNR) environment. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|