Derivative of instantaneous frequency for voice activity detection using phase-based approach
Autor: | Nguyen Binh Thien, Takahiro Fukumori, Takanobu Nishiura, Yukoh Wakabayashi |
---|---|
Rok vydání: | 2019 |
Předmět: |
Signal processing
Voice activity detection Computer science Phase (waves) 020206 networking & telecommunications 02 engineering and technology White noise Instantaneous phase Signal-to-noise ratio Amplitude Computer Science::Sound 0202 electrical engineering electronic engineering information engineering Spectrogram 020201 artificial intelligence & image processing Algorithm |
Zdroj: | APSIPA |
DOI: | 10.1109/apsipaasc47483.2019.9023241 |
Popis: | In this paper, we consider the use of the phase spectrum in speech signal analysis. In particular, a phase-based voice activity detection (VAD) by using the derivative of instantaneous frequency is proposed. Preliminary experiments reveal that the distribution of this feature can indicate the presence or absence of speech. The performance of the proposed method is evaluated in comparison with the conventional amplitude-based method. In addition, we consider a combination of the amplitude-based and phase-based methods in a simple manner to demonstrate the complementarity of both spectra. The experimental results confirm that the phase information can be used to detect voice activity with at least 62% accuracy. The proposed method shows better performance compared to the conventional amplitude-based method in the case when a speech signal was corrupted by white noise at low signal-to-noise ratio (SNR). A combination of two methods achieves even higher performance than each of them separately, in limited conditions. |
Databáze: | OpenAIRE |
Externí odkaz: |