Fully complex deep neural network for phase-incorporating monaural source separation
Autor: | Shu-Fan Wang, Chung-Hsien Wu, Yuan-Shan Lee, Jia-Ching Wang, Chien-Yao Wang |
---|---|
Rok vydání: | 2017 |
Předmět: |
Artificial neural network
Computer science business.industry Phase (waves) Short-time Fourier transform 020206 networking & telecommunications Pattern recognition 02 engineering and technology Signal Spectral line Time–frequency analysis 030507 speech-language pathology & audiology 03 medical and health sciences symbols.namesake Nonlinear system Fourier transform 0202 electrical engineering electronic engineering information engineering Source separation symbols Artificial intelligence 0305 other medical science business |
Zdroj: | ICASSP |
DOI: | 10.1109/icassp.2017.7952162 |
Popis: | Deep neural network (DNN) have become a popular means of separating a target source from a mixed signal. Most of DNN-based methods modify only the magnitude spectrum of the mixture. The phase spectrum is left unchanged, which is inherent in the short-time Fourier transform (STFT) coefficients of the input signal. However, recent studies have revealed that incorporating phase information can improve the quality of separated sources. To estimate simultaneously the magnitude and the phase of STFT coefficients, this work paper developed a fully complex-valued deep neural network (FCDNN) that learns the nonlinear mapping from complex-valued STFT coefficients of a mixture to sources. In addition, to reinforce the sparsity of the estimated spectra, a sparse penalty term is incorporated into the objective function of the FCDNN. Finally, the proposed method is applied to singing source separation. Experimental results indicate that the proposed method outperforms the state-of-the-art DNN-based methods. |
Databáze: | OpenAIRE |
Externí odkaz: |