Multichannel audio declipping
Autor: | Alexey Ozerov, Cagdas Bilen, Patrick Pérez |
---|---|
Přispěvatelé: | Technicolor R & I [Cesson Sévigné], Technicolor, ANR-14-CE27-0002,MAD,Inpainting de données audio manquantes(2014), Ozerov, Alexey, Appel à projets générique - Inpainting de données audio manquantes - - MAD2014 - ANR-14-CE27-0002 - Appel à projets générique - VALID |
Jazyk: | angličtina |
Rok vydání: | 2016 |
Předmět: |
Channel (digital image)
[INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing Speech recognition audio declipping Model parameters generalized expectation-maximization 02 engineering and technology Data_CODINGANDINFORMATIONTHEORY Set (abstract data type) 030507 speech-language pathology & audiology 03 medical and health sciences [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing multichannel audio Computer Science::Multimedia 0202 electrical engineering electronic engineering information engineering Nonnegative tensor factorization Mathematics full-rank spatial model business.industry nonnegative tensor factorization 020206 networking & telecommunications Pattern recognition Covariance Power (physics) Sound recording and reproduction Computer Science::Sound Spectrogram Artificial intelligence 0305 other medical science business |
Zdroj: | IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'16) IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'16), Mar 2016, Shanghai, China ICASSP |
Popis: | International audience; Audio declipping consists in recovering so-called clipped audio samples that are set to a maximum / minimum threshold. Many different approaches were proposed to solve this problem in case of single-channel (mono) recordings. However, while most of audio recordings are multichannel nowadays, there is no method designed specifically for multichannel audio declipping, where the inter-channel correlations may be efficiently exploited for a better declipping result. In this work we propose for the first time such a multichannel audio declipping method. Our method is based on representing a multichannel audio recording as a convolutive mixture of several audio sources, and on modeling the source power spectrograms and mixing filters by nonnegative tensor factorization model and full-rank covariance matrices, respectively. A generalized expectation-maximization algorithm is proposed to estimate model parameters. It is shown experimentally that the proposed multichannel audio de-clipping algorithm outperforms in average and in most cases a state-of-the-art single-channel declipping algorithm applied to each channel independently. |
Databáze: | OpenAIRE |
Externí odkaz: |