ICASSP 2021 Acoustic Echo Cancellation Challenge: Datasets, Testing Framework, and Results
Autor: | Sriram Srinivasan, Hannes Gamper, Kusha Sridhar, Sebastian Braun, Tanel Parnamaa, Robert Aichner, Ross Cutler, Ando Saabas, Markus Loide |
---|---|
Rok vydání: | 2020 |
Předmět: |
FOS: Computer and information sciences
Reverberation Sound (cs.SD) business.industry Computer science Mean opinion score Speech recognition Deep learning Echo (computing) Double-talk Computer Science - Sound Speech enhancement Background noise Audio and Speech Processing (eess.AS) FOS: Electrical engineering electronic engineering information engineering Artificial intelligence business PESQ Electrical Engineering and Systems Science - Audio and Speech Processing |
Zdroj: | ICASSP |
DOI: | 10.48550/arxiv.2009.04972 |
Popis: | The ICASSP 2021 Acoustic Echo Cancellation Challenge is intended to stimulate research in the area of acoustic echo cancellation (AEC), which is an important part of speech enhancement and still a top issue in audio communication and conferencing systems. Many recent AEC studies report good performance on synthetic datasets where the train and test samples come from the same underlying distribution. However, the AEC performance often degrades significantly on real recordings. Also, most of the conventional objective metrics such as echo return loss enhancement (ERLE) and perceptual evaluation of speech quality (PESQ) do not correlate well with subjective speech quality tests in the presence of background noise and reverberation found in realistic environments. In this challenge, we open source two large datasets to train AEC models under both single talk and double talk scenarios. These datasets consist of recordings from more than 2,500 real audio devices and human speakers in real environments, as well as a synthetic dataset. We open source two large test sets, and we open source an online subjective test framework for researchers to quickly test their results. The winners of this challenge will be selected based on the average Mean Opinion Score (MOS) achieved across all different single talk and double talk scenarios. |
Databáze: | OpenAIRE |
Externí odkaz: |