Unified speech and audio coding scheme for high quality at low bitrates
Autor: | R. Geiger, Nikolaus Rettelbach, Gerald Schuller, Roch Lefebvre, Grill Bernhard, Jeremie Lecomte, Stefan Bayer, Johannes Hilpert, Guillaume Fuchs, Markus Multrus, B. Bessette, Redwan Salami, Max Neuendorf, Philippe Gournay |
---|---|
Rok vydání: | 2009 |
Předmět: |
Voice activity detection
Computer science Speech recognition Quantization (signal processing) Speech coding Acoustic model Data_CODINGANDINFORMATIONTHEORY Enhanced Variable Rate Codec Linear predictive coding Adaptive Multi-Rate audio codec Codec2 Audio codec Frequency domain Bit rate Extended Adaptive Multi-Rate – Wideband Codec Active listening Transform coding |
Zdroj: | ICASSP |
DOI: | 10.1109/icassp.2009.4959505 |
Popis: | Traditionally, speech coding and audio coding were separate worlds. Based on different technical approaches and different assumptions about the source signal, neither of the two coding schemes could efficiently represent both speech and music at low bitrates. This paper presents a unified speech and audio codec, which efficiently combines techniques from both worlds. This results in a codec that exhibits consistently high quality for speech, music and mixed audio content. The paper gives an overview of the codec architecture and presents results of formal listening tests comparing this new codec with HE-AAC(v2) and AMR-WB+. This new codec forms the basis of the reference model in the ongoing MPEG standardization activity for Unified Speech and Audio Coding. |
Databáze: | OpenAIRE |
Externí odkaz: |