Unified speech and audio coding scheme for high quality at low bitrates

Autor:	R. Geiger, Nikolaus Rettelbach, Gerald Schuller, Roch Lefebvre, Grill Bernhard, Jeremie Lecomte, Stefan Bayer, Johannes Hilpert, Guillaume Fuchs, Markus Multrus, B. Bessette, Redwan Salami, Max Neuendorf, Philippe Gournay
Rok vydání:	2009
Předmět:	Voice activity detection Computer science Speech recognition Quantization (signal processing) Speech coding Acoustic model Data_CODINGANDINFORMATIONTHEORY Enhanced Variable Rate Codec Linear predictive coding Adaptive Multi-Rate audio codec Codec2 Audio codec Frequency domain Bit rate Extended Adaptive Multi-Rate – Wideband Codec Active listening Transform coding
Zdroj:	ICASSP
DOI:	10.1109/icassp.2009.4959505
Popis:	Traditionally, speech coding and audio coding were separate worlds. Based on different technical approaches and different assumptions about the source signal, neither of the two coding schemes could efficiently represent both speech and music at low bitrates. This paper presents a unified speech and audio codec, which efficiently combines techniques from both worlds. This results in a codec that exhibits consistently high quality for speech, music and mixed audio content. The paper gives an overview of the codec architecture and presents results of formal listening tests comparing this new codec with HE-AAC(v2) and AMR-WB+. This new codec forms the basis of the reference model in the ongoing MPEG standardization activity for Unified Speech and Audio Coding.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::342bc0af757039b73e2e2feb0d4c81a7 https://doi.org/10.1109/icassp.2009.4959505 Zobrazit plný text záznamu