Variable bit-rate CELP coding of speech with phonetic classification

Autor:	Allen Gersho, E. Paksoy, Krishnaswamy Srinivasan
Rok vydání:	2010
Předmět:	Code-excited linear prediction Background noise Voice activity detection Computer science Speech recognition Segmentation Electrical and Electronic Engineering Subjective quality Variable bitrate Harmonic Vector Excitation Coding Coding (social sciences)
Zdroj:	European Transactions on Telecommunications. 5:591-602
ISSN:	1124-318X
DOI:	10.1002/ett.4460050510
Popis:	A variable bit-rate speech coder intended for digital cellular applications is described. A voice activity detection algorithm is used to distinguish active speech from background noise. Each frame of active speech is further classified to distinguish between three phonetic categories: voiced, unvoiced, and onset. Each input frame is assigned one of five bit rates according to voice activity and phonetic classification and coded using an analysis-by-synthesis algorithm tailored to the needs of the class that it belongs to. The resulting coder, called Variable Rate Phonetic Segmentation, produces good quality speech at an average bit-rate below 3 kbit/s when operating with a voice activity factor of 0.5. Informal subjective quality assessment for speech in clean and noisy backgrounds indicates a performance that is comparable to the TIA standard QCELP algorithm while operating at a 25% to 40% lower average bit rate.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::79ef2d5a400db60bdb4efc7eb0ecb7b8 https://doi.org/10.1002/ett.4460050510 Zobrazit plný text záznamu Plný text