Transform Representation of the Spectra of Acoustic Speech Segments with Applications, Part 2: Speech Analysis, Synthesis and Coding

Autor: Algazi, Ralph, Cadwell, C., Irvine, D., Ready, M., Brown, K., Chung, S.
Jazyk: angličtina
Rok vydání: 1993
Předmět:
Zdroj: Algazi, Ralph; Cadwell, C.; Irvine, D.; Ready, M.; Brown, K.; & Chung, S.(1993). Transform Representation of the Spectra of Acoustic Speech Segments with Applications, Part 2: Speech Analysis, Synthesis and Coding. IEEE Transaction on Speech and Audio Processing, 1. UC Davis: Institute for Data Analysis and Visualization. Retrieved from: http://www.escholarship.org/uc/item/6qk673dj
Popis: In Part I of this paper, we introduced a new approach to the representation of the speech spectral envelope which makes use of the Karhunen-Loeve transformation of acoustic subword segments. This new signal-dependent representation captures, with a few KL vectors and transform coefficients, the perceptually and phonetically important structure of the spectral envelope. In this second part, we study the application of this new representation to the analysis, synthesis, and coding of speech. We propose simple quantization and coding strategies for the KL representation vectors as well as for the resulting transform coefficients. The resulting technique is a variable rate encoding scheme which achieves good speech quality at an average rate of 3.5 kilobits per second.
Databáze: OpenAIRE