Models of speech production for speech analysis and synthesis

Autor:	M. M. Sondhi
Rok vydání:	1990
Předmět:	Speech production Voice activity detection Acoustics and Ultrasonics Arts and Humanities (miscellaneous) Computer science Acoustics Speech recognition Speech coding Linear predictive coding Speech processing Utterance
Zdroj:	The Journal of the Acoustical Society of America. 87:S14-S14
ISSN:	0001-4966
DOI:	10.1121/1.2028027
Popis:	Coding (i.e., analysis and resynthesis) of speech signals on the basis of physiological models of speech production mechanisms has received considerable attention during the past several years. One of the reasons for this renewed interest is that the known coding algorithms give unacceptably poor quality of speech at low bit rates (e.g., at 2400 bits/s). A coding scheme that mimics human speech production may have advantages at such low bit rates. Basically, three aspects of speech production need to be modeled. First, the geometry of the vocal and nasal tracts needs to be parametrized. Second, a model must be selected to describe wave propagation in the tract. Finally, the sound sources (vocal cords and turbulent airflow) and their interactions with the tract must be modeled. In this talk the current models being used for each of these categories will be briefly described, and the way in which these models are employed for coding will also be described. Also, the use of such models for text‐to‐speech synthesis will be mentioned. Finally, some examples of speech produced by such models, both from text as well as by analysis and resynthesis of a given speech utterance will be played.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::7e2076f04d89c52855e80ca1ae77c75f https://doi.org/10.1121/1.2028027 Zobrazit plný text záznamu