Effect of speaking rate and contrastive stress on formant dynamics and vowel perception

Autor:	Michel Pitermann
Přispěvatelé:	Laboratoire Parole et Langage (LPL), Aix Marseille Université (AMU)-Centre National de la Recherche Scientifique (CNRS)
Jazyk:	angličtina
Rok vydání:	2000
Předmět:	Speech production Speech perception Acoustics and Ultrasonics speech production Speech recognition formant modelisation 01 natural sciences Models Biological speech perception statique 030507 speech-language pathology & audiology 03 medical and health sciences static Arts and Humanities (miscellaneous) Phonetics cinematique Vowel 0103 physical sciences Stress (linguistics) formant modeling Humans Speech [SHS.LANGUE]Humanities and Social Sciences/Linguistics contrastive stress 010301 acoustics debit Mathematics perception de la parole accent contrastif voyelle dynamics [SCCO.LING]Cognitive science/Linguistics Linguistics Formant kinematics Mid vowel dynamique production de la parole speaking rate 0305 other medical science vowel Sentence
Zdroj:	Journal of the Acoustical Society of America Journal of the Acoustical Society of America, 2000, 107, pp.3425-3437 Journal of the Acoustical Society of America, Acoustical Society of America, 2000, 107, pp.3425-3437
ISSN:	0001-4966 1520-8524
Popis:	International audience; Vowel formants play an important role in speech theories and applications; however, the same formant values measured for the steady-state part of a vowel can correspond to different vowel categories. Experimental evidence indicates that dynamic information can also contribute to vowel characterization. Hence, dynamically modeling formant transitions may lead to quantitatively testable predictions in vowel categorization. Because the articulatory strategy used to manage different speaking rates and contrastive stress may depend on speaker and situation, the parameter values of a dynamic formant model may vary with speaking rate and stress. In most experiments speaking rate is rarely controlled, only two or three rates are tested, and most corpora contain just a few repetitions of each item. As a consequence, the dependence of dynamic models on those factors is difficult to gauge. This article presents a study of 2300 [iai] or [iEi] stimuli produced by two speakers at nine or ten speaking rates in a carrier sentence for two contrastive stress patterns. The corpus was perceptually evaluated by naive listeners. Formant frequencies were measured during the steady-state parts of the stimuli, and the formant transitions were dynamically and kinematically modeled. The results indicate that (1) the corpus was characterized by a contextual assimilation instead of a centralization effect; (2) dynamic or kinematic modeling was equivalent as far as the analysis of the model parameters was concerned; (3) the dependence of the model parameter estimates on speaking rate and stress suggests that the formant transitions were sharper for high speaking rate, but no consistent trend was found for contrastive stress; (4) the formant frequencies measured in the steady-state parts of the vowels were sufficient to explain the perceptual results while the dynamic parameters of the models were not.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7783be86dc7b1f1b4ba399fac9277c32 https://hal.science/hal-00131702/file/2467.pdf Zobrazit plný text záznamu