Phonetic accommodation to natural and synthetic voices : Behavior of groups and individuals in speech shadowing
Autor: | Ingmar Steiner, Iona Gessinger, Eran Raveh, Bernd Möbius |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2022 |
Předmět: |
Linguistics and Language
Speech recognition Phonetic accommodation Realization (linguistics) 02 engineering and technology 01 natural sciences Language and Linguistics Speech shadowing 0103 physical sciences 0202 electrical engineering electronic engineering information engineering Natural (music) 010301 acoustics Divergence (linguistics) Human-human interaction Communication 020206 networking & telecommunications German Human-computer interaction Computer Science Applications Feature (linguistics) Variation (linguistics) Modeling and Simulation Synthetic speech Schwa Computer Vision and Pattern Recognition Psychology Software Epenthesis |
DOI: | 10.22028/d291-38285 |
Popis: | The present study investigates whether native speakers of German phonetically accommodate to natural and synthetic voices in a shadowing experiment. We aim to determine whether this phenomenon, which is frequently found in HHI, also occurs in HCI involving synthetic speech. The examined features pertain to different phonetic domains: allophonic variation, schwa epenthesis, realization of pitch accents, word-based temporal structure and distribution of spectral energy. On the individual level, we found that the participants converged to varying subsets of the examined features, while they maintained their baseline behavior in other cases or, in rare instances, even diverged from the model voices. This shows that accommodation with respect to one particular feature may not predict the behavior with respect to another feature. On the group level, the participants of the natural condition converged to all features under examination, however very subtly so for schwa epenthesis. The synthetic voices, while partly reducing the strength of effects found for the natural voices, triggered accommodating behavior as well. The predominant pattern for all voice types was convergence during the interaction followed by divergence after the interaction. |
Databáze: | OpenAIRE |
Externí odkaz: |