Speech Synthesis for the Generation of Artificial Personality
Autor: | Matthew P. Aylett, Alessandro Vinciarelli, Mirjam Wester |
---|---|
Rok vydání: | 2020 |
Předmět: |
Personality
Automatic Personality Perception Automatic Personality Recognition Automatic Personality Synthesis media_common.quotation_subject 020206 networking & telecommunications Conscientiousness Speech synthesis 02 engineering and technology computer.software_genre Neuroticism Human-Computer Interaction 030507 speech-language pathology & audiology 03 medical and health sciences Naturalness Perception 0202 electrical engineering electronic engineering information engineering Openness to experience Personality Big Five personality traits 0305 other medical science Psychology computer Software media_common Cognitive psychology |
Zdroj: | Aylett, M, Vinciarelli, A & Wester, M 2017, ' Speech Synthesis for the Generation of Artificial Personality ', IEEE Transactions on Affective Computing . https://doi.org/10.1109/TAFFC.2017.2763134 |
ISSN: | 2371-9850 |
DOI: | 10.1109/taffc.2017.2763134 |
Popis: | A synthetic voice personifies the system using it. In this work we examine the impact text content, voice quality and synthesis system have on the perceived personality of two synthetic voices. Subjects rated synthetic utterances based on the Big-Five personality traits and naturalness. The naturalness rating of synthesis output did not correlate significantly with any Big-Five characteristic except for a marginal correlation with openness. Although text content is dominant in personality judgments, results showed that voice quality change implemented using a unit selection synthesis system significantly affected the perception of the Big-Five, for example tense voice being associated with being disagreeable and lax voice with lower conscientiousness. In addition a comparison between a parametric implementation and unit selection implementation of the same voices showed that parametric voices were rated as significantly less neurotic than both the text alone and the unit selection system, while the unit selection was rated as more open than both the text alone and the parametric system. The results have implications for synthesis voice and system type selection for applications such as personal assistants and embodied conversational agents where developing an emotional relationship with the user, or developing a branding experience is important. |
Databáze: | OpenAIRE |
Externí odkaz: |