Speech Synthesis for the Generation of Artificial Personality

Autor:	Matthew P. Aylett, Alessandro Vinciarelli, Mirjam Wester
Rok vydání:	2020
Předmět:	Personality Automatic Personality Perception Automatic Personality Recognition Automatic Personality Synthesis media_common.quotation_subject 020206 networking & telecommunications Conscientiousness Speech synthesis 02 engineering and technology computer.software_genre Neuroticism Human-Computer Interaction 030507 speech-language pathology & audiology 03 medical and health sciences Naturalness Perception 0202 electrical engineering electronic engineering information engineering Openness to experience Personality Big Five personality traits 0305 other medical science Psychology computer Software media_common Cognitive psychology
Zdroj:	Aylett, M, Vinciarelli, A & Wester, M 2017, ' Speech Synthesis for the Generation of Artificial Personality ', IEEE Transactions on Affective Computing . https://doi.org/10.1109/TAFFC.2017.2763134
ISSN:	2371-9850
DOI:	10.1109/taffc.2017.2763134
Popis:	A synthetic voice personifies the system using it. In this work we examine the impact text content, voice quality and synthesis system have on the perceived personality of two synthetic voices. Subjects rated synthetic utterances based on the Big-Five personality traits and naturalness. The naturalness rating of synthesis output did not correlate significantly with any Big-Five characteristic except for a marginal correlation with openness. Although text content is dominant in personality judgments, results showed that voice quality change implemented using a unit selection synthesis system significantly affected the perception of the Big-Five, for example tense voice being associated with being disagreeable and lax voice with lower conscientiousness. In addition a comparison between a parametric implementation and unit selection implementation of the same voices showed that parametric voices were rated as significantly less neurotic than both the text alone and the unit selection system, while the unit selection was rated as more open than both the text alone and the parametric system. The results have implications for synthesis voice and system type selection for applications such as personal assistants and embodied conversational agents where developing an emotional relationship with the user, or developing a branding experience is important.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::17a46f3ff8acf1dec24d754a3897c1c8 https://doi.org/10.1109/taffc.2017.2763134 Zobrazit plný text záznamu