The semantic space for emotional speech and the influence of different methods for prosody isolation on its perception
Autor: | Susana Castillo, Douglas W. Cunningham, Martin Schorradt |
---|---|
Rok vydání: | 2018 |
Předmět: |
Facial expression
Phrase Speech recognition media_common.quotation_subject 05 social sciences Semantics 050105 experimental psychology 03 medical and health sciences 0302 clinical medicine Perception 0501 psychology and cognitive sciences Semantic differential Prosody Psychology 030217 neurology & neurosurgery Sentence Gesture media_common |
Zdroj: | SAP |
Popis: | Normally, when people talk to other people, they communicate not only using specific words, but also with intentional changes in their voice melody, facial expressions, and gestures. Not only is human communication inherently multimodal, it is also multi-layered. That is, it conveys more than simple semantic information, but also passes on a wide variety of social, emotional, and functional (e.g., conversation control) information. Previous work has examined the perception of socio-emotional information conveyed by words and facial expressions. Here, we build on that work and examine the perception of socio-emotional information based solely on prosody (e.g., speech melody, rate, tempo, intensity). To examine the perception of affective prosody, it is necessary to remove all semantics from the speech signal - without changing the prosody! In this paper, we compare several different state-of-the-art methods for removing semantics. We started by recording an audio database containing a German sentence spoken by 11 people in 62 different emotional states. We then removed or masked the semantics using three different techniques. We also recorded the same 62 states for a pseudo-language phrase. Each of these five sets of stimuli were subjected to a semantic differential rating task to derive and compare the semantic spaces for emotions. The results show that each of the methods successfully removed the semantic component, but also changed the perception of the emotional content. Interestingly, the pseudo-word stimuli diverged most from the normal sentences. Furthermore, although each of the filters affected the perception of the sentence in some manner, they did so in different ways. |
Databáze: | OpenAIRE |
Externí odkaz: |