Influence of TTS systems performance on reaction times in people with aphasia
Autor: | Ineke van der Meulen, Giorgia Cistola, Mireia Farrús, Alex Peiró-Lilja, Guillermo Cámbara |
---|---|
Přispěvatelé: | Rehabilitation Medicine |
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
medicine.medical_specialty
Technology jitter reading impairments QH301-705.5 media_common.quotation_subject QC1-999 Speech synthesis Intelligibility (communication) Audiology computer.software_genre Voice analysis Naturalness Reading (process) Aphasia medicine General Materials Science Active listening Speech processing systems Biology (General) Instrumentation QD1-999 Human voice media_common Comprensió Fluid Flow and Transfer Processes intelligibility Process Chemistry and Technology Physics General Engineering shimmer Engineering (General). Civil engineering (General) aphasia Computer Science Applications Chemistry naturalness text-to-speech systems Processament de la parla medicine.symptom TA1-2040 Psychology Comprehension computer Afàsia |
Zdroj: | Applied Sciences, Vol 11, Iss 11320, p 11320 (2021) Dipòsit Digital de la UB Universidad de Barcelona Applied Sciences (Switzerland), 11(23):11320. Multidisciplinary Digital Publishing Institute (MDPI) Applied Sciences; Volume 11; Issue 23; Pages: 11320 |
ISSN: | 2076-3417 |
Popis: | Text-to-speech (TTS) systems provide fundamental reading support for people with aphasia and reading difficulties. However, artificial voices are more difficult to process than natural voices. The current study is an extended analysis of the results of a clinical experiment investigating which, among three artificial voices and a digitised human voice, is more suitable for people with aphasia and reading impairments. Such results show that the voice synthesised with Ogmios TTS, a concatenative speech synthesis system, caused significantly slower reaction times than the other three voices used in the experiment. The present study explores whether and what voice quality metrics are linked to delayed reaction times. For this purpose, the voices were analysed using an automatic assessment of intelligibility, naturalness, and jitter and shimmer voice quality parameters. This analysis revealed that Ogmios TTS, in general, performed worse than the other voices in all parameters. These observations could explain the significantly delayed reaction times in people with aphasia and reading impairments when listening to Ogmios TTS and could open up consideration about which TTS to choose for compensative devices for these patients based on the voice analysis of these parameters. |
Databáze: | OpenAIRE |
Externí odkaz: |