Automatic Evaluation of Speech Intelligibility Based on i-vectors in the Context of Head and Neck Cancers
Autor: | Corinne Fredouille, Virginie Woisard, Alain Ghio, Imed Laaridh, Muriel Lalain |
---|---|
Přispěvatelé: | Laboratoire Informatique d'Avignon (LIA), Avignon Université (AU)-Centre d'Enseignement et de Recherche en Informatique - CERI, Laboratoire Parole et Langage (LPL), Aix Marseille Université (AMU)-Centre National de la Recherche Scientifique (CNRS), INCA |
Jazyk: | angličtina |
Rok vydání: | 2018 |
Předmět: |
Computer science
media_common.quotation_subject Speech recognition speech disorders Troubles de la parole 02 engineering and technology Intelligibility (communication) head and neck cancers Cancer ORL 030507 speech-language pathology & audiology 03 medical and health sciences Perception 0202 electrical engineering electronic engineering information engineering Traitement automatique de la parole [INFO]Computer Science [cs] [SHS.LANGUE]Humanities and Social Sciences/Linguistics Head and neck media_common Speech Acoustics intelligibility Phonétique clinique Intelligibilité automatic speech processing Parole 020201 artificial intelligence & image processing 0305 other medical science [SDV.MHEP]Life Sciences [q-bio]/Human health and pathology i-vectors |
Zdroj: | Interspeech 2018 Interspeech Interspeech, Sep 2018, Hyderabad, India. pp.2943-2947, ⟨10.21437/interspeech.2018-1266⟩ INTERSPEECH |
DOI: | 10.21437/interspeech.2018-1266⟩ |
Popis: | International audience; In disordered speech context, and despite its well-known sub-jectivity, perceptual evaluation is still the most commonly used method in clinical practice to evaluate the intelligibility level of patients' speech productions. However, and thanks to increasing computing power, automatic speech processing systems have witnessed a democratization in terms of users and application areas including the medical practice. In this paper, we evaluate an automatic approach for the prediction of cancer patients' speech intelligibility based on the representation of the speech acoustics in the total variability subspace based on the i-vector paradigm. Experimental evaluations of the proposed predictive approach have shown a very high correlation rate with perceptual intelligibility when applied on the French speech corpora C2SI (r=0.84). They have also demonstrated the robustness of the approach when using a limited amount of disordered speech per patient, which may lead to the redesign and alleviation of the test protocols usually used in disordered speech evaluation context. |
Databáze: | OpenAIRE |
Externí odkaz: |