Automatic speech analysis for detecting cognitive decline of older adults

Autor:	Lihe Huang, Hao Yang, Yiran Che, Jingjing Yang
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	cognitive decline natural language processing machine learning automatic speech recognition language aging Public aspects of medicine RA1-1270
Zdroj:	Frontiers in Public Health, Vol 12 (2024)
Druh dokumentu:	article
ISSN:	2296-2565
DOI:	10.3389/fpubh.2024.1417966
Popis:	BackgroundSpeech analysis has been expected to help as a screening tool for early detection of Alzheimer’s disease (AD) and mild-cognitively impairment (MCI). Acoustic features and linguistic features are usually used in speech analysis. However, no studies have yet determined which type of features provides better screening effectiveness, especially in the large aging population of China.ObjectiveFirstly, to compare the screening effectiveness of acoustic features, linguistic features, and their combination using the same dataset. Secondly, to develop Chinese automated diagnosis model using self-collected natural discourse data obtained from native Chinese speakers.MethodsA total of 92 participants from communities in Shanghai, completed MoCA-B and a picture description task based on the Cookie Theft under the guidance of trained operators, and were divided into three groups including AD, MCI, and heathy control (HC) based on their MoCA-B score. Acoustic features (Pitches, Jitter, Shimmer, MFCCs, Formants) and linguistic features (part-of-speech, type-token ratio, information words, information units) are extracted. The machine algorithms used in this study included logistic regression, random forest (RF), support vector machines (SVM), Gaussian Naive Bayesian (GNB), and k-Nearest neighbor (kNN). The validation accuracies of the same ML model using acoustic features, linguistic features, and their combination were compared.ResultsThe accuracy with linguistic features is generally higher than acoustic features in training. The highest accuracy to differentiate HC and AD is 80.77% achieved by SVM, based on all the features extracted from the speech data, while the highest accuracy to differentiate HC and AD or MCI is 80.43% achieved by RF, based only on linguistic features.ConclusionOur results suggest the utility and validity of linguistic features in the automated diagnosis of cognitive impairment, and validated the applicability of automated diagnosis for Chinese language data.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/0f7fc2d1f2d54d44b9f823faecc5e3dd Zobrazit plný text záznamu View record in DOAJ