Fusing voice and query data for non-invasive detection of laryngeal disorders

Autor: Jonas Minelga, Adas Gelzinis, Virgilijus Uloza, Evaldas Vaiciukynas, Magnus Hållander, Evaldas Padervinskis, Marija Bacauskiene, Antanas Verikas
Rok vydání: 2015
Předmět:
Zdroj: Expert Systems with Applications. 42:8445-8453
ISSN: 0957-4174
DOI: 10.1016/j.eswa.2015.07.001
Popis: Voice and query data are explored for the task of laryngeal disorders detection.Decision-level fusion by complete-case analysis is compared to imputation strategies.Query data outperform voice, fusion after iterative model-based imputation - the best.Human readable rules were extracted from the query data using affinity analysis. Topic of this study is exploration and fusion of non-invasive measurements for an accurate detection of pathological larynx. Measurements for human subject encompass answers to items of a specific survey and information extracted by the openSMILE toolkit from several audio recordings of sustained phonation (vowel /a/). Clinical diagnosis, assigned by medical specialist, is a target attribute distinguishing subject as healthy or pathological. Random forest (RF) is used here as a base-learner and also as a meta-learner for decision-level fusion. 5 RF classifiers, built separately on 3 variants of audio recording data (raw and after two types of voice activity detection) and 2 variants of questionnaire (with 9 and 26 questions) data, are fused selectively by finding out the best combination of all possible. Before fusion, due to presence of missing values in query modalities, several imputation techniques were evaluated besides the complete-case analysis by listwise deletion. Out-of-bag equal error rate (EER) was found to be higher for audio data and lower for query, but each variant was outperformed by the decision-level fusion. Fusion after listwise deletion provided EER of 4.84%, meanwhile imputation was found to improve detection slightly and helped to achieve EER of 4.55%. Variable importance, as permutation-based mean decrease in RF accuracy, was reported for query and audio data. Finally, regarding the noteworthy performance of the query data, 22 association rules (11 healthy + 11 pathological) using 17 questions were obtained for comprehensible insights.
Databáze: OpenAIRE