Fusing voice and query data for non-invasive detection of laryngeal disorders
Autor: | Jonas Minelga, Adas Gelzinis, Virgilijus Uloza, Evaldas Vaiciukynas, Magnus Hållander, Evaldas Padervinskis, Marija Bacauskiene, Antanas Verikas |
---|---|
Rok vydání: | 2015 |
Předmět: |
Voice activity detection
Association rule learning Computer science business.industry Listwise deletion Speech recognition General Engineering Word error rate Pattern recognition Missing data Ensemble learning Computer Science Applications Random forest Artificial Intelligence Imputation (statistics) Artificial intelligence business |
Zdroj: | Expert Systems with Applications. 42:8445-8453 |
ISSN: | 0957-4174 |
DOI: | 10.1016/j.eswa.2015.07.001 |
Popis: | Voice and query data are explored for the task of laryngeal disorders detection.Decision-level fusion by complete-case analysis is compared to imputation strategies.Query data outperform voice, fusion after iterative model-based imputation - the best.Human readable rules were extracted from the query data using affinity analysis. Topic of this study is exploration and fusion of non-invasive measurements for an accurate detection of pathological larynx. Measurements for human subject encompass answers to items of a specific survey and information extracted by the openSMILE toolkit from several audio recordings of sustained phonation (vowel /a/). Clinical diagnosis, assigned by medical specialist, is a target attribute distinguishing subject as healthy or pathological. Random forest (RF) is used here as a base-learner and also as a meta-learner for decision-level fusion. 5 RF classifiers, built separately on 3 variants of audio recording data (raw and after two types of voice activity detection) and 2 variants of questionnaire (with 9 and 26 questions) data, are fused selectively by finding out the best combination of all possible. Before fusion, due to presence of missing values in query modalities, several imputation techniques were evaluated besides the complete-case analysis by listwise deletion. Out-of-bag equal error rate (EER) was found to be higher for audio data and lower for query, but each variant was outperformed by the decision-level fusion. Fusion after listwise deletion provided EER of 4.84%, meanwhile imputation was found to improve detection slightly and helped to achieve EER of 4.55%. Variable importance, as permutation-based mean decrease in RF accuracy, was reported for query and audio data. Finally, regarding the noteworthy performance of the query data, 22 association rules (11 healthy + 11 pathological) using 17 questions were obtained for comprehensible insights. |
Databáze: | OpenAIRE |
Externí odkaz: |