VERONICA: Visual Analytics for Identifying Feature Groups in Disease Classification

Autor: Neda Rostamzadeh, Sheikh S. Abdullah, Kamran Sedig, Amit X. Garg, Eric McArthur
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Information, Vol 12, Iss 9, p 344 (2021)
Druh dokumentu: article
ISSN: 12090344
2078-2489
DOI: 10.3390/info12090344
Popis: The use of data analysis techniques in electronic health records (EHRs) offers great promise in improving predictive risk modeling. Although useful, these analysis techniques often suffer from a lack of interpretability and transparency, especially when the data is high-dimensional. The emergence of a type of computational system known as visual analytics has the potential to address these issues by integrating data analysis techniques with interactive visualizations. This paper introduces a visual analytics system called VERONICA that utilizes the natural classification of features in EHRs to identify the group of features with the strongest predictive power. VERONICA incorporates a representative set of supervised machine learning techniques—namely, classification and regression tree, C5.0, random forest, support vector machines, and naive Bayes to support users in developing predictive models using EHRs. It then makes the analytics results accessible through an interactive visual interface. By integrating different sampling strategies, analytics algorithms, visualization techniques, and human-data interaction, VERONICA assists users in comparing prediction models in a systematic way. To demonstrate the usefulness and utility of our proposed system, we use the clinical dataset stored at ICES to identify the best representative feature groups in detecting patients who are at high risk of developing acute kidney injury.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje