CLEMI-Imputation Evaluation

Autor:	Anthony Chapman, George M. Coghill, Wei Pang
Rok vydání:	2018
Předmět:	Computer science business.industry Machine learning computer.software_genre Missing data 01 natural sciences 010104 statistics & probability 03 medical and health sciences 0302 clinical medicine Software 030212 general & internal medicine Imputation (statistics) Artificial intelligence 0101 mathematics Cluster analysis business computer Test data
Zdroj:	SACI
DOI:	10.1109/saci.2018.8440981
Popis:	Missing data is challenging enough without the added complexities posed by a lack of research in evaluating imputation. Not only could we potentially increase the impact and validity of studies from many different sectors (research, public and private), we also believe that by creating evaluation software, more researchers may be willing to use and justify using imputation methods. This paper aims to encourage further research for efficient imputation evaluation by defining a framework which could be used to optimise the way we impute datasets prior to data analysis. We propose a framework which uses a prototypical approach to create testing data and machine learning methods to create a new metric for evaluation. Preliminary results are presented which show how, for our dataset, records with less than 40% missingness could be used for analysis, increasing the amount of available data.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::95b9d664d0ba56587f1faeabe2234ae3 https://doi.org/10.1109/saci.2018.8440981 Zobrazit plný text záznamu