CLEMI-Imputation Evaluation
Autor: | Anthony Chapman, George M. Coghill, Wei Pang |
---|---|
Rok vydání: | 2018 |
Předmět: |
Computer science
business.industry Machine learning computer.software_genre Missing data 01 natural sciences 010104 statistics & probability 03 medical and health sciences 0302 clinical medicine Software 030212 general & internal medicine Imputation (statistics) Artificial intelligence 0101 mathematics Cluster analysis business computer Test data |
Zdroj: | SACI |
DOI: | 10.1109/saci.2018.8440981 |
Popis: | Missing data is challenging enough without the added complexities posed by a lack of research in evaluating imputation. Not only could we potentially increase the impact and validity of studies from many different sectors (research, public and private), we also believe that by creating evaluation software, more researchers may be willing to use and justify using imputation methods. This paper aims to encourage further research for efficient imputation evaluation by defining a framework which could be used to optimise the way we impute datasets prior to data analysis. We propose a framework which uses a prototypical approach to create testing data and machine learning methods to create a new metric for evaluation. Preliminary results are presented which show how, for our dataset, records with less than 40% missingness could be used for analysis, increasing the amount of available data. |
Databáze: | OpenAIRE |
Externí odkaz: |