Predicting complications of diabetes mellitus using advanced machine learning algorithms

Autor:	Daniel Polimac, Wilson Diaz, Branimir Ljubic, Marija Stanojevic, Ameen Abdel Hai, Martin Pavlovski, Zoran Obradovic
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	030209 endocrinology & metabolism Health Informatics 030204 cardiovascular system & hematology Machine learning computer.software_genre Research and Applications Risk Assessment Diabetes Complications 03 medical and health sciences 0302 clinical medicine Deep Learning Diabetes mellitus Medicine Humans Healthcare Cost and Utilization Project business.industry Deep learning Decision Trees Electronic medical record Type 2 Diabetes Mellitus medicine.disease Prognosis Random forest Recurrent neural network Diabetes Mellitus Type 2 Multilayer perceptron Artificial intelligence Neural Networks Computer business computer Algorithms
Zdroj:	J Am Med Inform Assoc
Popis:	ObjectiveWe sought to predict if patients with type 2 diabetes mellitus (DM2) would develop 10 selected complications. Accurate prediction of complications could help with more targeted measures that would prevent or slow down their development.Materials and MethodsExperiments were conducted on the Healthcare Cost and Utilization Project State Inpatient Databases of California for the period of 2003 to 2011. Recurrent neural network (RNN) long short-term memory (LSTM) and RNN gated recurrent unit (GRU) deep learning methods were designed and compared with random forest and multilayer perceptron traditional models. Prediction accuracy of selected complications were compared on 3 settings corresponding to minimum number of hospitalizations between diabetes diagnosis and the diagnosis of complications.ResultsThe diagnosis domain was used for experiments. The best results were achieved with RNN GRU model, followed by RNN LSTM model. The prediction accuracy achieved with RNN GRU model was between 73% (myocardial infarction) and 83% (chronic ischemic heart disease), while accuracy of traditional models was between 66% – 76%.DiscussionThe number of hospitalizations was an important factor for the prediction accuracy. Experiments with 4 hospitalizations achieved significantly better accuracy than with 2 hospitalizations. To achieve improved accuracy deep learning models required training on at least 1000 patients and accuracy significantly dropped if training datasets contained 500 patients. The prediction accuracy of complications decreases over time period. Considering individual complications, the best accuracy was achieved on depressive disorder and chronic ischemic heart disease.ConclusionsThe RNN GRU model was the best choice for electronic medical record type of data, based on the achieved results.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3b6d7858f6c02d55afa2f899e9cb8b1a https://europepmc.org/articles/PMC7647294/ Zobrazit plný text záznamu