Comparing Different Machine Learning Techniques in Predicting Diabetes on Early Stage

Autor: Shweta Yadu, Rashmi Chandra, Vivek Kumar Sinha
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Engineering Proceedings, Vol 62, Iss 1, p 20 (2024)
Druh dokumentu: article
ISSN: 2673-4591
DOI: 10.3390/engproc2024062020
Popis: One of the diseases that is constantly spreading and is estimated to cause a significant number of deaths worldwide is diabetes mellitus. It is determined by the quantity of a blood sugar molecule made from glucose. The possibility of this disease has been predicted using a variety of methods. To forecast diabetes at an early stage, adequate and clear data on diabetic individuals are needed. In this study, 520 records from a hospital in Bangladesh with 16 different characteristic numbers were used to make predictions. At UCI, this dataset is accessible to everyone. We used Random Forest, Ada Booster, KNN, and Bagging algorithms after feature selection. Through 10-fold cross-validation, it was discovered that the Random Forest method had the best test accuracy, scoring 97.03% correctly and 95.03% correctly.
Databáze: Directory of Open Access Journals