RFFE - Random Forest Fuzzy Entropy for the classification of Diabetes Mellitus.
Autor: | Usha Ruby A; School of Computing Science and Engineering Department, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, Madhya Pradesh-466114, India., George Chellin Chandran J; School of Computing Science and Engineering Department, VIT Bhopal University, Bhopal-Indore Highway, Kothrikalan, Sehore, Madhya Pradesh-466114, India., Swasthika Jain TJ; Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka-561203, India., Chaithanya BN; Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka-561203, India., Patil R; Department of Computer Science and Engineering, GITAM School of Technology, Nagadenehalli, Doddaballapura, Karnataka-561203, India. |
---|---|
Jazyk: | angličtina |
Zdroj: | AIMS public health [AIMS Public Health] 2023 May 23; Vol. 10 (2), pp. 422-442. Date of Electronic Publication: 2023 May 23 (Print Publication: 2023). |
DOI: | 10.3934/publichealth.2023030 |
Abstrakt: | Diabetes is a category of metabolic disease commonly known as a chronic illness. It causes the body to generate less insulin and raises blood sugar levels, leading to various issues and disrupting the functioning of organs, including the retinal, kidney and nerves. To prevent this, people with chronic illnesses require lifetime access to treatment. As a result, early diabetes detection is essential and might save many lives. Diagnosis of people at high risk of developing diabetes is utilized for preventing the disease in various aspects. This article presents a chronic illness prediction prototype based on a person's risk feature data to provide an early prediction for diabetes with Fuzzy Entropy random vectors that regulate the development of each tree in the Random Forest. The proposed prototype consists of data imputation, data sampling, feature selection, and various techniques to predict the disease, such as Fuzzy Entropy, Synthetic Minority Oversampling Technique (SMOTE), Convolutional Neural Network (CNN) with Stochastic Gradient Descent with Momentum (SGDM), Support Vector Machines (SVM), Classification and Regression Tree (CART), K-Nearest Neighbor (KNN), and Naïve Bayes (NB). This study uses the existing Pima Indian Diabetes (PID) dataset for diabetic disease prediction. The predictions' true/false positive/negative rate is investigated using the confusion matrix and the receiver operating characteristic area under the curve (ROCAUC). Findings on a PID dataset are compared with machine learning algorithms revealing that the proposed Random Forest Fuzzy Entropy (RFFE) is a valuable approach for diabetes prediction, with an accuracy of 98 percent. Competing Interests: Conflict of interest: The authors declare no conflict of interest. (© 2023 the Author(s), licensee AIMS Press.) |
Databáze: | MEDLINE |
Externí odkaz: |