Implementation of Logistic Regression on Diabetic Dataset using Train-Test-Split, K-Fold and Stratified K-Fold Approach

Autor: Bhagat, Meenu, Bakariya, Brijesh
Zdroj: National Academy Science Letters; 20240101, Issue: Preprints p1-4, 4p
Abstrakt: Diabetes is a chronic metabolic disorder causing high blood sugars, that further severely affect body parts like the heart, liver, kidneys, lungs, eyes, nerves, blood vessels etc. There are three types of diabetes- Type-1 Diabetes, Type-2 Diabetes, and Gestational Diabetes. In Type-1, body of the patient fails to produce insulin. In Type-2 diabetes, cells of the body fails to respond to insulin effectively. Gestational diabetes occurs during pregnancy. There are many approaches used to analyse this disease. We have used the Machine learning approach for analysing diabetes. We have used 768 records from “pima diabetes dataset”. In this paper, we have used Logistic regression with Train Test Split, K-Fold cross-validation and Stratified K-Fold approach.
Databáze: Supplemental Index