Predicting Obesity Risk Through Lifestyle Habits: A Comparative Analysis of Machine Learning Models

Autor: Wang Xiaotian
Jazyk: English<br />French
Rok vydání: 2024
Předmět:
Zdroj: E3S Web of Conferences, Vol 553, p 05037 (2024)
Druh dokumentu: article
ISSN: 2267-1242
DOI: 10.1051/e3sconf/202455305037
Popis: This paper explores the escalating global concern of obesity, emphasizing the significance of identifying high-risk individuals to deploy targeted intervention strategies. Employing the University of California, Irvine (UCI) Machine Learning Repository dataset of 2,111 subjects from diverse regions, the classification of obesity levels was based on the Mexican Normativity, which closely aligns with Centers for Disease Control and Prevention (CDC) standards. The primary objective was to assess the predictive capabilities of an array of machine learning models in forecasting obesity levels based on lifestyle habits, excluding direct parameters like height and weight. An enhanced Logistic regression model, LogitBoost model, Random Forests, XGBoost, Support Vector Machines (SVM), Naive Bayes classifiers, and K-Nearest Neighbors (KNN) were employed for analysis. Through cross-validation, this research determined the hierarchy of factors contributing to obesity, spotlighting variables like ‘Consumption of food between meals’ and ‘Obesity among family members’ as major contributors. The results indicate that while LogitBoost performed optimally among Boost algorithms, its performance was slightly below traditional methods. This study’s unique approach of emphasizing lifestyle predictors, excluding direct height and weight variables, underscores the need for targeted, personalized intervention strategies in managing the global obesity epidemic.
Databáze: Directory of Open Access Journals