Machine Learning-Based Obesity Classification: A Comparative Study Using Self-Reported Survey Data and Ensemble Learning Models

Gregorius Airlangga

doi:10.37012/jtik.v11i1.2585

Machine Learning-Based Obesity Classification: A Comparative Study Using Self-Reported Survey Data and Ensemble Learning Models

Authors

Gregorius Airlangga Atma Jaya Catholic University of Indonesia, Indonesia

DOI:

https://doi.org/10.37012/jtik.v11i1.2585

Abstract

Obesity has become one of the most pressing global health challenges of the 21st century, with its prevalence increasing at an alarming rate. Obesity is a major global health concern, contributing to an increased risk of cardiovascular disease, diabetes, and other metabolic disorders. Traditional assessment methods, such as BMI-based classification, often fail to incorporate lifestyle and behavioral factors, limiting their predictive capabilities. This study explores the use of machine learning for obesity classification based on self-reported survey data collected from individuals in Mexico, Peru, and Colombia. The dataset comprises 2111 instances with 17 attributes, covering demographic characteristics, eating habits, and physical activity levels. Eight machine learning models, including Logistic Regression, Random Forest, Gradient Boosting, Support Vector Machine (SVM), Decision Tree, K-Nearest Neighbors, Naïve Bayes, and AdaBoost, were evaluated using 10-fold cross-validation. Results indicate that Gradient Boosting achieved the highest accuracy of 96.49%, followed by Random Forest and SVM, demonstrating the effectiveness of ensemble learning techniques in capturing complex feature interactions. In contrast, Naïve Bayes and AdaBoost exhibited the lowest classification performance due to their strong assumptions about feature independence and sensitivity to noisy data. The findings highlight the potential of machine learning in obesity classification and underscore the need for advanced predictive models to enhance public health monitoring and intervention strategies.

Downloads

Published

2025-03-25

Issue

Vol. 11 No. 1 (2025): Jurnal Teknologi Informatika dan Komputer

Section

Articles

Citation Check

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Jurnal Teknologi Informatika dan KomputerÂ allows readers to read, download, copy, distribute, print, search, or link to the full texts of its articles and allow readers to use them for any other lawful purpose. The journal allows the author(s) to hold the copyright without restrictions. Finally, the journal allows the author(s) to retain publishing rights without restrictions Authors are allowed to archive their submitted article in an open access repository Authors are allowed to archive the final published article in an open access repository with an acknowledgment of its initial publication in this journal.

Jurnal Teknlogi Informatika dan KomputerÂ is licensed under a Creative Commons Attribution 4.0 International License.

Machine Learning-Based Obesity Classification: A Comparative Study Using Self-Reported Survey Data and Ensemble Learning Models

Authors

DOI:

Abstract

Downloads

Published

Issue

Section

Citation Check

License

Make a Submission

sidemenu_jtik

Information