Machine Learning Approach: A Comparative Analysis of Classifiers in Predicting Obesity Type

Authors

  • Jeffrey Tedjasulaksana Bina Nusantara University
  • Ferry Jaya Dinata Bina Nusantara University
  • Rafael Krisnadi Bina Nusantara University
  • Matthew S.W. Reksosamudro Bina Nusantara University
  • Wilbert Wen Bina Nusantara University
  • Muhammad Fadlan Hidayat Bina Nusantara University

DOI:

https://doi.org/10.21512/emacsjournal.v8i1.15268

Keywords:

health, machine learning, Neural Network, hyperparameter tuning, AI application

Abstract

Obesity is a growing global public health concern that increases the risk of chronic diseases and significantly affects quality of life. Traditional diagnostic methods such as Body Mass Index (BMI) have limitations in accurately representing body fat distribution and individual health conditions. This study aims to comparatively evaluate the performance of various machine learning and neural network models in predicting obesity levels using a multiclass classification approach. The dataset consists of 2,111 observations with 12 predictor variables and seven obesity categories, obtained from a publicly available source. Data preprocessing included duplicate removal, outlier handling using the interquartile range method, feature scaling, and categorical encoding, followed by a 60:20:20 train–validation–test split. Several classifiers were implemented, including Logistic Regression, Support Vector Classifier, Random Forest, Extra Trees, Gradient Boosting-based models (XGBoost and LightGBM), Multilayer Perceptron, K-Nearest Neighbors, and TabNet. Model performance was evaluated using macro-average F1-score and confusion matrix analysis. The results indicate that LightGBM achieved the highest predictive performance with an F1-score of 0.96, demonstrating strong generalization across obesity categories. XGBoost and Random Forest also showed strong performance, while Support Vector Classifier exhibited consistent results across training, validation, and cross-validation. These findings suggest that ensemble-based models are highly effective for obesity classification, while model selection should consider accuracy, interpretability, and computational constraints.

Dimensions

Author Biographies

Ferry Jaya Dinata, Bina Nusantara University

Computer Science Department, School of Computer Science

Rafael Krisnadi, Bina Nusantara University

Computer Science Department, School of Computer Science

Matthew S.W. Reksosamudro, Bina Nusantara University

Computer Science Department, School of Computer Science

Wilbert Wen, Bina Nusantara University

Computer Science Department, School of Computer Science

Muhammad Fadlan Hidayat, Bina Nusantara University

Computer Science Department, School of Computer Science

References

Aditya Mahindru, Pradeep Patil, Varun Agrawal. (2023). Role of Physical Activity on Mental Health and Well-Being: A Review, 1-7, DOI: 10.7759/cureus.33475

Airlangga, G. (2025). A comparative analysis of machine learning models for obesity prediction. Jurnal Informatika Ekonomi Bisnis, 7(1). https://doi.org/10.37034/infeb.v7i1.1089

Ali, A., Wei, Y., Tyson, J., Akerman, H., Jackson, A. I. R., Lane, R., Spencer, D., & White, N. M. (2024). Enhancing the response of a wearable sensor for improved respiratory rate (RR) monitoring. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3509676

Alimbayeva, Z., Alimbayev, C., Ozhikenov, K., Bayanbay, N., & Ozhikenova, A. (2024). Wearable ECG Device and Machine Learning for Heart Monitoring. Sensors 2024, Vol. 24, Page 4201, 24(13), 4201. https://doi.org/10.3390/S24134201

Angela K. Fitch, Harold E. Bays. (2022). Obesity definition, diagnosis, bias, standard operating procedures (SOPs), and telehealth: An Obesity Medicine Association (OMA) Clinical Practice Statement (CPS) 2022, Obesity Pillars, Volume 1, 100004, ISSN 2667-3681, https://doi.org/10.1016/j.obpill.2021.100004

Arik, S. O., & Pfister, T. (2019). TabNet: Attentive interpretable tabular learning. arXiv. https://doi.org/10.48550/arXiv.1908.07442

Ba, N., Yue, W., Cao, C., Wu, W., & Cheng, P. (2024). Advances in Wearable Smart Chemical Sensors for Health Monitoring. Applied Sciences 2024, Vol. 14, Page 11199, 14(23), 11199. https://doi.org/10.3390/APP142311199

Bae, J. P., Nelson, D. R., Boye, K. S., & Mather, K. J. (2025). Prevalence of complications and comorbidities associated with obesity: A health insurance claims analysis. BMC Public Health, 25(1). https://doi.org/10.1186/s12889-024-21061-z

Chauhan, P., & Srivastava, S. (2025). Comparative analysis of deep learning and machine learning techniques for obesity classification. Atlantis Press.

Dirik, M. (2023). Application of machine learning techniques for obesity prediction: A comparative study. Journal of Complexity in Health Sciences, 6(2), 16–34. https://doi.org/10.21595/chs.2023.23193

Dumakude, A., & Ezugwu, A. E. (2023). Automated COVID-19 detection with convolutional neural networks. Scientific Reports, 13(1). https://doi.org/10.1038/s41598-023-37743-4

Erik Hemmingsson, Paulina Nowicka, Stanley Ulijaszek,Thorkild I. A. Sørensen. (2022). The social origins of obesity within and across generations, 1-11. https://doi.org/10.1111/obr.13514

Fruh, S. M. (2017). Obesity. Journal of the American Association of Nurse Practitioners, 29(S1), S3–S14. https://doi.org/10.1002/2327-6924.12510

Ge, Y., Wang, Q., Wang, L., Wu, H., Peng, C., Wang, J., Xu, Y., Xiong, G., Zhang, Y., & Yi, Y. (2019). Predicting post-stroke pneumonia using deep neural network approaches. International Journal of Medical Informatics, 132, 103986. https://doi.org/10.1016/j.ijmedinf.2019.103986

Habehh, H., & Gohel, S. (2021). Machine learning in healthcare. Current Genomics, 22(4), 291–300. https://doi.org/10.2174/1389202922666210705124359

Ikharo, B. A., & Aliu, D. (2023). Challenges Associated with Wearable Internet-of-Things (IoTs) Monitoring Systems for E-Health. FUOYE Journal of Engineering and Technology, 8(4). https://doi.org/10.46792/FUOYEJET.V8I4.1099

Jakachira, R., Jakachira, R., Yan, W., Burrow, J. A., Toussaint, K. C., & Toussaint, K. C. (2024). Dual-Wavelength, Polarization-Sensitive Wearable Photoplethysmographic Sensor on Diverse Skin Tones. Frontiers in Optics + Laser Science 2024 (FiO, LS) (2024), Paper JW4A.65, JW4A.65. https://doi.org/10.1364/FIO.2024.JW4A.65

Jeon, J., Lee, S., & Oh, C. (2022). Age-specific risk factors for the prediction of obesity using a machine learning approach. Research Square Platform LLC. https://doi.org/10.21203/rs.3.rs-1515734/v1

Joyce, D., De Brún, A., Symmons, S. M., Fox, R., & McAuliffe, E. (2023). Remote patient monitoring for COVID-19 patients: comparisons and framework for reporting. BMC Health Services Research, 23(1), 1–11. https://doi.org/10.1186/S12913-023-09526-0/TABLES/3

Kajzar, M. (2024). Wearable Devices for Training and Patient Monitoring: A Comprehensive Review. Quality in Sport, 29, 55667. https://doi.org/10.12775/QS.2024.29.55667

Kuhn, M., & Johnson, K. (2013). Applied predictive modeling. Springer New York. https://doi.org/10.1007/978-1-4614-6849-3

Kumar, G., & Alqahtani, H. (2022). Deep learning-based cancer detection-recent developments, trend and challenges. Computer Modeling in Engineering & Sciences, 130(3), 1271–1307. https://doi.org/10.32604/cmes.2022.018418

Li, Z., Li, H., & Gianchandani, Y. B. (2024). A Disposable Sensor for PM2.5 and PM10 Based on Wireless Magnetoelastic Resonators. Proceedings of IEEE Sensors. https://doi.org/10.1109/SENSORS60989.2024.10785131

Lin, W., Shi, S., Huang, H., Wen, J., & Chen, G. (2023). Predicting risk of obesity in overweight adults using interpretable machine learning algorithms. Frontiers in Endocrinology, 14. https://doi.org/10.3389/fendo.2023.1292167

Linh, V. T. N., Han, S., Koh, E., Kim, S., Jung, H. S., & Koo, J. (2025). Advances in wearable electronics for monitoring human organs: Bridging external and internal health assessments. Biomaterials, 314, 122865. https://doi.org/10.1016/J.BIOMATERIALS.2024.122865

Mahmood Safaei, Elankovan A. Sundararajan, Maha Driss, Wadii Boulila, Azrulhizam Shapi'i. (2021). A systematic literature review on obesity: Understanding the causes & consequences of obesity and reviewing various machine learning approaches used to predict obesity, Computers in Biology and Medicine, Volume 136, 104754, ISSN 0010-4825, https://doi.org/10.1016/j.compbiomed.2021.104754

Mousavi, S., Bieber, K., Zirpel, H., Vorobyev, A., Olbrich, H., Papara, C., De Luca, D. A., Thaci, D., Schmidt, E., Riemekasten, G., Lamprecht, P., Laudes, M., Kridin, K., & Ludwig, R. J. (2025). Large-scale analysis highlights obesity as a risk factor for chronic, non-communicable inflammatory diseases. Frontiers in Endocrinology, 16. https://doi.org/10.3389/fendo.2025.1516433

Nuttall, F. Q. (2015). Body mass index. Nutrition Today, 50(3), 117–128. https://doi.org/10.1097/nt.0000000000000092

Poirier, P., Eckel, R. H., Lavie, C. J., Van Gaal, L. F., Ross, R., Després, J. P., & Sharma, A. M. (2006). Obesity and cardiovascular disease: A scientific statement from the American Heart Association. Circulation, 113(6), 898–918 https://doi.org/10.1161/CIRCULATIONAHA.106.171016.

Poliban. (2025). Radial basis function model for obesity classification (PDF). Eltikom Poliban.

Safaei, M., Sundararajan, E. A., Driss, M., Boulila, W., & Shapi’i, A. (2021). A systematic literature review on obesity: Understanding the causes & consequences of obesity and reviewing various machine learning approaches used to predict obesity. Computers in Biology and Medicine, 136, 104754. https://doi.org/10.1016/j.compbiomed.2021.104754

Shivani Aggarwal, Kavita Pandey. (2023). Early identification of PCOS with commonly known diseases: Obesity, diabetes, high blood pressure and heart disease using machine learning techniques, Expert Systems with Applications, Volume 217, 119532, ISSN 0957-4174, https://doi.org/10.1016/j.eswa.2023.119532

Wu, Y., Li, D., & Vermund, S. H. (2024). Advantages and limitations of the body mass index (BMI) to assess adult obesity. International Journal of Environmental Research and Public Health, 21(6), 757. https://doi.org/10.3390/ijerph21060757

Xiaoyi Shi, Yuxin Zheng, Haiwen Cui, Yuxi Zhang, Menghui Jiang. (2022). Exposure to outdoor and indoor air pollution and risk of overweight and obesity across different life periods: A review, Ecotoxicology and Environmental Safety, Volume 242, 113893, ISSN 0147-6513, https://doi.org/10.1016/j.ecoenv.2022.113893

Zeedhan, M., Mohamed Ziham, M. M., Abdul Razick, M. S., & Ul Amin, N. (2025). Predicting obesity classification using k-nearest neighbors: A data science approach in Python. Preprints.

Zhang, Y., Wang, H., Cui, J., He, T., Qiu, G., Xu, Y., & Zhang, J. (2024). An ultraviolet photodetector based on conductive hydrogenated TiO2 film prepared by radio frequency atmospheric pressure plasma. Journal of Physics D: Applied Physics, 57(38), 385201. https://doi.org/10.1088/1361-6463/AD584B

Downloads

Published

2026-03-11

How to Cite

Tedjasulaksana, J., Dinata, F. J., Krisnadi, R., Reksosamudro, M. S., Wen, W., & Hidayat, M. F. (2026). Machine Learning Approach: A Comparative Analysis of Classifiers in Predicting Obesity Type. Engineering, MAthematics and Computer Science Journal (EMACS), 8(1), 19–25. https://doi.org/10.21512/emacsjournal.v8i1.15268
Abstract 18  .
PDF downloaded 19  .