Machine Learning Approach: A Comparative Analysis of Classifiers in Predicting Obesity Type
DOI:
https://doi.org/10.21512/emacsjournal.v8i1.15268Keywords:
health, machine learning, Neural Network, hyperparameter tuning, AI applicationAbstract
Obesity is a growing global public health concern that increases the risk of chronic diseases and significantly affects quality of life. Traditional diagnostic methods such as Body Mass Index (BMI) have limitations in accurately representing body fat distribution and individual health conditions. This study aims to comparatively evaluate the performance of various machine learning and neural network models in predicting obesity levels using a multiclass classification approach. The dataset consists of 2,111 observations with 12 predictor variables and seven obesity categories, obtained from a publicly available source. Data preprocessing included duplicate removal, outlier handling using the interquartile range method, feature scaling, and categorical encoding, followed by a 60:20:20 train–validation–test split. Several classifiers were implemented, including Logistic Regression, Support Vector Classifier, Random Forest, Extra Trees, Gradient Boosting-based models (XGBoost and LightGBM), Multilayer Perceptron, K-Nearest Neighbors, and TabNet. Model performance was evaluated using macro-average F1-score and confusion matrix analysis. The results indicate that LightGBM achieved the highest predictive performance with an F1-score of 0.96, demonstrating strong generalization across obesity categories. XGBoost and Random Forest also showed strong performance, while Support Vector Classifier exhibited consistent results across training, validation, and cross-validation. These findings suggest that ensemble-based models are highly effective for obesity classification, while model selection should consider accuracy, interpretability, and computational constraints.
References
Aditya Mahindru, Pradeep Patil, Varun Agrawal. (2023). Role of Physical Activity on Mental Health and Well-Being: A Review, 1-7, DOI: 10.7759/cureus.33475
Airlangga, G. (2025). A comparative analysis of machine learning models for obesity prediction. Jurnal Informatika Ekonomi Bisnis, 7(1). https://doi.org/10.37034/infeb.v7i1.1089
Ali, A., Wei, Y., Tyson, J., Akerman, H., Jackson, A. I. R., Lane, R., Spencer, D., & White, N. M. (2024). Enhancing the response of a wearable sensor for improved respiratory rate (RR) monitoring. IEEE Access. https://doi.org/10.1109/ACCESS.2024.3509676
Alimbayeva, Z., Alimbayev, C., Ozhikenov, K., Bayanbay, N., & Ozhikenova, A. (2024). Wearable ECG Device and Machine Learning for Heart Monitoring. Sensors 2024, Vol. 24, Page 4201, 24(13), 4201. https://doi.org/10.3390/S24134201
Angela K. Fitch, Harold E. Bays. (2022). Obesity definition, diagnosis, bias, standard operating procedures (SOPs), and telehealth: An Obesity Medicine Association (OMA) Clinical Practice Statement (CPS) 2022, Obesity Pillars, Volume 1, 100004, ISSN 2667-3681, https://doi.org/10.1016/j.obpill.2021.100004
Arik, S. O., & Pfister, T. (2019). TabNet: Attentive interpretable tabular learning. arXiv. https://doi.org/10.48550/arXiv.1908.07442
Ba, N., Yue, W., Cao, C., Wu, W., & Cheng, P. (2024). Advances in Wearable Smart Chemical Sensors for Health Monitoring. Applied Sciences 2024, Vol. 14, Page 11199, 14(23), 11199. https://doi.org/10.3390/APP142311199
Bae, J. P., Nelson, D. R., Boye, K. S., & Mather, K. J. (2025). Prevalence of complications and comorbidities associated with obesity: A health insurance claims analysis. BMC Public Health, 25(1). https://doi.org/10.1186/s12889-024-21061-z
Chauhan, P., & Srivastava, S. (2025). Comparative analysis of deep learning and machine learning techniques for obesity classification. Atlantis Press.
Dirik, M. (2023). Application of machine learning techniques for obesity prediction: A comparative study. Journal of Complexity in Health Sciences, 6(2), 16–34. https://doi.org/10.21595/chs.2023.23193
Dumakude, A., & Ezugwu, A. E. (2023). Automated COVID-19 detection with convolutional neural networks. Scientific Reports, 13(1). https://doi.org/10.1038/s41598-023-37743-4
Erik Hemmingsson, Paulina Nowicka, Stanley Ulijaszek,Thorkild I. A. Sørensen. (2022). The social origins of obesity within and across generations, 1-11. https://doi.org/10.1111/obr.13514
Fruh, S. M. (2017). Obesity. Journal of the American Association of Nurse Practitioners, 29(S1), S3–S14. https://doi.org/10.1002/2327-6924.12510
Ge, Y., Wang, Q., Wang, L., Wu, H., Peng, C., Wang, J., Xu, Y., Xiong, G., Zhang, Y., & Yi, Y. (2019). Predicting post-stroke pneumonia using deep neural network approaches. International Journal of Medical Informatics, 132, 103986. https://doi.org/10.1016/j.ijmedinf.2019.103986
Habehh, H., & Gohel, S. (2021). Machine learning in healthcare. Current Genomics, 22(4), 291–300. https://doi.org/10.2174/1389202922666210705124359
Ikharo, B. A., & Aliu, D. (2023). Challenges Associated with Wearable Internet-of-Things (IoTs) Monitoring Systems for E-Health. FUOYE Journal of Engineering and Technology, 8(4). https://doi.org/10.46792/FUOYEJET.V8I4.1099
Jakachira, R., Jakachira, R., Yan, W., Burrow, J. A., Toussaint, K. C., & Toussaint, K. C. (2024). Dual-Wavelength, Polarization-Sensitive Wearable Photoplethysmographic Sensor on Diverse Skin Tones. Frontiers in Optics + Laser Science 2024 (FiO, LS) (2024), Paper JW4A.65, JW4A.65. https://doi.org/10.1364/FIO.2024.JW4A.65
Jeon, J., Lee, S., & Oh, C. (2022). Age-specific risk factors for the prediction of obesity using a machine learning approach. Research Square Platform LLC. https://doi.org/10.21203/rs.3.rs-1515734/v1
Joyce, D., De Brún, A., Symmons, S. M., Fox, R., & McAuliffe, E. (2023). Remote patient monitoring for COVID-19 patients: comparisons and framework for reporting. BMC Health Services Research, 23(1), 1–11. https://doi.org/10.1186/S12913-023-09526-0/TABLES/3
Kajzar, M. (2024). Wearable Devices for Training and Patient Monitoring: A Comprehensive Review. Quality in Sport, 29, 55667. https://doi.org/10.12775/QS.2024.29.55667
Kuhn, M., & Johnson, K. (2013). Applied predictive modeling. Springer New York. https://doi.org/10.1007/978-1-4614-6849-3
Kumar, G., & Alqahtani, H. (2022). Deep learning-based cancer detection-recent developments, trend and challenges. Computer Modeling in Engineering & Sciences, 130(3), 1271–1307. https://doi.org/10.32604/cmes.2022.018418
Li, Z., Li, H., & Gianchandani, Y. B. (2024). A Disposable Sensor for PM2.5 and PM10 Based on Wireless Magnetoelastic Resonators. Proceedings of IEEE Sensors. https://doi.org/10.1109/SENSORS60989.2024.10785131
Lin, W., Shi, S., Huang, H., Wen, J., & Chen, G. (2023). Predicting risk of obesity in overweight adults using interpretable machine learning algorithms. Frontiers in Endocrinology, 14. https://doi.org/10.3389/fendo.2023.1292167
Linh, V. T. N., Han, S., Koh, E., Kim, S., Jung, H. S., & Koo, J. (2025). Advances in wearable electronics for monitoring human organs: Bridging external and internal health assessments. Biomaterials, 314, 122865. https://doi.org/10.1016/J.BIOMATERIALS.2024.122865
Mahmood Safaei, Elankovan A. Sundararajan, Maha Driss, Wadii Boulila, Azrulhizam Shapi'i. (2021). A systematic literature review on obesity: Understanding the causes & consequences of obesity and reviewing various machine learning approaches used to predict obesity, Computers in Biology and Medicine, Volume 136, 104754, ISSN 0010-4825, https://doi.org/10.1016/j.compbiomed.2021.104754
Mousavi, S., Bieber, K., Zirpel, H., Vorobyev, A., Olbrich, H., Papara, C., De Luca, D. A., Thaci, D., Schmidt, E., Riemekasten, G., Lamprecht, P., Laudes, M., Kridin, K., & Ludwig, R. J. (2025). Large-scale analysis highlights obesity as a risk factor for chronic, non-communicable inflammatory diseases. Frontiers in Endocrinology, 16. https://doi.org/10.3389/fendo.2025.1516433
Nuttall, F. Q. (2015). Body mass index. Nutrition Today, 50(3), 117–128. https://doi.org/10.1097/nt.0000000000000092
Poirier, P., Eckel, R. H., Lavie, C. J., Van Gaal, L. F., Ross, R., Després, J. P., & Sharma, A. M. (2006). Obesity and cardiovascular disease: A scientific statement from the American Heart Association. Circulation, 113(6), 898–918 https://doi.org/10.1161/CIRCULATIONAHA.106.171016.
Poliban. (2025). Radial basis function model for obesity classification (PDF). Eltikom Poliban.
Safaei, M., Sundararajan, E. A., Driss, M., Boulila, W., & Shapi’i, A. (2021). A systematic literature review on obesity: Understanding the causes & consequences of obesity and reviewing various machine learning approaches used to predict obesity. Computers in Biology and Medicine, 136, 104754. https://doi.org/10.1016/j.compbiomed.2021.104754
Shivani Aggarwal, Kavita Pandey. (2023). Early identification of PCOS with commonly known diseases: Obesity, diabetes, high blood pressure and heart disease using machine learning techniques, Expert Systems with Applications, Volume 217, 119532, ISSN 0957-4174, https://doi.org/10.1016/j.eswa.2023.119532
Wu, Y., Li, D., & Vermund, S. H. (2024). Advantages and limitations of the body mass index (BMI) to assess adult obesity. International Journal of Environmental Research and Public Health, 21(6), 757. https://doi.org/10.3390/ijerph21060757
Xiaoyi Shi, Yuxin Zheng, Haiwen Cui, Yuxi Zhang, Menghui Jiang. (2022). Exposure to outdoor and indoor air pollution and risk of overweight and obesity across different life periods: A review, Ecotoxicology and Environmental Safety, Volume 242, 113893, ISSN 0147-6513, https://doi.org/10.1016/j.ecoenv.2022.113893
Zeedhan, M., Mohamed Ziham, M. M., Abdul Razick, M. S., & Ul Amin, N. (2025). Predicting obesity classification using k-nearest neighbors: A data science approach in Python. Preprints.
Zhang, Y., Wang, H., Cui, J., He, T., Qiu, G., Xu, Y., & Zhang, J. (2024). An ultraviolet photodetector based on conductive hydrogenated TiO2 film prepared by radio frequency atmospheric pressure plasma. Journal of Physics D: Applied Physics, 57(38), 385201. https://doi.org/10.1088/1361-6463/AD584B
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Jeffrey Tedjasulaksana, Ferry Jaya Dinata, Rafael Krisnadi, Matthew S.W. Reksosamudro, Wilbert Wen, Muhammad Fadlan Hidayat

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License - Share Alike that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
USER RIGHTS
All articles published Open Access will be immediately and permanently free for everyone to read and download. We are continuously working with our author communities to select the best choice of license options, currently being defined for this journal as follows: Creative Commons Attribution-Share Alike (CC BY-SA)