Predictive Modeling of Jakarta's Social Cohesion: GBDT Leads Comparative Analysis

Authors

  • Muhammad Rizki Nur Majiid Bina Nusantara University
  • Karli Eka Setiawan Bina Nusantara University

DOI:

https://doi.org/10.21512/emacsjournal.v5i3.10602

Keywords:

Social Cohesion, Machine Learning, Urban Assessment, GBDT, Predictive Accuracy

Abstract

In this study, we address the challenge of predicting the Social Cohesion Index in Jakarta through a comprehensive analysis of machine learning models. Finding the most accurate and effective predictive model for this crucial urban evaluation task is the primary goal of our research. We use a variety of machine learning algorithms, comparing their performance using metrics like Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and computational cost. These algorithms include Gradient Boosted Decision Trees (GBDT), Polynomial Regression, Random Forest, Support Vector Machine (SVM), and Multi-Layer Perceptron (MLP). It should be noted that GBDT stands out as a top performer, regularly displaying outstanding accuracy with a competitive MAE of 0.692, RMSE of 0.887, and MAPE of 25.59%. The computational efficiency of GBDT is also impressive, with predictions taking only 0.05 seconds. These results underscore the potential of GBDT as a practical and precise tool for real-time assessments of social cohesion in large urban environments like Jakarta. The findings offer a data-driven way to guide policy decisions and community development activities, with important implications for urban planning and governance. Overall, this research emphasizes the promise of GBDT in boosting social cohesion evaluation approaches and increases our understanding of the application of machine learning in addressing complex urban difficulties.

 

Dimensions

Plum Analytics

Author Biographies

Muhammad Rizki Nur Majiid, Bina Nusantara University

Computer Science of Department, School of Computer Science

Karli Eka Setiawan, Bina Nusantara University

Computer Science of Department, School of Computer Science

 

References

Amir, S., Hidayana, I., Rahvenia, Z., & Haydar, S. (2023). Dataset on factors associated with social cohesion of urban life in Jakarta. Data in Brief, 49, 109339. https://doi.org/10.1016/j.dib.2023.109339

Andariesta, D. T., & Wasesa, M. (2022). Machine learning models for predicting international tourist arrivals in Indonesia during the COVID-19 pandemic: a multisource Internet data approach. Journal of Tourism Futures, 1–17. https://doi.org/10.1108/JTF-10-2021-0239

Jewett, R. L., Mah, S. M., Howell, N., & Larsen, M. M. (2021). Social Cohesion and Community Resilience During COVID-19 and Pandemics: A Rapid Scoping Review to Inform the United Nations Research Roadmap for COVID-19 Recovery. International Journal of Health Services, 51(3), 325–336. https://doi.org/10.1177/0020731421997092

Narayan, V., & Daniel, A. K. (2022). Energy Efficient Protocol for Lifetime Prediction of Wireless Sensor Network using Multivariate Polynomial Regression Model. Journal of Scientific and Industrial Research, 81(12), 1297–1309. https://doi.org/10.56042/jsir.v81i12.54908

Ouma, Y. O., Okuku, C. O., & Njau, E. N. (2020). Use of Artificial Neural Networks and Multiple Linear Regression Model for the Prediction of Dissolved Oxygen in Rivers: Case Study of Hydrographic Basin of River Nyando, Kenya. Complexity, 2020. https://doi.org/10.1155/2020/9570789

Parbat, D., & Chakraborty, M. (2020). A python based support vector regression model for prediction of COVID19 cases in India. Chaos, Solitons and Fractals, 138, 109942. https://doi.org/10.1016/j.chaos.2020.109942

Pekel, E. (2020). Estimation of soil moisture using decision tree regression. Theoretical and Applied Climatology, 139(3–4). https://doi.org/10.1007/s00704-019-03048-8

Rybak, A. (2023). Survey mode and nonresponse bias: A meta-Analysis based on the data from the international social survey programme waves 1996 2018 and the European social survey rounds 1 to 9. PLoS ONE, 18(3 March). https://doi.org/10.1371/journal.pone.0283092

Sarker, I. H. (2021). Machine Learning: Algorithms, Real-World Applications and Research Directions. SN Computer Science, 2(3), 1–21. https://doi.org/10.1007/s42979-021-00592-x

Steiner, A., Woolvin, M., & Skerratt, S. (2018). Measuring community resilience: Developing and applying a “hybrid evaluation” approach. Community Development Journal, 53(1). https://doi.org/10.1093/cdj/bsw017

Tzenios, N. (2020). Examining the Impact of EdTech Integration on Academic Performance Using Random Forest Regression. ResearchBerg Review of Science and Technology, 3(1), 94–106. https://researchberg.com/index.php/rrst/article/view/84

Viljanen, M., Meijerink, L., Zwakhals, L., & van de Kassteele, J. (2022). A machine learning approach to small area estimation: predicting the health, housing and well-being of the population of Netherlands. International Journal of Health Geographics, 21(1), 1–18. https://doi.org/10.1186/s12942-022-00304-5

Zhang, Z., & Jung, C. (2021). GBDT-MO: Gradient-Boosted Decision Trees for Multiple Outputs. IEEE Transactions on Neural Networks and Learning Systems, 32(7), 3156–3167. https://doi.org/10.1109/TNNLS.2020.3009776

Downloads

Published

2023-09-30

Issue

Section

Articles
Abstract 220  .
PDF downloaded 166  .