Inception-ResNet-V2 The U-Net Encoder for Road Segmentation using Sentinel 2A

Bayu Yanuargi; ema utami

doi:10.21512/comtech.v16i2.12089

Authors

Bayu Yanuargi Universitas Amikom Yogyakarta
ema utami Universitas Amikom Yogyakarta

DOI:

https://doi.org/10.21512/comtech.v16i2.12089

Keywords:

Convolutional Neural Network, U-Net, Inception ResNet V2, Encoder, Deep Learning, Computer Vision, Segmentation

Abstract

Updating road network maps is essential for transportation services, as incomplete or inaccurate maps can lead to inefficiencies and diminish service quality. The online transportation industry generates vast amounts of GPS data as drivers navigate, which is valuable for mapping road networks and improving traffic management. However, since drivers do not cover all roads, satellite imagery plays a crucial role in identifying areas that are not mapped. By combining GPS data as labels with satellite imagery, the extraction of new road networks becomes more accurate. This research employs a deep learning Convolutional Neural Network (CNN) with the U-Net architecture for road segmentation, allowing for the identification of new paths. Two different encoders are tested in this research: Inception-ResNet-V2 and a pure U-Net encoder. The Inception-ResNet-V2 encoder achieves an accuracy of 91.3%, while the pure U-Net encoder achieves 90.7%. In terms of Dice Loss, the models record values of 0.051 and 0.08, respectively. The research highlights the effectiveness of different U-Net encoders in road network segmentation. With high accuracy and low Dice Loss, this approach provides a reliable method for automatically updating road maps. It has potential applications in navigation systems, urban planning, and AI-driven intelligent transportation systems.

Dimensions

References

Baek, W. K., Lee, M.-J., & Jung, H.-S. (2024). Land cover classification from RGB and NIR satellite images using modified U-Net Model. IEEE Access, 12, 69445–69455. https://doi.org/10.1109/ACCESS.2024.3401416

Chen, B., Ding, C., Ren, W., & Xu, G. (2021). Automatically tracking road centerlines from low-frequency GPS trajectory data. ISPRS International Journal of Geo-Information, 10(3). https://doi.org/10.3390/ijgi10030122

Joubert, N., Reid, T. G. R., & Noble, F. (2020). Developments in modern GNSS and its impact on autonomous vehicle architectures. 2020 IEEE Intelligent Vehicles Symposium (IV), 2029–2036. https://doi.org/10.1109/IV47402.2020.9304840

Kamalov, F. (2020). Kernel density estimation based sampling for imbalanced class distribution. Information Sciences, 512, 1192–1201. https://doi.org/https://doi.org/10.1016/j.ins.2019.10.017

Liu, Y., Jia, R., Ye, J., & Qu, X. (2022). How machine learning informs ride-hailing services: A survey. Communications in Transportation Research, 2, 100075. https://doi.org/https://doi.org/10.1016/j.commtr.2022.100075

Luthfil, P., Program Doktor, H., Wasanta, T., & Santosa, W. (2021). Pengaruh indeks infrastruktur jalan terhadap indikator ekonomi di Indonesia. Jurnal HPJI (Himpunan Pengembangan Jalan Indonesia), 7(2), 143-152.

Pinuji, S., Savitri, A. I., Noormasari, M., Wijaya, D. K., & Kurniawan, A. (2019). Efektivitas data spasial peta Rupa Bumi Indonesia (RBI) dan Openstreetmap dalam pengambilan keputusan menggunakan Inasafe. Jurnal Dialog Penanggulangan Bencana, 10(1), 22-29. https://api.semanticscholar.org/CorpusID:238144783

Pommé, L.-E., Bourqui, R., Giot, R., & Auber, D. (2022). Relative Confusion Matrix: Efficient comparison of decision models. 2022 26th International Conference Information Visualisation (IV), 98–103. https://doi.org/10.1109/IV56949.2022.00025

Ramba, L. S. (2020). Design of a voice controlled home automation system using Deep Learning Convolutional Neural Network (DL-CNN). Telekontran : Jurnal Ilmiah Telekomunikasi, Kendali Dan Elektronika Terapan, 8(1), 57–73. https://doi.org/10.34010/telekontran.v8i1.3078

Sun, T., Di, Z., Che, P., Liu, C., & Wang, Y. (2019). Leveraging crowdsourced GPS data for road extraction from aerial imagery. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 7501–7510. https://doi.org/10.1109/CVPR.2019.00769

Tuli, T. B., Kohl, L., Chala, S. A., Manns, M., & Ansari, F. (2021). Knowledge-based digital twin for predicting interactions in human-robot collaboration. 2021 26th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA ), 1–8. https://doi.org/10.1109/ETFA45728.2021.9613342

Wang, J., Liu, Y., & Chang, J. (2022). An improved model for Kernel Density estimation based on Quadtree and Quasi-Interpolation. Mathematics, 10(14). https://doi.org/10.3390/math10142402

Wang, R., Cai, M., Xia, Z., & Zhou, Z. (2023). Remote sensing image road segmentation method integrating CNN-Transformer and UNet. IEEE Access, 11, 144446–144455. https://doi.org/10.1109/ACCESS.2023.3344797

Wei, Y., Zhang, K., & Ji, S. (2019). Road network extraction from satellite images using CNN based segmentation and tracing. IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium, 3923–3926. https://doi.org/10.1109/IGARSS.2019.8898565

Xu, Z., Yin, Y., Dai, C., Huang, X., Kudali, R., Foflia, J., Wang, G., & Zimmermann, R. (2020). Grab-Posisi-L: A labelled GPS trajectory dataset for map matching in Southeast Asia. Proceedings of the 28th International Conference on Advances in Geographic Information Systems, 171–174. https://doi.org/10.1145/3397536.3422218

Zhang, J., Hu, Q., Li, J., & Ai, M. (2021). Learning from GPS trajectories of floating car for CNN-based urban road extraction with high-resolution satellite imagery. IEEE Transactions on Geoscience and Remote Sensing, 59(3), 1836–1847. https://doi.org/10.1109/TGRS.2020.3003425

Zhang, P., He, H., Wang, Y., Liu, Y., Lin, H., Guo, L., & Yang, W. (2022). 3D urban buildings extraction based on airborne LiDAR and Photogrammetric Point Cloud Fusion according to U-Net deep learning model segmentation. IEEE Access, 10, 20889–20897. https://doi.org/10.1109/ACCESS.2022.3152744

Zhao, R., Qian, B., Zhang, X., Li, Y., Wei, R., Liu, Y., & Pan, Y. (2020). Rethinking Dice Loss for Medical Image Segmentation. 2020 IEEE International Conference on Data Mining (ICDM), 851–860. https://doi.org/10.1109/ICDM50108.2020.00094