Semantic Segmentation for Aerial Images: A Literature Review

Authors

  • Yongki Christian Sanjaya Bina Nusantara University
  • Alexander Agung Santoso Gunawan Bina Nusantara University
  • Edy Irwansyah Bina Nusantara University

Keywords:

Semantic Image Segmentation, Computer Vision

Abstract

Semantic image segmentation is one of the fundamental applications of computer vision which can also be called pixel-level classification. Semantic image segmentation is the process of understanding the role of each pixel in an image. Over time, the model for completing Semantic Image Segmentation has developed very rapidly. Due to this rapid growth, many models related to Semantic Image Segmentation have been produced and have also been used or applied in many domains such as medical areas and intelligent transportation. Therefore, our motivation in making this paper is to contribute to the world of research by conducting a review of Semantic Image Segmentation which aims to provide a big picture related to the latest developments related to Semantic Image Segmentation. In addition, we also provide the results of performance measurements on each of the Semantic Image Segmentation methods that we discussed using the Intersectionover-Union (IoU) method. After that, we provide a comparison for each semantic image segmentation model that we discuss using the results of the IoU and then provide conclusions related to a model that has good performance. We hope this review paper can facilitate researchers in understanding the development of Semantic Image Segmentation in a shorter time, simplify understanding of the latest advancements in Semantic Image Segmentation, and can also be used as a reference for developing new Semantic Image Segmentation models in the future

Author Biographies

Yongki Christian Sanjaya, Bina Nusantara University

Computer Science Department, School of Computer Science

Alexander Agung Santoso Gunawan, Bina Nusantara University

Computer Science Department, School of Computer Science

Edy Irwansyah, Bina Nusantara University

Computer Science Department, School of Computer Science

References

J. Long, E. Shelhamer and T. Darrell, "Fully Convolutional Networks for Semantic Segmentation," 2015.

X. Liu, Z. Deng and Y. Yang, "Recent progress in semantic image segmentation," 2018.

N. Moon, E. Bullitt, K. v. Leemput and G. Gerig, "Automatic Brain and Tumor Segmentation," in In 2002 International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2002.

G.-Q. Wei, K. Arbter and G. Hirzinger, "Automatic tracking of laparoscopic instruments by color coding," in In 1997 International Conference on Medical Robotics and Computer-Assisted Surgery (MRCAS), 1997.

S. Maldonado-Bascon, S. Lafuente-Arroyo, P. Gil-Jimenez, H. Gomez-Moreno and F. Lopez-Ferreras, "Road-Sign Detection and Recognition Based on Support Vector Machines," IEEE Transactions on Intelligent Transportation Systems, pp. 264-278, 2007.

A. Cohen, E. Rivlin, I. Shimshoni and E. Sabo, "Memory based active contour algorithm using pixel-level classified images for colon crypt segmentation," Computerized Medical Imaging and Graphics, pp. 150-164, 2015.

C. Huang, L. S. Davis and J. R. G. Townshend, "An assessment of support vector machines for land cover classification," International Journal of Remote Sensing, pp. 725-749, 2002.

M. A. Rahman and Y. Wang, "Optimizing Intersection-Over-Union in Deep Neural Networks for Image Segmentation," in In 2016 International Symposium on Visual Computing (ISVC), 2016.

Y. Yuan, X. Chen and J. Wang, "Object-Contextual Representations for Semantic Segmentation," 2019.

J. Fu, J. Liu, H. Tian, Y. Li, Y. Bao, Z. Fang and H. Lu, "Dual Attention Network for Scene Segmentation," 2019.

P. Wang, P. Chen, Y. Yuan, D. Liu, Z. Huang, X. Hou and G. Cottrell, "Understanding Convolution for Semantic Segmentation," 2018.

H. Zhao, J. Shi, X. Qi, X. Wang and J. Jia, "Pyramid Scene Parsing Network," 2017.

J. Zhuang, J. Yang, L. Gu and N. C. Dvornek, "ShelfNet for Fast Semantic Segmentation," 2019.

Y. Zhu, K. Sapra, F. A. Reda, K. J. Shih, S. Newsam, A. Tao and B. Catanzaro, "Improving Semantic Segmentation via Video Propagation and Label Relaxation," 2019.

L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff and H. Adam, "Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation," 2018.

S. R. Bulò, L. Porzi and P. Kontschieder, "In-Place Activated BatchNorm for Memory-Optimized Training of DNNs," 2018.

C. Yu, J. Wang, C. Gao, G. Yu, C. Shen and N. Sang, "Context Prior for Scene Segmentation," 2020.

C. Yu, J. Wang, C. Peng, C. Gao, G. Yu and N. Sang, "BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation," in In 2018 The European Conference on Computer Vision (ECCV), 2018.

L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy and A. L. Yuille, "DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs," 2017.

Z. Wu, C. Shen and A. v. d. Hengel, "Bridging Category-level and Instance-level Semantic Image Segmentation," 2016.

M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth and B. Schiele, "The Cityscapes Dataset for Semantic Urban Scene Understanding," 2016.

B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba, "Scene Parsing through ADE20K Dataset," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

K. Gong, X. Liang, D. Zhang, X. Shen and L. Lin, "Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

H. Caesar, J. Uijlings and V. Ferrari, "COCO-Stuff: Thing and Stuff Classes in Context," 2018.

Z. Zhong, Z. Q. Lin, R. Bidart, X. Hu, I. B. Daya, Z. Li, W.-S. Zheng, J. Li and A. Wong, "Squeeze-and-Attention Networks for Semantic Segmentation," 2020.

S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, D. Du, C. Huang and P. H. S. Torr, "Conditional Random Fields as Recurrent Neural Networks," 2016.

M. Yang, K. Yu, C. Zhang, Z. Li and K. Yang, "DenseASPP for Semantic Segmentation in Street Scenes," in 2018 Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

Z. Wei, Y. Sun, J. Wang, H. Lai and S. Liu, "Learning Adaptive Receptive Fields for Deep Image Parsing Network," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

W. Wang, R. Yu, Q. Huang and U. Neumann, "SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance," 2019.

G. Wang, P. Luo, L. Lin and X. Wang, "Learning Object Interactions and Descriptions for Semantic Image Segmentation," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

T. Takikawa, D. Acuna, V. Jampani and S. Fidler, "Gated-SCNN: Gated Shape CNNs for Semantic Segmentation," 2019.

T. Pohlen, A. Hermans, M. Mathias and B. Leibe, "Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.

S. Mohajerani and P. Saeedi, "Cloud-Net+: A Cloud Segmentation CNN for Landsat 8 Remote Sensing Imagery Optimized with Filtered Jaccard Loss Function," 2020.

P. Luo, G. Wang, L. Lin and X. Wang, "Deep Dual Learning for Semantic Image Segmentation," in 2017 International Conference on Computer Vision (ICCV), 2017.

P. Li, Y. Xu, Y. Wei and Y. Yang, "Self-Correction for Human Parsing," 2019.

S. Kong and C. Fowlkes, "Recurrent Scene Parsing with Perspective Understanding in the Loop," 2017.

D. Dai and L. V. Gool, "Dark Model Adaptation: Semantic Image Segmentation from Daytime to Nighttime," 2018.

S. Choi, J. T. Kim and J. Choo, "Cars Can’t Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks," 2020.

S. R. Bulò, G. Neuhold and P. Kontschieder, "Loss Max-Pooling for Semantic Image Segmentation," 2017.

S. M. Azimi, C. Henry, L. Sommer, A. Schumann and E. Vig, "SkyScapes – Fine-Grained Semantic Understanding of Aerial Scenes," in 2019 International Conference on Computer Vision (ICCV), 2019.

Z. Wu, C. Shen and A. v. d. Hengel, "Bridging Category-level and Instance-level Semantic Image Segmentation," 2016.

L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff and H. Adam, "Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation," 2018.

Downloads

Published

2020-10-01

Issue

Section

Articles