Semantic Segmentation for Aerial Images: A Literature Review
DOI:
https://doi.org/10.21512/emacsjournal.v2i3.6737Keywords:
Semantic Image Segmentation, Computer VisionAbstract
Semantic image segmentation is one of the fundamental applications of computer vision which can also be called pixel-level classification. Semantic image segmentation is the process of understanding the role of each pixel in an image. Over time, the model for completing Semantic Image Segmentation has developed very rapidly. Due to this rapid growth, many models related to Semantic Image Segmentation have been produced and have also been used or applied in many domains such as medical areas and intelligent transportation. Therefore, our motivation in making this paper is to contribute to the world of research by conducting a review of Semantic Image Segmentation which aims to provide a big picture related to the latest developments related to Semantic Image Segmentation. In addition, we also provide the results of performance measurements on each of the Semantic Image Segmentation methods that we discussed using the Intersectionover-Union (IoU) method. After that, we provide a comparison for each semantic image segmentation model that we discuss using the results of the IoU and then provide conclusions related to a model that has good performance. We hope this review paper can facilitate researchers in understanding the development of Semantic Image Segmentation in a shorter time, simplify understanding of the latest advancements in Semantic Image Segmentation, and can also be used as a reference for developing new Semantic Image Segmentation models in the futurePlum Analytics
References
J. Long, E. Shelhamer and T. Darrell, "Fully Convolutional Networks for Semantic Segmentation," 2015.
X. Liu, Z. Deng and Y. Yang, "Recent progress in semantic image segmentation," 2018.
N. Moon, E. Bullitt, K. v. Leemput and G. Gerig, "Automatic Brain and Tumor Segmentation," in In 2002 International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), 2002.
G.-Q. Wei, K. Arbter and G. Hirzinger, "Automatic tracking of laparoscopic instruments by color coding," in In 1997 International Conference on Medical Robotics and Computer-Assisted Surgery (MRCAS), 1997.
S. Maldonado-Bascon, S. Lafuente-Arroyo, P. Gil-Jimenez, H. Gomez-Moreno and F. Lopez-Ferreras, "Road-Sign Detection and Recognition Based on Support Vector Machines," IEEE Transactions on Intelligent Transportation Systems, pp. 264-278, 2007.
A. Cohen, E. Rivlin, I. Shimshoni and E. Sabo, "Memory based active contour algorithm using pixel-level classified images for colon crypt segmentation," Computerized Medical Imaging and Graphics, pp. 150-164, 2015.
C. Huang, L. S. Davis and J. R. G. Townshend, "An assessment of support vector machines for land cover classification," International Journal of Remote Sensing, pp. 725-749, 2002.
M. A. Rahman and Y. Wang, "Optimizing Intersection-Over-Union in Deep Neural Networks for Image Segmentation," in In 2016 International Symposium on Visual Computing (ISVC), 2016.
Y. Yuan, X. Chen and J. Wang, "Object-Contextual Representations for Semantic Segmentation," 2019.
J. Fu, J. Liu, H. Tian, Y. Li, Y. Bao, Z. Fang and H. Lu, "Dual Attention Network for Scene Segmentation," 2019.
P. Wang, P. Chen, Y. Yuan, D. Liu, Z. Huang, X. Hou and G. Cottrell, "Understanding Convolution for Semantic Segmentation," 2018.
H. Zhao, J. Shi, X. Qi, X. Wang and J. Jia, "Pyramid Scene Parsing Network," 2017.
J. Zhuang, J. Yang, L. Gu and N. C. Dvornek, "ShelfNet for Fast Semantic Segmentation," 2019.
Y. Zhu, K. Sapra, F. A. Reda, K. J. Shih, S. Newsam, A. Tao and B. Catanzaro, "Improving Semantic Segmentation via Video Propagation and Label Relaxation," 2019.
L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff and H. Adam, "Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation," 2018.
S. R. Bulò, L. Porzi and P. Kontschieder, "In-Place Activated BatchNorm for Memory-Optimized Training of DNNs," 2018.
C. Yu, J. Wang, C. Gao, G. Yu, C. Shen and N. Sang, "Context Prior for Scene Segmentation," 2020.
C. Yu, J. Wang, C. Peng, C. Gao, G. Yu and N. Sang, "BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation," in In 2018 The European Conference on Computer Vision (ECCV), 2018.
L.-C. Chen, G. Papandreou, I. Kokkinos, K. Murphy and A. L. Yuille, "DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs," 2017.
Z. Wu, C. Shen and A. v. d. Hengel, "Bridging Category-level and Instance-level Semantic Image Segmentation," 2016.
M. Cordts, M. Omran, S. Ramos, T. Rehfeld, M. Enzweiler, R. Benenson, U. Franke, S. Roth and B. Schiele, "The Cityscapes Dataset for Semantic Urban Scene Understanding," 2016.
B. Zhou, H. Zhao, X. Puig, S. Fidler, A. Barriuso and A. Torralba, "Scene Parsing through ADE20K Dataset," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
K. Gong, X. Liang, D. Zhang, X. Shen and L. Lin, "Look into Person: Self-supervised Structure-sensitive Learning and A New Benchmark for Human Parsing," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
H. Caesar, J. Uijlings and V. Ferrari, "COCO-Stuff: Thing and Stuff Classes in Context," 2018.
Z. Zhong, Z. Q. Lin, R. Bidart, X. Hu, I. B. Daya, Z. Li, W.-S. Zheng, J. Li and A. Wong, "Squeeze-and-Attention Networks for Semantic Segmentation," 2020.
S. Zheng, S. Jayasumana, B. Romera-Paredes, V. Vineet, Z. Su, D. Du, C. Huang and P. H. S. Torr, "Conditional Random Fields as Recurrent Neural Networks," 2016.
M. Yang, K. Yu, C. Zhang, Z. Li and K. Yang, "DenseASPP for Semantic Segmentation in Street Scenes," in 2018 Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
Z. Wei, Y. Sun, J. Wang, H. Lai and S. Liu, "Learning Adaptive Receptive Fields for Deep Image Parsing Network," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
W. Wang, R. Yu, Q. Huang and U. Neumann, "SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance," 2019.
G. Wang, P. Luo, L. Lin and X. Wang, "Learning Object Interactions and Descriptions for Semantic Image Segmentation," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
T. Takikawa, D. Acuna, V. Jampani and S. Fidler, "Gated-SCNN: Gated Shape CNNs for Semantic Segmentation," 2019.
T. Pohlen, A. Hermans, M. Mathias and B. Leibe, "Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes," in 2017 Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
S. Mohajerani and P. Saeedi, "Cloud-Net+: A Cloud Segmentation CNN for Landsat 8 Remote Sensing Imagery Optimized with Filtered Jaccard Loss Function," 2020.
P. Luo, G. Wang, L. Lin and X. Wang, "Deep Dual Learning for Semantic Image Segmentation," in 2017 International Conference on Computer Vision (ICCV), 2017.
P. Li, Y. Xu, Y. Wei and Y. Yang, "Self-Correction for Human Parsing," 2019.
S. Kong and C. Fowlkes, "Recurrent Scene Parsing with Perspective Understanding in the Loop," 2017.
D. Dai and L. V. Gool, "Dark Model Adaptation: Semantic Image Segmentation from Daytime to Nighttime," 2018.
S. Choi, J. T. Kim and J. Choo, "Cars Can’t Fly up in the Sky: Improving Urban-Scene Segmentation via Height-driven Attention Networks," 2020.
S. R. Bulò, G. Neuhold and P. Kontschieder, "Loss Max-Pooling for Semantic Image Segmentation," 2017.
S. M. Azimi, C. Henry, L. Sommer, A. Schumann and E. Vig, "SkyScapes – Fine-Grained Semantic Understanding of Aerial Scenes," in 2019 International Conference on Computer Vision (ICCV), 2019.
Z. Wu, C. Shen and A. v. d. Hengel, "Bridging Category-level and Instance-level Semantic Image Segmentation," 2016.
L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff and H. Adam, "Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation," 2018.
Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License - Share Alike that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
USER RIGHTS
All articles published Open Access will be immediately and permanently free for everyone to read and download. We are continuously working with our author communities to select the best choice of license options, currently being defined for this journal as follows: Creative Commons Attribution-Share Alike (CC BY-SA)