Observing Pre-Trained Convolutional Neural Network (CNN) Layers as Feature Extractor for Detecting Bias in Image Classification Data

Amadea Claire Isabel Ardison; Mikhaya Josheba Rumondang Hutagalung; Reynaldi Chernando; Tjeng Wawan Cenggoro

doi:10.21512/commit.v16i2.8144

Authors

Amadea Claire Isabel Ardison Bina Nusantara University
Mikhaya Josheba Rumondang Hutagalung Bina Nusantara University
Reynaldi Chernando Bina Nusantara University
Tjeng Wawan Cenggoro Bina Nusantara University

DOI:

https://doi.org/10.21512/commit.v16i2.8144

Keywords:

Pre-Trained Convolutional Neural Network (CNN), Features Extractor, Data Bias, Image Classification

Abstract

Detecting bias in data is crucial since it can pose serious problems when developing an AI algorithm. The research aims to propose a novel study design to detect bias in image classification data by using pretrained Convolutional Neural Network (CNN) layers as a feature extractor. There are three datasets used in the research with varying degrees of complexity, those are low, medium, and high complexity. There are Modified National Institute of Standards and Technology (MNIST) Digits, batik collections (Parang, Megamendung, and Kawung), and Canadian Institute for Advanced Research (CIFAR-10) datasets. Then, the researchers make a baseline workflow and substitute a step-in feature extraction with a convolution using the first pre-trained CNN layer and each of its kernels. Then, the researchers evaluate the effect of the experiments using accuracy. By observing the effect of the individual kernel, the research can better make sense of what happens inside a CNN layer. The research finds that color in the image is an essential factor when working with CNN. Furthermore, the proposed study design can detect bias in image classification data where it is related to the color of the image. Detecting this bias early is important in helping developers to improve AI algorithms.

Dimensions

Plum Analytics

Author Biographies

Amadea Claire Isabel Ardison, Bina Nusantara University

Computer Science Department, School of Computer Science

Mikhaya Josheba Rumondang Hutagalung, Bina Nusantara University

Computer Science Department, School of Computer Science

Reynaldi Chernando, Bina Nusantara University

Computer Science Department, School of Computer Science

Tjeng Wawan Cenggoro, Bina Nusantara University

Computer Science Department, School of Computer Science

Bioinformatics and Data Science Research Center

References

S. Leavy, â€œGender bias in artificial intelligence: The need for diversity and gender theory in machine learning,â€ in Proceedings of the 1st International Workshop On Gender Equality in Software Engineering, 2018, pp. 14â€“16.

D. J. Fuchs, â€œThe dangers of human-like bias in machine-learning algorithms,â€ Missouri S&Tâ€™s Peer to Peer, vol. 2, no. 1, pp. 1â€“14, 2018.

J. Dastin, â€œAmazon scraps secret AI recruiting tool that showed bias against women,â€ 2018. [Online]. Available: https://reut.rs/2UghQQS

A. Chouldechova, D. Benavides-Prado, O. Fialko, and R. Vaithianathan, â€œA case study of algorithmassisted decision making in child maltreatment hotline screening decisions,â€ in Proceedings of the 1st Conference on Fairness, Accountability and Transparency. PMLR, 2018, pp. 134â€“148.

N. Mehrabi, F. Morstatter, N. Saxena, K. Lerman, and A. Galstyan, â€œA survey on bias and fairness in machine learning,â€ ACM Computing Surveys (CSUR), vol. 54, no. 6, pp. 1â€“35, 2021.

S. Alelyani, â€œDetection and evaluation of machine learning bias,â€ Applied Sciences, vol. 11, no. 14, pp. 1â€“17, 2021.

H. Jiang and O. Nachum, â€œIdentifying and correcting label bias in machine learning,â€ arXiv Preprint arXiv:1901.04966, 2019.

W. Sun, O. Nasraoui, and P. Shafto, â€œEvolution and impact of bias in human and machine learning algorithm interaction,â€ PLOS ONE, vol. 15, no. 8, pp. 1â€“39, 2020.

S. Albawi, T. A. Mohammed, and S. Al-Zawi, â€œUnderstanding of a convolutional neural network,â€ in 2017 International Conference on Engineering and Technology (ICET). Antalya, Turkey: IEEE, Aug. 21â€“23, 2017, pp. 1â€“6.

B. B. Traore, B. Kamsu-Foguem, and F. Tangara, â€œDeep convolution neural network for image recognition,â€ Ecological Informatics, vol. 48, pp. 257â€“268, 2018.

L. Shang, Q. Yang, J. Wang, S. Li, and W. Lei, â€œDetection of rail surface defects based on CNN image recognition and classification,â€ in 2018 20th International Conference on Advanced Communication Technology (ICACT). Chuncheon, South Korea: IEEE, Feb. 11â€“14, 2018, pp. 45â€“51.

R. Chauhan, K. K. Ghanshala, and R. C. Joshi, â€œConvolutional Neural Network (CNN) for image detection and recognition,â€ in 2018 First International Conference on Secure Cyber Computing and Communication (ICSCCC). Jalandhar, India: IEEE, Dec. 15â€“17, 2018, pp. 278â€“282.

B. P. Gyires-TÂ´oth, M. OsvÂ´ath, D. Papp, and G. SzËucs, â€œDeep learning for plant classification and content-based image retrieval,â€ Cybernetics and Information Technologies, vol. 19, no. 1, pp. 88â€“100, 2019.

Y. Wang, H. Liu, M. Guo, X. Shen, B. Han, and Y. Zhou, â€œImage recognition model based on deep learning for remaining oil recognition from visualization experiment,â€ Fuel, vol. 291, pp. 1â€“14, 2021.

X. Yang, Y. Zhang, W. Lv, and D. Wang, â€œImage recognition of wind turbine blade damage based on a deep learning model with transfer learning and an ensemble learning classifier,â€ Renewable Energy, vol. 163, pp. 386â€“397, 2021.

S. Hicks, M. Riegler, K. Pogorelov, K. V. Anonsen, T. de Lange, D. Johansen, M. Jeppsson, K. R. Randel, S. L. Eskeland, and P. Halvorsen, â€œDissecting deep neural networks for better medical image classification and classification understanding,â€ in 2018 IEEE 31st International Symposium on Computer-Based Medical Systems (CBMS). Karlstad, Sweden: IEEE, June 18â€“21, 2018, pp. 363â€“368.

R. Guidotti, A. Monreale, S. Ruggieri, F. Turini, F. Giannotti, and D. Pedreschi, â€œA survey of methods for explaining black box models,â€ ACM Computing Surveys (CSUR), vol. 51, no. 5, pp.

â€“42, 2019.

N. Oâ€™Mahony, S. Campbell, A. Carvalho, S. Harapanahalli, G. V. Hernandez, L. Krpalkova, D. Riordan, and J. Walsh, â€œDeep learning vs. traditional computer vision,â€ in Science and Information Conference. Las Vegas, USA: Springer, April 25â€“26, 2019, pp. 128â€“144.

S. T. Krishna and H. K. Kalluri, â€œDeep learning and transfer learning approaches for image classification,â€ International Journal of Recent Technology and Engineering (IJRTE), vol. 7, no. 5S4, pp. 427â€“432, 2019.

K. He, X. Zhang, S. Ren, and J. Sun, â€œDeep residual learning for image recognition,â€ in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 770â€“778.

J. Wang, Y. Ma, L. Zhang, R. X. Gao, and D. Wu, â€œDeep learning for smart manufacturing: Methods and applications,â€ Journal of Manufacturing Systems, vol. 48, pp. 144â€“156, 2018.

A. Krizhevsky, Learning multiple layers of features from tiny images. University of Toronto, 2009.

Y. LeCun, C. Cortes, and C. J. C. Burges, â€œThe MNIST database of handwritten digits,â€ 1998. [Online]. Available: http://yann.lecun.com/exdb/mnist/

M. K. Hu, â€œVisual pattern recognition by moment invariants,â€ IRE Transactions on Information Theory, vol. 8, no. 2, pp. 179â€“187, 1962.

C. Cortes and V. Vapnik, â€œSupport-vector networks,â€ Machine Learning, vol. 20, no. 3, pp. 273â€“297, 1995.