Fish Classification System Using YOLOv3-ResNet18 Model for Mobile Phones

Suryadiputra Liawatimena; Edi Abdurachman; Agung Trisetyarso; Antoni Wibowo; Muhamad Keenan Ario; Ivan Sebastian Edbert

doi:10.21512/commit.v17i1.8107

Authors

Suryadiputra Liawatimena Bina Nusantara University
Edi Abdurachman Bina Nusantara University
Agung Trisetyarso Bina Nusantara University
Antoni Wibowo Bina Nusantara University
Muhamad Keenan Ario Bina Nusantara University
Ivan Sebastian Edbert Bina Nusantara University

DOI:

https://doi.org/10.21512/commit.v17i1.8107

Keywords:

Fish Classification System, YOLOv3- ResNet18 Model, Mobile Phone

Abstract

Every country in the world needs to report its fish production to the Food and Agriculture Organization of the United Nations (FAO) every year. In 2018, Indonesia ranked top five countries in fish production, with 8 million tons globally. Although it ranks top five, the fisheries in Indonesia are mostly dominated by traditional and small industries. Hence, a solution based on computer vision is needed to help detect and classify the fish caught every year. The research presents a method to detect and classify fish on mobile devices using the YOLOv3 model combined with ResNet18 as a backbone. For the experiment, the dataset used is four types of fish gathered from scraping across the Internet and taken from local markets and harbors with a total of 4,000 images. In comparison, two models are used: SSD-VGG and autogenerated model Huawei ExeML. The results show that the YOLOv3-ResNet18 model produces 98.45% accuracy in training and 98.15% in evaluation. The model is also tested on mobile devices and produces a speed of 2,115 ms on Huawei P40 and 3,571 ms on Realme 7. It can be concluded that the research presents a smaller-size model which is suitable for mobile devices while maintaining good accuracy and precision.

Dimensions

Plum Analytics

Author Biographies

Suryadiputra Liawatimena, Bina Nusantara University

Computer Science Department, BINUS Graduate Program â€“ Doctor of Computer Science

Computer Science Department, BINUS Graduate Program â€“ Master of Computer Science

Computer Engineering Department, Faculty of Engineering

Edi Abdurachman, Bina Nusantara University

Computer Science Department, BINUS Graduate Program â€“ Doctor of Computer Science

Agung Trisetyarso, Bina Nusantara University

Computer Science Department, BINUS Graduate Program â€“ Doctor of Computer Science

Antoni Wibowo, Bina Nusantara University

Computer Science Department, BINUS Graduate Program â€“ Doctor of Computer Science

Muhamad Keenan Ario, Bina Nusantara University

Computer Science Department, BINUS Graduate Program â€“ Master of Computer Science

Ivan Sebastian Edbert, Bina Nusantara University

Computer Science Department, BINUS Graduate Program â€“ Master of Computer Science

References

Food and Agriculture Organization of the United Nations, The state of world fisheries and aquaculture 2020: Sustainability in action. Food and Agriculture Organization of the United Nations, 2020.

K. Kusdiantoro, A. Fahrudin, S. H. Wisudo, and B. Juanda, â€œKinerja pembangunan perikanan tangkap di Indonesia,â€ Buletin Ilmiah Marina Sosial Ekonomi Kelautan dan Perikanan, vol. 5, no. 2, pp. 69â€“84, 2019.

J. Lee and W. Hwang, â€œCloud-based facial expression recognition system for customer satisfaction in distribution sectors,â€ ICIC Express Letters, Part B: Applications, vol. 11, no. 2, pp. 173â€“179, 2020.

B. W. Yoon, E. Genc, O. F. Ince, and M. E. Yildirim, â€œHuman activity recognition using inter-joint feature fusion with SVD,â€ ICIC Express Letters, Part B: Applications, vol. 12, no. 3, pp. 215â€“221, 2021.

R. Yamashita, M. Nishio, R. K. G. Do, and K. Togashi, â€œConvolutional neural networks: An overview and application in radiology,â€ Insights into Imaging, vol. 9, pp. 611â€“629, 2018.

D. I. PatrÂ´Ä±cio and R. Rieder, â€œComputer vision and artificial intelligence in precision agriculture for grain crops: A systematic review,â€ Computers and Electronics in Agriculture, vol. 153, pp. 69â€“81, 2018.

Y. D. Zhang, Z. Dong, X. Chen, W. Jia, S. Du, K. Muhammad, and S. H. Wang, â€œImage based fruit category classification by 13-layer deep convolutional neural network and data augmentation,â€ Multimedia Tools and Applications, vol. 78,

pp. 3613â€“3632, 2019.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, â€œRich feature hierarchies for accurate object detection and semantic segmentation,â€ in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014, pp. 580â€“587.

R. Girshick, â€œFast R-CNN,â€ in IEEE International Conference on Computer Vision (ICCV), Seoul, Korea, Oct. 27â€“Nov. 2, 2015, pp. 1440â€“1448.

S. Ren, K. He, R. Girshick, and J. Sun, â€œFaster R-CNN: Towards real-time object detection with region proposal networks,â€ in Advances in Neural Information Processing Systems, Montreal, Canada, Dec. 7â€“12, 2015, pp. 91â€“99.

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, and A. C. Berg, â€œSSD: Single shot multibox detector,â€ in Computer Visionâ€“ECCV 2016. Amsterdam, The Netherlands,: Springer, Oct. 11â€“14, 2016, pp. 21â€“37.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, â€œYou only look once: Unified, realtime object detection,â€ in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016, pp. 779â€“788.

H. Yang, P. Liu, Y. Hu, and J. Fu, â€œResearch on underwater object recognition based on YOLOv3,â€ Microsystem Technologies, vol. 27, pp. 1837â€“1844, 2021.

K. Cai, X. Miao, W. Wang, H. Pang, Y. Liu, and J. Song, â€œA modified YOLOv3 model for fish detection based on MobileNetv1 as backbone,â€ Aquacultural Engineering, vol. 91, pp. 1â€“9, 2020.

K. Raza and H. Song, â€œFast and accurate fish detection design with improved YOLO-v3 model and transfer learning,â€ International Journal of Advanced Computer Science and Applications,

vol. 11, no. 2, pp. 7â€“16, 2020.

A. Wong, M. Famuori, M. J. Shafiee, F. Li, B. Chwyl, and J. Chung, â€œYOLO nano: A highly compact you only look once convolutional neural network for object detection,â€ in 2019 Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing-NeurIPS Edition (EMC2- NIPS). IEEE, 2019, pp. 22â€“25.

R. Huang, J. Pedoeem, and C. Chen, â€œYOLOLITE: A real-time object detection algorithm optimized for non-GPU computers,â€ in 2018 IEEE International Conference on Big Data (Big Data). Seattle, USA: IEEE, Dec. 10â€“13, 2018, pp. 2503â€“2510.

H. Zhao, Y. Zhou, L. Zhang, Y. Peng, X. Hu, H. Peng, and X. Cai, â€œMixed YOLOv3-LITE: A lightweight real-time object detection method,â€ Sensors, vol. 20, no. 7, pp. 1â€“18, 2020.

P. Adarsh, P. Rathi, and M. Kumar, â€œYOLO v3- Tiny: Object detection and recognition using one stage improved model,â€ in 2020 6th International Conference on Advanced Computing and Communication Systems (ICACCS). Coimbatore, India: IEEE, March 6â€“7, 2020, pp. 687â€“694.

K. He, X. Zhang, S. Ren, and J. Sun, â€œDeep residual learning for image recognition,â€ in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, Nevada, United States, June 26â€“July 1, 2016, pp. 770â€“778.

W. Y. Ong, C. W. Too, and K. C. Khor, â€œTransfer learning on inception ResNet V2 for expiry reminder: A mobile application development,â€ in International Conference on Mobile Web and Intelligent Information Systems. Virtual (Online): Springer, Aug. 23â€“25, 2021, pp. 149â€“160.

Z. Wu, C. Shen, and A. Van Den Hengel, â€œWider or deeper: Revisiting the ResNet model for visual recognition,â€ Pattern Recognition, vol. 90, pp. 119â€“133, 2019.

I. Martinez-Alpiste, G. Golcarenarenji, Q. Wang, and J. M. Alcaraz-Calero, â€œSmartphone-based real-time object recognition architecture for portable and constrained systems,â€ Journal of Real-Time Image Processing, vol. 19, no. 1, pp. 103â€“115, 2022.

I. Martinez-Alpiste, P. Casaseca-de-la Higuera, J. Alcaraz-Calero, C. Grecos, and Q. Wang, â€œBenchmarking machine-learning-based object detection on a uav and mobile platform,â€ in 2019 IEEE Wireless Communications and Networking Conference (WCNC). Marrakesh, Morocco: IEEE, April 15â€“18, 2019, pp. 1â€“6.

L. TobÂ´Ä±as, A. Ducournau, F. Rousseau, G. Mercier, and R. Fablet, â€œConvolutional neural networks for object recognition on mobile devices: A case study,â€ in 2016 23rd International Conference on Pattern Recognition (ICPR). Cancun, Mexico: IEEE, Dec. 4â€“8, 2016, pp. 3530â€“3535.

R. Kostoeva, R. Upadhyay, Y. Sapar, and A. Zakhor, â€œIndoor 3D interactive asset detection using a smartphone,â€ The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. 42, pp. 811â€“817, 2019.

J. Redmon and A. Farhadi, â€œYOLOv3: An incremental improvement,â€ 2018. [Online]. Available: https://arxiv.org/abs/1804.02767

â€”â€”, â€œYOLO9000: Better, faster, stronger,â€ in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 7263â€“7271.

Y. Song, Q. K. Pan, L. Gao, and B. Zhang, â€œImproved non-maximum suppression for object detection using harmony search algorithm,â€ Applied Soft Computing, vol. 81, pp. 1â€“13, 2019.

G. Wu and Y. Li, â€œNon-maximum suppression for object detection based on the chaotic whale optimization algorithm,â€ Journal of Visual Communication and Image Representation, vol. 74, pp. 1â€“8, 2021.

J. Johnson, â€œCNN benchmarks,â€ 2017. [Online]. Available: https://github.com/jcjohnson/cnn-benchmarks#readme

P. D. Hung and N. N. Kien, â€œSSD-Mobilenet implementation for classifying fish species,â€ in Intelligent Computing and Optimization: Proceedings of the 2nd International Conference on Intelligent Computing and Optimization 2019 (ICO 2019). Springer, 2020, pp. 399â€“408.

HUAWEI CLOUD, â€œIntroduction to ExeML,â€ 2023. [Online]. Available: https://support.huaweicloud.com/intl/en-us/exemlug-modelarts/modelarts 21 0001.html