Javanese Document Image Recognition Using Multiclass Support Vector Machine

Authors

  • Yuna Sugianela Institut Teknologi Sepuluh Nopember Surabaya
  • Nanik Suciati Institut Teknologi Sepuluh Nopember Surabaya

DOI:

https://doi.org/10.21512/commit.v13i1.5330

Keywords:

Javanese Script, Recognition, Classification, Multiclass Support Vector Machine, One Against One Strategy

Abstract

Some ancient documents in Indonesia are written in the Javanese script. Those documents contain the knowledge of history and culture of Indonesia, especially about Java. However, only a few people understand the Javanese script. Thus, the automation system is needed to translate the document written in the Javanese script. In this study, the researchers use the classification method to recognize the Javanese script written in the document. The method used is the Multiclass Support Vector Machine (SVM) using One Against One (OAO) strategy. The researchers use seven variations of Javanese script from the different document for this study. There are 31 classes and 182 data for training and testing data. The result shows good performance in the evaluation. The recognition system successfully resolves the problem of color variation from the dataset. The accuracy of the study is 81.3%.

Dimensions

Plum Analytics

References

A. R. Widiarti, A. Harjoko, and S. Hartati, “Preprocessing model of manuscripts in Javanese characters,” Journal of Signal and Information Processing, vol. 5, no. 04, pp. 112–122, 2014.

M. A. Wibowo, M. Soleh, W. Pradani, A. N. Hidayanto, and A. M. Arymurthy, “Handwritten Javanese character recognition using descriminative deep learning technique,” in 2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE). Yogyakarta, Indonesia: IEEE, Nov. 1–2 2017, pp. 325–330.

A. R. Widiarti and P. N. Wastu, “Javanese character recognition using hidden Markov model,” International Journal of Computer, Electrical, Automation, Control and Information Engineering, vol. 3, no. 9, pp. 2201–2204, 2009.

A. R. Widiarti, A. Harjoko, Marsono, and S. Hartati, “The model and implementation of Javanese script image transliteration,” in 2017 International Conference on Soft Computing, Intelligent System and Information Technology (ICSIIT). Denpasar, Indonesia: IEEE, Sept. 26–29 2017, pp. 51–57.

Tim Ahli Bahasa Jawa, Pedoman penulisan Aksara Jawa. Yogyakarta: Yayasan Pustaka Nusatama, 2002.

A. M. Sulaiman, “Hanacaraka: Aksara Jawa yang mulai ditinggalkan,” Institut Seni Indonesia, Tech. Rep., 2011. [Online]. Available: https://bit.ly/2IpgEa0

A. R. Widiarti, A. Harjoko, and S. Hartati, “Line segmentation of Javanese image of manuscripts in Javanese scripts,” International Journal of Engineering Innovations and Research (IJEIR), vol. 2, pp. 239–244, 2013.

A. Tikader and N. Puhan, “Histogram of Oriented Gradients for English-Bengali script recognition,” in International Conference for Convergence for Technology-2014. Pune, India: IEEE, April 6–8 2014, pp. 1–5.

A. S. Nugroho, A. B. Witarto, and D. Handoko. (2003) Support Vector Machine – Teori dan aplikasinya dalam bioinformatika. [Online]. Available: http://www.asnugroho.net/papers/ikcsvm.pdf

C. Cortes and V. Vapnik, “Support-vector networks,” Machine Learning, vol. 20, no. 3, pp. 273–297, 1995.

T. Zhang, “An introduction to Support Vector Machines and other kernel-based learning methods,” AI Magazine, vol. 22, no. 2, pp. 103–104, 2001.

H. C. S. Ningrum, “Perbandingan metode Support Vector Machine (SVM) Linear, Radial Basis Function (RBF), dan Polinomial Kernel dalam klasifikasi bidang studi lanjut pilihan alumni UII,” 2018. [Online]. Available: https://dspace.uii.ac.id/handle/123456789/7791

C. J. Burges, “A tutorial on Support Vector Machines for pattern recognition,” Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121–167, 1998.

T. Joachims, Advances in kernel methods - Support vector learning. Cambridge, Massachusetts: The MIT Press, 1998, ch. Making large-scale SVM learning practical.

B. Aisen. (2006) A comparison of multiclass SVM methods. [Online]. Available: https://courses.media.mit.edu/2006fall/mas622j/Projects/aisen-project/

J. Milgram, M. Cheriet, and R. Sabourin, ““One Against One” or “One Against All”: Which one is better for handwriting recognition with SVMs?” in Tenth International Workshop on Frontiers in Handwriting Recognition. La Baule, France: Suvisoft, Oct. 23–26 2006.

N. Dalal and B. Triggs, “Histograms of Oriented Gradients for human detection,” in International Conference on Computer Vision & Pattern Recognition (CVPR’05), vol. 1. San Diego, CA, USA: IEEE Computer Society, June 20–25 2005, pp. 886–893.

M. N. Fuad and N. Suciati, “Klasifikasi multilabel motif citra batik menggunakan boosted random ferns,” JUTI: Jurnal Ilmiah Teknologi Informasi, vol. 16, no. 1, pp. 79–89, 2018.

Y. Sugianela and N. Suciati, “Ekstraksi fitur pada pengenalan karakter Aksara Jawa berbasis Histogram of Oriented Gradient,” JUTI: Jurnal Ilmiah Teknologi Informasi, vol. 17, no. 1, pp. 64–72, 2019.

Y. Sugianela, Q. L. Sutino, and D. Herumurti, “EEG classification for epilepsy based on wavelet packet decomposition and random forest,” Jurnal Ilmu Komputer dan Informasi, vol. 11, no. 1, pp. 27–33, 2018.

Downloads

Published

2019-05-31
Abstract 1256  .
PDF downloaded 501  .