Javanese Document Image Recognition Using Multiclass Support Vector Machine
DOI:
https://doi.org/10.21512/commit.v13i1.5330Keywords:
Javanese Script, Recognition, Classification, Multiclass Support Vector Machine, One Against One StrategyAbstract
Some ancient documents in Indonesia are written in the Javanese script. Those documents contain the knowledge of history and culture of Indonesia, especially about Java. However, only a few people understand the Javanese script. Thus, the automation system is needed to translate the document written in the Javanese script. In this study, the researchers use the classification method to recognize the Javanese script written in the document. The method used is the Multiclass Support Vector Machine (SVM) using One Against One (OAO) strategy. The researchers use seven variations of Javanese script from the different document for this study. There are 31 classes and 182 data for training and testing data. The result shows good performance in the evaluation. The recognition system successfully resolves the problem of color variation from the dataset. The accuracy of the study is 81.3%.
Plum Analytics
References
A. R. Widiarti, A. Harjoko, and S. Hartati, “Preprocessing model of manuscripts in Javanese characters,” Journal of Signal and Information Processing, vol. 5, no. 04, pp. 112–122, 2014.
M. A. Wibowo, M. Soleh, W. Pradani, A. N. Hidayanto, and A. M. Arymurthy, “Handwritten Javanese character recognition using descriminative deep learning technique,” in 2017 2nd International conferences on Information Technology, Information Systems and Electrical Engineering (ICITISEE). Yogyakarta, Indonesia: IEEE, Nov. 1–2 2017, pp. 325–330.
A. R. Widiarti and P. N. Wastu, “Javanese character recognition using hidden Markov model,” International Journal of Computer, Electrical, Automation, Control and Information Engineering, vol. 3, no. 9, pp. 2201–2204, 2009.
A. R. Widiarti, A. Harjoko, Marsono, and S. Hartati, “The model and implementation of Javanese script image transliteration,” in 2017 International Conference on Soft Computing, Intelligent System and Information Technology (ICSIIT). Denpasar, Indonesia: IEEE, Sept. 26–29 2017, pp. 51–57.
Tim Ahli Bahasa Jawa, Pedoman penulisan Aksara Jawa. Yogyakarta: Yayasan Pustaka Nusatama, 2002.
A. M. Sulaiman, “Hanacaraka: Aksara Jawa yang mulai ditinggalkan,” Institut Seni Indonesia, Tech. Rep., 2011. [Online]. Available: https://bit.ly/2IpgEa0
A. R. Widiarti, A. Harjoko, and S. Hartati, “Line segmentation of Javanese image of manuscripts in Javanese scripts,” International Journal of Engineering Innovations and Research (IJEIR), vol. 2, pp. 239–244, 2013.
A. Tikader and N. Puhan, “Histogram of Oriented Gradients for English-Bengali script recognition,” in International Conference for Convergence for Technology-2014. Pune, India: IEEE, April 6–8 2014, pp. 1–5.
A. S. Nugroho, A. B. Witarto, and D. Handoko. (2003) Support Vector Machine – Teori dan aplikasinya dalam bioinformatika. [Online]. Available: http://www.asnugroho.net/papers/ikcsvm.pdf
C. Cortes and V. Vapnik, “Support-vector networks,” Machine Learning, vol. 20, no. 3, pp. 273–297, 1995.
T. Zhang, “An introduction to Support Vector Machines and other kernel-based learning methods,” AI Magazine, vol. 22, no. 2, pp. 103–104, 2001.
H. C. S. Ningrum, “Perbandingan metode Support Vector Machine (SVM) Linear, Radial Basis Function (RBF), dan Polinomial Kernel dalam klasifikasi bidang studi lanjut pilihan alumni UII,” 2018. [Online]. Available: https://dspace.uii.ac.id/handle/123456789/7791
C. J. Burges, “A tutorial on Support Vector Machines for pattern recognition,” Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121–167, 1998.
T. Joachims, Advances in kernel methods - Support vector learning. Cambridge, Massachusetts: The MIT Press, 1998, ch. Making large-scale SVM learning practical.
B. Aisen. (2006) A comparison of multiclass SVM methods. [Online]. Available: https://courses.media.mit.edu/2006fall/mas622j/Projects/aisen-project/
J. Milgram, M. Cheriet, and R. Sabourin, ““One Against One” or “One Against All”: Which one is better for handwriting recognition with SVMs?” in Tenth International Workshop on Frontiers in Handwriting Recognition. La Baule, France: Suvisoft, Oct. 23–26 2006.
N. Dalal and B. Triggs, “Histograms of Oriented Gradients for human detection,” in International Conference on Computer Vision & Pattern Recognition (CVPR’05), vol. 1. San Diego, CA, USA: IEEE Computer Society, June 20–25 2005, pp. 886–893.
M. N. Fuad and N. Suciati, “Klasifikasi multilabel motif citra batik menggunakan boosted random ferns,” JUTI: Jurnal Ilmiah Teknologi Informasi, vol. 16, no. 1, pp. 79–89, 2018.
Y. Sugianela and N. Suciati, “Ekstraksi fitur pada pengenalan karakter Aksara Jawa berbasis Histogram of Oriented Gradient,” JUTI: Jurnal Ilmiah Teknologi Informasi, vol. 17, no. 1, pp. 64–72, 2019.
Y. Sugianela, Q. L. Sutino, and D. Herumurti, “EEG classification for epilepsy based on wavelet packet decomposition and random forest,” Jurnal Ilmu Komputer dan Informasi, vol. 11, no. 1, pp. 27–33, 2018.
Downloads
Published
Issue
Section
License
Authors who publish with this journal agree to the following terms:
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License - Share Alike that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
USER RIGHTS
All articles published Open Access will be immediately and permanently free for everyone to read and download. We are continuously working with our author communities to select the best choice of license options, currently being defined for this journal as follows: Creative Commons Attribution-Share Alike (CC BY-SA)