Analysis And Voice Recognition In Indonesian Language Using MFCC And SVM Method

Harvianto Harvianto; Livia Ashianti; Jupiter Jupiter; Suhandi Junaedi

doi:10.21512/comtech.v7i2.2252

Analysis And Voice Recognition In Indonesian Language Using MFCC And SVM Method

Authors

Harvianto Harvianto Bina Nusantara University
Livia Ashianti Bina Nusantara University
Jupiter Jupiter Bina Nusantara University
Suhandi Junaedi Bina Nusantara University

DOI:

https://doi.org/10.21512/comtech.v7i2.2252

Keywords:

voice recognition, MFCC, SVM, cross validation

Abstract

Voice recognition technology is one of biometric technology. Sound is a unique part of the human being which made an individual can be easily distinguished one from another. Voice can also provide information such as gender, emotion, and identity of the speaker. This research will record human voices that pronounce digits between 0 and 9 with and without noise. Features of this sound recording will be extracted using Mel Frequency Cepstral Coefficient (MFCC). Mean, standard deviation, max, min, and the combination of them will be used to construct the feature vectors. This feature vectors then will be classified using Support Vector Machine (SVM). There will be two classification models. The first one is based on the speaker and the other one based on the digits pronounced. The classification model then will be validated by performing 10-fold cross-validation.The best average accuracy from two classification model is 91.83%. This result achieved using Mean + Standard deviation + Min + Max as features.

Dimensions

Plum Analytics

References

Fokoue, E., & Ma, Z. (2013). Speaker Gender Recognition via MFCCs and SVMs. RIT Scholar Works.

Gracieth, B., Washington, S., & Filho, O. (2014). Classification of Pattern using Support Vector Machines: An Application for Automatic. The Eighth International Conference on Advanced Engineering Computing and Applications in Sciences. Rome, Italy: IARIA.

Lindasalwa, M., Mumtaj, B., & Elamvazuthi, I. (2010). Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques. Journal of Computing, 2, 138-143.

Lockwood, P., & Boudy, J. (1992). Experiments with a Nonlinear Spectral Subtractor (NSS), Hidden Markov Models and the projection, for robust speech recognition in cars. Journal Speech Communication - Eurospeech '91, 11(2-3), 215-228.

Payam, R., Lei, T., & Huan, L. (2009). Cross-Validation. In Encyclopedia of Database Systems. Putra, D., & Resmawan, A. (2011). Verifikasi Biometrika Suara Menggunakan Metode MFCC dan

DTW. Lontar Komputer, 2, 8-21.

Rosenberg, A., Lee, C. H., & Soong, F. (1994). Cepstral channel normalization techniques for HMMbased speaker verification. Proc. Int. Conf. on Spoken Language Processing, 1835-1838.

Yee, C. S., & Ahmad, A. M. (2008). Malay language text-independent speaker verification using NNMLP classifier with MFCC. Electronic Design, 2008. ICED 2008. International Conference.

Penang: IEEE.

Downloads

Published

2016-06-01

How to Cite

Harvianto, H., Ashianti, L., Jupiter, J., & Junaedi, S. (2016). Analysis And Voice Recognition In Indonesian Language Using MFCC And SVM Method. ComTech: Computer, Mathematics and Engineering Applications, 7(2), 131–139. https://doi.org/10.21512/comtech.v7i2.2252

Download Citation

Issue

Vol. 7 No. 2 (2016): ComTech

Section

Articles

License

Authors who publish with this journal agree to the following terms:
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under aÂ Creative Commons Attribution LicenseÂ - Share AlikeÂ that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.

b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.

c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.

Â USER RIGHTS

Â All articles published Open Access will be immediately and permanently free for everyone to read and download.Â We are continuously working with our author communities to select the best choice of license options, currently being defined for this journal as follows:

â€¢ Creative Commons Attribution-Share alike (CC BY-SA)