Tweets Emotions Analysis of Community Activities Restriction as COVID-19 Policy in Indonesia Using Support Vector Machine

Abi Nizar Sutranggono; Elly Matul Imah

doi:10.21512/commit.v17i1.8189

Authors

Abi Nizar Sutranggono Universitas Negeri Surabaya
Elly Matul Imah Universitas Negeri Surabaya

DOI:

https://doi.org/10.21512/commit.v17i1.8189

Keywords:

Tweets, Emotions Analysis, COVID-19 Policy, Support Vector Machine

Abstract

With the rising number of COVID-19 cases in Indonesia, the government has implemented the Imposition of Restrictions on Emergency Community Activities (Pemberlakuan Pembatasan Kegiatan Masyarakat - PPKM) as Indonesiaâ€™s COVID-19 policy. Several controversies and protests have colored the implementation of this emergency policy. Some netizens on Twitter voice their opinions about the policy in their tweets. Emotions in tweets can be recognized through text-based emotion detection or emotion analysis. However, textbased emotion detection is a challenging task. One of the main issues in classifying text with a machine learningbased approach deals with the feature dimensions. As a result, appropriate methods for accurately identifying emotion based on the text are required. The research studies an emotions analysis task on Indonesiansâ€™ PPKMrelated tweets to understand their emotional state while implementing the PPKM. The machine learning classification algorithms used are Support Vector Machine (SVM) and random forest. The total number of tweets is 4,401. The results show that SVM with linear kernel function combined with the TF-IDF and Chi-Square methods outperforms other classifiers with an accuracy of 0.7528. The accuracy value is higher than those obtained by previous studies. Moreover, the results of the emotion classification on PPKM tweets reveal that most Indonesians are unhappy with the implementation of the PPKM policy.

Dimensions

Plum Analytics

Author Biographies

Abi Nizar Sutranggono, Universitas Negeri Surabaya

Mathematic Department

Elly Matul Imah, Universitas Negeri Surabaya

Data Science Department

References

D. Cucinotta and M. Vanelli, â€œWHO declares COVID-19 a pandemic,â€ Acta Bio Medica: Atenei Parmensis, vol. 91, no. 1, pp. 157â€“160, 2020.

Menteri Dalam Negeri Republik Indonesia, â€œInstruksi Menteri Dalam Negeri Nomor 15 Tahun 2021,â€ 2021. [Online]. Available: https: //bit.ly/3lZlTmK

Z. Drus and H. Khalid, â€œSentiment analysis in social media and its application: Systematic literature review,â€ Procedia Computer Science, vol. 161, pp. 707â€“714, 2019.

P. Monachesi and S. Witteborn, â€œBuilding the sustainable city through Twitter: Creative skilled migrants and innovative technology use,â€ Telematics and Informatics, vol. 58, pp. 1â€“10, 2021.

K. Garcia and L. Berton, â€œTopic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA,â€ Applied Soft Computing, vol. 101, pp. 1â€“15, 2021.

M. Baali and N. Ghneim, â€œEmotion analysis of Arabic tweets using deep learning approach,â€ Journal of Big Data, vol. 6, no. 1, pp. 1â€“12, 2019.

S. Kusal, S. Patil, K. Kotecha, R. Aluvalu, and V. Varadarajan, â€œAI based emotion detection for textual big data: Techniques and contribution,â€ Big Data and Cognitive Computing, vol. 5, no. 3, pp. 1â€“45, 2021.

F. A. Acheampong, C. Wenyu, and H. Nunoo-Mensah, â€œText-based emotion detection: Advances, challenges, and opportunities,â€ Engineering Reports, vol. 2, no. 7, pp. 1â€“24, 2020.

M. Z. Asghar, F. Subhan, M. Imran, F. M. Kundi, S. Shamshirband, A. Mosavi, P. Csiba, and A. R. VÂ´arkonyi-KÂ´oczy, â€œPerformance evaluation of supervised machine learning techniques for efficient detection of emotions from online content,â€ Preprint, pp. 1â€“30, 2019.

P. Nandwani and R. Verma, â€œA review on sentiment analysis and emotion detection from text,â€ Social Network Analysis and Mining, vol. 11, no. 1, pp. 1â€“19, 2021.

R. Jayakrishnan, G. N. Gopal, and M. S. Santhikrishna, â€œMulti-class emotion detection and annotation in Malayalam novels,â€ in 2018 International Conference on Computer Communication and Informatics (ICCCI). IEEE, 2018, pp. 1â€“5.

P. Saigal and V. Khanna, â€œMulti-category news classification using support vector machine based classifiers,â€ SN Applied Sciences, vol. 2, no. 3, pp. 1â€“12, 2020.

L. De Bruyne, O. De Clercq, and V. Hoste, â€œLt3 at semeval-2018 task 1: A classifier chain to detect emotions in tweets,â€ in Proceedings of the 12th International Workshop on Semantic Evaluation, 2018, pp. 123â€“127.

M. Suhasini and B. Srinivasu, â€œEmotion detection framework for Twitter data using supervised classifiers,â€ in Data engineering and communication technology. Advances in intelligent systems and computing. Springer, 2020.

J. Carrillo-de Albornoz, J. Rodriguez Vidal, and L. Plaza, â€œFeature engineering for sentiment analysis in e-health forums,â€ PLOS ONE, vol. 13, no. 11, pp. 1â€“25, 2018.

A. Jaffe, Y. Kluger, O. Lindenbaum, J. Patsenker, E. Peterfreund, and S. Steinerberger, â€œThe spectral underpinning of word2vec,â€ Frontiers in Applied Mathematics and Statistics, vol. 6, pp. 1â€“10, 2020.

N. Kulkarni, R. Vaidya, and M. Bhate, â€œA comparative study of word embedding techniques to extract features from text,â€ Turkish Journal of Computer and Mathematics Education (TURCOMAT), vol. 12, no. 12, pp. 3550â€“3557, 2021.

O. Kwon, D. Kim, S. R. Lee, J. Choi, and S. Lee, â€œHandling out-of-vocabulary problem in hangeul word embeddings,â€ in Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021, pp. 3213â€“3221.

M. Chiny, M. Chihab, O. Bencharef, and Y. Chihab, â€œLSTM, VADER and TF-IDF based hybrid sentiment analysis model,â€ International Journal of Advanced Computer Science and Applications, vol. 12, no. 7, pp. 265â€“275, 2021.

D. Endalie and T. Tegegne, â€œDesigning a hybrid dimension reduction for improving the performance of Amharic news document classification,â€ PLOS ONE, vol. 16, no. 5, pp. 1â€“14, 2021.

D. A. Eisa, A. I. Taloba, and S. S. I. Ismail, â€œA comparative study on using principle component analysis with different text classifiers,â€ International Journal of Computer Applications, vol. 180, no. 31, pp. 1â€“6, 2018.

U. A. M. E. Ali, M. A. Hossain, and M. R. Islam, â€œAnalysis of PCA based feature extraction methods for classification of hyperspectral image,â€ in 2019 2nd International Conference on Innovation in Engineering and Technology (ICIET). IEEE, 2019, pp. 1â€“6.

M. Alassaf and A. M. Qamar, â€œImproving sentiment analysis of Arabic tweets by One-Way ANOVA,â€ Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 6, pp. 2849â€“2859, 2022.

S. Bahassine, A. Madani, M. Al-Sarem, and M. Kissi, â€œFeature selection using an improved Chi-square for Arabic text classification,â€ Journal of King Saud University-Computer and Information Sciences, vol. 32, no. 2, pp. 225â€“231, 2020.

L. Singh, S. Singh, and N. Aggarwal, â€œTwo-stage text feature selection method for human emotion recognition,â€ in Proceedings of 2nd International Conference on Communication, Computing and Networking: ICCCN 2018, NITTTR Chandigarh, India. Springer, 2018, pp. 531â€“538.

C. H. Cheng and H. H. Chen, â€œSentimental text mining based on an additional features method for text classification,â€ PLOS ONE, vol. 14, no. 6, pp. 1â€“17, 2019.

A. R. Murthy and K. A. Kumar, â€œA review of different approaches for detecting emotion from text,â€ in IOP Conference Series: Materials Science and Engineering, vol. 1110. IOP Publishing, 2021, pp. 1â€“23.

N. Alswaidan and M. E. B. Menai, â€œA survey of state-of-the-art approaches for emotion recognition in text,â€ Knowledge and Information Systems, vol. 62, pp. 2937â€“2987, 2020.

M. S. Saputri, R. Mahendra, and M. Adriani, â€œEmotion classification on Indonesian Twitter dataset,â€ in 2018 International Conference on Asian Language Processing (IALP). IEEE, 2018, pp. 90â€“95.

E. Cambria, A. Livingstone, and A. Hussain, â€œThe hourglass of emotions,â€ in Cognitive behavioural systems. Lecture notes in computer science. Springer, 2012.

P. R. Shaver, U. Murdaya, and R. C. Fraley, â€œStructure of the Indonesian emotion lexicon,â€ Asian Journal of Social Psychology, vol. 4, no. 3, pp. 201â€“224, 2001.

I. Dongo, Y. Cadinale, A. Aguilera, F. MartÂ´Ä±nez, Y. Quintero, and S. Barrios, â€œWeb scraping versus Twitter API: A comparison for a credibility analysis,â€ in Proceedings of the 22nd International Conference on Information Integration and Web-Based Applications & Services, 2020, pp. 263â€“273.

K. M. Ridhwan and C. A. Hargreaves, â€œLeveraging Twitter data to understand public sentiment for the COVID-19 outbreak in Singapore,â€ International Journal of Information Management Data Insights, vol. 1, no. 2, pp. 1â€“15, 2021.

B. Gaind, V. Syal, and S. Padgalwar, â€œEmotion detection and analysis on social media,â€ 2019. [Online]. Available: http: //arxiv.org/abs/1901.08458

H. T. Duong and T. A. Nguyen-Thi, â€œA review: Preprocessing techniques and data augmentation for sentiment analysis,â€ Computational Social Networks, vol. 8, no. 1, pp. 1â€“16, 2021.

S. Sarica and J. Luo, â€œStopwords in technical language processing,â€ PLOS ONE, vol. 16, no. 8, pp. 1â€“13, 2021.

H. Young, â€œThe digital language divide: How does the language you speak shape your experience of the Internet?â€ [Online]. Available: http: //labs.theguardian.com/digital-language-divide/

R. Chandra and A. Krishna, â€œCOVID-19 sentiment analysis via deep learning during the rise of novel cases,â€ PLOS ONE, vol. 16, no. 8, pp. 1â€“26, 2021.

Y. Li and T. Yang, â€œWord embedding for understanding natural language: A survey,â€ Guide to Big Data Applications, pp. 83â€“104, 2018.

F. K. Khattak, S. Jeblee, C. Pou-Prom, M. Abdalla, C. Meaney, and F. Rudzicz, â€œA survey of word embeddings for clinical text,â€ Journal of Biomedical Informatics, vol. 100, pp. 1â€“18, 2019.

P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, â€œEnriching word vectors with subword information,â€ Transactions of the Association for Computational Linguistics, vol. 5, pp. 135â€“146, 2017.

A. Madasu and S. Elango, â€œEfficient feature selection techniques for sentiment analysis,â€ Multimedia Tools and Applications, vol. 79, pp. 6313â€“6335, 2020.

E. J. Cushion, J. Warmenhoven, J. S. North, and D. J. Cleather, â€œPrincipal component analysis reveals the proximal to distal pattern in vertical jumping is governed by two functional degrees of freedom,â€ Frontiers in Bioengineering and Biotechnology, vol. 7, pp. 1â€“11, 2019.

K. J. Johnson and R. E. Synovec, â€œPattern recognition of jet fuels: Comprehensive GCÃ— GC with ANOVA-based feature selection and principal component analysis,â€ Chemometrics and Intelligent Laboratory Systems, vol. 60, no. 1-2, pp. 225â€“237, 2002.

G. Forman, â€œAn extensive empirical study of feature selection metrics for text classification,â€ Journal of Machine Learning Research, vol. 3, pp. 1289â€“1305, 2003.

S. Chatterjee, K. Chakrabarti, A. Garain, F. Schwenker, and R. Sarkar, â€œJUMRv1: A sentiment analysis dataset for movie recommendation,â€ Applied Sciences, vol. 11, no. 20, pp. 1â€“24, 2021.

E. M. Imah, F. Al Afif, M. I. Fanany, W. Jatmiko, and T. Basaruddin, â€œA comparative study on Daubechies Wavelet Transformation, Kernel PCA and PCA as feature extractors for arrhythmia detection using SVM,â€ in TENCON 2011-2011 IEEE Region 10 Conference. IEEE, 2011, pp. 5â€“9.

L. Breiman, â€œRandom forests,â€ Machine Learning, vol. 45, pp. 5â€“32, 2001.

V. Y. Kullarni and P. K. Sinha, â€œRandom forest classifier: a survey and future research directions,â€ International Journal of Advanced Computing, vol. 36, no. 1, pp. 1144â€“1156, 2013.

M. Allouch, A. Azaria, R. Azoulay, E. Ben-Izchak, M. Zwilling, and D. A. Zachor, â€œAutomatic detection of insulting sentences in conversation,â€ in 2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE). IEEE, 2018, pp. 1â€“4.

M. Z. Ali, K. Javed, E. U. Haq, and A. Tariq, â€œSentiment and emotion classification of epidemic related bilingual data from social media,â€ 2021. [Online]. Available: http://arxiv.org/abs/2105.01468

K. Sailunaz and R. Alhajj, â€œEmotion and sentiment analysis from twitter text,â€ Journal of Computational Science, vol. 36, pp. 1â€“18, 2019.