Tweets Emotions Analysis of Community Activities Restriction as COVID-19 Policy in Indonesia Using Support Vector Machine

Authors

  • Abi Nizar Sutranggono Universitas Negeri Surabaya
  • Elly Matul Imah Universitas Negeri Surabaya

DOI:

https://doi.org/10.21512/commit.v17i1.8189

Keywords:

Tweets, Emotions Analysis, COVID-19 Policy, Support Vector Machine

Abstract

With the rising number of COVID-19 cases in Indonesia, the government has implemented the Imposition of Restrictions on Emergency Community Activities (Pemberlakuan Pembatasan Kegiatan Masyarakat - PPKM) as Indonesia’s COVID-19 policy. Several controversies and protests have colored the implementation of this emergency policy. Some netizens on Twitter voice their opinions about the policy in their tweets. Emotions in tweets can be recognized through text-based emotion detection or emotion analysis. However, textbased emotion detection is a challenging task. One of the main issues in classifying text with a machine learningbased approach deals with the feature dimensions. As a result, appropriate methods for accurately identifying emotion based on the text are required. The research studies an emotions analysis task on Indonesians’ PPKMrelated tweets to understand their emotional state while implementing the PPKM. The machine learning classification algorithms used are Support Vector Machine (SVM) and random forest. The total number of tweets is 4,401. The results show that SVM with linear kernel function combined with the TF-IDF and Chi-Square methods outperforms other classifiers with an accuracy of 0.7528. The accuracy value is higher than those obtained by previous studies. Moreover, the results of the emotion classification on PPKM tweets reveal that most Indonesians are unhappy with the implementation of the PPKM policy.

Dimensions

Plum Analytics

Author Biographies

Abi Nizar Sutranggono, Universitas Negeri Surabaya

Mathematic Department

Elly Matul Imah, Universitas Negeri Surabaya

Data Science Department

References

D. Cucinotta and M. Vanelli, “WHO declares COVID-19 a pandemic,” Acta Bio Medica: Atenei Parmensis, vol. 91, no. 1, pp. 157–160, 2020.

Menteri Dalam Negeri Republik Indonesia, “Instruksi Menteri Dalam Negeri Nomor 15 Tahun 2021,” 2021. [Online]. Available: https: //bit.ly/3lZlTmK

Z. Drus and H. Khalid, “Sentiment analysis in social media and its application: Systematic literature review,” Procedia Computer Science, vol. 161, pp. 707–714, 2019.

P. Monachesi and S. Witteborn, “Building the sustainable city through Twitter: Creative skilled migrants and innovative technology use,” Telematics and Informatics, vol. 58, pp. 1–10, 2021.

K. Garcia and L. Berton, “Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA,” Applied Soft Computing, vol. 101, pp. 1–15, 2021.

M. Baali and N. Ghneim, “Emotion analysis of Arabic tweets using deep learning approach,” Journal of Big Data, vol. 6, no. 1, pp. 1–12, 2019.

S. Kusal, S. Patil, K. Kotecha, R. Aluvalu, and V. Varadarajan, “AI based emotion detection for textual big data: Techniques and contribution,” Big Data and Cognitive Computing, vol. 5, no. 3, pp. 1–45, 2021.

F. A. Acheampong, C. Wenyu, and H. Nunoo-Mensah, “Text-based emotion detection: Advances, challenges, and opportunities,” Engineering Reports, vol. 2, no. 7, pp. 1–24, 2020.

M. Z. Asghar, F. Subhan, M. Imran, F. M. Kundi, S. Shamshirband, A. Mosavi, P. Csiba, and A. R. V´arkonyi-K´oczy, “Performance evaluation of supervised machine learning techniques for efficient detection of emotions from online content,” Preprint, pp. 1–30, 2019.

P. Nandwani and R. Verma, “A review on sentiment analysis and emotion detection from text,” Social Network Analysis and Mining, vol. 11, no. 1, pp. 1–19, 2021.

R. Jayakrishnan, G. N. Gopal, and M. S. Santhikrishna, “Multi-class emotion detection and annotation in Malayalam novels,” in 2018 International Conference on Computer Communication and Informatics (ICCCI). IEEE, 2018, pp. 1–5.

P. Saigal and V. Khanna, “Multi-category news classification using support vector machine based classifiers,” SN Applied Sciences, vol. 2, no. 3, pp. 1–12, 2020.

L. De Bruyne, O. De Clercq, and V. Hoste, “Lt3 at semeval-2018 task 1: A classifier chain to detect emotions in tweets,” in Proceedings of the 12th International Workshop on Semantic Evaluation, 2018, pp. 123–127.

M. Suhasini and B. Srinivasu, “Emotion detection framework for Twitter data using supervised classifiers,” in Data engineering and communication technology. Advances in intelligent systems and computing. Springer, 2020.

J. Carrillo-de Albornoz, J. Rodriguez Vidal, and L. Plaza, “Feature engineering for sentiment analysis in e-health forums,” PLOS ONE, vol. 13, no. 11, pp. 1–25, 2018.

A. Jaffe, Y. Kluger, O. Lindenbaum, J. Patsenker, E. Peterfreund, and S. Steinerberger, “The spectral underpinning of word2vec,” Frontiers in Applied Mathematics and Statistics, vol. 6, pp. 1–10, 2020.

N. Kulkarni, R. Vaidya, and M. Bhate, “A comparative study of word embedding techniques to extract features from text,” Turkish Journal of Computer and Mathematics Education (TURCOMAT), vol. 12, no. 12, pp. 3550–3557, 2021.

O. Kwon, D. Kim, S. R. Lee, J. Choi, and S. Lee, “Handling out-of-vocabulary problem in hangeul word embeddings,” in Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021, pp. 3213–3221.

M. Chiny, M. Chihab, O. Bencharef, and Y. Chihab, “LSTM, VADER and TF-IDF based hybrid sentiment analysis model,” International Journal of Advanced Computer Science and Applications, vol. 12, no. 7, pp. 265–275, 2021.

D. Endalie and T. Tegegne, “Designing a hybrid dimension reduction for improving the performance of Amharic news document classification,” PLOS ONE, vol. 16, no. 5, pp. 1–14, 2021.

D. A. Eisa, A. I. Taloba, and S. S. I. Ismail, “A comparative study on using principle component analysis with different text classifiers,” International Journal of Computer Applications, vol. 180, no. 31, pp. 1–6, 2018.

U. A. M. E. Ali, M. A. Hossain, and M. R. Islam, “Analysis of PCA based feature extraction methods for classification of hyperspectral image,” in 2019 2nd International Conference on Innovation in Engineering and Technology (ICIET). IEEE, 2019, pp. 1–6.

M. Alassaf and A. M. Qamar, “Improving sentiment analysis of Arabic tweets by One-Way ANOVA,” Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 6, pp. 2849–2859, 2022.

S. Bahassine, A. Madani, M. Al-Sarem, and M. Kissi, “Feature selection using an improved Chi-square for Arabic text classification,” Journal of King Saud University-Computer and Information Sciences, vol. 32, no. 2, pp. 225–231, 2020.

L. Singh, S. Singh, and N. Aggarwal, “Two-stage text feature selection method for human emotion recognition,” in Proceedings of 2nd International Conference on Communication, Computing and Networking: ICCCN 2018, NITTTR Chandigarh, India. Springer, 2018, pp. 531–538.

C. H. Cheng and H. H. Chen, “Sentimental text mining based on an additional features method for text classification,” PLOS ONE, vol. 14, no. 6, pp. 1–17, 2019.

A. R. Murthy and K. A. Kumar, “A review of different approaches for detecting emotion from text,” in IOP Conference Series: Materials Science and Engineering, vol. 1110. IOP Publishing, 2021, pp. 1–23.

N. Alswaidan and M. E. B. Menai, “A survey of state-of-the-art approaches for emotion recognition in text,” Knowledge and Information Systems, vol. 62, pp. 2937–2987, 2020.

M. S. Saputri, R. Mahendra, and M. Adriani, “Emotion classification on Indonesian Twitter dataset,” in 2018 International Conference on Asian Language Processing (IALP). IEEE, 2018, pp. 90–95.

E. Cambria, A. Livingstone, and A. Hussain, “The hourglass of emotions,” in Cognitive behavioural systems. Lecture notes in computer science. Springer, 2012.

P. R. Shaver, U. Murdaya, and R. C. Fraley, “Structure of the Indonesian emotion lexicon,” Asian Journal of Social Psychology, vol. 4, no. 3, pp. 201–224, 2001.

I. Dongo, Y. Cadinale, A. Aguilera, F. Mart´ınez, Y. Quintero, and S. Barrios, “Web scraping versus Twitter API: A comparison for a credibility analysis,” in Proceedings of the 22nd International Conference on Information Integration and Web-Based Applications & Services, 2020, pp. 263–273.

K. M. Ridhwan and C. A. Hargreaves, “Leveraging Twitter data to understand public sentiment for the COVID-19 outbreak in Singapore,” International Journal of Information Management Data Insights, vol. 1, no. 2, pp. 1–15, 2021.

B. Gaind, V. Syal, and S. Padgalwar, “Emotion detection and analysis on social media,” 2019. [Online]. Available: http: //arxiv.org/abs/1901.08458

H. T. Duong and T. A. Nguyen-Thi, “A review: Preprocessing techniques and data augmentation for sentiment analysis,” Computational Social Networks, vol. 8, no. 1, pp. 1–16, 2021.

S. Sarica and J. Luo, “Stopwords in technical language processing,” PLOS ONE, vol. 16, no. 8, pp. 1–13, 2021.

H. Young, “The digital language divide: How does the language you speak shape your experience of the Internet?” [Online]. Available: http: //labs.theguardian.com/digital-language-divide/

R. Chandra and A. Krishna, “COVID-19 sentiment analysis via deep learning during the rise of novel cases,” PLOS ONE, vol. 16, no. 8, pp. 1–26, 2021.

Y. Li and T. Yang, “Word embedding for understanding natural language: A survey,” Guide to Big Data Applications, pp. 83–104, 2018.

F. K. Khattak, S. Jeblee, C. Pou-Prom, M. Abdalla, C. Meaney, and F. Rudzicz, “A survey of word embeddings for clinical text,” Journal of Biomedical Informatics, vol. 100, pp. 1–18, 2019.

P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, “Enriching word vectors with subword information,” Transactions of the Association for Computational Linguistics, vol. 5, pp. 135–146, 2017.

A. Madasu and S. Elango, “Efficient feature selection techniques for sentiment analysis,” Multimedia Tools and Applications, vol. 79, pp. 6313–6335, 2020.

E. J. Cushion, J. Warmenhoven, J. S. North, and D. J. Cleather, “Principal component analysis reveals the proximal to distal pattern in vertical jumping is governed by two functional degrees of freedom,” Frontiers in Bioengineering and Biotechnology, vol. 7, pp. 1–11, 2019.

K. J. Johnson and R. E. Synovec, “Pattern recognition of jet fuels: Comprehensive GC× GC with ANOVA-based feature selection and principal component analysis,” Chemometrics and Intelligent Laboratory Systems, vol. 60, no. 1-2, pp. 225–237, 2002.

G. Forman, “An extensive empirical study of feature selection metrics for text classification,” Journal of Machine Learning Research, vol. 3, pp. 1289–1305, 2003.

S. Chatterjee, K. Chakrabarti, A. Garain, F. Schwenker, and R. Sarkar, “JUMRv1: A sentiment analysis dataset for movie recommendation,” Applied Sciences, vol. 11, no. 20, pp. 1–24, 2021.

E. M. Imah, F. Al Afif, M. I. Fanany, W. Jatmiko, and T. Basaruddin, “A comparative study on Daubechies Wavelet Transformation, Kernel PCA and PCA as feature extractors for arrhythmia detection using SVM,” in TENCON 2011-2011 IEEE Region 10 Conference. IEEE, 2011, pp. 5–9.

L. Breiman, “Random forests,” Machine Learning, vol. 45, pp. 5–32, 2001.

V. Y. Kullarni and P. K. Sinha, “Random forest classifier: a survey and future research directions,” International Journal of Advanced Computing, vol. 36, no. 1, pp. 1144–1156, 2013.

M. Allouch, A. Azaria, R. Azoulay, E. Ben-Izchak, M. Zwilling, and D. A. Zachor, “Automatic detection of insulting sentences in conversation,” in 2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE). IEEE, 2018, pp. 1–4.

M. Z. Ali, K. Javed, E. U. Haq, and A. Tariq, “Sentiment and emotion classification of epidemic related bilingual data from social media,” 2021. [Online]. Available: http://arxiv.org/abs/2105.01468

K. Sailunaz and R. Alhajj, “Emotion and sentiment analysis from twitter text,” Journal of Computational Science, vol. 36, pp. 1–18, 2019.

Downloads

Published

2023-03-17
Abstract 779  .
PDF downloaded 894  .