Tweets Emotions Analysis of Community Activities Restriction as COVID-19 Policy in Indonesia Using Support Vector Machine
DOI:
https://doi.org/10.21512/commit.v17i1.8189Keywords:
Tweets, Emotions Analysis, COVID-19 Policy, Support Vector MachineAbstract
With the rising number of COVID-19 cases in Indonesia, the government has implemented the Imposition of Restrictions on Emergency Community Activities (Pemberlakuan Pembatasan Kegiatan Masyarakat - PPKM) as Indonesia’s COVID-19 policy. Several controversies and protests have colored the implementation of this emergency policy. Some netizens on Twitter voice their opinions about the policy in their tweets. Emotions in tweets can be recognized through text-based emotion detection or emotion analysis. However, textbased emotion detection is a challenging task. One of the main issues in classifying text with a machine learningbased approach deals with the feature dimensions. As a result, appropriate methods for accurately identifying emotion based on the text are required. The research studies an emotions analysis task on Indonesians’ PPKMrelated tweets to understand their emotional state while implementing the PPKM. The machine learning classification algorithms used are Support Vector Machine (SVM) and random forest. The total number of tweets is 4,401. The results show that SVM with linear kernel function combined with the TF-IDF and Chi-Square methods outperforms other classifiers with an accuracy of 0.7528. The accuracy value is higher than those obtained by previous studies. Moreover, the results of the emotion classification on PPKM tweets reveal that most Indonesians are unhappy with the implementation of the PPKM policy.
Plum Analytics
References
D. Cucinotta and M. Vanelli, “WHO declares COVID-19 a pandemic,” Acta Bio Medica: Atenei Parmensis, vol. 91, no. 1, pp. 157–160, 2020.
Menteri Dalam Negeri Republik Indonesia, “Instruksi Menteri Dalam Negeri Nomor 15 Tahun 2021,” 2021. [Online]. Available: https: //bit.ly/3lZlTmK
Z. Drus and H. Khalid, “Sentiment analysis in social media and its application: Systematic literature review,” Procedia Computer Science, vol. 161, pp. 707–714, 2019.
P. Monachesi and S. Witteborn, “Building the sustainable city through Twitter: Creative skilled migrants and innovative technology use,” Telematics and Informatics, vol. 58, pp. 1–10, 2021.
K. Garcia and L. Berton, “Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA,” Applied Soft Computing, vol. 101, pp. 1–15, 2021.
M. Baali and N. Ghneim, “Emotion analysis of Arabic tweets using deep learning approach,” Journal of Big Data, vol. 6, no. 1, pp. 1–12, 2019.
S. Kusal, S. Patil, K. Kotecha, R. Aluvalu, and V. Varadarajan, “AI based emotion detection for textual big data: Techniques and contribution,” Big Data and Cognitive Computing, vol. 5, no. 3, pp. 1–45, 2021.
F. A. Acheampong, C. Wenyu, and H. Nunoo-Mensah, “Text-based emotion detection: Advances, challenges, and opportunities,” Engineering Reports, vol. 2, no. 7, pp. 1–24, 2020.
M. Z. Asghar, F. Subhan, M. Imran, F. M. Kundi, S. Shamshirband, A. Mosavi, P. Csiba, and A. R. V´arkonyi-K´oczy, “Performance evaluation of supervised machine learning techniques for efficient detection of emotions from online content,” Preprint, pp. 1–30, 2019.
P. Nandwani and R. Verma, “A review on sentiment analysis and emotion detection from text,” Social Network Analysis and Mining, vol. 11, no. 1, pp. 1–19, 2021.
R. Jayakrishnan, G. N. Gopal, and M. S. Santhikrishna, “Multi-class emotion detection and annotation in Malayalam novels,” in 2018 International Conference on Computer Communication and Informatics (ICCCI). IEEE, 2018, pp. 1–5.
P. Saigal and V. Khanna, “Multi-category news classification using support vector machine based classifiers,” SN Applied Sciences, vol. 2, no. 3, pp. 1–12, 2020.
L. De Bruyne, O. De Clercq, and V. Hoste, “Lt3 at semeval-2018 task 1: A classifier chain to detect emotions in tweets,” in Proceedings of the 12th International Workshop on Semantic Evaluation, 2018, pp. 123–127.
M. Suhasini and B. Srinivasu, “Emotion detection framework for Twitter data using supervised classifiers,” in Data engineering and communication technology. Advances in intelligent systems and computing. Springer, 2020.
J. Carrillo-de Albornoz, J. Rodriguez Vidal, and L. Plaza, “Feature engineering for sentiment analysis in e-health forums,” PLOS ONE, vol. 13, no. 11, pp. 1–25, 2018.
A. Jaffe, Y. Kluger, O. Lindenbaum, J. Patsenker, E. Peterfreund, and S. Steinerberger, “The spectral underpinning of word2vec,” Frontiers in Applied Mathematics and Statistics, vol. 6, pp. 1–10, 2020.
N. Kulkarni, R. Vaidya, and M. Bhate, “A comparative study of word embedding techniques to extract features from text,” Turkish Journal of Computer and Mathematics Education (TURCOMAT), vol. 12, no. 12, pp. 3550–3557, 2021.
O. Kwon, D. Kim, S. R. Lee, J. Choi, and S. Lee, “Handling out-of-vocabulary problem in hangeul word embeddings,” in Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021, pp. 3213–3221.
M. Chiny, M. Chihab, O. Bencharef, and Y. Chihab, “LSTM, VADER and TF-IDF based hybrid sentiment analysis model,” International Journal of Advanced Computer Science and Applications, vol. 12, no. 7, pp. 265–275, 2021.
D. Endalie and T. Tegegne, “Designing a hybrid dimension reduction for improving the performance of Amharic news document classification,” PLOS ONE, vol. 16, no. 5, pp. 1–14, 2021.
D. A. Eisa, A. I. Taloba, and S. S. I. Ismail, “A comparative study on using principle component analysis with different text classifiers,” International Journal of Computer Applications, vol. 180, no. 31, pp. 1–6, 2018.
U. A. M. E. Ali, M. A. Hossain, and M. R. Islam, “Analysis of PCA based feature extraction methods for classification of hyperspectral image,” in 2019 2nd International Conference on Innovation in Engineering and Technology (ICIET). IEEE, 2019, pp. 1–6.
M. Alassaf and A. M. Qamar, “Improving sentiment analysis of Arabic tweets by One-Way ANOVA,” Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 6, pp. 2849–2859, 2022.
S. Bahassine, A. Madani, M. Al-Sarem, and M. Kissi, “Feature selection using an improved Chi-square for Arabic text classification,” Journal of King Saud University-Computer and Information Sciences, vol. 32, no. 2, pp. 225–231, 2020.
L. Singh, S. Singh, and N. Aggarwal, “Two-stage text feature selection method for human emotion recognition,” in Proceedings of 2nd International Conference on Communication, Computing and Networking: ICCCN 2018, NITTTR Chandigarh, India. Springer, 2018, pp. 531–538.
C. H. Cheng and H. H. Chen, “Sentimental text mining based on an additional features method for text classification,” PLOS ONE, vol. 14, no. 6, pp. 1–17, 2019.
A. R. Murthy and K. A. Kumar, “A review of different approaches for detecting emotion from text,” in IOP Conference Series: Materials Science and Engineering, vol. 1110. IOP Publishing, 2021, pp. 1–23.
N. Alswaidan and M. E. B. Menai, “A survey of state-of-the-art approaches for emotion recognition in text,” Knowledge and Information Systems, vol. 62, pp. 2937–2987, 2020.
M. S. Saputri, R. Mahendra, and M. Adriani, “Emotion classification on Indonesian Twitter dataset,” in 2018 International Conference on Asian Language Processing (IALP). IEEE, 2018, pp. 90–95.
E. Cambria, A. Livingstone, and A. Hussain, “The hourglass of emotions,” in Cognitive behavioural systems. Lecture notes in computer science. Springer, 2012.
P. R. Shaver, U. Murdaya, and R. C. Fraley, “Structure of the Indonesian emotion lexicon,” Asian Journal of Social Psychology, vol. 4, no. 3, pp. 201–224, 2001.
I. Dongo, Y. Cadinale, A. Aguilera, F. Mart´ınez, Y. Quintero, and S. Barrios, “Web scraping versus Twitter API: A comparison for a credibility analysis,” in Proceedings of the 22nd International Conference on Information Integration and Web-Based Applications & Services, 2020, pp. 263–273.
K. M. Ridhwan and C. A. Hargreaves, “Leveraging Twitter data to understand public sentiment for the COVID-19 outbreak in Singapore,” International Journal of Information Management Data Insights, vol. 1, no. 2, pp. 1–15, 2021.
B. Gaind, V. Syal, and S. Padgalwar, “Emotion detection and analysis on social media,” 2019. [Online]. Available: http: //arxiv.org/abs/1901.08458
H. T. Duong and T. A. Nguyen-Thi, “A review: Preprocessing techniques and data augmentation for sentiment analysis,” Computational Social Networks, vol. 8, no. 1, pp. 1–16, 2021.
S. Sarica and J. Luo, “Stopwords in technical language processing,” PLOS ONE, vol. 16, no. 8, pp. 1–13, 2021.
H. Young, “The digital language divide: How does the language you speak shape your experience of the Internet?” [Online]. Available: http: //labs.theguardian.com/digital-language-divide/
R. Chandra and A. Krishna, “COVID-19 sentiment analysis via deep learning during the rise of novel cases,” PLOS ONE, vol. 16, no. 8, pp. 1–26, 2021.
Y. Li and T. Yang, “Word embedding for understanding natural language: A survey,” Guide to Big Data Applications, pp. 83–104, 2018.
F. K. Khattak, S. Jeblee, C. Pou-Prom, M. Abdalla, C. Meaney, and F. Rudzicz, “A survey of word embeddings for clinical text,” Journal of Biomedical Informatics, vol. 100, pp. 1–18, 2019.
P. Bojanowski, E. Grave, A. Joulin, and T. Mikolov, “Enriching word vectors with subword information,” Transactions of the Association for Computational Linguistics, vol. 5, pp. 135–146, 2017.
A. Madasu and S. Elango, “Efficient feature selection techniques for sentiment analysis,” Multimedia Tools and Applications, vol. 79, pp. 6313–6335, 2020.
E. J. Cushion, J. Warmenhoven, J. S. North, and D. J. Cleather, “Principal component analysis reveals the proximal to distal pattern in vertical jumping is governed by two functional degrees of freedom,” Frontiers in Bioengineering and Biotechnology, vol. 7, pp. 1–11, 2019.
K. J. Johnson and R. E. Synovec, “Pattern recognition of jet fuels: Comprehensive GC× GC with ANOVA-based feature selection and principal component analysis,” Chemometrics and Intelligent Laboratory Systems, vol. 60, no. 1-2, pp. 225–237, 2002.
G. Forman, “An extensive empirical study of feature selection metrics for text classification,” Journal of Machine Learning Research, vol. 3, pp. 1289–1305, 2003.
S. Chatterjee, K. Chakrabarti, A. Garain, F. Schwenker, and R. Sarkar, “JUMRv1: A sentiment analysis dataset for movie recommendation,” Applied Sciences, vol. 11, no. 20, pp. 1–24, 2021.
E. M. Imah, F. Al Afif, M. I. Fanany, W. Jatmiko, and T. Basaruddin, “A comparative study on Daubechies Wavelet Transformation, Kernel PCA and PCA as feature extractors for arrhythmia detection using SVM,” in TENCON 2011-2011 IEEE Region 10 Conference. IEEE, 2011, pp. 5–9.
L. Breiman, “Random forests,” Machine Learning, vol. 45, pp. 5–32, 2001.
V. Y. Kullarni and P. K. Sinha, “Random forest classifier: a survey and future research directions,” International Journal of Advanced Computing, vol. 36, no. 1, pp. 1144–1156, 2013.
M. Allouch, A. Azaria, R. Azoulay, E. Ben-Izchak, M. Zwilling, and D. A. Zachor, “Automatic detection of insulting sentences in conversation,” in 2018 IEEE International Conference on the Science of Electrical Engineering in Israel (ICSEE). IEEE, 2018, pp. 1–4.
M. Z. Ali, K. Javed, E. U. Haq, and A. Tariq, “Sentiment and emotion classification of epidemic related bilingual data from social media,” 2021. [Online]. Available: http://arxiv.org/abs/2105.01468
K. Sailunaz and R. Alhajj, “Emotion and sentiment analysis from twitter text,” Journal of Computational Science, vol. 36, pp. 1–18, 2019.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Abi Nizar Sutranggono, Elly Matul Imah
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License - Share Alike that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
USER RIGHTS
All articles published Open Access will be immediately and permanently free for everyone to read and download. We are continuously working with our author communities to select the best choice of license options, currently being defined for this journal as follows: Creative Commons Attribution-Share Alike (CC BY-SA)