Emotion Intensity Value Prediction with Machine Learning Approach on Twitter

Rindy Claudia Setiawan; Andry Chowanda

doi:10.21512/commit.v17i2.8503

Authors

Rindy Claudia Setiawan Bina Nusantara University
Andry Chowanda Bina Nusantara University

DOI:

https://doi.org/10.21512/commit.v17i2.8503

Keywords:

Emotion Intensity, Value Prediction, Machine Learning Approach, Twitter

Abstract

Recognizing the intensity of the emotions is a paramount task for an affective system. By recognizing the intensity of the emotions, the system can have better human-computer interaction. The research explores several machine learning approaches with several different feature extraction method combinations to solve the emotion intensity prediction task while also analyzing and comparing it with several previous related papers. The research uses the dataset provided through theWASSA 2017 and SemEval 2018 competition. The dataset utilizes four of the eight basic emotions that Plutchik defines (anger, fear, joy, and sadness). The total data result in 19,736 rows of entry, with a total of 10,715 (54.3%) for training, 1,811 (9.17%) for validation, and 7,210 (36.53%) for testing. Three feature extraction methods are used and compared: N-gram, TFIDF, and Bag-of-Words. Meanwhile, machine learning algorithms are Linear Regression, Ridge Regression, KNearest Neighbor for Regression, Regression Tree, and Support Vector Regression (SVR). The results show that SVR with TF-IDF features has the best result of all attempted experiments, with a Pearson correlation score of 0.755 for all data and 0.647 for gold labels data. The final model also accepts newly seen data and displays the corresponding emotion label and intensity.

Dimensions

Plum Analytics

Author Biographies

Rindy Claudia Setiawan, Bina Nusantara University

Computer Science Department, BINUS Graduate Program â€“ Master of Computer Science

Andry Chowanda, Bina Nusantara University

Computer Science Department, School of Computer Science

References

Y. Wang, W. Song, W. Tao, A. Liotta, D. Yang, X. Li, S. Gao, Y. Sun, W. Ge, W. Zhang, and W. Zhang, â€œA systematic review on affective computing: Emotion models, databases, and recent advances,â€ Information Fusion, vol. 83, pp. 19â€“52, 2022.

N. Mejbri, F. Essalmi, M. Jemni, and B. A. Alyoubi, â€œTrends in the use of affective computing in e-learning environments,â€ Education and Information Technologies, vol. 27, pp. 3867â€“3889, 2022.

S. B. I. Badia, L. V. Quintero, M. S. Cameirao, A. Chirico, S. Triberti, P. Cipresso, and A. Gaggioli, â€œToward emotionally adaptive virtual reality for mental health applications,â€ IEEE Journal of Biomedical and Health Informatics, vol. 23, no. 5, pp. 1877â€“1887, 2018.

M. Heo and K. J. Lee, â€œChatbot as a new business communication tool: The case of naver talktalk,â€ Business Communication Research and Practice, vol. 1, no. 1, pp. 41â€“45, 2018.

A. Kumar and A. Jaiswal, â€œSystematic literature review of sentiment analysis on twitter using soft computing techniques,â€ Concurrency and Computation: Practice and Experience, vol. 32, no. 1, 2020.

S. Mohammad and F. Bravo-Marquez, â€œEmotion intensities in Tweets,â€ in Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (SEM 2017). Vancouver, Canada: Association for Computational Linguistics, 2017, pp. 65â€“77.

â€”â€”, â€œWASSA-2017 shared task on emotion intensity,â€ in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 34â€“49.

S. Mohammad, F. Bravo-Marquez, M. Salameh, and S. Kiritchenko, â€œSemEval-2018 task 1: Affect in Tweets,â€ in Proceedings of the 12th International Workshop on Semantic Evaluation. New Orleans, Louisiana: Association for Computational Linguistics, 2018, pp. 1â€“17.

P. Goel, D. Kulshreshtha, P. Jain, and K. K. Shukla, â€œPrayas at EmoInt 2017: An ensemble of deep neural architectures for emotion intensity prediction in Tweets,â€ in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 58â€“65.

V. Duppada and S. Hiray, â€œSeernet at EmoInt-2017: Tweet emotion intensity estimator,â€ in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 205â€“211.

V. Andryushechkin, I. Wood, and J. Oâ€™Neill, â€œNUIG at EmoInt-2017: BiLSTM and SVR ensemble to detect emotion intensity,â€ in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 175â€“179.

J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, â€œBERT: Pre-training of deep bidirectional transformers,â€ in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Minneapolis, Minnesota: Association for Computational Linguistics, 2019, pp. 4171â€“4186.

M. GarcÂ´Ä±a, S. Maldonado, and C. Vairetti, â€œEfficient n-gram construction for text categorization using feature selection techniques,â€ Intelligent Data Analysis, vol. 25, no. 3, pp. 509â€“525, 2021.

Z. Jiang, B. Gao, Y. He, Y. Han, P. Doyle, and Q. Zhu, â€œText classification using novel term weighting scheme-based improved TF-IDF for Internet media reports,â€ Mathematical Problems in Engineering, vol. 2021, pp. 1â€“30, 2021.

T. O. Olaleye, O. T. Arogundade, A. Abayomi-Alli, and A. K. Adesemowo, â€œAn ensemble predictive analytics of COVID-19 infodemic tweets using bag of words,â€ in Data science for COVID-19. Elsevier, 2021, pp. 365â€“380.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, and D. Cournapeau, â€œScikitlearn: Machine learning in Python,â€ The Journal of Machine Learning Research, vol. 12, pp. 2825â€“2830, 2011.

I. Syarif, A. Prugel-Bennett, and G. Wills, â€œSVM parameter optimization using grid search and genetic algorithm to improve classification performance,â€ TELKOMNIKA (Telecommunication Computing Electronics and Control), vol. 14, no. 4, pp. 1502â€“1509, 2016.

M. A. Mohsin and A. Beltiukov, â€œSummarizing emotions from text using Plutchikâ€™s wheel of emotions,â€ in 7th Scientific Conference on Information Technologies for Intelligent Decision Making Support (ITIDS 2019). Ufa, Russia: Atlantis Press, May 28â€“29, 2019, pp. 291â€“294.