Emotion Intensity Value Prediction with Machine Learning Approach on Twitter
DOI:
https://doi.org/10.21512/commit.v17i2.8503Keywords:
Emotion Intensity, Value Prediction, Machine Learning Approach, TwitterAbstract
Recognizing the intensity of the emotions is a paramount task for an affective system. By recognizing the intensity of the emotions, the system can have better human-computer interaction. The research explores several machine learning approaches with several different feature extraction method combinations to solve the emotion intensity prediction task while also analyzing and comparing it with several previous related papers. The research uses the dataset provided through theWASSA 2017 and SemEval 2018 competition. The dataset utilizes four of the eight basic emotions that Plutchik defines (anger, fear, joy, and sadness). The total data result in 19,736 rows of entry, with a total of 10,715 (54.3%) for training, 1,811 (9.17%) for validation, and 7,210 (36.53%) for testing. Three feature extraction methods are used and compared: N-gram, TFIDF, and Bag-of-Words. Meanwhile, machine learning algorithms are Linear Regression, Ridge Regression, KNearest Neighbor for Regression, Regression Tree, and Support Vector Regression (SVR). The results show that SVR with TF-IDF features has the best result of all attempted experiments, with a Pearson correlation score of 0.755 for all data and 0.647 for gold labels data. The final model also accepts newly seen data and displays the corresponding emotion label and intensity.
Plum Analytics
References
Y. Wang, W. Song, W. Tao, A. Liotta, D. Yang, X. Li, S. Gao, Y. Sun, W. Ge, W. Zhang, and W. Zhang, “A systematic review on affective computing: Emotion models, databases, and recent advances,” Information Fusion, vol. 83, pp. 19–52, 2022.
N. Mejbri, F. Essalmi, M. Jemni, and B. A. Alyoubi, “Trends in the use of affective computing in e-learning environments,” Education and Information Technologies, vol. 27, pp. 3867–3889, 2022.
S. B. I. Badia, L. V. Quintero, M. S. Cameirao, A. Chirico, S. Triberti, P. Cipresso, and A. Gaggioli, “Toward emotionally adaptive virtual reality for mental health applications,” IEEE Journal of Biomedical and Health Informatics, vol. 23, no. 5, pp. 1877–1887, 2018.
M. Heo and K. J. Lee, “Chatbot as a new business communication tool: The case of naver talktalk,” Business Communication Research and Practice, vol. 1, no. 1, pp. 41–45, 2018.
A. Kumar and A. Jaiswal, “Systematic literature review of sentiment analysis on twitter using soft computing techniques,” Concurrency and Computation: Practice and Experience, vol. 32, no. 1, 2020.
S. Mohammad and F. Bravo-Marquez, “Emotion intensities in Tweets,” in Proceedings of the 6th Joint Conference on Lexical and Computational Semantics (SEM 2017). Vancouver, Canada: Association for Computational Linguistics, 2017, pp. 65–77.
——, “WASSA-2017 shared task on emotion intensity,” in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 34–49.
S. Mohammad, F. Bravo-Marquez, M. Salameh, and S. Kiritchenko, “SemEval-2018 task 1: Affect in Tweets,” in Proceedings of the 12th International Workshop on Semantic Evaluation. New Orleans, Louisiana: Association for Computational Linguistics, 2018, pp. 1–17.
P. Goel, D. Kulshreshtha, P. Jain, and K. K. Shukla, “Prayas at EmoInt 2017: An ensemble of deep neural architectures for emotion intensity prediction in Tweets,” in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 58–65.
V. Duppada and S. Hiray, “Seernet at EmoInt-2017: Tweet emotion intensity estimator,” in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 205–211.
V. Andryushechkin, I. Wood, and J. O’Neill, “NUIG at EmoInt-2017: BiLSTM and SVR ensemble to detect emotion intensity,” in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis. Copenhagen, Denmark: Association for Computational Linguistics, 2017, pp. 175–179.
J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of deep bidirectional transformers,” in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Minneapolis, Minnesota: Association for Computational Linguistics, 2019, pp. 4171–4186.
M. Garc´ıa, S. Maldonado, and C. Vairetti, “Efficient n-gram construction for text categorization using feature selection techniques,” Intelligent Data Analysis, vol. 25, no. 3, pp. 509–525, 2021.
Z. Jiang, B. Gao, Y. He, Y. Han, P. Doyle, and Q. Zhu, “Text classification using novel term weighting scheme-based improved TF-IDF for Internet media reports,” Mathematical Problems in Engineering, vol. 2021, pp. 1–30, 2021.
T. O. Olaleye, O. T. Arogundade, A. Abayomi-Alli, and A. K. Adesemowo, “An ensemble predictive analytics of COVID-19 infodemic tweets using bag of words,” in Data science for COVID-19. Elsevier, 2021, pp. 365–380.
F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, and D. Cournapeau, “Scikitlearn: Machine learning in Python,” The Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
I. Syarif, A. Prugel-Bennett, and G. Wills, “SVM parameter optimization using grid search and genetic algorithm to improve classification performance,” TELKOMNIKA (Telecommunication Computing Electronics and Control), vol. 14, no. 4, pp. 1502–1509, 2016.
M. A. Mohsin and A. Beltiukov, “Summarizing emotions from text using Plutchik’s wheel of emotions,” in 7th Scientific Conference on Information Technologies for Intelligent Decision Making Support (ITIDS 2019). Ufa, Russia: Atlantis Press, May 28–29, 2019, pp. 291–294.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Rindy Claudia Setiawan, Andry Chowanda
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
a. Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License - Share Alike that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.
b. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
c. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
USER RIGHTS
All articles published Open Access will be immediately and permanently free for everyone to read and download. We are continuously working with our author communities to select the best choice of license options, currently being defined for this journal as follows: Creative Commons Attribution-Share Alike (CC BY-SA)