Machine Learning Based Emotional Sentiment Analysis of Tweet Data Using a Voting Classifier

Main Article Content

G. Deepika
K. Deepthi Reddy

Abstract

The introduction of social media and microblogging sites to the World Wide Web was a significant advancement. The website functioned as a place for users to express their opinions and feelings on a variety of problems. The Internet has bloomed into a viable platform for online education, information distribution, and the expression of varied opinions since the dawn of the social networking age. Social media networks contain a multitude of sentiment data in the form of tweets, blogs, status updates, articles, and so on. This study takes advantage of Twitter, the most popular microblogging network. Sentiment analysis is used to derive user views and sentiments from Twitter data (tweets). Some of the seven machine learning models presently employed by the existing system to categorise tweets into happy or sad categories include SVM, DTC, NB, RF, GBM, LR, VC (LR+SGD), and VC (LR+SGD). Following a thorough performance comparison, it was discovered that the voting classifier (LR-SGD) in conjunction with the topic-based information content index (TF-IDF) produces the best results, with an F1 score of 81% and an accuracy of 79%. The suggested system consists of combining the LR, RF, NB, and SVM voting classifiers with the TF-IDF model, which produces 94% accuracy, and the COUNT vectorization model, which yields 95% accuracy. The findings might help governments and companies improve the execution of programmes, goods, and events.

Article Details

How to Cite
[1]
G. Deepika and K. Deepthi Reddy, “Machine Learning Based Emotional Sentiment Analysis of Tweet Data Using a Voting Classifier”, Int. J. Comput. Eng. Res. Trends, vol. 9, no. 10, pp. 193–200, Oct. 2022.
Section
Research Articles

References

Bhumika Gupta and Monika Negi, Kanika Vishwakarma, Goldi Rawat, Priyanka Badhani ,“Study of Twitter Sentiment Analysis using Machine Learning Algorithms on Python,”. In International Journal of Computer Applications (0975 – 8887) Volume 165 – No.9, May 2017

Ankita and Nabizath Saleena, ‘‘An Ensemble Classification System for Twitter Sentiment Analysis,’’ in International Conference on Computational Intelligence and Data Science (ICCIDS 2018)

Roza H. Hama Aziz and Nazife Dimililer, ‘‘Twitter Sentiment Analysis using an Ensemble Weighted Majority Vote Classifier, Third International Conference on Advanced Science and Engineering (ICOASE2020)

Chaudhary Jagrit Varshney , Dr. Ashish Sharma and Dhirendra Prasad Yadav, ‘‘Sentiment Analysis using Ensemble Classification Technique,’’ IEEE ,2020

Anam Yousaf , Muhammad Umer , Saima Sadiq , Saleem Ullah, Seyefali Mirjalili(Senior Member, IEEE), Vainhav Rupapara , and Michele Nappi, (Senior Member, IEEE) , “Emotion Recognition by Textual Tweets Classification Using Voting Classifier (LR-SGD)”,IEEE,2021

N. F. F. da Silva, E. R. Hruschka, and E. R. Hruschka, ‘‘Tweet sentiment analysis with classifier ensembles,’’ Decis. Support Syst., vol. 66, pp. 170–179, Oct. 2014.

C. Kariya and P. Khodke, ‘‘Twitter sentiment analysis,’’ in Proc. Int. Conf. Emerg. Technol. (INCET), Jun. 2020, pp. 212–216.

A. Alsaeedi and M. Zubair, ‘‘A study on sentiment analysis techniques of Twitter data,’’ Int. J. Adv. Comput. Sci. Appl., vol. 10, no. 2, pp. 361–374, 2019.

A. Bandhakavi, N. Wiratunga, D. Padmanabhan, and S. Massie, ‘‘Lexicon based feature extraction for emotion text classification,’’ Pattern Recognit. Lett., vol. 93, pp. 133–142, Jul. 2017

H. Hakh, I. Aljarah, and B. Al-Shboul, ‘‘Online social media-based sentiment analysis for us airline companies,’’ in New Trends in Information Technology. Amman, Jordan: Univ. of Jordan, Apr. 2017.

R. Xia, C. Zong, and S. Li, ‘‘Ensemble of feature sets and classification algorithms for sentiment classification,’’ Inf. Sci., vol. 181, no. 6, pp. 1138–1152, Mar. 2011.

M. Umer, S. Sadiq, M. Ahmad, S. Ullah, G. S. Choi, and A. Mehmood, ‘‘A novel stacked CNN for malarial parasite detection in thin blood smear images,’’ IEEE Access, vol. 8, pp. 93782–93792, 2020.