Investigation of Different Machine Learning Algorithms to Determine Human Sentiment Using Twitter Data

Full Text (PDF, 368KB), PP.38-48

Views: 0 Downloads: 0

Author(s)

Golam Mostafa 1,* Ikhtiar Ahmed 2 Masum Shah Junayed 3

1. East West University, Dhaka, Bangladesh

2. Daffodil International University, Dhaka, Bangladesh

3. Bahcesehir University, Istanbul, Turkey

* Corresponding author.

DOI: https://doi.org/10.5815/ijitcs.2021.02.04

Received: 17 Jul. 2020 / Revised: 11 Oct. 2020 / Accepted: 26 Feb. 2021 / Published: 8 Apr. 2021

Index Terms

Sentiment Analysis, Tweet, Twitter, Sentiment, Social Media, Machine learning, Natural Language Processing

Abstract

In recent years, with the advancement of the internet, social media is a promising platform to explore what going on around the world, sharing opinions and personal development. Now, Sentiment analysis, also known as text mining is widely used in the data science sector. It is an analysis of textual data that describes subjective information available in the source and allows an organization to identify the thoughts and feelings of their brand or goods or services while monitoring conversations and reviews online. Sentiment analysis of Twitter data is a very popular research work nowadays. Twitter is that kind of social media where many users express their opinion and feelings through small tweets and different machine learning classifier algorithms can be used to analyze those tweets. In this paper, some selected machine learning classifier algorithms were applied on crawled Twitter data after applying different types of preprocessors and encoding techniques, which ended up with satisfying accuracy. Later a comparison between the achieved accuracies was showed. Experimental evaluations show that the Neural Network Classifier' algorithm provides a remarkable accuracy of 81.33% compared with other classifiers.

Cite This Paper

Golam Mostafa, Ikhtiar Ahmed, Masum Shah Junayed, "Investigation of Different Machine Learning Algorithms to Determine Human Sentiment Using Twitter Data", International Journal of Information Technology and Computer Science(IJITCS), Vol.13, No.2, pp.38-48, 2021. DOI:10.5815/ijitcs.2021.02.04

Reference

[1] A. Tripathy, A. Agrawal, S K. Rath “Classification of Sentimental Reviews Using Machine Learning Techniques”, Procedia Computer Science: 57(2015)821-829, Issue: 2015.
[2] I. Chatruvedi, E. Cambria, Roy E. Welsch, Francisco Herrera “Distinguishing between facts and opinions for sentimental analysis: Survey and Challenges”, Information Fusion: 44(2018)65-77, Issue:2018.
[3] S. Siddharth, R. Darsini, Dr. Sujithra “Sentiment Analysis on Twitter Data Using Machine Learning Algorithms in Python”, Vol.5, Issue: 2 February, 2018. Twitter Uses Statistics https://www.internetlivestats.com/twitter-statistics/, Last Accessed: 08 July, 2020.
[4] Sentiment Analysis https://monkeylearn.com/sentiment-analysis/, Last Accessed: 08 July, 2020.
[5] S. Bhargava, S. Choudhary “BEHAVIORAL ANALYSIS OF DEPRESSED SENTIMENTAL OVER TWITTER: BASED ON SUPERVISED MACHINE LERANING APROACH”, Issue: 2018.
[6] A. Hasan, S. Moin, A. Karim, S. Shamshirband “Machine Learning-Based Sentiment Analysis for Twitter Accounts”, Issue: 27 February, 2018.
[7] A. Alsaeedi, M Z. Khan “A Study on Sentiment Analysis Techniques of Twitter Data”, Vol. 10, Issue: February 2019.
[8] Shuo Xu “Bayesian Naïve Bayes Classifiers to text Classification”, Vol.44, Issue:2018.
[9] J. Samuel, G.C.Md. Nawaz Ali, Md. Mokhlesur Rahman, Ek Esawi and Yana Samuel “COVID-19 Public Sentiment Insight and Machine Learning for Tweets Classification”, Issue: 11 July, 2020.
[10] Types of Classification “https://analyticsindiamag.com/7-types-classification-algorithms/” Last Accessed: 08 July, 2020
[11] Understanding Support Vector Machine (SVM) algorithm from examples “https://www.analyticsvidhya.com/blog/2017/09/understaing-support-vector-machine-example-code/”, Last Accessed: 09 July, 2020.
[12] Classification using neural Networks “https://towardsdatascience.com/classification-using-neural-networks-b8e98f3a904f”, Last Accessed: 09 July, 2020.
[13] Logistic Regression “https://www.statisticssolutions.com/what-is-logistic-regression/”, Last Accessed: 09 July, 2020
[14] Text Analytics Functions “https://www.lexalytics.com/lexablog/text-analytics-functions-explained”, Last Accessed: 10 July, 2020.
[15] Tokenization “https://auricsystems.com/info-pages/tokenize-what-matters”, Last Accessed: 10 July, 2020.
[16] Lemmatization “https://www.machinelearningplus.com/nlp/lemmatization-examples-python/”, Last Accessed: 10 July, 2020
[17] Stemming and Lemmatization “https://nlp.stanford.edu/IR-book/html/htmledition/stemming-and-lemmatization-1.html”, Last Accessed: 10 July, 2020.
[18] Shakeel Ahmad, Muhammad Zubair Asghar, Fahad M. Alotaibi, Irfanullah Awan “Detection and Classification of Social Media-based Extremist Affiliations Using Sentiment Analysis Techniques”, Human-centric Computing and Information Sciences, Issue: 2019.
[19] Machine Learning Techniques “https://www.edureka.co/blog/classification-in-machine-learning/” Last Accessed: 08 July, 2020.