Sentiment Analysis of Social Network Contents using Machine Learning Algorithms: A Review

Mohamed Omar; Ahmed Salah; Mahmoud Mahdy

Authors

Mohamed Omar Department of Computer Science, Faculty of Computer and Informatics, Zagazig University, Zagazig 44519, Egypt
Ahmed Salah Department of Computer Science, Faculty of Computer and Informatics, Zagazig University, Zagazig 44519, Egypt
Mahmoud Mahdy Department of Computer Science, Faculty of Computer and Informatics, Zagazig University, Zagazig 44519, Egypt

Keywords:

Digital Humanities, Classification, Topic Analysis, Sentimentant Analysis, LDA Topic Modeling, TF-IDF

Abstract

The exponential growth of social media spaces has resulted in a previously unimaginable amount of user-generated content, which can be used to identify public opinion, sentiment, and trends. Sentiment analysis is an area of the natural language processing (NLP), data science, and machine learning literature that describes the identification, extraction, and analytical processes of subjective information and emotional tone within a document or text. The incorporation of machine learning algorithms into social networking content can reveal significant information of interest to businesses, political analysis, public health, and research in social sciences. Although text in social media environments has the same functions as other textual forms, the text presents unique challenges associated with its brevity, informality, use of slang and abbreviations, use of emojis, sarcasm, irony, etc. and other challenges associated with the fast-paced, constantly updating nature of social media. These challenges are addressed in a specialized way to use social media texts comparatively with other texts.
The goal of this article is to review the literature associated with social network content sentiment analysis, including the use of machine learning algorithms to classify sentiment. We will emphasize the unique characteristics of Twitter as a social media space, as well as the associated NLP focused preprocessing approaches to address noisy, informal, and platform-specific elements associated with the content. The review will include examples of traditional methods like Bag-of-Words and TF-IDF, features increasingly used into deeper methods like Word Embedding and in using dictionaries like sentiment lexicons and discuss their principles, as well as advantages and disadvantages when using these methods in social media. We will also include several of the most commonly used machine learning models used for sentiment classification. This review will also include traditional models like Naive Bayes, SVMs, and other ensemble methods used, as well as deep learning approaches utilizing CNNs, RNNs, and Transformers. Finally, we're going to discuss the nature and importance of the rigor of evaluation to determine sentiment classification correctness. We will discuss an evaluate using metrics like Accuracy, Precision, Recall, F1-Score, AUC-ROC, Cohen's Kappa and discuss important methods to evaluate classification validity like data spitting, cross-validation, and evaluate imbalanced datasets. We will explore, use, and evaluate of measures for sentiment discernment inherent in social media language, which will assist in an understand of how to manipulate behaviors of the machine learning for social media space, although this is a challenging task and domain.

Downloads

Download data is not yet available.

Sentiment Analysis of Social Network Contents using Machine Learning Algorithms: A Review

Authors

Keywords:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Make a Submission

Journal Information

Information

Volumes

Info. for Authors and Subscribers

Visitors