Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches.

Journal: Computational and mathematical methods in medicine
PMID:

Abstract

A vast amount of data is generated every second for microblogs, content sharing via social media sites, and social networking. Twitter is an essential popular microblog where people voice their opinions about daily issues. Recently, analyzing these opinions is the primary concern of Sentiment analysis or opinion mining. Efficiently capturing, gathering, and analyzing sentiments have been challenging for researchers. To deal with these challenges, in this research work, we propose a highly accurate approach for SA of fake news on COVID-19. The fake news dataset contains fake news on COVID-19; we started by data preprocessing (replace the missing value, noise removal, tokenization, and stemming). We applied a semantic model with term frequency and inverse document frequency weighting for data representation. In the measuring and evaluation step, we applied eight machine-learning algorithms such as Naive Bayesian, Adaboost, -nearest neighbors, random forest, logistic regression, decision tree, neural networks, and support vector machine and four deep learning CNN, LSTM, RNN, and GRU. Afterward, based on the results, we boiled a highly efficient prediction model with python, and we trained and evaluated the classification model according to the performance measures (confusion matrix, classification rate, true positives rate...), then tested the model on a set of unclassified fake news on COVID-19, to predict the sentiment class of each fake news on COVID-19. Obtained results demonstrate a high accuracy compared to the other models. Finally, a set of recommendations is provided with future directions for this research to help researchers select an efficient sentiment analysis model on Twitter data.

Authors

  • Waqas Haider Bangyal
    Department of Computer Science, University of Gujrat, Pakistan.
  • Rukhma Qasim
    Department of Computer Science, University of Gujrat, Pakistan.
  • Najeeb Ur Rehman
    Natural and Medical Sciences Research Center, University of Nizwa, Nizwa, Oman.
  • Zeeshan Ahmad
  • Hafsa Dar
    Department of Software Engineering, University of Gujrat, Pakistan.
  • Laiqa Rukhsar
    Department of Computer Science, University of Gujrat, Pakistan.
  • Zahra Aman
    Department of Computer Science, University of Gujrat, Pakistan.
  • Jamil Ahmad
    College of Software and Convergence Technology, Department of Software, Sejong University, Seoul, Republic of Korea.