Transfer Learning for Sentiment Analysis Using BERT Based Supervised Fine-Tuning.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

The growth of the Internet has expanded the amount of data expressed by users across multiple platforms. The availability of these different worldviews and individuals' emotions empowers sentiment analysis. However, sentiment analysis becomes even more challenging due to a scarcity of standardized labeled data in the Bangla NLP domain. The majority of the existing Bangla research has relied on models of deep learning that significantly focus on context-independent word embeddings, such as Word2Vec, GloVe, and fastText, in which each word has a fixed representation irrespective of its context. Meanwhile, context-based pre-trained language models such as BERT have recently revolutionized the state of natural language processing. In this work, we utilized BERT's transfer learning ability to a deep integrated model CNN-BiLSTM for enhanced performance of decision-making in sentiment analysis. In addition, we also introduced the ability of transfer learning to classical machine learning algorithms for the performance comparison of CNN-BiLSTM. Additionally, we explore various word embedding techniques, such as Word2Vec, GloVe, and fastText, and compare their performance to the BERT transfer learning strategy. As a result, we have shown a state-of-the-art binary classification performance for Bangla sentiment analysis that significantly outperforms all embedding and algorithms.

Authors

  • Nusrat Jahan Prottasha
    Department of Computer Science and Engineering, Daffodil International University, Dhaka 1341, Bangladesh.
  • Abdullah As Sami
    Department of Computer Science & Engineering, Chittagong University of Engineering & Technology, Chattogram 4349, Bangladesh.
  • Md Kowsher
    Department of Computer Science, Stevens Institute of Technology, Hoboken, NJ 07030, USA.
  • Saydul Akbar Murad
    Faculty of Computing, Universiti Malaysia Pahang, Pekan 26600, Malaysia.
  • Anupam Kumar Bairagi
    Computer Science and Engineering Discipline, Khulna University, Khulna 9208, Bangladesh.
  • Mehedi Masud
    Department of Computer Science, Taif University, Taif 21944, Saudi Arabia.
  • Mohammed Baz
    Department of Computer Engineering, College of Computer and Information Technology, Taif University, P.O. Box. 11099, Taif 21994, Saudi Arabia.