Sentiment analysis for deepfake X posts using novel transfer learning based word embedding and hybrid LGR approach.
Journal:
Scientific reports
Published Date:
Aug 3, 2025
Abstract
With the growth of social media, people are sharing more content than ever, including X posts that reflect a variety of emotions and opinions. AI-generated synthetic text, known as deepfake text, is used to imitate human writing to disseminate misleading information and fake news. However, as deepfake technology continues to grow, it becomes harder to accurately understand people's opinions on deepfake posts. Existing sentiment analysis algorithms frequently fail to capture the domain-specific, misleading, and context-sensitive characteristics of deepfake-related content. This study proposes a hybrid deep learning (DL) approach and novel transfer learning (TL)-based feature extraction approach for deepfake posts' sentiment analysis. The transfer learning-based approach combines the strengths of the hybrid DL technique to capture global and local contextual information. In this study, we compare the proposed approach with a range of machine learning algorithms, as well as, DL techniques for validation. Different feature extraction techniques, such as a bag of words (BOW), term frequency-inverse document frequency (TF-IDF), word embedding features, and novel TL features that combine the LSTM and DT, are used to build the models. The ML models are fine-tuned with extensive hyperparameter tuning to enhance performance and efficiency. The sentiment analysis performance of each applied method is validated using the k-fold cross-validation. The experimental results indicate that the proposed LGR (LSTM+GRU+RNN) approach with novel TL features performs well with a 99% accuracy. The proposed approach helps detect and prevent the spread of deepfake content, keeping people and organizations safe from its negative effects. This study covers a crucial gap in evaluating deepfake-specific social media sentiment by providing a comprehensive, scalable mechanism for monitoring and reducing the effect of fake content online.