Explainable hybrid word representations for sentiment analysis of financial news.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Due to the increasing interest of people in the stock and financial market, the sentiment analysis of news and texts related to the sector is of utmost importance. This helps the potential investors in deciding what company to invest in and what are their long-term benefits. However, it is challenging to analyze the sentiments of texts related to the financial domain, given the enormous amount of information available. The existing approaches are unable to capture complex attributes of language such as word usage, including semantics and syntax throughout the context, and polysemy in the context. Further, these approaches failed to interpret the models' predictability, which is obscure to humans. Models' interpretability to justify the predictions has remained largely unexplored and has become important to engender users' trust in the predictions by providing insight into the model prediction. Accordingly, in this paper, we present an explainable hybrid word representation that first augments the data to address the class imbalance issue and then integrates three embeddings to involve polysemy in context, semantics, and syntax in a context. We then fed our proposed word representation to a convolutional neural network (CNN) with attention to capture the sentiment. The experimental results show that our model outperforms several baselines of both classic classifiers and combinations of various word embedding models in the sentiment analysis of financial news. The experimental results also show that the proposed model outperforms several baselines of word embeddings and contextual embeddings when they are separately fed to a neural network model. Further, we show the explainability of the proposed method by presenting the visualization results to explain the reason for a prediction in the sentiment analysis of financial news.

Authors

  • Surabhi Adhikari
    Department of Computer Science and Engineering, Delhi Technological University, New Delhi, India.
  • Surendrabikram Thapa
    Department of Computer Science, Virginia Polytechnic Institute and State University, Blacksburg, VA, USA.
  • Usman Naseem
    School of Computer Science, The University of Sydney, Sydney, Australia. usman.naseem@sydney.edu.au.
  • Hai Ya Lu
    School of Computer Science, University of Technology Sydney, Sydney, Australia.
  • Gnana Bharathy
    School of Computer Science, University of Technology Sydney, Sydney, Australia.
  • Mukesh Prasad
    Centre for Artificial Intelligence, School of Software, Faculty of Engineering and Technology, University of Technology Sydney, Sydney, Australia.