Opinion Mining by Convolutional Neural Networks for Maximizing Discoverability of Nanomaterials.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

The scientific literature contains valuable information that can be used for future applications, but manual analysis presents challenges due to its size and disciplinary boundaries. The prevailing solution involves natural language processing (NLP) techniques such as information retrieval. Nonetheless, existing automated systems primarily provide either statistically based shallow information or deep information without traceability, thereby falling short of delivering high-quality and reliable insights. To address this, we propose an innovative approach of leveraging sentiment information embedded within the literature to track the opinions toward materials. In this study, we integrated material knowledge into text representation and constructed opinion data sets to hierarchically train deep learning models, named as Scientific Sentiment Network (SSNet). SSNet can effectively extract knowledge from the energy material literature and accurately categorize expert opinions into challenges and opportunities (94% and 92% accuracy, respectively). By incorporating sentiment features determined by SSNet, we can predict the ranking of emerging thermoelectric materials with a 70% correlation to experimental outcomes. Furthermore, our model achieves a commendable 68% accuracy in predicting suitable nanomaterials for atomic layer deposition (ALD) over time. These promising results offer a practical framework to extract and synthesize knowledge from the scientific literature, thereby accelerating research in the field of nanomaterials.

Authors

  • Tong Xie
    Department of Food Science and Technology, Shanghai Institute of Technology, Shanghai 201418, China.
  • Yuwei Wan
    Department of Linguistics and Translation, City University of Hong Kong, 83 Tat Chee Ave, Kowloon Tong, Hong Kong.
  • Haoran Wang
    Department of Urology, The Fifth Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510700, China.
  • Ina Østrøm
    School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW 2052, Australia.
  • Shaozhou Wang
    School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW 2052, Australia.
  • Mingrui He
    School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW 2052, Australia.
  • Rong Deng
    Department of Radiology, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, NO. 197 Ruijin Er Road, Shanghai 200025, China.
  • Xinyuan Wu
    School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW 2052, Australia.
  • Clara Grazian
    DARE ARC Training Centre in Data Analytics for Resources and Environments, South Eveleigh, NSW 2015, Australia.
  • Chunyu Kit
    Department of Linguistics and Translation, City University of Hong Kong, 83 Tat Chee Ave, Kowloon Tong, Hong Kong.
  • Bram Hoex
    School of Photovoltaic and Renewable Energy Engineering, University of New South Wales, Kensington, NSW 2052, Australia.