An Efficient Deep Learning Framework for Revealing the Evolution of Characterization Methods in Nanoscience.

Journal: Nano-micro letters
Published Date:

Abstract

Text mining has emerged as a powerful strategy for extracting domain knowledge structure from large amounts of text data. To date, most text mining methods are restricted to specific literature information, resulting in incomplete knowledge graphs. Here, we report a method that combines citation analysis with topic modeling to describe the hidden development patterns in the history of science. Leveraging this method, we construct a knowledge graph in the field of Raman spectroscopy. The traditional Latent DirichletAllocation model is chosen as the baseline model for comparison to validate the performance of our model. Our method improves the topic coherence with a minimum growth rate of 100% compared to the traditional text mining method. It outperforms the traditional text mining method on the diversity, and its growth rate ranges from 0 to 126%. The results show the effectiveness of rule-based tokenizer we designed in solving the word tokenizer problem caused by entity naming rules in the field of chemistry. It is versatile in revealing the distribution of topics, establishing the similarity and inheritance relationships, and identifying the important moments in the history of Raman spectroscopy. Our work provides a comprehensive tool for the science of science research and promises to offer new insights into the historical survey and development forecast of a research field.

Authors

  • Hui-Cong Duan
    Institute of Artificial Intelligence, Pen-Tung Sah Institute of Micro-Nano Science and Technology, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen, 361005, People's Republic of China.
  • Long-Xing Lin
    Institute of Artificial Intelligence, Pen-Tung Sah Institute of Micro-Nano Science and Technology, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen, 361005, People's Republic of China.
  • Ji-Chun Wang
    Institute of Artificial Intelligence, Pen-Tung Sah Institute of Micro-Nano Science and Technology, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen, 361005, People's Republic of China.
  • Tong-Ruo Diao
    Institute of Artificial Intelligence, Pen-Tung Sah Institute of Micro-Nano Science and Technology, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen, 361005, People's Republic of China.
  • Sheng-Jie Qiu
    Institute of Artificial Intelligence, Pen-Tung Sah Institute of Micro-Nano Science and Technology, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen, 361005, People's Republic of China.
  • Bi-Jun Geng
    Institute of Artificial Intelligence, Pen-Tung Sah Institute of Micro-Nano Science and Technology, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen, 361005, People's Republic of China.
  • Jia Shi
    School of Mechanical Engineering and Automation, Northeastern University, Shenyang, Liaoning, China.
  • Shu Hu
    School of Public Health, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong S.A.R., China.
  • Yang Yang
    Department of Gastrointestinal Surgery, The Third Hospital of Hebei Medical University, Shijiazhuang, China.

Keywords

No keywords available for this article.