An intent classification method for questions in "Treatise on Febrile diseases" based on TinyBERT-CNN fusion model.

Journal: Computers in biology and medicine
Published Date:

Abstract

"Treatise on Febrile Diseases" is an important classic book in the academic history of Chinese material medica. Based on the knowledge map of traditional Chinese medicine established by the study of "Treatise on Febrile Diseases", a question-answering system of traditional Chinese medicine was established to help people better understand and use traditional Chinese medicine. Intention classification is the basis of the question-answering system of traditional Chinese medicine, but as far as we know, there is no research on question intention classification based on "Treatise on Febrile Diseases". In this paper, the intent classification research is carried out based on the Chinese material medica-related content materials in "Treatise on Febrile Diseases" as data. Most of the existing models perform well on long text classification tasks, with high costs and a lot of memory requirements. However, the intent classification data of this paper has the characteristics of short text, a small amount of data, and unbalanced categories. In response to these problems, this paper proposes a knowledge distillation-based bidirectional Transformer encoder combined with a convolutional neural network model (TinyBERT-CNN), which is used for the task of question intent classification in "Treatise on Febrile Diseases". The model used TinyBERT as an embedding and encoding layer to obtain the global vector information of the text and then completed the intent classification by feeding the encoded feature information into the CNN. The experimental results indicated that the model outperformed other models in terms of accuracy, recall, and F1 values of 96.4%, 95.9%, and 96.2%, respectively. The experimental results prove that the model proposed in this paper can effectively classify the intent of the question sentences in "Treatise on Febrile Diseases", and provide technical support for the question-answering system of "Treatise on Febrile Diseases" later.

Authors

  • Helong Yu
    Smart Agriculture Research Institute, Jilin Agricultural University, Changchun, China.
  • Chunliu Liu
    College of Information Technology, Jilin Agricultural University, Changchun, 130118, China. Electronic address: chunliul@163.com.
  • Lina Zhang
    Intensive Care Unit, XiangYa Hospital, Central South University, Changsha, China.
  • Chengwen Wu
    Department of Computer Science and Artificial Intelligence, Wenzhou University, Wenzhou, 325035, China. Electronic address: jsj_wcw@wzu.edu.cn.
  • Guoxi Liang
    Department of Artificial Intelligence, Wenzhou Polytechnic, Wenzhou, 325035, China. Electronic address: guoxiliang2017@gmail.com.
  • José Escorcia-Gutierrez
    Department of Computational Science and Electronics, Universidad de la Costa, CUC, Barranquilla, 080002, Colombia. Electronic address: jescorci56@cuc.edu.co.
  • Osama A Ghoneim
    Faculty of Computers and Information, Tanta University, Egypt. Electronic address: OsamaGhoneim@ics.tanta.edu.eg.