Chinese Clinical Named Entity Recognition With Segmentation Synonym Sentence Synthesis Mechanism: Algorithm Development and Validation.

Journal: JMIR medical informatics
Published Date:

Abstract

BACKGROUND: Clinical named entity recognition (CNER) is a fundamental task in natural language processing used to extract named entities from electronic medical record texts. In recent years, with the continuous development of machine learning, deep learning models have replaced traditional machine learning and template-based methods, becoming widely applied in the CNER field. However, due to the complexity of clinical texts, the diversity and large quantity of named entity types, and the unclear boundaries between different entities, existing advanced methods rely to some extent on annotated databases and the scale of embedded dictionaries.

Authors

  • Jian Tang
    Department of Decision Sciences HEC, Université de Montréal, Montreal, Québec, Canada.
  • Zikun Huang
    School of Science and Technology, Guilin University, Guilin, China.
  • Hongzhen Xu
    Department of Pharmacy, People's Hospital of Guilin, 12 Wenming Road, Guilin, 541000, China, 86 18978320258.
  • Hao Zhang
    College of Mechanical and Electrical Engineering, Henan Agricultural University, Zhengzhou, 450002, China.
  • Hailing Huang
    Department of Pharmacy, People's Hospital of Guilin, 12 Wenming Road, Guilin, 541000, China, 86 18978320258.
  • Minqiong Tang
    Department of Pharmacy, People's Hospital of Guilin, 12 Wenming Road, Guilin, 541000, China, 86 18978320258.
  • Pengsheng Luo
    Department of Pharmacy, People's Hospital of Guilin, 12 Wenming Road, Guilin, 541000, China, 86 18978320258.
  • Dong Qin
    Department of Electrical and Computer Engineering, Iowa State University, 2215 Coover Hall, 2520 Osborn Drive, Ames, 50011-1046, IA, USA. Electronic address: dqin@iastate.edu.