Building a comprehensive syntactic and semantic corpus of Chinese clinical texts.

Journal: Journal of biomedical informatics
Published Date:

Abstract

OBJECTIVE: To build a comprehensive corpus covering syntactic and semantic annotations of Chinese clinical texts with corresponding annotation guidelines and methods as well as to develop tools trained on the annotated corpus, which supplies baselines for research on Chinese texts in the clinical domain.

Authors

  • Bin He
    Clinical Translational Medical Center, The Affiliated Dongguan Songshan Lake Central Hospital, Guangdong Medical University, Dongguan, Guangdong, China.
  • Bin Dong
    Ricoh Software Research Center (Beijing), Beijing, China.
  • Yi Guan
    School of Computer Science and Technology, Harbin Institute of Technology, Integrated Laboratory Building 803, Harbin 150001, China. Electronic address: guanyi@hit.edu.cn.
  • Jinfeng Yang
    Electric Power Research Institute of Guangdong Power Grid Corporation, Guangzhou 510080, China.
  • Zhipeng Jiang
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.
  • Qiubin Yu
    Medical Record Room, Second Affiliated Hospital of Harbin Medical University, Harbin 150086, China. Electronic address: yuqiubin6695@163.com.
  • Jianyi Cheng
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.
  • Chunyan Qu
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.