Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUND: Cardiovascular disease (CVD) has become the leading cause of death in China, and most of the cases can be prevented by controlling risk factors. The goal of this study was to build a corpus of CVD risk factor annotations based on Chinese electronic medical records (CEMRs). This corpus is intended to be used to develop a risk factor information extraction system that, in turn, can be applied as a foundation for the further study of the progress of risk factors and CVD.

Authors

  • Jia Su
    Language Technology Research Center, Harbin Institute of Technology, School of Computer Science and Technology, No. 92 West Dazhi Street, Harbin, Heilongjiang, China.
  • Bin He
    Clinical Translational Medical Center, The Affiliated Dongguan Songshan Lake Central Hospital, Guangdong Medical University, Dongguan, Guangdong, China.
  • Yi Guan
    School of Computer Science and Technology, Harbin Institute of Technology, Integrated Laboratory Building 803, Harbin 150001, China. Electronic address: guanyi@hit.edu.cn.
  • Jingchi Jiang
    School of Computer Science and Technology, Harbin Institute of Technology, Integrated Laboratory Building 803, Harbin 150001, China. Electronic address: jiangjingchi0118@163.com.
  • Jinfeng Yang
    Electric Power Research Institute of Guangdong Power Grid Corporation, Guangzhou 510080, China.