PASCAL: a pseudo cascade learning framework for breast cancer treatment entity normalization in Chinese clinical text.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUNDS: Knowledge discovery from breast cancer treatment records has promoted downstream clinical studies such as careflow mining and therapy analysis. However, the clinical treatment text from electronic health data might be recorded by different doctors under their hospital guidelines, making the final data rich in author- and domain-specific idiosyncrasies. Therefore, breast cancer treatment entity normalization becomes an essential task for the above downstream clinical studies. The latest studies have demonstrated the superiority of deep learning methods in named entity normalization tasks. Fundamentally, most existing approaches adopt pipeline implementations that treat it as an independent process after named entity recognition, which can propagate errors to later tasks. In addition, despite its importance in clinical and translational research, few studies directly deal with the normalization task in Chinese clinical text due to the complexity of composition forms.

Authors

  • Yang An
    Laboratory of Behavioral Neuroscience, National Institute on Aging, National Institutes of Health, Baltimore, MD, USA.
  • Jianlin Wang
    First Hospital of Lanzhou University, 1 Donggang W Rd, Chengguan District, Lanzhou, Gansu, 730000, China.
  • Liang Zhang
  • Hanyu Zhao
    Dalian University, No.10 Xuefu Street, Economic and Technological Development Zone, Dalian, Liaoning, 116622, China.
  • Zhan Gao
    State Key Laboratory of Cardiovascular Disease, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100037, China.
  • Haitao Huang
    The People's Hospital of Liaoning Province, No.33 Shenhe District, Shenyang, Liaoning, 110016, China.
  • Zhenguang Du
    The People's hospital of Liaoning Province, Liaoning 110016, PR China.
  • Zengtao Jiao
    AI Lab, Yidu Cloud, No.35 of Huayuan North Road, Haidian District, Beijing, 100191, China.
  • Jun Yan
    Department of Statistics, University of Connecticut, Storrs, CT 06269, USA.
  • Xiaopeng Wei
    Key Lab of Advanced Design and Intelligent Computing (Ministry of Education), Dalian University, Dalian, China.
  • Bo Jin
    HBISolutions Inc., Palo Alto, CA 94301, USA.