Efficient and Accurate Extracting of Unstructured EHRs on Cancer Therapy Responses for the Development of RECIST Natural Language Processing Tools: Part I, the Corpus.

Journal: JCO clinical cancer informatics
Published Date:

Abstract

PURPOSE: Electronic health records (EHRs) are created primarily for nonresearch purposes; thus, the amounts of data are enormous, and the data are crude, heterogeneous, incomplete, and largely unstructured, presenting challenges to effective analyses for timely, reliable results. Particularly, research dealing with clinical notes relevant to patient care and outcome is seldom conducted, due to the complexity of data extraction and accurate annotation in the past. RECIST is a set of widely accepted research criteria to evaluate tumor response in patients undergoing antineoplastic therapy. The aim for this study was to identify textual sources for RECIST information in EHRs and to develop a corpus of pharmacotherapy and response entities for development of natural language processing tools.

Authors

  • Yalun Li
    Department of Health Sciences Research, Mayo Clinic, Scottsdale, AZ.
  • Yung-Hung Luo
    Department of Chest Medicine, Taipei Veterans General Hospital, Taipei City, Taiwan.
  • Jason A Wampfler
    Division of Biomedical Statistics and Informatics, Department of Health Science Research, Mayo Clinic, Rochester, MN.
  • Samuel M Rubinstein
    Department of Medicine, Division of Hematology/Oncology, Vanderbilt University, Nashville, TN.
  • Firat Tiryaki
    School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA.
  • Kumar Ashok
    Department of Health Sciences Research, Mayo Clinic, Scottsdale, AZ.
  • Jeremy L Warner
    Department of Medicine, Brown University, Providence, RI, 02912, United States.
  • Hua Xu
    Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Ping Yang
    Key Laboratory of Grain and Oil Processing and Food Safety of Sichuan Province, College of Food and Bioengineering, Xihua University Chengdu 610039 China xingyage1@163.com.