Decoding Recurrence in Early-Stage and Locoregionally Advanced Non-Small Cell Lung Cancer: Insights From Electronic Health Records and Natural Language Processing.

Journal: JCO clinical cancer informatics
PMID:

Abstract

PURPOSE: Recurrences after curative resection in early-stage and locoregionally advanced non-small cell lung cancer (NSCLC) are common, necessitating a nuanced understanding of associated risk factors. This study aimed to establish a natural language processing (NLP) system to efficiently curate recurrence data in NSCLC and analyze risk factors longitudinally.

Authors

  • Kyeryoung Lee
    IMO Health, Inc., Rosemont, IL 60018, United States.
  • Zongzhi Liu
    GeneDx (Sema4), Stamford, CT.
  • Qing Huang
    Department of Environmental Health and Occupational Medicine,West China School of Public Health,Sichuan University,Chengdu 610041,China.
  • David Corrigan
    GeneDx (Sema4), Stamford, CT.
  • Iftekhar Kalsekar
    Epidemiology, Medical Devices, Johnson & Johnson, New Brunswick, NJ, USA.
  • Tomi Jun
    GeneDx (Sema4), Stamford, CT.
  • Gustavo Stolovitzky
    Thomas J. Watson Research Center, IBM, Yorktown Heights, NY, USA.
  • William K Oh
    GeneDx (Sema4), Stamford, CT.
  • Ravi Rajaram
    Department of Thoracic and Cardiovascular Surgery, The University of Texas MD Anderson Cancer Center, Houston, TX.
  • Xiaoyan Wang
    Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai, China.