Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
Published Date:

Abstract

Cancer treatments are known to introduce cardiotoxicity, negatively impacting outcomes and survivorship. Identifying cancer patients at risk of heart failure (HF) is critical to improving cancer treatment outcomes and safety. This study examined machine learning (ML) models to identify cancer patients at risk of HF using electronic health records (EHRs), including traditional ML, Time-Aware long short-term memory (T-LSTM), and large language models (LLMs) using novel narrative features derived from the structured medical codes. We identified a cancer cohort of 12,806 patients from the University of Florida Health, diagnosed with lung, breast, and colorectal cancers, among which 1,602 individuals developed HF after cancer. The LLM, GatorTron-3.9B, achieved the best F1 scores, outperforming the traditional support vector machines by 39%, the T-LSTM deep learning model by 7%, and a widely used transformer model, BERT, by 5.6%. The analysis shows that the proposed narrative features remarkably increased feature density and improved performance.

Authors

  • Ziyi Chen
    Department of Neurology, The First Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510080, Guangdong Province, China.
  • Mengyuan Zhang
    The Department of Radiology, The First Affiliated Hospital of Zhengzhou University, No.1, Eastern Jianshe Road, Zhengzhou 450052, Henan Province, China (K.Q., D.Y., Y.Z., M.Z., W.Z., J.Z., B.Y., J.G., J.L.).
  • Mustafa Mohammed Ahmed
    Division of Cardiovascular Medicine, Department of Medicine, College of Medicine, University of Florida, Gainesville, FL, USA.
  • Yi Guo
    Department of Respiratory and Critical Care Medicine, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.
  • Thomas J George
    Division of Hematology & Oncology, Department of Medicine, College of Medicine.
  • Jiang Bian
    Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, United States of America.
  • Yonghui Wu
    Department of Health Outcomes and Biomedical Informatics.