Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs.

Journal: AMIA ... Annual Symposium proceedings. AMIA Symposium
Published Date:

Abstract

Electronic Health Record (EHR) audit log data are increasingly utilized for clinical tasks, from workflow modeling to predictive analyses of discharge events, adverse kidney outcomes, and hospital readmissions. These data encapsulate user-EHR interactions, reflecting both healthcare professionals' behavior and patients' health statuses. To harness this temporal information effectively, this study explores the application of Large Language Models (LLMs) in leveraging audit log data for clinical prediction tasks, specifically focusing on discharge predictions. Utilizing a year's worth of EHR data from Vanderbilt University Medical Center, we fine-tuned LLMs with randomly selected 10,000 training examples. Our findings reveal that LLaMA-2 70B, with an AUROC of 0.80 [0.77-0.82], outperforms both GPT-4 128K in a zero-shot, with an AUROC of 0.68 [0.65-0.71], and DeBERTa, with an AUROC of 0.78 [0.75-0.82]. Among various serialization methods, the first-occurrence approach-wherein only the initial appearance of each event in a sequence is retained-shows superior performance. Furthermore, for the fine-tuned LLaMA-2 70B, logit outputs yield a higher AUROC of 0.80 [0.77-0.82] compared to text outputs, with an AUROC of 0.69 [0.67-0.72]. This study underscores the potential of fine-tuned LLMs, particularly when combined with strategic sequence serialization, in advancing clinical prediction tasks.

Authors

  • Xinmeng Zhang
    Department of Computer Science, Vanderbilt University, Nashville, TN 37212, United States.
  • Chao Yan
    School of Pharmacy, Shanghai Jiao Tong University, Shanghai 200240, China.
  • Yuyang Yang
    Northwestern University, Chicago, IL.
  • Zhuohang Li
    Department of Computer Science, Vanderbilt University, Nashville, TN 37212, United States.
  • Yubo Feng
    Vanderbilt University, Nashville, Tennessee, USA.
  • Bradley A Malin
    Vanderbilt University, Nashville, TN.
  • You Chen
    Dept. of Biomedical Informatics, School of Medicine, Vanderbilt University, Nashville, TN, USA.