Large language models are less effective at clinical prediction tasks than locally trained machine learning models.
Journal:
Journal of the American Medical Informatics Association : JAMIA
Published Date:
May 1, 2025
Abstract
OBJECTIVES: To determine the extent to which current large language models (LLMs) can serve as substitutes for traditional machine learning (ML) as clinical predictors using data from electronic health records (EHRs), we investigated various factors that can impact their adoption, including overall performance, calibration, fairness, and resilience to privacy protections that reduce data fidelity.