In silico perturbations provide multivariate interpretability in predicting post-lung transplant outcomes

Journal: medRxiv
Published Date:

Abstract

Lung transplantation is a life-saving therapy for end-stage lung disease but has the poorest survival among solid organ transplants. We analyzed standardized electronic health record (EHR) data from the United Network for Organ Sharing (UNOS) to predict one-, three-, and five-year survival and favorable long-term outcomes post-lung transplant. We applied two multivariate machine learning approaches, XGBoost or a tabular BERT model called EHRFormer, to data from 43,869 first-time lung transplant recipients (1987–2022). XGBoost and EHRFormer identified features that align closely with established risk factors for worse outcomes such as length of index stay, recipient age, and creatinine at the time of transplant. We developed a simple perturbation method with EHRFormer to probe in silico multivariate interactions between features that influence model prediction. Despite their attention to known risk factors, machine learning applied to EHR data collected by UNOS poorly predict one-, three-, and five-year mortality after lung transplant.

Authors

  • Lucy Luo; Marcin Możejko; Nikolay S. Markov; Alec Peltekian; Suror Mohsin; Mary Carns; Phillip Cooper; Jeffrey Lysne; Anthony Joudi; Alan Betensley; Bradford C. Bemiss; Catherine Myers; Ankit Bharat; Rade Tomic; Ambalavanan Arunachalam; Ewa Szczurek; GR Scott Budinger; Alexander V. Misharin; Mrinalini Venkata Subramani