Deep learning for the dynamic prediction of multivariate longitudinal and survival data.

Journal: Statistics in medicine
Published Date:

Abstract

The joint model for longitudinal and survival data improves time-to-event predictions by including longitudinal outcome variables in addition to baseline covariates. However, in practice, joint models may be limited by parametric assumptions in both the longitudinal and survival submodels. In addition, computational difficulties arise when considering multiple longitudinal outcomes due to the large number of random effects to be integrated out in the full likelihood. In this article, we discuss several recent machine learning methods for incorporating multivariate longitudinal data for time-to-event prediction. The presented methods use functional data analysis or convolutional neural networks to model the longitudinal data, both of which scale well to multiple longitudinal outcomes. In addition, we propose a novel architecture based on the transformer neural network, named TransformerJM, which jointly models longitudinal and time-to-event data. The prognostic abilities of each model are assessed and compared through both simulation and real data analysis on Alzheimer's disease datasets. Specifically, the models were evaluated based on their ability to dynamically update predictions as new longitudinal data becomes available. We showed that TransformerJM improves upon the predictive performance of existing methods across different scenarios.

Authors

  • Jeffrey Lin
    Department of Biostatistics and Data Science, The University of Texas Health Science Center at Houston, Houston, Texas, USA.
  • Sheng Luo
    Department of Biostatistics and Bioinformatics, Duke University Medical Center, Durham, North Carolina, USA.