Comparison of the Cox proportional hazards model and Random Survival Forest algorithm for predicting patient-specific survival probabilities in clinical trial data
Journal:
arXiv
Published Date:
Feb 5, 2025
Abstract
The Cox proportional hazards model is often used for model development in
data from randomized controlled trials (RCT) with time-to-event outcomes.
Random survival forests (RSF) is a machine-learning algorithm known for its
high predictive performance. We conduct a comprehensive neutral comparison
study to compare the predictive performance of Cox regression and RSF in
real-world as well as simulated data. Performance is compared using multiple
performance measures according to recommendations for the comparison of
prognostic prediction models. We found that while the RSF usually outperforms
the Cox model when using the $C$ index, Cox model predictions may be better
calibrated. With respect to overall performance, the Cox model often exceeds
the RSF in nonproportional hazards settings, while otherwise the RSF typically
performs better especially for smaller sample sizes. Overall performance of the
RSF is more affected by higher censoring rates, while overall performance of
the Cox model suffers more from smaller sample sizes.