The Conditional Super Learner.

Journal: IEEE transactions on pattern analysis and machine intelligence
Published Date:

Abstract

Using cross validation to select the best model from a library is standard practice in machine learning. Similarly, meta learning is a widely used technique where models previously developed are combined (mainly linearly) with the expectation of improving performance with respect to individual models. In this article we consider the Conditional Super Learner (CSL), an algorithm that selects the best model candidate from a library of models conditional on the covariates. The CSL expands the idea of using cross validation to select the best model and merges it with meta learning. We propose an optimization algorithm that finds a local minimum to the problem posed and proves that it converges at a rate faster than O(n). We offer empirical evidence that: (1) CSL is an excellent candidate to substitute stacking and (2) CLS is suitable for the analysis of Hierarchical problems. Additionally, implications for global interpretability are emphasized.

Authors

  • Gilmer Valdes
    Department of Radiation Oncology, University of California, San Francisco, California.
  • Yannet Interian
    Data Analytic Program, University of San Francisco, San Francisco, California.
  • Efstathios Gennatas
    Department of Radiation Oncology, University of California San Francisco, San Francisco, CA, USA.
  • Mark van der Laan