How to remove or control confounds in predictive models, with applications to brain biomarkers.

Journal: GigaScience
Published Date:

Abstract

BACKGROUND: With increasing data sizes and more easily available computational methods, neurosciences rely more and more on predictive modeling with machine learning, e.g., to extract disease biomarkers. Yet, a successful prediction may capture a confounding effect correlated with the outcome instead of brain features specific to the outcome of interest. For instance, because patients tend to move more in the scanner than controls, imaging biomarkers of a disease condition may mostly reflect head motion, leading to inefficient use of resources and wrong interpretation of the biomarkers.

Authors

  • Darya Chyzhyk
    Computational Intelligence Group, Universidad del Pais Vasco (UPV/EHU), San Sebastian 20018, Spain.
  • Gael Varoquaux
    Parietal, INRIA, NeuroSpin, bat 145 CEA Saclay, 91191, Gif sur Yvette, France.
  • Michael Milham
    Center for Biomedical Imaging and Neuromodulation, Nathan S. Kline Institute for Psychiatric Research, New York State Office of Mental Health, USA.
  • Bertrand Thirion
    Parietal, Inria, Université Paris-Saclay, Gif-sur-Yvette, France.