Addressing Confounding in Predictive Models with an Application to Neuroimaging.

Journal: The international journal of biostatistics
Published Date:

Abstract

Understanding structural changes in the brain that are caused by a particular disease is a major goal of neuroimaging research. Multivariate pattern analysis (MVPA) comprises a collection of tools that can be used to understand complex disease efxcfects across the brain. We discuss several important issues that must be considered when analyzing data from neuroimaging studies using MVPA. In particular, we focus on the consequences of confounding by non-imaging variables such as age and sex on the results of MVPA. After reviewing current practice to address confounding in neuroimaging studies, we propose an alternative approach based on inverse probability weighting. Although the proposed method is motivated by neuroimaging applications, it is broadly applicable to many problems in machine learning and predictive modeling. We demonstrate the advantages of our approach on simulated and real data examples.

Authors

  • Kristin A Linn
    Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, PA, USA. Electronic address: klinn@upenn.edu.
  • Bilwaj Gaonkar
    Center for Biomedical Image Computing and Analytics, United States. Electronic address: bilwaj@gmail.com.
  • Jimit Doshi
    Center for Biomedical Image Computing and Analytics (CBICA), University of Pennsylvania, Philadelphia, PA, USA.
  • Christos Davatzikos
    Artificial Intelligence in Biomedical Imaging Laboratory (AIBIL), Center for and Data Science for Integrated Diagnostics (AID), Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
  • Russell T Shinohara
    Artificial Intelligence in Biomedical Imaging Laboratory (AIBIL), Center for and Data Science for Integrated Diagnostics (AID), Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.