Empirical data drift detection experiments on real-world medical imaging data.

Journal: Nature communications
Published Date:

Abstract

While it is common to monitor deployed clinical artificial intelligence (AI) models for performance degradation, it is less common for the input data to be monitored for data drift - systemic changes to input distributions. However, when real-time evaluation may not be practical (eg., labeling costs) or when gold-labels are automatically generated, we argue that tracking data drift becomes a vital addition for AI deployments. In this work, we perform empirical experiments on real-world medical imaging to evaluate three data drift detection methods' ability to detect data drift caused (a) naturally (emergence of COVID-19 in X-rays) and (b) synthetically. We find that monitoring performance alone is not a good proxy for detecting data drift and that drift-detection heavily depends on sample size and patient features. Our work discusses the need and utility of data drift detection in various scenarios and highlights gaps in knowledge for the practical application of existing methods.

Authors

  • Ali Kore
    Vector Institute, Toronto, Canada.
  • Elyar Abbasi Bavil
    Temerity School of Medicine, University of Toronto, Toronto, Canada.
  • Vallijah Subasri
    Peter Munk Cardiac Center, University Health Network, Toronto, ON, Canada.
  • Moustafa Abdalla
    Computational Statistics & Machine Learning Group, Department of Statistics, University of Oxford, Oxford, UK.
  • Benjamin Fine
    Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA (LAC); Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA (LAC); Department of Diagnostic Imaging and Operational Analytics Lab, Trillium Health Partners, Mississauga, ON, Canada (BF); Department of Medical Imaging, University of Toronto, ON, Canada (BF); Departments of Anesthesiology and Neurosurgery and the Center for Advanced Data Analytics, University of Virginia, Charlottesville, USA (DJS).
  • Elham Dolatabadi
  • Mohamed Abdalla
    ICES, Toronto, Canada.