The reliability of a deep learning model in clinical out-of-distribution MRI data: A multicohort study.

Journal: Medical image analysis
Published Date:

Abstract

Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical datasets-collected with different scanners, protocols and disease populations-and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to datasets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.

Authors

  • Gustav Mårtensson
    Division of Clinical Geriatrics, Center for Alzheimer Research, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden.
  • Daniel Ferreira
    Division of Clinical Geriatrics, Center for Alzheimer Research, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden.
  • Tobias Granberg
    Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden; Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Boston, USA.
  • Lena Cavallin
    Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden; Department of Radiology, Karolinska University Hospital, Stockholm, Sweden.
  • Ketil Oppedal
    Centre for Age-Related Medicine, Stavanger University Hospital, Stavanger, Norway; Stavanger Medical Imaging Laboratory (SMIL), Department of Radiology, Stavanger University Hospital, Stavanger, Norway; Department of Electrical Engineering and Computer Science, University of Stavanger, Stavanger, Norway.
  • Alessandro Padovani
    Neurology Unit, Department of Clinical and Experimental Sciences, University of Brescia, Brescia, Italy.
  • Irena Rektorova
    1st Department of Neurology, Medical Faculty, St. Anne's Hospital and CEITEC, Masaryk University, Brno, Czech Republic.
  • Laura Bonanni
    Department of Neuroscience Imaging and Clinical Sciences and CESI, University G d'Annunzio of Chieti-Pescara, Chieti, Italy.
  • Matteo Pardini
    Department of Neuroscience (DINOGMI), University of Genoa and Neurology Clinics, Polyclinic San Martino Hospital, Genoa, Italy.
  • Milica G Kramberger
    Department of Neurology, University Medical Centre Ljubljana, Medical faculty, University of Ljubljana, Slovenia.
  • John-Paul Taylor
    Institute of Neuroscience, Campus for Ageing and Vitality, Newcastle University, Newcastle upon Tyne, NE4 5PL, United Kingdom.
  • Jakub Hort
    Memory Clinic, Department of Neurology, Charles University, 2nd Faculty of Medicine and Motol University Hospital, Prague, Czech Republic.
  • Jón Snædal
    Landspitali University Hospital, Reykjavik, Iceland.
  • Jaime Kulisevsky
    Movement Disorders Unit, Neurology Department, Sant Pau Hospital, Barcelona, Spain; Institut d'Investigacions Biomédiques Sant Pau (IIB-Sant Pau), Barcelona, Spain; Centro de Investigación en Red-Enfermedades Neurodegenerativas (CIBERNED), Barcelona, Spain; Universitat Autónoma de Barcelona (U.A.B.), Barcelona, Spain.
  • Frederic Blanc
    Day Hospital of Geriatrics, Memory Resource and Research Centre (CM2R) of Strasbourg, Department of Geriatrics, Hôpitaux Universitaires de Strasbourg, Strasbourg, France; University of Strasbourg and French National Centre for Scientific Research (CNRS), ICube Laboratory and Fédération de Médecine Translationnelle de Strasbourg (FMTS), Team Imagerie Multimodale Intégrative en Santé (IMIS)/ICONE, Strasbourg, France.
  • Angelo Antonini
    Department of Parkinson disease, IRCCS San Camillo, Via Alberoni 70, Venice-Lido, Italy.
  • Patrizia Mecocci
    Institute of Gerontology and Geriatrics, University of Perugia, Perugia, Italy.
  • Bruno Vellas
    UMR INSERM 1027, gerontopole, CHU, University of Toulouse, France.
  • Magda Tsolaki
    Third Department of Neurology, Medical School, Aristotle University of Thessaloniki, Thessaloniki, Greece.
  • Iwona Kłoszewska
    Medical University of Lodz, Lodz, Poland.
  • Hilkka Soininen
    Institute of Clinical Medicine, Neurology, University of Eastern Finland, Kuopio, Finland.
  • Simon Lovestone
    Department of Psychiatry, Warneford Hospital, University of Oxford, Oxford, UK.
  • Andrew Simmons
    King's College London, Institute of Psychiatry, NIHR Biomedical Research Centre for Mental Health at the South London and Maudsley NHS Foundation Trust, London, UK King's College London, Institute of Psychiatry, NIHR Biomedical Research Unit for Dementia at the South London and Maudsley NHS Foundation Trust, London, UK.
  • Dag Aarsland
    Centre for Age-Related Medicine, Stavanger University Hospital, Stavanger, Norway; Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
  • Eric Westman
    Division of Clinical Geriatrics, Center for Alzheimer Research, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden.