Eigenrank by committee: Von-Neumann entropy based data subset selection and failure prediction for deep learning based medical image segmentation.

Journal: Medical image analysis
Published Date:

Abstract

Manual delineation of anatomy on existing images is the basis of developing deep learning algorithms for medical image segmentation. However, manual segmentation is tedious. It is also expensive because clinician effort is necessary to ensure correctness of delineation. Consequently most algorithm development is based on a tiny fraction of the vast amount of imaging data collected at a medical center. Thus, selection of a subset of images from hospital databases for manual delineation - so that algorithms trained on such data are accurate and tolerant to variation, becomes an important challenge. We address this challenge using a novel algorithm. The proposed algorithm named 'Eigenrank by Committee' (EBC) first computes the degree of disagreement between segmentations generated by each DL model in a committee. Then, it iteratively adds to the committee, a DL model trained on cases where the disagreement is maximal. The disagreement between segmentations is quantified by the maximum eigenvalue of a Dice coefficient disagreement matrix a measure closely related to the Von Neumann entropy. We use EBC for selecting data subsets for manual labeling from a larger database of spinal canal segmentations as well as intervertebral disk segmentations. U-Nets trained on these subsets are used to generate segmentations on the remaining data. Similar sized data subsets are also randomly sampled from the respective databases, and U-Nets are trained on these random subsets as well. We found that U-Nets trained using data subsets selected by EBC, generate segmentations with higher average Dice coefficients on the rest of the database than U-Nets trained using random sampling (p < 0.05 using t-tests comparing averages). Furthermore, U-Nets trained using data subsets selected by EBC generate segmentations with a distribution of Dice coefficients that demonstrate significantly (p < 0.05 using Bartlett's test) lower variance in comparison to U-Nets trained using random sampling for all datasets. We believe that this lower variance indicates that U-Nets trained with EBC are more robust than U-Nets trained with random sampling.

Authors

  • Bilwaj Gaonkar
    Center for Biomedical Image Computing and Analytics, United States. Electronic address: bilwaj@gmail.com.
  • Joel Beckett
    Department of Neurosurgery, David Geffen School of Medicine, University of California Los Angeles; David Geffen School of Medicine, University of California Los Angeles. Electronic address: JBeckett@mednet.ucla.edu.
  • Mark Attiah
    Department of Neurosurgery, University of California, Los Angeles, United States.
  • Christine Ahn
    Department of Neurosurgery, University of California, Los Angeles, United States.
  • Matthew Edwards
    Department of Neurosurgery, University of California, Los Angeles, United States.
  • Bayard Wilson
    Department of Neurosurgery, University of California, Los Angeles, United States.
  • Azim Laiwalla
    Department of Neurosurgery, University of California, Los Angeles, United States.
  • Banafsheh Salehi
    Department of Radiology, University of California, Los Angeles, United States.
  • Bryan Yoo
    Department of Radiology, University of California, Los Angeles, Los Angeles, CA, USA.
  • Alex A T Bui
  • Luke Macyszyn