Challenges in Building of Deep Learning Models for Glioblastoma Segmentation: Evidence from Clinical Data.

Journal: Studies in health technology and informatics
Published Date:

Abstract

In this article, we compare the performance of a state-of-the-art segmentation network (UNet) on two different glioblastoma (GB) segmentation datasets. Our experiments show that the same training procedure yields almost twice as bad results on the retrospective clinical data compared to the BraTS challenge data (in terms of Dice score). We discuss possible reasons for such an outcome, including inter-rater variability and high variability in magnetic resonance imaging (MRI) scanners and scanner settings. The high performance of segmentation models, demonstrated on preselected imaging data, does not bring the community closer to using these algorithms in clinical settings. We believe that a clinically applicable deep learning architecture requires a shift from unified datasets to heterogeneous data.

Authors

  • Anvar Kurmukov
    Kharkevich Institute for Information Transmission Problems, Moscow, Russia.
  • Aleksandra Dalechina
    N.N. Burdenko National Medical Research Center of Neurosurgery, Moscow, Russia.
  • Talgat Saparov
    Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Moscow, Russia.
  • Mikhail Belyaev
    Skolkovo Institute of Science and Technology, Moscow, Russia. Electronic address: m.belyaev@skoltech.ru.
  • Svetlana Zolotova
    N.N. Burdenko National Medical Research Center of Neurosurgery, Moscow, Russia.
  • Andrey Golanov
    N.N. Burdenko National Medical Research Center of Neurosurgery, Moscow, Russia.
  • Anna Nikolaeva
    N.N. Burdenko National Medical Research Center of Neurosurgery, Moscow, Russia.