Summary Report of the SNMMI AI Task Force Radiomics Challenge 2024.

Journal: Journal of nuclear medicine : official publication, Society of Nuclear Medicine
Published Date:

Abstract

In medical imaging, challenges are competitions that aim to provide a fair comparison of different methodologic solutions to a common problem. Challenges typically focus on addressing real-world problems, such as segmentation, detection, and prediction tasks, using various types of medical images and associated data. Here, we describe the organization and results of such a challenge to compare machine-learning models for predicting survival in patients with diffuse large B-cell lymphoma using a baseline F-FDG PET/CT radiomics dataset. This challenge aimed to predict progression-free survival (PFS) in patients with diffuse large B-cell lymphoma, either as a binary outcome (shorter than 2 y versus longer than 2 y) or as a continuous outcome (survival in months). All participants were provided with a radiomic training dataset, including the ground truth survival for designing a predictive model and a radiomic test dataset without ground truth. Figures of merit (FOMs) used to assess model performance were the root-mean-square error for continuous outcomes and the C-index for 1-, 2-, and 3-y PFS binary outcomes. The challenge was endorsed and initiated by the Society of Nuclear Medicine and Molecular Imaging AI Task Force. Nineteen models for predicting PFS as a continuous outcome from 15 teams were received. Among those models, external validation identified 6 models showing similar performance to that of a simple general linear reference model using SUV and total metabolic tumor volumes (TMTV) only. Twelve models for predicting binary outcomes were submitted by 9 teams. External validation showed that 1 model had higher, but nonsignificant, C-index values compared with values obtained by a simple logistic regression model using SUV and TMTV. Some of the radiomic-based machine-learning models developed by participants showed better FOMs than did simple linear or logistic regression models based on SUV and TMTV only, although the differences in observed FOMs were nonsignificant. This suggests that, for the challenge dataset, there was limited or no value seen from the addition of sophisticated radiomic features and use of machine learning when developing models for outcome prediction.

Authors

  • Ronald Boellaard
    Department of Radiology and Nuclear Medicine, Cancer Center Amsterdam, Amsterdam University Medical Centers, Vrije Universiteit, Amsterdam, The Netherlands.
  • Arman Rahmim
  • Jacoba J Eertink
    Department of Radiology and Nuclear Medicine, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands.
  • Ulrich Duehrsen
    Department of Hematology, West German Cancer Center, University Hospital Essen, University of Duisburg-Essen, Essen, Germany.
  • Lars Kurch
    Clinic and Polyclinic for Nuclear Medicine, Department of Nuclear Medicine, University of Leipzig, Leipzig, Germany.
  • Pieternella J Lugtenburg
    Department of Hematology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, The Netherlands.
  • Sanne E Wiegers
    Department of Radiology and Nuclear Medicine, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands.
  • Gerben J C Zwezerijnen
    Department of Radiology and Nuclear Medicine, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands.
  • Josée M Zijlstra
    Department of Hematology, Amsterdam UMC, Cancer Center Amsterdam, Amsterdam, The Netherlands.
  • Martijn W Heymans
    Department of Epidemiology and Data Science, Amsterdam UMC location Vrije Universiteit Amsterdam, Amsterdam, The Netherlands.
  • Irène Buvat
    Imagerie Moléculaire In Vivo, CEA, Inserm, Univ Paris Sud, CNRS, Université Paris Saclay, Orsay, France.

Keywords

No keywords available for this article.