An evaluation of rectum contours generated by artificial intelligence automatic contouring software using geometry, dosimetry and predicted toxicity.

Journal: Biomedical physics & engineering express
Published Date:

Abstract

Objective This study assesses rectum contours generated using a commercial deep learning auto-contouring model and compares them to clinician contours using geometry, changes in dosimetry and toxicity modelling. Approach This retrospective study involved 308 prostate cancer patients who were treated using 3D-conformal radiotherapy. Computed tomography images were input into Limbus Contour (v1.8.0b3) to generate auto-contour structures for each patient. Auto-contours were not edited after their generation. Rectum auto-contours were compared to clinician contours geometrically and dosimetrically. Dice similarity coefficient (DSC), mean Hausdorff distance (HD) and volume difference were assessed. Dose-volume histogram (DVH) constraints (V41%-V100%) were compared, and a Wilcoxon signed rank test was used to evaluate statistical significance of differences. Toxicity modelling to compare contours was carried out using equivalent uniform dose (EUD) and clinical factors of abdominal surgery and atrial fibrillation. Trained models were tested (80:20) in their prediction of grade 1 late rectal bleeding (ntotal=124) using area-under the receiver operating characteristic curve (AUC). Main results Median DSC (interquartile range (IQR)) was 0.85 (0.09), median HD was 1.38 mm (0.60 mm) and median volume difference was -1.73 cc (14.58 cc). Median DVH differences between contours were found to be small (<1.5%) for all constraints although systematically larger than clinician contours (p<0.05). However, an IQR up to 8.0% was seen for individual patients across all dose constraints. Models using EUD alone derived from clinician or auto-contours had AUCs of 0.60 (0.10) and 0.60 (0.09). AUC for models involving clinical factors and dosimetry was 0.65 (0.09) and 0.66 (0.09) when using clinician contours and auto-contours. Significance Although median DVH metrics were similar, variation for individual patients highlights the importance of clinician review. Rectal bleeding prediction accuracy did not depend on the contour method for this cohort. The auto-contouring model used in this study shows promise in a supervised workflow. .

Authors

  • Owen Mc Laughlin
    Queen's University Belfast, Patrick G. Johnston Centre for Cancer Research, 97 Lisburn Road, Belfast, Northern Ireland, BT9 7AE, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND.
  • Fereshteh Gholami
    Queen's University Belfast, Patrick G Johnston Centre for Cancer Research, 97 Lisburn Rd, Belfast, Northern Ireland, BT9 7AE, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND.
  • Sarah Osman
    Centre for Cancer Research and Cell Biology, Queens' University, Belfast, UK.
  • Joe M O'Sullivan
    Queen's University Belfast, Patrick G. Johnston Centre for Cancer Research, 97 Lisburn Rd, Belfast, Northern Ireland, BT9 7AE, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND.
  • Stephen J McMahon
    Queen's University Belfast, Patrick G. Johnston Centre for Cancer Research, 97 Lisburn Rd, Belfast, Northern Ireland, BT9 7AE, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND.
  • Suneil Jain
    Department of Clinical Oncology, The Northern Ireland Cancer Centre, Belfast City Hospital, Antrim, Northern Ireland, UK.
  • Conor K McGarry
    Queen's University Belfast, The Patrick G Johnston Centre for Cancer Research, 97 Lisburn Rd, Belfast, Northern Ireland, BT9 7AE, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND.

Keywords

No keywords available for this article.