Assessment of Robustness of MRI Radiomic Features in Four Abdominal Organs: Impact of Deep Learning Reconstruction and Segmentation.

Journal: Journal of magnetic resonance imaging : JMRI
Published Date:

Abstract

BACKGROUND: The impact of deep learning (DL) reconstruction and segmentation on MRI radiomics stability has not been fully assessed. PURPOSE: To investigate the effects of acquisition, reconstruction, and segmentation on the reproducibility and variability of radiomic features in abdominal MRI. STUDY TYPE: Prospective. POPULATION: 37 volunteers (22 men; mean age ± standard deviation, 37.4 ± 11.0 years). FIELD STRENGTH/SEQUENCE: 3.0-T; axial turbo spin echo T2-weighted image, and fat-suppressed T2-weighted image using a half-Fourier acquisition single-shot turbo spin echo technique, each acquired four times with conventional or accelerated techniques, reconstructed with standard or DL algorithms. ASSESSMENT: Regions of interest were automatically generated by a DL neural network for liver, spleen, and right and left kidneys, followed by manual correction. We extracted 107 features using PyRadiomics after z-score normalization. STATISTICAL TESTS: The reproducibility between acquisitions, reconstructions, and segmentations was evaluated using intraclass correlation coefficient (ICC) and concordance correlation coefficient (CCC). The variability among the four scans was assessed by coefficient of variation (CV) and quartile coefficient of dispersion (QCD). p < 0.05 was considered significant. RESULTS: The mean ICC (0.518-0.608; 0.606-0.681) and CCC (0.515-0.603; 0.601-0.680) values were low for both manual and automatic segmentation regardless of image acquisition and reconstruction, using conventional acquisition with standard reconstruction as reference. The mean ICC (0.535-0.713) and CCC (0.531-0.714) values were low between manual and automatic segmentation, regardless of image acquisition and reconstruction. The median CV (10.0%-17.5%; 8.9%-15.5%) and QCD (5.3%-8.5%; 5.1%-8.3%) values were moderate but still adequate for both manual and automatic segmentation among different scans. CONCLUSION: Given the substantial impact of accelerated acquisition and DL reconstruction on the robustness of radiomics features in abdominal MRI, caution should be exercised when utilizing images with different acquisition and reconstruction techniques in radiomics analysis. The automatic segmentation cannot replace manual segmentation due to insufficient robustness of radiomics features. EVIDENCE LEVEL: 2. TECHNICAL EFFICACY: Stage 1.

Authors

Keywords

No keywords available for this article.