Interactive Explainable Deep Learning Model for Hepatocellular Carcinoma Diagnosis at Gadoxetic Acid-enhanced MRI: A Retrospective, Multicenter, Diagnostic Study.
Journal:
Radiology. Imaging cancer
Published Date:
May 1, 2025
Abstract
Purpose To develop an artificial intelligence (AI) model based on gadoxetic acid-enhanced MRI to assist radiologists in hepatocellular carcinoma (HCC) diagnosis. Materials and Methods This retrospective study included patients with focal liver lesions (FLLs) who underwent gadoxetic acid-enhanced MRI between January 2015 and December 2021. All hepatic malignancies were diagnosed pathologically, whereas benign lesions were confirmed with pathologic findings or imaging follow-up. Five manually labeled bounding boxes for each FLL obtained from precontrast T1-weighted, T2-weighted, arterial phase, portal venous phase, and hepatobiliary phase images were included. The lesion classifier component, used to distinguish HCC from non-HCC, was trained and externally tested. The feature classifier, based on a post hoc algorithm, inferred the presence of the Liver Imaging Reporting and Data System (LI-RADS) features by analyzing activation patterns of the pretrained lesion classifier. Two radiologists categorized FLLs in the external testing dataset according to LI-RADS criteria. Diagnostic performance of the AI model and the model's impact on reader accuracy were assessed. Results The study included 839 patients (mean age, 51 years ± 12 [SD]; 681 male) with 1023 FLLs (594 HCCs and 429 non-HCCs). The AI model yielded area under the receiver operating characteristic curves of 0.98 and 0.97 in the training set and external testing set, respectively. Compared with LI-RADS category 5, the AI model showed higher sensitivity (91.6% vs 74.8%; < .001) and similar specificity (90.7% vs 96.0%; = .22). The two readers identified more LI-RADS major features and more accurately classified category LR-5 lesions when assisted versus unassisted by AI, with higher sensitivities (reader 1, 85.7% vs 72.3%; < .001; reader 2, 89.1% vs 74.0%; < .001) and the same specificities (reader 1, 93.3% vs reader 2, 94.7%; > .99 for both). Conclusion The AI model accurately diagnosed HCC and improved the radiologists' diagnostic performance. Artificial Intelligence, Deep Learning, MRI, Hepatocellular Carcinoma © RSNA, 2025 See also commentary by Singh et al in this issue.