High Accuracy but Low Explainability: The Challenge of Explainable Artificial Intelligence in Multiple Sclerosis Assessment From Magnetic Resonance Imaging Radiology Reports.

Journal: Seminars in ultrasound, CT, and MR
Published Date:

Abstract

Timely identification of disease progression and/or active lesions in multiple sclerosis (MS) is essential for clinical management. Radiology reports often contain complex language, making consistent interpretation challenging. We developed a natural language processing (NLP)-based tool to assist radiologists in detecting MS-related changes and evaluated its explainability. To assess the performance and interpretability of NLP algorithms in identifying disease progression and/or active lesions in MRI reports of MS patients. A retrospective study included 600 MRI reports labeled for MS progression and/or active lesions (January 2013-July 2022). Five hundred reports were used to fine-tune Robustly Optimized BERT pretraining Approach-based models; 100 served as the test set. A prospective evaluation was conducted on 122 reports. Explainability was assessed using the Local Interpretable Model-Agnostic Explanations tool and radiologist feedback. Retrospective accuracy was 87% for new/enlarged lesions and 96% for active lesions. Prospective accuracy improved to 94.26% and 99.18%, respectively. Local Interpretable Model-Agnostic Explanations-based interpretability yielded radiologist agreement rates of 53.2% (new/enlarged lesions) and 52.5% (active lesions). Our NLP tools demonstrated high accuracy in detecting MS-related MRI findings. However, explainability remains limited, underscoring the need for more intuitive interpretability methods to support clinical integration.

Authors

Keywords

No keywords available for this article.