Artificial Intelligence Enhances Diagnostic Accuracy of Contrast Enemas in Hirschsprung Disease Compared to Clinical Experts.
Journal:
European journal of pediatric surgery : official journal of Austrian Association of Pediatric Surgery ... [et al] = Zeitschrift fur Kinderchirurgie
Published Date:
Jul 15, 2025
Abstract
Contrast enema (CE) is widely used in the evaluation of suspected Hirschsprung disease (HD). Deep learning is a promising tool to standardize image assessment and support clinical decision-making. This study assesses the diagnostic performance of a deep neural network (DNN), with and without clinical data, and compares its interpretation with that of pediatric surgeons and radiologists.In this retrospective study, 1,471 CE images from patients <15 years were analyzed, with 218 images used for testing. A DNN, pediatric radiologists, and surgeons independently reviewed the testing set, with and without clinical data. Diagnostic performance was assessed using ROC and PR curves, and interobserver agreement was evaluated using Fleiss' kappa. Rectal biopsy served as the reference standard.The DNN achieved high diagnostic accuracy (area under the receiver operating characteristic curve [AUC-ROC] = 0.87) in CE interpretation, with improved performance when combining anteroposterior and lateral images (AUC-ROC = 0.92). Clinical data integration further enhanced model sensitivity and negative predictive value. The super-surgeon (majority voting of colorectal surgeons) outperformed most individual clinicians (sensitivity 81.8%, specificity 79.1%), while the super-radiologist (majority voting of radiologists) showed moderate accuracy. Interobserver analysis revealed strong agreement between the model and surgeons (Cohen's kappa = 0.73), and overall consistency among experts and the model (Fleiss' kappa = 0.62).Artificial intelligence-assisted CE interpretation achieved higher specificity and comparable sensitivity to that of the clinicians. Its consistent performance and substantial agreement with experts support its potential role in improving CE assessment in HD.
Authors
Keywords
No keywords available for this article.