Longitudinal image-based prediction of surgical intervention in infants with hydronephrosis using deep learning: Is a single ultrasound enough?

Journal: PLOS digital health

Published Date: Aug 4, 2025

Abstract

The potential of deep learning to predict renal obstruction using kidney ultrasound images has been demonstrated. However, these image-based classifiers have incorporated information using only single-visit ultrasounds. Here, we developed machine learning (ML) models incorporating ultrasounds from multiple clinic visits for hydronephrosis to generate a hydronephrosis severity index score to discriminate patients into high versus low risk for needing pyeloplasty and compare these against models trained with single clinic visit data. We included patients followed for hydronephrosis from three institutions. The outcome of interest was low risk versus high risk of obstructive hydronephrosis requiring pyeloplasty. The model was trained on data from Toronto, ON and validated on an internal holdout set, and tested on an internal prospective set and two external institutions. We developed models trained with single ultrasound (single-visit) and multi-visit models using average prediction, convolutional pooling, long-short term memory and temporal shift models. We compared model performance by area under the receiver-operator-characteristic (AUROC) and area under the precision-recall-curve (AUPRC). A total of 794 patients were included (603 SickKids, 102 Stanford, and 89 CHOP) with a pyeloplasty rate of 12%, 5%, and 67%, respectively. There was no significant difference in developing single-visit US models using the first ultrasound vs. the latest ultrasound. Comparing single-visit vs. multi-visit models, all multi-visit models fail to produce AUROC or AUPRC significantly greater than single-visit models. We developed ML models for hydronephrosis that incorporate multi-visit inference across multiple institutions but did not demonstrate superiority over single-visit inference. These results imply that the single-visit models would be sufficient in aiding accurate risk stratification from single, early ultrasound images.

Authors

Adree Khondker

Division of Urology, The Hospital for Sick Children, 555 University Avenue, Toronto, ON, M5G 1X8, Canada.
Stanley Bryan Z Hua

Department of Computer Science, University of Toronto, Toronto, Canada.
Jethro C C Kwong

Faculty of Medicine, University of Toronto, Toronto, ON, Canada.
Kunj Sheth

Division of Pediatric Urology, Stanford Medicine, Palo Alto, California, USA.
Daniel Alvarez
Kyla N Velaer

Stanford Children's Health -- Lucile Packard Children's Hospital, Stanford University, Palo Alto, California, United States of America.
John Weaver

Division of Urology, Children's Hospital of Philadelphia, 3401 Civic Center Blvd, Philadelphia, PA 19104, USA.
Alice Xiang

Division of Urology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America.
Gregory E Tasian

Department of Surgery, Division of Pediatric Urology, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, United States; Center for Pediatric Clinical Effectiveness, The Children's Hospital of Philadelphia, Philadelphia, PA 19104, United States; Department of Biostatistics, Epidemiology, and Informatics, The University of Pennsylvania, Philadelphia, PA, 19104, United States.
Armando J Lorenzo

Department of Surgery, The Hospital for Sick Children, Toronto, Ontario, Canada.
Anna Goldenberg

SickKids Research Institute, 686 Bay Street, Toronto, ON M5G 0A4, Canada; Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON M5S 2E4, Canada. Electronic address: anna.goldenberg@utoronto.ca.
Mandy Rickard

Department of Surgery, The Hospital for Sick Children, Toronto, Ontario, Canada.
Lauren Erdman

Genetics and Genome Biology Program, The Hospital for Sick Children, Toronto, Ontario, Canada.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40758672)

Longitudinal image-based prediction of surgical intervention in infants with hydronephrosis using deep learning: Is a single ultrasound enough?

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Longitudinal image-based prediction of surgical intervention in infants with hydronephrosis using deep learning: Is a single ultrasound enough?

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals