Semi-Supervised Medical Image Segmentation via Knowledge Mining from Large Models
Journal:
arXiv
Published Date:
Mar 10, 2025
Abstract
Large-scale vision models like SAM have extensive visual knowledge, yet their
general nature and computational demands limit their use in specialized tasks
like medical image segmentation. In contrast, task-specific models such as
U-Net++ often underperform due to sparse labeled data. This study introduces a
strategic knowledge mining method that leverages SAM's broad understanding to
boost the performance of small, locally hosted deep learning models.
In our approach, we trained a U-Net++ model on a limited labeled dataset and
extend its capabilities by converting SAM's output infered on unlabeled images
into prompts. This process not only harnesses SAM's generalized visual
knowledge but also iteratively improves SAM's prediction to cater specialized
medical segmentation tasks via U-Net++. The mined knowledge, serving as "pseudo
labels", enriches the training dataset, enabling the fine-tuning of the local
network.
Applied to the Kvasir SEG and COVID-QU-Ex datasets which consist of
gastrointestinal polyp and lung X-ray images respectively, our proposed method
consistently enhanced the segmentation performance on Dice by 3% and 1%
respectively over the baseline U-Net++ model, when the same amount of labelled
data were used during training (75% and 50% of labelled data). Remarkably, our
proposed method surpassed the baseline U-Net++ model even when the latter was
trained exclusively on labeled data (100% of labelled data). These results
underscore the potential of knowledge mining to overcome data limitations in
specialized models by leveraging the broad, albeit general, knowledge of
large-scale models like SAM, all while maintaining operational efficiency
essential for clinical applications.