BiSeg-SAM: Weakly-Supervised Post-Processing Framework for Boosting Binary Segmentation in Segment Anything Models
Journal:
arXiv
Published Date:
Apr 2, 2025
Abstract
Accurate segmentation of polyps and skin lesions is essential for diagnosing
colorectal and skin cancers. While various segmentation methods for polyps and
skin lesions using fully supervised deep learning techniques have been
developed, the pixel-level annotation of medical images by doctors is both
time-consuming and costly. Foundational vision models like the Segment Anything
Model (SAM) have demonstrated superior performance; however, directly applying
SAM to medical segmentation may not yield satisfactory results due to the lack
of domain-specific medical knowledge. In this paper, we propose BiSeg-SAM, a
SAM-guided weakly supervised prompting and boundary refinement network for the
segmentation of polyps and skin lesions. Specifically, we fine-tune SAM
combined with a CNN module to learn local features. We introduce a WeakBox with
two functions: automatically generating box prompts for the SAM model and using
our proposed Multi-choice Mask-to-Box (MM2B) transformation for rough
mask-to-box conversion, addressing the mismatch between coarse labels and
precise predictions. Additionally, we apply scale consistency (SC) loss for
prediction scale alignment. Our DetailRefine module enhances boundary precision
and segmentation accuracy by refining coarse predictions using a limited amount
of ground truth labels. This comprehensive approach enables BiSeg-SAM to
achieve excellent multi-task segmentation performance. Our method demonstrates
significant superiority over state-of-the-art (SOTA) methods when tested on
five polyp datasets and one skin cancer dataset.