Divide and Conquer: Grounding a Bleeding Areas in Gastrointestinal Image with Two-Stage Model
Journal:
arXiv
Published Date:
Dec 21, 2024
Abstract
Accurate detection and segmentation of gastrointestinal bleeding are critical
for diagnosing diseases such as peptic ulcers and colorectal cancer. This study
proposes a two-stage framework that decouples classification and grounding to
address the inherent challenges posed by traditional Multi-Task Learning
models, which jointly optimizes classification and segmentation. Our approach
separates these tasks to achieve targeted optimization for each. The model
first classifies images as bleeding or non-bleeding, thereby isolating
subsequent grounding from inter-task interference and label heterogeneity. To
further enhance performance, we incorporate Stochastic Weight Averaging and
Test-Time Augmentation, which improve model robustness against domain shifts
and annotation inconsistencies. Our method is validated on the Auto-WCEBleedGen
Challenge V2 Challenge dataset and achieving second place. Experimental results
demonstrate significant improvements in classification accuracy and
segmentation precision, especially on sequential datasets with consistent
visual patterns. This study highlights the practical benefits of a two-stage
strategy for medical image analysis and sets a new standard for GI bleeding
detection and segmentation. Our code is publicly available at this GitHub
repository.