SAMA-UNet: Enhancing Medical Image Segmentation with Self-Adaptive Mamba-Like Attention and Causal-Resonance Learning
Journal:
arXiv
Published Date:
May 21, 2025
Abstract
Medical image segmentation plays an important role in various clinical
applications, but existing models often struggle with the computational
inefficiencies and challenges posed by complex medical data. State Space
Sequence Models (SSMs) have demonstrated promise in modeling long-range
dependencies with linear computational complexity, yet their application in
medical image segmentation remains hindered by incompatibilities with image
tokens and autoregressive assumptions. Moreover, it is difficult to achieve a
balance in capturing both local fine-grained information and global semantic
dependencies. To address these challenges, we introduce SAMA-UNet, a novel
architecture for medical image segmentation. A key innovation is the
Self-Adaptive Mamba-like Aggregated Attention (SAMA) block, which integrates
contextual self-attention with dynamic weight modulation to prioritise the most
relevant features based on local and global contexts. This approach reduces
computational complexity and improves the representation of complex image
features across multiple scales. We also suggest the Causal-Resonance
Multi-Scale Module (CR-MSM), which enhances the flow of information between the
encoder and decoder by using causal resonance learning. This mechanism allows
the model to automatically adjust feature resolution and causal dependencies
across scales, leading to better semantic alignment between the low-level and
high-level features in U-shaped architectures. Experiments on MRI, CT, and
endoscopy images show that SAMA-UNet performs better in segmentation accuracy
than current methods using CNN, Transformer, and Mamba. The implementation is
publicly available at GitHub.