MFFBi-Unet: Merging Dynamic Sparse Attention and Multi-scale Feature Fusion for Medical Image Segmentation.

Journal: Interdisciplinary sciences, computational life sciences

Published Date: Jul 29, 2025

Abstract

The advancement of deep learning has driven extensive research validating the effectiveness of U-Net-style symmetric encoder-decoder architectures based on Transformers for medical image segmentation. However, the inherent design requiring attention mechanisms to compute token affinities across all spatial locations leads to prohibitive computational complexity and substantial memory demands. Recent efforts have attempted to address these limitations through sparse attention mechanisms. However, existing approaches employing artificial, content-agnostic sparse attention patterns demonstrate limited capability in modeling long-range dependencies effectively. We propose MFFBi-Unet, a novel architecture incorporating dynamic sparse attention through bi-level routing, enabling context-aware computation allocation with enhanced adaptability. The encoder-decoder module integrates BiFormer to optimize semantic feature extraction and facilitate high-fidelity feature map reconstruction. A novel Multi-scale Feature Fusion (MFF) module in skip connections synergistically combines multi-level contextual information with processed multi-scale features. Extensive evaluations on multiple public medical benchmarks demonstrate that our method consistently exhibits significant advantages. Notably, our method achieves statistically significant improvements, outperforming state-of-the-art approaches like MISSFormer by 2.02% and 1.28% Dice scores on respective benchmarks.

Authors

Baoshan Sun

School of Computer Science and Technology, Tiangong University, Tianjin, 300387, China. sunbaoshan@tiangong.edu.cn.
Chunfei Liu

School of Computer Science and Technology, Tiangong University, Tianjin, 300387, China.
Qiuyan Wang

Center for Genomic and Personalized Medicine, Guangxi Medical University, Nanning, China.
Kaiyu Bi

School of Computer Science and Technology, Tiangong University, Tianjin, 300387, China.
Wenxue Zhang

College of Biomass Science and Engineering, Sichuan University, Chengdu 610065, China; School of Liquor-Brewing Engineering, Sichuan University of Jinjiang College, Meishan 620860, China. Electronic address: foodecoengineering@163.com.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40730736)

MFFBi-Unet: Merging Dynamic Sparse Attention and Multi-scale Feature Fusion for Medical Image Segmentation.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

MFFBi-Unet: Merging Dynamic Sparse Attention and Multi-scale Feature Fusion for Medical Image Segmentation.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals