CFM-UNet: coupling local and global feature extraction networks for medical image segmentation.

Journal: Scientific reports
Published Date:

Abstract

In medical image segmentation, traditional CNN-based models excel at extracting local features but have limitations in capturing global features. Conversely, Mamba, a novel network framework, effectively captures long-range feature dependencies and excels in processing linearly arranged image inputs, albeit at the cost of overlooking fine spatial relationships and local pixel interactions. This limitation highlights the need for hybrid approaches that combine the strengths of both architectures. To address this challenge, we propose CNN-Fusion-Mamba-based U-Net (CFM-UNet). The model integrates CNN-based Bottle2neck blocks for local feature extraction and Mamba-based visual state space blocks for global feature extraction. These parallel frameworks perform feature fusion through our designed SEF block, achieving complementary advantages. Experimental results demonstrate that CFM-UNet outperforms other advanced methods in segmenting medical image datasets, including liver organs, liver tumors, spine, and colon polyps, with notable generalization ability in liver organ segmentation. Our code is available at https://github.com/Jiacheng-Han/CFM-UNet .

Authors

  • Ke Niu
    Beijing Information Science and Technology University, Beijing, China. Electronic address: niuke@bistu.edu.cn.
  • Jiacheng Han
    Beijing Information Science and Technology University, Computer School, Beijing, 100000, China.
  • Jiuyun Cai
    Beijing Information Science and Technology University, Computer School, Beijing, 100000, China.