FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation.

Journal: Computers in biology and medicine

Published Date: Oct 13, 2023

Abstract

Medical image segmentation is crucial for accurate diagnosis and treatment in the medical field. In recent years, convolutional neural networks (CNNs) and Transformers have been frequently adopted as network architectures in medical image segmentation. The convolution operation is limited in modeling long-range dependencies because it can only extract local information through the limited receptive field. In comparison, Transformers demonstrate excellent capability in modeling long-range dependencies but are less effective in capturing local information. Hence, effectively modeling long-range dependencies while preserving local information is essential for accurate medical image segmentation. In this paper, we propose a four-axis fusion framework called FAFuse, which can exploit the advantages of CNN and Transformer. As the core component of our FAFuse, a Four-Axis Fusion module (FAF) is proposed to efficiently fuse global and local information. FAF combines Four-Axis attention (height, width, main diagonal, and counter diagonal axial attention), a multi-scale convolution, and a residual structure with a depth-separable convolution and a Hadamard product. Furthermore, we also introduce deep supervision to enhance gradient flow and improve overall performance. Our approach achieves state-of-the-art segmentation accuracy on three publicly available medical image segmentation datasets. The code is available at https://github.com/cczu-xiao/FAFuse.

Authors

Shoukun Xu

Aliyun School of Big Data, Changzhou University, Changzhou, Jiangsu, 213164, China.
Dehao Xiao

Aliyun School of Big Data, Changzhou University, Changzhou, Jiangsu, 213164, China.
Baohua Yuan

Aliyun School of Big Data, Changzhou University, Changzhou, Jiangsu, 213164, China; Jiangsu Engineering Research Center of Digital Twinning Technology for Key Equipment in Petrochemical Process, Changzhou University, Changzhou, Jiangsu, 213164, China. Electronic address: yuanbaohua@cczu.edu.cn.
Yi Liu

Department of Interventional Therapy, Ningbo No. 2 Hospital, Ningbo, China.
Xueyuan Wang

Aliyun School of Big Data, Changzhou University, Changzhou, Jiangsu, 213164, China.
Ning Li

Department of Respiratory and Critical Care Medicine, Center for Respiratory Medicine, the Fourth Affiliated Hospital of School of Medicine, and International School of Medicine, International Institutes of Medicine, Zhejiang University, Yiwu, China.
Lin Shi

Department of Imaging and Interventional Radiology, The Chinese University of Hong Kong, Shatin, China.
Jialu Chen

Aliyun School of Big Data, Changzhou University, Changzhou, Jiangsu, 213164, China.
Ju-Xiao Zhang

College of Information and Mathematics Science, Nanjing Normal University of Special Education, Nanjing 210038, Jiangsu, China.
Yanhao Wang

School of Data Science and Engineering, East China Normal University, Shanghai 200062, China.
Jianfeng Cao

Department of Electrical Engineering, City University of Hong Kong, Hong Kong, 999077, China.
Yeqin Shao
Mingjie Jiang

Department of Electrical Engineering, City University of Hong Kong, 999077, Hong Kong, China.

Keywords

Algorithms Humans Image Processing, Computer-Assisted Neural Networks, Computer

External Resources

View on PubMed Access via DOI PubMed (37852109)

FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

FAFuse: A Four-Axis Fusion framework of CNN and Transformer for medical image segmentation.

Abstract

Authors

Keywords

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals