Medical image segmentation model based on triple gate MultiLayer perceptron.

Journal: Scientific reports
Published Date:

Abstract

To alleviate the social contradiction between limited medical resources and increasing medical needs, the medical image-assisted diagnosis based on deep learning has become the research focus in Wise Information Technology of med. Most of the existing medical segmentation models based on Convolution or Transformer have achieved relatively sound effects. However, the Convolution-based model with a limited receptive field cannot establish long-distance dependencies between features as the Network deepens. The Transformer-based model produces large computation overhead and cannot generalize the bias of local features and perceive the position feature of medical images, which are essential in medical image segmentation. To address those issues, we present Triple Gate MultiLayer Perceptron U-Net (TGMLP U-Net), a medical image segmentation model based on MLP, in which we design the Triple Gate MultiLayer Perceptron (TGMLP), composed of three parts. Firstly, considering encoding the position information of features, we propose the Triple MLP module based on MultiLayer Perceptron in this model. It uses linear projection to encode features from the high, wide, and channel dimensions, enabling the model to capture the long-distance dependence of features along the spatial dimension and the precise position information of features in three dimensions with less computational overhead. Then, we design the Local Priors and Global Perceptron module. The Global Perceptron divides the feature map into different partitions and conducts correlation modelling for each partition to establish the global dependency between partitions. The Local Priors uses multi-scale Convolution with high local feature extraction ability to explore further the relationship of context feature information within the structure. At last, we suggest a Gate-controlled Mechanism to effectively solves the problem that the dependence of position embeddings between Patches and within Patches in medical images cannot be well learned due to the relatively small number of samples in medical images segmentation data. Experimental results indicate that the proposed model outperforms other state-of-the-art models in most evaluation indicators, demonstrating its excellent performance in segmenting medical images.

Authors

  • Jingke Yan
    Guilin University of Electronic Technology, School of Marine Engineering, Beihai, 536000, China.
  • Xin Wang
    Key Laboratory of Bio-based Material Science & Technology (Northeast Forestry University), Ministry of Education, Harbin 150040, China.
  • Jingye Cai
    School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, Sichuan, 610054, China.
  • Qin Qin
    Department of Bioengineering, Institute of Engineering in Medicine, University of California, San Diego, San Diego, CA, United States.
  • Hao Yang
    College of Agricultural Science and Engineering, Hohai University, Nanjing 210098, China.
  • Qin Wang
    Department of Pharmacy, Affiliated Hospital of Nantong University, Nantong, China.
  • Yao Cheng
    Southwest Jiaotong University, State Key Laboratory of Traction Power, Chengdu, 610000, China.
  • Tian Gan
    Guilin University of Electronic Technology, School of Computer Science and Information Security, Guilin, 541004, China.
  • Hua Jiang
    Institute for Emergency and Disaster Medicine, Sichuan Academy of Medical Sciences, Sichuan Provincial People's Hospital, School of Medicine, University of Electronic Science and Technology of China, No. 32, Yi Huan Lu Xi Er Duan, Chengdu, Sichuan Province, China; Sino-Finnish Medical AI Research Center, No. 32, Yi Huan Lu Xi Er Duan, Chengdu, Sichuan Province, China. Electronic address: hua.jiang@traumabank.org.
  • Jianhua Deng
    University of Electronic Science and Technology of China,School of Information and Software Engineering, Chengdu, 610000, China.
  • Bingxu Chen