Dual parallel net: A novel deep learning model for rectal tumor segmentation via CNN and transformer with Gaussian Mixture prior.

Journal: Journal of biomedical informatics
Published Date:

Abstract

Segmentation of rectal cancerous regions from Magnetic Resonance (MR) images can help doctor define the extent of the rectal cancer and judge the severity of rectal cancer, so rectal tumor segmentation is crucial to improve the accuracy of rectal cancer diagnosis. However, accurate segmentation of rectal cancerous regions remains a challenging task due to the shape of rectal tumor has significant variations and the tumor and surrounding tissue are indistinguishable. In addition, in the early research on rectal tumor segmentation, most deep learning methods were based on convolutional neural networks (CNNs), and traditional CNN have small receptive field, which can only capture local information while ignoring the global information of the image. Nevertheless, the global information plays a crucial role in rectal tumor segmentation, so traditional CNN-based methods usually cannot achieve satisfactory segmentation results. In this paper, we propose an encoder-decoder network named Dual Parallel Net (DuPNet), which fuses transformer and classical CNN for capturing both global and local information. Meanwhile, as for capture features at different scales as well as to avoid accuracy loss and parameters reduction, we design a feature adaptive block (FAB) in skip connection between encoder and decoder. Furthermore, in order to utilize the apriori information of rectal tumor shape effectively, we design a Gaussian Mixture prior and embed it in self-attention mechanism of the transformer, leading to robust feature representation and accurate segmentation results. We have performed extensive ablation experiments to verify the effectiveness of our proposed dual parallel encoder, FAB and Gaussian Mixture prior on the dataset from the Shanxi Cancer Hospital. In the experimental comparison with the state-of-the-art methods, our method achieved a Mean Intersection over Union (MIoU) of 89.34% on the test set. In addition to that, we evaluated the generalizability of our method on the dataset from Xinhua Hospital, the promising results verify the superiority of our method.

Authors

  • Huiting Zhang
    College of Data Science, Taiyuan University of Technology, Jinzhong 030600, China; Technology Research Centre of Spatial Information Network Engineering of Shanxi, Jinzhong 030600, China; Key Laboratory of Big Data Fusion Analysis and Application of Shanxi Province, Taiyuan, China, 030024.
  • Xiaotang Yang
    Department of Radiology, Shanxi Province Cancer Hospital, Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences, Cancer Hospital, Affiliated to Shanxi Medical University, Taiyuan, 030013, Shanxi, China. yangxt210@126.com.
  • Dengao Li
    Technology Research Center of Spatial Information Network Engineering of Shanxi, Taiyuan, Shanxi, China.
  • Yanfen Cui
    Department of Radiology, Shanxi Province Cancer Hospital, Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences, Cancer Hospital, Affiliated to Shanxi Medical University, Taiyuan, 030013, Shanxi, China.
  • Jumin Zhao
    College of Information and Computer, Taiyuan University of Technology, Taiyuan, Shanxi, China.
  • Shuang Qiu
    College of Food Science and Nutritional Engineering, China Agricultural University, P.O. Box 40, No. 17 Qinghua East Road, Haidian District Beijing, 100083 People's Republic China.