MD-SGT: Multi-dilation spherical graph transformer for unsupervised medical image registration.

Journal: Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society
PMID:

Abstract

Deformable medical image registration is an essential preprocess step for several clinical applications. Even though the existing convolutional neural network and transformer based methods achieved the promising results, the limited long-range spatial dependence and non-uniform attention span of these models prohibit further improving the registration performance. To deal with this issue, we proposed a multi-dilation spherical graph transformer (MD-SGT), in which the encoder combined the advantages of convolutional and graph transformer blocks to distinguish effectively the differences between the reference and the template images at various scales. Specifically, the features of each voxel were obtained by aggregating the information from its neighbors sampled from different spherical regions with different dilation rates. The implicit convolution inductive bias and long-range uniform attention span induced by such information aggregation manner made the features more representative for registration. Through the qualitative and quantitative comparisons with state-of-the-art methods on two datasets, we demonstrated that combining long-range uniform attention span and inductive bias are beneficial for promoting the image registration performance, with the Dice score, ASD and HD95 being improved at least by 0.5%, 2.2% and 1.1%, respectively.

Authors

  • Kun Tang
    Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Key Laboratory of Intelligent Medical Image Analysis and Precise Diagnosis of Guizhou Province, State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, China.
  • Lihui Wang
    Shanghai Mental Health Center, Shanghai Jiao Tong University, School of Medicine, Shanghai, China.
  • Xingyu Huang
    School of Computer Science and Technology, Donghua University 201600, Shanghai, China.
  • Xinyu Cheng
    Engineering Research Center of Text Computing & Cognitive Intelligence, Ministry of Education, Key Laboratory of Intelligent Medical Image Analysis and Precise Diagnosis of Guizhou Province, State Key Laboratory of Public Big Data, College of Computer Science and Technology, Guizhou University, Guiyang, China.
  • Yue-Min Zhu
    University Lyon, INSA Lyon, CNRS, INSERM, CREATIS UMR 5220, U1206, F-69621, Lyon, France.