Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.

Journal: Journal of medical systems
Published Date:

Abstract

In the rapidly evolving field of medical image analysis utilizing artificial intelligence (AI), the selection of appropriate computational models is critical for accurate diagnosis and patient care. This literature review provides a comprehensive comparison of vision transformers (ViTs) and convolutional neural networks (CNNs), the two leading techniques in the field of deep learning in medical imaging. We conducted a survey systematically. Particular attention was given to the robustness, computational efficiency, scalability, and accuracy of these models in handling complex medical datasets. The review incorporates findings from 36 studies and indicates a collective trend that transformer-based models, particularly ViTs, exhibit significant potential in diverse medical imaging tasks, showcasing superior performance when contrasted with conventional CNN models. Additionally, it is evident that pre-training is important for transformer applications. We expect this work to help researchers and practitioners select the most appropriate model for specific medical image analysis tasks, accounting for the current state of the art and future trends in the field.

Authors

  • Satoshi Takahashi
    Department of Neurosurgery, University of Tokyo, Tokyo.
  • Yusuke Sakaguchi
    Department of Inter-Organ Communication Research in Kidney Diseases, Osaka University Graduate School of Medicine, Suita, Japan.
  • Nobuji Kouno
    Department of Surgery, Graduate School of Medicine, Kyoto University.
  • Ken Takasawa
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo 104-0045, Japan.
  • Kenichi Ishizu
  • Yu Akagi
    Department of Biomedical Informatics, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo, 113-8655, Japan.
  • Rina Aoyama
    Showa University Graduate School of Medicine School of Medicine.
  • Naoki Teraya
    Division of Medical AI Research and Development, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo, 104-0045, Japan.
  • Amina Bolatkan
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo 104-0045, Japan.
  • Norio Shinkai
    Cancer Translational Research Team, RIKEN Center for Advanced Intelligence Project, 1-4-1 Nihonbashi, Chuo-ku, Tokyo 103-0027, Japan.
  • Hidenori Machino
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo 104-0045, Japan.
  • Kazuma Kobayashi
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute.
  • Ken Asada
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo 104-0045, Japan.
  • Masaaki Komatsu
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo 104-0045, Japan.
  • Syuzo Kaneko
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute, 5-1-1 Tsukiji, Chuo-ku, Tokyo 104-0045, Japan.
  • Masashi Sugiyama
  • Ryuji Hamamoto
    Division of Molecular Modification and Cancer Biology, National Cancer Center Research Institute, Tokyo, Japan.