ViT-GCN: a novel hybrid model for accurate pneumonia diagnosis from x-ray images.

Journal: Biomedical physics & engineering express

Published Date: Jul 21, 2025

Abstract

This study aims to enhance the accuracy of pneumonia diagnosis from x-ray images by developing a model that integrates Vision Transformer (ViT) and Graph Convolutional Networks (GCN) for improved feature extraction and diagnostic performance. The ViT-GCN model was designed to leverage the strengths of both ViT, which captures global image information by dividing the image into fixed-size patches and processing them in sequence, and GCN, which captures node features and relationships through message passing and aggregation in graph data. A composite loss function combining multivariate cross-entropy, focal loss, and GHM loss was introduced to address dataset imbalance and improve training efficiency on small datasets. The ViT-GCN model demonstrated superior performance, achieving an accuracy of 91.43% on the COVID-19 chest x-ray database, surpassing existing models in diagnostic accuracy for pneumonia. The study highlights the effectiveness of combining ViT and GCN architectures in medical image diagnosis, particularly in addressing challenges related to small datasets. This approach can lead to more accurate and efficient pneumonia diagnoses, especially in resource-constrained settings where small datasets are common.

Authors

Nuo Xu

Information Engineering, Guangdong University of Technology, Guangzhou, China.
Jinran Wu

School of Mathematics and Physics, The University of Queensland, Brisbane, QLD, Australia.
Fengjing Cai

College of Mathematics and Physics, Wenzhou University, Wenzhou, 325035, People's Republic of China. caifj7704@wzu.edu.cn.
Xi'an Li

Ceyear Technologies Co., Ltd, Qingdao 266555, People's Republic of China.
Hong-Bo Xie

Jiangsu Provincial Key Laboratory for Interventional Medical Devices, Huaiyin Institute of Technology, Huaian, Jiangsu Province, 223003, People's Republic of China.

Keywords

Algorithms COVID-19 Databases, Factual Humans Image Processing, Computer-Assisted Neural Networks, Computer Pneumonia Radiographic Image Interpretation, Computer-Assisted SARS-CoV-2

External Resources

View on PubMed Access via DOI PubMed (40614755)

ViT-GCN: a novel hybrid model for accurate pneumonia diagnosis from x-ray images.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

ViT-GCN: a novel hybrid model for accurate pneumonia diagnosis from x-ray images.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals