Enhanced tuberculosis detection using Vision Transformers and explainable AI with a Grad-CAM approach on chest X-rays.

Journal: BMC medical imaging

PMID: 40128729

Abstract

Tuberculosis (TB), caused by Mycobacterium tuberculosis, remains a leading global health challenge, especially in low-resource settings. Accurate diagnosis from chest X-rays is critical yet challenging due to subtle manifestations of TB, particularly in its early stages. Traditional computational methods, primarily using basic convolutional neural networks (CNNs), often require extensive pre-processing and struggle with generalizability across diverse clinical environments. This study introduces a novel Vision Transformer (ViT) model augmented with Gradient-weighted Class Activation Mapping (Grad-CAM) to enhance both diagnostic accuracy and interpretability. The ViT model utilizes self-attention mechanisms to extract long-range dependencies and complex patterns directly from the raw pixel information, whereas Grad-CAM offers visual explanations of model decisions about highlighting significant regions in the X-rays. The model contains a Conv2D stem for initial feature extraction, followed by many transformer encoder blocks, thereby significantly boosting its ability to learn discriminative features without any pre-processing. Performance testing on a validation set had an accuracy of 0.97, recall of 0.99, and F1-score of 0.98 for TB patients. On the test set, the model has accuracy of 0.98, recall of 0.97, and F1-score of 0.98, which is better than existing methods. The addition of Grad-CAM visuals not only improves the transparency of the model but also assists radiologists in assessing and verifying AI-driven diagnoses. These results demonstrate the model's higher diagnostic precision and potential for clinical application in real-world settings, providing a massive improvement in the automated detection of TB.

Authors

K Vanitha

Department of ECE, Jawaharlal Nehru Technological University, Anantapur, India.
T R Mahesh

Department of Computer Science and Engineering, JAIN (Deemed-to-be-University), Bangaluru, Karnataka, India.
V Vinoth Kumar

School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, 632014, Vellore, India.
Suresh Guluwadi

Adama Science and Technology University, 302120, Adama, Ethiopia. suresh.guluwadi@astu.edu.et.

Keywords

Humans Neural Networks, Computer Radiographic Image Interpretation, Computer-Assisted Radiography, Thoracic Tuberculosis, Pulmonary

External Resources

View on PubMed Access via DOI PubMed (40128729)

Enhanced tuberculosis detection using Vision Transformers and explainable AI with a Grad-CAM approach on chest X-rays.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals