Prompt-Driven Latent Domain Generalization for Medical Image Classification.

Journal: IEEE transactions on medical imaging

Published Date: Jan 2, 2025

Abstract

Deep learning models for medical image analysis easily suffer from distribution shifts caused by dataset artifact bias, camera variations, differences in the imaging station, etc., leading to unreliable diagnoses in real-world clinical settings. Domain generalization (DG) methods, which aim to train models on multiple domains to perform well on unseen domains, offer a promising direction to solve the problem. However, existing DG methods assume domain labels of each image are available and accurate, which is typically feasible for only a limited number of medical datasets. To address these challenges, we propose a unified DG framework for medical image classification without relying on domain labels, called Prompt-driven Latent Domain Generalization (PLDG). PLDG consists of unsupervised domain discovery and prompt learning. This framework first discovers pseudo domain labels by clustering the bias-associated style features, then leverages collaborative domain prompts to guide a Vision Transformer to learn knowledge from discovered diverse domains. To facilitate cross-domain knowledge learning between different prompts, we introduce a domain prompt generator that enables knowledge sharing between domain prompts and a shared prompt. A domain mixup strategy is additionally employed for more flexible decision margins and mitigates the risk of incorrect domain assignments. Extensive experiments on three medical image classification tasks and one debiasing task demonstrate that our method can achieve comparable or even superior performance than conventional DG algorithms without relying on domain labels. Our code is publicly available at https://github.com/SiyuanYan1/PLDG/tree/main.

Authors

Siyuan Yan

Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, China.
Zhen Yu
Chi Liu
Lie Ju
Dwarikanath Mahapatra

Department of Computer Science, ETH Zurich, Switzerland. Electronic address: dwarikanath.mahapatra@inf.ethz.ch.
Brigid Betz-Stablein

QIMR Berghofer Medical Research Institute, Cancer and Population Studies, Brisbane, Queensland, Australia.
Victoria Mar

Victorian Melanoma Service, Alfred Hospital, Melbourne, Victoria, Australia.
Monika Janda

Centre for Health Services Research, Faculty of Medicine, The University of Queensland, Brisbane, Queensland, Australia.
Peter Soyer
Zongyuan Ge

AIM for Health Lab, Faculty of IT, Monash University, Clayton, Victoria, Australia; Monash-Airdoc Research Lab, Faculty of IT, Monash University, Clayton, Victoria, Australia.

Keywords

Algorithms Databases, Factual Deep Learning Humans Image Interpretation, Computer-Assisted Image Processing, Computer-Assisted

External Resources

View on PubMed Access via DOI PubMed (39137089)

Prompt-Driven Latent Domain Generalization for Medical Image Classification.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals