Revisiting Invariant Learning for Out-of-Domain Generalization on Multi-Site Mammogram Datasets
Journal:
arXiv
Published Date:
Mar 9, 2025
Abstract
Despite significant progress in robust deep learning techniques for mammogram
breast cancer classification, their reliability in real-world clinical
development settings remains uncertain. The translation of these models to
clinical practice faces challenges due to variations in medical centers,
imaging protocols, and patient populations. To enhance their robustness,
invariant learning methods have been proposed, prioritizing causal factors over
misleading features. However, their effectiveness in clinical development and
impact on mammogram classification require investigation. This paper reassesses
the application of invariant learning for breast cancer risk estimation based
on mammograms. Utilizing diverse multi-site public datasets, it represents the
first study in this area. The objective is to evaluate invariant learning's
benefits in developing robust models. Invariant learning methods, including
Invariant Risk Minimization and Variance Risk Extrapolation, are compared
quantitatively against Empirical Risk Minimization. Evaluation metrics include
accuracy, average precision, and area under the curve. Additionally,
interpretability is examined through class activation maps and visualization of
learned representations. This research examines the advantages, limitations,
and challenges of invariant learning for mammogram classification, guiding
future studies to develop generalized methods for breast cancer prediction on
whole mammograms in out-of-domain scenarios.