Weakly supervised multi-modal contrastive learning framework for predicting the HER2 scores in breast cancer.

Journal: Computerized medical imaging and graphics : the official journal of the Computerized Medical Imaging Society

PMID: 39919535

Abstract

Human epidermal growth factor receptor 2 (HER2) is an important biomarker for prognosis and prediction of treatment response in breast cancer (BC). HER2 scoring is typically evaluated by pathologist microscopic observation on immunohistochemistry (IHC) images, which is labor-intensive and results in observational biases among different pathologists. Most existing methods generally use hand-crafted features or deep learning models in unimodal (hematoxylin and eosin (H&E) or IHC) to predict HER2 scores through supervised or weakly supervised learning. Consequently, the information from different modalities is not effectively integrated into feature learning which can help improve HER2 scoring performance. In this paper, we propose a novel weakly supervised multi-modal contrastive learning (WSMCL) framework to predict the HER2 scores in BC at the whole slide image (WSI) level. It aims to leverage multi-modal (H&E and IHC) joint learning under the weak supervision of WSI label to achieve the HER2 score prediction. Specifically, the patch features within H&E and IHC WSIs are respectively extracted and then the multi-head self-attention (MHSA) is used to explore the global dependencies of the patches within each modality. The patch features corresponding to top-k and bottom-k attention scores generated by MHSA in each modality are selected as the candidates for multi-modal joint learning. Particularly, a multi-modal attentive contrastive learning (MACL) module is designed to guarantee the semantic alignment of the candidate features from different modalities. Extensive experiments demonstrate the proposed WSMCL has the better HER2 scoring performance and outperforms the state-of-the-art methods. The code is available at https://github.com/HFUT-miaLab/WSMCL.

Authors

Jun Shi

School of Communication and Information Engineering, Shanghai University, Shanghai, China. Electronic address: junshi@staff.shu.edu.cn.
Dongdong Sun

School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, PR China.
Zhiguo Jiang
Jun Du

Department of Gastrointestinal Surgery, Affiliated Hospital of Jiangnan University, Wuxi, Jiangsu 214062, P.R. China.
Wei Wang

State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Macau 999078, China.
Yushan Zheng

Image Processing Center, School of Astronautics, Beihang University, Beijing 100191, China; Beijing Advanced Innovation Center for Biomedical Engineering, Beihang University, Beijing 100191, China.
Haibo Wu

Department of Pathology, the First Affiliated Hospital of USTC, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230036, Anhui Province, China; Intelligent Pathology Institute, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, 230036, Anhui Province, China.

Keywords

Breast Neoplasms Humans Image Interpretation, Computer-Assisted Pathology Receptor, ErbB-2 Supervised Machine Learning

External Resources

View on PubMed Access via DOI PubMed (39919535)

Weakly supervised multi-modal contrastive learning framework for predicting the HER2 scores in breast cancer.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals