PCLSurv: a prototypical contrastive learning-based multi-omics data integration model for cancer survival prediction.

Journal: Briefings in bioinformatics
PMID:

Abstract

Accurate cancer survival prediction remains a critical challenge in clinical oncology, largely due to the complex and multi-omics nature of cancer data. Existing methods often struggle to capture the comprehensive range of informative features required for precise predictions. Here, we introduce PCLSurv, an innovative deep learning framework designed for cancer survival prediction using multi-omics data. PCLSurv integrates autoencoders to extract omics-specific features and employs sample-level contrastive learning to identify distinct yet complementary characteristics across data views. Then, features are fused via a bilinear fusion module to construct a unified representation. To further enhance the model's capacity to capture high-level semantic relationships, PCLSurv aligns similar samples with shared prototypes while separating unrelated ones via prototypical contrastive learning. As a result, PCLSurv effectively distinguishes patient groups with varying survival outcomes at different semantic similarity levels, providing a robust framework for stratifying patients based on clinical and molecular features. We conduct extensive experiments on 11 cancer datasets. The comparison results confirm the superior performance of PCLSurv over existing alternatives. The source code of PCLSurv is freely available at https://github.com/LiangSDNULab/PCLSurv.

Authors

  • Zhimin Li
    Department of Dermatology, Air Force Medical Center, PLA, Beijing, People's Republic of China.
  • Wenlan Chen
    School of Computer Science and Engineering, Central South University, Changsha 410083, China.
  • Hai Zhong
    The Second Hospital of Shandong University, Cheeloo College of Medicine, Shandong University, Jinan 250000, China. Electronic address: 18753107255@163.com.
  • Cheng Liang