MeD-3D: A Multimodal Deep Learning Framework for Precise Recurrence Prediction in Clear Cell Renal Cell Carcinoma (ccRCC)
Journal:
arXiv
Published Date:
Jul 10, 2025
Abstract
Accurate prediction of recurrence in clear cell renal cell carcinoma (ccRCC)
remains a major clinical challenge due to the disease complex molecular,
pathological, and clinical heterogeneity. Traditional prognostic models, which
rely on single data modalities such as radiology, histopathology, or genomics,
often fail to capture the full spectrum of disease complexity, resulting in
suboptimal predictive accuracy. This study aims to overcome these limitations
by proposing a deep learning (DL) framework that integrates multimodal data,
including CT, MRI, histopathology whole slide images (WSI), clinical data, and
genomic profiles, to improve the prediction of ccRCC recurrence and enhance
clinical decision-making. The proposed framework utilizes a comprehensive
dataset curated from multiple publicly available sources, including TCGA, TCIA,
and CPTAC. To process the diverse modalities, domain-specific models are
employed: CLAM, a ResNet50-based model, is used for histopathology WSIs, while
MeD-3D, a pre-trained 3D-ResNet18 model, processes CT and MRI images. For
structured clinical and genomic data, a multi-layer perceptron (MLP) is used.
These models are designed to extract deep feature embeddings from each
modality, which are then fused through an early and late integration
architecture. This fusion strategy enables the model to combine complementary
information from multiple sources. Additionally, the framework is designed to
handle incomplete data, a common challenge in clinical settings, by enabling
inference even when certain modalities are missing.