Regulators of homologous recombination deficiency identified by machine learning using somatic multi-omics data.

Journal: Life science alliance
Published Date:

Abstract

Homologous recombination deficiency (HRD) is a critical biomarker for guiding targeted therapies, yet the full range of somatic alterations driving HRD across cancers remains incompletely characterized. Here, we present a tumor-agnostic machine learning framework that integrates somatic multi-omics data, including copy-number variations, single-nucleotide variants, DNA methylation, and gene expression from over 8,000 patients in The Cancer Genome Atlas. Using a genome-wide mutational signature-based HRD score as ground truth, our model achieved high predictive performance and leveraged SHAP-based explainability to uncover HRD regulators beyond BRCA1/2 Cross-tumor analysis revealed both shared and cancer type-specific molecular determinants, whereas functional enrichment highlighted key molecular and cellular processes. These findings expand the known repertoire of HRD-associated alterations, provide a resource for mechanistic investigation, and demonstrate the potential of integrative AI approaches to improve patient stratification for HR-targeted therapies across diverse malignancies.

Authors

  • Renan Valieris
    Laboratory of Computational Biology and Bioinformatics, CIPE/A.C.Camargo Cancer Center, São Paulo, São Paulo, 01508-010, Brazil.
  • Lucas Rosa
    Clinical and Functional Genomics Group, A.C. Camargo Cancer Center, São Paulo, Brazil.
  • Luan Martins
    Laboratory of Computational Biology and Bioinformatics, CIPE/A.C.Camargo Cancer Center, São Paulo, São Paulo, 01508-010, Brazil.
  • Alexandre Defelicibus
    Laboratory of Computational Biology and Bioinformatics, CIPE/A.C.Camargo Cancer Center, São Paulo, São Paulo, 01508-010, Brazil.
  • Dirce Maria Carraro
    Clinical and Functional Genomics Group, A.C. Camargo Cancer Center, São Paulo, Brazil.
  • Diana Noronha Nunes
    Division of Cancer Biology, Department of Radiation Oncology, Rutgers New Jersey Medical School, Newark, NJ, USA.
  • Emmanuel Dias-Neto
    Laboratory Medical Genomics, CIPE/A.C.Camargo Cancer Center, São Paulo, São Paulo, 01508-010, Brazil.
  • Rafael Rosales
    Departamento de Computação e Matemática, Universidade de São Paulo, São Paulo, Brazil.
  • Israel Tojal da Silva
    Laboratory of Computational Biology and Bioinformatics, A.C. Camargo Cancer Center, São Paulo, Brazil [email protected].