Uncovering subtype-specific metabolic signatures in breast cancer through multimodal integration, attention-based deep learning, and self-organizing maps.

Journal: Scientific reports
Published Date:

Abstract

This study integrates multimodal metabolomic data from three platforms-LC-MS, GC-MS, and NMR-to systematically identify biomarkers distinguishing breast cancer subtypes. A feedforward attention-based deep learning model effectively selected 99 significant metabolites, outperforming traditional static methods in classification performance and biomarker consistency. By combining data from diverse platforms, the approach captured a comprehensive metabolic profile while maintaining biological relevance. Self-organizing map analysis revealed distinct metabolic signatures for each subtype, highlighting critical pathways. Group 1 (ER/PR-positive, HER2-negative) exhibited elevated serine, tyrosine, and 2-aminoadipic acid levels, indicating enhanced amino acid metabolism supporting nucleotide synthesis and redox balance. Group 3 (triple-negative breast cancer) displayed increased TCA cycle intermediates, such as α-ketoglutarate and malate, reflecting a metabolic shift toward energy production and biosynthesis to sustain aggressive proliferation. In Group 4 (HER2-enriched), elevated phosphatidylcholines and phosphatidylethanolamines suggested upregulated mono-unsaturated phospholipid biosynthesis. The study provides a framework for leveraging multimodal data integration, attention-based feature selection, and self-organizing map analysis to identify biologically meaningful biomarkers.

Authors

  • Parisa Shahnazari
    Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
  • Kaveh Kavousi
    Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran.
  • Hamid Reza Khorram Khorshid
    Genetics Research Center, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
  • Zarrin Minuchehr
    Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, Tehran, Iran.
  • Bahram Goliaei
    Institute of Biochemistry and Biophysics, Laboratory of Biophysics and Molecular Biology, Institute of Biochemistry and Biophysics (IBB), University of Tehran, University of Tehran, Tehran, Iran. goliaei@ut.ac.ir.
  • Reza M Salek
    School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, CB2 0SP, UK. rms72@cam.ac.uk.