Exploring graph-based models for predicting active compounds against triple-negative breast cancer.

Journal: Molecular diversity

Published Date: Jul 9, 2025

Abstract

Breast cancer is among the most dominant and rapidly rising cancers, both in India and around the world. Triple-negative breast cancer (TNBC) is one of the most aggressive subtypes of breast cancer, distinguished by the absence of HER2, progesterone, and estrogen receptor expressions. This absence limits treatment options, emphasizing the urgent need to discover or design new drug candidates for TNBC. Integrating artificial intelligence and machine learning in computational modeling, has significantly accelerated the analysis of large-scale biological data and improved the prediction of therapeutic outcomes. In this study, we curated a data set of 756 mutant-type compounds from three cell lines and developed four graph-based models to predict active compounds against TNBC. Validated using stratified nested tenfold cross-validation and optimized with the Optuna framework, the models achieved predictive accuracy with AUC values of 0.65-0.82, with the MPNN model outperforming all the others. Furthermore, key structural fragments associated with cell inhibition and model predictions were identified and interpreted using several explainability techniques. Validation with an external set of FDA-approved drugs demonstrated prediction accuracies ranging from 66% to 97%, highlighting the robustness of the models in identifying compounds with potential inhibitory activity against TNBC cells.

Authors

Hridoy Jyoti Mahanta

Advanced Computation and Data Sciences Division, CSIR- North East Institute of Science and Technology, Jorhat, 785006, Assam, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, Uttar Pradesh, India.
Amarjeet Boruah

School of Computing Sciences, Assam Kaziranga University, Jorhat, 785006, Assam, India.
Bikram Phukan

Advanced Computation and Data Sciences Division, CSIR-North East Institute of Science and Technology, Jorhat, 785006, Assam, India.
Hillul Chutia

CSIR-North East Institute of Science and Technology, Jorhat 785006, India.
Pankaj Bharali

Centre for Infectious Diseases, CSIR North East Institute of Science and Technology, Jorhat, Assam, 785006, India; Academy of Scientific and Innovation Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India.
Selvaraman Nagamani

CSIR-North East Institute of Science and Technology, Jorhat 785006, India.

Keywords

No keywords available for this article.

External Resources

View on PubMed Access via DOI PubMed (40632362)

Exploring graph-based models for predicting active compounds against triple-negative breast cancer.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals

Exploring graph-based models for predicting active compounds against triple-negative breast cancer.

Abstract

Authors

Keywords

External Resources

Stay Ahead of Medical AI

Popular Topics

Recent Journals