Application of machine learning models for property prediction to targeted protein degraders.

Journal: Nature communications
PMID:

Abstract

Machine learning (ML) systems can model quantitative structure-property relationships (QSPR) using existing experimental data and make property predictions for new molecules. With the advent of modalities such as targeted protein degraders (TPD), the applicability of QSPR models is questioned and ML usage in TPD-centric projects remains limited. Herein, ML models are developed and evaluated for TPDs' property predictions, including passive permeability, metabolic clearance, cytochrome P450 inhibition, plasma protein binding, and lipophilicity. Interestingly, performance on TPDs is comparable to that of other modalities. Predictions for glues and heterobifunctionals often yield lower and higher errors, respectively. For permeability, CYP3A4 inhibition, and human and rat microsomal clearance, misclassification errors into high and low risk categories are lower than 4% for glues and 15% for heterobifunctionals. For all modalities, misclassification errors range from 0.8% to 8.1%. Investigated transfer learning strategies improve predictions for heterobifunctionals. This is the first comprehensive evaluation of ML for the prediction of absorption, distribution, metabolism, and excretion (ADME) and physicochemical properties of TPD molecules, including heterobifunctional and molecular glue sub-modalities. Taken together, our investigations show that ML-based QSPR models are applicable to TPDs and support ML usage for TPDs' design, to potentially accelerate drug discovery.

Authors

  • Giulia Peteani
    Novartis Biomedical Research, Novartis Campus, 4002, Basel, Switzerland.
  • Minh Tam Davide Huynh
    Novartis Biomedical Research, Novartis Campus, 4002, Basel, Switzerland.
  • Grégori Gerebtzoff
    Novartis Institutes for BioMedical Research, NIBR Translational Medicine, Modeling and Simulations, Novartis Pharma AG, Novartis Campus, 4056, Basel, Switzerland.
  • Raquel Rodríguez-Pérez
    Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany.