Optimizing drug design by merging generative AI with a physics-based active learning framework.

Journal: Communications chemistry
Published Date:

Abstract

Machine learning is transforming drug discovery, with generative models (GMs) gaining attention for their ability to design molecules with specific properties. However, GMs often struggle with target engagement, synthetic accessibility, or generalization. To address these, we developed a GM workflow integrating a variational autoencoder with two nested active learning cycles. These iteratively refine their predictions using chemoinformatics and molecular modeling predictors. We tested our workflow on two systems, CDK2 and KRAS, successfully generating diverse, drug-like molecules with high predicted affinity and synthesis accessibility. Notably, we generated novel scaffolds distinct from those known for each target. For CDK2, we synthetized 9 molecules yielding 8 with in vitro activity, including one with nanomolar potency. For KRAS, in silico methods validated by CDK2 assays identified 4 molecules with potential activity. These findings showcase our GM workflow's ability to explore novel chemical spaces tailored for specific targets, thereby opening new avenues in drug discovery.

Authors

  • Isaac Filella-Merce
    Barcelona Supercomputing Center (BSC), Barcelona, Spain.
  • Alexis Molina
    Nostrum Biodiscovery S.L., 08029 Barcelona, Spain.
  • Lucía Díaz
    NOSTRUM BIODISCOVERY S.L., E-08029 Barcelona, Spain.
  • Marek Orzechowski
    RA Capital, Boston, MA, USA.
  • Yamina A Berchiche
    Superluminal Medicines, Boston, MA, USA.
  • Yang Ming Zhu
    RA Capital, Boston, MA, USA.
  • Júlia Vilalta-Mor
    Barcelona Supercomputing Center (BSC), Barcelona, Spain.
  • Laura Malo
    Nostrum Biodiscovery, Barcelona, Spain.
  • Ajay S Yekkirala
    RA Capital, Boston, MA, USA. ajay.yekkirala@superluminalrx.com.
  • Soumya Ray
  • Victor Guallar
    NOSTRUM BIODISCOVERY S.L., E-08029 Barcelona, Spain.

Keywords

No keywords available for this article.