De Novo Molecular Design by Combining Deep Autoencoder Recurrent Neural Networks with Generative Topographic Mapping.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Here we show that Generative Topographic Mapping (GTM) can be used to explore the latent space of the SMILES-based autoencoders and generate focused molecular libraries of interest. We have built a sequence-to-sequence neural network with Bidirectional Long Short-Term Memory layers and trained it on the SMILES strings from ChEMBL23. Very high reconstruction rates of the test set molecules were achieved (>98%), which are comparable to the ones reported in related publications. Using GTM, we have visualized the autoencoder latent space on the two-dimensional topographic map. Targeted map zones can be used for generating novel molecular structures by sampling associated latent space points and decoding them to SMILES. The sampling method based on a genetic algorithm was introduced to optimize compound properties "on the fly". The generated focused molecular libraries were shown to contain original and a priori feasible compounds which, pending actual synthesis and testing, showed encouraging behavior in independent structure-based affinity estimation procedures (pharmacophore matching, docking).

Authors

  • Boris Sattarov
    Laboratory of Chemoinformatics , UMR 7177 University of Strasbourg/CNRS , 4 rue B. Pascal , 67000 Strasbourg , France.
  • Igor I Baskin
    a Faculty of Physics , M.V. Lomonosov Moscow State University , Moscow , Russia.
  • Dragos Horvath
    Laboratoire de Chemoinformatique, UMR 7140, Université de Strasbourg , 1 rue Blaise Pascal, Strasbourg 67000, France.
  • Gilles Marcou
    Laboratoire de Chemoinformatique, UMR 7140, Université de Strasbourg , 1 rue Blaise Pascal, Strasbourg 67000, France.
  • Esben Jannik Bjerrum
    Wildcard Pharmaceutical Consulting, Zeaborg Science Center, Frødings Allé 41 , 2860 Søborg , Denmark.
  • Alexandre Varnek
    Laboratoire de Chemoinformatique, UMR 7140, Université de Strasbourg , 1 rue Blaise Pascal, Strasbourg 67000, France.