3D Molecular Representations Based on the Wave Transform for Convolutional Neural Networks.

Journal: Molecular pharmaceutics
Published Date:

Abstract

Convolutional neural networks (CNN) have been successfully used to handle three-dimensional data and are a natural match for data with spatial structure such as 3D molecular structures. However, a direct 3D representation of a molecule with atoms localized at voxels is too sparse, which leads to poor performance of the CNNs. In this work, we present a novel approach where atoms are extended to fill other nearby voxels with a transformation based on the wave transform. Experimenting on 4.5 million molecules from the Zinc database, we show that our proposed representation leads to better performance of CNN-based autoencoders than either the voxel-based representation or the previously used Gaussian blur of atoms and then successfully apply the new representation to classification tasks such as MACCS fingerprint prediction.

Authors

  • Denis Kuzminykh
    Insilico Medicine , Baltimore , Maryland 21218 , United States.
  • Daniil Polykovskiy
    Insilico Medicine , Baltimore , Maryland 21218 , United States.
  • Artur Kadurin
    Search Department, Mail.Ru Group Ltd., Moscow, Russia.
  • Alexander Zhebrak
    Pharmaceutical Artificial Intelligence Department, Insilico Medicine, Inc., Emerging Technology Centers, Johns Hopkins University at Eastern, Baltimore, Maryland, USA.
  • Ivan Baskov
    Insilico Medicine , Baltimore , Maryland 21218 , United States.
  • Sergey Nikolenko
    Kazan (Volga Region) Federal University, Kazan, Russia.
  • Rim Shayakhmetov
    Insilico Medicine , Baltimore , Maryland 21218 , United States.
  • Alex Zhavoronkov
    Pharmaceutical Artificial Intelligence Department, Insilico Medicine, Inc., Emerging Technology Centers, Johns Hopkins University at Eastern, Baltimore, Maryland, USA.