Molecular graph convolutions: moving beyond fingerprints.

Journal: Journal of computer-aided molecular design
Published Date:

Abstract

Molecular "fingerprints" encoding structural information are the workhorse of cheminformatics and machine learning in drug discovery applications. However, fingerprint representations necessarily emphasize particular aspects of the molecular structure while ignoring others, rather than allowing the model to make data-driven decisions. We describe molecular graph convolutions, a machine learning architecture for learning from undirected graphs, specifically small molecules. Graph convolutions use a simple encoding of the molecular graph-atoms, bonds, distances, etc.-which allows the model to take greater advantage of information in the graph structure. Although graph convolutions do not outperform all fingerprint-based methods, they (along with other graph-based methods) represent a new paradigm in ligand-based virtual screening with exciting opportunities for future improvement.

Authors

  • Steven Kearnes
    Stanford University, 318 Campus Dr. S296, Stanford, CA, 94305, USA. kearnes@stanford.edu.
  • Kevin McCloskey
    Google Inc., 1600 Amphitheatre Pkwy, Mountain View, CA, 94043, USA.
  • Marc Berndl
    Google Inc., 1600 Amphitheatre Pkwy, Mountain View, CA, 94043, USA.
  • Vijay Pande
    Department of Chemistry , Stanford University , Stanford , CA 94305 , USA . Email: pande@stanford.edu.
  • Patrick Riley
    Google Inc., 1600 Amphitheatre Pkwy, Mountain View, CA, 94043, USA.