MolGraph: a Python package for the implementation of molecular graphs and graph neural networks with TensorFlow and Keras.

Journal: Journal of computer-aided molecular design
Published Date:

Abstract

Molecular machine learning (ML) has proven important for tackling various molecular problems, such as predicting molecular properties based on molecular descriptors or fingerprints. Since relatively recently, graph neural network (GNN) algorithms have been implemented for molecular ML, showing comparable or superior performance to descriptor or fingerprint-based approaches. Although various tools and packages exist to apply GNNs in molecular ML, a new GNN package, named MolGraph, was developed in this work with the motivation to create GNN model pipelines highly compatible with the TensorFlow and Keras application programming interface (API). MolGraph also implements a module to accommodate the generation of small molecular graphs, which can be passed to a GNN algorithm to solve a molecular ML problem. To validate the GNNs, benchmarking was conducted using the datasets from MoleculeNet, as well as three chromatographic retention time datasets. The benchmarking results demonstrate that the GNNs performed in line with expectations. Additionally, the GNNs proved useful for molecular identification and improved interpretability of chromatographic retention time data. MolGraph is available at https://github.com/akensert/molgraph . Installation, tutorials and implementation details can be found at  https://molgraph.readthedocs.io/en/latest/ .

Authors

  • Alexander Kensert
    1 Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden.
  • Gert Desmet
    Department of Chemical Engineering, Vrije Universiteit Brussel, Pleinlaan 2, Brussel 1050, Belgium.
  • Deirdre Cabooter
    Department for Pharmaceutical and Pharmacological Sciences, University of Leuven (KU Leuven), Pharmaceutical Analysis, Herestraat 49, Leuven 3000, Belgium.