Graph Neural Networks for Learning Molecular Excitation Spectra.

Journal: Journal of chemical theory and computation
Published Date:

Abstract

Machine learning (ML) approaches have demonstrated the ability to predict molecular spectra at a fraction of the computational cost of traditional theoretical chemistry methods while maintaining high accuracy. Graph neural networks (GNNs) are particularly promising in this regard, but different types of GNNs have not yet been systematically compared. In this work, we benchmark and analyze five different GNNs for the prediction of excitation spectra from the QM9 dataset of organic molecules. We compare the GNN performance in the obvious runtime measurements, prediction accuracy, and analysis of outliers in the test set. Moreover, through TMAP clustering and statistical analysis, we are able to highlight clear hotspots of high prediction errors as well as optimal spectra prediction for molecules with certain functional groups. This in-depth benchmarking and subsequent analysis protocol lays down a recipe for comparing different ML methods and evaluating dataset quality.

Authors

  • Kanishka Singh
    Helmholtz-Zentrum Berlin für Materialien und Energie GmbH, Hahn-Meitner-Platz 1, Berlin 10409, Germany.
  • Jannes Münchmeyer
    Computer Science Department, Humboldt-Universität zu Berlin, Berlin 10099, Germany.
  • Leon Weber
    Computer Science Department, Humboldt-Universität zu Berlin, Berlin, Germany.
  • Ulf Leser
    Humboldt-Universität zu Berlin, Knowledge Management in Bioinformatics, Berlin, Germany.
  • Annika Bande
    Helmholtz-Zentrum Berlin für Materialien und Energie GmbH, Hahn-Meitner-Platz 1, Berlin 10409, Germany.