Informing geometric deep learning with electronic interactions to accelerate quantum chemistry.

Journal: Proceedings of the National Academy of Sciences of the United States of America
Published Date:

Abstract

Predicting electronic energies, densities, and related chemical properties can facilitate the discovery of novel catalysts, medicines, and battery materials. However, existing machine learning techniques are challenged by the scarcity of training data when exploring unknown chemical spaces. We overcome this barrier by systematically incorporating knowledge of molecular electronic structure into deep learning. By developing a physics-inspired equivariant neural network, we introduce a method to learn molecular representations based on the electronic interactions among atomic orbitals. Our method, OrbNet-Equi, leverages efficient tight-binding simulations and learned mappings to recover high-fidelity physical quantities. OrbNet-Equi accurately models a wide spectrum of target properties while being several orders of magnitude faster than density functional theory. Despite only using training samples collected from readily available small-molecule libraries, OrbNet-Equi outperforms traditional semiempirical and machine learning-based methods on comprehensive downstream benchmarks that encompass diverse main-group chemical processes. Our method also describes interactions in challenging charge-transfer complexes and open-shell systems. We anticipate that the strategy presented here will help to expand opportunities for studies in chemistry and materials science, where the acquisition of experimental or reference training data is costly.

Authors

  • Zhuoran Qiao
    Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125.
  • Anders S Christensen
    Entos, Inc., Los Angeles, CA 90027.
  • Matthew Welborn
    Entos, Inc., Los Angeles, CA 90027.
  • Frederick R Manby
    Entos, Inc., Los Angeles, CA 90027.
  • Anima Anandkumar
    Department of Computing and Mathematical Science, California Institute of Technology, Pasadena, California; NVIDIA Corporation, Santa Clara, California.
  • Thomas F Miller
    Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125.