Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach.

Journal: Journal of chemical theory and computation
PMID:

Abstract

Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. We introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k isomers of C7H10O2 we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of Hartree-Fock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semiempirical quantum chemistry and machine learning models trained on 1 and 10% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.

Authors

  • Raghunathan Ramakrishnan
    Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials, Department of Chemistry, University of Basel , Klingelbergstraße 80, CH-4056 Basel, Switzerland.
  • Pavlo O Dral
    Max-Planck-Institut für Kohlenforschung, Kaiser-Wilhelm-Platz 1, 45470 Mülheim an der Ruhr, Germany.
  • Matthias Rupp
    Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials, Department of Chemistry, University of Basel , Klingelbergstraße 80, CH-4056 Basel, Switzerland.
  • O Anatole von Lilienfeld
    Acceleration Consortium and Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada.