Spatially Resolved Uncertainties for Machine Learning Potentials.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Machine learning potentials have become an essential tool for atomistic simulations, yielding results close to ab initio simulations at a fraction of computational cost. With recent improvements on the achievable accuracies, the focus has now shifted on the data set composition itself. The reliable identification of erroneously predicted configurations to extend a given data set is therefore of high priority. Yet, uncertainty estimation techniques have achieved mixed results for machine learning potentials. Consequently, a general and versatile method to correlate energy or atomic force uncertainties with the model error has remained elusive to date. In the current work, we show that epistemic uncertainty cannot correlate with model error by definition but can be aggregated over groups of atoms to yield a strong correlation. We demonstrate that our method correctly estimates prediction errors both globally per structure and locally resolved per atom. The direct correlation of local uncertainty and local error is used to design an active learning framework based on identifying local subregions of a large simulation cell and performing ab initio calculations only for the subregion subsequently. We successfully utilized this method to perform active learning in the low-data regime for liquid water.

Authors

  • Esther Heid
    Department of Chemical Engineering, Massachusetts Institute of Technology 77 Massachusetts Avenue Cambridge MA 02139 USA whgreen@mit.edu kfjensen@mit.edu.
  • Johannes Schörghuber
    Institute of Materials Chemistry, TU Wien, A-1060 Vienna, Austria.
  • Ralf Wanzenböck
    Institute of Materials Chemistry, TU Wien, A-1060 Vienna, Austria.
  • Georg K H Madsen
    Institute of Materials Chemistry , TU Wien , A-1060 Vienna , Austria.