Combination of explainable machine learning and conceptual density functional theory: applications for the study of key solvation mechanisms.

Journal: Physical chemistry chemical physics : PCCP
Published Date:

Abstract

We present explainable machine learning approaches for the accurate prediction and understanding of solvation free energies, enthalpies, and entropies for different salts in various protic and aprotic solvents. As key input features, we use fundamental contributions from the conceptual density functional theory (DFT) of solutions. The most accurate models with the highest prediction accuracy for the experimental validation data set are decision tree-based approaches such as extreme gradient boosting and extra trees, which highlight the non-linear influence of feature values on target predictions. The detailed assessment of the importance of features in terms of Gini importance criteria as well as Shapley Additive Explanations (SHAP) and permutation and reduction approaches underlines the prominent role of anion and cation solvation effects in combination with fundamental electronic properties of the solvents. These results are reasonably consistent with previous assumptions and provide a solid rationale for more recent theoretical approaches.

Authors

  • I-Ting Ho
    Boehringer Ingelheim Pharma GmbH & Co. KG, Global Computational Biology and Digital Sciences, D-88397 Biberach (Riss), Germany.
  • Milena Matysik
    Boehringer Ingelheim GmbH & Co. KG, IT RDM Services, D-88397 Biberach (Riss), Germany.
  • Liliana Montano Herrera
    Boehringer Ingelheim Pharma GmbH & Co. KG, Global Innovation & Alliance Management, D-88397 Biberach (Riss), Germany.
  • Jiyoung Yang
    Boehringer Ingelheim Pharma GmbH & Co. KG, Analytical Development Biologicals, D-88397 Biberach (Riss), Germany.
  • Ralph Joachim Guderlei
    Boehringer Ingelheim GmbH & Co. KG, IT RDM Services, D-88397 Biberach (Riss), Germany.
  • Michael Laussegger
    Boehringer Ingelheim RCV GmbH & Co. KG, IT RDM Services, A-1121 Vienna, Austria.
  • Bernhard Schrantz
    Boehringer Ingelheim RCV GmbH & Co. KG, IT RDM Services, A-1121 Vienna, Austria.
  • Regine Hammer
    Boehringer Ingelheim GmbH & Co. KG, OneHP Business Unit, D-88397 Biberach (Riss), Germany.
  • Ramón Alain Miranda-Quintana
    Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL, 32603, USA.
  • Jens Smiatek
    Boehringer Ingelheim GmbH & Co. KG, Development NCE, D-88397 Biberach (Riss), Germany.