From mundane to surprising nonadditivity: drivers and impact on ML models.

Journal: Journal of computer-aided molecular design
PMID:

Abstract

Nonadditivity (NA) in Structure-Activity and Structure-Property Relationship (SAR) data is a rare but very information rich phenomenon. It can indicate conformational flexibility, structural rearrangements, and errors in assay results and structural assignment. While purely ligand-based conformational causes of NA are rather well understood and mundane, other factors are less so and cause surprising NA that has a huge influence on SAR analysis and ML model performance. We here report a systematic analysis across a wide range of properties (20 on-target biological activities and 4 physicochemical ADME-related properties) to understand the frequency of various different phenomena that may lead to NA. A set of novel descriptors were developed to characterize double transformation cycles and identify trends in NA. Double transformation cycles were classified into "surprising" and "mundane" categories, with the majority being classed as mundane. We also examined commonalities among surprising cycles, finding LogP differences to have the most significant impact on NA. A distinct behavior of NA for on-target sets compared to ADME sets was observed. Finally, we show that machine learning models struggle with highly nonadditive data, indicating that a better understanding of NA is an important future research direction.

Authors

  • Laura Guasch
    Roche Pharma Research and Early Development, Roche Innovation Center Basel, F. Hoffmann- La Roche AG, Basel, 4070, Switzerland. laura.guasch@roche.com.
  • Niels Maeder
    Department of Chemistry and Applied Biosciences, ETH Zurich, Vladimir-Prelog-Weg 2, 8093 Zurich, Switzerland.
  • John G Cumming
    Roche Pharma Research and Early Development, Roche Innovation Center Basel, F. Hoffmann- La Roche AG, Basel, 4070, Switzerland.
  • Christian Kramer
    Institute of General, Inorganic and Theoretical Chemistry, University of Innsbruck , Innrain 82, 6020 Innsbruck, Austria.