Topological regression as an interpretable and efficient tool for quantitative structure-activity relationship modeling.

Journal: Nature communications
PMID:

Abstract

Quantitative structure-activity relationship (QSAR) modeling is a powerful tool for drug discovery, yet the lack of interpretability of commonly used QSAR models hinders their application in molecular design. We propose a similarity-based regression framework, topological regression (TR), that offers a statistically grounded, computationally fast, and interpretable technique to predict drug responses. We compare the predictive performance of TR on 530 ChEMBL human target activity datasets against the predictive performance of deep-learning-based QSAR models. Our results suggest that our sparse TR model can achieve equal, if not better, performance than the deep learning-based QSAR models and provide better intuitive interpretation by extracting an approximate isometry between the chemical space of the drugs and their activity space.

Authors

  • Ruibo Zhang
    Department of Electrical and Computer Engineering, Texas Tech University, 1012 Boston Ave, Lubbock, TX, 79409, USA.
  • Daniel Nolte
    Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX, 79409, USA.
  • Cesar Sanchez-Villalobos
    Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX, 79409, USA.
  • Souparno Ghosh
    Department of Mathematics and Statistics, Texas Tech University, 1108 Memorial Circle, Lubbock, TX, 79409, USA.
  • Ranadip Pal
    Department of Electrical and Computer Engineering, Texas Tech University, Lubbock, TX, 79409, USA. Ranadip.Pal@ttu.edu.