Advancements in Ligand-Based Virtual Screening through the Synergistic Integration of Graph Neural Networks and Expert-Crafted Descriptors.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

The fusion of traditional chemical descriptors with graph neural networks (GNNs) offers a compelling strategy for enhancing ligand-based virtual screening methodologies. A comprehensive evaluation revealed that the benefits derived from this integrative strategy vary significantly among different GNNs. Specifically, while GCN and SchNet demonstrate pronounced improvements by incorporating descriptors, SphereNet exhibits only marginal enhancement. Intriguingly, despite SphereNet's modest gain, all three models-GCN, SchNet, and SphereNet-achieve comparable performance levels when leveraging this combination strategy. This observation underscores a pivotal insight: sophisticated GNN architectures may be substituted with simpler counterparts without sacrificing efficacy, provided that they are augmented with descriptors. Furthermore, our analysis reveals a set of expert-crafted descriptors' robustness in scaffold-split scenarios, frequently outperforming the combined GNN-descriptor models. Given the critical importance of scaffold splitting in accurately mimicking real-world drug discovery contexts, this finding accentuates an imperative for GNN researchers to innovate models that can adeptly navigate and predict within such frameworks. Our work not only validates the potential of integrating descriptors with GNNs in advancing ligand-based virtual screening but also illuminates pathways for future enhancements in model development and application. Our implementation can be found at https://github.com/meilerlab/gnn-descriptor.

Authors

  • Yunchao Liu
    Department of Computer Science, Vanderbilt University, 1400 18th Ave S, Nashville, TN, 37212, USA.
  • Rocco Moretti
    Department of Chemistry, Vanderbilt University, Nashville, TN, USA.
  • Yu Wang
    Clinical and Technical Support, Philips Healthcare, Shanghai, China.
  • Ha Dong
    Department of Neural Science, Amherst College, 220 South Pleasant Street, Amherst, Massachusetts 01002, United States.
  • Bailu Yan
    Department of Biostatistics, Vanderbilt University, 2201 West End Ave, Nashville, Tennessee 37235, United States.
  • Bobby Bodenheimer
  • Tyler Derr
    Department of Computer Science, Data Science Institute, Vanderbilt University, 2201 West End Ave, Nashville, Tennessee 37235, United States.
  • Jens Meiler
    Department of Chemistry, Vanderbilt University, Nashville, TN, United States.