deepGraphh: AI-driven web service for graph-based quantitative structure-activity relationship analysis.

Journal: Briefings in bioinformatics
Published Date:

Abstract

Artificial intelligence (AI)-based computational techniques allow rapid exploration of the chemical space. However, representation of the compounds into computational-compatible and detailed features is one of the crucial steps for quantitative structure-activity relationship (QSAR) analysis. Recently, graph-based methods are emerging as a powerful alternative to chemistry-restricted fingerprints or descriptors for modeling. Although graph-based modeling offers multiple advantages, its implementation demands in-depth domain knowledge and programming skills. Here we introduce deepGraphh, an end-to-end web service featuring a conglomerate of established graph-based methods for model generation for classification or regression tasks. The graphical user interface of deepGraphh supports highly configurable parameter support for model parameter tuning, model generation, cross-validation and testing of the user-supplied query molecules. deepGraphh supports four widely adopted methods for QSAR analysis, namely, graph convolution network, graph attention network, directed acyclic graph and Attentive FP. Comparative analysis revealed that deepGraphh supported methods are comparable to the descriptors-based machine learning techniques. Finally, we used deepGraphh models to predict the blood-brain barrier permeability of human and microbiome-generated metabolites. In summary, deepGraphh offers a one-stop web service for graph-based methods for chemoinformatics.

Authors

  • Vishakha Gautam
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi, Okhla, Phase III, New Delhi, Delhi, India.
  • Rahul Gupta
    National Institute of Technology, Hamirpur, Himachal Pradesh 177005, India.
  • Deepti Gupta
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi (IIIT-Delhi), Okhla, Phase III, New Delhi-110020, India.
  • Anubhav Ruhela
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi (IIIT-Delhi), Okhla, Phase III, New Delhi-110020, India.
  • Aayushi Mittal
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi, Okhla, Phase III, New Delhi, Delhi, India.
  • Sanjay Kumar Mohanty
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi, Okhla, Phase III, New Delhi, Delhi, India.
  • Sakshi Arora
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi, Okhla, Phase III, New Delhi, Delhi, India.
  • Ria Gupta
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi, Okhla, Phase III, New Delhi, Delhi, India.
  • Chandan Saini
    Department of Computational Biology, Indraprastha Institute of Information Technology-Delhi (IIIT-Delhi), Okhla, Phase III, New Delhi-110020, India.
  • Debarka Sengupta
    Indraprastha Institute of Technology Delhi, New Delhi, India.
  • Natarajan Arul Murugan
    Department of Theoretical Chemistry and Biology, School of Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, 10691 Stockholm, Sweden.
  • Gaurav Ahuja
    Department of Computational Biology, Indraprastha Institute of Information Technology - Delhi (IIIT-D), Delhi, India.