Improving target-disease association prediction through a graph neural network with credibility information.

Journal: Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
Published Date:

Abstract

Identifying effective target-disease associations (TDAs) can alleviate the tremendous cost incurred by clinical failures of drug development. Although many machine learning models have been proposed to predict potential novel TDAs rapidly, their credibility is not guaranteed, thus requiring extensive experimental validation. In addition, it is generally challenging for current models to predict meaningful associations for entities with less information, hence limiting the application potential of these models in guiding future research. Based on recent advances in utilizing graph neural networks to extract features from heterogeneous biological data, we develop CreaTDA, an end-to-end deep learning-based framework that effectively learns latent feature representations of targets and diseases to facilitate TDA prediction. We also propose a novel way of encoding credibility information obtained from literature to enhance the performance of TDA prediction and predict more novel TDAs with real evidence support from previous studies. Compared with state-of-the-art baseline methods, CreaTDA achieves substantially better prediction performance on the whole TDA network and its sparse sub-networks containing the proteins associated with few known diseases. Our results demonstrate that CreaTDA can provide a powerful and helpful tool for identifying novel target-disease associations, thereby facilitating drug discovery.

Authors

  • Chang Liu
    Key Lab of Cell Differentiation and Apoptosis of Ministry of Education, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
  • Cuinan Yu
  • Yipin Lei
    Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, 100084, China.
  • Kangbo Lyu
  • Tingzhong Tian
    Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing, China.
  • Qianhao Li
    Shanghai Key Laboratory of Multidimensional Information Processing, Department of Electronic Engineering, East China Normal University, Shanghai 200241, China. 51171213058@stu.ecnu.edu.cn.
  • Dan Zhao
    Key Laboratory of Hunan Province for Water Environment and Agriculture Product Safety, College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China.
  • Fengfeng Zhou
  • Jianyang Zeng
    Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China; MOE Key Laboratory of Bioinformatics, Tsinghua University, Beijing 100084, China. Electronic address: zengjy321@tsinghua.edu.cn.