A microbial knowledge graph-based deep learning model for predicting candidate microbes for target hosts.

Journal: Briefings in bioinformatics
Published Date:

Abstract

Predicting interactions between microbes and hosts plays critical roles in microbiome population genetics and microbial ecology and evolution. How to systematically characterize the sophisticated mechanisms and signal interplay between microbes and hosts is a significant challenge for global health risks. Identifying microbe-host interactions (MHIs) can not only provide helpful insights into their fundamental regulatory mechanisms, but also facilitate the development of targeted therapies for microbial infections. In recent years, computational methods have become an appealing alternative due to the high risk and cost of wet-lab experiments. Therefore, in this study, we utilized rich microbial metagenomic information to construct a novel heterogeneous microbial network (HMN)-based model named KGVHI to predict candidate microbes for target hosts. Specifically, KGVHI first built a HMN by integrating human proteins, viruses and pathogenic bacteria with their biological attributes. Then KGVHI adopted a knowledge graph embedding strategy to capture the global topological structure information of the whole network. A natural language processing algorithm is used to extract the local biological attribute information from the nodes in HMN. Finally, we combined the local and global information and fed it into a blended deep neural network (DNN) for training and prediction. Compared to state-of-the-art methods, the comprehensive experimental results show that our model can obtain excellent results on the corresponding three MHI datasets. Furthermore, we also conducted two pathogenic bacteria case studies to further indicate that KGVHI has excellent predictive capabilities for potential MHI pairs.

Authors

  • Jie Pan
  • Zhen Zhang
    School of Pharmacy, Jining Medical University, Rizhao, Shandong, China.
  • Ying Li
    School of Information Engineering, Chang'an University, Xi'an 710010, China.
  • Jiaoyang Yu
    Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, College of Life Sciences, Northwest University, Xi'an 710069, China.
  • Zhuhong You
  • Chenyu Li
    Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, College of Life Sciences, Northwest University, Xi'an 710069, China.
  • Shixu Wang
    Department of Endoscopy,National Cancer Center,National Clinical Research Center for Cancer,Cancer Hospital,Chinese Academy of Medical Sciences and Peking Union Medical College,Beijing.
  • Minghui Zhu
    School of Electrical Engineering and Computer Engineering, The Pennsylvania State University, State College, PA, 16802, USA. Electronic address: muz16@psu.edu.
  • Fengzhi Ren
    North China Pharmaceutical Group, Shijiazhuang 050015, Hebei, China.
  • Xuexia Zhang
    North China Pharmaceutical Group, Shijiazhuang 050015, Hebei, China.
  • Yanmei Sun
    Key Laboratory of Resources Biology and Biotechnology in Western China, Ministry of Education, Provincial Key Laboratory of Biotechnology of Shaanxi Province, College of Life Sciences, Northwest University, Xi'an 710069, China.
  • Shiwei Wang
    PTN Graduate Program, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, PR China.