Graph embeddings on gene ontology annotations for protein-protein interaction prediction.

Journal: BMC bioinformatics
Published Date:

Abstract

BACKGROUND: Protein-protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches do not consider missing and spurious interactions inherent in PPI networks. To address these two issues, we define two corresponding tasks, namely missing PPI prediction and spurious PPI prediction, and propose a method that employs graph embeddings that learn vector representations from constructed Gene Ontology Annotation (GOA) graphs and then use embedded vectors to achieve the two tasks. Our method leverages on information from both term-term relations among GO terms and term-protein annotations between GO terms and proteins, and preserves properties of both local and global structural information of the GO annotation graph.

Authors

  • Xiaoshi Zhong
    School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore. xszhong@ntu.edu.sg.
  • Jagath C Rajapakse
    Bioinformatics Research Center, School of Computer Engineering, Nanyang Technological University, Singapore; Singapore-MIT Alliance, Singapore; Department of Biological Engineering, Massachusetts Institute of Technology, USA.