NegGOA: negative GO annotations selection using ontology structure.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Predicting the biological functions of proteins is one of the key challenges in the post-genomic era. Computational models have demonstrated the utility of applying machine learning methods to predict protein function. Most prediction methods explicitly require a set of negative examples-proteins that are known not carrying out a particular function. However, Gene Ontology (GO) almost always only provides the knowledge that proteins carry out a particular function, and functional annotations of proteins are incomplete. GO structurally organizes more than tens of thousands GO terms and a protein is annotated with several (or dozens) of these terms. For these reasons, the negative examples of a protein can greatly help distinguishing true positive examples of the protein from such a large candidate GO space.

Authors

  • Guangyuan Fu
    College of Computer and Information Science, Southwest University, Chongqing 400715, China.
  • Jun Wang
    Department of Speech, Language, and Hearing Sciences and the Department of Neurology, The University of Texas at Austin, Austin, TX 78712, USA.
  • Bo Yang
    Center for Cognition and Brain Disorders, Hangzhou Normal University, Hangzhou, Zhejiang Province 311121, China.
  • Guoxian Yu
    College of Computer and Information Science, Southwest University, Chongqing 400715, China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China.