Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier.

Journal: Oncotarget
Published Date:

Abstract

Identification of protein-protein interactions (PPIs) is of critical importance for deciphering the underlying mechanisms of almost all biological processes of cell and providing great insight into the study of human disease. Although much effort has been devoted to identifying PPIs from various organisms, existing high-throughput biological techniques are time-consuming, expensive, and have high false positive and negative results. Thus it is highly urgent to develop in silico methods to predict PPIs efficiently and accurately in this post genomic era. In this article, we report a novel computational model combining our newly developed discriminative vector machine classifier (DVM) and an improved Weber local descriptor (IWLD) for the prediction of PPIs. Two components, differential excitation and orientation, are exploited to build evolutionary features for each protein sequence. The main characteristics of the proposed method lies in introducing an effective feature descriptor IWLD which can capture highly discriminative evolutionary information from position-specific scoring matrixes (PSSM) of protein data, and employing the powerful and robust DVM classifier. When applying the proposed method to Yeast and H. pylori data sets, we obtained excellent prediction accuracies as high as 96.52% and 91.80%, respectively, which are significantly better than the previous methods. Extensive experiments were then performed for predicting cross-species PPIs and the predictive results were also pretty promising. To further validate the performance of the proposed method, we compared it with the state-of-the-art support vector machine (SVM) classifier on Human data set. The experimental results obtained indicate that our method is highly effective for PPIs prediction and can be taken as a supplementary tool for future proteomics research.

Authors

  • Zheng-Wei Li
    School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 21116, China.
  • Zhu-Hong You
    Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, China. zhuhongyou@ms.xjb.ac.cn.
  • Xing Chen
    School of Information and Electrical Engineering, China University of Mining and Technology, Xuzhou, 221116, China. xingchen@amss.ac.cn.
  • Li-Ping Li
    Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Science, Urumqi 830011, China.
  • De-Shuang Huang
  • Gui-Ying Yan
    Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100010, China.
  • Ru Nie
    School of Computer Science and Technology, China University of Mining and Technology, Xuzhou 221116, China.
  • Yu-An Huang
    Department of Computing, Hong Kong Polytechnic University, Hung Hom, Hong Kong, China.