Deep learning-assisted prediction of protein-protein interactions in Arabidopsis thaliana.

Journal: The Plant journal : for cell and molecular biology
Published Date:

Abstract

Currently, the experimentally identified interactome of Arabidopsis (Arabidopsis thaliana) is still far from complete, suggesting that computational prediction methods can complement experimental techniques. Motivated by the prosperity and success of deep learning algorithms and natural language processing techniques, we introduce an integrative deep learning framework, DeepAraPPI, allowing us to predict protein-protein interactions (PPIs) of Arabidopsis utilizing sequence, domain and Gene Ontology (GO) information. Our current DeepAraPPI comprises: (i) a word2vec encoding-based Siamese recurrent convolutional neural network (RCNN) model; (ii) a Domain2vec encoding-based multiple-layer perceptron (MLP) model; and (iii) a GO2vec encoding-based MLP model. Finally, DeepAraPPI combines the prediction results of the three individual predictors through a logistic regression model. Compiling high-quality positive and negative training and test samples by applying strict filtering strategies, DeepAraPPI shows superior performance compared with existing state-of-the-art Arabidopsis PPI prediction methods. DeepAraPPI also provides better cross-species predictive ability in rice (Oryza sativa) than traditional machine learning methods, although the overall performance in cross-species prediction remains to be improved. DeepAraPPI is freely accessible at http://zzdlab.com/deeparappi/. In the meantime, we have also made the source code and data sets of DeepAraPPI available at https://github.com/zjy1125/DeepAraPPI.

Authors

  • Jingyan Zheng
    State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China.
  • Xiaodi Yang
    Department of Microbiology and Parasitology, Anhui Key Laboratory of Infection and Immunity, Bengbu Medical College, Bengbu 233000, Anhui, China.
  • Yan Huang
    Department of Neurology, University of Texas Health Science Center at Houston, Houston, TX.
  • Shiping Yang
    State Key Laboratory of Agrobiotechnology, College of Biological Sciences , China Agricultural University , Beijing 100193 , China.
  • Stefan Wuchty
    Department of Computer Science, University of Miami, Miami, FL, 33146, USA.
  • Ziding Zhang
    State Key Laboratory of Agrobiotechnology, College of Biological Sciences , China Agricultural University , Beijing 100193 , China.