Prediction of Protein-ATP Binding Residues Based on Ensemble of Deep Convolutional Neural Networks and LightGBM Algorithm.

Journal: International journal of molecular sciences
PMID:

Abstract

Accurately identifying protein-ATP binding residues is important for protein function annotation and drug design. Previous studies have used classic machine-learning algorithms like support vector machine (SVM) and random forest to predict protein-ATP binding residues; however, as new machine-learning techniques are being developed, the prediction performance could be further improved. In this paper, an ensemble predictor that combines deep convolutional neural network and LightGBM with ensemble learning algorithm is proposed. Three subclassifiers have been developed, including a multi-incepResNet-based predictor, a multi-Xception-based predictor, and a LightGBM predictor. The final prediction result is the combination of outputs from three subclassifiers with optimized weight distribution. We examined the performance of our proposed predictor using two datasets: a classic ATP-binding benchmark dataset and a newly proposed ATP-binding dataset. Our predictor achieved area under the curve (AUC) values of 0.925 and 0.902 and Matthews Correlation Coefficient (MCC) values of 0.639 and 0.642, respectively, which are both better than other state-of-art prediction methods.

Authors

  • Jiazhi Song
    College of Computer Science and Technology, Jilin University, No. 2699 Qianjin Street, Changchun 130012, China.
  • Guixia Liu
    Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . Email: gxliu@ecust.edu.cn ; Email: ytang234@ecust.edu.cn ; ; Tel: +86-21-64250811.
  • Jingqing Jiang
    College of Computer Science and Technology, Inner Mongolia University for Nationalities, No. 536 Huolinhe Street, Tongliao 028000, China.
  • Ping Zhang
    Department of Computer Science and Engineering, The Ohio State University, USA.
  • Yanchun Liang
    * College of Computer Science and Technology, Key Laboratory of Symbolic, Computation and Knowledge, Engineering of Ministry of Education, Jilin University, Changchun 130012, P. R. China.