Prediction of Neuropeptides from Sequence Information Using Ensemble Classifier and Hybrid Features.

Journal: Journal of proteome research
PMID:

Abstract

As hormones in the endocrine system and neurotransmitters in the immune system, neuropeptides (NPs) provide many opportunities for the discovery of new drugs and targets for nervous system disorders. In spite of their importance in the hormonal regulations and immune responses, the bioinformatics predictor for the identification of NPs is lacking. In this study, we develop a predictor for the identification of NPs, named PredNeuroP, based on a two-layer stacking method. In this ensemble predictor, 45 models are introduced as base-learners by combining nine feature descriptors with five machine learning algorithms. Then, we select eight base-learners referring to the sum of accuracy and Pearson correlation coefficient of base-learner pairs on the first-layer learning. On the second-layer learning, the outputs of these advisable base-learners are imported into logistic regression classifier to train the final model, and the outputs are the final predicting results. The accuracy of PredNeuroP is 0.893 and 0.872 on the training and test data sets, respectively. The consistent performance on these data sets approves the practicability of our predictor. Therefore, we expect that PredNeuroP would provide an important advancement in the discovery of NPs as new drugs for the treatment of nervous system disorders. The data sets and Python code are available at https://github.com/xialab-ahu/PredNeuroP.

Authors

  • Yannan Bin
    Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China.
  • Wei Zhang
    The First Affiliated Hospital of Nanchang University, Nanchang, China.
  • Wending Tang
    Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China.
  • Ruyu Dai
    Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China.
  • Menglu Li
    School of Computer Science and Technology, Anhui University, Hefei, Anhui 230601, China.
  • Qizhi Zhu
    Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui 230601, China.
  • Junfeng Xia