A Novel Protein Subcellular Localization Method With CNN-XGBoost Model for Alzheimer's Disease.

Journal: Frontiers in genetics
Published Date:

Abstract

The disorder distribution of protein in the compartment or organelle leads to many human diseases, including neurodegenerative diseases such as Alzheimer's disease. The prediction of protein subcellular localization play important roles in the understanding of the mechanism of protein function, pathogenes and disease therapy. This paper proposes a novel subcellular localization method by integrating the Convolutional Neural Network (CNN) and eXtreme Gradient Boosting (XGBoost), where CNN acts as a feature extractor to automatically obtain features from the original sequence information and a XGBoost classifier as a recognizer to identify the protein subcellular localization based on the output of the CNN. Experiments are implemented on three protein datasets. The results prove that the CNN-XGBoost method performs better than the general protein subcellular localization methods.

Authors

  • Long Pang
    Harbin Nebula Bioinformatics Technology Development Co., Ltd., Harbin, China.
  • Junjie Wang
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.
  • Lingling Zhao
    School of Electronic Engineering, Heilongjiang University, Harbin, China.
  • Chunyu Wang
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.
  • Hui Zhan
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.

Keywords

No keywords available for this article.