PredAmyl-MLP: Prediction of Amyloid Proteins Using Multilayer Perceptron.

Journal: Computational and mathematical methods in medicine
PMID:

Abstract

Amyloid is generally an aggregate of insoluble fibrin; its abnormal deposition is the pathogenic mechanism of various diseases, such as Alzheimer's disease and type II diabetes. Therefore, accurately identifying amyloid is necessary to understand its role in pathology. We proposed a machine learning-based prediction model called PredAmyl-MLP, which consists of the following three steps: feature extraction, feature selection, and classification. In the step of feature extraction, seven feature extraction algorithms and different combinations of them are investigated, and the combination of SVMProt-188D and tripeptide composition (TPC) is selected according to the experimental results. In the step of feature selection, maximum relevant maximum distance (MRMD) and binomial distribution (BD) are, respectively, used to remove the redundant or noise features, and the appropriate features are selected according to the experimental results. In the step of classification, we employed multilayer perceptron (MLP) to train the prediction model. The 10-fold cross-validation results show that the overall accuracy of PredAmyl-MLP reached 91.59%, and the performance was better than the existing methods.

Authors

  • Yanjuan Li
    School of Information and Computer Engineering, Northeast Forestry University, Harbin 150040, China. liyanjuan@nefu.edu.cn.
  • Zitong Zhang
    The Second Clinical College, Chongqing Medical University, Chongqing, China.
  • Zhixia Teng
    School of Computer Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China.
  • Xiaoyan Liu
    College of Information Technology, Jilin Agricultural University, Changchun, China.