Identification of adaptor proteins using the ANOVA feature selection technique.

Journal: Methods (San Diego, Calif.)
PMID:

Abstract

The adaptor proteins play a crucially important role in regulating lymphocyte activation. Rapid and efficient identification of adaptor proteins is essential for understanding their functions. However, biochemical methods require not only expensive experimental costs, but also long experiment cycles and more personnel. Therefore, a computational method that could accurately identify adaptor proteins is urgently needed. To solve this issue, we developed a classifier that combined the support vector machine (SVM) with the composition of k-Spaced Amino Acid Pairs (CKSAAP) and the amino acid composition (AAC) to identify adaptor proteins. Analysis of variance (ANOVA) was used to select the optimized features which could generate the maximum prediction performance. By examining the proposed model on independent data, we found that the 447 optimized features could achieve an accuracy of 92.39% with an AUC of 0.9766, demonstrating the powerful capabilities of our model. We hope that the proposed model could provide more clues for studying adaptor proteins.

Authors

  • Yu-Hao Wang
    School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
  • Yu-Fei Zhang
    School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
  • Ying Zhang
    Department of Nephrology, Nanchong Central Hospital Affiliated to North Sichuan Medical College, Nanchong, China.
  • Zhi-Feng Gu
    School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
  • Zhao-Yue Zhang
    Key Laboratory for Neuro-Information of Ministry of Education, School of Life Science and Technology, Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.
  • Hao Lin
    Yangtze Delta Region Institute (Huzhou), University of Electronic Science and Technology of China, Huzhou, Zhejiang, China.
  • Ke-Jun Deng
    Center for Informational Biology, University of Electronic Science and Technology of China Chengdu 611731, China.