A step-by-step classification algorithm of protein secondary structures based on double-layer SVM model.

Journal: Genomics
Published Date:

Abstract

In this paper, a step-by-step classification algorithm based on double-layer SVM model is constructed to predict the secondary structure of proteins. The most important feature of this algorithm is to improve the prediction accuracy of α+β and α/β classes through transforming the prediction of two classes of proteins, α+β and α/β classes, with low accuracy in the past, into the prediction of all-α and all-β classes with high accuracy. A widely-used dataset, 25PDB dataset with sequence similarity lower than 40%, is used to evaluate this method. The results show that this method has good performance, and on the basis of ensuring the accuracy of other three structural classes of proteins, the accuracy of α+β class proteins is improved significantly.

Authors

  • Yongzhen Ge
    School of Mathematical Sciences, Ocean University of China, Qingdao 266100, PR China.
  • Shuo Zhao
    College of Information Science and Engineering, Ocean University of China, Qingdao 266100, PR China.
  • Xiqiang Zhao
    School of Mathematical Sciences, Ocean University of China, Qingdao 266100, PR China. Electronic address: xqzhao62@163.com.