DeepSVM-fold: protein fold recognition by combining support vector machines and pairwise sequence similarity scores generated by deep learning networks.

Journal: Briefings in bioinformatics
Published Date:

Abstract

Protein fold recognition is critical for studying the structures and functions of proteins. The existing protein fold recognition approaches failed to efficiently calculate the pairwise sequence similarity scores of the proteins in the same fold sharing low sequence similarities. Furthermore, the existing feature vectorization strategies are not able to measure the global relationships among proteins from different protein folds. In this article, we proposed a new computational predictor called DeepSVM-fold for protein fold recognition by introducing a new feature vector based on the pairwise sequence similarity scores calculated from the fold-specific features extracted by deep learning networks. The feature vectors are then fed into a support vector machine to construct the predictor. Experimental results on the benchmark dataset (LE) show that DeepSVM-fold obviously outperforms all the other competing methods.

Authors

  • Bin Liu
    Department of Endocrinology, the First Affiliated Hospital of Chongqing Medical University, Chongqing, China; Department of Endocrinology, Neijiang First People's Hospital, Chongqing, China.
  • Chen-Chen Li
    School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong 518055, China.
  • Ke Yan
    Department of Biostatistics, Medical College of Wisconsin, Milwaukee, Wis.