Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning.

Journal: Scientific reports
Published Date:

Abstract

Direct prediction of protein structure from sequence is a challenging problem. An effective approach is to break it up into independent sub-problems. These sub-problems such as prediction of protein secondary structure can then be solved independently. In a previous study, we found that an iterative use of predicted secondary structure and backbone torsion angles can further improve secondary structure and torsion angle prediction. In this study, we expand the iterative features to include solvent accessible surface area and backbone angles and dihedrals based on Cα atoms. By using a deep learning neural network in three iterations, we achieved 82% accuracy for secondary structure prediction, 0.76 for the correlation coefficient between predicted and actual solvent accessible surface area, 19° and 30° for mean absolute errors of backbone φ and ψ angles, respectively, and 8° and 32° for mean absolute errors of Cα-based θ and τ angles, respectively, for an independent test dataset of 1199 proteins. The accuracy of the method is slightly lower for 72 CASP 11 targets but much higher than those of model structures from current state-of-the-art techniques. This suggests the potentially beneficial use of these predicted properties for model assessment and ranking.

Authors

  • Rhys Heffernan
    Signal Processing Laboratory, School of Engineering, Griffith University, Brisbane, Australia.
  • Kuldip Paliwal
    Signal Processing Laboratory, School of Engineering, Griffith University, Brisbane, Australia.
  • James Lyons
    Signal Processing Laboratory, School of Engineering, Griffith University, Brisbane, Australia.
  • Abdollah Dehzangi
    1] Signal Processing Laboratory, School of Engineering, Griffith University, Brisbane, Australia [2] Institute for Integrated and Intelligent Systems, Griffith University, Brisbane, Australia.
  • Alok Sharma
    Center for Integrative Medical Sciences, RIKEN Yokohama, Yokohama, 230-0045, Japan.
  • Jihua Wang
    Shandong Provincial Key Laboratory of Functional Macromolecular Biophysics, Dezhou University, Dezhou, Shandong, China.
  • Abdul Sattar
    1] Institute for Integrated and Intelligent Systems, Griffith University, Brisbane, Australia [2] National ICT Australia (NICTA), Brisbane, Australia.
  • Yuedong Yang
    Institute for Glycomics and School of Information and Communication Technique, Griffith University, Parklands Dr. Southport, QLD 4222, Australia.
  • Yaoqi Zhou
    Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen, Guangdong, 518106, China. Electronic address: zhouyq@szbl.ac.cn.