Protein inter-residue contact and distance prediction by coupling complementary coevolution features with deep residual networks in CASP14.

Journal: Proteins
Published Date:

Abstract

This article reports and analyzes the results of protein contact and distance prediction by our methods in the 14th Critical Assessment of techniques for protein Structure Prediction (CASP14). A new deep learning-based contact/distance predictor was employed based on the ensemble of two complementary coevolution features coupling with deep residual networks. We also improved our multiple sequence alignment (MSA) generation protocol with wholesale meta-genome sequence databases. On 22 CASP14 free modeling (FM) targets, the proposed model achieved a top-L/5 long-range precision of 63.8% and a mean distance bin error of 1.494. Based on the predicted distance potentials, 11 out of 22 FM targets and all of the 14 FM/template-based modeling (TBM) targets have correctly predicted folds (TM-score >0.5), suggesting that our approach can provide reliable distance potentials for ab initio protein folding.

Authors

  • Yang Li
    Occupation of Chinese Center for Disease Control and Prevention, Beijing, China.
  • Chengxin Zhang
    Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan.
  • Wei Zheng
    School of Computer Engineering, Jinling Institute of Technology, Nanjing, 211169, China. zhengwei@jit.edu.cn.
  • Xiaogen Zhou
    Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109.
  • Eric W Bell
    Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America.
  • Dong-Jun Yu
  • Yang Zhang
    Innovative Institute of Chinese Medicine and Pharmacy, Academy for Interdiscipline, Chengdu University of Traditional Chinese Medicine, Chengdu, China.