An end-to-end framework for the prediction of protein structure and fitness from single sequence.

Journal: Nature communications
Published Date:

Abstract

Significant research progress has been made in the field of protein structure and fitness prediction. Particularly, single-sequence-based structure prediction methods like ESMFold and OmegaFold achieve a balance between inference speed and prediction accuracy, showing promise for many downstream prediction tasks. Here, we propose SPIRED, a single-sequence-based structure prediction model that exhibits comparable performance to the state-of-the-art methods but with approximately 5-fold acceleration in inference and at least one order of magnitude reduction in training consumption. By integrating SPIRED with downstream neural networks, we compose an end-to-end framework named SPIRED-Fitness for the rapid prediction of both protein structure and fitness from single sequence with satisfactory accuracy. Moreover, SPIRED-Stab, the derivative of SPIRED-Fitness, achieves state-of-the-art performance in predicting the mutational effects on protein stability.

Authors

  • Yinghui Chen
    MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, China.
  • Yunxin Xu
    MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, China.
  • Di Liu
    Laboratory of Nutrition and Functional Food, College of Food Science and Engineering, Jilin University, Changchun, China.
  • Yaoguang Xing
    MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, China.
  • Haipeng Gong
    School of Life Science, Tsinghua University, Beijing 100084, China.