ThermoLink: Bridging disulfide bonds and enzyme thermostability through database construction and machine learning prediction.

Journal: Protein science : a publication of the Protein Society
PMID:

Abstract

Disulfide bonds, covalently formed by sulfur atoms in cysteine residues, play a crucial role in protein folding and structure stability. Considering their significance, artificial disulfide bonds are often introduced to enhance protein thermostability. Although an increasing number of tools can assist with this task, significant amounts of time and resources are often wasted owing to inadequate consideration. To enhance the accuracy and efficiency of designing disulfide bonds for protein thermostability improvement, we initially collected disulfide bond and protein thermostability data from extensive literature sources. Thereafter, we extracted various sequence- and structure-based features and constructed machine-learning models to predict whether disulfide bonds can improve protein thermostability. Among all models, the neighborhood context model based on the Adaboost-DT algorithm performed the best, yielding "area under the receiver operating characteristic curve" and accuracy scores of 0.773 and 0.714, respectively. Furthermore, we also found AlphaFold2 to exhibit high superiority in predicting disulfide bonds, and to some extent, the coevolutionary relationship between residue pairs potentially guided artificial disulfide bond design. Moreover, several mutants of imine reductase 89 (IR89) with artificially designed thermostable disulfide bonds were experimentally proven to be considerably efficient for substrate catalysis. The SS-bond data have been integrated into an online server, namely, ThermoLink, available at guolab.mpu.edu.mo/thermoLink.

Authors

  • Ran Xu
    Department of Allied Health Sciences, University of Connecticut, 358 Mansfield Rd, Storrs, CT 06269, USA.
  • Qican Pan
    Zelixir Biotech Company Ltd, Shanghai, China.
  • Guoliang Zhu
    Zelixir Biotech Company Ltd, Shanghai, China.
  • Yilin Ye
    School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China.
  • Minghui Xin
    School of Physics, Shandong University, Jinan, China.
  • Zechen Wang
    School of Physics, Shandong University, Jinan, Shandong 250100, China.
  • Sheng Wang
    Intensive Care Medical Center, Tongji Hospital, School of Medicine, Tongji University, Shanghai, 200065, People's Republic of China.
  • Weifeng Li
    Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China.
  • Yanjie Wei
    Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Center for High Performance Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China.
  • Jingjing Guo
    The School of Management, Hefei University of Technology, Hefei, China.
  • Liangzhen Zheng
    Tencent AI Lab, Shenzhen, China.