PILOT: Deep Siamese network with hybrid attention improves prediction of mutation impact on protein stability.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Evaluating the mutation impact on protein stability (ΔΔG) is essential in the study of protein engineering and understanding molecular mechanisms of disease-associated mutations. Here, we propose a novel deep learning framework, PILOT, for improved prediction of ΔΔG using a Siamese network with hybrid attention mechanism. The PILOT framework leverages multiple attention modules to effectively extract representations for amino acids, atoms, and protein sequences, respectively. This approach significantly ensures the deep fusion of structural information at both residue and atom levels, the seamless integration of structural and sequence representations, and the effective capture of both long-range and short-range dependencies among amino acids. Our extensive evaluations demonstrate that PILOT greatly outperforms other state-of-the-art methods. We also showcase that PILOT identifies exceptional patterns for different mutation types. Moreover, we illustrate the clinical applicability of PILOT in highlighting pathogenic variants from benign variants and VUS (variants of uncertain significance), and distinguishing de novo mutations in disease cases and controls. In summary, PILOT presents a robust deep learning tool that could offer significant insights into drug design, medical applications, and protein engineering studies.

Authors

  • Yuan Zhang
    Department of Urology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Junsheng Deng
    Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan 411105, China.
  • Mingyuan Dong
    Key Laboratory of New Energy and Rare Earth Resource Utilization of State Ethnic Affairs Commission, Key Laboratory of Photosensitive Materials & Devices of Liaoning Province, School of Physics and Materials Engineering, Dalian Minzu University, Dalian 116600, China.
  • Jiafeng Wu
    Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan 411105, China.
  • Qiuye Zhao
    Department of Computational Biology, Cornell University, Ithaca, NY 14853, USA; Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY 14853, USA. Electronic address: qyzhao23@gmail.com.
  • Xieping Gao
  • Dapeng Xiong
    MOE Key Laboratory of Bioinformatics, School of Life Sciences, Tsinghua University, Beijing, China.