MESM: integrating multi-source data for high-accuracy protein-protein interactions prediction through multimodal language models.

Journal: BMC biology
Published Date:

Abstract

BACKGROUND: Protein-protein interactions (PPIs) play a critical role in essential biological processes such as signal transduction, enzyme activity regulation, cytoskeletal structure, immune responses, and gene regulation. However, current methods mainly focus on extracting features from protein sequences and using graph neural network (GNN) to acquire interaction information from the PPI network graph. This limits the model's ability to learn richer and more effective interaction information, thereby affecting prediction performance.

Authors

  • Feng Wang
    Department of Oncology, Binzhou Medical University Hospital, Binzhou, Shandong, China.
  • Jinming Chu
    School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, School of Software, Changzhou University, Changzhou, 213164, China.
  • Liyan Shen
    School of Computer Engineering, Suzhou Vocational University, Suzhou, 215104, China. liyan.shen@jssvc.edu.cn.
  • Shan Chang
    Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China.