Joint Masked Reconstruction and Contrastive Learning for Mining Interactions Between Proteins
Journal:
arXiv
Published Date:
Mar 6, 2025
Abstract
Protein-protein interaction (PPI) prediction is an instrumental means in
elucidating the mechanisms underlying cellular operations, holding significant
practical implications for the realms of pharmaceutical development and
clinical treatment. Presently, the majority of research methods primarily
concentrate on the analysis of amino acid sequences, while investigations
predicated on protein structures remain in the nascent stages of exploration.
Despite the emergence of several structure-based algorithms in recent years,
these are still confronted with inherent challenges: (1) the extraction of
intrinsic structural information of proteins typically necessitates the
expenditure of substantial computational resources; (2) these models are overly
reliant on seen protein data, struggling to effectively unearth interaction
cues between unknown proteins. To further propel advancements in this domain,
this paper introduces a novel PPI prediction method jointing masked
reconstruction and contrastive learning, termed JmcPPI. This methodology
dissects the PPI prediction task into two distinct phases: during the residue
structure encoding phase, JmcPPI devises two feature reconstruction tasks and
employs graph attention mechanism to capture structural information between
residues; during the protein interaction inference phase, JmcPPI perturbs the
original PPI graph and employs a multi-graph contrastive learning strategy to
thoroughly mine extrinsic interaction information of novel proteins. Extensive
experiments conducted on three widely utilized PPI datasets demonstrate that
JmcPPI surpasses existing optimal baseline models across various data partition
schemes. The associated code can be accessed via
https://github.com/lijfrank-open/JmcPPI.