Integrating Protein Language Models and Geometric Deep Learning for Peptide Toxicity Prediction.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Peptide toxicity prediction is a critical task in biomedical research, influencing drug safety and therapeutic development. Traditional methods, relying on sequence similarity or handcrafted features, struggle to capture the complex relationship between peptide structure and toxicity. In this study, we propose PeptiTox, an advanced deep learning framework that integrates protein language models (PLMs) and geometric deep learning to enhance peptide toxicity prediction. Specifically, ESM2 is employed to extract sequence embeddings, while ESMFold predicts the three-dimensional (3D) peptide structure. The structural information is then transformed into a graph representation, where residues serve as nodes, and interactions between residues form edges. A graph neural network (GNN) is subsequently used to learn peptide representations and classify their toxicity. Experimental results demonstrate that PeptiTox significantly outperforms state-of-the-art models across multiple evaluation metrics. Our findings highlight the importance of integrating sequence and structural knowledge for peptide toxicity prediction, paving the way for safer and more effective peptide-based therapeutics.

Authors

  • Yanling Wang
    Department of Neurology, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China.
  • Na Li
    School of Nursing, Fujian University of Traditional Chinese Medicine, Fuzhou, China.
  • Xiao Wang
    Research Centre of Basic Integrative Medicine, School of Basic Medical Sciences, Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China.
  • Feng Cao
    Department of Cardiology, Xijing Hospital, Fourth Military Medical University, Xi'an, Shaanxi, China; Department of Cardiology, Chinese PLA General Hospital, Beijing, China.
  • Shuwen Xiong
    Faculty of Applied Sciences, Macao Polytechnic University, R. de Luís Gonzaga Gomes, Macao 999078, China.
  • Leyi Wei
    School of Computer Science and Technology, Tianjin University, Tianjin, 30050, China.