PLPTP: A Motif-based Interpretable Deep Learning Framework Based on Protein Language Models for Peptide Toxicity Prediction.

Journal: Journal of molecular biology

PMID: 40158838

Abstract

Peptide toxicity prediction holds significant importance in drug development and biotechnology, as accurately identifying toxic peptide sequences is crucial for designing safer peptide-based drugs. This study proposes a deep learning-based model for peptide toxicity prediction, integrating Evolutionary Scale Modeling (ESM2), Bidirectional Long Short-Term Memory (BiLSTM), and Deep Neural Network (DNN). The ESM2 model captures evolutionary information from peptide sequences, providing a rich context for the sequences; the BiLSTM network focuses on extracting contextual dependencies, thereby capturing long-range dependencies within the sequence; and the DNN further classifies the extracted features to achieve the final toxicity prediction. To enhance the reliability and transparency of the model, we also conducted motif analysis to identify key patterns in the data, which helps to explain the model's attention mechanism and its classification performance. To address the class imbalance in the dataset, we employed Focal Loss as the loss function, which enhances the model's ability to identify minority class samples by reducing the contribution of easily classified samples. Experimental results demonstrate that the proposed model performs exceptionally well across multiple evaluation metrics, particularly in handling imbalanced data, achieving significant improvements over traditional methods. This result highlights the model's potential to improve the accuracy of peptide toxicity prediction and its valuable role in drug development and biotechnology research. The PLPTP web server is available at https://www.bioai-lab.com/PLPTP.

Authors

Shun Gao

School of Computer Science and Technology, Hainan University, Haikou 570228, China.
Yanna Jia

School of Computer Science and Technology, Hainan University, Haikou 570228, China.
Feifei Cui

School of Computer Science and Technology, Hainan University, Haikou 570228, China.
Junlin Xu

School of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan, Hubei 430065, China.
Yajie Meng

College of Computer Science and Electronic Engineering, Hunan University, Changsha, China.
Leyi Wei

School of Computer Science and Technology, Tianjin University, Tianjin, 30050, China.
Qingchen Zhang
Quan Zou
Zilong Zhang

School of Computer Science and Technology, Hainan University, Haikou 570228, China.

Keywords

Amino Acid Motifs Amino Acid Sequence Computational Biology Deep Learning Humans Neural Networks, Computer Peptides Proteins

External Resources

View on PubMed Access via DOI PubMed (40158838)

PLPTP: A Motif-based Interpretable Deep Learning Framework Based on Protein Language Models for Peptide Toxicity Prediction.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals