MSF-CPMP: A Novel Multi-Source Feature Fusion Model for Cyclic Peptide Membrane Permeability Prediction

Journal: bioRxiv
Published Date:

Abstract

Cyclic peptides are becoming attractive molecules for drug discovery because of their properties with inherent stability and structural diversity. However, the high potential of cyclic peptide drugs is challenged by the limited membrane permeability cross cell membrane. To predict cyclic peptide membrane permeability (CPMP), an increased number of computational models or tools are designed and used. But these existing algorithms or models do not appropriately capture feature diversity of cyclic peptides. In this study, we introduce a novel multi-source feature fusion model called MSF-CPMP, which aims to increase the accuracy of predicted CPMP. The MSF-CPMP model incorporates three features extracted from SMILES sequences, graph-based molecular structures, and physicochemical properties of cyclic peptides. By benchmarking with other machine learning and deep learning-based methods, MSF-CPMP achieved the highest levels of the evaluation metrics such as accuracy of 0.9062 and AUROC of 0.9546, and further validated MSF-CPMP robustness in learning capabilities and efficacy of its multi-source fusion. Our result demonstrates that MSF-CPMP outperforms other methods in predicting CPMP, that provides also exemplifies the power of advanced deep learning methods in tackling complex biological challenges, offering contributions to computational biology and clinical treatment. Code is available at https://github.com/wanglabhku/MSF-CPMP We have recognized that cyclic peptides are one type of main macro-molecular drugs useful for treatment of human diseases. However, the structural diversity of cyclic peptides results in limited permeability across cell membranes which is challenging drug research or industrial development. Previously some computational models or tools including machine learning or deep learning ones tried to solve this problem, but they still are not efficient. There are urgent requirement for more effective methodologies. Thus, we introduce a novel multi-source feature fusion model called MSF-CPMP. Our aim is to enhance accuracy and efficiency of predicted CPMP. We carried out evaluation of performance metrics by comparative analysis between MSF-CPMP and other machine learning or deep learning methods. Our result demonstrates that MSF-CPMP outperforms other models in predicting CPMP. We further validated MSF-CPMP robustness in learning capabilities and efficacy of its multi-source feature fusion. In the future we would further improve our methods that can integrate broader biomedicine and biomedical information and provide guide for drug diversity in clinically treating complex human diseases.

Authors

  • Yijun Zhang; Zimeng Chen; Zhuxuan Wan; Qianhui Jiang; Xiaoling Lu; Bin Yan; Jing Qin; Yong Liu; Junwen Wang