PractiCPP: a deep learning approach tailored for extremely imbalanced datasets in cell-penetrating peptide prediction.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Effective drug delivery systems are paramount in enhancing pharmaceutical outcomes, particularly through the use of cell-penetrating peptides (CPPs). These peptides are gaining prominence due to their ability to penetrate eukaryotic cells efficiently without inflicting significant damage to the cellular membrane, thereby ensuring optimal drug delivery. However, the identification and characterization of CPPs remain a challenge due to the laborious and time-consuming nature of conventional methods, despite advances in proteomics. Current computational models, however, are predominantly tailored for balanced datasets, an approach that falls short in real-world applications characterized by a scarcity of known positive CPP instances.

Authors

  • Kexin Shi
    Department of Nuclear Medicine, West China Hospital of Sichuan University, Chengdu, China.
  • Yuanpeng Xiong
    Department of Computer Science and Technology, Tsinghua University, Beijing, China.
  • Yu Wang
    Clinical and Technical Support, Philips Healthcare, Shanghai, China.
  • Yifan Deng
    College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
  • Wenjia Wang
    School of Computing Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK.
  • Bingyi Jing
    Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen 518000, China.
  • Xin Gao
    Department of Computer Science, New Jersey Institute of Technology, Newark, New Jersey, USA.