CysPresso: a classification model utilizing deep learning protein representations to predict recombinant expression of cysteine-dense peptides.

Journal: BMC bioinformatics
PMID:

Abstract

BACKGROUND: Cysteine-dense peptides (CDPs) are an attractive pharmaceutical scaffold that display extreme biochemical properties, low immunogenicity, and the ability to bind targets with high affinity and selectivity. While many CDPs have potential and confirmed therapeutic uses, synthesis of CDPs is a challenge. Recent advances have made the recombinant expression of CDPs a viable alternative to chemical synthesis. Moreover, identifying CDPs that can be expressed in mammalian cells is crucial in predicting their compatibility with gene therapy and mRNA therapy. Currently, we lack the ability to identify CDPs that will express recombinantly in mammalian cells without labour intensive experimentation. To address this, we developed CysPresso, a novel machine learning model that predicts recombinant expression of CDPs based on primary sequence.

Authors

  • Sébastien Ouellet
    , Ottawa, Canada.
  • Larissa Ferguson
    Neurobiology Division, MRC Laboratory of Molecular Biology, Cambridge, UK.
  • Angus Z Lau
    Medical Biophysics, University of Toronto, Toronto, ON, Canada.
  • Tony K Y Lim
    , Vancouver, Canada. tl581@cam.ac.uk.