Machine learning classifiers aid virtual screening for efficient design of mini-protein therapeutics.

Journal: Bioorganic & medicinal chemistry letters
Published Date:

Abstract

De novo design of mini-proteins (4-12 kDa) has recently been shown to produce new candidates for protein therapeutics. They are temperature stable molecules that bind to the drug target with high affinity for inhibiting its interactions. The development of mini-protein binders requires laboratory screening of tens of thousands of molecules for effective target binding. In this study we trained machine learning classifiers which can distinguish, with 90% accuracy and 80% precision, mini-protein binders from non-binding molecules designed for a particular target; this significantly reduces the number of mini protein candidates for experimental screening. Further, on the basis of our results we propose a multi-stage protocol where a small dataset (few hundred experimentally verified target-specific mini-proteins) can be used to train classifiers for improving the efficiency of mini-protein design for any specific target.

Authors

  • Neeraj K Gaur
    Beamline Development and Application Section, Bhabha Atomic Research Centre, Mumbai 400085, India; Division of Biochemical Sciences, CSIR-National Chemical Laboratory, Pune 411008, India. Electronic address: neerajkailashgaur@gmail.com.
  • Venuka Durani Goyal
    Beamline Development and Application Section, Bhabha Atomic Research Centre, Mumbai 400085, India.
  • Kiran Kulkarni
    Division of Biochemical Sciences, CSIR-National Chemical Laboratory, Pune 411008, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
  • Ravindra D Makde
    Beamline Development and Application Section, Bhabha Atomic Research Centre, Mumbai 400085, India.