NanoBinder: a machine learning assisted nanobody binding prediction tool using Rosetta energy scores.

Journal: Journal of cheminformatics
Published Date:

Abstract

Nanobodies offer significant therapeutic potential due to their small size, stability, and versatility. Although advancements in computational protein design have made designing de novo nanobodies increasingly feasible, there are limited tools specifically tailored for this purpose. Rosetta with its specialized protocols, is a prominent tool for nanobody design but is limited by a high false-negative rate, necessitating extensive high-throughput screening. This results in increased costs, time, and labor due to the need for large-scale experimentation and detailed structural analysis. To address current challenges in nanobody design, we introduce NanoBinder, an interpretable machine learning model that predicts nanobody-antigen binding using Rosetta energy scores. NanoBinder utilizes a Random Forest model trained on experimentally validated complexes and can be seamlessly integrated into the Rosetta software. It employs SHAP summary plots for interpretability, which helps identify key features influencing binding interactions. Experimentally validated on forty-nine diverse nanobodies, NanoBinder accurately predicts non-binders and shows reasonable performance in identifying binders. This approach significantly enhances predictive accuracy, reduces the need for extensive experimental assays, and accelerates nanobody development, thereby offering a powerful tool to mitigate the costs, time, and labor associated with high-throughput screening.Scientific contribution This study introduces NanoBinder, a machine learning framework for predicting nanobody-antigen binding using Rosetta-derived energy features. Through rigorous experimental validation across diverse nanobody sets, NanoBinder enhances nanobody screening workflows by reducing false positives and minimizing reliance on extensive wet-lab assays. The approach bridges the gap between physics-based modeling and data-driven prediction in nanobody design.

Authors

  • Palistha Shrestha
    Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju-si, 54896, Jeollabuk-do, Republic of Korea.
  • Chandana S Talwar
    Disease Target Structure Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.
  • Jeevan Kandel
    Graduate School of Integrated Energy-AI, Jeonbuk National University, Jeonju-si, 54896, Jeollabuk-do, Republic of Korea.
  • Kwang-Hyun Park
    Disease Target Structure Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea.
  • Kil To Chong
    Division of Electronic Engineering, and Advanced Research Center of Electronics and Information, Chonbuk National University, Jeonju-Si 54896, South Korea. Electronic address: kitchong@jbnu.ac.kr.
  • Eui-Jeon Woo
    Disease Target Structure Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), 125 Gwahak-ro, Yuseong-gu, Daejeon, 34141, Republic of Korea. ejwoo@kribb.re.kr.
  • Hilal Tayara
    Department of Electronics and Information Engineering, Chonbuk National University, Jeonju 54896, South Korea. Electronic address: hilaltayara@jbnu.ac.kr.

Keywords

No keywords available for this article.