Ensembling machine learning models to boost molecular affinity prediction.

Journal: Computational biology and chemistry
Published Date:

Abstract

This study unites six popular machine learning approaches to enhance the prediction of a molecular binding affinity between receptors (large protein molecules) and ligands (small organic molecules). Here we examine a scheme where affinity of ligands is predicted against a single receptor - human thrombin, thus, the models consider ligand features only. However, the suggested approach can be repurposed for other receptors. The methods include Support Vector Machine, Random Forest, CatBoost, feed-forward neural network, graph neural network, and Bidirectional Encoder Representations from Transformers. The first five methods use input features based on physico-chemical properties of molecules, while the last one is based on textual molecular representations. All approaches do not rely on atomic spatial coordinates, avoiding a potential bias from known structures, and are capable of generalizing for compounds with unknown conformations. Within each of the methods, we have trained two models that solve classification and regression tasks. Then, all models are grouped into a pipeline of two subsequent ensembles. The first ensemble aggregates six classification models which vote whether a ligand binds to a receptor or not. If a ligand is classified as active (i.e., binds), the second ensemble predicts its binding affinity in terms of the inhibition constant K.

Authors

  • Maksym Druchok
    SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Institute for Condensed Matter Physics, NAS of Ukraine, 1 Svientsitskii Str., 79011 Lviv, Ukraine. Electronic address: maksym@icmp.lviv.ua.
  • Dzvenymyra Yarish
    SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine.
  • Sofiya Garkot
    SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Ukrainian Catholic University, 17 Svientsitskii Str., 79011 Lviv, Ukraine.
  • Tymofii Nikolaienko
    SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Taras Shevchenko National University of Kyiv, 64/13, Volodymyrska Str., 01601 Kyiv, Ukraine.
  • Oleksandr Gurbych
    SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Lviv Polytechnic National University, 5 Kniazia Romana Str., 79005 Lviv, Ukraine.