Development of Predictive QSAR Models of 4-Thiazolidinones Antitrypanosomal Activity Using Modern Machine Learning Algorithms.

Journal: Molecular informatics
Published Date:

Abstract

This paper presents novel QSAR models for the prediction of antitrypanosomal activity among thiazolidines and related heterocycles. The performance of four machine learning algorithms: Random Forest regression, Stochastic gradient boosting, Multivariate adaptive regression splines and Gaussian processes regression have been studied in order to reach better levels of predictivity. The results for Random Forest and Gaussian processes regression are comparable and outperform other studied methods. The preliminary descriptor selection with Boruta method improved the outcome of machine learning methods. The two novel QSAR-models developed with Random Forest and Gaussian processes regression algorithms have good predictive ability, which was proved by the external evaluation of the test set with corresponding Q =0.812 and Q =0.830. The obtained models can be used further for in silico screening of virtual libraries in the same chemical domain in order to find new antitrypanosomal agents. Thorough analysis of descriptors influence in the QSAR models and interpretation of their chemical meaning allows to highlight a number of structure-activity relationships. The presence of phenyl rings with electron-withdrawing atoms or groups in para-position, increased number of aromatic rings, high branching but short chains, high HOMO energy, and the introduction of 1-substituted 2-indolyl fragment into the molecular structure have been recognized as trypanocidal activity prerequisites.

Authors

  • Anna Kryshchyshyn
    Department of Pharmaceutical, Organic and Bioorganic Chemistry, Danylo Halytsky Lviv National Medical University, Pekarska str. 69, 79010, Lviv, Ukraine.
  • Oleg Devinyak
    Department of Pharmaceutical Disciplines, Uzhgorod National University, Narodna sq. 1, 88000, Uzhgorod, Ukraine.
  • Danylo Kaminskyy
    Department of Pharmaceutical, Organic and Bioorganic Chemistry, Danylo Halytsky Lviv National Medical University, Pekarska str. 69, 79010, Lviv, Ukraine.
  • Philippe Grellier
    National Museum of Natural History, UMR 7245 CNRS MCAM, Sorbonne Universités, CP 52, 57 Rue Cuvier, Paris, 75005, France.
  • Roman Lesyk
    Department of Pharmaceutical, Organic and Bioorganic Chemistry, Danylo Halytsky Lviv National Medical University, Pekarska str. 69, 79010, Lviv, Ukraine.