A Survey of Multi-task Learning Methods in Chemoinformatics.

Journal: Molecular informatics
Published Date:

Abstract

Despite the increasing volume of available data, the proportion of experimentally measured data remains small compared to the virtual chemical space of possible chemical structures. Therefore, there is a strong interest in simultaneously predicting different ADMET and biological properties of molecules, which are frequently strongly correlated with one another. Such joint data analyses can increase the accuracy of models by exploiting their common representation and identifying common features between individual properties. In this work we review the recent developments in multi-learning approaches as well as cover the freely available tools and packages that can be used to perform such studies.

Authors

  • Sergey Sosnin
    Center for Computational and Data-intensive Science and Engineering, Skolkovo Institute of Science and Technology, Nobelya Ulitsa 3 Moscow, 121205, Russia.
  • Mariia Vashurina
    Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH), Institute of Structural Biology, Ingolstädter Landstraße 1, D-85764, Neuherberg, Germany.
  • Michael Withnall
    Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH), Institute of Structural Biology, Ingolstädter Landstraße 1, D-85764, Neuherberg, Germany.
  • Pavel Karpov
    Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH), Institute of Structural Biology, Ingolstädter Landstraße 1, D-85764, Neuherberg, Germany.
  • Maxim Fedorov
    Center for Computational and Data-Intensive Science and Engineering, Skolkovo Institute of Science and Technology Skolkovo Innovation Center, Moscow, 143026, Russia.
  • Igor V Tetko
    g Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) , Institute of Structural Biology , Neuherberg , Germany.