Comparative Study of Multitask Toxicity Modeling on a Broad Chemical Space.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Acute toxicity is one of the most challenging properties to predict purely with computational methods due to its direct relationship to biological interactions. Moreover, toxicity can be represented by different end points: it can be measured for different species using different types of administration, etc., and it is questionable if the knowledge transfer between end points is possible. We performed a comparative study of prediction multitask toxicity for a broad chemical space using different descriptors and modeling algorithms and applied multitask learning for a large toxicity data set extracted from the Registry of Toxic Effects of Chemical Substances (RTECS). We demonstrated that multitask modeling provides significant improvement over single-output models and other machine learning methods. Our research reveals that multitask learning can be very useful to improve the quality of acute toxicity modeling and raises a discussion about the usage of multitask approaches for regulation purposes. Our MultiTox models are freely available in OCHEM platform ( ochem.eu/multitox ) under CC-BY-NC license.

Authors

  • Sergey Sosnin
    Center for Computational and Data-intensive Science and Engineering, Skolkovo Institute of Science and Technology, Nobelya Ulitsa 3 Moscow, 121205, Russia.
  • Dmitry Karlov
    Skolkovo Institute of Science and Technology , Skolkovo Innovation Center , Moscow 143026 , Russia.
  • Igor V Tetko
    g Helmholtz Zentrum München - German Research Center for Environmental Health (GmbH) , Institute of Structural Biology , Neuherberg , Germany.
  • Maxim V Fedorov