Imputation of Assay Bioactivity Data Using Deep Learning.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

We describe a novel deep learning neural network method and its application to impute assay pIC values. Unlike conventional machine learning approaches, this method is trained on sparse bioactivity data as input, typical of that found in public and commercial databases, enabling it to learn directly from correlations between activities measured in different assays. In two case studies on public domain data sets we show that the neural network method outperforms traditional quantitative structure-activity relationship (QSAR) models and other leading approaches. Furthermore, by focusing on only the most confident predictions the accuracy is increased to R > 0.9 using our method, as compared to R = 0.44 when reporting all predictions.

Authors

  • T M Whitehead
    Intellegens , Eagle Labs , Chesterton Road , Cambridge CB4 3AZ , United Kingdom.
  • B W J Irwin
    Optibrium , F5-6 Blenheim House, Cambridge Innovation Park, Denny End Road , Cambridge CB25 9PB , United Kingdom.
  • P Hunt
    Optibrium , F5-6 Blenheim House, Cambridge Innovation Park, Denny End Road , Cambridge CB25 9PB , United Kingdom.
  • M D Segall
    Optibrium , F5-6 Blenheim House, Cambridge Innovation Park, Denny End Road , Cambridge CB25 9PB , United Kingdom.
  • G J Conduit
    Intellegens , Eagle Labs , Chesterton Road , Cambridge CB4 3AZ , United Kingdom.