Active learning for computational chemogenomics.

Journal: Future medicinal chemistry

Published Date: Mar 6, 2017

Abstract

AIM: Computational chemogenomics models the compound-protein interaction space, typically for drug discovery, where existing methods predominantly either incorporate increasing numbers of bioactivity samples or focus on specific subfamilies of proteins and ligands. As an alternative to modeling entire large datasets at once, active learning adaptively incorporates a minimum of informative examples for modeling, yielding compact but high quality models. Results/methodology: We assessed active learning for protein/target family-wide chemogenomic modeling by replicate experiment. Results demonstrate that small yet highly predictive models can be extracted from only 10-25% of large bioactivity datasets, irrespective of molecule descriptors used.

Authors

Daniel Reker

Swiss Federal Institute of Technology (ETH), Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland.
Petra Schneider

Swiss Federal Institute of Technology (ETH), Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, CH-8093, Zurich, Switzerland.
Gisbert Schneider

Swiss Federal Institute of Technology (ETH), Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, CH-8093, Zurich, Switzerland.
J B Brown

Kyoto University Graduate School of Medicine, Center for Medical Education, Life Science Informatics Research Unit, Kyoto 606-8501, Japan.

Keywords

Binding Sites Computational Biology Computer Simulation Databases, Chemical Drug Discovery Genomics Ligands Machine Learning Models, Chemical Proteins

External Resources

View on PubMed Access via DOI PubMed (28263088)

Active learning for computational chemogenomics.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals