Active machine learning-driven experimentation to determine compound effects on protein patterns.

Journal: eLife

PMID: 26840049

Abstract

High throughput screening determines the effects of many conditions on a given biological target. Currently, to estimate the effects of those conditions on other targets requires either strong modeling assumptions (e.g. similarities among targets) or separate screens. Ideally, data-driven experimentation could be used to learn accurate models for many conditions and targets without doing all possible experiments. We have previously described an active machine learning algorithm that can iteratively choose small sets of experiments to learn models of multiple effects. We now show that, with no prior knowledge and with liquid handling robotics and automated microscopy under its control, this learner accurately learned the effects of 48 chemical compounds on the subcellular localization of 48 proteins while performing only 29% of all possible experiments. The results represent the first practical demonstration of the utility of active learning-driven biological experimentation in which the set of possible phenotypes is unknown in advance.

Authors

Armaghan W Naik

Computational Biology Department, Carnegie Mellon University, Pittsburgh, United States.
Joshua D Kangas

Computational Biology Department, Carnegie Mellon University, Pittsburgh, United States.
Devin P Sullivan

Computational Biology Department, Carnegie Mellon University, Pittsburgh, United States.
Robert F Murphy

Computational Biology Department, Center for Bioimage Informatics, and Departments of Biological Sciences, Biomedical Engineering and Machine Learning, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, USA; Freiburg Institute for Advanced Studies and Faculty of Biology, Albert Ludwig University of Freiburg, Germany. Electronic address: murphy@cmu.edu.

Keywords

Automation, Laboratory Cell Physiological Phenomena Cytosol Drug Evaluation, Preclinical High-Throughput Screening Assays Microscopy Optical Imaging Proteins Supervised Machine Learning

External Resources

View on PubMed Access via DOI PubMed (26840049)

Active machine learning-driven experimentation to determine compound effects on protein patterns.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals