CellBox: Interpretable Machine Learning for Perturbation Biology with Application to the Design of Cancer Combination Therapy.

Journal: Cell systems
PMID:

Abstract

Systematic perturbation of cells followed by comprehensive measurements of molecular and phenotypic responses provides informative data resources for constructing computational models of cell biology. Models that generalize well beyond training data can be used to identify combinatorial perturbations of potential therapeutic interest. Major challenges for machine learning on large biological datasets are to find global optima in a complex multidimensional space and mechanistically interpret the solutions. To address these challenges, we introduce a hybrid approach that combines explicit mathematical models of cell dynamics with a machine-learning framework, implemented in TensorFlow. We tested the modeling framework on a perturbation-response dataset of a melanoma cell line after drug treatments. The models can be efficiently trained to describe cellular behavior accurately. Even though completely data driven and independent of prior knowledge, the resulting de novo network models recapitulate some known interactions. The approach is readily applicable to various kinetic models of cell biology. A record of this paper's Transparent Peer Review process is included in the Supplemental Information.

Authors

  • Bo Yuan
    New Use Agriculture and Natural Plant Products Program, Department of Plant Biology and Center for Agricultural Food Ecosystems, Institute of Food, Nutrition & Health, Rutgers University, 59 Dudley Road, New Brunswick, NJ, 08901, USA.
  • Ciyue Shen
    Department of Cell Biology, Harvard Medical School, Boston, MA, USA; cBio Center, Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA; Broad Institute, Cambridge, MA, USA. Electronic address: c_shen@g.harvard.edu.
  • Augustin Luna
    Computational Biology Branch, National Library of Medicine, Bethesda, MD, United States.
  • Anil Korkut
    Department of Bioinformatics & Computational Biology, the University of Texas M D Anderson Cancer Center, Houston, TX, USA.
  • Debora S Marks
    Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA; Broad Institute of Harvard and MIT, Cambridge, MA 02139, USA. Electronic address: debbie@hms.harvard.edu.
  • John Ingraham
    MIT Computer Science & Artificial Intelligence Laboratory, Boston, MA, USA.
  • Chris Sander
    Computational Biology Center, Memorial Sloan-Kettering Cancer Center, New York, 10065 NY; and arne@bioinfo.se debbie@hms.harvard.edu cccsander@gmail.com.