A pan-CRISPR analysis of mammalian cell specificity identifies ultra-compact sgRNA subsets for genome-scale experiments.

Journal: Nature communications
PMID:

Abstract

A genetic knockout can be lethal to one human cell type while increasing growth rate in another. This context specificity confounds genetic analysis and prevents reproducible genome engineering. Genome-wide CRISPR compendia across most common human cell lines offer the largest opportunity to understand the biology of cell specificity. The prevailing viewpoint, synthetic lethality, occurs when a genetic alteration creates a unique CRISPR dependency. Here, we use machine learning for an unbiased investigation of cell type specificity. Quantifying model accuracy, we find that most cell type specific phenotypes are predicted by the function of related genes of wild-type sequence, not synthetic lethal relationships. These models then identify unexpected sets of 100-300 genes where reduced CRISPR measurements can produce genome-scale loss-of-function predictions across >18,000 genes. Thus, it is possible to reduce in vitro CRISPR libraries by orders of magnitude-with some information loss-when we remove redundant genes and not redundant sgRNAs.

Authors

  • Boyang Zhao
    Sqilline Health, Boston, MA.
  • Yiyun Rao
    Huck Institute for the Life Sciences, Pennsylvania State University, State College, PA, USA.
  • Scott Leighow
    Department of Biomedical Engineering, Pennsylvania State University, State College, PA, USA.
  • Edward P O'Brien
    Department of Chemistry, Pennsylvania State University, State College, PA, USA.
  • Luke Gilbert
    Department of Urology, University of California at San Francisco, San Francisco, CA, USA.
  • Justin R Pritchard
    Department of Biomedical Engineering, Pennsylvania State University, State College, PA, USA. jrp94@psu.edu.