Prediction of on-target and off-target activity of CRISPR-Cas13d guide RNAs using deep learning.

Journal: Nature biotechnology
PMID:

Abstract

Transcriptome engineering applications in living cells with RNA-targeting CRISPR effectors depend on accurate prediction of on-target activity and off-target avoidance. Here we design and test ~200,000 RfxCas13d guide RNAs targeting essential genes in human cells with systematically designed mismatches and insertions and deletions (indels). We find that mismatches and indels have a position- and context-dependent impact on Cas13d activity, and mismatches that result in G-U wobble pairings are better tolerated than other single-base mismatches. Using this large-scale dataset, we train a convolutional neural network that we term targeted inhibition of gene expression via gRNA design (TIGER) to predict efficacy from guide sequence and context. TIGER outperforms the existing models at predicting on-target and off-target activity on our dataset and published datasets. We show that TIGER scoring combined with specific mismatches yields the first general framework to modulate transcript expression, enabling the use of RNA-targeting CRISPRs to precisely control gene dosage.

Authors

  • Hans-Hermann Wessels
    New York Genome Center, New York City, NY, USA.
  • Andrew Stirn
    New York Genome Center, New York City, NY, USA.
  • Alejandro Méndez-Mancilla
    New York Genome Center, New York City, NY, USA.
  • Eric J Kim
    Department of Computer Science, Columbia University, New York City, NY, USA.
  • Sydney K Hart
    New York Genome Center, New York City, NY, USA.
  • David A Knowles
  • Neville E Sanjana
    New York Genome Center, New York City, NY, USA. nsanjana@nygenome.org.