Determinants of Base Editing Outcomes from Target Library Analysis and Machine Learning.

Journal: Cell
PMID:

Abstract

Although base editors are widely used to install targeted point mutations, the factors that determine base editing outcomes are not well understood. We characterized sequence-activity relationships of 11 cytosine and adenine base editors (CBEs and ABEs) on 38,538 genomically integrated targets in mammalian cells and used the resulting outcomes to train BE-Hive, a machine learning model that accurately predicts base editing genotypic outcomes (R ≈ 0.9) and efficiency (R ≈ 0.7). We corrected 3,388 disease-associated SNVs with ≥90% precision, including 675 alleles with bystander nucleotides that BE-Hive correctly predicted would not be edited. We discovered determinants of previously unpredictable C-to-G, or C-to-A editing and used these discoveries to correct coding sequences of 174 pathogenic transversion SNVs with ≥90% precision. Finally, we used insights from BE-Hive to engineer novel CBE variants that modulate editing outcomes. These discoveries illuminate base editing, enable editing at previously intractable targets, and provide new base editors with improved editing capabilities.

Authors

  • Mandana Arbab
    Merkin Institute of Transformative Technologies in Healthcare, Broad Institute of Harvard and MIT, Cambridge, MA, USA.
  • Max W Shen
    Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, MA, USA.
  • Beverly Mok
    Merkin Institute of Transformative Technologies in Healthcare, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA; Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA.
  • Christopher Wilson
    Merkin Institute of Transformative Technologies in Healthcare, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA; Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA.
  • Żaneta Matuszek
    Merkin Institute of Transformative Technologies in Healthcare, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA; Howard Hughes Medical Institute, Harvard University, Cambridge, MA 02138, USA; Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA.
  • Christopher A Cassa
    Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
  • David R Liu
    Merkin Institute of Transformative Technologies in Healthcare, Broad Institute of Harvard and MIT, Cambridge, MA, USA. drliu@fas.harvard.edu.