Predicting base editing outcomes with an attention-based deep learning algorithm trained on high-throughput target library screens.

Journal: Nature communications
PMID:

Abstract

Base editors are chimeric ribonucleoprotein complexes consisting of a DNA-targeting CRISPR-Cas module and a single-stranded DNA deaminase. They enable transition of C•G into T•A base pairs and vice versa on genomic DNA. While base editors have great potential as genome editing tools for basic research and gene therapy, their application has been hampered by a broad variation in editing efficiencies on different genomic loci. Here we perform an extensive analysis of adenine- and cytosine base editors on a library of 28,294 lentivirally integrated genetic sequences and establish BE-DICT, an attention-based deep learning algorithm capable of predicting base editing outcomes with high accuracy. BE-DICT is a versatile tool that in principle can be trained on any novel base editor variant, facilitating the application of base editing for research and therapy.

Authors

  • Kim F Marquart
    Institute of Pharmacology and Toxicology, University of Zurich, Zurich, Switzerland.
  • Ahmed Allam
    Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland.
  • Sharan Janjuha
    Institute of Pharmacology and Toxicology, University of Zurich, Zurich, Switzerland.
  • Anna Sintsova
    Department of Quantitative Biomedicine, University of Zurich, Zurich, Switzerland.
  • Lukas Villiger
    Department of Pharmacology and Toxicology, University of Zurich, Zurich, Switzerland.
  • Nina Frey
    Institute of Molecular Health Sciences, ETH Zurich, Zurich, Switzerland.
  • Michael Krauthammer
    Yale School of Medicine, New Haven, CT.
  • Gerald Schwank
    Institute of Pharmacology and Toxicology, University of Zurich, Zurich, Switzerland. schwank@pharma.uzh.ch.