A universal deep-learning model for zinc finger design enables transcription factor reprogramming.

Journal: Nature biotechnology
Published Date:

Abstract

CysHis zinc finger (ZF) domains engineered to bind specific target sequences in the genome provide an effective strategy for programmable regulation of gene expression, with many potential therapeutic applications. However, the structurally intricate engagement of ZF domains with DNA has made their design challenging. Here we describe the screening of 49 billion protein-DNA interactions and the development of a deep-learning model, ZFDesign, that solves ZF design for any genomic target. ZFDesign is a modern machine learning method that models global and target-specific differences induced by a range of library environments and specifically takes into account compatibility of neighboring fingers using a novel hierarchical transformer architecture. We demonstrate the versatility of designed ZFs as nucleases as well as activators and repressors by seamless reprogramming of human transcription factors. These factors could be used to upregulate an allele of haploinsufficiency, downregulate a gain-of-function mutation or test the consequence of regulation of a single gene as opposed to the many genes that a transcription factor would normally influence.

Authors

  • David M Ichikawa
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Osama Abdin
    Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.
  • Nader Alerasool
    Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada.
  • Manjunatha Kogenaru
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • April L Mueller
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Han Wen
    School of Ophthalmology and Optometry, Eye Hospital, Wenzhou Medical University, Wenzhou 325027, China.
  • David O Giganti
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Gregory W Goldberg
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Samantha Adams
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Jeffrey M Spencer
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Rozita Razavi
    Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.
  • Satra Nim
    Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada.
  • Hong Zheng
    School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, China.
  • Courtney Gionco
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Finnegan T Clark
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Alexey Strokach
    Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada.
  • Timothy R Hughes
    Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario M5S 3E1, Canada. Program on Genetic Networks and Program on Neural Computation & Adaptive Perception, Canadian Institute for Advanced Research, Toronto, Ontario M5G 1Z8, Canada. Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada.
  • Timothee Lionnet
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA.
  • Mikko Taipale
    Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada.
  • Philip M Kim
    Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON M5S 1AS, Canada; Department of Computer Science, University of Toronto, Toronto, ON M5S 3G4, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 1AS, Canada. Electronic address: pi@kimlab.org.
  • Marcus B Noyes
    Institute for Systems Genetics, NYU Grossman School of Medicine, New York, NY, USA. marcus.noyes@nyulangone.org.