Machine learning sequence prioritization for cell type-specific enhancer design.

Journal: eLife
PMID:

Abstract

Recent discoveries of extreme cellular diversity in the brain warrant rapid development of technologies to access specific cell populations within heterogeneous tissue. Available approaches for engineering-targeted technologies for new neuron subtypes are low yield, involving intensive transgenic strain or virus screening. Here, we present Specific Nuclear-Anchored Independent Labeling (SNAIL), an improved virus-based strategy for cell labeling and nuclear isolation from heterogeneous tissue. SNAIL works by leveraging machine learning and other computational approaches to identify DNA sequence features that confer cell type-specific gene activation and then make a probe that drives an affinity purification-compatible reporter gene. As a proof of concept, we designed and validated two novel SNAIL probes that target parvalbumin-expressing (PV+) neurons. Nuclear isolation using SNAIL in wild-type mice is sufficient to capture characteristic open chromatin features of PV+ neurons in the cortex, striatum, and external globus pallidus. The SNAIL framework also has high utility for multispecies cell probe engineering; expression from a mouse PV+ SNAIL enhancer sequence was enriched in PV+ neurons of the macaque cortex. Expansion of this technology has broad applications in cell type-specific observation, manipulation, and therapeutics across species and disease models.

Authors

  • Alyssa J Lawler
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Easwaran Ramamurthy
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Ashley R Brown
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Naomi Shin
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Yeonju Kim
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Noelle Toong
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Irene M Kaplow
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Morgan Wirthlin
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Xiaoyu Zhang
    First Department of Infectious Diseases, The First Affiliated Hospital of China Medical University, Shenyang, China.
  • BaDoi N Phan
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Grant A Fox
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.
  • Kirsten Wade
    Department of Psychiatry, Translational Neuroscience Program, University of Pittsburgh, Pittsburgh, United States.
  • Jing He
    School of Management, Guilin University of Aerospace Technology, Guilin, China.
  • Bilge Esin Ozturk
    Department of Ophthalmology, University of Pittsburgh, Pittsburgh, United States.
  • Leah C Byrne
    Department of Neurobiology, University of Pittsburgh, Pittsburgh, United States.
  • William R Stauffer
    Department of Neurobiology, University of Pittsburgh, Pittsburgh, United States.
  • Kenneth N Fish
    Department of Psychiatry, Translational Neuroscience Program, University of Pittsburgh, Pittsburgh, United States.
  • Andreas R Pfenning
    Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, United States.