Kinetics of -induced gene silencing can be predicted from combinations of epigenetic and genomic features.

Journal: Genome research
PMID:

Abstract

To initiate X-Chromosome inactivation (XCI), the long noncoding RNA mediates chromosome-wide gene silencing of one X Chromosome in female mammals to equalize gene dosage between the sexes. The efficiency of gene silencing is highly variable across genes, with some genes even escaping XCI in somatic cells. A gene's susceptibility to -mediated silencing appears to be determined by a complex interplay of epigenetic and genomic features; however, the underlying rules remain poorly understood. We have quantified chromosome-wide gene silencing kinetics at the level of the nascent transcriptome using allele-specific Precision nuclear Run-On sequencing (PRO-seq). We have developed a Random Forest machine-learning model that can predict the measured silencing dynamics based on a large set of epigenetic and genomic features and tested its predictive power experimentally. The genomic distance to the locus, followed by gene density and distance to LINE elements, are the prime determinants of the speed of gene silencing. Moreover, we find two distinct gene classes associated with different silencing pathways: a class that requires -repeat A for silencing, which is known to activate the SPEN pathway, and a second class in which genes are premarked by Polycomb complexes and tend to rely on the B repeat in for silencing, known to recruit Polycomb complexes during XCI. Moreover, a series of features associated with active transcriptional elongation and chromatin 3D structure are enriched at rapidly silenced genes. Our machine-learning approach can thus uncover the complex combinatorial rules underlying gene silencing during X inactivation.

Authors

  • Lisa Barros de Andrade E Sousa
    Otto Warburg Laboratories, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany.
  • Iris Jonkers
    Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA.
  • Laurène Syx
    Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, UPMC Paris-Sorbonne, 75005 Paris, France.
  • Ilona Dunkel
    Otto Warburg Laboratories, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany.
  • Julie Chaumeil
    Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, UPMC Paris-Sorbonne, 75005 Paris, France.
  • Christel Picard
    Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, UPMC Paris-Sorbonne, 75005 Paris, France.
  • Benjamin Foret
    Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, UPMC Paris-Sorbonne, 75005 Paris, France.
  • Chong-Jian Chen
    Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, UPMC Paris-Sorbonne, 75005 Paris, France.
  • John T Lis
    Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, USA.
  • Edith Heard
    Institut Curie, PSL Research University, CNRS UMR3215, INSERM U934, UPMC Paris-Sorbonne, 75005 Paris, France.
  • Edda G Schulz
    Otto Warburg Laboratories, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany.
  • Annalisa Marsico
    Max Planck Institute for Molecular Genetics, Ihnestr. 63-73, 14195 Berlin, Germany.