Cell Reprogramming Requires Silencing of a Core Subset of Polycomb Targets

Download PDF České info

Transcription factor (TF)–induced reprogramming of somatic cells into induced pluripotent stem cells (iPSC) is associated with genome-wide changes in chromatin modifications. Polycomb-mediated histone H3 lysine-27 trimethylation (H3K27me3) has been proposed as a defining mark that distinguishes the somatic from the iPSC epigenome. Here, we dissected the functional role of H3K27me3 in TF–induced reprogramming through the inactivation of the H3K27 methylase EZH2 at the onset of reprogramming. Our results demonstrate that surprisingly the establishment of functional iPSC proceeds despite global loss of H3K27me3. iPSC lacking EZH2 efficiently silenced the somatic transcriptome and differentiated into tissues derived from the three germ layers. Remarkably, the genome-wide analysis of H3K27me3 in Ezh2 mutant iPSC cells revealed the retention of this mark on a highly selected group of Polycomb targets enriched for developmental regulators controlling the expression of lineage specific genes. Erasure of H3K27me3 from these targets led to a striking impairment in TF–induced reprogramming. These results indicate that PRC2-mediated H3K27 trimethylation is required on a highly selective core of Polycomb targets whose repression enables TF–dependent cell reprogramming.

Published in the journal: . PLoS Genet 9(2): e32767. doi:10.1371/journal.pgen.1003292
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1003292

Summary

Introduction

The realization that the expression of few transcription factors can reassign cell fate has been a paradigm-shifting insight for biology and medicine, from the pioneering conversion of fibroblasts into myoblasts [1], to the achievement of inducing pluripotency from adult somatic cells [2]. In particular, for the derivation of induced pluripotent stem cells (iPSC), the enticing medical opportunities during the past few years have focused most efforts on improving the efficiency and safety of transcription factor (TF)-induced reprogramming, and on defining the molecular and functional equivalence between iPSC and embryonic stem cells (ESC). Substantial progress has been made in the characterization of distinct stages of the reprogramming process (reviewed in [3]) as well as of its key features, such as the mesenchymal-to-epithelial transition [4], [5], and the importance of cell-cycle progression [6]–[11]. Yet, in terms of molecular understanding, we still lack a mechanistic insight of how the somatic gene expression program is silenced in order to enable the reacquisition of the pluripotent state. In particular, the functional relevance of defined chromatin modifications has only recently started to be explored [12]–[15], especially as far as those histone marks are concerned that orchestrate genome programming during development.

Genome-wide analyses have started to provide a detailed description of the chromatin changes that underlie TF-induced reprogramming to pluripotency. The very initial stages of the process entail the widespread acquisition of dimethylation of lysine-4 of histone H3 (H3K4me2) at promoters and enhancers of pluripotency genes that will be reactivated however only at later stages, indicating that this modification primes but does not directly trigger gene activation [16]. Conversely, analyses of established iPSC at the end of the reprogramming process revealed that changes in Polycomb-mediated trimethylation of lysine-27 of histone H3 (H3K27me3) represent a key feature that distinguishes the fibroblast from the iPSC epigenomes [17].

The Polycomb axis is organized around two families of protein complexes, Polycomb Repressive Complex 1 (PRC1) and Polycomb Repressive Complex 2 (PRC2) (reviewed in [18]). Both contain enzymatic activities that catalyze, respectively, monoubiquitylation of lysine-119 on histone H2A (H2AK119Ub) (carried out by RING1B and, to a lesser extent, RING1A) and di -⁠ and tri-methylation of lysine-27 of histone H3 (catalyzed by EZH2 and, to a lesser extent, EZH1) [18]. H3K27me3 constitutes a docking site for the chromodomain of CBX proteins, members of PRC1, leading to a cascade model for Polycomb action, in which PRC2 deposits H3K27me3 that then recruits PRC1 to enable H2AK119Ub [18]. Gene silencing by PRC2 is required for normal development and differentiation ([19] reviewed in [20]). In the mouse, inactivation of Ezh2 or the essential non-enzymatic components of the complex, SUZ12 and EED, results in severe developmental failures in vivo and in varying degrees of impairment during ESC differentiation in vitro [21]–[27]. Thus, the finding that H3K27me3 constitutes the most significant chromatin mark that distinguishes somatic from iPSC epigenomes supports the notion that TF-induced reprogramming may represent the reverse trajectory of normal development and posits for this mark a key role in the repression of the somatic gene expression program. Furthermore, a hallmark feature of the ESC epigenome, fully shared with iPSC, is the presence of bivalent H3K4me3/H3K27me3 chromatin domains [28], [29]. These hold the promoters of key developmental regulator genes in a plastic state of minimal expression, poised for either full activation or definitive repression in a lineage-specific manner. Ablation of PRC2 components in ESC leads to the misexpression of many of these developmental targets [21], [22], indicating that bivalency is a critical feature of ESC pluripotency and predicting that its re-establishment is a necessary condition in iPSC reprogramming. Here we provide functional validation of the relevance of H3K27me3 in the reacquisition of pluripotency, through the conditional inactivation of Ezh2 resulting in bulk depletion of H3K27me3 at the onset of TF-induced reprogramming. Our results demonstrate that, surprisingly, global H3K27me3 is dispensable for the reacquisition of the pluripotent state. Critically, however, we find that upon Ezh2 inactivation, H3K27me3 is both retained and de novo acquired on a specific core of Polycomb targets involved in lineage determination and transcriptional regulation. This highly selective silencing of the somatic gene program is catalyzed by an alternative PRC2 and its inhibition impairs reprogramming, thus uncovering a critical but highly circumscribed role for H3K27me3 in TF-induced reprogramming.

Results

Derivation of iPSC from mouse embryonic fibroblasts upon Ezh2 inactivation

In order to investigate the function of H3K27 trimethylation (H3K27me3) in TF-induced cell reprogramming, we derived embryonic fibroblasts (MEF) from mutant mice harboring a conditional Ezh2 knock-out allele carrying loxP sites (Ezh2^fl) flanking exons coding for the catalytic SET domain [30] (Figure 1 and Table 1). Experimental mice carried also an Oct4-GFP knock-in reporter gene [31] to monitor endogenous Oct4 gene re-activation during reprogramming. We expressed Oct4, Klf4, c-Myc and Sox2 reprogramming factors from a single doxycycline-regulated bicistronic lentiviral vector (STEMCCA), in which each cistron codes for a fusion of two factors which are then released through, respectively, F2A and E2A self-cleaving peptide signals [32]. As depicted in Figure 1A, our reprogramming protocol entailed: i) treatment of experimental (Ezh2^f^l/fl; Oct4-GFP) and control (Ezh2^+/fl; Oct4-GFP) MEF with cell-permeable TAT-Cre recombinase to inactivate Ezh2; ii) infection with STEMCCA and reverse tetracycline transactivator (rtTA) lentiviruses, followed by replating at clonal density and switch to a chemically-defined embryonic stem cell (ESC) medium (KSR) and administration of doxycycline for 23 days; iii) withdrawal of doxycycline to select cells that re-expressed the endogenous pluripotency factors; and iv) isolation, clonal propagation and molecular and functional characterization of individual iPSC clones. As controls, we used interchangeably iPSC from Ezh2^fl/+ MEF treated with TAT-Cre (referred to as Ezh2^ΔSET/+ iPSC) as well as iPSC from Ezh2^+/+or Ezh2^fl/fl MEF that were not exposed to Tat-cre (referred to as Ezh2^+/+ and Ezh2^fl/fl iPSC). Upon verification of comparable efficiency in the infection with STEMCCA of control and mutant MEF (Figure S1A), we analyzed the efficiency of reprogramming following Ezh2 inactivation by comparing the number of primary iPSC colonies assessed through staining with alkalyne phosphatase (AP) 7 days after doxycycline removal (Figure 1B). Starting from two different batches of MEF, we found no significant difference in the number of iPSC colonies generated from mutant (Ezh2^ΔSET/ΔSET) and control (Ezh2^ΔSET/+) fibroblasts (Figure 1B, 1C). As genome-wide chromatin analyses have postulated a critical role for H3K27me3 in the resetting of transcriptional programs during iPSC derivation [17], we tested whether this unexpected finding resulted from selective reprogramming of Ezh2^fl/fl MEF that had escaped Cre-mediated recombination, thereby preserving H3K27me3-dependent reprogramming proficiency. We ruled out this possibility as the majority of iPSC clones analyzed carried the SET-deleted Ezh2 allele regardless of the genotype (Figure S1B).

Derivation and biochemical analysis of iPSC upon conditional <i>Ezh2</i> inactivation. — **Fig. 1. Derivation and biochemical analysis of iPSC upon conditional *Ezh2* inactivation.**

Mass spectrometry analysis: H3K27me3 levels below the limit of detection in <i>Ezh2<sup>ΔSET/ΔSET</sup></i> iPSCs. — **Tab. 1. Mass spectrometry analysis: H3K27me3 levels below the limit of detection in *Ezh2^ΔSET/ΔSET* iPSCs.**

Representative iPSC clones of either genotype were analyzed by Western blot and found to be devoid of functional EZH2 and with undetectable H3K27me3 (Figure 1D). Levels of H3K27 dimethylation (H3K27me2) were also decreased in mutant iPSC clones, whereas H3K27 monomethylation (H3K27me1) remained unaltered (Figure 1D). Furthermore, global levels of the other major repressive histone modification, histone H3K9 trimethylation (H3K9me3), remained unaltered upon Ezh2 inactivation (Figure S1C).

Global loss of H3K27me3 was confirmed by high performance liquid chromatography (HPLC) coupled to tandem mass spectrometry (MS/MS) analysis of the histone H3 fraction purified from control and mutant iPSC clones. Specifically, we determined the relative abundance of the peaks corresponding to the various combinations of modifications harbored by the H3 peptide spanning lysine-27 through arginine-40 (H3^27–40). A significant change in the abundance and distribution of the various forms of methylated H3^27–40 was observed (Figure 1E). We then carried out a detailed MS/MS fragmentation analysis to pinpoint methylation at specific residues, which indicated that H3K27me3 was undetectable in Ezh2^ΔSET/ΔSET iPSC clones, confirming that changes in the relative abundance of methylated H3^27–40are mainly attributed to loss of this modification (Table 1 and Figure S1E).

Finally, we measured by qRT-PCR the expression levels, in both mutant and control iPSC, of the exogenously provided reprogramming factors, to exclude that reprogramming in the absence of H3K27me3 had selected iPSC clones with a pronounced leakiness of the doxycycline-inducible transgenes, whose sustained expression enabled the maintenance of H3K27me3-depleted iPSC. As shown in Figure S1D, we excluded this possibility since EZH2-proficient and EZH2-deficient iPSC showed equivalent, minimal levels of transgene expression following doxycycline withdrawal.

Self-renewal and pluripotency of Ezh2^ΔSET/ΔSET iPSC

Having determined that Ezh2 inactivation in MEF is compatible with TF-induced cell reprogramming, we performed a comprehensive functional characterization of representative control and mutant iPSC clones. Ezh2^ΔSET/ΔSET and Ezh2^ΔSET/+ control MEF yielded iPSC colonies: i) with distinctive iPSC/ESC morphology (Figure 2A, left panel), ii) that stained positive for AP (Figure 2A, middle panel); and iii) that had reactivated the endogenous Oct4 gene (as assessed by GFP fluorescence, Figure 2A, right panel). iPSC clones of either genotype had the same percentage of cells co-expressing the pluripotency markers OCT4 and SSEA1, as measured by flow cytometry (Figure 2B). Next we assessed control and mutant iPSC clones for the two cardinal features that define the pluripotent state: self-renewal and the ability to differentiate into cell types of the three germ layers.

Characterization of pluripotency in iPSC clones reprogrammed upon <i>Ezh2</i> inactivation. — **Fig. 2. Characterization of pluripotency in iPSC clones reprogrammed upon *Ezh2* inactivation.**

Self-renewal is currently best assessed by growing ESC and iPSC under chemically defined conditions that entail the dual inhibition (2i) of mitogen activated protein kinase (MAPK) and glycogen synthase kinase 3 (GSK3) signaling, in the presence of leukemia inhibitory factor (LIF) (commonly referred to as 2i/LIF medium). These highly selective culture conditions were shown to capture the ground state of pluripotency exhibited by cells of the epiblast inner cell mass (ICM) [33]; importantly, they were also shown to selectively promote the full reprogramming of partially reprogrammed cells to the authentic pluripotent state [34]. As shown in Figure 2C, Ezh2 mutant and control (Ezh2^+/+) iPSC clones grew readily in 2i/LIF and have been cultivated for over 20 passages. Finally, we tested the ability of Ezh2-proficient (Ezh2^+/+) and mutant (Ezh2^ΔSET/ΔSET) iPSC to differentiate along the three embryonic lineages through the formation of teratomas. iPSC clones of either genotype (2 clones for each genotype) injected into NOD/SCID Il2rγc^−/−immunodeficient mice gave rise within three to four weeks to teratomas harboring terminally differentiated cell types derived from the three germ layers. Differentiation was assessed combining hematoxylin/eosin with immunohistochemical stainings for lineage-specific markers (desmin for the mesodermal, S-100 for the neurectodermal and cytokeratin for the endodermal and ectodermal lineages, respectively) (Figure 2D).

Next, we asked to which extent the transcriptome of Ezh2 mutant iPSC had been reset correctly during reprogramming. As shown in Figure 2E and Figure S2, a stringent analysis of 4 independent control and mutant iPSC clones (t-test, FDR <0.05) revealed that transcriptomes were indistinguishable from each other and equally divergent from MEF-specific ones. We validated this result through qRT-PCR on a panel of well-established genes associated with pluripotency, stemness and lineage specification (n. 7, 32 and 50 genes, respectively). Figure 2F shows a heat map representation of the expression levels (expression data of all genes are provided in Table S1), with iPSC clones of both genotypes clustering together with ESC and sharply distinguished from MEF.

Bulk H3K27me3 is dispensable from the onset of reprogramming

TAT-Cre-mediated inactivation of Ezh2 resulted in a threefold decrease in global H3K27me3 levels by the onset of the reprogramming process (Figure 3A). We therefore asked whether the unexpected possibility to reprogram despite inactivation of Ezh2 was due to residual H3K27me3 that could have still ensured, during the very first days of reprogramming, a sufficient degree of repression of lineage specific genes. To this end, we aimed at erasing the H3K27me3 mark completely by dilution, through serial passage of TAT-Cre treated MEF before the start of reprogramming. In MEF, however, PRC2 is a direct repressor of Cdkn2a, a locus encoding three key cell cycle regulators (p16, p19 and p15) whose activation promotes senescence [35]. Importantly, expression of this locus, and in particular of p19/Arf, was shown to hinder iPSC reprogramming [9], Thus, to prevent senescence driven by Cdkn2a de-repression following Ezh2 inactivation (Figure S3A), we resorted to compound primary tail tip fibroblasts (TTF) harboring both the conditional Ezh2 allele and the Ink4/Arf knock-out allele [36]. We subjected TTF to two sequential rounds of TAT-Cre transduction and passaged them 5 times before infection with STEMCCA and doxycycline administration (Figure 3B). Following confirmation that H3K27me3 was undetectable by Western blot on the day of infection with the reprogramming lentivirus (Figure 3C), and that efficiency of infection was equivalent for control and mutant TTF (Figure S3B), we went on to measure the efficiency of TF-dependent reprogramming under these most stringent conditions. As shown in Figure 3D, AP staining revealed that TTF starting off with undetectable H3K27me3 and controls yielded iPSC colonies with similar efficiency. This indicates that bulk levels of H3K27me3 are not required to prime silencing during the first days of reprogramming.

**Fig. 3. Establishment of iPSC clones upon genome-wide erasure of H3K27me3 at the onset of reprogramming.**

Ezh2 depleted iPSC retain H3K27me3 on selected Polycomb targets

The finding that bulk H3K27me3 was apparently dispensable for reprogramming even when erased at the onset of the process was at odds with its purported role as the critical mark that distinguishes MEF from iPSC epigenomes as well as, more broadly, with its pivotal role in the maintenance of gene repression through embryogenesis and adulthood [17]. We therefore asked whether, upon Ezh2 inactivation, residual levels of H3K27me3 below the threshold of Western blot and mass spectrometry sensitivity, could still be deposited on selected targets. To address this point we performed chromatin immunoprecipitation coupled to high-throughput sequencing (ChIP-seq). We generated ChIP-seq profiles from two independent control and mutant iPSC clones for both H3K27me3 and H3K27me2 with highly specific monoclonal antibodies. Consistently with our prediction, the higher sensitivity of ChIP-seq did reveal the presence of residual H3K27me3 in mutant iPSC clones. Specifically, the mark was retained on 2477 genes (with an enriched region overlapping a +/−5kb region interval around the transcriptional start site, TSS), comprising close to half of all H3K27me3 targets retrieved from wild type iPSC. Mutant clones showed a preferential retention of H3K27me3 proximal to the TSS of target genes. In comparison to the full complement of PRC2 targets in control iPSC, mutant clones displayed a clear tripartition in the genome-wide distribution of H3K27me3 and H3K27me2 marks (Figure 4A). 47% of genes retained both H3K27me3 and H3K27me2, 39% of genes were marked only by H3K27me2 and 13,7% of genes lost both marks. Importantly, the complement of genes enriched for H3K27me3 in mutant iPSC clones was almost entirely comprised within the group of H3K27me3 targets found in control iPSC cells, thus excluding a significant redistribution of the mark to new targets in cells reprogrammed in the absence of functional Ezh2 (Figure 4B). Furthermore, we found only a small overlap in the distribution of H3K27me2-only targets between control and mutant iPSC clones. Instead, the complement of genes marked only by H3K27me2 in mutant iPSC clones was to a good extent comprised within the subset of genes that are H3K27 trimethylated in control iPSC (Figure 4C). Thus, we conclude that during reprogramming in the absence of Ezh2, i) H3K27me3 is selectively retained on a subset of the targets that are normally H3K27 trimethylated in iPSC, where it coexists with H3K27me2; ii) H3K27me2 is lost at targets that are normally carrying only this mark in iPSC; and iii) H3K27me2 is retained in 86% of the targets that are normally H3K27 trimethylated in iPSC, coexisting, in half of these, with residual H3K27me3. We validated these findings through individual ChIP-qPCR on genes selected among those that were downregulated in the MEF to iPSC transition (Figure 4D and Table S2). We confirmed the sharp distinction between a group of genes that retained both H3K27me3 and H3K27me2 and those that only retained H3K27me2, irrespective of the level of transcriptional repression that was equivalent for the two groups between control and mutant iPSC (Figure 4D and Table S2). Interestingly, we found a stronger enrichment for PRC2 on the genes that selectively retained the H3K27me3 mark, likely reflecting its ability to act as docking site for the EED subunit of PRC2.

Genome-wide distribution of H3K27me3 in <i>Ezh2<sup>ΔSET/ΔSET</sup></i> iPSC revealed through ChIP–seq. — **Fig. 4. Genome-wide distribution of H3K27me3 in *Ezh2^ΔSET/ΔSET* iPSC revealed through ChIP–seq.**

In the absence of EZH2, EZH1 is the only other known enzymatic component of PRC2, where its enzymatic activity is however twenty fold weaker than that of EZH2 [37]. Our findings support therefore a model in which, in the presence of EZH1-only PRC2 complexes, genes that are normally more enriched for PRC2 will be more likely to accumulate H3K27me3 because they will have a higher chance of having the weak activity of EZH1 result in productive trimethylation. In turn this will further enhance PRC2 recruitment, establishing a feed-forward loop that guarantees on selected targets residual levels of H3K27me3. We tested this hypothesis by analyzing, in control iPSC, the average enrichment for H3K27me3 in genes that, in mutant iPSC, respectively retain or lose this mark. As shown in Figure 4E and Figure S4B, genes that retain H3K27me3 in mutant iPSC showed indeed a highly significant stronger enrichment for this mark in control iPSC cells, both in terms of maximum height and average read density (p-value<2.2e-16, two tailed t-Test). In almost all cases however, targets that retain H3K27me3 in mutant iPSC also feature H3K27me2 (contrary to what happens in control iPSC), indicating that EZH1 does not reach H3K27 trimethylation: 1) in all cells of the population, 2) on both alleles of the same cell, or 3) on both histone H3 tails of the same nucleosome. The non-randomness of H3K27me3 marking in mutant iPSC was confirmed by the virtually identical genome-wide distribution of H3K27me3 and H3K27me2 in two independent iPSC mutant clones (Figure S4D), as well as by the striking similarity between the distributions of, respectively, H3K27me3 in control and H3K27me2 in mutant iPSC (Figure S4C).

Targets of H3K27me3 in Ezh2 mutant iPSC are enriched for transcriptional regulators and developmental determinants

Finally, we asked whether the H3K27me tripartition of the mutant iPSC epigenome identified functionally relevant classes of genes. To this end we analyzed the three groups of PRC2 targets differentially methylated in mutant iPSC (H3K27me3⁺/H3K27me2⁺; H3K27me3⁻/H3K27me2⁺; and H3K27me3⁻/H3K27me2⁻) in terms of the Gene Ontology (GO) of their members (Figure 4F). Remarkably, the subset of genes that retained H3K27me3 in mutant iPSC showed a distinct GO profile when compared to the other two classes, with a clear enrichment for categories linked to development, cellular differentiation and transcriptional regulation. This unique signature was confirmed when the GO enrichment in H3K27me3-retaining genes was probed against the sole complement of all Polycomb targets, further underscoring the functional partition of the Polycomb epigenome. The H3K27me3⁻/H3K27me2⁺ and H3K27me3⁻/H3K27me2⁻ subsets were instead strongly enriched for genes involved in metabolic homeostasis and cellular transport.

Remarkably, while genes that retained H3K27me3 in mutant iPSC comprised 47% of the physiological PRC2 epigenome, they were disproportionately enriched for genes preferentially expressed in MEF versus iPSC (comprising 69,2% of the total, Figure 4G). This suggested that retention of H3K27me3 in Ezh2 mutant iPSC occurred on selected targets to enable silencing of the MEF-specific gene expression program. Specifically, we found 550 genes, among those that retained H3K27me3 in mutant iPSC, that were downregulated in the MEF to iPSC transition (Figure S5A). Among these, 175 acquire H3K27me3 de novo in the MEF to iPSC transition and are enriched in GO categories related to transcriptional regulation (Figure S5B). We hypothesized the presence within this group of one or more MEF-specific master regulators that were silenced during reprogramming through H3K27me3. We thus performed a master regulator analysis (MRA) to identify the TFs whose targets (predicted on the basis of their consensus binding sites) are over-represented in the differentially expressed genes (DEGs) between MEF and iPSC. To this end we applied the transcription factor binding site over-representation analysis algorithm [38]. This analysis uncovered four TFs, Egr1, Ets1, Prxx1, Prxx2, whose binding sites were significantly over-represented (FDR<5%) among the 175 DEGs that acquire de novo H3K27me3 in the transition from MEF to iPSC, and are predicted to control the bulk (84%) of the up-regulated genes in MEF with respect to iPSC (Figure S5C). Notably, expression of Egr1 and Ets transcript levels showed a strong reduction within the first week of reprogramming, corroborating their role in the silencing of the MEF-specific program (Figure S5D).

PRC2 catalyzes residual H3K27 trimethylation in Ezh2 mutant iPSC

Previous reports have highlighted partial redundancy between Ezh2 and Ezh1 in several cell types [27], [39]. To determine whether an alternative PRC2 complex is responsible for H3K27me3 of target genes also in Ezh2-mutant iPSC, we knocked down the essential PRC2 component EED in Ezh2-mutant iPSC. Infection of two Ezh2-mutant iPSC clones with lentiviruses expressing two independent shRNAs against Eed promoted a substantial reduction of EED protein levels (Figure 5A). Ezh2-mutant iPSC stably interfered for Eed (or infected with a control virus) were subjected to ChIP-qPCR assays on a representative set of target genes retaining the H3K27me3 mark in mutant iPSC. As shown in Figure 5B, levels of H3K27me3 decreased substantially in two independent Ezh2-mutant iPSC clones upon Eed knock-down. Importantly, loss of H3K27me3 led to the reactivation of target gene expression (Figure 5B, right panel). These results indicate that an alternative EZH1-containing PRC2 complex deposits H3K27me3 on a selected subset of Polycomb targets during reprogramming of Ezh2-mutant iPSC, to promote stable gene repression.

Effect of PRC2 inactivation on established <i>Ezh2<sup>ΔSET/ΔSET</sup></i> iPSC clones and TF–induced reprogramming. — **Fig. 5. Effect of PRC2 inactivation on established *Ezh2^ΔSET/ΔSET* iPSC clones and TF–induced reprogramming.**

Given the existence of an alternative PRC2 complex that functions in Ezh2 mutant iPSC, we determined the effect of Eed knock-down on TTF reprogramming. Specifically, Ezh2-proficient Cdkn2a^−/− TTFs were infected first with a lentivirus expressing a shRNA against Eed, and few days later with the STEMCCA reprogramming virus (Figure S6A). As control, TTF cells were infected first with a control virus followed by STEMCCA. Quantification of iPSC colonies by AP staining performed two weeks after the beginning of doxycycline treatment revealed an over 80% reduction in reprogramming efficiency upon Eed inactivation (Figure 5E), indicating that PRC2-mediated H3K27me3 deposition on a selected subset of Polycomb targets is necessary for correct fibroblast reprogramming.

Discussion

Trimethylation on lysine-27 of histone H3 by Polycomb proteins has been characterized as a critical mechanism that ensures the cell-type specific establishment and maintenance of gene silencing in multicellular organisms [19], [20], [40], [41]. From ESC to tissue-specific stem cells, this chromatin mark has been associated to the timed repression of developmental regulator genes, which underlies in turn the physiological unfolding of cell fate transitions [19]–[22], [42]–[44]. Consistent with its role in the physiology of genome programming during development, the Polycomb axis has been hypothesized as a key player in the reprogramming of somatic cells epigenomes. In particular TF-induced reprogramming, through its relatively high efficiency coupled with the stability of the reprogrammed state and the ability to study defined stages of the process, enabled the first insight into the chromatin changes that underlie cell fate reassignment on a genome-wide scale. Among 16,500 promoters that were analyzed by ChIP-on-chip, roughly 1000 differed sharply in the H3K4me3 and H3K27me3 status when MEF were compared to ESC [17]. Nearly all of these signature genes displayed H3K4me3 and H3K27me3 patterns that were identical between ESC and iPSC, thus providing the first evidence that the transition from MEF to iPSC recapitulated, in addition to the transcriptome, to a large extent also the epigenome of ESC. Importantly, however, a comparison of methylation maps between MEF, ESC and iPSC revealed that H3K4me3 changed considerably less between the three cell types than H3K27me3, whose distribution was instead remarkably different between MEF, on the one hand, and ESC and iPSC on the other. This observation suggested that changes in H3K27me3 were more relevant to reprogramming than those in H3K4me3, and led to posit for the Polycomb pathway a central role in the conversion of MEF into iPSC [17]. A similar approach was used to investigate the earliest stages of the reprogramming process, uncovering the acquisition of H3K4 dimethylation (H3K4me2) at several pluripotency loci and developmental regulators that precedes their transcriptional activation [16]. More recently the inactivation of the H3K27 demethylase UTX revealed the essential role of H3K27 demethylation in TF-induced reprogramming, through the reactivation of a handful of pluripotency genes during the initial stages of the process [13]. Similarly, Polycomb affected the ability of ESC to reassign fate in cell fusion-based short term reprogramming [45]. Thus, while these data reveal the importance, in the early phase of reprogramming, of the reactivation of the pluripotency network through widespread H3K4me2 acquisition and loss of H3K27 methylation, the relevance of H3K27 trimethylase activity in silencing somatic gene expression remains still ill defined.

Here we provide functional validation for the role of H3K27me3 in TF-induced reprogramming. Our findings reveal that functional inactivation of EZH2, the main catalyst of H3K27 trimethylation within PRC2, is surprisingly compatible with TF-cell reprogramming. In its absence, reprogramming proceeds through the deposition of H3K27me3 by the alternative EZH1-PRC2, on a highly selected core of Polycomb targets controlling lineage determination and transcriptional regulation. Moreover, our data show that correct reprogramming requires an exquisitely low amount of H3K27me3 below the limit of detection of mass spectrometry. Bioinformatics analysis of ChIP-seq data revealed that H3K27me3 is retained in Ezh2 mutant iPSC on about 2500 genes, representing close to half of all PRC2 targets in iPSC and ESC. These include 2190 genes that are already marked by H3K27me3 in MEF, with the remaining ones acquiring the modification during the MEF to iPSC transition, thus indicating that residual PRC2 is able through cell division to both propagate the mark at pre-existing loci and redistribute it to new targets. The correct reprogramming guaranteed by selective retention of H3K27me3 was also consistent with the normal differentiation of Ezh2-mutant iPSC in teratoma assays, defining the core subset of functionally relevant Polycomb targets. Similar results were obtained also when Ezh2 was inactivated in established iPSC (data not shown). These data exclude a functional difference between iPSC reprogrammed ab initio in the absence of Ezh2 or acutely depleted of the enzyme following reacquisition of pluripotency. In turn, this is consistent with the observation that H3K7me3 is partially retained upon Ezh2 inactivation.

Importantly, fully independent mutant iPSC clones were indistinguishable in their patterns of residual H3K27me3, indicating that retention of the mark either follows an instructive mechanism or, if stochastic, is subjected to a strong selection pressure that limits the range of residual targets. The former possibility appears more likely on the basis of two related observations: first, that residual H3K27me3 targets in mutant iPSC are preferentially associated to CpG islands, and second, that these residual H3K27me3 targets also correspond to genes with higher levels of H3K27me3 in normal iPSC. Combined, these evidences suggest that the basic mode of PRC2 recruitment is conserved in Ezh2-mutant cells, allowing prevalent targets to recruit enough mutant PRC2 for EZH1 to catalyze productive H3K27 trimethylation.

In mutant iPSC, further inhibition of PRC2 through knock-down of its critical subunit EED resulted in loss of H3K27me3 and re-expression of developmental regulator genes, indicating that maintenance and/or de novo acquisition of H3K27me3 is critical to ensure silencing of the MEF-specific gene expression program. Consistently, Eed knock-down in MEF prevented reprogramming. This result highlights the essential contribution of H3K27me3 to the repression of developmental regulator genes that enables successful cell fate reassignment.

Materials and Methods

Mice

Ezh2^fl, OCT4-GFP and Cdkn2a^−/− mice have been previously described [30], [31], [36]. Compound mutants were generated intercrossing individual strains. Primers used for genotyping are listed in Table S5.

Derivation of primary MEF and TTF

MEF were produced from E13.5 compound mutants following standard procedures. Briefly, embryos were harvested from pregnant females and MEF were isolated by enzymatic dissociation of tissues. Cells were cultured in MEF medium (DMEM high-glucose, 10% fetal calf serum, 2mM L-glutamine, 50 units/ml penicillin, 50 µg/ml streptomycin) at 37°C, 5% CO2 for up to passage-3 before TAT-Cre transduction and/or infection with reprogramming viruses. For the preparation of adult tail tip fibroblasts (TTF), 0.5 cm tail tips from 8 to 16-weeks old compound mutants were biopsied aseptically, subjected to enzymatic dissociation and cultured in MEF medium at 37°C, 5% CO2 for one week.

Lentivirus production

Lentiviral constructs coding for doxycycline-inducible STEMCCA, reverse tetracycline transactivator (rtTA) and the Zs-green reporter gene have been previously described [32]. Production of lentiviral particles was performed as previously described [32]. Briefly, plasmids expressing viral proteins GAG, POL, REV, TAT, and the vesicular stomatitis virus envelope glycoprotein (VSV-G) were co-transfected with either STEMCCA or rtTA vectors into semi-confluent 293T cells by calcium phosphate precipitation in the presence of 25 µM chloroquine. Supernatant of transfected cells were collected every 12 hours during 2 consecutive days and concentrated by centrifugation. Viral particles were resuspended in MEF medium and either used freshly for infection or frozen at −80°C.

Infection of MEF and derivation of Oct4-GFP; Ezh2^fl iPSC

MEF at passage 1 to 3 carrying either one (control) or both (mutant) Ezh2 floxed alleles were seeded at a density of 2×10⁵ cells/well in a 6-well tissue culture dish. 24hr later, MEF were transduced in a 1∶1 PBS/DMEM/GlutaMAX (Invitrogen) solution with 50 µg/ml of recombinant TAT-Cre recombinase [46]. 16 hour after transduction, medium was replaced, and MEF were infected with the STEMCCA and rtTA lentiviruses. 24 hours following infection with STEMCCA and rtTA, MEF were harvested and seeded at clonal density on a mytomycin-treated MEF feeder layer. One day after, MEF medium was replaced with ESC medium (high glucose DMEM, 15% knockout serum replacement-Invitrogen-, 2 mM L-glutamine, 50 units/ml penicillin, 50 µg/ml streptomycin, 0,1 mM non-essential amino acids, 1/500 home-made leukaemia inhibitory factor, 0,1 mM 2-β-mercaptoethanol) supplemented with 1 µg/ml doxycycline to induce expression of the reprogramming factors. Doxycycline treatment was replaced every other day for 23 days. Four days after doxycycline withdrawal, iPSC colonies were stained for alkaline phosphatase (AP), counted to assess the efficiency of reprogramming or picked and expanded on a mitomycin-treated MEF feeder layer to establish individual iPSC clones. To measure the efficiency of infection, an aliquot of MEF of each genotype was infected in parallel with a combination of STEMCCA, rtTA and a lentivirus expressing ZsGreen under the EF1á constitutive promoter. Three days after infection, cells were analyzed by flow cytometry to assess the efficiency of infection. Established iPSC clones were grown in 2i/LIF medium (DMEM/F-12+GlutaMAX, Neurobasal Medium, N2 Supplement-Invitrogen-, B27 supplement-Invitrogen-, 1,5 mM Hepes, 2 mM Glutammine, 0,1 mM β-mercaptoethanol,1/500 home-made LIF, 3 µM CHIR99021, 1 µM PD0325901.

Infection of TTF and derivation of Cdkn2a^−/−; Ezh2^fl iPSC

Control Cdkn2a^−/−Ezh2^fl/+ and experimental Cdkn2a^−/−Ezh2^fl/fl TTFs at passage-1 were treated with 50 ug/ml of TAT-Cre recombinase and passaged 3 days later. At 80% confluency, cells underwent a second round of TAT-Cre transduction and were further expanded to allow the dilution of H3K27me3, before infection with STEMCCA and rtTA lentiviruses, as described above.

RNA interference

Puromycin-resistant lentiviral constructs expressing Eed shRNAs were purchased from Open Biosystem (TRCN0000095719, TRCN0000095721, TRCN0000095722). Lentiviruses were produced as described above. Cdkn2a^−/− TTFs were infected with viruses expressing either shRNA # TRCN0000095719, TRCN0000095721, TRCN0000095722 or an empty pLKO.1 vector and selected with 2 µg/ml of puromycin. After 6 days of selection, resistant TTFs were reprogrammed by infection with Tet-STEMCCA as described above. Ezh2^ΔSET/ΔSET iPSC clones were infected with viruses expressing either shRNA # TRCN0000095719, TRCN0000095721 or as control, an empty pLKO.1 vector and expanded for two weeks in 0.7 µg/ml of puromycin. RNA, proteins and chromatin were collected on the same day.

Alkaline phosphatase staining

Alkaline phosphatase staining was performed using the Leukocyte Alkaline Phosphatase kit (Sigma Aldrich) following manufacture's instructions.

Flow cytometry

One million cells were stained with phycoerythrin-conjugated anti-mouse/human SSEA1 (eBioscience, 12-8813) in FACS buffer (1% bovine serum albumin (BSA), 0,05% NaN₃ in PBS), washed and acquired on a FACS Calibur instrument (BD Biosciences). Data were analyzed using FlowJo software (Tree Star inc.).

Teratoma assay

iPSC were cultured in 2i/LIF medium for 3 passages. 2×10⁶ cells were injected subcutaneously into NOD-SCID Il2rãc^−/− mice. 2–3 weeks after injection, mice were sacrificed and tumors were isolated and fixed in 4% formaldehyde for immunohistochemistry.

RNA extraction and cDNA synthesis

RNA was extracted using TRIzol Reagent (Invitrogen) and purified with RNeasy mini kit (QIAGEN) following manufacturer's instructions. cDNA was prepared using SuperScript VILO cDNA Synthesis Kit (Invitrogen) following manufacturer's instructions.

Quantitative gene expression analysis

Quantitative real-time PCR analysis for the expression of fibroblast specific genes on MEF, 2 Ezh2^+/+ iPSC clones, 2 Ezh2^ΔSET/ΔSET iPSC clones, and 1 E14tg2á ESC line was performed using pre-designed Taqman gene expression assays following manufacturer's instructions. (Applied Biosystems: Cd14Mm00438094_g1, Cebpa Mm00514283_s1, Dok1 Mm00438532_m1, Emp2 Mm00801709_m1, Enpp1 Mm00501097_m1, Fads3 Mm00517643_m1, Bmp1Mm00802220_m1, Ndrg1 Mm00440447_m1, Snai2 Mm00441531_m1, Sox9 Mm00448840_m1, Tram1l1, Mm00525200_s1, Vamp5 Mm00444144_m1, Bmp1 Mm00802220_m1, Cap2 Mm00482645_m1, Vcan Mm01283063_m1, Ptprm Mm00436095_m1, Lox Mm00495386_m1, Pftk1 Mm00448111_m1). Quantitative real-time PCR analysis for the expression of the STEMCCA vector was performed using a custom-designed Taqman gene expression assay, previously described [32].

cDNA arrays

iPSC were cultured in 2i/LIF medium for 3 passages before RNA extraction. Expression was assessed using the TaqMan Stem Cell Pluripotency Array (Applied Biosystem 4385363) following the manufacturer's instructions. dCT values were calculated subtracting each single CT value to the geometrical mean of the housekeeping genes (Actb, Raf1, Ctnnb1, Gapdh and Eef1a1). Hierarchical clustering and heat map were obtained using Gene Cluster 3.0 software (Michael Eisen, Copyright 1998–99 Stanford University; Michiel de Hoon, 2002 University of Tokyo, Human Genome Center).

Microarray analysis

Microarray analysis was performed on 4 Ezh2^+/+ iPSC clones, 4 Ezh2^ΔSET/ΔSET iPSC clones, 3 independent preparation of MEFs and one sample of E14tg2a ESC using Affymetrix Mouse Gene 1.0 ST arrays. Labeling, hybridization, and washing were performed according to Affimetrix guidelines. Data analysis was performed with R software version 2.15.0 (http://www.r-project.org) starting from raw data (CEL files). Data were normalized using robust multichip analysis (RMA) algorithm [47] as implemented in [48] and made available through Bioconductor (http://www.bioconductor.org/). To perform differential expression analysis, we started from the log₂ normalized value relative to each gene. Then, we used a t-test to identify significant changes in gene expression between wild type and mutant samples, and a P value was calculated for each gene. A false discovery rate (FDR) procedure [49] was applied to take into account multiple testing correction. To identify differentially expressed genes (DEGs) we considered a threshold of 0.05 on the corrected P value, along with a cut-off of 1.5-fold changes.

Gene Ontology analysis

We used Cytoscape software [50] and the BiNGO plugin [51] to analyze Gene Ontology terms significantly enriched in the various gene sets. The enrichment for each term was tested using hypergeometric test and P values were corrected using FDR procedure. All terms with a FDR <0.01 were considered enriched.

Protein extraction and Western blot analysis

iPSC were cultured in 2i/LIF medium for 3 passages before protein extraction. Cell pellets were lysed with urea buffer (8 M Urea, 0,1 M NaH₂PO₄, 0,01 M Tris base diluted in water, pH 8.0) at room temperature for 30 minutes on a rotating wheel. Lysates were sonicated with a Bioruptor Sonication System (UCD200) (3 cycles of 30 seconds with one minute breaks, high power). Lysates were centrifuged at 13000 rpm for 15 minutes and supernatants were transferred to a new tube. Protein quantification was performed using Bio-Rad protein assay and following manufacturer's instructions. For the detection of histone modifications 40 µg of total protein extracts were loaded into a 12% acrylamide gel. For the measurement of EZH2 levels, 80 µg of total protein lysates were loaded onto an 8% acrylamide gel. Western blot was performed using standard procedures. Intensities of Western blot bands were determined using ImageJ software (rsbweb.nih.gov/ij/). Antibodies used for Western blot are listed in Table S3.

Mass spectrometry

A detailed description of core histone extraction and mass spectrometry analysis can be found in Text S1.

Histology and immunohistochemistry

Teratoma tissues were washed in PBS buffer for 30 minutes, fixed in 4% buffered formalin for 4 hours and paraffin embedded. Samples were processed in consecutive 3 µm thick sections and stained with haematoxylin and eosin. Immunostaining for desmin, protein S-100 and cytokeratin were performed using an automated immunostainer (Autostainer, DakoCytomation, Glostrup, Denmark) and a commercially available detection kit (DakoEnVision Plus-HRP), according to the manufacturer's instructions. Detailed information about antibodies and immunostaining procedure are listed in Table S4. After immunostaining, sections were counterstained with 1% modified Harris hematoxylin, dehydrated and mounted. Standard reference positive and negative controls were run simultaneously. Haematoxylin and eosin and immunohistochemistry images were taken with an Olympus Upright BX 51 optical microscope equipped with a Nikon digital color camera. Digital images were processed with Adobe Photoshop CS3.

Chromatin immunoprecipitation

iPSC cultured in 2i/LIF medium were trypsinized and resuspended in 1% formaldehyde/PBS solution. Cross-linking was allowed to proceed for 10 min at room temperature and stopped by addition of glycine at a final concentration of 0.125 M, followed by an additional incubation for 5 min. Fixed cells were washed twice with PBS and resuspended in SDS buffer (SDS 0.5%, Tris-Cl pH 8.1 50 mM, NaCl 100 mM, EDTA pH 8 5 mM, NaN₃ 0.02%, diluted in ddH₂O) and stored at −80°C. SDS cell suspensions were thawed at room temperature using a water bath for 20 minutes, centrifuged at 2000 rpm and resuspended in ice cold IP buffer (SDS buffer/Triton dilution buffer = 2∶1; Triton dilution buffer: Triton X-100 5%, Tris-Cl pH 8.6 100 mM, NaCl 100 mM, EDTA pH 8 5 mM, NaN₃ 0.02%, diluted in ddH₂O). Fixed cells were sonicated yielding genomic DNA fragments with a bulk size of 400 to 1000 bp. Sonicated material was centrifuged at 13000 rpm for 30 minutes at 4°C and supernatants were transferred into a new tube. Chromatin was quantified using Bio-Rad protein assay following manufacturer's instructions. For immunoprecipitation using antibodies against H3 and specific histone modifications, 100 µg of chromatin were used. For immunoprecipitation of SUZ12, 500 µg of chromatin were used. For each immunoprecipitation assay, chromatin was diluted in 1 ml IP buffer and 10 µl were taken and stored at −80°C as 1% of input. Primary antibodies were incubated overnight at 4°C on a rotating platform. To each sample, 50 µl of 50% slurry of protein A-Sepharose (Amersham) beads were added for 2–3 h. Beads were washed three times in 150 mM wash buffer and one time in 500 mM wash buffer (Triton-X 1%, NaCl 150 mM or 500 mM, Tris-Cl pH 8.0 20 mM, SDS 0.1%, EDTA pH8 2 mM diluted in water). Beads (and input samples) were resuspended in 120 µl of 0.1% SDS, 0.1 M NaHCO3 buffer and de-cross-linked at 65°C overnight. DNA was purified using PCR purification kit (QIAGEN) following the manufacturer's instruction and eluted in 100 µl of water. 1 µl of eluted material was used for each real-time quantitative PCR (qPCR) reaction. Quantitative real-time PCR analysis was performed in triplicate using Fast SYBR Green master mix (Applied Biosystems) in a 7500 Fast Real-Time PCR instrument (Applied Biosystems). Antibodies used for ChIP are listed in Table S3. Primers used for ChIP-qPCR are listed in Table S5.

ChIP–seq

ChIP-seq analysis was performed on two Ezh2 ^+/+ iPSC clones and two Ezh2^ΔSET/ΔSET iPSC clones. 10 ng of sheared DNA obtained before (input) and after chromatin IP using anti-H3K27me3 and -H3K27me2 specific antibodies, were prepared with the Illumina ChIPSeq sample prep kit and multiplexing oligonucleotide kit. DNA libraries were quantified using a high sensitivity Chip on Bioanalyzer (Agilent) and diluted to a concentration of 16 pM. Diluted libraries were used for cluster generation and sequencing on a HiSeq 2000 instrument (Illumina) following manufacturer's protocol.

Bioinformatics analysis of ChIP–seq data

Mapping of reads

After filtering for artifacts with FASTX-Toolkit v.0.0.13, reads were aligned to the mm9 genome using Bowtie v.0.12.7 [52], allowing up to two mismatches per read and discarding multiply-aligning reads. Each ChIP-seq experiment had a total of 18 to 28 millions of uniquely mapped reads. Aligned reads of a MEF H3K27me3 ChIP-seq dataset were downloaded from the Gene Expression Omnibus (accession id GSM656316) and processed similarly to the other datasets (see below).

Identification of enriched domains

We used two different strategies to identify enriched domains using, respectively, MACS v.1.4.0 [53] and RSEG v.0.4.8 [54]. For MACS analysis, we disabled the shifting model and the dynamic lambda (which are inappropriate for histone modifications) and used a stringent p-value threshold of 10e-10. A more generous identification of enriched regions was done using RSEG, which was specifically designed to identify extended regions marked by histone marks (RSEG was independently benchmarked in [55]). We used Hideaki's empirical method to determine bin size, 20 iterations for the training, and provided deadzones (regions where no read can map uniquely) as described in the documentation. Domains called as unconfident were discarded. In both approaches, we compared the signal over the input.

On the basis of a calibration performed in ESC and of biological and technical considerations (see Text S1 for further details), we applied the MACS method to identify H3K27me3 enriched genomic regions, while RSEG was used to identify domains of enriched H3K27me2.

Annotation

For both H3K27me3 and H3K27me2, we assigned enriched regions to genes by determining for each RefSeq transcript, whether an enriched domain overlapped with a +/−5 kb region around the transcription start site. We also tested a +/−2.5 kb interval, which resulted in a marginal decrease in the number of gene annotations. Transcripts were then collapsed to gene symbols using BioMart [56], and only the genes marked in both biological replicates (Figure S4D) were retained for further analysis.

Heat map and clustering of H3K27me3 and H3K27me2 distributions around the TSS were produced with Seqminer [57] using K-means linear normalization.

Venn diagrams showing overlap between different datasets were prepared using BioVenn (by Tim Hulsen http://www.cmbi.ru.nl/cdd/biovenn/).

Ethics statement

This project involved the minimum number of mice required to fulfil the research objectives. Experiments involving animals were performed in accordance with the Italian Laws (D.L.vo 116/92 and following additions), which enforces EU 86/609 Directive (Council Directive 86/609/EEC of 24 November 1986 on the approximation of laws, regulations and administrative provisions of the Member States regarding the protection of animals used for experimental and other scientific purposes). The authority responsible for ensuring compliance with the provisions of EU 86/609 Directive is the Italian Ministry of Health. Our mouse facility is authorized by the Ministry of Health (DM N°86/2005 -⁠ 17/06/2005) and a veterinarian is responsible for the well-being of the experimental animals.

An Institutional Animal Care and Use Committee (IACUC) of the IFOM Foundation-FIRC Institute of Molecular Oncology Foundation, supervises the ethical conduct of research involving non-human vertebrates.

Supporting Information

Zdroje

1. DavisRL, WeintraubH, LassarAB (1987) Expression of a single transfected cDNA converts fibroblasts to myoblasts. Cell 51 : 987–1000.

2. TakahashiK, YamanakaS (2006) Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell 126 : 663–676.

3. PlathK, LowryWE (2011) Progress in understanding reprogramming to the induced pluripotent state. Nat Rev Genet 12 : 253–265.

4. LiR, LiangJ, NiS, ZhouT, QingX, et al. (2010) A mesenchymal-to-epithelial transition initiates and is required for the nuclear reprogramming of mouse fibroblasts. Cell Stem Cell 7 : 51–63.

5. Samavarchi-TehraniP, GolipourA, DavidL, SungHK, BeyerTA, et al. (2010) Functional genomics reveals a BMP-driven mesenchymal-to-epithelial transition in the initiation of somatic cell reprogramming. Cell Stem Cell 7 : 64–77.

6. BanitoA, RashidST, AcostaJC, LiS, PereiraCF, et al. (2009) Senescence impairs successful reprogramming to pluripotent stem cells. Genes Dev 23 : 2134–2139.

7. HongH, TakahashiK, IchisakaT, AoiT, KanagawaO, et al. (2009) Suppression of induced pluripotent stem cell generation by the p53–p21 pathway. Nature 460 : 1132–1135.

8. KawamuraT, SuzukiJ, WangYV, MenendezS, MoreraLB, et al. (2009) Linking the p53 tumour suppressor pathway to somatic cell reprogramming. Nature 460 : 1140–1144.

9. LiH, ColladoM, VillasanteA, StratiK, OrtegaS, et al. (2009) The Ink4/Arf locus is a barrier for iPS cell reprogramming. Nature 460 : 1136–1139.

10. MarionRM, StratiK, LiH, MurgaM, BlancoR, et al. (2009) A p53-mediated DNA damage response limits reprogramming to ensure iPS cell genomic integrity. Nature 460 : 1149–1153.

11. UtikalJ, PoloJM, StadtfeldM, MaheraliN, KulalertW, et al. (2009) Immortalization eliminates a roadblock during cellular reprogramming into iPS cells. Nature 460 : 1145–1148.

12. LiangG, HeJ, ZhangY (2012) Kdm2b promotes induced pluripotent stem cell generation by facilitating gene activation early in reprogramming. Nat Cell Biol 14 : 457–466.

13. MansourAA, GafniO, WeinbergerL, ZviranA, AyyashM, et al. (2012) The H3K27 demethylase Utx regulates somatic and germ cell epigenetic reprogramming. Nature

14. OnderTT, KaraN, CherryA, SinhaAU, ZhuN, et al. (2012) Chromatin-modifying enzymes as modulators of reprogramming. Nature 483 : 598–602.

15. WangT, ChenK, ZengX, YangJ, WuY, et al. (2011) The histone demethylases Jhdm1a/1b enhance somatic cell reprogramming in a vitamin-C-dependent manner. Cell Stem Cell 9 : 575–587.

16. KocheRP, SmithZD, AdliM, GuH, KuM, et al. (2011) Reprogramming factor expression initiates widespread targeted chromatin remodeling. Cell Stem Cell 8 : 96–105.

17. MaheraliN, SridharanR, XieW, UtikalJ, EminliS, et al. (2007) Directly reprogrammed fibroblasts show global epigenetic remodeling and widespread tissue contribution. Cell Stem Cell 1 : 55–70.

18. MoreyL, HelinK (2010) Polycomb group protein-mediated repression of transcription. Trends Biochem Sci 35 : 323–332.

19. MohnF, WeberM, RebhanM, RoloffTC, RichterJ, et al. (2008) Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors. Mol Cell 30 : 755–766.

20. MohnF, SchubelerD (2009) Genetics and epigenetics: stability and plasticity during cellular differentiation. Trends Genet 25 : 129–136.

21. BoyerLA, PlathK, ZeitlingerJ, BrambrinkT, MedeirosLA, et al. (2006) Polycomb complexes repress developmental regulators in murine embryonic stem cells. Nature 441 : 349–353.

22. LeeTI, JennerRG, BoyerLA, GuentherMG, LevineSS, et al. (2006) Control of developmental regulators by Polycomb in human embryonic stem cells. Cell 125 : 301–313.

23. LeebM, PasiniD, NovatchkovaM, JaritzM, HelinK, et al. (2010) Polycomb complexes act redundantly to repress genomic repeats and genes. Genes Dev 24 : 265–276.

24. O'CarrollD, ErhardtS, PaganiM, BartonSC, SuraniMA, et al. (2001) The polycomb-group gene Ezh2 is required for early mouse development. Mol Cell Biol 21 : 4330–4336.

25. PasiniD, BrackenAP, HansenJB, CapilloM, HelinK (2007) The polycomb group protein Suz12 is required for embryonic stem cell differentiation. Mol Cell Biol 27 : 3769–3779.

26. PasiniD, BrackenAP, JensenMR, Lazzerini DenchiE, HelinK (2004) Suz12 is essential for mouse development and for EZH2 histone methyltransferase activity. EMBO J 23 : 4061–4071.

27. ShenX, LiuY, HsuYJ, FujiwaraY, KimJ, et al. (2008) EZH1 mediates methylation on histone H3 lysine 27 and complements EZH2 in maintaining stem cell identity and executing pluripotency. Mol Cell 32 : 491–502.

28. AzuaraV, PerryP, SauerS, SpivakovM, JorgensenHF, et al. (2006) Chromatin signatures of pluripotent cell lines. Nat Cell Biol 8 : 532–538.

29. BernsteinBE, MikkelsenTS, XieX, KamalM, HuebertDJ, et al. (2006) A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell 125 : 315–326.

30. SuIH, BasavarajA, KrutchinskyAN, HobertO, UllrichA, et al. (2003) Ezh2 controls B cell development through histone H3 methylation and Igh rearrangement. Nat Immunol 4 : 124–131.

31. LengnerCJ, CamargoFD, HochedlingerK, WelsteadGG, ZaidiS, et al. (2007) Oct4 expression is not required for mouse somatic stem cell self-renewal. Cell Stem Cell 1 : 403–415.

32. SommerCA, StadtfeldM, MurphyGJ, HochedlingerK, KottonDN, et al. (2009) Induced pluripotent stem cell generation using a single lentiviral stem cell cassette. Stem Cells 27 : 543–549.

33. YingQL, WrayJ, NicholsJ, Batlle-MoreraL, DobleB, et al. (2008) The ground state of embryonic stem cell self-renewal. Nature 453 : 519–523.

34. SilvaJ, BarrandonO, NicholsJ, KawaguchiJ, TheunissenTW, et al. (2008) Promotion of reprogramming to ground state pluripotency by signal inhibition. PLoS Biol 6: e253 doi:10.1371/journal.pbio.0060253

35. BrackenAP, Kleine-KohlbrecherD, DietrichN, PasiniD, GargiuloG, et al. (2007) The Polycomb group proteins bind throughout the INK4A-ARF locus and are disassociated in senescent cells. Genes Dev 21 : 525–530.

36. SerranoM, LeeH, ChinL, Cordon-CardoC, BeachD, et al. (1996) Role of the INK4a locus in tumor suppression and cell mortality. Cell 85 : 27–37.

37. MargueronR, LiG, SarmaK, BlaisA, ZavadilJ, et al. (2008) Ezh1 and Ezh2 maintain repressive chromatin through different mechanisms. Mol Cell 32 : 503–518.

38. HansenM, GerdsTA, NielsenOH, SeidelinJB, TroelsenJT, et al. (2012) pcaGoPromoter–an R package for biological and regulatory interpretation of principal components in genome-wide gene expression data. PLoS ONE 7: e32394 doi:10.1371/journal.pone.0032394

39. EzhkovaE, LienWH, StokesN, PasolliHA, SilvaJM, et al. (2011) EZH1 and EZH2 cogovern histone H3K27 trimethylation and are essential for hair follicle homeostasis and wound repair. Genes Dev 25 : 485–498.

40. SimonJA, KingstonRE (2009) Mechanisms of polycomb gene silencing: knowns and unknowns. Nat Rev Mol Cell Biol 10 : 697–708.

41. MargueronR, ReinbergD (2011) The Polycomb complex PRC2 and its mark in life. Nature 469 : 343–349.

42. EzhkovaE, PasolliHA, ParkerJS, StokesN, SuIH, et al. (2009) Ezh2 orchestrates gene expression for the stepwise differentiation of tissue-specific stem cells. Cell 136 : 1122–1135.

43. HirabayashiY, SuzkiN, TsuboiM, EndoTA, ToyodaT, et al. (2009) Polycomb limits the neurogenic competence of neural precursor cells to promote astrogenic fate transition. Neuron 63 : 600–613.

44. TestaG (2011) The time of timing: How Polycomb proteins regulate neurogenesis. Bioessays 33 : 519–528.

45. PereiraCF, PiccoloFM, TsubouchiT, SauerS, RyanNK, et al. (2010) ESCs require PRC2 to direct the successful reprogramming of differentiated cells toward pluripotency. Cell Stem Cell 6 : 547–556.

46. MunstB, PatschC, EdenhoferF (2009) Engineering cell-permeable protein. J Vis Exp

47. IrizarryRA, HobbsB, CollinF, Beazer-BarclayYD, AntonellisKJ, et al. (2003) Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 4 : 249–264.

48. GentlemanRC, CareyVJ, BatesDM, BolstadB, DettlingM, et al. (2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 5: R80.

49. ReinerA, YekutieliD, BenjaminiY (2003) Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 19 : 368–375.

50. MaereS, HeymansK, KuiperM (2005) BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks. Bioinformatics 21 : 3448–3449.

51. SmootME, OnoK, RuscheinskiJ, WangPL, IdekerT (2011) Cytoscape 2.8: new features for data integration and network visualization. Bioinformatics 27 : 431–432.

52. LangmeadB, TrapnellC, PopM, SalzbergSL (2009) Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25.

53. ZhangY, LiuT, MeyerCA, EeckhouteJ, JohnsonDS, et al. (2008) Model-based analysis of ChIP-Seq (MACS). Genome Biol 9: R137.

54. SongQ, SmithAD (2011) Identifying dispersed epigenomic domains from ChIP-Seq data. Bioinformatics 27 : 870–871.

55. MicsinaiM, ParisiF, StrinoF, AspP, DynlachtBD, et al. (2012) Picking ChIP-seq peak detectors for analyzing chromatin modification experiments. Nucleic Acids Res 40: e70.

56. KinsellaRJ, KahariA, HaiderS, ZamoraJ, ProctorG, et al. (2011) Ensembl BioMarts: a hub for data retrieval across taxonomic space. Database (Oxford) 2011: bar030.

57. YeT, KrebsAR, ChoukrallahMA, KeimeC, PlewniakF, et al. (2011) seqMINER: an integrated ChIP-seq data interpretation platform. Nucleic Acids Res 39: e35.