S Phase Progression in Human Cells Is Dictated by the Genetic Continuity of DNA Foci

Download PDF České info

DNA synthesis must be performed with extreme precision to maintain genomic integrity. In mammalian cells, different genomic regions are replicated at defined times, perhaps to preserve epigenetic information and cell differentiation status. However, the molecular principles that define this S phase program are unknown. By analyzing replication foci within discrete chromosome territories during interphase, we show that foci which are active during consecutive intervals of S phase are maintained as spatially adjacent neighbors throughout the cell cycle. Using extended DNA fibers, we demonstrate that this spatial continuity of replication foci correlates with the genetic continuity of adjacent replicon clusters along chromosomes. Finally, we used bioinformatic tools to compare the structure of DNA foci with DNA domains that are seen to replicate during discrete time intervals of S phase using genome-wide strategies. Data presented show that a major mechanism of S phase progression involves the sequential synthesis of regions of the genome because of their genetic continuity along the chromosomal fiber.

Published in the journal: . PLoS Genet 6(4): e32767. doi:10.1371/journal.pgen.1000900
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1000900

Summary

Introduction

DNA synthesis in eukaryotes must be performed with absolute precision as any defects compromise genetic integrity. In all eukaryotes, DNA is duplicated during S phase of the cell cycle, which is regulated to ensure that DNA synthesis is completed before mitosis can begin [1]–[3]. During synthesis, different regions of the genome are replicated at specific times [4]–[6], perhaps as a part of a fundamental mechanism that ensures the preservation of epigenetic information [7]. Within this timing program, chromatin within gene-rich chromosomal R-bands is known to begin early in S phase, before synthesis of heterochromatic G-bands takes place. This general structure can be revealed at low resolution, using cytological chromosome banding [8], and at higher resolution using genome-wide strategies [9]–[15].

Recent developments in genome-wide analysis have revolutionized our ability to define the structure of S phase in higher eukaryotes. However, detailed analysis of the replication program has been limited by our understanding of the molecular mechanisms that control how specific origins are used at different times. In mammalian cells, recent studies have shown that local chromatin environments define a general preference for origins that are activated during early S-phase [10]–[15]. Regions that engage synthesis at the onset of S phase frequently have a locally high gene density and correspondingly high levels of RNA synthesis. In addition, more detailed analysis is beginning to explore how local chromatin features such as the distribution of CpG islands [14] and local chromatin accessibility [15] contribute to patterns of origin selection.

Single cell studies provide an alternative strategy for understanding S phase progression. Active sites of DNA synthesis can be revealed as replication foci [16],[17], which contain groups of replicons that are replicated together within dedicated replication factories [18]; such replicon clusters typically contain 3–5 replicons within ∼1 Mbp of DNA [19],[20]. DNA foci are thought to represent fundamental unit of chromosome structure [19]–[23] that are defined by local chromatin environments [23]–[25] and replicated during defined intervals of S phase [26],[27]. Perhaps importantly, foci that are replicated during consecutive intervals of S phase appear to lie side-by-side in nuclei [28],[29], suggesting that their organization contributes to replication timing.

During S phase, the organization of replicons within replicon clusters defines how long individual DNA foci are engaged in synthesis. In HeLa cells, during early S phase, the average speed of fork elongation is ∼1.5 kbp/min/fork [19],[30]. As the average distance between adjacent origins in replicon clusters is ∼150 kb (90% of adjacent origins are typically ∼50–250 kb apart) most will be engaged in synthesis for 1–2 h before the internal forks from adjacent replicons meet and terminate by fork fusion. When this occurs, the rate of synthesis can only be maintained if new origins are activated. Hence the progressive activation and completion of synthesis within the ∼1 Mbp DNA foci defines a replication timing program within which different cohorts of foci are replicated within time zones that occupy ∼1–2 hours of S phase.

Mechanisms of origin selection that define S phase timing are known to show remarkable plasticity during cell differentiation [10],[12],[15]. However, within a particular cell type, the extent to which DNA replication is deterministic –⁠ and hardwired by chromosome structure –⁠ or stochastic –⁠ and so varies for cell to cell –⁠ remains a matter of debate. To address this question, we designed an experimental approach that would allow us to analyze the spread of DNA synthesis throughout nuclei of individual cells (Figure 1). Sites of DNA synthesis within DNA foci were labeled with thymidine analogues using pulse and pulse-chase-pulse strategies and analyzed over many days. Initially, labeled foci are distributed throughout all chromosomes but as cells proliferate random mitotic segregation reduces the number of labeled chromosomes within individual cells so that chromosome territories (CT) and their DNA foci are resolved. Immediately after labeling it is impossible to establish the extent to which adjacent foci are related by their spatial and/or genetic continuity, as the alternative models are indistinguishable. However, following chromosome segregation, the plasticity of CT structure [24] allows the spatially and genetically determined models to be distinguished (Figure 1B). Hence, over many cell division cycles, the analysis of individual CTs provides a high-resolution memory of cis -⁠ and trans-activation events that define the replication timing program.

**Fig. 1. Double-labeled replication foci are segregated in specific regions of mitotic chromosomes.**

We used 3D and 4D light microscopy to analyze the organization of DNA foci within individual CTs of nuclei and mitotic chromosomes. We show that the sequential replication of DNA foci is defined by their genetic association along individual chromosomes. To visualize the genetic association directly, we analyzed individual DNA fibers from cells that were labeled during sequential 1 h intervals of S phase. We conclude that the sequential activation of adjacent replicon clusters represents a major mechanism of S phase progression. Indeed, once early synthesis has begun, only a minority –⁠ about 10% -⁠ of de novo initiation events are genetically uncoupled from sites that were engaged in synthesis earlier during S phase. Finally, in order to integrate this conclusion with the analysis of replication using genome-wide strategies, we used bioinformatic tools to show that the structure of replicon clusters within DNA foci and lengths of replication timing domains correlate with extremely high significance. This is consistent with DNA foci being the stable higher-order units of chromatin packaging that define the replication timing program in mammalian cells.

Results

S phase progression is defined by the spatial organization of DNA foci

In HeLa cells in early S phase, the template for DNA synthesis is folded into DNA foci that can be labeled with a variety of modified thymidine analogues and visualized in both living and fixed cells (Figure S1). Different pulse and pulse-chase-pulse strategies can then be used to evaluate the relationship of foci that are engaged in DNA synthesis during different intervals of S phase (Figure S2). In mid/late S phase, the spatial relationship of foci that were labeled during consecutive intervals of S phase is evident because distinct patterns of active sites are seen at this time (Figure S1A and S1C). In early S phase (Figure S1B), in contrast, spatial analysis at the time of labeling is much less informative because of the high density of active sites.

To evaluate the alternative models of S phase progression described in Figure 1, cells were labeled with two consecutive pulse-labels and grown for many days to leave ∼3 labeled CTs/cell (Figure 1C, Figure 2, Figure 3). As a control, we first monitored the co-association of labeled foci in metaphase, as this defines their distribution within individual chromosomes (Figure 1). Metaphase images, from cells that were labeled during early S phase, showed that all labeled chromosomes within double-labeled cells contained early S phase foci that incorporated both the 1^st and 2^nd precursors. However, as chromosome condensation during metaphase limits the resolution of the spatial analysis, we next monitored the level of co-association within interphase CTs [23]. Analysis of CTs showed that foci labeled with the 1^st replication precursor were within 500 nm of a focus labeled with the 2^nd. In addition, time-lapse imaging of foci in living cells showed this co-association to be maintained when cells were monitored for up to 3 hours. Throughout the imaging time course (Figure 2 and Video S1, S2, S3), individual CTs showed dramatic plasticity [31], with shape transformations during cell movement resulting in early S phase foci displaying frequent relative positional shifts of 0.2–0.6 µm over 30 min. Notably, during these shifts, the association of adjacent foci labeled during the 1^st and 2^nd pulses was always maintained (25 CTs were analyzed by live imaging and labeled foci showed the same behavior in all cases).

**Fig. 2. The spatial architecture of DNA foci is maintained in living cells.**

**Fig. 3. The S phase program is defined by the temporal activation of DNA foci at adjacent positions within CTs.**

The sequential activation of DNA foci is defined by their spatial continuity within individual chromosomes

To reinforce the interpretation of time-lapse imaging, we evaluated the spatial relationship of interphase foci labeled during consecutive intervals of early S phase using 3D confocal microscopy (Figure 3). To test the models described in Figure 1, we measured the spatial separation of nearest foci containing the 1^st and 2^nd precursors using both consecutive pulses and pulses separated by intervening unlabeled periods of 1 or 2 hours. Experiments were performed using both fixed (Figure 3B and 3C) and living (Figure 3D and 3E) cells. Living cells were analyzed directly and fixed cells were processed for 3D confocal imaging by indirect immuno-labeling.

Following image capture, image analysis software was used to define the center of mass of labeled sites (Figure 3B6) and then measure the 3D separation of the nearest sites labeled during the 1^st and 2^nd pulses (Figure 3G and 3H). Under the experimental conditions used, DNA foci in HeLa cells have an average diameter of ∼350 nm (Figure 3F). Moreover, as living and fixed cells show the same size distribution, our experimental strategies do not appear to disrupt local chromatin architecture during processing and imaging (Figure 3F; t test association probability p<0.07 n = 60). While analysis of both fixed and living cells demonstrates the stability of foci with sizes of 300–500 nm, we note that recent advances in light microscopy (3D-SIM and SMI microscopy) reveal that individual foci can be resolved into sub-domains with an average size of ∼125 nm [Cristina Cardoso and Vadim Chagin, Technische Universität Darmstadt, personal communication].

When unsynchronized cells were labeled with consecutive pulses, most foci were labeled with both precursors (Figure 3B and 3D); as synthesis within individual foci is not synchronized, a minority of foci might be labeled with only one precursor because they began or completed synthesis during the 1^st or 2^nd labeling periods. However, when the pulses were separated by 1 hour (Figure 3C and 3E) ∼50% of foci were labeled with only one analogue (43.5% of foci in living cells (n = 200) and 52% in fixed cells (n = 146)). Nearest neighbor analysis was used to explore this spatial relationship quantitatively (Figure 3G and 3H). With consecutive pulses, the average center-center separation of the nearest red and green labeled sites was ∼150 nm (Figure 3B and 3D) –⁠ as most foci are double labeled this center-to-center separation is less than the average diameter of individual foci. With an intervening chase of 1 h, the separation between adjacent foci labeled during the 1^st and 2^nd pulses increased to ∼350 nm (Figure 3G and 3H). As this center-to-center separation is similar to the average diameter of foci in early S phase (Figure 3F) foci labeled during the consecutive intervals of S phase must lie close to or touching their nearest neighbor.

Two important controls emphasize the significance of this nearest neighbor analysis. First, we analyzed individual foci that were labeled simultaneously with 2 replication precursor analogues (Figure S3). This defines the reliability of distance measurements and the effect of experimental noise on the precision of data generated by the analysis. To demonstrate a worst-case-scenario, red and green foci with >2-fold average intensities were seen to give an average separation of no more than 75 nm (Figure S3). Second, we also measured the separation of foci labeled during either 1^st or 2^nd pulse to define the distribution of foci that were labeled with each precursor. Under all labeling conditions used, the average separation of nearest early S phase foci was ∼500 nm (Figure S4), which is highly significantly different to the separation of neighboring foci labeled by consecutive pulses with an intervening chase (t test = 2.955E-12 comparing separation of BrdU foci in Figure S4D with separation of biotin and BrdU foci in Figure 3H).

In a parallel study, we also performed a nearest neighbor analysis using normal diploid human fibroblasts (MRC5; Figure S5). While these diploid cells appear to have slightly larger early foci (513+/−116 nm; n = 200) than HeLa cells, perhaps as a consequence of their flattened shape, foci labeled with a separation of 1 h nevertheless maintained a strict side-by-side relationship (separation was 556+/−114nm; n = 155). These experiments show that DNA foci labeled during consecutive intervals of S phase retain a nearest neighbor relationship independently of changes in CT structure, consistent with the spatial relationship at the time of labeling being defined by the genetic connectivity of DNA foci along chromosomes. The significance of this strict side-by-side relationship was reiterated using in silico simulations to model the activation of DNA foci (Figure S6).

The replication timing program correlates with the spatial context of DNA foci

We next attempted to reinforce the links between S phase progression and the genetic continuity of DNA foci by monitoring the distribution of foci labeled during widely separated intervals of S phase. First, we analyzed cells in early S phase after labeling replication foci with 3 sequential replication precursors each separated by 1 hour (Figure S7). As expected, the separation of both consecutive labels –⁠ the separation between the 1^st–2^nd and 2^nd–3^rd precursors -⁠ was ∼350 nm (Figure S7). However, a significantly larger separation of ∼500 nm was seen when the separations of sites labeled with the 1^st and 3^rd precursors was measured (Figure S7C). This shows that even though the folding of DNA foci within individual CTs is complex and dynamic (Figure 2) the foci labeled at different times of early S phase show a progressive separation over time.

This progressive synthesis of early S phase replication foci is consistent with synthesis spreading along chromosomes at a rate of ∼200 nm/h. Over longer periods -⁠ with separations of >4 hours -⁠ the linear continuity of labeled sites is difficult to define because nearest neighbor relationship are degraded by chromosome folding (Figure S8) and the distribution of euchromatin and heterochromatin in CTs [17],[24],[25]. Based on this observation, we would not rule out the possibility that early and mid/late S phase have distinct characteristics. Towards the end of early S phase, as the replication of euchromatin completes, many forks appear to pass from the early to mid/late replication domains [10]–[12]. At this time of S phase, a significant fraction (∼5%) of chromatin is replicated by forks that extend for at least 500 kbp. Such temporal transition regions in the replication program [10] apparently engage synthesis for many hours without encountering and activating potential origins in heterochromatin.

Visualizing replication domains on single DNA molecules defines the genetic contribution to S phase progression

Nearest neighbor analysis is consistent with a genetically defined next-in-line model, which operates in cis within individual CTs (Figure 1). We next wanted to evaluate the extent to which this cis activation defines S phase progression. In nuclei, however, analysis is compromised by the dynamic properties of DNA foci within individual CTs. To avoid this limitation, we analyzed the genetic relationships of replication pulses along individual DNA fibers (Figure 4). DNA fibers were prepared by direct spreading of cells labeled with biotin-dUTP and BrdU with an intervening 1 h chase. Spreads were prepared directly from cells without prior DNA isolation in order to image isolated ∼1–2 Mbp DNA fibers. As careful spreading, with only ∼5 labeled cells per spread, prevents mixing of fibers from individual labeled cells [32], this approach allows us to capture biotin-labeled fibers from cells that were engaged in DNA synthesis during the 1^st labeling period. Regions of spreads with dispersed biotin-labeled fibers were located and randomly selected fields recorded; low magnification was used so that each imaging field contained fibers with at least 0.8 Mbp of DNA. In 144 fields, from 4 equivalent experiments, the fibers analyzed contained 450 Mbp of DNA in total.

**Fig. 4. S phase progression correlates with the sequential activation of replicon clusters as defined by their genetic continuity along individual chromosome.**

Double-labeled fibers were analyzed, as any forks growing throughout the labeling period will incorporate both 1^st and 2^nd precursors, which will be separated by a predictable distance that reflects the rate of fork elongation (Figure 4A and Figure S9). As seen before [19], the active replicons are often clustered into small groups that typically contain ∼0.5–1 Mbp of DNA. This clustering is exemplified by the DNA fibers shown in Figure 4B. The first example (Panels 1 and 2) shows two adjacent imaging fields that contain a single fiber of >1.5 Mbp. This fiber has 3 replicons in the center and 2 on the right that were active during the 1^st pulse (biotin in red). These replicons are linked genetically as replication in the DNA between them is completed during the 2^nd pulse (BrdU in green). On the left of the same fiber, three patches are labeled during the 2^nd pulse, showing that replicons in the adjacent DNA are activated during the 2^nd labeling period. The short cluster shown in panel 3 contains 4 active replicons with an average size of 90 kbp. In this particular example, secondary origins are activated in replicons on both sides of the central cluster during the 2^nd labeling period.

Using fibers like those shown (Figure 4B), two distinct classes of double-labeled fiber were scored, based on labeling within the proximal flanking DNA (Figure 4C and 4D). Replicon clusters with ‘extending’ forks were scored when replicons labeled during the 1^st pulse were flanked by single DNA tracks labeled during the 2^nd pulse, consistent with continued elongation of the out-growing forks from the flanking replicons of the primary cluster. Replicon clusters with ‘secondary activation’ were scored when DNA flanking the primary cluster also contained multiple tracks labeled during the 2^nd pulse, which is only possible if additional forks are activated within the flanking DNA. The structure of replicons within the primary (biotin-labeled) clusters defines the frequency of these two populations (Figure S9). Notably, clusters with extending forks had widely dispersed origins (∼200 kbp apart on average) whereas clusters with secondary initiations within the flanking DNA had shorter inter-origin distances (∼125 kbp apart on average). This difference presumably reflects the temporal relationship between the completion and activation of synthesis in adjacent replicon clusters.

Preparation and staining of DNA fibres that contain >1 Mbp of DNA is technically challenging. However, the use of quality controls to monitor spreading and measurement of the labeled tracks (Figure S9) ensure reliability of the data generated. In all of the scored fibres, the separation of the biotin -⁠ and BrdU-labeled tracks was consistent with fork elongation rates within the normally accepted range for early S phase of 1–2 kbp/min (Figure S9). In these fibers, the continuity of the labeled tracks demonstrates that the underlying DNA strand must be intact throughout the labeled region.

To complete this analysis, we recorded single-labeled regions in order to define de novo initiation events that were remote from previously active replicons and thus ‘uncoupled’ (Figure 4C and 4D) from synthesis during the 1^st labeling period. In the random fields used in this analysis, only 5% of labeled tracks were seen to be uniquely BrdU-labeled (Figure 4C). These observations suggest that genetically adjacent DNA foci are replicated during consecutive intervals of S phase. This genetic spread of synthesis appears to be a major mechanisms, as while the stochastic activation of potential origins is not precluded, remote initiation events, which are uncoupled from previously active replication foci, account for no more than 10% of initiation events once S phase has begun.

Individual DNA foci correlate with genome-wide replication timing domains

During our analysis of replication foci within individual cells we deliberately used a holistic approach in order to avoid any bias that might arise if specific genomic regions were targeted for analysis. To validate our conclusions, we next attempted to integrate the single cell data (Figure 1, Figure 2, Figure 3, Figure 4) with genome-wide data sets [10]–[15], which define the average pattern of synthesis across cell populations. To compare the structure of genome-wide timing domains with replication foci, we first defined the distribution profile of replication timing domains on selected regions of a specific human chromosome (Figure 5A) using genome-wide data sets taken from Desprat et al. [10]. Randomly selected regions of human chromosome 6 with ∼10 Mbp of DNA (1 region is shown in Figure 5B) were sampled and points of inflection in the data readout used to define peaks in the timing profile. Individual peaks represent domains of discrete replication timing and peak heights (Figure 5B) define the average time of replication of the domain across the cell population analyzed –⁠ the highest peaks are replicated predominantly at the onset of S phase. When replication domains from different regions of chromosome 6 were combined the resulting distribution profile (Figure 5A) showed the average domain to contain 529.5+/−208.0 kbp of DNA.

**Fig. 5. Replication timing domains correlate with DNA foci.**

For comparison with the timing data, we generated a series of distribution profiles that simulate the DNA content of populations of DNA foci in human cells. Profiles were generated using published data [19] that describes the distribution of replicon sizes and the number of replicons/cluster in human HeLa cells. In the two distribution profiles shown (Figure 5C) the first describes a typical profile for a population of 112 DNA foci –⁠ for direct comparison with the data set in Figure 5A –⁠ and the second shows the profile for a much larger sample. With average DNA contents of 527.9+/−312.2 kbp and 549.0+/−306.2 kbp of DNA, respectively, these simulations show that the DNA contents of replication timing domains and DNA foci have a high degree of similarity, with correlation coefficients in excess of 0.9 (Figure 5).

Figure 5 also shows the timing relationship between adjacent replication domains using genome-wide analysis of cell populations. The early replicating band p12.3 shows an example of how replication proceeds across a chromosomal domain, which in this typical example contains ∼5 Mbp of DNA. At the left side of this region, 6 timing domains (seen as peaks on the timing profile) are clearly structured so that the central region (Figure 5B, region a) is replicated at the onset of S phase and the adjacent flanking regions (Figure 5B, regions b–d) are replicated sequentially as S phase proceeds. While the structure of peaks and valleys in the timing profile shows that individual cells in the population activate replication of the respective domains at slightly different times, the general trend is clearly consistent with the sequential activation of genetically adjacent timing domains across this region of chromosome 6 in human ES cells.

This comparison highlights a number of fundamental features of chromatin organization that define the efficacy of DNA replication. Most importantly, it is clear that the amount of DNA within both DNA foci and replication timing domains is dramatically different from the average size of individual replicons, which typically contain 100–150 kbp of DNA in human cells [19],[20]. This implies that the replication timing domains must contain groups of replicons that are replicated together. In addition, if individual timing domains were single replicons it would only be possible to duplicate 1.5×10⁹ bp or ∼25% of their DNA in an S phase of 10 hours, given that synthesis during S phase of a diploid mammalian cells involves ∼750 replication sites at any time [16]–[23]. Hence, the co-replication of replicons clusters within replication timing domains is necessary to complete synthesis on schedule.

While the evidence for replication timing domains that contain multiple replicons is overwhelming, it is notable that individual replicons are not evident at the resolution provided by genome-wide analysis (Figure 5B). This is likely to reflect the redundancy of potential origins, which in human cells are present in ∼10-fold excess relative to actual sites where DNA synthesis initiates [1]–[3],[33]. Features of the local chromatin environment are thought to contribute to origin selection and define the relative efficiency with which different potential origins are used. Even so, origin activation clearly has a strong stochastic component so that different sites are used in different cells (Figure S10). As a result, the timing domains seen in population studies must generate a composite activation profile, which reflects how potential origins are used. The use of different potential origins in different cells will effectively smooth synthesis across chromatin domains so that the distribution of individual replicons is not seen. This means that replicon structure is defined by initiation events within individual cells and that the functional domains that are defined by DNA foci, and not the individual replicons themselves, are the regulatory targets for DNA synthesis.

The organization of DNA within chromosome territories defines the location of replication factories within the inter-chromatin compartment

The efficacy of a timing program that propagates using the genetic continuity of DNA foci will require that initiation sites that are used at the onset of S phase have an appropriate distribution throughout the genome. Notably, replication foci visualized in metaphase are uniformly spread along chromosomes (Figure 1). While it is not known how this is achieved, genome-wide approaches show that replication will often begin in regions of the genome that are rich in features linked to gene expression [10]–[15]. Interestingly, this conclusion was drawn from single cell studies 15 years ago [34], based on the co-localization of replication factories and active transcription sites at the onset of S phase.

Potential origins are thought to be equivalent when they are established well before the onset of S phase [1]–[3]. Hence, origin selection at the beginning of S phase must reflect the local chromatin environment within nuclear domains where replication factories are assembled. In this regard, it is notable that early replication factories are associated with nuclear domains that contain open chromatin whereas replication during mid/late S phase spreads to the chromatin-dense nuclear domains (Figure 6). This is confirmed by the structure of sites that contain nascent DNA (Figure 6A), which are located within the chromatin compartment at the interface between the chromatin and inter-chromatin nuclear domains [22],[23]. During synthesis, the organization of active sites means that DNA foci, which contains the unreplicated template, and the associated factories and nascent product occupy discrete nuclear compartments (Figure 6C). This spatial separation means that during replication of a DNA focus that was labeled with BrdU in an earlier cell cycle the nascent product shows very little immediate co-localization with Br-DNA within the template containing focus. Subsequently, as the nascent chromatin matures, a period of 1–2 h is required before almost complete co-localization is seen (Figure 6D). This arrangement shows how the spatial architecture of the template-containing DNA foci and synthetic factories (Figure 6C) contribute to the dynamic behavior of chromatin during S phase.

**Fig. 6. The proximity of DNA foci and the inter-chromatin domain defines the location of sites that are permissive for replication factory assembly.**

Discussion

Eukaryotic cells have such complex genomes that DNA synthesis must be highly regulated in order to ensure the preservation of genome integrity and epigenetic modifications that define cell type. Surprisingly little is known, however, about the molecular principles by which this is achieved in higher eukaryotes. One key feature of the process, which has been appreciated for many years, is that replication of euchromatin and heterochromatin is structured temporally to occur preferentially during early and mid/late S phase, respectively [8]. This temporal restriction correlates with the differential activity of specific cyclin-CDK complexes [35] and the replication of different classes of chromatin, as defined by post-translational histone modification [36],[37], during early and mid/late S phase.

While the spatial architecture of DNA foci appears to contribute to the structure of the mammalian S phase, the molecular mechanisms involved are not known. To address this question, we designed a single cell strategy to identify molecular links between chromosome organization and the timing of DNA synthesis (Figure 1). Analysis at the level of single cells is based on the structure of DNA foci, which are both functional units of DNA replication and structural units of chromosome organization [17],[22],[23]. The architecture of structural foci within chromosomal sub-domains has been analyzed in numerous recent studies. High-resolution analysis of the distribution of chromatin in domains of 2–10 Mbp has clearly demonstrated that foci typically contain 0.5–1 Mbp of DNA [24],[25],[38]. The most comprehensive study has shown that foci with ∼1 Mbp of DNA are a common feature of genome organization [25] and that foci within transcriptionally active and inactive chromatin domains have distinct properties and nuclear distributions [25]. The spatial architecture of the 1 Mbp DNA domains has been analyzed in detail over length scales ranging from 0.5 to 75 Mbp [39]. Notably, the domains in nuclei are separated in relation to their genetic co-ordinates in the range 0 to 3–5 Mbp but little further separation is seen when sequences are further apart, because of the 3D folding of chromosomes within CTs [39].

S phase timing is defined by the connectivity of DNA foci

Here, we wanted to assess how higher-order chromatin organization contributes to the S phase timing program in mammalian cells. To do this, we evaluated the relative importance of direct (genetic) and indirect (spatial) chromatin interactions during S phase progression (Figure 1). DNA foci were labeled at different times of S phase and their spatial organization analyzed within individual CTs. Using a nearest neighbor analysis of DNA foci (Figure 1, Figure 2, Figure 3, and Video S1, S2, S3), together with an analysis of labeling continuity on stretched DNA fibers (Figure 4), we show that DNA foci that were labeled during consecutive intervals of S phase maintain a strict spatial co-association over many cell cycles. This demonstrates that foci labeled during consecutive intervals of S phase are genetic neighbors along chromosomes and provides strong evidence that this relationship underlies a ‘next-in-line’ mechanism of S phase progression [28],[29]. Importantly, our experimental design is not directed to specific chromosomal loci or specific times of the cell cycle but instead uses an unbiased and holistic analysis of DNA foci, which are replicated during early S phase; as the labeled foci are not constrained by synthesis at the time of analysis their distribution must reflect a preferred organizational steady state within CTs.

As S phase proceeds, the majority of foci engage synthesis for 1–2 h (Figure S2) before the termination of synthesis by fusion of internal forks is coupled to activation of origins within adjacent DNA foci. The invasion of outgrowing forks into the genetically adjacent foci is one mechanism that in principle could cause structural alterations that allow or stimulate de novo origin activation. However, our analysis shows that this is not an inevitable outcome, as some forks grow without encountering conditions where de novo origin activation will occur; such regions might have a low density of potential origins [10]–[12]. Forks with these characteristics have been described using both DNA fibers [reviewed in 22] and in recent genome-wide studies [10]–[12], where extended forks of >250 kbp (representing ∼5% of the genome) correlate with the ‘temporal transition regions’ that link replication during early and mid/late S phase. This transition from early to mid/late S phase correlates with a timing transition that can be revealed as a ‘3C-pause’ in DNA synthesis under some conditions of replicative stress [40].

Genome-wide approaches to map replication timing

Single cell studies and genome-wide analysis of replication in cell populations provide complimentary strategies to explore DNA synthesis. Hence, it is important to understand the strengths and limitations of these strategies and evaluate how key information can be combined to develop a general model of S phase progression. A specific advantage of the genome-wide approach is that replication timing is anchored directly to DNA sequence and annotated features such as chromatin architecture and transcriptional activity. In doing this, genome-wide strategies also provide a composite view of DNA synthesis, which can be interpreted to define the average behavior of cells in the population. Such population approaches have shown that large regions of mammalian genomes are replicated during predictable intervals of S phase and that this generally correlates with features of the chromatin environment, so that highly expressed regions of the genome are replicated early during S phase [10]–[15]. The fact that syntenic regions of the human [9] and mouse [11] genomes replicate at equivalent times implies that this general principle is conserved.

During DNA synthesis, cells must also preserve the epigenetic information in chromatin that defines cell type specific patterns of gene expression. In exploring this aspect of mammalian S phase, genome-wide studies have shown that large genomic regions alter their replication timing when cells are induced to differentiate [10],[12],[15] and that distinct changes in replication timing arise as cells become epigenetically committed to differentiation [41]. Such changes raise obvious questions about mechanisms that link chromatin domains that are selected for synthesis during different periods of S phase and how these might relate to the next-in-line model of S phase progression [28],[29]. As described above, such changes are presumably linked to changes in the local chromatin environment, which modulates the efficiency with which potential origins are established and used.

While the ability to relate replication timing to DNA sequence and chromatin features, such as histone modifications, is compelling [10]–[15], one limitation of studies based on cell populations is that any cell-to-cell variability is lost. This is inevitable as population-based approaches will smooth any biological complexity that we might expect to see as experimental noise. In contrast, analysis of DNA synthesis within individual nuclei and on isolated DNA fibers [5],[20], is able to reveal detail related to the specific events that occur within individual cells. However, despite obvious experimental differences, our attempt to integrate data from genome-wide and single cell studies has shown that replicon clusters within domains that contain ∼500 kbp of DNA provide the functional targets during replication of mammalian genomes (Figure 5). Moreover, evidence discussed above shows how data derived from single cells and cell populations support a general model for S phase progression that is in part based on the stochastic activation of potential replication origins and in part on the sequential activation of replication domains, based on their genetic continuity along chromosomes.

A model of S phase progression

The preferential accessibility of potential origins within open chromatin and the differential sensitivity of early and late origins to different cyclin/CDK complexes are major regulators of origin selection. These properties then dictate the efficiency with which different loci –⁠ such as potential replication origins (pre-RCs; Figure 7B) –⁠ interact with the inter-chromatin compartment where active replication factories are formed (RF; Figure 7B2). Origin selection is never-the-less stochastic, as most potential origins are replicated passively throughout S phase [6]. However, once S phase has begun, our data suggest that a next-in-line principle [28],[29] defines the efficiency with which origins can be activated in the downstream replication program, so that only a minority (at most 10%) of de novo initiation events are uncoupled from synthesis within previously active replicon clusters (Figure 4). As replication within engaged replicon clusters approaches completion, the external forks might drive structural perturbations in neighboring foci that alter the exposure of potential origins to the replication machinery and so increases the probability of their activation (Figure 7B3). In this way, the genetic continuity of DNA foci along the chromosomal fiber provides a fundamental determinant of S phase progression in mammalian cells.

**Fig. 7. A model linking the organization of replicon clusters to S phase progression.**

In the absence of genetically defined initiation sites, it is interesting to speculate how the mammalian cells have evolved to ensure that their genetic information is preserved during cell proliferation. Given the demand for precision, it is perhaps surprising that a key regulatory principle involves the random activation of potential initiation sites that are significantly more numerous than necessary to perform synthesis on schedule [33]. This stochastic feature of initiation and the redundancy of potential origins ensures that the system has sufficient tolerance to complete synthesis on schedule if the synthetic environment happens to change; any condition that result in slowing or stalling of the engaged forks are counteracted by local increases in origin density [reviewed in 42]. This regulatory mechanism operates at the level of DNA foci, and recent studies have suggested that a replication-dependent memory mechanism, based on the structure of DNA loops, ensures that appropriate levels of synthesis are maintained from one cell cycle to the next [43].

During S phase, the co-ordinated activation of groups of replicons within DNA foci will reduce the number of active synthetic sites that are required to complete synthesis. In addition, as replicon clusters engage synthesis together, within dedicated replication factories [18], this organization minimizes the time that adjacent replicons are replicating before their growing forks meet and fuse to terminate synthesis. Growing forks are complex structures that are inevitably more prone to damage and recombination than DNA packaged into normal chromatin, hence limiting the number of exposed forks will minimize the risk of damaging the genome. In addition, the sequential activation of replicon clusters based on their genetic continuity along chromosomes will also limit the number of isolated forks. Hence, we propose that the orderly synthesis of replicon clusters within DNA foci has evolved as a mechanism to ensure that higher eukaryotes can duplicate their genomes with the required efficiency while ensuring the preservation of both genetic and epigenetic information.

Materials and Methods

Labeling replication foci in situ

HeLa cells were grown in DMEM (Sigma) with 5% FBS and antibiotics. MRC5 cells were grown in MEM with 10% FBS and antibiotics. Replication foci were pulse-labeled in culture medium containing 10 µM bromo-deoxyuridine (BrdU) or labeled with modified replication precursor analogues: Cy3-; AlexaFluor488 -⁠ (AF488-); biotin-; or digoxigenin-dUTP as described by Maya-Mendoza et al. [44]. Active replication factories were defined by transient expression of GFP-PCNA [29] or indirect immuno-fluorescence with a PCNA specific antibody (Immuno Concepts; Auto I.D. serum No 6006; 1/1000; 15 h; 4°C). Chromatin was visualized by transient expression of DsRed-histone-H2B. Unsynchronized cells were used throughout this study; this was a deliberate choice to avoid synchrony-dependent artefacts and preserve the natural structure of the S phase program.

DNA fiber experiments

DNA fiber spreads were prepared as previously described [19],[44] using very low densities of labeled cells –⁠ of 10³ cell/spread only 5–10 were labeled in these experiments. This low density minimizes DNA bundles and tangles within labeled fibers and allows visualization of Mbp fibers. In addition, the low density of labeled cells allows analysis of fibers from individual labeled cells. BrdU labeled tracks were detected with BrdU anti-sheep antibody (Biodesign; M20105S; 1∶1000 dilution; 1 h at 20°C) and biotin-11-dUTP tracks using a mouse monoclonal antibody (Clone BN-34, Sigma; 1∶1000 dilution; 1 h; 20°C). Primary antibodies were detected using Cy3 -⁠ or AF488-conjugated donkey anti-sheep and anti-mouse secondary antibodies. The slides with DNA fibers were mounted with 50∶50 PBS-glycerol.

Fibers were examined using a Zeiss LSM 510META confocal microscope using a 40× lens, labeled tracks measured using the LSM software and converted to kbp using a conversion factor of 1 µm = 2.59 kbp [19]; under these imaging conditions a single imaging field contains ∼0.8 Mbp DNA. Double-labeled fibers were imaged only in dispersed, untangled areas of the DNA spread, to ensure the continuity of adjacent replicon clusters on individual DNA fibers. Routine quality control for spreading was performed using direct DNA labeling with YOYO-1 (Figure S9F) or cells labeled for >24 h with 10 µM BrdU, to give fully Br-labeled fibers (Figure S9G).

Immuno-fluorescence and direct labeling of DNA foci

DNA foci labeled with BrdU, biotin-dUTP or digoxigenin-dUTP were visualized by indirect immuno-fluorescence as described [19],[44]. Cells were grown on coverslips, pulse labeled (directly or by transfection) and fixed in 4% paraformaldehyde. Fixed cells were acid treated (for BrdU labeling) and washed 3× in PBS, treated with 0.5 Triton ×100 in PBS, rinsed 3× in PBS, 3× PBS+ (PBS plus 1% BSA and 0.1% Tween 20), blocked for 1 h and incubated for 1 h with the appropriate antibody. Secondary antibodies were conjugated with Cy3, AF488, AF647 and Qdot reagents (Invitrogen). For 2^nd or 3^rd pulse detection, cells incubated after first detection including secondary antibody, were washed 3× in PBS and 3× in PBS+ and incubated with the appropriate first and second antibodies. In some experiments we used BrdU anti-rat (Immunologicals Direct Clone BU 1/75; 1∶1000 dilution; 1 h; 20°C) and a secondary anti-rat antibody conjugated with Qdot-605. Streptavidin-Qdot-525 was used to identify sites containing biotin-dUTP. Finally, slides were washed 3× in PBS+, 3× in PBS, incubated with 5 µg/ml Hoechst 33258 (Sigma) for 10 min, rinsed 3× in PBS and mounted with either Vectashield or Prolong mounting media. Mitotic chromosomes were prepared as described [44].

For confocal imaging, samples were examined using a Zeiss LSM 510META confocal microscope and 100× (1.45 NA) lens. 3D images were generated using Z stacks and processed in Imaris software. In order to ensure optimal imaging performance, instrument alignment was performed at regular intervals by Zeiss. Chromatic shift was corrected using multi-coloured TetraSpeck florescent beads; the maximum tolerated shifts were 50 nm in X–Y and 100 nm in Z (Figure S3B). To minimize chromatic shift, for all experimental conditions extreme care was taken to balance labeling intensities in different imaging channels. In addition, for each indirect labeling experiment multiple samples were prepared so that each replication pulse could be labeled with the different secondary reagents used. 4D time-lapse imaging was performed using a Deltavision microscope with a CoolSNAP-HQ2 camera and Olympus objective (100x; 1.4 NA). The intensity of light during imaging was kept to 32% using an acquisition speed of 100–200 ms. Chromosome spreads were captured using a Deltavision microscope and images deconvolved using 5–10 iterations and pre-filter cut-off values (microns) of 0.05.

The 3D and 4D images were analyzed using Imaris software. For LSM images of individual CTs a 0.02 µm Gaussian filter was applied. For nearest neighbor analysis, 3D projections were generated in Imaris software from confocal Z series and software used to identify 3D labeled sites and the mass centers of individual sites (foci). Individual channels were processed separately. The co-ordinates of the mass centers were then used to define the spatial relationship between adjacent foci, either within or between channels. For presentation, the imaging software represents the mass centers of DNA foci as computer generated spheres that correspond in size to average foci. Images generated in doing this are clearly artificial and while providing an accurate representation of the positions of foci are not intended to provide a realistic representation of the foci themselves.

Bioinformatic analysis of replication timing domains

Replication timing data from human ES cells [10] was taken from the Integrative Genomics Viewer website at: http://www.broadinstitute.org/igv. For analysis, we choose to use human chromosome 6, as we have used this chromosome recently to model S phase [45]. To map the replication timing domains, ∼10 Mbp regions were selected at random and points of inflection defined to identify peaks in the timing profile. Distances between adjacent peaks were then taken from the browser to develop a profile of distributions.

Profiles of distributions for replication foci were generated using parameters for the distribution of replicons per cluster and the length of replicons [19]. For simulation, the primary data for replicon length was approximated to a normal distribution (μ = 140.6238kbp, σ = 58.8192), which was then sampled to determine the length of each individual replicon and assimilated into replicon clusters using the published frequencies of replicons/cluster. Simulations were implemented in Matlab.

Supporting Information

Zdroje

1. BlowJJ

DuttaA

2005 Preventing re-replication of chromosomal DNA. Nat Rev Mol Cell Biol 6 476 486

2. DePamphilisML

BlowJJ

GhoshS

SahaT

NoguchiK

2006 Regulating the licensing of DNA replication origins in metazoa. Curr Opin Cell Biol 18 231 239

3. SclafaniRA

HolzenTM

2007 Cell cycle regulation of DNA replication. Annu Rev Genet 41 237 280

4. MachidaYJ

HamlinJL

DuttaA

2005 Right place, right time, and only once: replication initiation in metazoans. Cell 123 13 24

5. NorioP

KosiyatrakulS

YangQ

GuanZ

BrownNM

2005 Progressive activation of DNA replication initiation in large domains of the immunoglobulin heavy chain locus during B cell development. Mol Cell 20 575 87

6. MesnerLD

CrawfordEL

HamlinJL

2006 Isolating apparently pure libraries of replication origins from complex genomes. Mol Cell 21 719 726

7. GrothA

RochaW

VerreaultA

AlmouzniG

2007 Chromatin challenges during DNA replication and repair. Cell 128 721 733

8. DrouinR

LemieuxN

RicherCL

1990 Analysis of DNA replication during S-phase by means of dynamic chromosome banding at high resolution. Chromosoma 99 273 280

9. WoodfineK

FieglerH

BeareDM

CollinsJE

McCannOT

2004 Replication timing of the human genome. Hum Mol Genet 13 191 202

10. DespratR

Thierry-MiegD

LaillerN

LajugieJ

SchildkrautC

2009 Predictable dynamic program of timing of DNA replication in human cells. Genome Res 19 2288 2299

11. Farkash-AmarS

LipsonD

PoltenA

GorenA

HelmstetterC

2008 Global organization of replication time zones of the mouse genome. Genome Res 18 1562 1570

12. HirataniI

RybaT

ItohM

YokochiT

SchwaigerM

2008 Global reorganization of replication domains during embryonic stem cell differentiation. PLoS Biol 6 e245 doi:10.1371/journal.pbio.0060245

13. CadoretJ-C

MeischF

Hassan-ZadehV

LuytenI

GuilletC

2008 Genome-wide studies highlight indirect links between human replication origins and gene regulation. Proc Natl Acad Sci USA 105 15837 15842

14. Sequeira-MendesJ

Diaz-UriarteR

ApedaileA

HuntleyD

BrockdorffN

2009 Transcription initiation activity sets replication origin efficiency in mammalian cells. PLoS Genet 5 e1000446 doi:10.1371/journal.pgen.1000446

15. HansenRS

ThomasS

SandstromR

CanfieldTK

ThurmanRE

2010 Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci USA 107 139 144

16. JacksonDA

1995 Nuclear organization: uniting replication foci, chromatin domains and chromosome structure. BioEssays 17 587 591

17. ZinkD

2006 The temporal program of DNA replication: new insights into old questions. Chromosoma 115 273 287

18. HozakP

HassanAB

JacksonDA

CookPR

1993 Visualization of replication factories attached to a nucleoskeleton. Cell 73 361 373

19. JacksonDA

PomboA

1998 Replicon clusters are stable units of chromosome structure: evidence that nuclear organization contributes to the efficient activation and propagation of S phase in human cells. J Cell Biol 140 1285 1295

20. LebofskyR

HeiligR

SonnleitnerM

WeissenbachJ

BensimonA

2006 DNA replication origin interference increases the spacing between initiation events in human cells. Mol Biol Cell 17 5337 5345

21. MaH

SamarabanduJ

DevdharRS

AcharyaR

ChengPC

1998 Spatial and temporal dynamics of DNA replication sites in mammalian cells. J Cell Biol 143 1415 1425

22. BerezneyR

DubeyDD

HubermanJA

2000 Heterogeneity of eukaryotic replicons, replicon clusters, and replication foci. Chromosoma 108 471 484

23. CremerT

CremerC

2001 Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat Rev Genet 2 292 301

24. ShoplandLS

LynchCR

PetersonKA

ThorntonK

KepperN

2006 Folding and organization of a contiguous chromosome region according to the gene distribution pattern in primary genomic sequence. J Cell Biol 174 27 38

25. GoetzeS

Mateos-LangerakJ

GiermanHJ

de LeeuwW

GiromusO

2007 The three-dimensional structure of human interphase chromosomes in related to the transcriptome map. Mol Cell Biol 27 4475 4487

26. GoenA

CedarH

2003 Replicating by the clock. Nat Rev Mol Cell Biol 4 25 32

27. AladjemMI

2007 Replication in context: dynamic regulation of DNA replication patterns in metazoans. Nat Rev Genet 8 588 600

28. MandersEMM

StapJ

BrakenhoffGJ

van DrielR

AtenJA

1992 Dynamics of three-dimensional replication patterns during the S-phase, analyzed by double labeling of DNA and confocal microscopy. J Cell Sci 103 857 862

29. SporbertA

GahlA

AnkerholdR

LeonhardtH

CardosoMC

2002 DNA polymerase clamp shows little turnover at established replication sites but sequential de novo assembly at adjacent origin clusters. Mol Cell 10 1355 1365

30. TakebayashiS

SugimuraK

SaitoT

SatoC

FukushimaY

2004 Regulation of replication at the R/G chromosomal band boundary and pericentromeric heterochromatin of mammalian cells. Exp Cell Res 304 162 174

31. BornflethH

EdelmannP

ZinkD

CremerT

CremerC

1999 Quantitative motion analysis of subchromosomal foci in living cells using four-dimensional microscopy. Biophys J 77 2871 2886

32. BlowJJ

GillespiePJ

FrancisD

JacksonDA

2001 Replication origins in Xenopus egg extracts are 5–15 kb apart and are activated in clusters that fire at different times. J Cell Biol 152 15 26

33. HyrienO

MarheinekeK

GoldarA

2003 Paradoxes of eukaryotic DNA replication: MCM proteins and the random completion problem. BioEssays 25 116 125

34. HassanAB

ErringtonRJ

WhiteNS

JacksonDA

CookPR

1994 Replication and transcription sites are colocalized in human cells. J Cell Sci 107 425 434

35. KatsunoY

SuzukiA

SugimuraK

OkumuraK

ZineldeenDH

2009 Cyclin A-CDK1 regulates the origin firing program in mammalian cells. Proc Natl Acad Sci USA 106 3184 3189

36. ZhangJM

XuF

HashimshonyT

KeshetN

CedarH

2002 Establishment of transcriptional competence in early and late S phase. Nature 420 198 202

37. Lande-DinerL

ZhangJM

CedarH

2009 Shifts in replication timing actively affect histone acetylation during nucleosome reassembly. Mol Cell 34 767 774

38. AlbiezH

CremerM

TiberiC

VecchioL

SchermellehL

2006 Chromatin domains and the interchromatin compartment form structurally defined and functionally interacting nuclear networks. Chromosome Res 14 707 733

39. Mateos-LangerakJ

BohnM

de LeeuwW

GiromusO

MandersEMM

2009 Spatially confined folding of chromatin in the interphase nucleus. Proc Natl Acad Sci USA 106 3812 3817

40. FetniR

DrouinR

RicherCL

LemieuxN

1996 Complementary replication R -⁠ and G-band patterns induced by cell blocking at the R-band/G-band transition, a possible regulatory checkpoint within the S phase of the cell cycle. Cytogenet Cell Genet 75 172 179

41. HirataniI

RybaT

ItohM

RathjenJ

KulikM

2010 Genome-wide dynamics of replication timing revealed by in vitro models of mouse embryogenesis. Genome Res 20 155 169

42. Maya-MendozaA

TangCW

PomboA

JacksonDA

2009 Mechanisms regulating S phase progression in mammalian cells. Front Biosci 14 4199 4213

43. CourbetS