PNAS PLUS Unreplicated DNA remaining from unperturbed S phases passes through mitosis for resolution in daughter cells Alberto Morenoa,1, Jamie T. Carringtona,1, Luca Albergantea,b, Mohammed Al Mamuna,b, Emma J. Haagensena,2, Eirini-Stavroula Komselic, Vassilis G. Gorgoulisc,d,e, Timothy J. Newmana,b, and J. Julian Blowa,3 a School of Life Sciences, University of Dundee, Dundee DD1 5EH, United Kingdom; bSchool of Science and Engineering, University of Dundee, Dundee DD1 4HN, United Kingdom; cDepartment of Histology and Embryology, School of Medicine, University of Athens, GR-11527 Athens, Greece; dBiomedical Research Foundation of the Academy of Athens, GR-11527 Athens, Greece; and eFaculty Institute of Cancer Sciences, Manchester Academic Health Science Centre, University of Manchester, Manchester M20 4QL, United Kingdom To prevent rereplication of genomic segments, the eukaryotic cell cycle is divided into two nonoverlapping phases. During late mitosis and G1 replication origins are “licensed” by loading MCM2-7 double hexamers and during S phase licensed replication origins activate to initiate bidirectional replication forks. Replication forks can stall irreversibly, and if two converging forks stall with no intervening licensed origin—a “double fork stall” (DFS)—replication cannot be completed by conventional means. We previously showed how the distribution of replication origins in yeasts promotes complete genome replication even in the presence of irreversible fork stalling. This analysis predicts that DFSs are rare in yeasts but highly likely in large mammalian genomes. Here we show that complementary strand synthesis in early mitosis, ultrafine anaphase bridges, and G1-specific p53-binding protein 1 (53BP1) nuclear bodies provide a mechanism for resolving unreplicated DNA at DFSs in human cells. When origin number was experimentally altered, the number of these structures closely agreed with theoretical predictions of DFSs. The 53BP1 is preferentially bound to larger replicons, where the probability of DFSs is higher. Loss of 53BP1 caused hypersensitivity to licensing inhibition when replication origins were removed. These results provide a striking convergence of experimental and theoretical evidence that unreplicated DNA can pass through mitosis for resolution in the following cell cycle. DNA replication but capable of becoming active if necessary (5–9). We previously used mathematical analysis to show how the distribution of replication origins in yeasts can be explained by the need for complete genome replication in the presence of irreversible fork stalling (4). Our theory predicts that organisms with significantly larger genomes than yeast, such as those of mammals, will experience a much greater probability of replication failure genome-wide. In this work, we provide evidence for a postreplicative mechanism that allows the resolution of these unreplicated segments of DNA that involves segregation of template DNA strands during mitosis by the creation of ultrafine anaphase bridges (UFBs) and their recognition in the subsequent G1 phase by the DNA repair protein p53-binding protein 1 (53BP1). We show that 53BP1 nuclear bodies correlate with the expected number of DFSs, both when the number of replication origins is reduced and when the number of replication origins is increased. We also show that 53BP1 preferentially associates with DNA in larger replicons, as predicted Significance We provide evidence that in organisms with gigabase-sized genomes, such as humans, one or more stretches of DNA typically remain unreplicated when cells enter mitosis and are segregated to daughter cells via structures called ultrafine anaphase bridges. p53-binding protein 1 (53BP1) accumulates at the subsequent DNA structures inherited by each daughter cell in the following G1 phase to facilitate resolution in S phase. We show that the abundance of these structures match theoretical predictions for the number of unreplicated DNA segments when the number of replication origins is artificially increased or decreased. We show that 53BP1 preferentially binds to chromosomal regions with low numbers of replication origins. This work challenges the prevailing view of how genome stability is maintained in proliferating cells. | MCM | cell cycle | 53BP1 | UFB D uring the eukaryotic cell cycle, the genome must be precisely duplicated with no sections left unreplicated and no sections replicated more than once. To prevent rereplication, the process is divided into two nonoverlapping phases: during late mitosis and G1 replication origins are “licensed” for subsequent use by loading MCM2-7 double hexamers, and during S phase DNAbound MCM2-7 is activated to form processive CMG (CDC45MCM-GINS) helicases that drive replication fork progression. The prohibition of origin licensing during S phase and G2 ensures that rereplication of DNA cannot occur. However, the inability to license new origins after the onset of S phase provides a challenge for the cell to fully replicate the genome using its finite supply of licensed origins. Replication forks can irreversibly stall when they encounter unusual structures on the DNA, such as DNA damage or tightly bound protein–DNA complexes. When replication initiation occurs at a licensed replication origin the MCM2-7 double hexamer forms a pair of bidirectionally orientated CMG helicases (1–3). If one fork irreversibly stalls, the converging fork from a neighboring origin can compensate by replicating all of the DNA up to the stalled fork. However, if two converging forks both stall and there is no licensed origin between them—a “double fork stall” (DFS)—new replicative machinery cannot be recruited to replicate the intervening DNA (4). To compensate for this potential for underreplication, origins are licensed redundantly, with most (typically >70%) remaining dormant www.pnas.org/cgi/doi/10.1073/pnas.1603252113 Author contributions: A.M. designed, performed, and optimized experiments on 53BP1 nuclear bodies, UFBs and immunofluorescence, and ChIP-Seq; J.T.C. performed experiments on 53BP1 nuclear bodies, RPA and γ-H2AX foci, mitotic EdU, and clonogenic assays and performed the flow-cytometry experiments; L.A. performed the ChIP-Seq analysis; L.A. and M.A.M. performed the mathematical and computational analyses; A.M. and E.J.H. developed the 3D FACS protocol; E.-S.K. and V.G.G. made the HBEC cells; A.M. and J.T.C. analyzed data; T.J.N. coordinated the theoretical work; J.J.B. led the project; and A.M., J.T.C., and J.J.B. wrote the paper. The authors declare no conflict of interest. This article is a PNAS Direct Submission. Freely available online through the PNAS open access option. 1 A.M. and J.T.C. contributed equally to this work. 2 Present address: Northern Institute for Cancer Research, Medical School, Newcastle University, Newcastle upon Tyne NE2 4HH, United Kingdom. 3 To whom correspondence should be addressed. Email: [email protected]. This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10. 1073/pnas.1603252113/-/DCSupplemental. PNAS | Published online August 11, 2016 | E5757–E5764 SYSTEMS BIOLOGY Edited by James E. Cleaver, University of California, San Francisco, CA, and approved July 6, 2016 (received for review February 26, 2016) A 3 Mean number of DFSs Frequency x 103 4 3 2 1 0 wild-type B 5 10 100 1000 10000 origins added 2 1 0 1 origins removed Mathematical model Computer model: IMR90 Computer model: HeLa 40 60 80 100 120 140 160 Replication Origins (%) Inter−Origin Distances (kbp) C S/G2 mitosis D E 50 Frequency of cells (%) 12 μm DAPI 53BP1 Untreated S G1 G1 Cells Poisson Distr. (λ = 0.95) 40 30 20 10 0 Aphidicolin (0.2 μM) 0 1 2 3 4 5 6 7 Number of 53BP1 bodies per cell G 1 Mean 53BP1 bodies Mean 53BP1 bodies F 0.8 0.6 0.4 0.2 0 2 4 6 8 G1 (h after nocodazole release) 1.0 0.8 0.6 0.4 0.2 0 I II III IV V S phase stage (EdU morphology) Fig. 1. Potential mechanism for resolution of DFSs. (A) Distribution of replicon sizes in HeLa cells, based on data from ref. 13. The red bar represents the average replicon size of ∼31 kb. (B) Mean number of DFSs predicted using a mathematical model (4), and a computational model that uses origin data from HeLa and IMR-90 (13) when origins are added or depleted randomly. (C) Model for segregation of unreplicated DNA to daughter cells for resolution in the next cell cycle. (D) The 53BP1 nuclear bodies in untreated and aphidicolin-treated cells. (E) Frequency of G1-specific 53BP1 nuclear bodies (n = 100, three replicates). χ2 test for a fitted Poisson, P = 0.771. (F) Frequency of G1-specific 53BP1 nuclear bodies at the times indicated after nocodazole treatment and mitotic shake-off (n = 150, three replicates, error bars are SEM). χ2 test, P = 0.924. (G) Frequency of 53BP1 nuclear bodies at different stages of the replication timing program, as defined by O’Keefe et al. (32) (n = 150, three replicates, error bars are SEM). χ2 test, P = 4.998 × 10−4. by the theoretical analysis. This experimental work strongly supports the theoretical analysis of DFSs in organisms of differing genome size that we present in an accompanying paper (10). Results Refinements in technology have led to a convergence of origin mapping data in mammalian tissue culture cells (11–13). Fig. 1A shows the spacing between ∼90,000 replication origins (i.e., the replicon sizes) in HeLa cells derived from the data of Picard et al. (13). The average interorigin distance is ∼31 kb, consistent with initiation events being ∼100 kb apart (11, 14, 15) and ∼30% of origins being stochastically activated in any given S phase (6–9, 16, 17). Compared with yeast, human cells have an irregular E5758 | www.pnas.org/cgi/doi/10.1073/pnas.1603252113 distribution of origins with an unexpectedly high number of very large replicons (10). Using a mathematical approach that we have previously derived and validated (4) we estimate that one or two DFSs are expected to occur in every HeLa cell S phase (Fig. 1B). Similar numbers were obtained when we performed computational analyses based on the origin mapping data from both HeLa and primary IMR90 cells (Fig. 1B). The predicted number of DFSs increases when replication origins are removed and decreases when they are added (Fig. 1B). Theoretical analysis indicates that the distribution of origins in human cells is constrained to produce, on average, only a small number of DFSs and therefore indicates that human cells should possess a postreplicative mechanism capable of resolving these spontaneous events (10). Moreno et al. Mcm5 104 GAPDH 103 50k 20 50 D 1500 1000 500 0 32 48 64 3 2 1 Mcm5 RNAi 0 40 Total lysates - + F Dox Tubulin + Dox Chromatin 60 Frequency of cells (%) Cdc6 - Mcm5 Lamin B origins removed origins added 80 90 100 Non-induced Cdc6 overexpression 40 Poisson distr. (λ = 0.95) Poisson distr. (λ = 0.65) 30 20 10 0 1 2 3 4 5 1.0 I *** 1 0 Mathematical model Computer model HeLa: Mcm5 RNAi HeLa: Cdt1 RNAi HBEC: Cdc6 40 60 80 0.8 5 cm s iC on 2 tro l H Mean 53BP1 bodies Mean number of DFSs 70 Number of 53BP1 nuclear bodies per cell wild-type 3 60 50 0 G 50 Replication Origins (% control) Mcm5 RNAi (h) E 100k siM Early S-phase Mcm2 2000 16 50k DNA content 70 2500 Control 100k DNA content Mcm5 depletion (%) C 104 103 102 LaminB 0 G2 G1 Mcm5 105 Mcm5 0.6 Tubulin 0.4 1.00 0.23 0.2 0 100 120 140 160 Replication Origins (%) siControl siMcm5 RNAi Fig. 2. Origin number and the frequency of 53BP1 nuclear bodies. (A) Immunoblot of total and chromatin-bound MCM5 in HeLa cells after MCM5 RNAi. (B) Three-dimensional FACS of HeLa cells labeled with EdU (Left) and MCM2 (Right). Red, G1 phase: EdU negative and G1 DNA content. Blue, early S phase: incorporation of EdU without significant increase in total DNA content. Orange, late S phase: EdU positive cells with >G1 DNA content. Green, G2 phase: EdU negative and G2 DNA content. (C) FACS of chromatin-associated MCM2 signal in early S phase HeLa cells with indicated periods of MCM5 RNAi. (D) Frequency of G1-specific 53BP1 nuclear bodies (y-axis values) after MCM5 knockdown versus relative number of replication origins quantified by 3D FACS of DNA-bound MCM2 (x-axis values). Each point represents the mean of 100 cells. (E and F) CDC6-inducible HBEC cells. Immunoblot of CDC6 and tubulin in whole-cell lysates (E, Top) and MCM5 and Lamin B1 in chromatin samples (E, Bottom). Frequency distribution of 53BP1 nuclear bodies in HBEC cells (F) (n = 100, three replicates). χ2 tests for fitted Poissons, P > 0.87. The two conditions are significantly different (Wilcoxon rank sum test, P = 2.843 × 10−6). (G) Compilation of the predicted number of DFSs using the mathematical model and the computer simulation (from Fig. 1B) and the mean number of 53BP1 nuclear bodies in vivo (from D and F and Fig. S2E). (H) Frequency of G1-specific 53BP1 nuclear bodies in control and MCM5 RNAi-treated IMR-90 cells (n = 150, three replicates, error bars are SEM). t test, P = 1.79 × 10−4. (I) Immunoblot to show the depletion of MCM5 after RNAi. Quantification of band intensity is indicated below the blot. Because a DFS will create a large segment of unreplicated DNA, our analyses suggest that humans and metazoans in general with significantly larger genomes than yeasts have evolved mechanisms to resolve them. Unreplicated or damaged DNA may require topological unhooking for accurate segregation during mitosis and the lesions generated by this process could then be repaired in the following cell cycle (18, 19) (Fig. 1C). Chromatid condensation during mitosis could provide a procMoreno et al. essive unwinding activity to separate unreplicated DNA, leaving single-stranded gaps that could then be partially filled in during mitosis (20) or the following cell cycle. This mechanism depends on the resolution and segregation of topologically intertwined strands and could require a limited amount of DNA strand cutting (19–21). We therefore predict that this mechanism might be able to deal with only a small number of DFSs, as our theory predicts (10). PNAS | Published online August 11, 2016 | E5759 SYSTEMS BIOLOGY Chromatin 20 40 50 80 Mcm5 depletion (%) Mcm2 intensity 105 32 48 64 mid-late S 16 early S 32 48 64 EdU intensity Total lysates 16 PNAS PLUS B Control RNAi (h) Mcm5 RNAi (h) Mean number of 53BP1 bodies A B DAPI % 53BP1 bodies with RPA foci A 53BP1 5 μm RPA 40 *** 30 20 10 0 Merge siControl siMcm5 RNAi D Normalised 53BP1 signal strength Mean γ-H2AX foci 1.2 0.8 0.4 0 1.00 0.98 0.96 0.94 40 30 20 10 1 5 10 20 50 100 200 5001000 Replicon length (kbp) F 10000 5000 2000 Number of Replicons Replicon length (kbp) 1.02 siControl siMcm5 RNAi E 50+ 1.04 Number of replicons C 500 200 50 20 53BP1− Replicons 53BP1+ Replicons 1000 100 10 53BP1− 53BP1+ Replicon type 1 6 -6 10 - 10 16 - 1 6 25 - 2 5 40 - 4 60 - 60 10 - 1 0 160 - 00 1 250 - 60 2 40 0 - 50 60 0 - 400 10 0 - 60 0 16 00 100 - 0 2500 160 4000 250 0 60 00 40 0 00 - 6 00 - 1 00 00 0 00 5 Replicon length (kbp) Fig. 3. The 53BP1 is enriched at genomic loci that correspond to large replicons. (A) Representative image of the colocalization between G1-specific 53BP1 nuclear bodies and RPA foci. (B) Percentage of total cellular 53BP1 nuclear bodies that colocalize with RPA after treatment with control or MCM5 RNAi (n = 100, three replicates, error bars are SEM). t test, P = 3.01 × 10−3. (C) Mean frequency of G1-specific γ-H2AX foci in HeLa cells after MCM5 RNAi (n = 150, three replicates, error bars are SEM). t test, P = 0.585. (D) Plot of the average 53BP1/IgG signal ratio per kilobase against replicon size. A strong and significant correlation is observed (Spearman ρ = 0.91, P < 10−15). (E) Distribution of the size of 53BP1+ and 53BP1− replicons. t test, P < 10−15. (F) Frequency distribution of 53BP1+ and 53BP1− replicons across different replicon sizes. χ2 test, P < 10−15. Previous studies have suggested that 53BP1 recognizes these aberrant structures in the cell cycle following underreplication. The 53BP1 forms “nuclear bodies” in G1 phase that are symmetrically distributed between sister cells, possibly corresponding to lesions generated by transmission of DFSs through mitosis in the parent cell (18, 22). Consistent with this idea, the number of 53BP1 nuclear bodies increases in cells treated with replication inhibitors (23–25) (Fig. 1D and Fig. S1 A and B), suggesting that 53BP1 can recognize structures resulting from defective DNA replication. As our theory predicts (10), 53BP1 nuclear bodies in normal G1 phase HeLa cells conform to a Poisson distribution (suggesting that they are due to independent stochastic events) with a mean close to the predicted number of DFSs (Fig. 1E and Fig. S1 C and D). The number remains stable during G1 E5760 | www.pnas.org/cgi/doi/10.1073/pnas.1603252113 (Fig. 1F) but declines as cells progress through S phase, supporting the idea that they are resolved in a replication-dependent manner (Fig. 1G and Fig. S1E). To provide evidence for a link between DFSs and 53BP1 nuclear bodies, we depleted replication origins in HeLa cells using RNAi against two components of the origin licensing system, MCM5 and CDT1 (6) (Fig. 2A and Fig. S2A). We then used a 3D flow cytometry protocol, measuring DNA content, 5-ethynyl2′-deoxyuridine (EdU) incorporation, and chromatin-bound MCM2 to determine the amount of chromatin-bound MCM2-7 in cells entering S phase (Fig. 2B). Because origins are only licensed for use if they are associated with MCM2-7, the amount of DNA-bound MCM2 at the onset of S phase provides a measure of the number of available origins. Depletion of MCM5 by RNAi Moreno et al. Moreno et al. Discussion In this work we present evidence that in unperturbed cell cycles of human cells unreplicated DNA is frequently present at the end of G2, is partially filled in during early mitosis, and is segregated during mitosis for resolution during the following cell cycle. Our theoretical analysis (4, 10) suggests that in organisms such as humans with gigabase-sized genomes DFSs will routinely occur and create sections of unreplicated DNA that must be resolved by a postreplicative mechanism. The symmetrical distribution of 53BP1 nuclear bodies between daughter cells and their induction by replicative stresses means that they could mark the products of unreplicated DNA segregated to daughter cells. We show that when replication origins are deleted or added there is a strong correlation between the number of 53BP1 nuclear bodies and our theoretical predictions of DFSs as presented in our accompanying paper (10). We show that 53BP1 preferentially associates with larger replicons, in line with our theoretical predictions of DFS distribution. We also provide evidence for a mechanism for the processing of the unreplicated PNAS | Published online August 11, 2016 | E5761 PNAS PLUS chromosome fragile sites, and late-replicating DNA (Fig. S6 D and E). Taken together, these analyses show that 53BP1 is more likely to bind to DNA in large replicons, as predicted if 53BP1 recognizes DNA structures resulting from DFSs. The recent discovery that aphidicolin-treated cells exhibit EdU incorporation during early mitosis indicates that replication stress causes damage that is resolved postreplicatively by DNA repair synthesis (20). If unreplicated DNA is unwound during mitosis, ssDNA will be exposed, thereby providing a template for complementary strand synthesis. Consistent with this idea, MCM5 RNAi caused a significant increase in early-mitotic EdU foci (Fig. 4 A–C). This result, when combined with the colocalization of 53BP1 and RPA and lack of increased γ-H2AX foci (Fig. 3A), implies that the foci of EdU incorporation during early mitosis represent sites of DNA synthesis of unreplicated DNA (dashed lines in Fig. 1C). This postreplicative mechanism may not be able to complete replication of all of the unreplicated DNA generated by DFSs, which may be hundreds of kilobases in size (10). It has been suggested that UFBs, which contain single-stranded DNA, might represent a mechanism for resolving partially replicated stretches of DNA (20, 26–29) (dashed lines in Fig. 1C). Consistent with this idea, we observed that in untreated HeLa cells the number and distribution of UFBs closely matched the numbers of 53BP1 nuclear bodies. Further, the number of UFBs increased in line with 53BP1 nuclear bodies when MCM2-7 was partially depleted. This is consistent with the idea that UFBs provide a mechanism by which unreplicated DNA generated by DFSs is transmitted through mitosis to daughter cells to become ssDNA lesions coated with 53BP1 that form nuclear bodies (Fig. 4 D–F). Because we predict that DFSs occur frequently in normal cells, 53BP1 is likely to be performing a function in binding to the products of DFSs in G1 phase. The 53BP1 is known to protect damaged DNA from undergoing homologous recombination (30) and could perform this function at DFSs, which may allow the structures to be resolved by an alternative pathway in S phase. To explore this idea further, we examined a possible synthetic interaction between loss of 53BP1 and an increase in DFSs created by partial knockdown of MCM2-7. RNAi transfected cells were treated with increasing concentrations of hydroxyurea (HU) before a colony assay was performed. Cells partially depleted for MCM2-7 were hypersensitive to HU, due to their inability to use dormant replication origins (6–8). The 53BP1-depleted cells showed a sensitivity similar to that of control cells. However, cells depleted of 53BP1 showed a highly synergistic sensitivity to HU when combined with partial knockdown of MCM5 (Fig. 4 G and H and Fig. S6G). This shows that although 53BP1 is not essential it works together with dormant origins to protect cells from the consequences of replication fork failure. SYSTEMS BIOLOGY reduced the amount of DNA-bound MCM2 at S phase entry (Fig. 2C). Similar results were obtained with RNAi against the licensing factor CDT1 (Fig. S2 A and B). In both cases, overall EdU incorporation was not affected (Fig. S2C). In line with our theoretical predictions, the number of 53BP1 nuclear bodies increased in proportion to the reduction in DNA-bound MCM2 (Fig. 2D and Fig. S2 D and E). Similar results were obtained in U2OS cells treated with MCM5 RNAi (Fig. S2 F and G). Our theory also predicts that if cells have higher than normal numbers of licensed origins, the number of DFSs should reduce (Fig. 1B). To increase DNA-bound MCM2-7 we used a human bronchial epithelial cell line overexpressing the licensing protein CDC6 (Fig. 2E). The number of 53BP1 nuclear bodies in these hyperlicensed cells was reduced 30% compared with noninduced controls (Fig. 2F and Fig. S3 A and B), in line with our model. Fig. 2G combines all our data on the number of 53BP1 nuclear bodies (Fig. 2 D and F and Fig. S2E) to show that there is excellent agreement between our theoretical predictions (Fig. 1B) and the experimental data from cells with reduced or increased numbers of licensed origins. Fig. 2 H and I shows that there is also an increase in the frequency of 53BP1 nuclear bodies after MCM5 RNAi treatment of primary IMR-90 cells. This suggests a similar relationship between the amount of DNA-bound MCM2-7 and the number of 53BP1 nuclear bodies in both normal cells (IMR-90) and cancer cells (HeLa and U2OS). Taken together, our data provide strong support for the idea that failures of DNA replication caused by spontaneous DFSs cause the appearance of 53BP1 nuclear bodies in the subsequent G1. We next investigated the nature of the lesions marked by 53BP1 nuclear bodies in G1 cells. Our theory suggests that these structures represent single-stranded or partially single-stranded regions of DNA rather than double-strand DNA breaks (Fig. 1C). We therefore investigated the colocalization of 53BP1 nuclear bodies with the ssDNA binding protein RPA (replication protein A). Fig. 3 A and B shows that in untreated cells ∼7% 53BP1 nuclear bodies were associated with measurable levels of RPA (18), but partial depletion of MCM2-7 caused a large increase in colocalization to >30%. The increase of RPA in 53BP1 nuclear bodies in cells with a reduced origin number might reflect the larger distance between stalled forks, and hence longer stretches of unreplicated DNA that bind RPA. To rule out the possibility that G1-specific 53BP1 nuclear bodies mark double-strand breaks generated by synthetic reduction of licensed origins in the preceding S phase, we also quantified the frequency of G1-specifc γ-H2AX foci in response to MCM5 RNAi (Fig. 3C and Fig. S4). No significant increase of G1-specifc γ-H2AX foci was observed between the control and cells depleted of MCM5 (t test, P = 0.59). Although DFSs can occur at any region of the genome, theoretical analysis predicts that they are more likely in larger replicons rather than in smaller ones (4). To test this, we performed chromatin immunoprecipitation with anti-53BP1 antibodies and sequenced the precipitated DNA (Fig. S5). Cell fractionation and immunoblotting revealed that a majority of 53BP1 (∼75%) is associated with chromatin (Fig. S6B), and quantification of GFP-53BP1 intensity revealed that only ∼1% of 53BP1 signal originates from nuclear bodies (Fig. S6A). This means that the majority of DNA bound by 53BP1 is not associated with 53BP1 nuclear bodies. Consistent with this, the total genomic coverage from 53BP1 and IgG precipitations were comparable (Fig. S6C). However, the 53BP1/IgG binding ratio showed a highly significant correlation between replicon size and the strength of 53BP1 association (Fig. 3D and Fig. S5D). We then identified 1-kb regions of the genome with a high 53BP1/IgG ratio (P < 10−3); replicons were defined as 53BP1+ when they contained one or more 53BP1-enriched regions and 53BP1− otherwise. 53BP1+ replicons were on average approximately three times larger than 53BP1− replicons (Fig. 3 D and E and Fig. S6F). There was also a weak correlation between 53BP1 binding, DAPI EdU C *** 4 5 cm 5 siM on tro l 6 siC B 10 μm Mean mitotic EdU foci A Mcm5 3 Tubulin 2 1.00 0.23 1 0 siControl siMcm5 RNAi E Mean number of UFBs 3μm DAPI BLM F * 2.5 Frequency of cells (%) D 2.0 1.5 1.0 0.5 siControl siMcm5 H 1 s si iMc 53 m BP 5 + 1 si 53 BP 5 l tro cm on M si C si 53BP1 Mcm5 Tubulin Relative Number of Colonies (log) RNAi RNAi RNAi Control RNAi Mcm5 Poisson distr. RNAi Control ( =1.4) 30 Poisson distr. RNAi Mcm5 ( =2) 20 10 0 0 G 40 0 1 2 3 4 5 6 7 8 9 10 Number of UFBs per cell 100 10 RNAi Control RNAi Mcm5 RNAi 53BP1 RNAi Mcm5 + 53BP1 1 0 0.2 0.4 0.6 0.8 1.0 1.3 1.7 2.0 HU (μM) Fig. 4. MCM5 RNAi effects on mitosis. (A) Representative image of early-mitotic HeLa cell with foci of EdU incorporation. (B) Quantification of foci of EdU incorporation during prophase and prometaphase HeLa cells after MCM5 RNAi (n = 100, three replicates, error bars are SEM). t test, P = 3.43 × 10−8. (C) Immunoblot to show depletion of MCM5 after MCM5 RNAi. Quantification of band intensity is indicated below the blot. (D) Representative image of UFBs stained with BLM in an anaphase HeLa cell. (E) Frequency of UFBs after 48-h treatment with MCM5 RNAi (n = 75, three replicates, error bars are SEM). t test, P = 0.0473. (F) Frequency distribution of UFBs (n = 100 cells, four replicates). χ2 tests for Poissons, P > 0.85. A significant difference was observed. Wilcoxon rank sum test, P = 5.095 × 10−3. (G) Immunoblot showing the knockdown of MCM5 and 53BP1 by RNAi in HeLa cells. (H) Clonogenic assay after treatment, as seen in G, with increasing HU (three replicates, error bars are SEM). DNA between DFSs, involving complementary strand synthesis occurring in early mitosis, the resolution of partially replicated DNA via UFBs, and their association with 53BP1 in G1. A recent paper has shown that when DNA replication is inhibited the condensation of chromosomes during early mitosis is associated with the appearance of focal sites of DNA synthesis (20). Our theoretical work is based on the idea that, from the end of G1 through to the end of metaphase, MCM2-7 cannot be loaded onto DNA even if DFSs have occurred (4, 10). MCM2-7 forms the core of the replicative CMG helicase that unwinds DNA at the replication fork, and so the essential problem for completing replication at DFSs is to provide an alternative DNA unwinding activity. The chromosome condensation that resolves sister chromatids during early mitosis could provide such an alternative DNA unwinding activity. Once ssDNA is exposed, DNA polymerases will perform complementary strand synthesis to substantially fill in the gaps. Consistent with this we show that the frequency of foci of DNA synthesis in early mitosis is in line with our predictions of the number of DFSs and increases when origin number is reduced. We imagine that this complementary strand synthesis, which is not driven by a processive helicase, might not always fully complete DNA replication but may leave small gaps or lesions on the DNA. UFBs represent a potential intermediate that could allow such partially unreplicated DNA segments arising from DFSs to E5762 | www.pnas.org/cgi/doi/10.1073/pnas.1603252113 be segregated to daughter cells (18–20). We show that the number of UFBs in untreated cells increases in line with our predictions when origin number is reduced. Consistent with our theoretical model, we show that 53BP1 is preferentially bound to DNA in larger replicons, where DFSs are more likely to occur. Under normal circumstances, there is only a low colocalization of RPA with 53BP1 nuclear bodies, but this increases markedly when origin number is reduced. We imagine that normally a significant proportion of the remaining unreplicated DNA can undergo complementary strand synthesis in early mitosis, leaving little ssDNA to bind RPA in the subsequent G1. However, the distance between stalled forks in DFSs will increase following a reduction in origin number, and this should result in more ssDNA remaining after progression through mitosis, as we demonstrate. We also show that the increase in 53BP1 nuclear bodies in G1 is not significantly associated with an increase of γ-H2AX foci after partial depletion of origins, suggesting that 53BP1 nuclear bodies are not simply sites of double-stranded DNA breaks. The 53BP1 nuclear bodies are ultimately dispersed as the DNA replicates during S phase, suggesting that the unusual DNA structures that they mark are not fully resolved until another round of replication has occurred. Finally, we show that 53BP1 synergizes with dormant origins to protect genome integrity in the presence of replicative stress, as evidenced by its hypersensitivity to HU when the number of Moreno et al. Materials and Methods Cell Culture. HeLa and IMR-90 cells were obtained from the American Type Culture Collection and used at a population doubling level lower than 30 and 20, respectively, and maintained in DMEM (41966; Invitrogen), supplemented with 10% FBS (10270106; Invitrogen) and penicillin and streptomycin at 37 °C in 5% CO2. HBEC-Cdc6-Tet-On (human bronchial epithelial cells) were grown in keratinocyte serum-free medium (17005-075; Invitrogen) supplemented with 50 μg/mL bovine pituitary extract and 5 ng/mL hEGF (17005-075; Invitrogen). The HBEC cell line was developed as described in ref. 31. Briefly, immortalized HBECs were infected with PLVX-Tet-On with blasticidin resistance (3 μg/mL) and PLVX-TRE-Cdc6 with zeocin resistance (12.5 μg/mL). Clones with robust doxycycline-dependent induction (5 μg/mL) were selected. RNAi and Transfections. siRNA duplexes were obtained from Thermo Fisher Scientific, and the sequences were as follows: control: 5′-UAGCGACUAAACACAUCAA -3′; MCM5: 5′-GGAUCUGGCCAGCUUUGAU -3′; CDT1: SMARTPool M-003248-02; and 53BP1: 5′- GAAGGACGGAGUACUAAUA-3′. Transfection was performed with Lipofectamine RNAiMAX (Invitrogen). Fifty nanomolar siRNA was mixed with the Lipofectamine in Opti-MEM medium (Invitrogen). The mixture was added to 50–60% confluent cells in antibiotic-free DMEM (Invitrogen). Cells were subjected to different times of transfection to obtain variable reductions in protein level. Immunoblotting and Antibodies. Immunoblotting was performed as previously described (8). Western blotting was performed according to standard procedures. Extraction of the chromatin-bound fraction was performed by treatment with CSK extraction buffer (10 mM Hepes, pH 7.4, 300 mM sucrose, 100 mM NaCl, 3 mM MgCl2, and 0.5% Triton-X-100) for 10 min on ice. The pellet, containing chromatin-associated proteins, was processed for Western blotting. The antibodies used were MCM5 (sc-136366; Santa Cruz), CDT1 (ab183478; Abcam), tubulin (T6199; Sigma-Aldrich), 53BP1 (A300-272A; Bethyl), Lamin B1 (16048; Abcam), GAPDH (ab9484; Abcam), and CDC6 (05-550; Millipore). Immunofluorescence. Cells were seeded in six-well plates containing glass coverslips. At the required times for each experiment they were fixed with 4% (vol/vol) formaldehyde, permeabilized with 0.1% Triton X-100 in TBS, and blocked with 0.5% fish skin gelatin (G-7765; Sigma) for 1 h. Cells were then incubated with the relevant antibodies overnight at 4 °C and washed with 0.1% TBS-Tween before incubation with Alexa secondary antibodies (Invitrogen). Cells were incubated with DAPI (D9542; Sigma) and mounted using Vectashield mounting medium (H-1000; Vector Laboratories). The antibodies used were 53BP1 (NB-100-904; Novus), Cyclin A (ab16726; Abcam), BLM (sc7790; Santa Cruz), RPA (ab2175; Abcam), γ-H2AX (2577L; Cell Signaling), and phospho-histone H3 (9701S; Cell Signaling). For incorporation of EdU during early mitosis, asynchronous HeLa cells were incubated in 40 μM EdU (Invitrogen) for 30 min before fixation. To visualize incorporated EdU the cells were incubated in Click-iT EdU reaction, following the manufacturer’s pro- 1. Evrin C, et al. (2009) A double-hexameric MCM2-7 complex is loaded onto origin DNA during licensing of eukaryotic DNA replication. Proc Natl Acad Sci USA 106(48): 20240–20245. 2. Gambus A, Khoudoli GA, Jones RC, Blow JJ (2011) MCM2-7 form double hexamers at licensed origins in Xenopus egg extract. J Biol Chem 286(13):11855–11864. Moreno et al. PNAS PLUS tocol (Thermo Fisher Scientific). Cells in prophase and prometaphase were identified by phospho-H3 antibody signal and DAPI morphology. Image Acquisition and Analysis. Microscopy images were acquired using an Olympus IX70 deltavision deconvolution microscope. An Olympus 63× oil immersion objective was used, and images were captured using a CCD camera. Data from microscopy experiments was analyzed using Volocity 3D analysis software (PerkinElmer). The nucleus was outlined as the region of interest, and lower intensity threshold was set to a number that indicated the intranuclear background. Three-Dimensional Flow Cytometry. Cells were incubated with 40 μM EdU (Invitrogen) for 30 min before trypsinization and collection. Cells were preextracted with CSK extraction buffer for 10 min on ice and then fixed in 2% (vol/vol) formaldehyde for 15 min. For MCM2 labeling, cells were permeabilized in ice-cold 70% (vol/vol) ethanol for 10 min and incubated for 1 h with anti-BM28 primary antibody (1:500). After staining with AlexaFluor 488-labeled secondary antibody (Invitrogen) cells were washed and Click-it EdU reaction was performed for 30 min. Finally, cells were treated with propidium iodide (PI) solution (50 μg/mL PI, 50 μg/mL RNaseA, and 0.1% Triton-X-100) and transferred to FACS tubes for analysis. Samples were acquired using a BD FACSCanto and the results analyzed using FlowJo software. ChIP Sequencing. Cells were cross-linked for 30 min using 1.5 mM ethylene glycol-bis(succinimidyl succinate) followed by 10 min with 1% formaldehyde. Reactions were stopped with 2 M glycine and cells were resuspended in CSK buffer for 10 min. Cells were treated with 5 μL micrococcal nuclease (2,000 U/μL) for 10 min at 37 °C, neutralized with 2× RIPA [100 mM Tris·HCl, pH 7.4, 300 mM NaCl, 2% (vol/vol) IGE-Pal CA-630, 0.5% Na deoxycholate, and 1 mM EDTA] and incubated on ice for 10 min. Samples were then precleared with Protein A Dynabeads for 1 h at 4 °C and then incubated with 7 μg anti-53BP1 antibody (A300-272A; Bethyl) rotating overnight at 4 °C. DNA was eluted (1% SDS, 0.1 M NaHCO3, and 0.1% Tween-20) and used for library preparation using the NEBNext ChIP-seq kit and sequenced on an Illumina HiSEq 2500 by Edinburgh Genomics. The raw sequence data were assessed, aligned, and combined using R version 3.2.2, Rsubread version 1.20.2, and SAMtools version 1.2. Aligned reads were analyzed using a script based on R version 3.2.2 and Rsamtools 1.22. The quality assessment and a detailed description of the analysis pipeline are available in SI Materials and Methods. Files containing the aligned reads are available at the European Nucleotide Archive (accession no. PRJEB12222) and the R script used for the analysis is available as Dataset S1. Mathematical and Computational Analysis. The mean number of DFSs was computed with the formula logð2Þ K X Ng Ni − log 1 + logð2Þ , Ns N s i=1 where Ng indicates the genome size, NS indicates the median stalling distance, and the Ni ði = 1..kÞ indicate the length of the K replicons. Replication origin (RO) depletion and augmentation experiments were performed by randomly removing or increasing the number of ROs. More details on the mathematical model used are described in refs. 4 and 10 and an extended summary of the approach used is available in SI Materials and Methods. Clonogenic Assay. Cells were transfected in 10-cm dishes and replated into sixwell dishes before treatment. HU was added to cells for 48 h before medium was replaced with fresh growth medium. After 10 d, cells were washed, fixed, and stained with Crystal Violet. The number of colonies >1 mm were recorded. For each genotype, cell viability of untreated cells was defined as 100%. ACKNOWLEDGMENTS. This work was supported by Wellcome Trust Grants WT096598MA and 097945/B/11/Z; Greek General Secretariat for Research and Technology Program of Excellence II (Aristeia II) Grant 3020 (to V.G.G.); the Dundee Imaging Facility, supported by Wellcome Trust Award 097945/B/ 11/Z and Medical Research Council Award MR/K015869/1; and the Flow Cytometry and Cell Sorting Facility at the University of Dundee. V.G.G. and E.-S.K. received an Experimental Research Center Elpen Scholarship and National Scholarships Foundation-Siemens Aristeia Fellowship. 3. Remus D, et al. (2009) Concerted loading of Mcm2-7 double hexamers around DNA during DNA replication origin licensing. Cell 139(4):719–730. 4. Newman TJ, Mamun MA, Nieduszynski CA, Blow JJ (2013) Replisome stall events have shaped the distribution of replication origins in the genomes of yeasts. Nucleic Acids Res 41(21):9705–9718. PNAS | Published online August 11, 2016 | E5763 SYSTEMS BIOLOGY dormant origins was reduced. The 53BP1 gene (TP53BP1) is not essential and has been associated with protecting damaged DNA from undergoing inappropriate homologous recombination (30). The synthetic interaction that we show between loss of 53BP1 and partial MCM knockdown could therefore be a consequence of unreplicated DNA undergoing inappropriate homologous recombination during S phase. In the accompanying paper (10) we provide a theoretical analysis of origin distribution that leads to the conclusion that DFSs are almost inevitable in the large genomes of human cells. The experimental work presented here provides a potential mechanism by which DFSs can be processed, involving partial filling-in of unreplicated segments during early mitosis, segregation to daughter cells via UFBs, their association with 53BP1 nuclear bodies during G1, and their ultimate resolution during the next S phase. Our work therefore provides both experimental and theoretical evidence that structures resulting from replication failure can pass through mitosis for resolution in the next cell cycle. 5. Woodward AM, et al. (2006) Excess Mcm2-7 license dormant origins of replication that can be used under conditions of replicative stress. J Cell Biol 173(5):673–683. 6. Ge XQ, Jackson DA, Blow JJ (2007) Dormant origins licensed by excess Mcm2-7 are required for human cells to survive replicative stress. Genes Dev 21(24):3331–3341. 7. Ibarra A, Schwob E, Méndez J (2008) Excess MCM proteins protect human cells from replicative stress by licensing backup origins of replication. Proc Natl Acad Sci USA 105(26):8956–8961. 8. Ge XQ, Blow JJ (2010) Chk1 inhibits replication factory activation but allows dormant origin firing in existing factories. J Cell Biol 191(7):1285–1297. 9. Blow JJ, Ge XQ, Jackson DA (2011) How dormant origins promote complete genome replication. Trends Biochem Sci 36(8):405–414. 10. Mamun AM, et al. (2016) Inevitability and containment of replication errors for eukaryotic genome lengths spanning megabase to gigabase. Proc Natl Acad Sci USA 113:E5765–5774. 11. Besnard E, et al. (2012) Unraveling cell type-specific and reprogrammable human replication origin signatures associated with G-quadruplex consensus motifs. Nat Struct Mol Biol 19(8):837–844. 12. Mesner LD, et al. (2013) Bubble-seq analysis of the human genome reveals distinct chromatin-mediated mechanisms for regulating early- and late-firing origins. Genome Res 23(11):1774–1788. 13. Picard F, et al. (2014) The spatiotemporal program of DNA replication is associated with specific combinations of chromatin marks in human cells. PLoS Genet 10(5): e1004282. 14. Jackson DA, Pombo A (1998) Replicon clusters are stable units of chromosome structure: Evidence that nuclear organization contributes to the efficient activation and propagation of S phase in human cells. J Cell Biol 140(6):1285–1295. 15. Conti C, et al. (2007) Replication fork velocities at adjacent replication origins are coordinately modified during DNA replication in human cells. Mol Biol Cell 18(8): 3059–3067. 16. Blow JJ, Ge XQ (2009) A model for DNA replication showing how dormant origins safeguard against replication fork failure. EMBO Rep 10(4):406–412. 17. Kunnev D, et al. (2015) Effect of minichromosome maintenance protein 2 deficiency on the locations of DNA replication origins. Genome Res 25(4):558–569. 18. Lukas C, et al. (2011) 53BP1 nuclear bodies form around DNA lesions generated by mitotic transmission of chromosomes under replication stress. Nat Cell Biol 13(3): 243–253. 19. Naim V, Wilhelm T, Debatisse M, Rosselli F (2013) ERCC1 and MUS81-EME1 promote sister chromatid separation by processing late replication intermediates at common fragile sites during mitosis. Nat Cell Biol 15(8):1008–1015. E5764 | www.pnas.org/cgi/doi/10.1073/pnas.1603252113 20. Minocherhomji S, et al. (2015) Replication stress activates DNA repair synthesis in mitosis. Nature 528(7581):286–290. 21. Ying S, et al. (2013) MUS81 promotes common fragile site expression. Nat Cell Biol 15(8):1001–1007. 22. Harrigan JA, et al. (2011) Replication stress induces 53BP1-containing OPT domains in G1 cells. J Cell Biol 193(1):97–108. 23. Rappold I, Iwabuchi K, Date T, Chen J (2001) Tumor suppressor p53 binding protein 1 (53BP1) is involved in DNA damage-signaling pathways. J Cell Biol 153(3):613–620. 24. Anderson L, Henderson C, Adachi Y (2001) Phosphorylation and rapid relocalization of 53BP1 to nuclear foci upon DNA damage. Mol Cell Biol 21(5):1719–1729. 25. Polo SE, Jackson SP (2011) Dynamics of DNA damage response proteins at DNA breaks: a focus on protein modifications. Genes Dev 25(5):409–433. 26. Chan KL, North PS, Hickson ID (2007) BLM is required for faithful chromosome segregation and its localization defines a class of ultrafine anaphase bridges. EMBO J 26(14):3397–3409. 27. Chan KL, Hickson ID (2009) On the origins of ultra-fine anaphase bridges. Cell Cycle 8(19):3065–3066. 28. Barefield C, Karlseder J (2012) The BLM helicase contributes to telomere maintenance through processing of late-replicating intermediate structures. Nucleic Acids Res 40(15):7358–7367. 29. Biebricher A, et al. (2013) PICH: A DNA translocase specially adapted for processing anaphase bridge DNA. Mol Cell 51(5):691–701. 30. Bunting SF, et al. (2010) 53BP1 inhibits homologous recombination in Brca1-deficient cells by blocking resection of DNA breaks. Cell 141(2):243–254. 31. Petrakis TG, et al. (2016) Exploring and exploiting the systemic effects of deregulated replication licensing. Semin Cancer Biol 37-38:3–15. 32. O’Keefe RT, Henderson SC, Spector DL (1992) Dynamic organization of DNA replication in mammalian cell nuclei: Spatially and temporally defined replication of chromosome-specific alpha-satellite DNA sequences. J Cell Biol 116(5):1095–1110. 33. Morgan M, et al. (2009) ShortRead: A bioconductor package for input, quality assessment and exploration of high-throughput sequence data. Bioinformatics 25(19): 2607–2608. 34. Liao Y, Smyth GK, Shi W (2013) The Subread aligner: Fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res 41(10):e108. 35. Weddington N, et al. (2008) ReplicationDomain: A visualization tool and comparative database for genome-wide replication timing data. BMC Bioinformatics 9:530. Moreno et al. Supporting Information Moreno et al. 10.1073/pnas.1603252113 SI Materials and Methods RO Dataset. Genomic regions of HeLa cells associated with replication origins were obtained from the OriSeq project (13) using the h19 human genome. Each genomic region was associated with a single point on the genome, which was obtained by taking the middle points of the genomic region under consideration. The replication origin positions allowed us to divide each chromosome into a set of replicons, defined as an area on a chromosome delimited on both sides by replication origins. The largest replicon of each chromosome was removed from the analysis because these replicons correspond to the centromeric areas, which are difficult to study using sequencing experiments. When the replication origin positions were used to compute the probability of DFSs, the genome size was adjusted accordingly. These data were used in all of the circumstances that required the positions of ROs. Mathematical Model and Computational Simulations. The details of the mathematical model used with an extensive analysis of its theoretical consequences can be found in ref. 10 and only a limited description will be presented here. In ref. 4 we derived a simple model of DFS, using two key assumptions: that all of the licensed ROs can be activated as necessary (therefore any time constraint is excluded, a condition likely to be true for somatic human cells) and that there is a constant small probability per nucleotide for each individual replication fork to irreversibly stall. In the same article (4), we showed that if all of the replicons are significantly smaller than the median stalling distance and with a distribution, characterized by a limited dispersion (conditions compatible with experimentally derived replicon size distribution in yeasts), the probability of at least one DFS in a given cell cycle is Pðzero DFSÞ ≈ 1 − ðlog 2Þ2 Nl Ng 1 + R2 , 2 2 Ns [S1] where Nl is the average replicon size, Ng is the size of the genome, Ns is the median stalling distance, and R is the ratio of the SD to the mean of the replicon size distribution. Because in human cell lines a very large variation can be observed in replicon size and few replicons are as large as half of the median stalling distance, the above approximation is inappropriate. Hence, we lifted this approximation and used an exact approach (10). Under these new conditions, in a given genome with K replicons Ni (with i = 1, . . ., K − 1) we have ! K X Ng Ni Pðzero DFSÞ = exp −logð2Þ + log 1 + logð2Þ . Ns Ns i=1 Computational experiments on RO depletion were performed by randomly sampling a given percentage of ROs from the data for each chromosome with the extremes excluded, and by computing the mean number of DFSs using Eq. S2 and Eq. S4. Computational experiments on RO augmentation were performed by inserting a given percentage of new ROs in random positions on each chromosome. To bolster biological reality and for compatibility with the RO dataset used, insertions that resulted in the creation of replicons smaller than 4 kb were ignored. Each computational experiment was repeated 100 times to account for stochastic variability. The SEM associated with the expected number of DFSs was consistently lower than 0.1% and therefore was not reported. ChIP Sequencing Experiments. Two technical replicas were performed for each of five ChIP experiments on 53BP1 and IgG control. Each experiment was associated to a unique key (Fig. S5A). Quality of the reads was assessed using the qa function of the ShortRead R package (33). The analysis is based on a sample of 1 million reads for each replica. Diagnostic plots (Fig. S5 B and C) indicate a consistently good quality of the reads, and no trimming was deemed necessary. Reads were aligned to the human genome UCSC version h19, to ensure compatibility with the replication origin dataset, using the align function of the Rsubread package with default parameter values (34). Alignments containing indels were ignored. The number and percentage of aligned reads were consistently high, as indicated by Fig. S5A. Because cells were preextracted using CSK treatment, the aligned reads were tested for contamination from mitochondrial DNA. Contamination was detected in both technical replicas of 53BP1 in Experiment 5, with an amount of mitochondrial DNA up to ∼100 times higher than other replicas. This was considered an indication of potential problems in the experimental procedure, and both technical replicas were excluded from further analysis. All of the remaining experiment and technical replicas were combined into a single file containing the 53BP1 and IgG reads (http://www.ebi.ac.uk/ena/data/view/ PRJEB12222; files ERX1269088 and ERX1269089, and contains the control IgG and 53BP1 immunoprecipitates, respectively), sorted, and indexed using SAMtools version 1.2. Analysis of Binding Events. Aligned 53BP1 and IgG reads were associated with a unique replicon by considering the initial position of the reads. Because the total number of reads was significantly different for the two antibodies, the number of reads for each replicon was normalized by dividing it by the total number of chromosomal reads of the antibody under consideration. For each replicon, the ratio of the normalized number of reads was then considered. More formally, for each replicon i, 53BP1 signal Sigi was computed as [S2] The distribution of DFSs can be well approximated by a Poisson distribution, because in a Poisson distribution the probability of zero events is Pðzero eventsÞ = expð− λÞ, [S3] where λ is mean number of events. Then λ = −logðPðzero eventsÞÞ. [S4] Combining Eq. S4 with Eq. S2 we can therefore calculate the mean number of DFSs in human cell lines. Moreno et al. www.pnas.org/cgi/content/short/1603252113 53BP1i Sigi = P j 53BP1j !, ! IgGi P , j IgGj where 53BP1i and IgGi are the numbers of 53BP1 and IgG reads associated with replicon i, and the index j varies along all of the replicons of the chromosome under consideration. To account for the noise associated with the ChIP sequencing experimental pipeline, the replicons were then grouped according to their size and the mean of each group was computed. In the most populated length group the mean was associated with limited errors; however, very large and very small replicons are characterized by a much larger uncertainty. Moreover, areas with limited mappability can be 1 of 6 frequently observed in large replicons, resulting in a weakened 53BP1 signal (Fig. S5D, which reports the same information of Fig. 3B with error bars indicating the SEM). To assess 53BP1 enrichment, the genome was divided into contiguous areas of 1 kb and for each area k, Sigk was computed 5 4 3 Frequency of cells (%) EdU CyclinA Merge 1 0 Aphidicolin D 100 G1 Phase Population 90 S Phase Population 40 80 G2 Phase Population 70 Poisson Distribution G1 Phase 60 50 40 30 20 10 0 0 1 2 3 4 5 6 7 Number of 53BP1 nuclear bodies per cell E % of cells 53BP1 2 Control C B ** 6 100 90 80 70 60 50 40 30 20 10 0 G1 T0 T2 S T4 T6 Frequency of cells (%) Mean 53BP1 nuclear bodies A as discussed above. For each chromosome, Sigk is approximately log-normally distributed. It is therefore possible to use a fitted log-normal distribution to associate, with each area k, a P value that indicates the probability of observing by chance a value of Sigk as large as observed or more. Control 35 Aphidicolin 30 25 20 15 10 5 0 0 1 2 3 4 5 6 7 8 9 10 11 12 Number of 53BP1 nuclear bodies per cell G2 T8 T10 T12 Time after Noc release (h) Fig. S1. Dynamics of 53BP1 nuclear bodies. (A) Mean number of G1-specific 53BP1 nuclear bodies in untreated and aphidicolin-treated cells. Error bars are SEM of three replicates. (B) Representative image of 53BP1 nuclear body identification at different stages of the cell cycle. Cells were labeled for 53BP1 (green), EdU (orange), and Cyclin A (red). (C) Frequency of 53BP1 nuclear bodies at different stages of the cell cycle. Only G1-specific nuclear bodies fit a Poisson distribution. (D) Frequency of G1-specific 53BP1 nuclear bodies in untreated and aphidicolin-treated cells. n = 100. (E) Frequency of cells at different stages of the cell cycle after release from nocodazole arrest. Cells in G1 were assessed for the frequencies of 53BP1 nuclear bodies from T2–T8 (green area) and cells from T8–T12 were used to identify the S-phase pattern (yellow area). Moreno et al. www.pnas.org/cgi/content/short/1603252113 2 of 6 B Cdt1 RNAi (h) RNAi Control 16 32 48 64 Early S-phase Mcm2 Total Lysates A Mcm5 Cdt1 * GAPDH 5000 4000 3000 2000 1000 0 Control 16 32 48 64 Cdt1 RNAi (h) EdU C 10 Mcm5 RNAi (h) 5 Control 16 32 48 64 104 103 102 50k 100k 50k 100k 50k 100k 50k 100k 50k 100k DNA Content Cdt1 RNAi (h) Control 16 50k 100k 50k 100k 32 48 64 104 103 102 50k 100k 40 40 RNAi Control RNAi Control 20 15 10 5 25 20 15 10 5 0 0 1 2 3 4 5 6 7 8 9 10 11 12 0 1 2 3 4 5 6 7 8 9 10 11 12 Number of 53BP1 nuclear bodies Number of 53BP1 nuclear bodies E Poisson distr. RNAi Cdt1 ( = 2.40) 3.0 2.5 2.0 1.5 1.0 0.5 Cdt1 RNAi 0 40 50 60 70 80 90 100 Replication Origin (%) Mean 53BP1 nuclear bodies F 3.5 G 3 *** 5 25 Poisson distr. RNAi Control ( = 1.46) 30 cm Poisson distr. RNAi Mcm5 ( = 2.68) tro l 30 siM Poisson distr. RNAi Control ( = 1.46) RNAi Cdt1 (50-60% origin deletion) 35 RNAi Mcm5 (50-60% origin deletion) on 35 0 Mean 53BP1 nuclear bodies 50k 100k siC Number of cells (%) D 50k 100k DNA Content Number of cells (%) EdU 105 Mcm5 2 Tubulin 1.00 1 0 0.21 U2OS siControl siMcm5 RNAi (U2OS) Fig. S2. Quantification of origin reduction. (A) Western blot for replication origin depletion in HeLa cells using RNAi against Cdt1. Cells were transfected for the indicated time before harvesting. (B) Representative FACS plot of chromatin-associated Mcm2 in early S-phase HeLa cells after Cdt1 depletion. (C) PI/EdU FACS profiles for Mcm5 and Cdt1 depletion in HeLa cells. (D) Frequency distribution of 53BP1 nuclear bodies in Mcm5 RNAi-treated cells (Left) and Cdt1 RNAitreated cells (Right). (E) Mean number of G1-specific 53BP1 nuclear bodies in response to varying degrees of licensing knockdown as measured in B after Cdt1 RNAi. The 53BP1 nuclear bodies were quantified in triplicate in at least 100 cells. (F) Mean number of G1-specific 53BP1 nuclear bodies in U2OS cells after treatment with control or MCM5 RNAi. Error bars are SEM of three replicates, P = 1.4 × 10−12. (G) Total U2OS cell extract after treatment with control or MCM5 RNAi, immunoblotted for MCM5. Moreno et al. www.pnas.org/cgi/content/short/1603252113 3 of 6 B ** - Dox + Dox 104 Mcm2 1 103 0.5 (30% RO addition) 105 0 EdU Mean 53BP1 nuclear bocies A - Dox + Dox 104 103 102 50K 100K 50K 100K DNA Content Fig. S3. Cdc6 overexpression. (A) Mean number of 53BP1 nuclear bodies in uninduced and overexpression of Cdc6 in HBEC cells. Error bars are SEM of three replicates, P = 0.00662. (B) FACS profiles of the distribution of chromatin-associated Mcm2 and the EdU incorporation of uninduced and Cdc6 overexpressing HBEC cells. DAPI EdU/CyclinA B G1 5 Mcm5 S Tubulin 1.00 γ-H2AX s iM cm s iC on tro l A 0.27 Merge 10 μm Fig. S4. G1-specific γ-H2AX foci. (A) Representative staining of γ-H2AX foci in asynchronous HeLa cells. Intense γ-H2AX foci were quantified in cells that stained negatively for EdU and Cyclin A. (B) Transfected cells used for microscopy were tested for successful depletion by immunoblotting for Mcm5. Quantification of Mcm5 reduction is indicated below the corresponding treatments. Moreno et al. www.pnas.org/cgi/content/short/1603252113 4 of 6 A Experiment 53BP1 Experiment 1 – Technical replica 1 53BP1 Experiment 1 – Technical replica 2 53BP1 Experiment 2 – Technical replica 1 53BP1 Experiment 2 – Technical replica 2 53BP1 Experiment 3 – Technical replica 1 53BP1 Experiment 3 – Technical replica 2 53BP1 Experiment 4 – Technical replica 1 53BP1 Experiment 4 – Technical replica 2 53BP1 Experiment 5 – Technical replica 1 53BP1 Experiment 5 – Technical replica 2 IgG Experiment 1 – Technical replica 1 IgG Experiment 1 – Technical replica 2 IgG Experiment 2 – Technical replica 1 IgG Experiment 2 – Technical replica 2 IgG Experiment 3 – Technical replica 1 IgG Experiment 3 – Technical replica 2 IgG Experiment 4 – Technical replica 1 IgG Experiment 4 – Technical replica 2 IgG Experiment 5 – Technical replica 1 IgG Experiment 5 – Technical replica 2 2.0 30 20 Quality Score Reads 15’085’705 15’085’705 25’316’792 25’316’792 29’105’981 29’105’981 35’869’224 35’869’224 18’644’065 18’644’065 18’585’722 18’585’722 14’856’583 14’856’583 17’489’711 17’489’711 32’639’637 32’639’637 37’701’283 37’701’283 Proportion of reads B Key 1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 10 9 10 30 20 10 5 6 7 8 30 20 10 1 2 3 10 30 20 10 7 8 30 20 10 2 1 2 3 4 5 0.5 0.0 2.0 1.5 1.0 0.5 102030 102030 102030 102030 3 6 7 8 9 10 1 2 3 4 5 2.0 1.5 1.0 0.5 0.0 2.0 1.5 1.0 0.5 0.0 4 0 50 100 0 50 100 0 50 100 0 50 100 Cycle D 10 102030 Proportion of reads Quality Score 10 1 9 Average (calibrated) base quality 20 6 8 0.0 4 30 5 7 1.0 Cycle 9 6 1.5 0 50 100 0 50 100 0 50 100 0 50 100 C Aligned reads 13’431’878 (89.04 %) 13’289’715 (88.09 %) 22’624’95’5 (89.37 %) 22’427’05’5 (88.59 %) 25’570’708 (87.85 %) 25’320’996 (87.00 %) 31’837’763 (88.76 %) 31’546’518 (87.95 %) 16’632’221 (89.21 %) 16’468’113 (88.33 %) 16’502’266 (88.79 %) 16’368’731 (88.07 %) 13’250’134 (89.19 %) 13’130’364 (88.38 %) 15’158’814 (86.67 %) 15’002’940 (85.78%) 28’864’826 (88.43 %) 28’550’761 (87.47 %) 34’159’997 (90.61 %) 33’851’465 (89.79 %) 102030 102030 102030 102030 102030 Average (calibrated) base quality Normalised signal strength 1.55 1.50 1.45 1.40 5 10 20 50 100 200 500 1000 Replicon length (kbp) Fig. S5. Details of 53BP1 ChIP sequencing. (A) Experiment names, key, and number of reads. Details of the alignment can be found in SI Materials and Methods. (B and C) ChIP sequencing quality analysis for 53BP1 (B) and IgG (C). Quality score with respect to the length of the reads (Left) and quality distribution (Right). (D) Plot of the average 53BP1/IgG signal ratio per kilobase against replicon size including SEM (error bars not reported when fewer than three values are present). Moreno et al. www.pnas.org/cgi/content/short/1603252113 5 of 6 B 10 9 C *** 10-1 108 107 CSK Soluble 106 53BP1 105 LaminB 104 Tubulin 103 102 101 % reads for 1kb regions Sum intensity 53BP1 (a.u.) A 53BP1 1.50 Replication Timing Log2(Early/Late) 1.55 2 1 0 -1 10D 10E 10F 10G 11C 11D 11E 11F 11G 11H 12B 12E 13A 13C 13D 14B 14C 15A 16C 16D 17B 18A 18B 1A 1B 1D 1E 1F 1I 1G 1K 1L 20B 22B 2C 2D 2E 2F 2G 2H 2I 2J 3A 3B 3C 3D 4A 4C 4D 4F 5C 5D 5E 5F 6B 6C 6E 6F 6G 7B 7C 7D 7E 7F 7G 7H 7I 7J 8B 8C 8D 9B 9D XB XC XD -2 Fragile site ID Genome wide F Density (x10-5) E Genome Wide 1.45 53BP1/IgG 10-5 10-7 Nuclear Nucleus Bodies D 10-3 Null 53BP1enriched 5 IgG 53BP1 - Replicons 53BP1 + Replicons 4 3 2 1 5 10 20 50 100 200 500 Replicon length (kbp) Number of colonies G 400 300 200 100 0 Control Mcm5 53BP1 RNAi Mcm5+ 53BP1 Fig. S6. 53BP1 ChIP sequencing distributions. (A) Total intensity of 53BP1 in the nucleus and in nuclear bodies obtained by quantifying microscopy images of cells expressing GFP-53BP1. (B) Immunoblot of 53BP1 in CSK-extracted pellets and CSK-solubilized fractions of HeLa cells. (C) Distribution of ChIP sequencing reads for 53BP1 and control IgG across the entire human genome, with DNA grouped into 1-kb bins. (D) ChIP sequencing 53BP1/IgG ratio for common fragile sites. (E) Mean replication timing computed for each 1-kb genomic region enriched in 53BP1 using timing data from Weddington et al. (35). The null distribution, using all of the values reported for HeLa, is plotted for comparison (Left). The distributions are significantly different (Wilcoxon signed rank test P < 10−10). (F) Frequency of 53BP1+ and 53BP1− replicons of different sizes (χ2 test, P = 5 × 10−4). (G) Plating efficiency of cells for the clonogenic assay. Error bars are SEM of three independent experiments for untreated cells and for the four different genotypes used in the clonogenic assays. Other Supporting Information Files Dataset S1 (PDF) Moreno et al. www.pnas.org/cgi/content/short/1603252113 6 of 6 # Clean memory ------------------------rm(list = ls()) # Load libraries ------------------------library("pastecs") library("Rsamtools") # Load replication origin distribution for Hela cells ------------------------# # Data are available at http://pbil.univ-lyon1.fr/members/fpicard/research.html # # Set working directory setwd("~") # Load and process data new.ori.Hela.u.2000.core.info.hg19 <- read.delim("new-ori-Hela-u-2000-core-infohg19.bed", stringsAsFactors=FALSE) MIdPos <- rowMeans(new.ori.Hela.u.2000.core.info.hg19[,2:3]) ChrData <- split(MIdPos, new.ori.Hela.u.2000.core.info.hg19[,1]) for(i in 1: length(ChrData)){ ChrData[[i]] <- sort(ChrData[[i]]) } ChrDist <- ChrData for(i in 1: length(ChrDist)){ ChrDist[[i]] <- ChrDist[[i]][-1] - ChrDist[[i]][-length(ChrDist[[i]])] } get_distances <- function(Positions){ return(Positions[-1] - Positions[-length(Positions)]) } ChrDistFil <- ChrDist for(i in 1: length(ChrDistFil)){ ChrDistFil[[i]] <- ChrDistFil[[i]][-which.max(ChrDistFil[[i]])] } Prob.noStall <- function(N,G,S){ return( exp( (-G/S)*log(2) + sum( log(1 + (N/S)*log(2)) ) ) ) } ComputeStallProb <- function(x){ # Compute distances if(length(x)<2){ return(NA) } x <- sort(unique(x)) Dists <- x[-1] - x[-length(x)] # mean(Dists) return(1-Prob.noStall(Dists, sum(Dists), 10^7)) } ComputeMaxGap <- function(x){ # Compute distances if(length(x)<2){ return(NA) } x <- sort(unique(x)) Dists <- x[-1] - x[-length(x)] return(max(Dists)) } # Load alignments for 53BP1 and IgG and compare with RO distribution --------------# # Processed data are saved to ReadVSRO.RData # AllRep <- NULL AllIgG <- NULL All53 <- NULL ChrCovIgG <- NULL ChrCov53 <- NULL ClustersVect <- rep(1, length(names(ChrData))) names(ClustersVect) <- names(ChrData) for(ChrToCheck in names(ChrData)){ print(paste("Loading", Sys.time())) print(ChrToCheck) data.gr<-GRanges(seqnames=c(ChrToCheck), ranges = IRanges(start=0, end=536870912)) which <- data.gr params<-ScanBamParam(which=which, what = c("pos")) Bam53BP1Pos <- scanBam("Sorted53BP1.bam", param = params) Bam53BP1Pos <- unlist(Bam53BP1Pos) print(paste("Loaded 53BP1", Sys.time())) params<-ScanBamParam(which=which, what = c("pos")) BamIgGPos <- scanBam("IgGFil1357.sorted.bam", param = params) BamIgGPos <- unlist(BamIgGPos) ---------- print(paste("Loaded IgG", Sys.time())) # Get the bin limits (RO and smallest/largest value for IgG and 53BP1) ExtRoPos <- c(0, ChrData[[ChrToCheck]], max(max(max(BamIgGPos, Bam53BP1Pos)), max(ChrData[[ChrToCheck]]))) BinnedIgG <- hist(BamIgGPos, breaks = ExtRoPos, plot = FALSE)$count Binned53 <- hist(Bam53BP1Pos, breaks = ExtRoPos, plot = FALSE)$count # Filter regions beyond the largest RO and before the smallest RO FilRep <- get_distances(ExtRoPos) FilBinIgG <- BinnedIgG FilBin53 <- Binned53 RemIds <- c(1, length(FilRep)) FilRep <- FilRep[-RemIds] FilBinIgG <- FilBinIgG[-RemIds] FilBin53 <- FilBin53[-RemIds] # Filter the centromeric area, determined by the largest replicon RemIds <- which.max(FilRep) FilRep <- FilRep[-RemIds] FilBinIgG <- FilBinIgG[-RemIds] FilBin53 <- FilBin53[-RemIds] # Save the information on a genome-wide matrix AllRep <- c(AllRep, FilRep) AllIgG <- c(AllIgG, FilBinIgG) All53 <- c(All53, FilBin53) rm(BamIgGPos) rm(Bam53BP1Pos) } # All53 contains the read count for 53BP1 # AllIgG contains the read count for IgG # AllRep contains the replicon length save(All53, AllIgG, AllRep, file = "ReadVSRO.RData") # Load alignments for 53BP1 and IgG and perform enrichment analysis -------------# # Processed data are saved to Enrich.RData # ConIntList <- list() ----------- ConIntVect <- NULL OE53BP1Bins <- NULL AllRep <- NULL AllBottomsInROs <- NULL AllTopsInROs <- NULL RandTopsInROs <- NULL Ro53OEStat <- NULL library("pastecs") library("Rsamtools") ClustersVect <- rep(1, length(names(ChrData))) names(ClustersVect) <- names(ChrData) SummaryMat <- NULL SummaryAreas <- NULL # Look at each chromosome independently # (this simplify the memory usage and account for potential ploidity problems) # for(ChrToCheck in names(ChrData)){ # Load data print(paste("Loading", Sys.time())) print(ChrToCheck) data.gr<-GRanges(seqnames=c(ChrToCheck), ranges = IRanges(start=0, end=536870912)) which <- data.gr params<-ScanBamParam(which=which, what = c("pos")) Bam53BP1Pos <- scanBam("Sorted53BP1.bam", param = params) Bam53BP1Pos <- unlist(Bam53BP1Pos) print(paste("Loaded 53BP1", Sys.time())) params<-ScanBamParam(which=which, what = c("pos")) BamIgGPos <- scanBam("IgGFil1357.sorted.bam", param = params) BamIgGPos <- unlist(BamIgGPos) print(paste("Loaded IgG", Sys.time())) # Divide the data into blocks of 1kbp print(paste("Computing histograms", Sys.time())) BrkPos <- seq(from=0, to=max(BamIgGPos)+1e3, by=1e3) HIgG <- hist(BamIgGPos, breaks = BrkPos, plot=FALSE) H53BP1 <- hist(Bam53BP1Pos, breaks = BrkPos, plot=FALSE) # For each bin, compute the normalised signal to noise ratio SigRat <- (H53BP1$counts/sum(H53BP1$counts))/(HIgG$counts/sum(HIgG$counts)) # Look at the distribution of signal to noise # mean(SigRat[!is.infinite(SigRat)], na.rm=TRUE) # # Approximate the distribution with a log-normal LogVal <- log(SigRat[!is.infinite(SigRat) & !is.nan(SigRat) & !is.na(SigRat)]) LogSd <- sd(LogVal[!is.infinite(LogVal)], na.rm=TRUE) # Obtain the pval for overexpression (OEPv) and underexpression (UEPv) OEPv <- plnorm(q = SigRat, meanlog = 0, sdlog = LogSd, lower.tail = FALSE) UEPv <- plnorm(q = SigRat, meanlog = 0, sdlog = LogSd, lower.tail = TRUE) # Look at the number of ROs for each bin HROs <- hist(ChrData[[ChrToCheck]], breaks = BrkPos, plot=FALSE) # How does the strength of overexpression compare to the presence of RO? # boxplot(OEPv ~ HROs$counts) # TB <- table(OEPv<1e-3, HROs$counts) # barplot(100*(t(TB)/colSums(TB))[,2], main = paste("53BP1 enrichment and RO", ChrToCheck), xlab="Number of ROs", # ylab="Percentage of genomic areas enriched in 53BP1") # # # # the # Record the information collected Code is rather slow. Parallelization over different chromosomes could be implemented to speed up analysis CombMat <- NULL for(ROdIdx in 2: length(ChrData[[ChrToCheck]])){ if(ROdIdx %% 100 ==0){ print(paste(ROdIdx-1, "of", length(ChrData[[ChrToCheck]]), "done!")) } SubIdx <- which(BrkPos < ChrData[[ChrToCheck]][ROdIdx] & BrkPos > ChrData[[ChrToCheck]][ROdIdx-1]) tOEpV <- OEPv[SubIdx] tUEpV <- UEPv[SubIdx] tOEpV <- tOEpV[!is.na(tOEpV)] tUEpV <- tUEpV[!is.na(tUEpV)] tIgGCt <- HIgG$counts[SubIdx] t53BCt <- H53BP1$counts[SubIdx] CombMat <- rbind(CombMat, c(min(tOEpV), sum(tOEpV<1e-3), min(tUEpV), sum(tUEpV<1e-3), ChrData[[ChrToCheck]][ROdIdx], ChrData[[ChrToCheck]][ROdIdx-1], sum(tIgGCt==0), sum(t53BCt==0), sum(tIgGCt==0 & t53BCt==0), length(SubIdx))) } SummaryMat <- rbind(SummaryMat, cbind(CombMat, rep(ChrToCheck, nrow(CombMat)))) SummaryAreas <- rbind(SummaryAreas, cbind(OEPv, UEPv, BrkPos[-1], BrkPos[length(BrkPos)], rep(ChrToCheck, length(OEPv)), H53BP1$counts, HIgG$counts)) rm(BamIgGPos) rm(Bam53BP1Pos) } colnames(SummaryMat) <- c("Min PV (OE)", "Nr PV<1e3 (OE)", "Min PV (UE)", "Nr PV<1e3 (UE)", "Replicon End", "Replicon Start", "0IgG count", "053BP1 count", "0IgG&53BP1 count", "Area Count", "Chromosome") # SummaryMat contains the summary statistics for each replicon # SummaryAreas contains the summary statistics for each 1kb slice save(SummaryMat, SummaryAreas, file = "Enrich.RData")
© Copyright 2026 Paperzz