#PAGE_PARAMS# #ADS_HEAD_SCRIPTS# #MICRODATA#

Pan-cancer analysis of somatic mutations and epigenetic alterations in insulated neighbourhood boundaries


Authors: Pietro Pinoli aff001;  Eirini Stamoulakatou aff001;  An-Phi Nguyen aff002;  María Rodríguez Martínez aff002;  Stefano Ceri aff001
Authors place of work: DEIB, Politecnico di Milano, Milano, Italy aff001;  IBM Research Zürich, Rüschlikon, Switzerland aff002
Published in the journal: PLoS ONE 15(1)
Category: Research Article
doi: https://doi.org/10.1371/journal.pone.0227180

Summary

Recent evidence shows that the disruption of constitutive insulated neighbourhoods might lead to oncogene dysregulation. We present here a systematic pan-cancer characterisation of the associations between constitutive boundaries and genome alterations in cancer. Specifically, we investigate the enrichment of somatic mutation, abnormal methylation, and copy number alteration events in the proximity of CTCF bindings overlapping with topological boundaries (junctions) in 26 cancer types. Focusing on CTCF motifs that are both in-boundary (overlapping with junctions) and active (overlapping with peaks of CTCF expression), we find a significant enrichment of somatic mutations in several cancer types. Furthermore, mutated junctions are significantly conserved across cancer types, and we also observe a positive selection of transversions rather than transitions in many cancer types. We also analyzed the mutational signature found on the different classes of CTCF motifs, finding some signatures (such as SBS26) to have a higher weight within in-boundary than off-bounday motifs. Regarding methylation, we find a significant number of over-methylated active in-boundary CTCF motifs in several cancer types; similarly to somatic-mutated junctions, they also have a significant conservation across cancer types. Finally, in several cancer types we observe that copy number alterations tend to overlap with active junctions more often than in matched normal samples. While several articles have recently reported a mutational enrichment at CTCF binding sites for specific cancer types, our analysis is pan-cancer and investigates abnormal methylation and copy number alterations in addition to somatic mutations. Our method is fully replicable and suggests several follow-up tumour-specific analyses.

Keywords:

DNA methylation – Methylation – Point mutation – Substitution mutation – Adenocarcinomas – Cancer genomics – breast cancer – Somatic mutation

Introduction

The human genome is organized hierarchically into discrete topologically associated domains (TADs), sub-megabase segments that tend to self-associate and are relatively insulated from neighboring domains [1, 2]. TAD boundaries, revealed by high-throughput chromatin conformation capture (HiC) techniques [3], are enriched for the insulator binding protein CTCF, which plays a central role in multiple complex genomic processes, including transcription [4, 5], imprinting [6, 7], long-range chromatin interactions and subnuclear localization [8, 9].

TADs are hierarchical in nature [10, 11] and usually contain smaller sub-TADs and individual loops that function as insulated neighborhoods [12, 13]. While TADs are largely invariant features of genome organization and are mostly conserved across cell types and even across species, sub-TADs, loops, and insulated neighborhoods appear to differ, at least partially, between different cell lineages [14].

Loss of CTCF-mediated insulation has been identified as a potential reason for domain disruption and spreading [15] and has been connected with multiple malignancies, such as cancer [1618], intellectual disability [19, 20] and developmental disorders [21]. In cancer, recent studies have shown that TAD disruption is often found in cancer cells and contributes to oncogenesis through several mechanisms. For instance, mutated or epigenetic alterations of a TAD boundary might lead to the fusion of two adjacent TADs [13, 18, 22, 23]. Hypermethylation of CTCF binding sites has been observed to lead to loss of insulation between topological domains and consequent aberrant gene activation in gliomas [22]. Microdeletions that eliminate the boundary sites of insulated neighborhoods containing prominent acute lymphoblastic leukemia proto-oncogenes have also been reported [23]. The same study also identified an enrichment in boundary CTCF site mutations in the genomes of esophageal and liver carcinoma [23]. Furthermore, genomic rearrangements with breakpoints within TADs can lead to breakage or fusion of TADs that might result in oncogene activation [2427], as observed in prostate cancer, where chromosomal deletions lead to the establishment of new domain boundaries and the rearrangement of gene interactions [27]. Hotspots of mutations within CTCF motifs have been independently observed in melanoma [28, 29] related to UV exposure, and in gastrointestinal cancers [30]. In parallel, statistical analyses have showed an enrichment of cancer-associated genes in proximity of mutated boundaries [28] and highlighted specific dysregulated genes [30]. Furthermore, 21 insulators (both cancer-specific and pan-cancer) have been suggested as cancer drivers in recent work [31]. Finally, a recent review [32] summarizes various findings about the increase of somatic mutations at binding sites of TFs including CTCF, and suggests that the mutation frequency increase is shaped by the complex interplay between DNA damage and repair levels. These studies have motivated our work to systematically characterize the dysregulation of CTCF binding across different cancer types and through different mechanisms of dysregulation.

Materials and methods

We first outline our approach to detect insulated neighbourhoods and CTCF binding motifs. To investigate mutations that can potentially affect TADs, we analyzed public datasets available from the International Cancer Genome Consortium (ICGC [33]) and The Cancer Genome Atlas (TCGA [34]), leveraging data from the ENCODE Consortium [35]. Table A in S1 File lists the cancer types we have considered. For the systematic exploration of large datasets we used a novel approach for genomic computing [36].

Regarding the definition of insulated neighbourhoods, we considered three datasets that have mapped insulated neighbourhoods using Chromatin Interaction Analysis by Paired-End Tag (ChIA-PET):

  • A breast cancer study that characterised CTCF-mediated chromatin interactions using ChIA-PET experiments on the MCF7 breast cancer cell line [37].

  • A study on the H1-hESC human embryonic stem cell line that inferred neighbourhoods using the SMC1 protein—cohesin subunit as the target of the ChIA-PET experiment [13].

  • A set of constitutive neighbourhoods provided by Hnisz and colleagues [23]. This set was obtained as the intersection of two RAD21, a subunit of the cohesin complex, ChIA-PET experiments on GM12878 and K562 cell lines, and a SMC1 ChIA-PET experiment on a Jurkat cell line. Only neighbourhoods confirmed in at least 2 cell lines were kept.

We used Biostrings [38] to identify the CTCF binding sites. Specifically, we used the 19bp long Jaspar [39] MA0139.1 motif (Fig A in S1 File) in the HG19 assembly, finding 107,230 (104,459 excluding chromosome Y) positions matching the motif with a score of at least 80%. Next, we classified motifs as in-boundary/off-boundary and active/inactive according to whether they were located respectively on a previously mapped insulated neighbourhood and on a peak of CTCF ChIP-seq data, as characterised by the Bernstein Laboratory and published on ENCODE [35] (ENCSR000AMF for H1-hESC, ENCSR000AKB for GM12878 and ENCSR560BUE for MCF7). Fig 1 illustrates the different types of CTCF motifs according to the above classification, while the Table 1 reports the number of boundaries and motifs in each category for each ChIA-PET dataset and ChIP-seq cell line.

Fig. 1. Classification of CTCF motifs, within a short portion of chromosome 11.
Classification of CTCF motifs, within a short portion of chromosome 11.
Motifs are classified as active (confirmed by a CTCF ChIP-seq peak) and inactive (not confirmed). Active motifs are further divided into in-boundary and off-boundary according to whether they overlap a boundary, as defined by a ChIA-PET experiment.
Tab. 1. Summary statistics of the number of boundaries and motifs.
Summary statistics of the number of boundaries and motifs.
The first two experiments performed ChIA-PET experiments on an MCF7 breast cancer cell line [37] and an H1-hESC human embryonic stem cell line [13]. The Hnisz dataset is defined as the intersection of three ChIA-PET experiments targeting respectively RAD21 on cell lines GM128178 and K562, and SMC1 on a Jurkat cell line, with only neighbourhoods predicted in at least 2 out of 3 ChIA-PET experiments being considered.

Point mutation analysis

Enrichment of somatic mutations on active, in-boundary CTCF motifs

While mutations in CTCF binding sites have been reported to occur frequently in several cancers, the significance of these findings across the broad spectrum of cancer types has not yet been evaluated. In order to test the enrichment of mutations on CTCF sites, we analyzed Whole Genome Sequencing (WGS) data publicly available at ICGC [33]. Specifically, we selected 14 cancer types for which at least 200,000 mutations across all patients were reported—this threshold was imposed in order to ensure adequate statistics. Opposed to earlier approaches that focused on identifying motif-disrupting mutations [28], we opt for a descriptive unbiased approach and choose to summarise all mutations identified in the 19bp long CTCF motif. Table B in S1 File summarizes the list of analyzed datasets and also reports the number of samples and total number of somatic mutations per cancer type.

Using GMQL [36], an in-house developed query language for genomic analyses, we observed an accumulation of somatic mutations on active in-boundary CTCF motifs. Fig 2, left subplot, shows the accumulation of mutations in esophageal adenocarcinoma (ESAD), one of the tumour types for which this phenomenon was more evident. As is evident in the figure, a peak in the mutation frequency distribution is clearly visible over the 19 bp CTCF active in-boundary motifs, while the mutation frequency over flanking regions is much smaller. In comparison, no such enrichment was found on active off-boundary motifs (Fig 2, right subplot). We find similar patterns in almost all other examined cancer types (see Fig B in S1 File for boundaries extracted from the ChIA-PET on the MCF7 cell line, Fig C in S1 File for boundaries from the hESC cell line, and Fig D in S1 File for the Hnisz dataset). This suggests that active, in-boundary CTCF motifs tend to acquire a larger number of mutations than expected by chance.

Fig. 2. Somatic mutation frequency in a range of (-1kbp, +1kbp) surrounding (a) an active in-boundary and (b) an active off-boundary CTCF motifs in eEsophageal adenocarcinoma (ESAD).
Somatic mutation frequency in a range of (-1kbp, +1kbp) surrounding (a) an active in-boundary and (b) an active off-boundary CTCF motifs in eEsophageal adenocarcinoma (ESAD).
A central peak overlapping the 19bp CTCF motif is evident in (a).

To estimate the statistical significance of the observed enrichment, we performed a per-patient analysis. Namely, for each patient we counted the number of mutations on in-boundary versus off-boundary motifs; we discarded patients with less than five mutations in at least on of the categories. We then created a two column contingency table, where the two columns correspond to in-boundary and off-boundary motifs, respectively, while each row corresponds to one of the considered patients. Finally, we run a chi-square test to test whether the distributions on the two columns are identical or distinct. The p-values of the chi-square test are reported in Table 2. We found the enrichment of somatic mutations in active in-boundary CTCF motifs to be significant in several cancer types and across the three datasets of insulated neighbourhoods.

Tab. 2. Significance of the enrichment of somatic mutations in active, in-boundary motifs for all studied cancer types and neighbourhoods datasets.
Significance of the enrichment of somatic mutations in active, in-boundary motifs for all studied cancer types and neighbourhoods datasets.
The p-values are computed using a chi-square test. Within parenthesis are the number of patient on which each test have ben run. Cancer names are defined in Table A in S1 File, according to ICGC nomenclature.

Interestingly, in five cancer types, namely Esophageal Adenocarcinoma (ESAD), Liver Cancer (LIRI), Melanoma (MELA), Skin Adenocarcinoma (SKCA) and Malignant Lymphoma (MALY), the enrichment is significant in all the datasets of insulated neighbourhoods; in breast cancer, gastric cancer and colorectal cancer it is significant in MCF7 and Hnisz datasets. The Hnisz dataset is associated with the most significant results in most cancer types. We speculate that, since these neighbourhoods were obtained by intersecting different cell lines, they are more likely to be constitutive for the cell and, therefore, be present in most of the tissues.

Mutation enrichment in gene promoters and exons

Neighbourhood boundaries tend to fall in promoter regions where chromatin is open to facilitate transcription, especially in active genes, [40]. Hence, it could be argued that the enrichment of mutations in active in-boundary motifs is due to the proximity of these motifs to promoters, rather than to a cancer specific mechanism. To rule out this possibility, we divided the active CTCF motifs into two classes: motifs overlapping gene promoters and motifs outside promoter regions. Here, promoters were defined as the genomic regions spanning -2kb to +2kb from the transcription start site of a RefSeq annotated gene. We then computed the accumulation of mutations in these two classes of active motifs.

Fig 3(a) and 3(b) shows the results of this analysis in Esophageal Adenocarcinoma (active CTCF motifs taken from the hESC dataset). A slightly higher frequency of somatic mutations is observable on active motifs outside promoters, ruling out the possibility that the enrichment in mutations is driven by the presence of a promoter. We performed similar analyses on all available cancer types (Table B in S1 File) and quantified the enrichment using the per-patient test, as described above. Empirical p-values are reported in the Table C in S1 File. In no cancer type did we observe an enrichment of mutations in promoters, supporting the hypothesis that the accumulation of mutations in active CTCF motifs is not due to the overlap of promoters, and hence, not due to the open state of the chromatin.

Fig. 3. Distribution of the number of mutations on active in-boundary CTCF motifs within and outside promoters and exons in Esophageal Adenocarcinoma using the hESC cell line.
Distribution of the number of mutations on active in-boundary CTCF motifs within and outside promoters and exons in Esophageal Adenocarcinoma using the hESC cell line.
A higher frequency of mutations is found outside promoters and exons, ruling out the possibility that the enrichment of somatic mutations in active CTCF motifs is due to their higher probability of having open chromatin.

We also tested the difference of mutation enrichment for the boundaries colocated with the exons of protein coding genes. Fig 3(c) and 3(d) shows the results of this analysis in Esophageal Adenocarcinoma (active CTCF motifs taken from the hESC dataset). Again, a sligthly higher frequency of somatic mutations is observed in CTCF motifs outside exons, ruling out the hypothesis that such enrichment is due to the colocation with open chromatin regions.

We also tried to test if mutations tend to cluster on active enhancers, defined as the intersection of H3K4me1 and H3K27Ac histone modifications, but for such regions we found a very small intersection with active in-boundary CTCF motifs and it was not possible to perform a significative statistical test.

Genes close to frequently mutated boundaries

We also look for genes close to mutated in-boundary motifs. In Fig 4, we report the genes having a TSS within 180 kbp from a mutated active in-boundary motif for melanoma (180 kpb is the average size of an insulated neighbourhood). The plot shows that several oncogenes and proto-oncogenes are in the near neighbourhood of some mutated junctions. Notably, in the list of oncogenes we find TGFB1, whose up-regulation has been recently associated with disruption of CTCF binding motif due to somatic mutations in the melanoma A375 cell line [31]. We also find PDGFRA, an oncogene associated with an array of clinically significant neoplasms, also reported by Flavahan et al. [22] as dysregulated due to its proximity to a mutated boundary in gliomas. Circular plots for the other four cancer types with a significant mutational enrichment in all cell lines can be found in the Fig G in S1 File.

Fig. 4. Genes close to mutated CTCF in-boundary motifs in melanoma.
Genes close to mutated CTCF in-boundary motifs in melanoma.
The height of the bars represents the number of samples in which the gene is close to a mutated in-boundary active CTCF motif. We considered only genes with top mutations whose TSS is within 180 kbp from the center of the mutated motif.

Analysis of the most frequent types of mutations

Next, we investigated whether the somatic mutations in active in-boundary motifs preferentially belong to a certain class of mutations, which could point to a specific tumourigenic mechanism. Hence, we split somatic point mutations into transitions (A↔G and C↔T) and transversions (A↔C, G↔T and A↔T, C↔G) and counted how many mutations fall into each category for each tumour. Table 3 reports the counts of transition and transversion mutations for active in-boundary and active off-boundary motifs for all considered cancers. We used a chi-square test on the contingency table associated with each cancer to assess whether the observed frequencies were homogeneously distributed between transitions and transversions. For seven cancer types—Esophageal Adenocarcinoma, Pancreatic Cancer Endocrine Neoplasm, Skin Cancer, Liver Cancer, Skin Adenocarcinoma, Malignant Lymphoma and Colorectal Cancer—one type of mutation occurred more frequently than expected by chance. Overall, most cancers acquire a similar or higher number of transversions than transitions, with the notable exception of skin cancers, where transitions are more frequent.

Tab. 3. Counts of transition and transversion mutations in in-boundary and off-boundary active motifs using the hESC dataset to define insulated neighbourhoods.
Counts of transition and transversion mutations in in-boundary and off-boundary active motifs using the hESC dataset to define insulated neighbourhoods.
P-values are computed using a chi-square test on the contingency table associated with each tumour.

In general, the ratio of transitions versus transversions depends on the type of DNA element. In particular, the ratio in binding sites and regulatory regions is not well characterized, although we note recent results in germline mutations [41] that indicate that transversions have a slightly larger effect on DNA shape and transcription factor binding than transitions. Assuming that similar results can be generalised to somatic alterations, this finding suggests that transversions on CTCF motifs might have a stronger effect than transitions, which could hint towards a positive selection of transversions rather than transitions in insulated neighborhood boundaries in tumour cells, as these alterations are more likely to disrupt boundaries than transitions.

Motivated by the non-random occurrences of transitions and tranversions in certain cancer types, we further investigated the mutation patterns most frequently found in the active CTCF motifs and flanking areas (19 bp ± 50 bp). We considered the five tumour types for which we have the highest number of mutations (see Table B in S1 File)—and studied their mutation patterns. Results are summarized in Fig 5.

Fig. 5. Mutations in active in-boundary CTCF motifs and flanking regions (19 bp ±50 bp).
Mutations in active in-boundary CTCF motifs and flanking regions (19 bp ±50 bp).
Clear peaks are observed within CTCF motifs in all cancers types. Additional peaks are observed in the flanking regions of the CTCF motifs in Liver Cancer, Esophageal Adenocarcinoma and Breast Cancer tumour types (sub-plots A,B,C).

Esophageal Adenocarcinoma (ESAD), Liver Cancer (LIRI) and Breast Cancer (BRCA) were associated with the largest enrichment of in-boundary mutations in all 3 considered ChIA-PET experiments (see Table 2). In Esophageal Adenocarcinoma and Liver Cancer, we note two very distinguishable peaks at positions 9 and 11 (the two positions flanking the central position of the motif, located at x—coordinate 10, see Fig 5A and 5B). Similar although less prominent peaks were also found in Breast Cancer (see Fig 5C). We further note significant peaks of mutations just outside the CTCF binding site, particularly marked in Esophageal Adenocarcinoma and Breast Cancer. We speculate that these mutations might correspond to the binding sites of the subunits of the largest cohesin complex that is assembled to define a boundary.

Point mutations in skin cancer (MELA) and skin adenocarcinoma (SKCA) are shown in Fig 5D and 5E. We note a prevalence of G→A mutations in positions 6-7 and C→T mutations in positions 13-14, which are symmetric with respect to the motif center at basis 10. These mutations are consistent with the observed enrichment of C→T and CC→TT mutations in ultraviolet exposure-driven melanoma tumours [42]. In these cases, no additional peaks in the flanking regions of the CTCF motif are observed.

Analysis of mutational signatures

We next investigated whether the active in-boundary motifs on which we observed the mutation enrichment are affected by specific mutational processes. In order to do so, we computed the exposure (or weight contribution) of the Alexandrov mutation signatures [43] on in-boundary motifs and on the whole genome. An Alexandrov mutational signature describes a mutational process as a vector of mutation probabilities for all the 96 possible single nucleotide variants within their context of adjacent bases, i.e., each of these 96 probabilities corresponds to a triplet whose central base is mutated (e.g. G[G → A]A). We considered the catalogue of single base substitution mutational signatures provider by COSMIC and computed the relative exposure of each of them in the considered tumors; this operation, known as signature refitting, was performed by means of the Bioconductor R package decompTumor2Sig [44]. The set of mutational signatures found in a tumor (or in a particular class of genome regions) depends on which mutational processes were active and the related exposure indicates how strongly such process contributes to the mutational load in the tumor (or within the specific regions). In our analysis we found that signature SBS26, which is associated with defective DNA mismatch repair, is particularly prominent in the active in-boundary motifs in Esophageal Adenocarcinoma (ESAD Fig 6), Liver Cancer (LIRI, Fig I in S1 File) and Breast Cancer (BRCA, Fig H in S1 File), although the exposure of the same signature is not relevant in the whole genome of the same tumors. For what it concerns Skin Sancer (MELA) and Skin Adenocarcinoma (SKCA) we found that within in-boundary motifs the signature SBS7b has the highest exposure and not signature SBS7a as in the whole genome (Figs J and K in S1 File). Both signatures are associated with exposure to ultraviolet light and their most frequent mutation type is T[C → T]C (i.e., G[G→A]A) whose corresponding triplet TCC—GGA is present in a significant subset of CTCF binding sites (e.g., positions 3-5 and 13-15).

Fig. 6. The average contributions or weights (i.e., exposures) of mutational signatures for the Esophageal Adenocarcinoma dataset.
The average contributions or weights (i.e., exposures) of mutational signatures for the Esophageal Adenocarcinoma dataset.
Signature refitting was done based on mutations falling a) in-boundaries motifs, b)in the whole genome, and and then c) on the difference between the two.

Overlap of mutated boundaries across cancer types

We also found a significant overlap of frequently mutated boundaries across tumour types in the same five tumour types (Melanoma, Esophageal Adenocarcinoma, Skin Adenocarcinoma, Liver Cancer and Breast Cancer) associated with the highest number of mutations, see Fig F in S1 File. We tested the significance of the pair-wise overlap of junctions across cancer types using the hypergeometric test. Results are shown in Table G in S1 File. All pair-wise comparisons resulted in very significant p-values, confirming that mutations in boundaries do not happen by random chance, hinting to a concerted oncogenic mechanism to dysregulate key cancer driver genes.

Methylation analysis

The CTCF consensus binding sequence contains a CpG dinucleotide and can, therefore, be methylated—20% of CTCF binding sites are reported to be methylated on average [45]. Furthermore, it has been reported that up to 41% of variable CTCF binding is linked to differential DNA methylation, concentrated at the two CpG dinucleotides on the CTCF motif [46]. Increased methylation leading to disruption of CTCF binding patterns has also been observed in immortalized cell lines [46], suggesting that abnormal methylation of CTFC motifs might be a mechanism of cancer gene dysregulation. Hence, we decided to investigate if hyper-methylation occurs on in-boudary motifs, considering for this analysis 12 cancer types for which at least 20 matched tumoral and normal methylation samples are available in TCGA [34]. Table D in S1 File lists the available datasets. Active in boundary CTCF motifs are preferentially located on coding and regulatory elements, which indirectly reflects the distribution of methylation probes, preferentially located in coding and regulatory elements (see Table E in S1 File).

To investigate the effect of methylation on CTCF DNA binding, we first looked at the methylation patterns found on CTCF motifs. Fig 7 shows a representative example of the distribution of the delta of methylation intensities (tumor beta—normal beta) on CTCF in-boundary motifs on a cohort of Breast Cancer patients. As previously reported, methylation peaks are found on the CpG dinucleotides. Evident in the figure, active in-boundary motifs are frequently over-methylated in tumour samples compared to normal samples.

Fig. 7. A representative distribution of delta methylation values on CTFC in-boundary motifs for Breast Cancer.
A representative distribution of delta methylation values on CTFC in-boundary motifs for Breast Cancer.
Hypo-methylated refers to probes with a delta beta value lower than -0.2, hyper-methylated to probes with a delta beta value greater than 0.2 and normal-methylated to all the others. The x-axis reports the CTCF motif. Two distinguishable methylation peaks are observed at positions 4 and 15, corresponding to the presence of 2 pairs of CpG dinucleotides.

In several cancer types we observed hypermethylation on in-boundary motifs. Specifically, the phenomenon is significant in Breast Cancer (BRCA), Head and Neck Squamous Cell Carcinoma (HNSC), Kidney Renal Papillary Cell Carcinoma (KIRP), Uterine Corpus Endometrial Carcinoma (UCEC), Lung Squamous Cell Carcinoma (LUSC) and Skin Cutaneous Melanoma (SKCM). In order to validate that such phenomenon is specific of in-boundary junctions and not due to an overall genome over-methylation of these tumors, we designed a bootstrap test. Specifically, we calculated Δβ = (βtumorβnormal) for all the probes. We then sampled an equal number of probes from in-boundary and off-boundary motifs and calculated the mean of these probes. We repeated the sampling 10,000 times and counted how many times the mean Δβ within in-boundary is greater than the one within off-boundary motifs. The empirical p-values are 0 in Breast Cancer (BRCA), Head and Neck Squamous Cell Carcinoma (HNSC), Uterine Corpus Endometrial Carcinoma (UCEC), Lung Squamous Cell Carcinoma (LUSC), Skin Cutaneous Melanoma (SKCM), Bladder Urothelial Cancer (BLCA), Liver Hepatocellular Carcinoma (LIHC) and Thyroid Cancer (THCA), indicating that in these cancer types the Δβ is significantly larger within in-boundary than off-boundary motifs, i.e., these motifs are more methylated than off-boundary ones. Conversely, in the cases of Kidney Renal Papillary Cell Carcinoma (KIRP), Kidney Renal Clear Cell Carcinoma (KIRC), Lung Adenocarcinoma (LUAD) and Prostate Adenocarcinoma (PRAD), we obtain a p-value equal to 1, implying that in these cancers, the in-boundary motifs are significantly less methylated than the off-boundary motifs.

To evaluate whether over-methylated motifs are conserved across different cancer types, we considered the five most over-methylated tumour types according to the boundaries defined for the hESC cell line (see Table 4, Left subplot): Breast Cancer, Uterine Corpus Endometrial Carcinoma, Head and Neck Squamous Cell Carcinoma, Kidney Renal Papillary Cell Carcinoma and Lung Squamous Cell Carcinoma. For each cancer type, we identified the set of active in-boundary motifs with the strongest methylation difference between tumour and normal, i.e. abs(tumour beta—normal beta)> 0.2, and we computed the overlap across all 5 cancer types. Although the sets of differentially methylated active in-boundary motifs are different for each tumour, they have strongly significant overlaps, as shown in Fig L in S1 File. Significance is computed by mean of a hypergeometric test, as previously discussed. Results are reported in Table H in S1 File.

Tab. 4.
a) Empirical p-values indicating the significance of the over-methylation of active, in-boundary CTCF motifs compared to off-boundary random positions in the genome, computed by means of a permutation test. b) p-values indicating the enrichment of CNA mutations on active in-boundary motifs with respect to off-boundary motis in tumour samples. P-values have been computed using the chi-square test on a P × 2 contingency table, where P is the number of patients. In brackets we report the median size of the CNA-mutated regions for the given cancer.

CNA analysis

Copy number alterations (CNA) are a main tumorigenic driver in many cancer types [47]. In the context of neighbourhood dysregulation, recent work has reported tandem duplications intersecting with a TAD that led to de novo 3D contact domain formation. The domain rearrangement affected a lineage-specific super-enhancer, resulting in high-level gene activation [48]. The same work revealed that TAD boundary intersecting deletions are associated with IRS4 dysregulation (a gene often over-expressed in different cancer types) in sarcoma and squamous cancers. Motivated by these findings, we investigate here whether copy number alterations may contribute to cancer phenotypes by disrupting topologically associated domain boundaries.

To evaluate the effect of CNAs on a neighbourhood’s dysregulation, we analysed CNA data for 11 different types of cancer provided by ICGC. We considered a genomic region mutated, i.e. either deleted or amplified, if it has an (absolute) mean value greater than 0.2 [49]. A major difference with respect to somatic alterations and methylation changes is that CNAs are not local alterations, but can span large areas of a chromosome. A large CNA that affects a sizeable portion of a chromosome is likely to induce gene dysregulation by a combination of mechanisms, including oncogene and/or tumor suppression dysregulation, as well as insulated neighbourhood dysregulation. Such multi-factorial dysregulatory effect is less likely to be present in small CNAs. To quantify this effect, we stratified CNAs by size splitting them in 4 groups according to the 25%, 50% (median), 75% and 100% percentiles of the size distribution of each cancer type. Note that the 100% percentile includes all CNAs available for analysis for each cancer.

Copy number alterations of active in-boundary vs off-boundary motifs

To test the potential association between cancer and copy-number mutation affecting insulated neighbourhoods, we compared the distribution of CNA-mutated regions overlapping in-boundary motifs with the distribution of regions overlapping off-boundary motifs in tumour samples. In practice, for each cancer, we built a P × 2 contingency table where each row represents a patient. We reported in Table 4b the p-values obtained by performing a chi-square test on such contingency tables, using the median of the CNA size distribution as a cutoff for CNA size. Results for all 4 cutoffs can be found in Tables I and J in S1 File. Interestingly, for all three the neighbourhood datasets and for all size cutoffs (with the exception of the 25th percentile), we obtained significant results for breast cancer, lung adenocarcinoma, and ovarian cancer. The non-significance of the results for small cutoffs may be explained by the number of patients considered (reported in brackets in Tables I and J in S1 File). Indeed, for the smaller cutoffs, we retained a small number of patients suggesting that the high p-values may be due to low statistical power.

In recent work [50], principal component analysis (PCA) on CNA data was performed on various cancer types, and Ovarian, Lung and Breast (basal subtype) cancers were found to have a similar signature characterised by a higher degree of copy number alterations compared to other types of cancer. Our results are in agreement with these findings: we observe that ovarian, lung and breast cancers have an enrichment of CNAs overlapping active, in-boundary motifs, which might potentially lead to the dysregulation of key cancer drivers. Note that we obtained our results by focusing on CNAs with an absolute segment mean greater than 0.2 and by considering only active motifs, while [50] used gene mean CNA values, which do not in general co-locate with motifs. Remarkably, we find similar patterns as in [50] while looking at different positions in the genome (active, in-boundary motifs versus coding genes). This strongly suggests that the dysregulation of neighbourhood boundaries through the accumulation of CNAs is not a fortuitous occurrence, but could instead be a driver mechanism in these cancer types.

Results and discussion

Cancer arises as a result of heterogeneous molecular mechanisms that lead to the silencing of tumour suppressor genes and activation of oncogenes. It has been reported in the past that disruption of constitutive insulated neighbourhoods leads to oncogene activation [22, 23, 31].

Recent reviews and opinion papers have highlighted the central role of boundary disruption leading to genomic rearrangement in cancer development [5155], and several recent articles have connected the disruption of constitutive neighbourhoods to oncogene dysregulation in specific tumours.

In [22] for instance, the loss of one boundary enables a constitutive enhancer to interact aberrantly with PDGFRA, a prominent oncogene in glioma. In [23], the loss of one boundary is linked to the activation of TAL1, a proto-oncogene in acute lymphoblastic leukemia. In [31], mutations of one insulator region identified as a melanoma driver are associated with the upregulation of TGFB1, although another study on melanoma could not find evidence of gene expression enrichment [28]. Other studies have described gene expression changes in the proximity of mutation hotspots at CTCF binding sites in gastrointestinal cancer [30]. While each of these results individually suggests an important role for the dysregulation of some constitutive neighbourhoods in specific tumors, a conclusive pan-cancer analysis is not yet available.

In this work, we present a systematic pan-cancer analysis of the alterations of neighbourhood boundaries that actively support DNA looping and the constitution of insulated neighbourhoods. We concentrate on CTCF motif occurrences that are confirmed by ChIP-seq experiments in three cell lines, and we further divide them as in-boundary vs off-boundary by using boundary regions as identified by biologically coherent ChIA-PET experiments. Specifically, we have investigated the alteration of CTCF bindings by means of three different genomic alterations: the acquisition of somatic mutations, abnormal methylation and copy number alterations. All three mechanisms can disrupt or completely eliminate a motif, hence preventing the binding of CTCF and compromising the integrity of its associated neighbourhood. In the case of somatic mutations, we opted for a descriptive unbiased approach where we summarised all mutations falling within a motif. Our result complements existing studies that have highlighted the prevalence of mutation enrichments at CTCF sites and provides a novel insights about the impact of methylation and copy number alteration on boundary dysregulation, which have not yet been systematically analysed.

Table 5 summarises our analysis: we find that somatic mutations, methylation, and copy number variations are significantly enriched in the neighbourhood boundaries in some specific cancer types. Specifically, we observed an enrichment of somatic mutations in at least two insulated neighbourhood datasets in eight cancer types. A systematic analysis of the type of mutation observed in each cancer type (Fig 5) reveals the appearance of mutation peaks on specific positions of the motif, which are not present on off-boundary CTCF motifs. We also observe a very significant overlap of frequently mutated active boundaries on these five cancers (Table G in S1 File), confirming that mutations in boundary do not happen at random. A positive selection of transversions versus transitions seems to be prevalent in most cancer types. We speculate that transversions might be positively selected, as they have a slightly larger effect on DNA shape and transcription factor binding than transitions [41].

Tab. 5. Significant alterations in cancer types.
Significant alterations in cancer types.
For each of the three analyses we performed (enrichment of somatic mutations, DNA methylation and CNA) we report the result corresponding to each of the three datasets of insulated neighbourhoods: hESC (h), MCF7 (M) and Hnisz (H). A dash indicates that data is not available for the specific cancer type, Y that the alteration is significant (p-value <.05), N that the alteration is not significant (p-value ≥ .05). For methylation Y(+) indicates that a significant hyper-methylation is observed, Y(-) that a significant hypo-methylation is observed.

Regarding methylation, we tested in-boundary CTCF motifs in twelve cancer types and we found that seven of them are significantly hyper-methylated while four of them are significantly hypo-methylated. We also observe a very significant overlap of frequently over-methylated active boundaries on these cancers, confirming that also the over-methylated dysregulation of boundaries does not happen by random chance. Finally, we observe that copy number alterations significantly overlap with active junctions in four cancer types, namely in Breast Cancer (BRCA), Lung Adenocarcinoma (LUAD), Lung Squamous Carcinoma (LUSC) and Ovarian Cancer (OV), all of them cancers where important oncogenic CNA signatures have been identified [50, 56].

In S2 File we report pan-cancer integrated information; rows describe CTCF motif occurrences indexed by genomic coordinates, their active-inactive state, their overlap with neighborhood boundaries and for each tumor their counts of mutations and hyper methylated probes.

Unfortunately, we do not have access to all studied data types for all considered cancers, and Breast Cancer is the only tumor type for which we have data about all three mutagenic processes. Remarkably, breast cancer appears to be significantly enriched in all 3 considered boundary dysregulation mechanims. We note that the amplification of estrogen response elements has been reported to be generated by abnormal long-range chromatin interactions [57], further highlighting the importance of maintaining a correctly regulated genome architecture.

To summarise, although additional analysis and experimental validation are needed to understand the causal relationship between these observations and the abnormal modifications of the genome 3D structure, our research presents a systematic observation of the relationships between topological boundaries and genome alterations. One of the immediate outcomes of our work is a method for the identification of deregulated junctions, enabling a detailed, tumour-specific analysis on a smaller scale, as in [22, 23, 27, 28, 30].

Supporting information

S1 File [pdf]
Supplementary material, containing Tables A-J and Figs A-L.

S2 File [xls]
Pancancer integrated information; rows describe CTCF motif occurrences indexed by genomic coordinates, their activeinactive state, their overlap with neighborhood boundaries and for each tumor their counts of mutations and hyper methylated probes.

S3 File [rtf]
Readme file for .

S1 Notebook [ipynb]
Jupyter Notebook for mutation analysis.


Zdroje

1. Bickmore WA, van Steensel B. Genome Architecture: Domain Organization of Interphase Chromosomes. Cell. 2013;152(6):1270–1284. https://doi.org/10.1016/j.cell.2013.02.001 23498936

2. Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485:376 EP –. doi: 10.1038/nature11082 22495300

3. Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, et al. Comprehensive mapping of long range interactions reveals folding principles of the human genome. Science. 2009;326(5950):289–293. doi: 10.1126/science.1181369 19815776

4. Baniahmad A, Steiner C, Köhne AC, Renkawitz R. Modular structure of a chicken lysozyme silencer: involvement of an unusual thyroid hormone receptor binding site. Cell. 1990;61(3):505–514. doi: 10.1016/0092-8674(90)90532-j 2159385

5. Filippova GN, Thienes CP, Penn BH, Cho DH, Hu YJ, Moore JM, et al. CTCF -binding sites flank CTG/CAG repeats and form a methylation-sensitive insulator at the DM1 locus. Nature Genetics. 2001;28(4):335–343. doi: 10.1038/ng570 11479593

6. Bell AC, Felsenfeld G. Methylation of a CTCF -dependent boundary controls imprinted expression of the Igf2 gene. Nature. 2000;405(6785):482–485. doi: 10.1038/35013100 10839546

7. Hark AT, Schoenherr CJ, Katz DJ, Ingram RS, Levorse JM, Tilghman SM. CTCF mediates methylation-sensitive enhancer-blocking activity at the H19/Igf2 locus. Nature. 2000;405(6785):486–489. doi: 10.1038/35013106 10839547

8. Yusufzai TM, Tagami H, Nakatani Y, Felsenfeld G. CTCF tethers an insulator to subnuclear sites, suggesting shared insulator mechanisms across species. Molecular Cell. 2004;13(2):291–298. doi: 10.1016/s1097-2765(04)00029-2 14759373

9. Hou C, Zhao H, Tanimoto K, Dean A. CTCF -dependent enhancer-blocking by alternative chromatin loop formation. Proceedings of the National Academy of Sciences. 2008;105(51):20398–20403. doi: 10.1073/pnas.0808506106

10. Phillips-Cremins JE, Sauria MEG, Sanyal A, Gerasimova TI, Lajoie BR, Bell JSK, et al. Architectural Protein Subclasses Shape 3D Organization of Genomes during Lineage Commitment. Cell. 2013;153(6):1281–1295. doi: 10.1016/j.cell.2013.04.053 23706625

11. Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, et al. A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping. Cell. 2014;159(7):1665–1680. https://doi.org/10.1016/j.cell.2014.11.021 25497547

12. Dowen JM, Fan ZP, Hnisz D, Ren G, Abraham BJ, Zhang LN, et al. Control of Cell Identity Genes Occurs in Insulated Neighborhoods in Mammalian Chromosomes. Cell. 2014;159(2):374–387. doi: 10.1016/j.cell.2014.09.030 25303531

13. Ji X, Dadon DB, Powell BE, Fan ZP, Borges-Rivera D, Shachar S, et al. 3D Chromosome Regulatory Landscape of Human Pluripotent Cells. Cell Stem Cell. 2016;18(2):262–275. doi: 10.1016/j.stem.2015.11.007 26686465

14. Dixon JR, Gorkin DU, Ren B. Chromatin Domains: The Unit of Chromosome Organization. Molecular Cell. 2016;62(5):668–680. https://doi.org/10.1016/j.molcel.2016.05.018 27259200

15. Jose CC, Xu B, Jagannathan L, Trac C, Mallela RK, Hattori T, et al. Epigenetic dysregulation by nickel through repressive chromatin domain disruption. Proceedings of the National Academy of Sciences of the United States of America. 2014;111(40):14631–14636. doi: 10.1073/pnas.1406923111 25246589

16. Filippova GN, Qi CF, Ulmer JE, Moore JM, Ward MD, Hu YJ, et al. Tumor-associated Zinc Finger Mutations in the CTCF Transcription Factor Selectively Alter Its DNA-binding Specificity. Cancer Research. 2002;62(1):48–52. 11782357

17. Prawitt D, Enklaar T, Gärtner-Rupprecht B, Spangenberg C, Oswald M, Lausch E, et al. Microdeletion of target sites for insulator protein CTCF in a chromosome 11p15 imprinting center in Beckwith—Wiedemann syndrome and Wilms’ tumor. Proceedings of the National Academy of Sciences. 2005;102(11):4085–4090. doi: 10.1073/pnas.0500037102

18. Katainen R, Dave K, Pitkänen E, Palin K, Kivioja T, Välimäki N, et al. CTCF /cohesin-binding sites are frequently mutated in cancer. Nature Genetics. 2015;47:818. doi: 10.1038/ng.3335 26053496

19. Gregor A, Oti M, Kouwenhoven EN, Hoyer J, Sticht H, Ekici AB, et al. De Novo Mutations in the Genome Organizer CTCF Cause Intellectual Disability. The American Journal of Human Genetics. 2013;93(1):124–131. doi: 10.1016/j.ajhg.2013.05.007 23746550

20. Bastaki F, Nair P, Mohamed M, Malik EM, Helmi M, Al-Ali MT, et al. Identification of a novel CTCF mutation responsible for syndromic intellectual disability—a case report. BMC Medical Genetics. 2017;18(1):68. doi: 10.1186/s12881-017-0429-0 28619046

21. Herold M, Bartkuhn M, Renkawitz R. CTCF: insights into insulator function during development. Development. 2012;139(6):1045–1057. doi: 10.1242/dev.065268 22354838

22. Flavahan WA, Drier Y, Liau BB, Gillespie SM, Venteicher AS, Stemmer-Rachamimov AO, et al. Insulator dysfunction and oncogene activation in IDH mutant gliomas. Nature. 2015;529:110 EP –. doi: 10.1038/nature16490 26700815

23. Hnisz D, Weintraub AS, Day DS, Valton AL, Bak RO, Li CH, et al. Activation of proto-oncogenes by disruption of chromosome neighborhoods. Science. 2016;351(6280):1454–1458. doi: 10.1126/science.aad9024 26940867

24. Gröschel S, Sanders MA, Hoogenboezem R, de Wit E, Bouwman BAM, Erpelinck C, et al. A Single Oncogenic Enhancer Rearrangement Causes Concomitant EVI1 and GATA2 Deregulation in Leukemia. Cell. 2014;157(2):369–381. doi: 10.1016/j.cell.2014.02.019 24703711

25. Northcott PA, Lee C, Zichner T, Stütz AM, Erkek S, Kawauchi D, et al. Enhancer hijacking activates GFI1 family oncogenes in medulloblastoma. Nature. 2014;511:428. doi: 10.1038/nature13379 25043047

26. Walker BA, Wardell CP, Brioli A, Boyle E, Kaiser MF, Begum DB, et al. Translocations at 8q24 juxtapose MYC with genes that harbor superenhancers resulting in overexpression and poor prognosis in myeloma patients. Blood Cancer Journal. 2014;4:e191. doi: 10.1038/bcj.2014.13 24632883

27. Taberlay PC, Achinger-Kawecka J, Lun ATL, Buske FA, Sabir K, Gould CM, et al. Three-dimensional disorganization of the cancer genome occurs coincident with long-range genetic and epigenetic alterations. Genome Research. 2016;26(6):719–731. doi: 10.1101/gr.201517.115 27053337

28. Poulos RC, Thoms JA, Guan YF, Unnikrishnan A, Pimanda JE, Wong JW. Functional mutations form at CTCF-cohesin binding sites in melanoma due to uneven nucleotide excision repair across the motif. Cell reports. 2016;17(11):2865–2872. doi: 10.1016/j.celrep.2016.11.055 27974201

29. Sabarinathan R, Mularoni L, Deu-Pons J, Gonzalez-Perez A, Lopez-Bigas N. Nucleotide excision repair is impaired by binding of transcription factors to DNA. Nature. 2016;532(7598):264. doi: 10.1038/nature17661 27075101

30. Guo YA, Chang MM, Huang W, Ooi WF, Xing M, Tan P, et al. Mutation hotspots at CTCF binding sites coupled to chromosomal instability in gastrointestinal cancers. Nature communications. 2018;9(1):1520. doi: 10.1038/s41467-018-03828-2 29670109

31. Liu EM, Martinez-Fundichely A, Diaz BJ, Aronson B, Cuykendall T, MacKay M, et al. Identification of Cancer Drivers at CTCF Insulators in 1,962 Whole Genomes. Cell systems. 2019;8(5):446–455. doi: 10.1016/j.cels.2019.04.001 31078526

32. Gonzalez-Perez A, Sabarinathan R, Lopez-Bigas N. Local Determinants of the Mutational Landscape of the Human Genome. Cell. 2019;177(1):101–114. doi: 10.1016/j.cell.2019.02.051 30901533

33. ICGC: International Cancer Genome Consortium homepage;. https://dcc.icgc.org/.

34. TCGA: The Cancer Genome Atlas homepage;. https://portal.gdc.cancer.gov/.

35. ENCODE: Encyclopedia of DNA Elements homepage;. https://www.encodeproject.org/.

36. Masseroli M, Pinoli P, Venco F, Kaitoua A, Jalili V, Palluzzi F, et al. GenoMetric Query Language: a novel approach to large-scale genomic data management. Bioinformatics. 2015;31(12):1881–1888. doi: 10.1093/bioinformatics/btv048 25649616

37. Tang Z, Luo OJ, Li X, Zheng M, Zhu JJ, Szalaj P, et al. CTCF -Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription. Cell. 2015;163(7):1611–1627. doi: 10.1016/j.cell.2015.11.024 26686651

38. Pagès H, Aboyoun P, Gentleman R, DebRoy S. Biostrings: Efficient manipulation of biological strings; 2017.

39. Sandelin A, Alkema W, Engström P, Wasserman WW, Lenhard B. JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic acids research. 2004;32(suppl_1):D91–D94. doi: 10.1093/nar/gkh012 14681366

40. Perera D, Poulos RC, Shah A, Beck D, Pimanda JE, Wong JWH. Differential DNA repair underlies mutation hotspots at active promoters in cancer genomes. Nature. 2016;532:259 EP –. doi: 10.1038/nature17437 27075100

41. Guo C, McDowell IC, Nodzenski M, Scholtens DM, S AA, Lowe WL, et al. Transversions have larger regulatory effects than transitions. BMC Genomics. 2017;18:394:1–9. doi: 10.1186/s12864-017-3785-4

42. Alexandrov LB, Stratton MR. Mutational signatures: the patterns of somatic mutations hidden in cancer genomes. Current Opinion in Genetics & Development. 2014;24:52–60. https://doi.org/10.1016/j.gde.2013.11.014.

43. Alexandrov LB, Kim J, et al. The Repertoire of Mutational Signatures in Human Cancer. BMC Bioinformatics. 2019;4:20:152.

44. Krüger S, Piro RM decompTumor2Sig: identification of mutational signatures active in individual tumors. bioRxiv. 2018.

45. Stadler MB, Murr R, Burger L, Ivanek R, Lienert F, Schöler A, et al. DNA-binding factors shape the mouse methylome at distal regulatory regions. Nature. 2011;480:490. doi: 10.1038/nature10716 22170606

46. Wang H, Maurano MT, Qu H, Varley KE, Gertz J, Pauli F, et al. Widespread plasticity in CTCF occupancy linked to DNA methylation. Genome Research. 2012;22(9):1680–1688. doi: 10.1101/gr.136101.111 22955980

47. Beroukhim R, Mermel CH, Porter D, Wei G, Raychaudhuri S, Donovan J, et al. The landscape of somatic copy-number alteration across human cancers. Nature. 2010;463(7283):899–905. doi: 10.1038/nature08822 20164920

48. Weischenfeldt J, Dubash T, Drainas AP, Mardin BR, Chen Y, Stütz AM, et al. Pan-cancer analysis of somatic copy number alterations implicates IRS4 and IGF2 in enhancer hijacking. Nat Genet. 2017;49(1):65–74. doi: 10.1038/ng.3722 27869826

49. Laddha Saurabh V., Ganesan Shridar, Chan Chang S., and White Eileen. Mutational landscape of the essential autophagy gene becn1 in human cancers. Molecular cancer research: MCR, 12(4):485–490, Apr 2014. doi: 10.1158/1541-7786.MCR-13-0614 24478461

50. Graham NA, Minasyan A, Lomova A, Cass A, Balanis NG, Friedman M, et al. Recurrent patterns of DNA copy number alterations in tumors reflect metabolic selection pressures. Molecular Systems Biology. 2017;13(2). doi: 10.15252/msb.20167159 28202506

51. Corces MR, Corces VG. The three-dimensional cancer genome. Current Opinion in Genetics & Development. 2016;36:1–7. https://doi.org/10.1016/j.gde.2016.01.002.

52. Valton AL, Dekker J. TAD disruption as oncogenic driver. Current Opinion in Genetics & Development. 2016;36:34–40. https://doi.org/10.1016/j.gde.2016.03.008.

53. Krijger PHL, de Laat W. Regulation of disease-associated gene expression in the 3D genome. Nature Reviews Molecular Cell Biology. 2016;17:771 EP –. doi: 10.1038/nrm.2016.138 27826147

54. Kaiser VB, Semple CA. When TADs go bad: chromatin structure and nuclear organisation in human disease. F1000Res. 2017; 6:F1000 Faculty Rev–314.

55. Song SH, Kim TY. CTCF, Cohesin, and Chromatin in Human Cancer. Genomics Inform. 2017;15(4):114–122. doi: 10.5808/GI.2017.15.4.114 29307136

56. Zack TI, Schumacher SE, Carter SL, Cherniack AD, Saksena G, Tabak B, et al. Pan-cancer patterns of somatic copy number alteration. Nature genetics. 2013;45(10):1134. doi: 10.1038/ng.2760 24071852

57. Yang J, Wei X, Tufan T, Kuscu C, Unlu H, Farooq S, et al. Recurrent mutations at estrogen receptor binding sites alter chromatin topology and distal gene expression in breast cancer. Genome biology. 2018;19(1):190. doi: 10.1186/s13059-018-1572-4 30404658


Článek vyšel v časopise

PLOS One


2020 Číslo 1
Nejčtenější tento týden
Nejčtenější v tomto čísle
Kurzy

Zvyšte si kvalifikaci online z pohodlí domova

plice
INSIGHTS from European Respiratory Congress
nový kurz

Současné pohledy na riziko v parodontologii
Autoři: MUDr. Ladislav Korábek, CSc., MBA

Svět praktické medicíny 3/2024 (znalostní test z časopisu)

Kardiologické projevy hypereozinofilií
Autoři: prof. MUDr. Petr Němec, Ph.D.

Střevní příprava před kolonoskopií
Autoři: MUDr. Klára Kmochová, Ph.D.

Všechny kurzy
Kurzy Podcasty Doporučená témata Časopisy
Přihlášení
Zapomenuté heslo

Zadejte e-mailovou adresu, se kterou jste vytvářel(a) účet, budou Vám na ni zaslány informace k nastavení nového hesla.

Přihlášení

Nemáte účet?  Registrujte se

#ADS_BOTTOM_SCRIPTS#