Variegated gene expression caused by cell-specific long

ARTICLES
Variegated gene expression caused by cell-specific
long-range DNA interactions
Daan Noordermeer1,4,5 , Elzo de Wit1,5 , Petra Klous1,5 , Harmen van de Werken1 , Marieke Simonis1 ,
Melissa Lopez-Jones2 , Bert Eussen3 , Annelies de Klein3 , Robert H. Singer2 and Wouter de Laat1,6
Mammalian genomes contain numerous regulatory DNA sites with unknown target genes. We used mice with an extra β-globin
locus control region (LCR) to investigate how a regulator searches the genome for target genes. We find that the LCR samples a
restricted nuclear subvolume, wherein it preferentially contacts genes controlled by shared transcription factors. No contacted
gene is detectably upregulated except for endogenous β-globin genes located on another chromosome. This demonstrates
genetically that mammalian trans activation is possible, but suggests that it will be rare. Trans activation occurs not pan-cellularly,
but in ‘jackpot’ cells enriched for the interchromosomal interaction. Therefore, cell-specific long-range DNA contacts can cause
variegated expression.
High-resolution profiling of transcription-factor binding sites, the
discovery of conserved genetic elements and identification of regulatory
sites using technologies such as DNaseI hypersensitive site mapping1 ,
has demonstrated that the number of genomic sites with transcription
regulatory potential far exceeds the number of genes. One of the main
challenges of the post-genomic era therefore is to assign function
to each element. This requires understanding of the capacity of
regulatory sites to reach over distance and identify specific target
genes at the single-cell level. It is known that mammalian DNA
elements can modulate the activity of distant genes on the same
chromosome, up to a genomic distance of over a megabase2,3 . The
three-dimensional structure of the mammalian genome facilitates
long-range gene regulation. This was first shown for the β-globin
locus, which contains multiple β-globin genes arranged in the order
of their developmental expression (Fig. 1a). Upstream of the genes
resides a large regulatory DNA element that enhances expression of the
β-globin genes up to ∼100-fold4 . The element is called a locus control
region (LCR), as it has the capacity to confer position-independent and
copy-number-dependent expression on reporter genes in transgenic
assays in addition to its classical enhancer activity5 . Otherwise, the
β-globin LCR is not different from classical enhancers: it upregulates
gene expression over distance, it functions in a tissue-specific manner
and genes compete for its activity6,7 . At its endogenous location, the
LCR enhances expression of the β-globin genes through physical
interactions, thereby looping out the intervening DNA that may carry
non-responding genes8–10 (Supplementary Fig. S1). A similar mode of
action involving chromatin looping has been shown for enhancers in
several other gene loci11,12 .
At a higher-order level of organization, microscopy studies have
shown that genes can occupy differential positions in the nucleus
depending on their expression status13–15 and that regulatory DNA
elements are instrumental in targeting these genomic regions to specific
nuclear positions16–19 . Moreover, recent observations suggest that
functionally related genes on the same and different chromosomes
may come together in the nuclear space for co-transcription20 . Finally,
some reports have suggested functional communication between
regulatory sites located on different chromosomes21–25 . Collectively,
these studies raise questions on how genes and regulatory sequences
explore the nuclear space to engage in functional crosstalk with
preferred genomic partners.
RESULTS
Exploring mammalian transvection
To investigate the ability of regulatory DNA elements to relocate
chromosomal regions in the nuclear interior and search for preferred
target genes, we used transgenic mice with the human erythroid-specific
β-globin LCR (hLCR) site-specifically integrated into a gene-dense
region on chromosome 8. This region, 8C3–C4, contains many
housekeeping genes18 . Two transgenic lines were available, LCR-S and
LCR-AS, each carrying the hLCR without globin genes at 8C3–C4, but
in opposite orientations (Fig. 1b). In a previous study we showed that in
erythroid cells each LCR detectably upregulates several housekeeping
1
Hubrecht Institute-KNAW and University Medical Center Utrecht, Uppsalalaan 8, 3584 CT Utrecht, The Netherlands. 2 Department of Anatomy and Structural Biology,
Albert Einstein College of Medicine, Bronx, New York 10461, USA. 3 Department of Clinical Genetics, Erasmus Medical Centre, PO Box 2040, 3000 CA Rotterdam,
The Netherlands. 4 Present address: Laboratory of Developmental Genomics, School of Life Sciences, Swiss Federal Institute of Technology (EPFL), Lausanne,
Switzerland. 5 These authors contributed equally to this work.
6
Correspondence should be addressed to W.d.L. (e-mail: [email protected])
Received 12 September 2010; accepted 9 May 2011; published online 26 June 2011; DOI: 10.1038/ncb2278
944
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
ARTICLES
a
Mouse β-globin locus
(mouse chromosome 7)
LCR β-globin genes
654 3 21 εy βh1 βmaj βmin
–85HS –62/–60HSs
2
5
4
1
E
Human β-globin locus
(human chromosome 11)
6
2
Gγ Aγ
P
5
4
3
1
hLCR-S
1
E/F
5
G
P: Puromycin
N: Neomycin
G: GFP
N
E12.5 fetal liver
0.5
0
– AS S
Aγ-globin: AS AS AS S
e
DNA FISH
Separate:
8C3–C4
(BAC A15)
8C3–C4
(BAC P23)
DAPI
0.5
0
– AS S – AS S
AS AS AS S S S
S S
Co-localized:
1.0
DNA FISH
E12.5 fetal liver
Co-localization
frequency
1.0
Relative
expression
Relative
expression
5 kb
G
N
c E10.5 Embryonic blood
d
20 kb
P
Rad23a
– AS S
3
1
4
3
Rad23a
hLCR:
2
B B
E Embryonic blood
F Fetal liver
B Adult bone marrow
2
2
8C3–C4
Aγ-globin–GFP-AS
Aγ-globin–GFP-S
4
3'HS1
1
β-globin gene
DNaseI hypersensitive site
Olfactory receptor gene
b hLCR-AS
3
5
E
/ HS
2
F/B F/B
LCR β-globin genes
4 3
7
E
54 3 2 1 ε
–110/–107HSs
8
3'HS1
1
3
is generally not observed in mammals28 , but may occur in special
instances where mono-allelic expression needs to be ensured29–31 .
As mammalian chromosomes intermingle extensively32 , individual
chromosomal segments may indeed invade the territories of other,
possibly homologous, chromosomes. We generated transgenic mice
carrying a human Aγ-globin reporter gene in one or the other
orientation at exactly the same location in 8C3–C4 (Fig. 1b). The
Aγ-globin gene is a human fetal globin gene that, in mouse transgenics
carrying a full human globin cluster including the hLCR, is most
highly expressed between embryonic day 10.5 and 12.5 (ref. 33;
Fig. 1a). Crossing the Aγ-globin reporter mice with transgenic mice
carrying the hLCR on the homologue revealed no increase in Aγ-globin
gene expression at any of the relevant developmental stages (Fig. 1c).
Fluorescence in situ hybridization (FISH) experiments also did not
reveal increased interaction between the homologues (Fig. 1d,e). Therefore, our experimental system provides no evidence for mammalian
transvection involving the hLCR at 8C3–C4. Although these data do
not rule out mammalian transvection at other genomic locations, they
do strongly suggest that the ectopic hLCR does not have unlimited
freedom to search the nuclear interior for natural target genes.
NS
NS
6.0
4.0
2.0
0
hLCR:
Aγ-globin:
–
–
S
–
S
S
Figure 1 An ectopic LCR does not activate a natural target gene on the
homologous chromosome. (a) Schematic representation of the endogenous
mouse and human β-globin loci. Below each globin gene, gene activity in
(transgenic) mice is indicated. (b) Targeting strategy for the insertion of
the human β-globin LCR and a human Aγ-globin–green fluorescent protein
(GFP) reporter gene into the 8C3–C4 locus on mouse chromosome eight.
(c) RT–qPCR of Aγ-globin transcript levels, normalized to Hprt1 transcript
levels. Data are from at least two independent samples. (d) Representative
examples of DNA FISH showing co-localized and separate 8C3–C4 alleles.
DAPI, 4,6-diamidino-2-phenylindole. Scale bar: 2 µm. (e) Co-localization
frequencies of 8C3–C4 alleles. Significance levels are indicated above the
graph (G -test).
genes that directly surround the integration site up to sixfold, with
the most distal one being 150 kilobases (kb) away. Microscopy studies
revealed that both LCRs positioned 8C3–C4 more often outside its
own chromosome territory, raising the question of to where the LCR
migrates its integration site.
We reasoned that, in the most extreme situation, the hLCR could
‘search’ for a natural target gene present anywhere in the genome,
including on its homologous chromosome, as is seen in transvection.
The term transvection was coined to describe transcriptional regulation
across (paired) homologous chromosomes, a phenomenon mainly
studied in Drosophila, where homologues frequently pair26,27 . Pairing
LCR motion is limited by chromosomal context
To investigate in more detail the ability of the hLCR to actively determine its genomic environment, we applied chromosome conformation
capture34 on chip (4C) technology to E14.5 fetal livers of wild-type and
homozygous LCR-AS mice (Supplementary Fig. S2). 4C captures and
identifies spatially proximal DNA sequences to enable an unbiased scan
for DNA elements interacting with a locus of choice35 . Analysis of the
4C data revealed that the ratio of inter- over intrachromosomal captures
increased in the LCR-AS mice (Supplementary Fig. S3), in agreement
with the hLCR causing looping out from the chromosome territory
(ref. 18). This might be a reflection of the LCR actively engaging in
interactions with new interchromosomal partners. To identify specific
DNA interactions, sliding-window algorithms were applied that scan
the linear chromosome templates for significant clustering of independently captured sequences36,37 . Using both conventional analysis and a
newly developed high-resolution analysis, we identified a highly similar
set of interacting regions for 8C3–C4 with or without an integrated
hLCR, both on the same chromosome (in cis) and on different
chromosomes (in trans) (Fig. 2a–c and Supplementary Figs S4, S5a). Extensive fluorescence in situ hybridization experiments on cryo-sections
(cryo-FISH; ref. 32) validated the 4C data (Fig. 2d,e). Some regions in
trans showed increased interaction frequencies with the hLCR, again
in agreement with the LCR causing looping out of the chromosome
territory. However, we did not find chromosomal regions that exclusively interacted with the hLCR. Altogether, this demonstrates that the
hLCR at 8C3–C4 does not search the genome to contact new preferred
interaction partners. Rather, the chromosomal context of 8C3–C4
dictates the nuclear space that can be explored by the integrated hLCR.
The LCR contacts GATA-1 and EKLF regulated genes
Whereas the overall genomic environment did not change, a
quantitative comparison of 4C results showed that the hLCR at 8C3–C4
captured a subset of pre-existing interchromosomal interaction
partners more efficiently. This was most obvious for the α-globin
locus on chromosome 11, but also for the endogenous β-globin
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
945
ARTICLES
a
8C3–C4
max.
Chromosome 8
hLCR-AS 1
0
max.
hLCR-AS 2
0
max.
Wild type 1
0
max.
Wild type 2
0
0
20
80
60
Chromosomal location (Mb)
40
b
100
120
8C3–C4
200
hLCR-AS
0
200
10
100
0
8
0
20
40
60
80
Chromosomal location (Mb)
β-globin
c
200
hLCR-AS
120
6
α-globin
Chromosome 7
Chromosome 11
4
100
Window size
Wild type
100
0
200
–log10 P-value
Window size
Wild type
Chromosome 8
100
2
100
0
0
0
50
100
Chromosomal location (Mb)
150
e
d Cryo-FISH
50
Chromosomal location (Mb)
100
Cryo-FISH
Co-localization frequency (%)
Co-localized:
Separate:
8C3–C4
∗
8
∗
∗
∗
Fetal liver hLCR-AS
6
Fetal liver wild type
∗
4
∗
∗
∗
∗
∗
∗
∗
2
0
n = 717 509 724 785 256 255 504 482 509 258 250 252 254 251 519 508 528 503 251 251 253 506
Locus of interest
DAPI
0
14D2
11D/E1
14C3
11B5
7F3
10B1
7F5
10B5.3
7F3
15B3.1
7D3
Significant interacting region:
Fetal liver hLCR-AS:
Fetal liver wild type:
Figure 2 Contacts of 8C3–C4 with and without the LCR are similar.
(a) Intrachromosomal DNA interactions of 8C3–C4 with (top) and without
(bottom) an integrated β-globin LCR are essentially similar, as determined
by 4C analysed with a running-mean analysis of microarray data (average
probe spacing: 7 kb). (b) Intrachromosomal DNA interactions of 8C3–C4
with (top) and without (bottom) an integrated β-globin LCR are essentially
similar, as determined by 4C analysed with domainograms that visualize
probability scores (P -values indicated with colour codes) for the clustering
of positive 4C signals over windows ranging in size from 1 to 200 probes.
(c) Interchromosomal 4C data for two chromosomes (7 and 11), analysed
as described above. (d) Validation of 4C results by cryo-FISH; examples of
results. Scale bar: 2 µm. (e) Interaction frequencies with a series of genomic
regions, measured by cryo-FISH in wild-type and LCR transgenic fetal livers.
The number of cells analysed (n) is indicated. Colour codes indicate the
significance of the 4C signal (probe clustering), with green referring to
P < 0.01 and red referring to P ≥ 0.01.
locus on chromosome 7 (Fig. 3a). Homology between the human
and mouse LCRs is limited (<10%) and therefore not expected
to underlie the latter contact. The two loci have in common that
they are very highly expressed and carry genes controlled by EKLF
and GATA-1, two transcription factors that also bind to the LCR.
Ranking genomic regions on the basis of their difference in 4C signal
revealed significant enrichment (P < 0.01, hypergeometric test) of
EKLF (refs 38,39)- and GATA-1 (ref. 40)-regulated genes, as well as
of highly expressed genes, among regions contacted more strongly by
the LCR (Supplementary Table S1). The same conclusion was drawn
946
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
ARTICLES
a
Chromosomal location (Mb)
111.0
7E3
110.8
Chromosomal location (Mb)
11A4
32.0
111.2
32.2
1% most highly
expressed genes
0.4
0.2
∗
0
(LCR–Rad23a) bin
0.6
GATA1regulated genes
0.4
0.2
0
∗
< –3
< –2
< –1
>1
>2
>3
>4
>5
>6
>7
0.6
Fraction of probes
0
(LCR–Rad23a) bin
Wild type
hLCR-AS
0.6
EKLFregulated genes
c GATA1
EKLF
28
119
0.4
96
88
0.2
0
∗
7
0
74
< –3
< –2
< –1
>1
>2
>3
>4
>5
>6
>7
2
10
8
6
4
2
0
Fraction of probes
4C signal (k = 21)
α-globin
4
< –3
< –2
< –1
>1
>2
>3
>4
>5
>6
>7
b
Fraction of probes
4C signal (k = 21)
β-globin
32.4
(LCR–Rad23a) bin
1% most highly
expressed
Figure 3 Within a predetermined genomic environment the ectopic LCR
shows preferential interactions with specific genes. (a) 4C results (running
median over sliding windows of 21 probes) for 8C3–C4 with (blue) and
without (red) the integrated LCR, at the endogenous β-globin locus on
chromosome 7 (left) and the α-globin locus on chromosome 11 (right).
(b) Probes were binned according to increasing difference in (LCR −
wild-type) 4C signal and characterized depending on their location relative
to highly expressed and GATA-1- and EKLF-regulated genes. The yellow
dashed line represents the expected frequency on the basis of all probes.
(c) Venn diagram showing the number of, and overlap between, probes
captured more frequently in the integrated LCR 4C experiment for each
category of genes analysed in the population marked with an asterisk in b.
from an analysis per microarray probe (Fig. 3b). Each category of genes
seems to independently attract the LCR (Fig. 3c and Supplementary
Fig. S5b). We conclude that the LCR at 8C3–C4 behaves like a ‘dog
on a lead’: the chromosomal context of 8C3–C4 imposes important
constraints on its freedom to move, but within the restricted nuclear
subcompartment that it occupies the LCR preferentially contacts genes
that are controlled by shared transcription factors.
strategies that measure gene expression across cell populations (Fig. 4d
and Supplementary Fig. S6e).
Interchromosomal gene activation by the LCR
Notwithstanding its restricted ability to change the spatial environment
of 8C3–C4, the ectopic LCR is involved in many long-range interactions
in cis and trans. We investigated whether any of the contacted genes
were upregulated by the LCR. For this, we profiled the transcriptomes
of wild-type and LCR-transgenic littermates using microarrays. A small
number of genes was found to be upregulated more than twofold in
transgenics. These were the previously characterized genes proximal
to the LCR in cis (ref. 18), plus a single gene in trans, Hbb-bh1, that
was confirmed to be upregulated by quantitative PCR with reverse
transcription (RT–qPCR) (Fig. 4 and Supplementary Fig. S6a–d).
Interestingly, Hbb-bh1, or βh1, is one of the endogenous β-globin
genes normally expressed at high, LCR-dependent levels earlier in
development, in embryonic blood cells (Fig. 1a). In E14.5 fetal livers
the gene is looped away from the endogenous LCR (Supplementary
Fig. S1). However, three independent probe-sets on the Affymetrix
microarray and two independent RT–qPCR primer-sets reveal that
the gene is not fully silent, but expressed at basal levels in wild-type
fetal livers (Fig. 4 and Supplementary Fig. S6). Importantly, increased
Hbb-bh1 expression was consistently found in transgenics carrying
the LCR in either one of the orientations when compared with their
wild-type littermates (Fig. 4a,b), but not when 8C3–C4 exclusively
carried a Neo selection cassette (Fig. 4c). No other β- or α-globin
gene detectably changed in expression in the presence of the ectopic
LCR (but see below), as judged by microarray analysis and RT–qPCR
Trans -contacts cause variegated expression
Given that the endogenous β-globin locus carrying βh1 is among the
chromosomal regions that are preferentially contacted by the ectopic
LCR (Fig. 3a), we hypothesized that such interchromosomal LCR–βh1
contacts drive increased βh1 expression. DNA and RNA-FISH analysis
showed that the interaction occurs in 5–10% of the cells, whereas
higher-resolution cryo-FISH identified contacts between 2 and 3%
of the alleles (Supplementary Table S2). Irrespectively, this raises the
question of how these relatively rare interchromosomal contacts can
account for the overall twofold increase in transcript levels measured
across the entire cell population. We reasoned that interaction
frequencies measured by FISH in fixed cells may reflect chromatin
dynamics, such that over time a given interaction occurs in every
cell. Alternatively, they reflect genome conformations that are cell
specific and relatively stable after mitotic exit, implying that, in a
given nucleus, genomic loci sample overlapping or non-overlapping
nuclear subvolumes. If the latter were true, ‘jackpot’ cells should exist
with accumulated βh1 messenger RNA levels in combination with
frequent interchromosomal LCR–βh1 interactions. To investigate this
we carried out RNA FISH using a mixture of probes visualizing both
primary transcripts and accumulated mRNA (ref. 41; Fig. 5a,b). In
wild-type E14.5 fetal liver cells, we failed to detect the βh1 primary
transcript signals that were visible in E10.5 embryonic blood cells
(Supplementary Fig. S7), consistent with its marked drop in expression
during development. Accumulated cytoplasmic βh1 mRNA was seen
in only a small percentage (3.5%) of cells. These cells also contained the
highest adult β-globin transcript levels (Hbb-b1, or β-major) and we
therefore assumed they represent the most differentiated erythroid cells
in the liver that had most time to accumulate β-globin mRNA. Notably,
in only one out of 50 of these cells was an interchromosomal interaction
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
947
ARTICLES
a E14.5 fetal liver
Relative
expression
3
Wild type
2
hLCR-AS
1
hLCR-S
0
Rad23 –/–
βh1
βh1
βh1
(1437810_a_at)(1437990_x_at) (1450736_a_at)
b E14.5 fetal liver
c E14.5 fetal liver
Relative
expression
Relative
expression
4
3
2
1
0
Litter:
1
0
βh1
I
βh1
II
βh1
III
βh1
I
βh1
II
βh1
III
βh1
d E14.5 fetal liver
Relative
expression
3
2
1
0
εy
βh1
β maj
β min
ζ-globin
α-globin
Figure 4 The ectopic LCR on chromosome 8 enhances the expression of the
endogenous βh1 gene on chromosome 7. (a) Affymetrix gene-expression
data for all probe-sets analysing βh1 transcripts (n = 3). (b) RT–qPCR
comparison of βh1 gene expression between multiple wild-type and
homozygous LCR-AS littermates, normalized to Hprt1 transcript levels and to
own wild-type littermates. (c) RT–qPCR analysis showing that the insertion
at 8C3–C4 of a neomycin selection cassette instead of the hLCR does not
lead to upregulation of the βh1 gene. Data from two independent samples.
(d) RT–qPCR analysis of expression of β- and α-globin genes in wild-type
and homozygous LCR littermates. Error bars: standard error on the basis of
3 littermates (n).
between 8C3–C4 and β-globin observed, which does not exceed the
interaction frequency of 5.1% measured across the entire wild-type
red-blood-cell population (Fig. 5c and Supplementary Table S2). On
the other hand, in LCR transgenic fetal liver cells the percentage of cells
with high βh1 mRNA levels is increased to 7.0%. More importantly,
in these transgenics 15/50 cells (30%) with high βh1 levels showed
an interchromosomal interaction between 8C3–C4 (with the LCR)
and β-globin, a highly significant (P = 6.6 × 10−7 , hypergeometric
test) increase in interaction frequency when compared with the
8.5% measured across all red blood cells in the transgenics (Fig. 5c
and Supplementary Fig. S8a,b). As a control, we analysed α- versus
β-globin co-localization in transgenic cells and found no correlation
between this interchromosomal interaction and βh1 expression levels
(Supplementary Fig. S8c).
We next reversed the question and asked whether cells showing
the interchromosomal LCR–βh1 interaction also had increased βh1
transcript levels in their cytoplasm as compared with other red blood
cells in the transgenic fetal livers. For this, we developed automated
image-analysis software (see Methods) and analysed thousands of
cells with respect to their intensity of βh1 RNA FISH signal in the
cytoplasm. The analysis showed that in transgenics, but not in wild
type, cells with the interchromosomal interaction between 8C3–C4 and
948
the β-globin locus more often have high βh1 levels than the other cells
from the same tissue (Fig. 6a). Interestingly, albeit less pronounced,
the same was found for β-major (Fig. 6b), the adult β-globin gene
located next to βh1 in the endogenous globin locus (Fig. 1a). Its
natural extremely high expression level precluded uncovering an extra
contribution from the few cells with this interchromosomal interaction
by cell-population-based expression assays such as microarrays or
RT–qPCR. However, our analysis at the single-cell level revealed that
also this adult β-globin gene benefits from contacts with the extra copy
of the LCR. βh1 is looped away from the endogenous mouse LCR and
as such may be available for contacts with the ectopic human LCR.
β-major dynamically forms and breaks contacts with the endogenous
LCR (ref. 7), possibly providing opportunity for the ectopic LCR to also
engage in contacts and further boost transcription. The genes encoding
α-globin did not benefit from contacts with the ectopic LCR, as judged
from the single-cell analysis strategy (Fig. 6c). We conclude that our
transgenic mice contain a unique population of cells that have increased
levels of mRNA for β-globin because these genes on chromosome 7 are
contacted and trans activated by the ectopic LCR on chromosome 8.
DISCUSSION
One of the main challenges of the post-genomic era is to assign
function to genomic sites, many of which have regulatory potential.
Clearly, this cannot be done without considering the dynamics and
spatial configuration of the genome. Here, we uncover properties of
nuclear organization that dictate the action of regulatory elements in
nuclear space. Our findings should contribute to a working model
of genome function. The results demonstrate that regulatory DNA
elements can search for preferred interaction partners, which in the
case of the LCR are genes controlled by shared transcription factors.
The ability to roam the nucleus is however heavily constrained by the
chromosomal context. We predict the same to be true for almost all
genomic locations, although the degree of constraint may vary. The
concept of chromosomal context heavily influencing a gene’s specific
nuclear location seems to contradict more deterministic models of
nuclear organization, where functionally related genes are proposed to
meet at dedicated nuclear sites20 . We cannot exclude that the LCR
would have a more notable effect when placed at other genomic
locations, or that other enhancers exist that are better capable of
repositioning chromosomes and forming specific interchromosomal
interactions. We note however that very few, if any, regulatory elements
have been described with such a strong influence on gene expression
and chromatin organization as the β-globin LCR.
An important finding of this study was that the ectopic, orphan, LCR
on chromosome 8 contacted many different genes in cis and in trans,
including those sharing a similar set of regulatory proteins, but that no
measurable effect on the expression of most of them was detected.
This suggests that, in mammals, ultralong-range gene regulation
within and between chromosomes will be rare, or at least difficult
to measure in cell populations. Two endogenous β-globin genes on
chromosome 7 were the exception, as they were both upregulated
by the ectopic LCR in cells with the relevant interchromosomal
interaction. As these are natural (mouse) target genes of the (human)
LCR, promoter compatibility and spatial proximity seem essential for
transcription regulation over distance. Interestingly, a few examples
exist of endogenous tissue-specific enhancers activating not only target
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
ARTICLES
a RNA FISH
c
RNA FISH
E14.5 fetal liver
8C3–C4 versus β-globin locus
b RNA FISH
I
II
III
IV
V
βh1
β maj
8C3–C4
DAPI
Co-localization
frequency (%)
P<< 0.001
30
All
erythroid
cells
20
β h1
'jackpot'
cells
10
NS
0
n = 1,400 50
775 50
Wild type hLCR-AS
Figure 5 Increased βh1 mRNA levels in cells showing interchromosomal
LCR–βh1 interactions. (a) RNA FISH on E14.5 fetal liver cells, with
one cell showing strongly increased βh1 mRNA levels in the cytoplasm
(‘jackpot cell’). Scale bars: 2 µm. (b) Enlargement of the ‘jackpot cell’,
revealing an interchromosomal interaction between the endogenous β-globin
locus on chromosome 7 and the ectopic LCR on chromosome 8. (I–IV)
Probes from one focal plane are shown separately and merged. (V) Z stack
showing all RNA signals for βmaj and 8C3–C4. (c) Quantification of RNA
FISH. Determining interchromosomal interaction frequencies between the
endogenous β-globin locus and 8C3–C4 without (wild type) and with an
integrated LCR (hLCR-AS), in all red blood cells and in ‘βh1 jackpot cells’.
The number of cells analysed (n) is indicated. NS: no significant difference.
genes but also non-target genes that happen to be in physical proximity
to the enhancer42,43 . Our results open the possibility that such bystander
activation may be more common in the genome, but appreciable only
in individual cells that have their genome folded such that an enhancer
and gene happen to be within contacting distance.
Our data provide genetic evidence for classical enhancer activity
between mammalian chromosomes, where the genetic addition or
deletion of a regulatory DNA element on one chromosome causes
increased or reduced expression of a physically interacting gene on
another chromosome. As such, we provide formal in vivo evidence that
mammalian regulatory sites do not need an intervening chromatin
fibre to propagate activating signals to responding gene promoters
elsewhere in the genome, but that spatial proximity, in combination
with enhancer–promoter compatibility, is sufficient for gene activation.
Interchromosomal interactions between mammalian regulatory sites
and genes have been observed before, but genetic evidence for trans
activation was lacking so far. For example, the alternatively expressed
TH 1 and TH 2 cytokine loci were seen to come together before their
activation in naive T cells. On differentiation to T-helper 1 or 2
cells, the interactions between these signature loci were lost and the
respective genes turned on25 . The functional consequences of this
interaction seemed complex, however, and different from classical
enhancer activity. The deletion of a regulatory element in the TH 2
locus caused a delay, rather than a reduction, in the expression of
the TH 1 gene, and intriguingly this effect was measurable only in
differentiating TH 1 cells that no longer showed the interchromosomal
interaction. The interaction was proposed to prepare loci for proper
expression during subsequent T-helper cell specification, an activity
not previously assigned to regulatory sites25 . Interchromosomal gene
regulation by a single enhancer was suggested to control the expression
of all ∼1,200 olfactory receptor genes spread across the genome23 , but
deletion of the enhancer demonstrated that the enhancer only affects
genes in cis44,45 . In another study the activation of human interferon
beta (IFN -β) expression in response to viral infection was reported
to coincide with interchromosomal interactions with three Alu repeat
elements harbouring cryptic NF-κB sites21 . Although transfection
experiments with plasmids carrying these elements supported the idea
that the DNA interactions boost IFN -β expression, formal evidence
for interchromosomal enhancer activity awaits demonstration that
the chromosomal deletion of one of these repeats causes a drop
in IFN -β expression. Finally, the imprinting control region (ICR)
of the H 19-Igf2 locus has been the subject of several studies on
interchromosomal DNA interactions22,24,46 . The data did not reveal
trans activation and were not necessarily consistent, as each study
identified different interchromosomal interactions with different
functional outcomes, possibly owing to the use of different cell types
and/or experimental approaches.
An interesting observation from our artificial system is that
interchromosomal interactions can lead to variegated levels of
accumulated transcripts in the individual cells. We propose to term the
observed phenomenon that cell-specific long-range DNA interactions
cause variable gene expression levels among otherwise identical
cells ‘spatial effect variegation’ (SEV; Fig. 6d). Stochastic cell-to-cell
variation in gene expression, or transcriptional noise, is common in cell
populations47 . Our data open the possibility that SEV may be one of the
underlying mechanisms of transcriptional noise. In such a scenario, the
nature of the interacting region will determine whether gene expression
goes up or down in the corresponding cell. This is different from
position effect variegation48 , where variable expression of ectopically
placed genes is classically thought to be caused by repressive effects
from directly surrounding chromatin. Future research should indicate
if SEV is acting on endogenous genes. If so, it may provide specific cells
within a larger population with a mechanism to make autonomous
cell-fate decisions, without the need for external signalling.
METHODS
Methods and any associated references are available in the online
version of the paper at http://www.nature.com/naturecellbiology
Note: Supplementary Information is available on the Nature Cell Biology website
ACKNOWLEDGEMENTS
We thank J. Marteijn and W. Vermeulen for providing Rad23a knockout material,
Y. Oz for counting FISH slides, V. Buckle for providing BAC probes, J. van Rheenen
and the Hubrecht Imaging Center for help with image analysis and E. Splinter and
other members of the group for assistance. This work was financially supported by
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
949
ARTICLES
Pixel count top 1%
1,000
500
100
50
10
5
1
P = 0.0082
0
5,000
0.2
0.4
Quantile rank
0.6
Wild type
Pixel count top 1%
Pixel count top 1%
Pixel count top 1%
hLCR-AS
5,000
1,000
500
100
50
10
5
1
0
β -major—8C3–C4
b
0.2
0.4
Quantile rank
0.6
c
hLCR-AS
5,000
1,000
500
100
50
10
5
1
P = 0.0127
0.2
0.4
0.6
Quantile rank
0
5,000
α-globin—8C3–C4
Pixel count top 1%
β h1—8C3–C4
a
hLCR-AS
5,000
1,000
500
100
50
10
5
1
0
0.2
0.4
Quantile rank
0.6
Wild type
1,000
500
Interacting
100
50
Non-interacting
10
5
1
0
0.2
0.4
Quantile rank
0.6
d Spatial effect variagation:
Gene
Regulatory
element
t=0
t = +1
t = +2
Time
Figure 6 Increased βh1 and β-major mRNA levels in cells showing
interchromosomal LCR–βh1 interactions. (a,b) Automated RNA-FISH
image-analysis (see Methods) results, showing that cells in which
the ectopic LCR interacts in trans with the endogenous β-globin
locus more often have high βh1 (a) or β-major (b) transcript levels
than cells that have the loci apart. (c) Cells in which the ectopic
LCR interacts in trans with the endogenous α-globin locus do not
differ in their levels of mRNA for α-globin from cells without this
interchromosomal interaction. The probability score for the difference
in distributions for interacting and non-interacting cells is calculated by
a one-sided Kolmogorov–Smirnov test. (d) SEV: variegated expression
among otherwise identical cells caused by cell-specific long-range DNA
interactions (intra- or interchromosomal) that are relatively stable during
interphase.
grants from the Dutch Scientific Organization (NWO) (91204082 and 935170621)
and a European Research Council Starting Grant (209700, ‘4C’) to W.d.L.
5.
AUTHOR CONTRIBUTIONS
D.N. and W.d.L. designed the experiments, analysed the data and, with help from
E.d.W., wrote the manuscript. D.N. and P.K. carried out experiments. E.d.W.
analysed 4C data and developed the automated FISH image analysis. H.v.d.W.
analysed 4C and microarray expression data. M.S. carried out 4C experiments. M.L-J.
and R.H.S. designed and synthesized RNA-FISH probes. B.E. and A.d.K. helped with
the FISH experiments.
6.
9.
COMPETING FINANCIAL INTERESTS
The authors declare no competing financial interests.
10.
Published online at http://www.nature.com/naturecellbiology
Reprints and permissions information is available online at http://www.nature.com/
reprints
1.
2.
3.
4.
950
Boyle, A. P. et al. High-resolution mapping and characterization of open chromatin
across the genome. Cell 132, 311–322 (2008).
Amano, T. et al. Chromosomal dynamics at the Shh locus: limb bud-specific
differential regulation of competence and active transcription. Dev. Cell 16,
47–57 (2009).
Kleinjan, D. J. & Coutinho, P. Cis-ruption mechanisms: disruption of cis-regulatory
control as a cause of human genetic disease. Brief Funct. Genomic Proteom. 8,
317–332 (2009).
Epner, E. et al. The β-globin LCR is not necessary for an open chromatin structure or
developmentally regulated transcription of the native mouse β-globin locus. Mol. Cell
2, 447–455 (1998).
7.
8.
11.
12.
13.
14.
15.
16.
Grosveld, F., van Assendelft, G. B., Greaves, D. R. & Kollias, G. Position-independent,
high-level expression of the human β-globin gene in transgenic mice. Cell 51,
975–985 (1987).
Wasylyk, B., Wasylyk, C., Augereau, P. & Chambon, P. The SV40 72 bp repeat
preferentially potentiates transcription starting from proximal natural or substitute
promoter elements. Cell 32, 503–514 (1983).
Wijgerde, M., Grosveld, F. & Fraser, P. Transcription complex stability and chromatin
dynamics in vivo. Nature 377, 209–213 (1995).
Carter, D., Chakalova, L., Osborne, C. S., Dai, Y. F. & Fraser, P. Long-range chromatin
regulatory interactions in vivo. Nat. Genet. 32, 623–626 (2002).
Tolhuis, B., Palstra, R. J., Splinter, E., Grosveld, F. & de Laat, W. Looping and
interaction between hypersensitive sites in the active β-globin locus. Mol. Cell 10,
1453–1465 (2002).
de Laat, W. & Grosveld, F. Spatial organization of gene expression: the active
chromatin hub. Chromosome Res. 11, 447–459 (2003).
Spilianakis, C. G. & Flavell, R. A. Long-range intrachromosomal interactions in the
T helper type 2 cytokine locus. Nat. Immunol. 5, 1017–1027 (2004).
Vernimmen, D., De Gobbi, M., Sloane-Stanley, J. A., Wood, W. G. & Higgs, D. R.
Long-range chromosomal interactions regulate the timing of the transition between
poised and active gene expression. EMBO J. 26, 2041–2051 (2007).
Brown, K. E. et al. Association of transcriptionally silent genes with Ikaros complexes
at centromeric heterochromatin. Cell 91, 845–854 (1997).
Chambeyron, S., Da Silva, N. R., Lawson, K. A. & Bickmore, W. A. Nuclear
re-organisation of the Hoxb complex during mouse embryonic development.
Development 132, 2215–2223 (2005).
Skok, J. A. et al. Nonequivalent nuclear location of immunoglobulin alleles in B
lymphocytes. Nat. Immunol. 2, 848–854 (2001).
Hewitt, S. L. et al. Association between the Igk and Igh immunoglobulin loci
mediated by the 3’ Igk enhancer induces ‘decontraction’ of the Igh locus in pre-B
cells. Nat. Immunol. 9, 396–404 (2008).
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
ARTICLES
17. Lundgren, M. et al. Transcription factor dosage affects changes in higher order
chromatin structure associated with activation of a heterochromatic gene. Cell 103,
733–743 (2000).
18. Noordermeer, D. et al. Transcription and chromatin organization of a housekeeping
gene cluster containing an integrated β-globin locus control region. PLoS Genet. 4,
e1000016 (2008).
19. Ragoczy, T., Bender, M. A., Telling, A., Byron, R. & Groudine, M. The locus
control region is required for association of the murine β-globin locus with
engaged transcription factories during erythroid maturation. Genes Dev. 20,
1447–1457 (2006).
20. Schoenfelder, S. et al. Preferential associations between co-regulated genes reveal
a transcriptional interactome in erythroid cells. Nat. Genet. 42, 53–61 (2010).
21. Apostolou, E. & Thanos, D. Virus infection induces NF-κB-dependent interchromosomal associations mediating monoallelic IFN-β gene expression. Cell 134,
85–96 (2008).
22. Ling, J. Q. et al. CTCF mediates interchromosomal colocalization between Igf2/H19
and Wsb1/Nf1. Science 312, 269–272 (2006).
23. Lomvardas, S. et al. Interchromosomal interactions and olfactory receptor choice.
Cell 126, 403–413 (2006).
24. Sandhu, K. S. et al. Nonallelic transvection of multiple imprinted loci is organized
by the H19 imprinting control region during germline development. Genes Dev. 23,
2598–2603 (2009).
25. Spilianakis, C. G., Lalioti, M. D., Town, T., Lee, G. R. & Flavell, R. A.
Interchromosomal associations between alternatively expressed loci. Nature 435,
637–645 (2005).
26. Lewis, E. B. The theory and application of a new method of detecting
chromosomal rearrangements in Drosophila melanogaster. Am. Naturalist 88,
225–239 (1954).
27. Pirrotta, V. Transvection and chromosomal trans-interaction effects. Biochim.
Biophys. Acta 1424, M1-8 (1999).
28. Bolzer, A. et al. Three-dimensional maps of all chromosomes in human male
fibroblast nuclei and prometaphase rosettes. PLoS Biol. 3, 826–842 (2005).
29. Bacher, C. P. et al. Transient colocalization of X-inactivation centres accompanies
the initiation of X inactivation. Nat. Cell Biol. 8, 293–299 (2006).
30. Hewitt, S. L. et al. RAG-1 and ATM coordinate monoallelic recombination
and nuclear positioning of immunoglobulin loci. Nat. Immunol. 10,
655–664 (2009).
31. Xu, N., Donohoe, M. E., Silva, S. S. & Lee, J. T. Evidence that homologous
X-chromosome pairing requires transcription and Ctcf protein. Nat. Genet. 39,
1390–1396 (2007).
32. Branco, M. R. & Pombo, A. Intermingling of chromosome territories in interphase
suggests role in translocations and transcription-dependent associations. PLoS Biol.
4, 780–788 (2006).
33. Strouboulis, J., Dillon, N. & Grosveld, F. Developmental regulation of a complete
70-kb human β-globin locus in transgenic mice. Genes Dev. 6, 1857–1864 (1992).
34. Dekker, J., Rippe, K., Dekker, M. & Kleckner, N. Capturing chromosome
conformation. Science 295, 1306–1311 (2002).
35. Simonis, M. et al. Nuclear organization of active and inactive chromatin domains
uncovered by chromosome conformation capture-on-chip (4C). Nat. Genet. 38,
1348–1354 (2006).
36. de Wit, E., Braunschweig, U., Greil, F., Bussemaker, H. J. & van Steensel, B.
Global chromatin domain organization of the Drosophila genome. PLoS Genet. 4,
e1000045 (2008).
37. Simonis, M., Kooren, J. & de Laat, W. An evaluation of 3C-based methods to capture
DNA interactions. Nat. Methods 4, 895–901 (2007).
38. Drissen, R. et al. The erythroid phenotype of EKLF-null mice: defects in hemoglobin
metabolism and membrane stability. Mol. Cell Biol. 25, 5205–5214 (2005).
39. Hodge, D. et al. A global role for EKLF in definitive and primitive erythropoiesis.
Blood 107, 3359–3370 (2006).
40. Welch, J. J. et al. Global regulation of erythroid gene expression by transcription
factor GATA-1. Blood 104, 3136–3147 (2004).
41. Femino, A. M., Fay, F. S., Fogarty, K. & Singer, R. H. Visualization of single RNA
transcripts in situ. Science 280, 585–590 (1998).
42. Cajiao, I., Zhang, A., Yoo, E. J., Cooke, N. E. & Liebhaber, S. A. Bystander gene
activation by a locus control region. EMBO J. 23, 3854–3863 (2004).
43. Lower, K. M. et al. Adventitious changes in long-range gene expression caused by
polymorphic structural variation and promoter competition. Proc. Natl Acad. Sci.
USA 106, 21771–21776 (2009).
44. Fuss, S. H., Omura, M. & Mombaerts, P. Local and cis effects of the H element on
expression of odorant receptor genes in mouse. Cell 130, 373–384 (2007).
45. Nishizumi, H., Kumasaka, K., Inoue, N., Nakashima, A. & Sakano, H. Deletion of
the core-H region in mice abolishes the expression of three proximal odorant receptor
genes in cis. Proc. Natl. Acad. Sci. USA 104, 20067–20072 (2007).
46. Zhao, Z. et al. Circular chromosome conformation capture (4C) uncovers extensive
networks of epigenetically regulated intra- and interchromosomal interactions. Nat.
Genet. 38, 1341–1347 (2006).
47. Raj, A. & van Oudenaarden, A. Nature, nurture, or chance: stochastic gene expression
and its consequences. Cell 135, 216–226 (2008).
48. Muller, H. J. Types of visible variations induced by X-rays in Drosophila. J. Genet.
22, 299–334 (1930).
NATURE CELL BIOLOGY VOLUME 13 | NUMBER 8 | AUGUST 2011
© 2011 Macmillan Publishers Limited. All rights reserved.
951
METHODS
DOI: 10.1038/ncb2278
METHODS
Gene targeting and generation of transgenic mice. Targeting of the human
β-globin LCR to the mouse Rad23a gene has been described18 . The human Aγ-globin
gene was targeted to the mouse Rad23a gene by substituting the ClaI Neo-resistance
cassette from an existing Rad23a targeting construct by a 7.6 kb ClaI fragment
containing a TK-Neo resistance cassette coupled to a ClaI–SmaI fragment containing
the human Aγ-globin gene with a gene encoding GFP at the translational start.
Constructs with the ClaI fragment in two orientations were obtained: γ-globin-S
(50 -γ-globin at the 50 -end of the Rad23a gene) and γ-globin-AS (50 -γ-globin gene
at the 30 -end of the Rad23a gene). Targeting in Ola129-derived embryonic stem
cells, blastocyst injection and breeding to obtain homozygous transgenic animals
in an FVB background has been described18 . Genotyping was carried out by
Southern blot. Animal experiments were carried out according to institutional
and national guidelines (Committee on Experiments with Laboratory Animals
(DEC-Consult); Ministry of Agriculture, Nature and Food Quality, The Hague,
The Netherlands).
Gene expression analysis. Total RNA was isolated using Trizol (Invitrogen) from
tissues of at least two independent embryos. Complementary DNA was synthesized
using SuperScriptII Reverse Transcriptase and oligo(dT)12–18 primer according to
the manufacturers’ instructions (Invitrogen). Products were quantified by qPCR,
using platinum Taq DNA polymerase (Invitrogen) and SYBR Green (Sigma) on
an Opticon 2 Real-Time PCR detection system (BioRad). Sequences of primers are
provided (Supplementary Table S3). Transcript levels were normalized to the Hprt1
transcript.
FISH. DNA FISH and cryo-FISH was carried out as described before18,35,49 .
Bacterial artificial chromosome (BAC) clones (BACPAC Resources Centre) used to
visualize genomic regions are listed in Supplementary Table S3. For DNA–FISH,
two adjacent BAC probes, RP24-136A15 and RP24-319P23, were labelled with
SpectrumGreen-dUTP (green, Vysis) or ChromaTide Texas Red-12-dUTP (red,
Invitrogen). BAC probes for cryo-FISH were labelled with SpectrumGreen-dUTP
(green) or ChromaTide Alexa Fluor 594-5-dUTP (red; Invitrogen). Probe specificity
was confirmed on mouse spleen metaphase spreads. 500 ng of labelled probe was
co-precipitated with 5 µg of mouse Cot1 DNA (Invitrogen). Images were collected
with a Zeiss Axio Imager Z1 epifluorescence microscope (100× plan apochromat,
1.4 oil objective) equipped with a CCD (charge-coupled device) camera. DNA-FISH
images were analysed with Zeiss AxioVision software (Zeiss). Cryo-FISH images
were analysed with Isis FISH Imaging System software (Metasystems). Filters
used for DNA FISH: DAPI (Zeiss), fluorescein isothiocyanate, AF594 (Chroma).
Filters used for cryo-FISH: DAPI, fluorescein isothiocyanate, RD-TR-PE (Zeiss).
No bleed-through was detected and images were collected without saturation
of intensities. The significance of co-localization (DNA FISH and cryo-FISH)
was determined by applying a replicated goodness-of-fit test (G-statistic). The
null hypothesis in DNA-FISH experiments was that integration of the LCR or
LCR and Aγ-globin gene resulted in similar co-localization frequencies as the
wild-type 8C3–C4 alleles. The null hypothesis in cryo-FISH experiments was that
associating regions identified in the 4C analysis had co-localization frequencies
comparable to those of negative regions. In DNA FISH a minimum of 150
alleles were counted for each genotype; in cryo-FISH a minimum of 500 alleles
were counted by a person not knowing the probe combination applied to
the sections.
To simultaneously detect accumulated mRNA in the cytoplasm and the
nuclear position of transcribed loci, RNA FISH was carried out50 with some
adjustments. E14.5 fetal livers were resuspended in 200 µl PBS and 20 µl was
spotted on a poly-l-lysine slide (Sigma) and air-dried for 5 min at RT before
fixation. For hybridization, slides were rehydrated into PBS plus magnesium and
equilibrated in 50% formamide/2× SSC for 20 min. Hybridization was carried
out three times overnight at 37 ◦ C, followed by a series of post-hybridization
washes and mounting in Prolong Gold antifade reagent (Molecular Probes).
Images were collected on a Leica DM6000 Fluorescence microscope (100 plan
apochromat, 1.4 oil objective), using Leica filters 11513888 (Y5), 11513887 (Y3),
11513890 (GFP) and 11513874 (DAPI), and analysed using Leica Application
Suite software. RNA probes for Hbb-bh1 and Hbb-b1 were synthesized as
described previously50 . A mixture of six different probes was labelled with Cy3
or Cy5 (GE Healthcare). Probe sequences are listed in Supplementary Table S3.
BAC clones RP23-319P23 and 14567 (Incyte), analysing respectively the actively
transcribed locus 8C3–C4 and the active α-globin locus, were labelled with
SpectrumGreen-dUTP (Vysis) as described previously35 . For each combination
of probes (Hbb-bh1 and Hbb-b1 with either 8C3–C4 or α-globin), two to five
slides were independently hybridized and analysed. Images with five to 25 cells
were checked for ‘jackpot cells’, with high levels of βh1 (Hbb-bh1) transcript
in their cytoplasm. After identification of these ‘jackpot’ cells, images for other
probes were opened and cells were analysed for co-localization between the
active endogenous β-globin locus (visualized using Hbb-b1 transcript probes) and
the locus of interest (8C3–C4 or α-globin). Loci were scored as co-localizing
if two spots overlapped or touched in a focal plane without black pixels in
between. Counting results are summarized in Supplementary Table S2. Results were
reproducible between different slides and between different investigators analysing
RNA-FISH slides.
Automated image analysis. For this study, we developed software to automatically
identify and analyse large numbers of microscopy images. In summary: we used the
DAPI and Rad23a signal to define cell boundaries in each image (on average ∼10–20
cells/image). Of all pixels inside cells in an image, we selected pixels with the 1%
highest intensity in the βh1 and β-major signal. Next, we determined the number
of high-intensity pixels for every cell, which is expected to be roughly equal under
the null hypothesis of homogeneous expression.
The R package EBImage (ref. 51) was used for the automated image analysis.
Within a z-stack, the focal point was determined by calculating the absolute
difference between the actual and the blurred version of the image. Because this
is highest in the image with the highest contrast, the focal point is the image
where the sum of this difference is maximal. We binarized Rad23a and DAPI
images by applying a threshold function (thresh) on both channels. This yields two
images with binary information, which are combined into a single binary image
by setting to a value of one pixels that are one in either or both of the original
images (logical OR function). Subsequently, a distance map (distmap) is calculated
using the binary image52 . Cell boundaries were defined by carrying out a watershed
(watershed) function on the distance map53 . 83 wild-type images yielded 1,757 cells;
149 hLCR-AS images yielded 1,563 cells; 214 α-globin images were analysed, yielding
2,108 cells.
Because of the differences in illumination settings between images, a nonparametric threshold rather than an absolute threshold was used for scoring
expression levels. Of all pixels found within cells we selected the 1% with highest
intensity. Homogeneous expression levels would return an equal distribution of
pixels in the cells. However, we see enrichment of high pixels in a population of
high-expressing cells. Cells were then classified as ‘interacting’ (active 8C3–C4 and
β-globin loci overlap or touch), or ‘non-interacting’ (four nascent transcript signals
(two 8C3–C4 and two βmaj ), but no overlap). The top 1% intense pixels in the
βh1 channel (‘intense pixels’) were defined and counted per cell. Per category, cells
were sorted according to their number of intense pixels. Probability scores for the
difference in distributions for interacting and non-interacting cells were calculated
by a one-sided Kolmogorov–Smirnov test.
4C analysis. 4C sample preparation was carried out as described35 , with HindIII
as a primary restriction enzyme, DpnII (wild type) or NlaIII (LCR-AS) as secondary
frequent cutter and EcoNI as tertiary restriction enzyme for linearization. Sequences
of primers are in Supplementary Table S3. Custom Nimblegen arrays carrying
4C probes for chromosomes 7, 8, 10, 11, 12, 14 and 15 were used (release
mm6 of the UCSC build mouse genome; ref. 2). The liftOver tool (UCSC)
was used to transfer probe position to the locations in release mm9 (NCBI37).
Competitive hybridization of 4C product versus genomic DNA was carried out as
described35 . Normalization of the array data was carried out with ‘lowess’, using
standard settings54 . Subsequently, we corrected for differences in ranges between
datasets. Because of the asymmetric, non-normal distribution of the data, standard
variance normalization is not suitable; therefore, we used the median absolute
deviation (a non-parametric estimator for the variance). For the visualization of
DNA interactions we adapted the domainogram algorithm36 , enabling multiscale
visualization of 4C signals. First, 4C data are binarized by calling probes that show
a more than threefold increase of median absolute deviation over the median
positive. Next we take the union (∪) of the positive signals in the two replicates,
which is used as input for subsequent analysis. A probability score for enrichment
of positive 4C signals with a given genomic region is calculated by a binomial
test, and for every chromosome we determine the probability (P) of observing
positive 4C signals by dividing the total number of positive signals by the total
number of probes. For a range of window sizes (w from 2 to 200) enrichment
of positive signals is determined in window w with given P. By carrying out this
analysis using sliding windows, we can analyse entire chromosomes and construct
a multiscale chromosomal map by visualizing the matrix of −log10 P-values of
chromatin interactions.
To identify differentially interacting regions we calculated the average of the two
median absolute deviation normalized replicates followed by the running median
(window size of 21) for both viewpoints for every trans chromosome. Windows with
a difference of two (log 2 scale) were scored as differentially interacting. Genes within
50 kb of a differentially contacted probe were scored as differentially contacted genes.
The resulting genes were intersected with a list of genes that were differentially
regulated on EKLF (refs 38,39) or GATA-1 (ref. 40) deletion, or the 1% most highly
expressed genes. Enrichment for EKLF and/or GATA-1 regulated genes among
NATURE CELL BIOLOGY
© 2011 Macmillan Publishers Limited. All rights reserved.
METHODS
DOI: 10.1038/ncb2278
differentially contacted genes was calculated by comparing this ratio to that of all
EKLF and/or GATA-1 regulated genes among all genes.
Accession numbers. Data are available from Gene Expression Omnibus (GEO):
GSE24614 and GSE5891.
49. Solovei, I. et al. Spatial preservation of nuclear chromatin architecture during
three-dimensional fluorescence in situ hybridization (3D-FISH). Exp. Cell Res. 276,
10–23 (2002).
50. Levsky, J. M., Shenoy, S. M., Pezo, R. C. & Singer, R. H. Single-cell gene expression
profiling. Science 297, 836–840 (2002).
51. Pau, G., Fuchs, F., Sklyar, O., Boutros, M. & Huber, W. EBImage–an R package
for image processing with applications to cellular phenotypes. Bioinformatics 26,
979–981 (2010).
52. Kolountzakis, M. N. & Kutulakos, K. N. Fast computation of the Euclidian distance
maps for binary images. Inform. Process. Lett. 43, 181–184 (1992).
53. Vincent, L. & Soille, P. Watersheds in digital spaces: an efficient algorithm based on
immersion simulations. Pattern Analysis and Machine Intelligence, IEEE Trans. 13,
583–598 (2002).
54. Smyth, G. K. & Speed, T. Normalization of cDNA microarray data. Methods 31,
265–273 (2003).
NATURE CELL BIOLOGY
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
DOI: 10.1038/ncb2278
βh1
Figure S1 Spatial organization of the endogenous mouse β-globin locus. Cartoon showing the spatial topology of the mouse endogenous β-globin locus in
definitive red blood cells at E14.5 and later stages of development, as inferred from 3C data. At this stage, βh1 is ignored by its own LCR in the fetal liver.
WWW.NATURE.COM/NATURECELLBIOLOGY
1
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
A
Wildtype
H: HindIII
D: DpnII
N: NlaIII
forward: TCACACGCGAAGTAGGCC
reverse: CCTTCCTCCACCATGATGA
H
D
H
HH
H
H
Rad23a
hLCR-AS forward: ACACTTTCAGTCCGGTCC
reverse: AGATTTCCTGTTCACTCACTG
N
H
H
Pu
B
Wildtype
H
hLCR-AS
H
5
4
H H
3
H
H
H
2
HH H
1
C Human β-globin locus
LCR
ε
Gγ Aγ
δ
β
max
4C / control
FL hLCR-AS1
0.0
max
FL hLCR-AS2
0.0
Figure S2 4C analysis of 8C3/C4 with and without the human globin LCR. (a)
Location of restriction fragments and primer sets used for the amplification
of 4C material from wildtype and transgenic animals. Different restriction
fragments and primer sets have been used to analyze DNA interactions
with wildtype 8C3/C4 and 8C3/C4 carrying the integrated human LCR in
either orientation. Only relevant restriction sites of frequent cutters have
been indicated. (b) Gel electrophoresis analysis of PCR amplified wildtype
and transgenic 4C material shows the reproducibility between replicate
samples and the differences between amplified 4C material generated with
different primer sets. (c) Normalized, non-smoothened, 4C signal intensities
for probes mapping to the human δ-globin locus which was present as a
control locus on the dedicated 4C microarray carrying probes for 7 mouse
chromosomes. Positive signal is restricted to the human LCR, while probes
mapping to other parts of the human globin locus are negative.
2
WWW.NATURE.COM/NATURECELLBIOLOGY
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
60000
hLCR
chr 8
chr 7
chr 15
40000
chr 14
chr 12
chr 11
20000
chr 10
0
probe count per chromsome
A
1
2
3
4
5
6
1
2
3
4
5
6
1
2
3
4
5
6
60000
40000
20000
0
probe count per chromsome
Wildtype
1.1
1.0
0.9
0.8
0.7
0.6
ratio cis−probes (hLCR/Wildtype)
1.2
B
increasing 4C signal
Figure S3 With the LCR, the ratio of high 4C signal on inter- versus intrachromosomal probes increases. (a) Average normalized 4C signals over two
replicates were sorted in an ascending manner for the wildtype and LCR
experiment. Probes were subsequently divided into six equal-sized bins. Barplots
show for each bin the frequency of probes for every chromosome. Note that in
bin 6 (probes with highest 4C signal) chromosome 8 (the cis chromosome) is
overrepresented, but less so with the LCR than in wildtype. (b) Barplot showing
the ratio of cis probe frequencies between hLCR and WT for the 4C signal
bins. The relative decrease in the amount of cis probes with high intensities is
proportional to more contacts with trans chromosomes by the hLCR.
WWW.NATURE.COM/NATURECELLBIOLOGY
3
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
A
4C running mean analysis
Chromosome 8
0
20
40
60
80
100
120
max
hLCR-AS 1
0
max
hLCR-AS 2
0
max
Wildtype 1
0
max
Wildtype 2
0
B
Correlation between 4C replicates:
hLCR
hLCR
Wildtype
hLCR-AS 1
hLCR-AS 2
Wildtype
hLCR-AS 1
hLCR-AS 2
WT 1
WT 2
1
0.64
0.65
0.65
1
0.64
0.59
1
0.77
WT 1
WT 2
Figure S4 Intra-chromosomal contacts of 8C3/C4 without and with LCR
are essentially similar. (a) 4C running mean data in cis of 4 independent
E14.5 fetal liver samples, 2 with and 2 without the ectopic LCR. Despite
1
amplification with different primer sets, the overall chromosome-wide
interaction pattern is comparable. (b) Pairwise correlation coefficients
(Spearman’s rank) between all experiments.
4
WWW.NATURE.COM/NATURECELLBIOLOGY
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
4C signal enrichment probability
Wildtype
hLCR-AS
Wildtype
Chromosome 10
Chromosome 14
Chromosomal position (Mb)
Chromosomal position (Mb)
Chromosome 11
Chromosome 15
Window size
Window size
Chromosomal position (Mb)
Window size
hLCR-AS
Chromosomal position (Mb)
Window size
Wildtype
Chromosome 12
Window size
hLCR-AS
Chromosome 7
Window size
A
Chromosomal position (Mb)
Chromosomal position (Mb)
B
GATA1
EKLF
895
All probes:
131
7
119
1053
2
914
GATA1
Probes
preferentially
contacted by
hLCR-AS:
EKLF
28
119
96
88
0
7
total probes:
n=324712
74
1% highest
expressed
1% highest
expressed
Figure S5 The genomic environment of 8C3/C4 without and with LCR is
essentially similar, but the LCR preferentially interacts with genes controlled by
shared transcription factors. (a) Domainograms showing interactions of 8C3/C4
with (top) and without (bottom) the integrated human LCR for all chromosomes
analyzed. Window size: 0-200 probes. (b) Venn diagrams showing the total
number of probes at or around (-/+ 50 kb) the most highly expressed genes,
GATA1-regulated genes and EKLF-regulated genes on the microarray (left) and
among the regions preferentially contacted by the LCR (right).
WWW.NATURE.COM/NATURECELLBIOLOGY
5
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
B
Affymetrix gene-expression analysis
Affymetrix gene-expression analysis
E14.5 Fetal liver
4.0
4.0
3.0
3.0
Relative
expression
E14.5 Fetal liver
Relative
expression
A
2.0
1.0
0.0
n=
3
3
3
CalR
3
3
CalR
3
CalR
3
3
3
Syce2
2.0
1.0
0.0
3
3
3
3
Scyl3
(1456170_x_at) (1433806_x_at) (1417606_a_at) (1429270_a_at) (1448717_at)
C
n=
Gcdh
3
3
3
Cdc20 Lsm14a
3
3
βh1
3
3
βh1
3
3
Lactb
(1434365_a_at) (1439394_x_at) (1428437_at) (1437810_a_at) (1437990_x_at) (1449014_at)
D
RT-qPCR analysis
E14.5 Fetal liver
RT-qPCR analysis
E14.5 Fetal liver
primary transcript level
3.0
1.0
0.0
E
2.0
Relative
expression
Relative
expression
2.0
n=
3
3
Scyl3
3
3
4
4
3
Cdc20 Lsm14a
3
4
1.0
0.0
4
Wildtype
hLCR-AS
hLCR-S
βh1
Lactb
βh1
Affymetrix gene-expression analysis
E14.5 Fetal liver
Relative
expression
1.0
0.5
0.0
n=
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
εy
εy
εy
βmaj
ζ-globin
α-globin
α-globin
(1450621_a_at)
(1436823_x_at)
(1436717_x_at)
(1417184_s_at)
(1448716_at)
(1428361_x_at)
(1452757_s_at)
Figure S6 The ectopic LCR on chromosome 8 activates a single gene
in trans. (a) Affymetrix gene-expression data of previously identified
upregulated genes surrounding the integrated LCR. (b) Affymetrix geneexpression data identifying over twofold upregulated genes in trans. (c)
Verification of upregulation in trans by RT-qPCR analysis identifies the
endogenous βh1 gene as the only gene upregulated over twofold in trans.
(d) RT-qPCR of βh1 primary transcript levels demonstrating increased
transcription activity of the βh1 gene at the nascent RNA level. Data
from two independent samples. (e) Affymetrix gene-expression data for
all probe-sets analyzing additional genes in the β- and α-globin loci.
Error bars: Standard Error based on the number of littermates indicated
below (n).
6
WWW.NATURE.COM/NATURECELLBIOLOGY
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
A
B
RNA FISH
E10.5 embryonic blood
RNA FISH
E14.5 fetal liver
βh1
βmaj
DAPI
Figure S7 RNA FISH analysis with probes specific for βh1 and βmaj transcripts. In E10.5 embryonic blood, βh1 primary transcripts and mRNA is detected,
but βmaj transcripts not. Scale bars: 2 μm
WWW.NATURE.COM/NATURECELLBIOLOGY
7
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
A
RNA FISH
E14.5 FL
1
B
C
2
zoom ins:
1
Z-stack:
1
RNA-FISH
E14.5 Fetal liver
Co-localization
frequency (%)
30
α-globin locus vs.
β-globin locus
zoom ins:
2
Z-stack:
2
βh1
8C3/C4
βmaj
DAPI
20
10
0
NS
n=574
n=44
hLCR-AS
Figure S8 RNA-FISH showing increased βh1 mRNA levels in cells with
inter-chromosomal LCR-βh1 interactions. (a) RNA-FISH on E14.5 fetal
liver cells, with one cell in both fields showing strongly increased βh1
mRNA levels in the cytoplasm (‘jackpot cell’). (b) Zoom in on the ‘jackpot
cell’ highlighted in a., showing an inter-chromosomal interaction between
the endogenous β-globin locus on chromosome 7 and the ectopic LCR on
chromosome 8. The zoom-in shows separate images of three channels from
a single focal plane and in the bottom right a image of the merged βmaj and
8C3/C4 is shown. The z-projection shows the entire z-stack highlighting
all primary transcript-signals for βmaj and 8C3/C4. Scale bars: 2 μm (c)
No increased βh1 transcript levels in cells showing an interchromosomal
interaction between the β- and α-globin locus. Quantification of interchromosomal interactions in LCR-AS transgenics between the β-globin locus
on chromosome 7 and the α-globin locus on chromosome 11, in e14.5
fetal liver total erythroid cells and in ‘βh1 jackpot cells’. Number of cells
analyzed (n) is indicated.
8
WWW.NATURE.COM/NATURECELLBIOLOGY
© 2011 Macmillan Publishers Limited. All rights reserved.
S U P P L E M E N TA R Y I N F O R M AT I O N
Supplementary Tables
Table S1 Genomic regions that are differentially contacted when the LCR is integrated in 8C3/C4. Genomic regions are ranked based on their difference in 4C
signal. The presence of EKLF and GATA-1 regulated genes in these regions is indicated.
Table S2 FISH results. (a) Interaction frequencies measured in E14.5 fetal liver cells as measured by RNA-FISH. (b) Interaction frequencies measured
in E14.5 fetal liver cells, between the endogenous β-globin locus on chromosome 7 and 8C3/C4 on chromosome 8, without (WT) and with (hLCR-AS) the
integrated hLCR. Measurements were made by cryo-FISH (which necessarily focuses on alleles), DNA –FISH (where interactions were scored only in cells
showing signals for all 4 alleles) and RNA-FISH (where interactions were scored only in the 50-80% of red blood cells in the fetal liver (i.e. cells showing
cytoplasmic β-major transcripts). Note that cryo-FISH gives much lower interaction frequencies, as expected from a technique providing ~5 -fold higher
resolution. Note also that frequencies measured by different techniques are difficult to compare, as they all score interactions differently.
Table S3 List of primers and BACs.
WWW.NATURE.COM/NATURECELLBIOLOGY
9
© 2011 Macmillan Publishers Limited. All rights reserved.
A. RNA-FISH results Analyzed loci 8C3/C4 vs β-globin 8C3/C4 vs β-globin α-globin vs β-globin Tissue Total β-globin
expressing
cells hLCR-AS Wildtype hLCR-AS 775 1400 574 Colocalization in βglobin expressing
cells
cells % 66 71 15 8.5 % 5.1 % 2.6 % βh1 positive cells
Colocalization in
(among β-globin
expressing cells)
cells % βh1 positive cells
50 50 44 cells % 15 1 1 30.0 % 2.0 % 2.2 % 6.5 % 3.6 % 6.7 % B. Comparison of FISH results (interaction frequency 8C3/C4 – β-globin locus)
FISH technique Tissue Cryo-FISH hLCR-AS Wildtype hLCR-AS Wildtype hLCR-AS Wildtype DNA-FISH
RNA-FISH
Total cells 534 262 129 90
775
1400
counted 8C3/C4 alleles Cells with DNAFISH signals Cells with βmaj
mRNA signal
Colocalization n
%
12 6 8 8
66
71
2.3 % 2.3 % 6.2 % 8.9 %
8.5 %
5.1 %
© 2011 Macmillan Publishers Limited. All rights reserved.
Significance of difference
(Fisher exact test)
Not significant Not significant p = 0.00437
4C primers
Primer set
Sequence
Rad23a
TCACACGCGAAGTAGGCC
CCTTCCTCCACCATGATGA
ACACTTTCAGTCCGGTCC
AGATTTCCTGTTCACTCACTG
hLCR-AS
Gene ID and Affymetrix probes
Gene name
Gene ID (ENSEMBL)
Rad23a
CalR
ENSMUSG00000003813
ENSMUSG00000003814
Syce2
ENSMUSG00000003824
Scyl3
Cdc20
Lsm14a
βh1 (Hbb-bh1)
ENSMUSG00000026584
ENSMUSG00000006398
ENSMUSG00000066568
ENSMUSG00000052217
Lactb
εy (Hbb-y)
ENSMUSG00000032370
ENSMUSG00000052187
βmaj (Hbb-b1)
βmin (Hbb-b2)
ζ-globin (Hba-x)
α-globin (Hba-a1)
ENSMUSG00000073940
ENSMUSG00000052305
ENSMUSG00000055609
ENSMUSG00000069919
Affymetrix Probes
1417606_a_at
1433806_x_at
1456170_x_at
1429270_a_at
1448717_at
1434365_a_at
1439394_x_at
1428437_at
1437810_a_at
1437990_x_at
1450736_a_at
1449014_at
1436823_x_at
1436717_x_at
1450621_a_at
1417184_s_at
1448716_at
1428361_x_at
1452757_s_at
Gene expression analysis
Primer set
Sequence
mRNA Hprt
AGCCTAAGATGAGCGCAAGT
ATGGCCACAGGACTAGAACA
TTCACCACCATGGAGAAGGC
GGCATGGACTGTGGTCATGA
AGGTGCTGACTTCCTTGGG
GGGTGAATTCTTTGCCGAA
TGGACAACCTCAAGGAGAC
AGTAGAAAGGACAATCACCAAC
GAACTTGTCCTCTGCCTCT
ATCACCAGCACATTACCCA
ATGCCAAAGTGAAGGCCCAT
CCCAGCACAATCACGATCAT
ATCCCAAGGTGAAGGCCCAT
CCCAGCACAATCACGATCGC
GAGAGAGCTATCATCATGTCC
AAGTAGGTCTTCGTCTGGG
TGGCCATGGTGCTGAATATG
TCTTGCCGTGACCCTTGAC
TCTGGGAGTTGAGACTGTGA
TGGACCCATGGACTCTAACA
CTTACCATCTGGACTTGCTG
GGGTGACGGAGTGTCTTTA
TGCTCCATCCTCTGGTCT
CGTGCTGTGTGTCCTTTG
CCACCCAAACCACAATGT
GGACTGAACTGACTGTATGC
CGTGGTTGGAGTTTCTGTAG
TGCTGATGCTTGCGATTC
mRNA Gapdh
mRNA human Aγ-globin
mRNA βh1
mRNA εy
mRNA βmaj
mRNA βmin
mRNA ζ-globin
mRNA α-globin
Primary transcripts βh1
mRNA Scyl3
mRNA Cdc20
mRNA Lsm14a
mRNA Lactb
© 2011 Macmillan Publishers Limited. All rights reserved.
BAC clones for DNA-FISH and Cryo-FISH
BAC clone
RP23-317H16
RP23-370E12
RP23-32C19
RP23-265I23
RP23-27B18
RP24-136A15
RP24-319P23
RP23-87K3
RP24-130O14
RP23-375D18
14567 (Incyte)
RP24-236L11
RP23-311P1
RP23-258M10
RP23-450E9
RP24-255K10
Locus
(Cytological band)
7D3, negative control
7E3, β-globin locus
7F3, negative control
7F3
7F5
8C3 (previously 8C3/C4)
8C3 (previously 8C3/C4)
10B1
10B5.3, negative control
11A4, α-globin locus
11A4, α-globin locus
11B5
11D/E1
14C3
14D2
15B3.1
Chromosome
Position (Mb)
(NCBI assembly m37)
7
7
7
7
7
8
8
10
10
11
11
11
11
14
14
15
93
111
131
134
148
87
87
41
74
32
32
78
102
56
70
37
RNA-FISH probes
Probe
Sequence
βh1 5UTR
GTGAGGTCTAGAAGCTTGGAGATGATCTCAAGTGTGCAAAAGCCAGAATG
βh1 Exon 1
TCCAAGTCCACTTTATCCCAGATGCTTGTGATAGCTGCCTTCTCCTCAGC
βh1 Intron 1
TGTGCCACAAAACCCTATAGAAACCCTGGAAATTTCTGCCATGCATAAGG
βh1 Intron 2
TCAAACCCAAGGCCCCAGAAATGCTGGGCGCTCACTCAAATCTGCACCCA
Alternate βh1 3'UTR 1
GGGCATCATAGACACATGGGATTGCCAGTGTACTGGAATGGAGTTTAATG
Alternate βh1 3'UTR 2
TTAAACAAAATTTGTGCTCTCAATGCCTAATCCAGTCCCCATGGACTCAA
βmaj 5’-UTR1
CTGTTTCTGGGGTTGTGAGTCAACACAACTATGTCAGAAGCAAATGTGAG
βmaj Exon1
TTCATCGGAGTTCACCTTTCCCCACAGGCAAGAGACAGCAGCCTTCTCAG
βmaj Intron1
GGAAAGCAGACCTCTGTCTCCAAGCACCCAACTTCTTCTTGTGAGCTGCC
βmaj Intron2
ACTCCACACACAGTCATGGAGACTGCTCCCTAGAATCGCTTCCCCTTTTT
Alternate bmaj 3’-UTR1
CTTGGGAACAATTAACCATTGTTCACAGGCAAGAGCAGGAAAGGGGGTTT
Alternate bmaj 3’-UTR2
AGAAGACAGATTTTCAAATGTCTATCATTTTGCCAACAACTGACAGATGC
© 2011 Macmillan Publishers Limited. All rights reserved.