Conserved sequence motifs upstream from the co

Volume 14 Numb** 2i 1986
Nucleic Acids Research
Conserved sequence motifs upstream from the co-ordinately expressed vfteUogenin and
apoVLDLII genes of chicken
Fred van het Schip, Rein Strijlcer, John Samallo, Max Grubcr and Geert AB
Department of Biochemistry, Groningen University, Nijenborgh 16, 9747 AG Groningen, The
Netherlands
Received 15 July 1986; Revised and Accepted 30 September 1986
ABSTRACT
The v i t e l l o g e n i n and apoVLDLII y o l k p r o t e i n genes of chicken are
transcribed 1n the l i v e r upon estrogenization. To get Information on putat i v e r e g u l a t o r y elements, we compared more than 2 kb of t h e i r 5' f l a n k i n g
DMA sequences. Connon sequence m o t i f s were found 1n regions, e x h i b i t i n g
estrogen-Induced changes 1n chromatin structure. Stretches of alternating
pyrinidines and purines of about 30-nucleot1des long are present at roughly
s i m i l a r positions. A d i s t i n c t box of sequence homology i n the chicken genes
also appears to be present at a s i m i l a r p o s i t i o n 1n f r o n t of the v i t e l l o genin genes of Xenopus laevis, but 1s absent from the estrogen-responsive
egg-wh1te protein genes expressed i n the oviduct. In f r o n t of the v i t e l l o genin (position -595) and the VLDLII gene (position -548), a DNA element of
about 300 base-pairs was found, which possesses structural characteristics
of a mobile genetic element and bears homology to the t r a n s p o s o n - l i k e V1
element of Xenopus laevis.
INTRODUCTION
The induction by estrogen of yolk p r o t e i n synthesis 1n the l i v e r of
oviparous vertebrates provides an attractive model for the study of hormoneregulated and tissue-specific gene expression (1-4). The vitellogenin (Vtg)
and apo Very Low Density Lipoproteinll (VLDLII) genes encode the major yolk
protein precursors synthesized In chicken l i v e r .
Expression of these genes
appears to be similarly regulated 1n many respects. Both genes are normally
t r a n s c r i b e d only 1n the l i v e r of l a y i n g hens, but can be a c t i v a t e d in the
l i v e r s of roosters, Immature chicks or even 11-days old embryos by injection
of estradiol (5-8). Induction of transcription Is very rapid and completely
r e v e r s i b l e . Transcripts from the genes appear w i t h i n a few hours a f t e r
e s t r a d i o l a d m i n i s t r a t i o n and r a p i d l y decline to undetectably low levels
after hormone withdrawal (6,9). In addition, preferential stabilization of
Vtg and VLDLII mRNAs by estrogen has been observed (9).
Estrogen-induced
expression
© IR L Press Limited, Oxford, England.
of the Vtg and VLDLII genes In l i v e r is as8669
Nucleic Acids Research
sod a ted with structural changes In the chromatin encompassing these genes.
Nuclease-hypersensitive sites, which are correlated with expression, appear
In equivalent regions In front of both genes shortly after estrogen Induct i o n (10-12).
These common characteristics suggest that the genes might share similar
regulatory sequence elements. The molecular organization and sequence of the
5'-end regions of the VLDLII (13,14) and Vtg (11,15,16) genes have been
described e a r l i e r . In the present paper we compare more than 2 kb of 5'
flanking DNA of both genes. These sequences and their homologies are discussed and compared to regulatory elements I d e n t i f i e d 1n other systems.
MATERIALS AND METHODS
DNA cloning and sequencing
B g l l l - f r a g m e n t s of 2.0- and 3.0-kb, covering the VLDLII 5' gene r e g i o n ,
were I s o l a t e d from X Charon4A clone 12 (see Ref.13 and Refs. t h e r e i n ) and
subcloned In pBR322. Sau3AI fragments of these subclones were cloned 1n
phage M13mp9 (17) and sequenced according to the chain-termination method
(18). Sequences 1n the 2.0-kb B g l l l fragment were p a r t i a l l y determined by
the sequencing s t r a t e g y of Hong (19), using randomly generated d e l e t i o n
mutants.
EcoRI-fragments of 0.6, 1.36, and 0.54 kb, covering the 5'-end region of
the Vtg gene, were I s o l a t e d from X Charon4A clone 22 (20). A 3.6-kb BamHI
fragment, p a r t i a l l y overlapping these fragments , was subcloned 1n pBR322
and digested w i t h Sau3AI and P s t I ; the same holds f o r the 1.36-kb EcoRI
fragment. Subsequently, a l l fragments were cloned Into the M13 derivatives
mp8 and mp9 and sequenced. To determine sequences from both strands, short
stretches of DNA were occasionally sequenced by the chemical modification
procedure of Haxam and G i l b e r t (21) a f t e r 3 ' - or 5 ' - e n d - l a b e l l 1 n g of the
appropriate fragments (see F1gs. 1 and 2).
DNA sequence analysis
The nucleotide sequences were analyzed by using a two-dimensional dotmatrix program prepared by Dr. P. Terpstra 1n our laboratory as well as the
program NAQ of the Protein I d e n t i f i c a t i o n Resource system (22).
RESULTS AND DISCUSSION
We have sequenced and compared 5 ' - f l a n k 1 n g DNA of the chicken v i t e l l o genin gene and the VLDLII gene, spanning the regions from p o s i t i o n s -2030
8670
Nucleic Acids Research
Yl
-2500
gaattctact
tggagacatg
tagtggatcc
atctcttgtt
cactgacaca
tttactacga
gaggggctgc
tgactgcaaa
ccgagtggat
ctagcaacta
tatgggtacc
ttttgtgtga
tgtgctcttg
gttcattctc
ctagaacagg
catcagttca
cctcttgcct
ttacagaatg
ccgtgatttc
atacatgaat
gagtatagtc
aacataacct
agtagcacat
cagaaaaagg
aaagaggagt
taaaagtctc
-2000
acttgtgtgc
cttacttcct
aaaaactgat
tgctgttcct
gtactcatat
tacataacat
ccgtgtattg
tccctgcaac
ttgaagccca
ctgaaatagc
ctgggaacag
tttgaaggcc
ttcttgttga
ttagttcact
cgattttctg
gctacttggc
tactgctgag
cttatgaaac
aataaataca
agcaaaggtt
aatgtagatt
gggcaaaacc
caattttaaa
ttttttatca
gtttacataa
accatgcctg
ctttcttttt
taactgcccc
ccttgaaaga
atcaaagctg
tcctttattc
ataagtttgc
ttgtgaatag
aacgacatag
gatgaaatcc
aaacgtactg
aaataaagtg
acaatctatt
tgcaactgcc
ctttgtttat
aatgtgttct
tgcattttgt
aatctctgca
ctttgttcat
tatgttccat
tatctggatt
atactgattg
agtctcatct
tttattgtaa
gagatgccaa
actgataaaa
gcagagccct
-1500
tcttttttct
cccacgtcta
agcattcaaa
caatttagtt
aagtttttgt
tatgaagaaa
tttatcaact
tcttctgatg
agttgaaaaa
cattgtggtc
actgaagagc
gcactgctgg
acccagcttg
ctcctatttc
tgtgttatca
atttggttct
ggttttaatt
ttaaaaatat
atattgtttc
agcctctggc
ctgattgggt
gtggcaggac
atgccgtagt
ggtattattt
aacttgagaa
—•—r
-1000
tttttttttt
atctggaata
ttgtttgata
aaagcatcag
tagtagcctt
tggtcaacta
gaggcacaga
tgaattctgt
gtcaatgcct
attaaaacac
cagtatgtca
aagttgcacc
ctttggagga
tgagggtacc
atataaatca
gtaggaaatg
gttcacattt
tcctggtcag
tcagtagcct
ctgcaggaat
tattatcagc
catgtaccag
agaagtgttt
gattttcttt
ttcagcagaa
attttggagt
aaatagatga
gtttatttat
tccatggtgt
ggatttcttt
caaaaaaata
ttacacttgt
agcagttccc
gtgggactga
tttgccacaa
tagattttga
tccaaaagca
ccaacttttt
tgatgagctg
cagttagtga
caaaagttct
tgctcccatt
cgtgaccgga
cttaaatcat
ggccataaac
tagataacaa
cagccagccg
tactgtgata
attcgccgtg
aacagccacg
taccatgaag
taaagaatga
tatctgttgc
atgctatgtt
tatttgtgca
aaattagtta
ctgcttgagc
aaaacagagg
aaatcc^gaa
aacactcttg
attagaatat
tcctccaatg
tgtagggttt
tttccacact
tgaagttggc
agctgacctg
tactttggaa
gctgaaagaa
gtgcgttggt
caaagctgag
cttgggtcag
tgacccaatc
cattgaaact
aagagaattt
tgttcctgaa
cttttgtggg
aagggagatg
gatatgtctg
ttacttttca
aggcttaaaa
aaaatgtagg
ataaattggt
taggagaact
agagcagttg
tggtgagtcc
aactgaatgt
cagaagaccg
acaggaaccc
cctgacattt
tgcaagcctg
cacttctatc
gataaaatat
cacattgatc
gcacatatga
ggaagaggga
gtgccaggtc
taggaaagca
tctggtcaat
atgattgcaa
cattcttcca
-1951
-1871
-1791
-1711
-1631
-1551
-1471
-1391
-1311
-1231
-1151
-1071
- 991
- 911
- 831
- 751
- 671
- 591
- 511
- 431
- 351
- 271
- 191
- Ill
- 31
1
Figure 1 Nucleotide sequence of 2030 bp upstream of the Vtg gene.
The restriction map and sequence strategy are outlined above the sequence.
Sequences determined by the chemical modification procedure are Indicated by
• (5'-end-labeled fragments) and o (3'-end-labeled fragments). The numbering
Indicates positions relative to the cap site.
and -2594 to the transcription start sites (+1) of the respective genes
(Figs. 1 and 2).
Our data extend and correct sequences published by us and others (11,1316). By sequencing both strands, we can decide unambiguously between minor
conflicts 1n the earlier data. One major discrepancy between the VLDLII
sequence we report here, and that published by Hache £ t ^ 1 ^ (14), deserves
further comment. Upstream from the Haelll site at position -254 their
sequence completely deviates from ours. This Is caused by their positioning
at -254 of a Bglll-Haelll fragment which In our sequence lies much further
upstream, viz. -2594 to -2363. Since we have read the sequence through the
8671
Nucleic Acids Research
=
Hi
-2500
-2000
<
gatctggtcc
tatgccttgc
gctataaatg
ctttttaaaa
ttctaggttg
aagaaataaa
cacctaagaa
atgccatgtg
attaaagaac
tgccattcca
gaaagtgctg
gacaaaagct
ctcaggcagc
gacacttgag
gtgccttgtg
gaatttataa
tgtttgcacg
atctgtcttt
caagacagaa
tgtgacctca
acaatacaca
agagagtcca
tgttttgttg
ccatattgca
ggctctcgga
gttccctagg
agcctgcgca
ttggttaata
ggtcggcttt
tttggtcagt
cacgtctcct
tccatcctgc
ttctgcataa
acaaaccact
aactgcaagc
aaagcctttg
gagagagtaa
tactttactc
tatgatctca
tgaattttgt
gaacagataa
tatgtatttt
ggaatgaagg
aaaggccctg
accatagcac
acagcagttt
gttatattaa
caggtcttct
ttaaaaatga
agaagtggaa
ttttgcttcc
gataccttta
gttgtatcac
aagctccatg
tctgttagtt
gatctcacat
aattttctgc
gattgagcta
cagtctgggt
aaaaggcttg
catttgttgg
caacatgagc
ttatgaaagg
gtgctcaggt
tgagcagaat
atgccagtgt
-1500
1
gtggttcata
agacagaaaa
caatggcact
attttgcagc
atttttaact
tttgtgagtt
gtatgaaagt
ggagatccac
ttaatttgag
acaaggactg
ttgctatctg
aaacctcttc
aaaacacacc
attcaccccc
gaacgttatc
gagaacttaa
ctgaaaattc
attgggattt
ataaatactt
tgctgcactc
aagaaaagtc
gaaattcatg
tgtaggtttc
ctcctnyttca
tgattgagat
agaagttatg
gctacgggga
aaacagaggg
cttcccacag
ggcctctatg
cagactgacc
ttgcttgcaa
ctcagatgag
-1000
»
aagcactgat
gtaggaagga
ggtcaatacc
tgaaagactg
ttttagccat
attttagact
gcctagatag
gctgctcatg
taaaaaaggg
ctttttgtgc
tggggctgga
agatatacat
cttaatatgg
acaggtaatg
tcagctgtaa
aactcttgtt
tctgttctga
ttttggatca
catcgagaca
actgatttgg
ttctagaaaa
tcagtgactt
atgtatgcct
atggttttat
tgagctacat
cttcccctgt
aaaatgaact
tgaactgtga
gaacagtggg
acatggttgc
ttccattacc
aagagcaaat
catc
cagtggtctg
aatagactac
acgtagactt
cacagtgaaa
tcciaaaagc
tttgtttctt
ccactggagt
tcagctgctt
cttgcaatat
aatcattgat
ttcatgatag
attactacgt
ccagagactg
actgcttatt
aagcaggcag
gttcccgtgt
cttctagaag
aacgttcttc
gtggaaaaac
ctgccatatg
acgtgttggg
agtaggtggc
catgcctgca
gtgtgagata
tccacagcat
gagatcttgg
gtgagagtac
atgagcataa
agcaaaaatc
ctgaaaatgt
aaatccgaac
ggggaaacaa
<
-500
'
tgaaataatt
actttgaacg
tatttaacag
gaacagtagg
tgatgtttgt
cctgtcgtgc
tatctgtgtt
tccacctcag
ctggctttct
ttgggcttgg
cactgggtaa
agggcagcta
ccatctgttg
gctatttctg
taaatctgta
gttcttccta
taaccttgtt
atgtttcagg
tgggaggcca
tgtaaaaggg
ctataaataa
taatactact
ccttttcctc
ctgatggttg
ggcactacag
caagtggctg
Cgtatgtata
tagacacaaa
agtccagctg
agggggctca
aacaggtcca
agcaggacct
aaaaaggaaa
cttatttctg
tgagattaca
aaatagttga
gtgtcaccca
aggtgaagga
aaaataaaga
accaaagatt
tgcttatttg
caattgcctg
agttagacaa
aattcccttt
acctgattta
ttaatttgaa
gagtctcctt
gaccataaat
agacattagt
gggctttttc
tgggcagccc
taatgcttac
agatactggt
ttatgtcact
agctaatgcc
accttactga
atgctttctg
agatttacac
catatacctg
taatttgcag
tagctacagt
gtgacccagg
gagtcctaca
ttgacccctc
tctatatata
tttttaatta
aggcctttga
gcttcattct
tttgtgatga
atatacctta
ataggtgact
cataagaaag
cagagcactc
attggcagta
acttagcagg
gttttactac
gaagagagat
tgcctgtaat
agtgattgta
tgctttttag
aataattttc
tttgtttttc
tctttaatga
tgtttggttt
tgagcttttc
gacataagat
atttgttgct
agacccaatg
actgggcaca
ccattttgtc
tgtgcaccag
ctgtcaaagt
taatgtggaa
agctgccttc
cagcttcttg
actatattag
^2515
-2435
-2355
-2275
-2195
-2115
-2035
-1955
-1875
-17 95
-1715
-1635
-1555
-1475
-1395
-1315
-1235
-1155
-1075
- 995
- 915
- 835
- 755
- 675
- 595
- 515
- 435
- 355
- 275
- 195
- 115
- 35
1
Figure 2 Nucleotide sequence of 2594 bp upstream of the VLDLII gene. The
r e s t r i c t i o n map and sequence strategy are o u t l i n e d above the sequence.
Sequences determined by the chemical modification procedure are Indicated by
• (5'-end-labeled fragments) and o (3'-end-1abeled fragments). The numbering
Indicates positions relative to the cap site.
H a e l l l s i t e at -254, we I n f e r that our arrangement Is the c o r r e c t one.
Moreover, our sequence and arrangement agrees with the r e s t r i c t i o n map of
the genoraic ONA determined by the other group (23) and ourselves (24). A
cloning a r t i f a c t Is therefore excluded. As a consequence, some of the homology elements noted by Burch (11) are affected: Element 2, constituting an
Imperfect Inverted repeat of element 1 (position -200), was I n i t i a l l y pos1-
8672
Nucleic Acids Research
.1000 .900
.800
.700
.600
.500
_<00
.380
.200
J00
1 (cop)
via.
repeots
1-»
enhancer ewes
i-,
v
- — 2 9 4 bp
"tronsposon"
«-i
v
Py/Pu
"20bp"
VLOLD
hss
»
repeats
enhancer cores
»
2-» 2-»
w
i«
276 bp b H
"Ironsposon"
p w,H
/
Py/Pu
(
»
?
"20 bp"
F i g u r e 3 Common s t r u c t u r a l f e a t u r e s 1n the 51 f l a n k i n g regions of the Vtg
and VLDLII gene. Nuclease-hypersens1t1ve s i t e s (hss) are given by black
t r i a n g l e s ( • ) . Open t r i a n g l e s ( V ) I n d i c a t e p o s i t i o n s of sequences r e sembling enhancer core elements: 5'-CTTTCCTAGA-3' (at Vtg -282),
5'-ACTGGTTTTG-3' (at Vtg -337), 5'-GTGGAAGG-3' ( a t VLDL -199),5'-GTGGGAAG-3'
(at VLDL -334), 5'-GTTGAAAG-3' (at VLDL -348) a l l 1n the lower strand and
5'-GTGGAATT-3' (at VLDL -280) 1n the upper strand. Arrows Indicate repeated
sequences: 5'-GTTTTTTATCAG-3' ( a t Vtg - 8 9 , -181 and -393); 5'-TGAACTGTGA-3'
(at VLDL -404 and -480). Py/Pu, °20 bp° and "transposon" refer to sequence
elements discussed 1n the text.
tioned around -290 but Is actually located about 2 kb further upstream.
Common structural elements In the regions exhibiting estradiol-induced
changes In chromatin structure
From studies on the chromatin structure of genes I t has emerged that the
upstream regions proximal to active genes, or genes which can be activated
1n the tissue studied, often contain sites which are hypersensitive to
DNAsel (25-27). These sites are believed to mark nucleosorae-free regions
Involved In the control of gene expression (25-27). The regions 5' to the
Vtg gene (10,11) and 5' to the VLDLII gene (12) contain several nucleasehypersensitive sites which are Induced by estradiol and only occur In liver.
The Vtg gene 1s marked by six sites, five of which have already been reported (11) and the sixth one recently been observed In our laboratory (K.Kok
£ t iU, to be published) , a l l mapping within the f i r s t 900 base-pairs
proximal to the cap s i t e . The four sites found for the VLDLII gene map
within the 500 nucleotide sequence preceding the transcription I n i t i a t i o n
site (12). In a search for common sequences that might have a bearing on the
8673
Nucleic Acids Research
GRE--
VLDLII
-2332
TTTG(ftGCTGAAAGA]CTGCACA6 fGAAAGAACA 3TAGGAAATAGTTG
VITELL06ENIN
TTCCTGGTCAGCGTGACCG<£GC fGAAAGAt\CA :ATTGATCCCGTGA
-631
1
ER-blndlng oito- -
J
Figure 4 Putative estrogen-receptor binding sites within the Vtg and VLDLII
51 flanking regions. Homologous decanucleotides are boxed similarly. Direct
and Inverted repeats are Indicated by arrows. The putative estrogen-receptor
(ER) binding site 1s Indicated. A sequence showing 82% (VLDLII) and 94%
(Vtg) horaology to the consensus sequence of the glucocort1co1d-respons1ve
element (GRE) 1s Indicated.
special chromatin organization of these regions, we have analyzed and compared their nucleotide sequences.
In front of the Vtg as well as the VLDLII gene we find about 30-nucleotides long stretches of alternating pyrimidines and purines at roughly
similar positions viz. -533 and -468, respectively (F1g. 3). Alternating
pyrimidine/purine sequences may promote the Z-conformat1on of DNA (28) which
prevents binding of nucleosomes (29). Such sequences have a potentiating
a c t i v i t y on transcription 1n which capacity they are found 1n the SV40
enhancer region (30), which 1s essentially nucleosome-free (31). Horeover, a
9-bp alternating pyrimidine/purine element has been found to be essential
for transcription of the hunan metallothionein gene (32).
Ch
Ch
XI
XI
XI
XI
Vtg
VLDLII
Al
A2
Bl
B2
Consensus:
T TTAC
275 «- T T T . C
98* T T T G C
107 - T T T A C
202 -• T T T . G
203 - T T T . G
99 -
TTT
C
• G
A A A C T G AT A
A AA
A A A C T G AC C
A AT
A A C CT G A T A
C AA
A A C CT G AT A
C AG
A A A GT G AC C
A G G
A A A G T G AC G
A G G
AA
T A
AC
T G A
CG
C C
A AG
C G A
Figure 5 A sequence element conserved I n yolk protein genes. Numbers r e f e r
t o the 51 n u c l e o t i d e of the sequences shown. The d i r e c t i o n of the a r r o w s
I n d i c a t e whether the sequence I s I n t h e upper (-») or i n the l o w e r ( « - )
s t r a n d . N u c l e o t i d e s I n v a r i a n t i n a l l s i x genes are boxed. Ch: c h i c k e n . X I :
Xenopus l a e v i s ; sequences were taken from r e f . 41.
8674
Nucleic Acids Research
In addition to the alternating pyrimidine/purine sequences, the Vtg and
VLDLII 5'-flank1ng regions contain other motifs which are reminiscent of
enhancers. F i r s t l y , four elements occur In the -190 to -350 region of the
VLDLII gene which, 7 out of 8 nucleotides , match the SV40 enhancer core
sequence (33). Similarly, the Vtg gene 1s flanked by two segments matching
the IgCjj (34) or IgCk (35,36) enhancer core elements by 9 out of 10
positions. Secondly , multiple repeated sequences are present; two copies of
a 10-bp repeat in front of the VLDLII gene and three (Im)perfect copies of a
12-bp repeat 1n front of the Vtg gene are most conspicuous. All elements
discussed above are depicted in F1g. 3. The presence of several structural
features which are attributed to potential enhancers 1n the regions of
nuclease-hypersensitive sites may f i t Into the modular structure proposed
for enhancers (for a review see Ref. 37). In this context, the elements
found In front of the Vtg and VLDLII genes might have a role, although
perhaps not an essential one , as modulators of a larger enhancer region
regulating transcription.
Distribution of putative estrogen receptor binding sites within the
viteliogenin and VLDLII 5' flanking regions
The region around -600 of the Vtg gene, containing the estrogen-dependent
hypomethylated Hpall site (38,39), has been Implicated 1n the estrogen
induction of the gene (40). In vitro DNAsel protection experiments have
Indicated the sequence running from -621 to -597 as a possible target site
for the estrogen-receptor (see also Fig. 4). This sequence segment overlaps
the symmetrical element -GGTCANNNTGACC- which has also been found In front
of the chicken VLDLII gene as well as In front of the four Xenopus laevis
Vtg genes (41), although positions and copy number differ even within the
Xenopus Vtg gene family. Moreover, at position -587 to -604 1n the lower
strand of the chicken Vtg gene, a sequence resembling the glucocorticoid
response element Is located (42).
In view of these data, we screened the DMA flanking the VLDLII gene for
s i m i l a r i t i e s to the -0.6 kb Vtg gene region. When the sequence of the
proposed estrogen-receptor binding site was considered, only sequences of
limited homology (624) were found, viz. at positions -1629 and -1452.
However when we searched for subsets of the sequence more striking similarities were noticed. Two different decanucleotides, which 1n Vtg overlap and
form part of the receptor binding s i t e , are also found in front of the
VLDLII gene.viz. at -2.3 kb where they are 8 bp apart (see Fig. 4). In both
genes these sequences are adjoined by Inverted repeats. Interestingly, a
8675
Nucleic Acids Research
sequence resembling the glucocorticoid-responsive element Is also present In
the -2.3 kb VLDLII gene region In the same configuration as 1n the Vtg gene
(see F i g . 4).
A s i m i l a r regulatory role of these sequences Is not excluded by t h e i r
vastly different distances relative to their genes. Steroid-responsive elements generally have not been found at fixed positions In front of genes and
have been located sometimes as far as 2.6 kb as observed f o r the r a b b i t
uteroglobin gene (43).
A sequence element conserved within the 5' flanking region of genes
controlled by estradiol 1n l i v e r
Comparative analysis of the Vtg and VLDLII gene reveals a 20- nucleotide
block of sequence horaology, lying In opposite orientation, within the -100
to -300 region (F1g. 5). We found a s i m i l a r sequence 1n the Xenopus laevis
vitellogenin genes (41) at corresponding p o s i t i o n s . The derived consensus
sequence contains a I0-nucleot1de motif that 1s perfectly conserved within
the six yolk protein genes (see F1g. 5). However, this motif 1s absent from
the egg-wh1te protein genes, which are also estrogen-responsive but only expressed In the o v i d u c t , v i z . the ovalbumin, ovalbumin-Hnked X and Y,
conalbumin (44), ovomucoid and the lysozyme gene, (Genbank, release 26.0).
LENGTH
294 kp
Ch«tg
ATTTTC [TJA) ATGTSTTCTT6TGT
_^
-591
CTGAAAGAACACAT ITGAI TCCCCTGATTTCAATAAA
276
ChtLOLII
-827
^_
GTTTTC i t t l S GATCTCACATTGTA
_^
-544
TCCCCTGTGA6ATC ITTGI GCAAGT66CTGAGATTTA
448
X1A1/B1
CACAAC [TTAj G6C6CACATTTACT
ACrAAAIfiI£CCCC fcAA) ATGTGTGTGtATTATATT
189
IlBJLf
-727
CTACAC ICtAl GGGGCACATTCACQ
«
-S31
AcfAAATGTGCCCC fcTA| ATGTAGCA6ATTTTCTTC
CTACAC (£TA) tCCGCACATATACT
AGTAAATGTGCCCC ETAI ATGTAGCAGATTTTCTTC
X1A2
CAACAC Rf*l CCGGCACATTTACT
TCTAAATCTGCCCC WTAIATGATATAGATT6tATTT
11A1
CCGCAT |TGT] TCSGCAAAUTACC
GGCAAATTTGCCCAG^ST] TGATAAATGAGCCCCACT
112
-692
^__
Figure 6 Transposon-Uke elements 1n chicken and Xenopus laevis yolk protein
genes. The bounding Inverted repeats are Indicated by arrows above the
sequence. The flanking direct repeats are boxed. The length given for each
element Includes the Inverted repeat. Positions, where known exactly, are
given relative to the cap sites. Short sequence motifs that are frequently
present 1n regulatory sequences are underlined. The Xenopus sequences were
taken frora Schubiger et a]^ (45).
8676
Nucleic Acids Research
Nor 1s the element found 1n the liver-specific, but hormone-Independent
albumin gene. This sequence element common to yolk protein genes forms the
core of a block of homology between the chicken and Xenopus Al and A2 Vtg
genes noticed earlier (41). In that study, the homology between the chicken
gene element and those of the Xenopus B genes ( at position -90) was rather
poor. More consistent results are obtained with a sequence track around -200
of the B genes (see Fig. 5),y1eld1ng better homology to the chicken genes
and to all four Xenopus genes as well.
In the chicken Vtg and VLDLII gene, this conserved element coincides
with an expression-1Inked nuclease-hypersensitive site at position -120 and
-270, respectively, suggesting a regulatory function. Since the conserved
element flanks estrogen-controlled genes expressed 1n liver only.1t might
Influence their expression 1n a tissue-specific manner.
A transposon-11ke DNA element within the upstream regions of the
vitellogenin and VLDLII gene
When we surveyed the 5' flanking regions of the Vtg and VLDLII gene our
attention was drawn to a common structural feature at similar positions. I t
1s a sequence track of about 300 basepairs bounded by perfect Inverted
Vtg
-900
CGATTTTcEfiS-ATGTGTTCTTGTGTTATCAATATAAATCACAGTTAGTGATGAAGTTGGCTGCAAGCCTGCATCAGT*****
** *
*** *
**
* * **
**
**********
*
TT6TTTT6ffT6TGATCTCACATTGT
A6GTTTCAT6TAT6CCTCATGCCT6CACCTTTT
VLDL -835
l
2
--TCAGCTACTTGGCTGCATTTTGTATTTGGTTCTGTAGGAAATGCA'MAGTTCTAGCTGACCTGCAC
******* *
CCTCAGCTAATGC
***** **
*
* *
CATTT-GTTGCTCCATAT
******
+***
**
****
TTCTATCCCTCTTG--*
** ***
*
TGCAAATTTTCTGCCT--CCTGTTCAATGGTTTTATGTGTGAGATAC
CCTTACTGCTGAGAATCTCTGCAGGTTTTAATTGTTCACATTTTGCTCCCATTTACTTTGGAAGATAAAATATTTACAGA
******** **
* **
**** *
• **
*
* * *
*****
TGATGGTTGACCTTACTGAAGACCCAATGGGCTCTCGGAGATTGAGCTATGATTGAGATTGAGCTAEATTCTABAGCATGGCACTACAG-
ATGCTTATGAAACCTTT---GTTCATTTAAAAATATTCCTGGTCAGCGTGACCGGAGCTGAAAGAACACATtTGA(TCCCGTGATTT
******
*
****
*
* *
****
**
*
*
* *
*
*
***
ATGCTTTCTGACTGGGCACAGTTCCCTAGGCAGT
-582
*
-CTGGGTA6AAGTTAT6CTTCCCCT6T6A6ATC-trnJ6CAA6T6GCT6 -535
Figure 7 Nucleotide sequences of the transposon-like elements within the
upstream region of the Vtg and VLDLII gene. Dashes were Inserted to achieve
optimal alignment of homologous sequences. The Inverted repeats bounding the
elements are Indicated by thick arrows. The flanking direct repeats are
boxed. Corrispondingly numbered arrows Indicate direct or Inverted repeats.
An 8-bp sequence Identical to the enhancer core consensus (33) 1s boxed.
Consecutive stretches of homology with the B2 Lf V1 element of Xenopus
laevis (45) are Indicated by bold lines.
8677
Nucleic Acids Research
repeats, which In turn are flanked by s1te-spec1f1c three base-pair direct
repeats(F1g. 6), a structure reminiscent of mobile genetic elenents.
Interestingly, a transposon-Uke DNA element (V1 element) Interspersed 1n
the Vtg locus of Xenopus laevis has recently been described (46). Both
chicken elements bear homology 1n structure, length of the direct repeats
and sequence of the Inverted repeats to these V1 elements (F1g. 6).In
Xenopus, seven copies of the V1 element are present near or 1n the Vtg genes
whereas a total of about 7500 copies 1s dispersed throughout the genome
(45). We found no evidence for a multiple occurrence of these elements 1n
the chicken genome, when Southern blots of cloned DNA containing sequences
of the Vtg or VLDLII element were probed with nick-translated genomic DNA
at low stringency (data not shown).
The transposon-11ke elements 1n chicken and Xenopus differ 1n length;
however they share homologous nucleotide stretches up to 12 basepairs as
11 lustrated 1n F1g. 7 for the chicken Vtg and VLDLII genes and the Xenopus
B2 gene. As yet we do not know the functional significance of these elements. However, the presence of a demethylation site (see above), the
occurrence of direct and Indirect repeats and a sequence Identical to the
enhancer core consensus (see Fig. 7) as well as the recurrence of the socalled recognition sequences TGTG/CACA and GAGA/TCTC (46) In the Inverted
repeats bounding the elements (see F1g. 6), argue 1n favour of a possible
regulatory role.
Our sequence analysis has uncovered a number of elements conserved In the
estrogen-responsive genes expressed exclusively In liver. Their precise role
In the hormone- and tissue-specific expression of these genes, which are
transcribed at high frequency, has s t i l l to be established. The availability
of the sequence makes direct studies of the significance and Importance of
these elements, of their Interaction with factors, and of their possibly cooperative function possible.
ACKNOWLEDGEMENTS
We are g r a t e f u l t o Dr. P. Terpstra f o r assistance i n the computer analys i s . We thank B. Dikkeschel for assistance I n the i n i t i a l p a r t of t h i s work.
This I n v e s t i g a t i o n was c a r r i e d out under auspices of the Netherlands Foundat i o n f o r Chemical Research (S.O.N.) w i t h f i n a n c i a l a i d from the Netherlands
Organization f o r the Advancement of Pure Research (Z.W.O.).
REFERENCES
1.
Bergink, E.W., Wallace, R.A., Van den Berg, J . A . , Bos, E . S . , Gruber, M.
and AB, G. (1974) Am. Zool. 1±, 1177-1195.
8678
Nucleic Acids Research
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
Deeley, R.G. and Goldberger, R.F. (1979) In Ontogeny of receptors and
reproductive hormone action pp. 291-307, eds. Hamilton, T . H . , Clark,
J . H . and Sadler, W.A., Raven Press, New York.
Chan, L., Means, A.R. and O'Halley, B.W. (1978) 1n Vitamins and Hormones 36, 259-295.
Wahli, W., Dawid, I . B . , R y f f e l , G.U. and Weber, R. (1981) Science 212.
298-304.
Bergink, E.W., Kloosterboer, H . J . , Gruber, M. and AB, G. (1973) B1och1m.
Biophys. Acta 294, 497-506.
W1skoc1l, R., Bensky, P., Dower, W., Goldberger, R.F., Gordon, J . I . and
Deeley, R.G. (1980) Proc. N a t l . Acad. Sc1. USA 77_, 4474-4478.
Wieringa, B., Van der Zwaag-GerHtsen, J . , Mulder, J . AB, G. and
Gruber, H. (1981) Eur. J . Biochem. 114, 635-641.
Elbrecht, A . , Lazier, C.B., P r o t t e r , A.A. and U11 Haras, D.L. (1984)
Science 225, 639-641.
Noteborn, M.H.M., Bakker, 0 . , de Jonge, M.A.W., Gruber, M. and AB, G.
(1986) J . Steroid Biochem. 24, 14-18.
Burch, J.B.E. and Weintraub. H. (1983) Cell 33, 65-76.
Burch, J.B.E. (1984) Nucleic Adds Res. U_, 1117-1135.
Kok, K., Snippe, L., AB, G. and Gruber, H. (1985) Nucleic Adds Res.
U, 5189-5202.
Van het Schip, A . D . , M e i j l i n k , F.C.P.W., S t H j k e r , R., Gruber, M., van
V H e t , A . J . , van de Klundert, J . A . H . and AB, G. (1983) Nucleic Adds
Res. U, 2529-2540.
Hache, R.J.G., W i s k o d l . R., Vasa, M., Roy, R.N., Lau, P.C.K. and
Deeley, R.G. (1983) J . B1ol. Chem. 258, 4556-4564.
Walker, P., Brown-Luedi, M., Germond, J . - E . , WahH, W., M e i j U n k ,
F.C.P.W., van het Schip, A . D . , Roelink, H., Gruber, M. and AB, G.
(1983) EMBO J . 2, 2271-2279.
Geiser, M., M a t t a j , I.W., W1lks, A.F. and Jost, J . - P . (1983) J . B1ol.
Chem. 258, 9024-9030.
Messing, J . and V1e1ra, J . (1982) Gene ^ 9 , 269-276.
Sanger, F . , Coulson, A.R., B a r r e l l , B.G., Smith, A.J.H. and Roe, B.A.
(1980) J . Hoi. B1ol. 143, 161-178.
Hong, G.F. (1982) J . Mol. B i o l . J J 8 , 539-549.
Arnberg, A . C . , M e i j U n k , F.C.P.W., Mulder, J . , van Bruggen, E . F . J . ,
Gruber, M. and AB, G. (1981) Nucleic Adds Res. £ , 3271-3286.
Maxam, A. and G i l b e r t , W. (1980) In Methods In Enzymology, v o l . 65,
part I (Grossman, L. ed.) pp. 499-560 Academic Press, New York.
George, D.G., Barker, W.C. and Hunt, L.T. (1986) Nucleic Adds Res.
U, 11-15.
Colgan, V . , Elbrecht, A . , Goldman, P . , Lazier, C.B. and
Deeley,
R.G. (1982) J . B i o l . Chem. 257, 14453-14460.
M e i j l i n k , F.C.P.W. (1983) Ph.D. Thesis, The University of Groningen.
E l g i n , S.C.R. (1981) Cell 27, 413-415.
Igo-Kemenes, T . , Hflrz, W. and Zachau, H.G. (1982) Ann. Rev. Biochem.
51., 89-121.
E l g i n , S.C.R. (1984) Nature 309, 213-214.
Nordheim, A. and Rich, A. (1983) Nature 303, 674-679.
Nickol, J . . Behe, M. and Felsenfeld, G7T1982) Proc. N a t l . Acad. S d .
USA 79, 1771-1775.
8679
Nucleic Acids Research
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.
43.
44.
45.
46.
8680
Herr, W., Clarke, J . andGluzman, Y.(1985) In Eukaryotic transcription :
the role of e l s - and trans-acting elements (Ed. Y. Gluzman) pp. 27-32,
Cold Spring Harbor Laboratory, New York.
Jongstra, J . , Reudelhuber, T . L . , Oudet, P . , Benoist, C , Chae, C.B.,
J e l t s c h , J . H . , Mathis, D.J. and Chambon, P. (1984) Nature 307, 708-714.
Karin, H . , Haslinger, A., Holtgreve, H . , Richards, R . I . , Krauter, P . ,
Westphal, H.H. and Beato, H. (1984) Nature 308, 513-519.
Weiher, H . , Konig, M. and Gruss, P. (1983) Science 219, 626-631.
G i l l i e s , S . D . , Morrison, S . L . , 0 1 , V.T. and Tonegawa, S. (1983) Cell
3_3. 717-728.
Picard, 0. and Schaffner, W. (1984) Nature 307, 80-82.
Queen, C. and S t a f f o r d , J . (1984) J . Mol. C e l l . B i o l . 4_, 1042-1049.
Schaffner, W. (1985) In Eukaryotic transcription : the role of e l s - and
trans-acting elements (Ed. Y. Gluzman) pp. 1-18, Cold Spring Harbor
Laboratory, New York.
M e i j l i n k , F.C.P.W., PhiHpsen, J . N . J . , Gruber, H. and AB, G. (1983)
Nucleic Adds Res. 1J., 1361-1373.
Wilks, A . F . , Cozens, P . J . , M a t t a j , I.W. and Jost, J . P . (1982) Proc.
N a t l . Acad. Sc1. USA 7£, 4252-4255.
Jost, J . - P . , Seldran, H. and Geiser, H. (1984) Proc. N a t l . Acad. Sc1.
USA 81., 429-433.
Walker, P . , Germond, J . - E . , Brown-Luedi, M., G1vel, F. andWahU, W.
(1984) Nucleic Adds Res. 12^, 8611-8626.
Beato, M. (1984) Proc. of the 10th annual EMBO symposium on control of
eukaryotic gene transcription.
Cato, A.C.B., Geisse, S., Wenz, M., Westphal, H.H. and Beato, H. (1984)
EMBO J . 3, 2771-2778.
Cochet, M., Gannon, F . , Hen, R., Maroteaux, L., P e r r i n , F. and Chambon,
P. (1979) Nature 282, 567-574.
Schubiger, J . - L . , Germond, J . - E . , ten Heggeler, B. and Wahli, W. (1985)
J . Mol. B1ol. JL86, 491-503.
Nussinov, R. (1986) Biochim. Biophys. Acta 866^, 93-108.