Volume 14 Numb** 2i 1986 Nucleic Acids Research Conserved sequence motifs upstream from the co-ordinately expressed vfteUogenin and apoVLDLII genes of chicken Fred van het Schip, Rein Strijlcer, John Samallo, Max Grubcr and Geert AB Department of Biochemistry, Groningen University, Nijenborgh 16, 9747 AG Groningen, The Netherlands Received 15 July 1986; Revised and Accepted 30 September 1986 ABSTRACT The v i t e l l o g e n i n and apoVLDLII y o l k p r o t e i n genes of chicken are transcribed 1n the l i v e r upon estrogenization. To get Information on putat i v e r e g u l a t o r y elements, we compared more than 2 kb of t h e i r 5' f l a n k i n g DMA sequences. Connon sequence m o t i f s were found 1n regions, e x h i b i t i n g estrogen-Induced changes 1n chromatin structure. Stretches of alternating pyrinidines and purines of about 30-nucleot1des long are present at roughly s i m i l a r positions. A d i s t i n c t box of sequence homology i n the chicken genes also appears to be present at a s i m i l a r p o s i t i o n 1n f r o n t of the v i t e l l o genin genes of Xenopus laevis, but 1s absent from the estrogen-responsive egg-wh1te protein genes expressed i n the oviduct. In f r o n t of the v i t e l l o genin (position -595) and the VLDLII gene (position -548), a DNA element of about 300 base-pairs was found, which possesses structural characteristics of a mobile genetic element and bears homology to the t r a n s p o s o n - l i k e V1 element of Xenopus laevis. INTRODUCTION The induction by estrogen of yolk p r o t e i n synthesis 1n the l i v e r of oviparous vertebrates provides an attractive model for the study of hormoneregulated and tissue-specific gene expression (1-4). The vitellogenin (Vtg) and apo Very Low Density Lipoproteinll (VLDLII) genes encode the major yolk protein precursors synthesized In chicken l i v e r . Expression of these genes appears to be similarly regulated 1n many respects. Both genes are normally t r a n s c r i b e d only 1n the l i v e r of l a y i n g hens, but can be a c t i v a t e d in the l i v e r s of roosters, Immature chicks or even 11-days old embryos by injection of estradiol (5-8). Induction of transcription Is very rapid and completely r e v e r s i b l e . Transcripts from the genes appear w i t h i n a few hours a f t e r e s t r a d i o l a d m i n i s t r a t i o n and r a p i d l y decline to undetectably low levels after hormone withdrawal (6,9). In addition, preferential stabilization of Vtg and VLDLII mRNAs by estrogen has been observed (9). Estrogen-induced expression © IR L Press Limited, Oxford, England. of the Vtg and VLDLII genes In l i v e r is as8669 Nucleic Acids Research sod a ted with structural changes In the chromatin encompassing these genes. Nuclease-hypersensitive sites, which are correlated with expression, appear In equivalent regions In front of both genes shortly after estrogen Induct i o n (10-12). These common characteristics suggest that the genes might share similar regulatory sequence elements. The molecular organization and sequence of the 5'-end regions of the VLDLII (13,14) and Vtg (11,15,16) genes have been described e a r l i e r . In the present paper we compare more than 2 kb of 5' flanking DNA of both genes. These sequences and their homologies are discussed and compared to regulatory elements I d e n t i f i e d 1n other systems. MATERIALS AND METHODS DNA cloning and sequencing B g l l l - f r a g m e n t s of 2.0- and 3.0-kb, covering the VLDLII 5' gene r e g i o n , were I s o l a t e d from X Charon4A clone 12 (see Ref.13 and Refs. t h e r e i n ) and subcloned In pBR322. Sau3AI fragments of these subclones were cloned 1n phage M13mp9 (17) and sequenced according to the chain-termination method (18). Sequences 1n the 2.0-kb B g l l l fragment were p a r t i a l l y determined by the sequencing s t r a t e g y of Hong (19), using randomly generated d e l e t i o n mutants. EcoRI-fragments of 0.6, 1.36, and 0.54 kb, covering the 5'-end region of the Vtg gene, were I s o l a t e d from X Charon4A clone 22 (20). A 3.6-kb BamHI fragment, p a r t i a l l y overlapping these fragments , was subcloned 1n pBR322 and digested w i t h Sau3AI and P s t I ; the same holds f o r the 1.36-kb EcoRI fragment. Subsequently, a l l fragments were cloned Into the M13 derivatives mp8 and mp9 and sequenced. To determine sequences from both strands, short stretches of DNA were occasionally sequenced by the chemical modification procedure of Haxam and G i l b e r t (21) a f t e r 3 ' - or 5 ' - e n d - l a b e l l 1 n g of the appropriate fragments (see F1gs. 1 and 2). DNA sequence analysis The nucleotide sequences were analyzed by using a two-dimensional dotmatrix program prepared by Dr. P. Terpstra 1n our laboratory as well as the program NAQ of the Protein I d e n t i f i c a t i o n Resource system (22). RESULTS AND DISCUSSION We have sequenced and compared 5 ' - f l a n k 1 n g DNA of the chicken v i t e l l o genin gene and the VLDLII gene, spanning the regions from p o s i t i o n s -2030 8670 Nucleic Acids Research Yl -2500 gaattctact tggagacatg tagtggatcc atctcttgtt cactgacaca tttactacga gaggggctgc tgactgcaaa ccgagtggat ctagcaacta tatgggtacc ttttgtgtga tgtgctcttg gttcattctc ctagaacagg catcagttca cctcttgcct ttacagaatg ccgtgatttc atacatgaat gagtatagtc aacataacct agtagcacat cagaaaaagg aaagaggagt taaaagtctc -2000 acttgtgtgc cttacttcct aaaaactgat tgctgttcct gtactcatat tacataacat ccgtgtattg tccctgcaac ttgaagccca ctgaaatagc ctgggaacag tttgaaggcc ttcttgttga ttagttcact cgattttctg gctacttggc tactgctgag cttatgaaac aataaataca agcaaaggtt aatgtagatt gggcaaaacc caattttaaa ttttttatca gtttacataa accatgcctg ctttcttttt taactgcccc ccttgaaaga atcaaagctg tcctttattc ataagtttgc ttgtgaatag aacgacatag gatgaaatcc aaacgtactg aaataaagtg acaatctatt tgcaactgcc ctttgtttat aatgtgttct tgcattttgt aatctctgca ctttgttcat tatgttccat tatctggatt atactgattg agtctcatct tttattgtaa gagatgccaa actgataaaa gcagagccct -1500 tcttttttct cccacgtcta agcattcaaa caatttagtt aagtttttgt tatgaagaaa tttatcaact tcttctgatg agttgaaaaa cattgtggtc actgaagagc gcactgctgg acccagcttg ctcctatttc tgtgttatca atttggttct ggttttaatt ttaaaaatat atattgtttc agcctctggc ctgattgggt gtggcaggac atgccgtagt ggtattattt aacttgagaa —•—r -1000 tttttttttt atctggaata ttgtttgata aaagcatcag tagtagcctt tggtcaacta gaggcacaga tgaattctgt gtcaatgcct attaaaacac cagtatgtca aagttgcacc ctttggagga tgagggtacc atataaatca gtaggaaatg gttcacattt tcctggtcag tcagtagcct ctgcaggaat tattatcagc catgtaccag agaagtgttt gattttcttt ttcagcagaa attttggagt aaatagatga gtttatttat tccatggtgt ggatttcttt caaaaaaata ttacacttgt agcagttccc gtgggactga tttgccacaa tagattttga tccaaaagca ccaacttttt tgatgagctg cagttagtga caaaagttct tgctcccatt cgtgaccgga cttaaatcat ggccataaac tagataacaa cagccagccg tactgtgata attcgccgtg aacagccacg taccatgaag taaagaatga tatctgttgc atgctatgtt tatttgtgca aaattagtta ctgcttgagc aaaacagagg aaatcc^gaa aacactcttg attagaatat tcctccaatg tgtagggttt tttccacact tgaagttggc agctgacctg tactttggaa gctgaaagaa gtgcgttggt caaagctgag cttgggtcag tgacccaatc cattgaaact aagagaattt tgttcctgaa cttttgtggg aagggagatg gatatgtctg ttacttttca aggcttaaaa aaaatgtagg ataaattggt taggagaact agagcagttg tggtgagtcc aactgaatgt cagaagaccg acaggaaccc cctgacattt tgcaagcctg cacttctatc gataaaatat cacattgatc gcacatatga ggaagaggga gtgccaggtc taggaaagca tctggtcaat atgattgcaa cattcttcca -1951 -1871 -1791 -1711 -1631 -1551 -1471 -1391 -1311 -1231 -1151 -1071 - 991 - 911 - 831 - 751 - 671 - 591 - 511 - 431 - 351 - 271 - 191 - Ill - 31 1 Figure 1 Nucleotide sequence of 2030 bp upstream of the Vtg gene. The restriction map and sequence strategy are outlined above the sequence. Sequences determined by the chemical modification procedure are Indicated by • (5'-end-labeled fragments) and o (3'-end-labeled fragments). The numbering Indicates positions relative to the cap site. and -2594 to the transcription start sites (+1) of the respective genes (Figs. 1 and 2). Our data extend and correct sequences published by us and others (11,1316). By sequencing both strands, we can decide unambiguously between minor conflicts 1n the earlier data. One major discrepancy between the VLDLII sequence we report here, and that published by Hache £ t ^ 1 ^ (14), deserves further comment. Upstream from the Haelll site at position -254 their sequence completely deviates from ours. This Is caused by their positioning at -254 of a Bglll-Haelll fragment which In our sequence lies much further upstream, viz. -2594 to -2363. Since we have read the sequence through the 8671 Nucleic Acids Research = Hi -2500 -2000 < gatctggtcc tatgccttgc gctataaatg ctttttaaaa ttctaggttg aagaaataaa cacctaagaa atgccatgtg attaaagaac tgccattcca gaaagtgctg gacaaaagct ctcaggcagc gacacttgag gtgccttgtg gaatttataa tgtttgcacg atctgtcttt caagacagaa tgtgacctca acaatacaca agagagtcca tgttttgttg ccatattgca ggctctcgga gttccctagg agcctgcgca ttggttaata ggtcggcttt tttggtcagt cacgtctcct tccatcctgc ttctgcataa acaaaccact aactgcaagc aaagcctttg gagagagtaa tactttactc tatgatctca tgaattttgt gaacagataa tatgtatttt ggaatgaagg aaaggccctg accatagcac acagcagttt gttatattaa caggtcttct ttaaaaatga agaagtggaa ttttgcttcc gataccttta gttgtatcac aagctccatg tctgttagtt gatctcacat aattttctgc gattgagcta cagtctgggt aaaaggcttg catttgttgg caacatgagc ttatgaaagg gtgctcaggt tgagcagaat atgccagtgt -1500 1 gtggttcata agacagaaaa caatggcact attttgcagc atttttaact tttgtgagtt gtatgaaagt ggagatccac ttaatttgag acaaggactg ttgctatctg aaacctcttc aaaacacacc attcaccccc gaacgttatc gagaacttaa ctgaaaattc attgggattt ataaatactt tgctgcactc aagaaaagtc gaaattcatg tgtaggtttc ctcctnyttca tgattgagat agaagttatg gctacgggga aaacagaggg cttcccacag ggcctctatg cagactgacc ttgcttgcaa ctcagatgag -1000 » aagcactgat gtaggaagga ggtcaatacc tgaaagactg ttttagccat attttagact gcctagatag gctgctcatg taaaaaaggg ctttttgtgc tggggctgga agatatacat cttaatatgg acaggtaatg tcagctgtaa aactcttgtt tctgttctga ttttggatca catcgagaca actgatttgg ttctagaaaa tcagtgactt atgtatgcct atggttttat tgagctacat cttcccctgt aaaatgaact tgaactgtga gaacagtggg acatggttgc ttccattacc aagagcaaat catc cagtggtctg aatagactac acgtagactt cacagtgaaa tcciaaaagc tttgtttctt ccactggagt tcagctgctt cttgcaatat aatcattgat ttcatgatag attactacgt ccagagactg actgcttatt aagcaggcag gttcccgtgt cttctagaag aacgttcttc gtggaaaaac ctgccatatg acgtgttggg agtaggtggc catgcctgca gtgtgagata tccacagcat gagatcttgg gtgagagtac atgagcataa agcaaaaatc ctgaaaatgt aaatccgaac ggggaaacaa < -500 ' tgaaataatt actttgaacg tatttaacag gaacagtagg tgatgtttgt cctgtcgtgc tatctgtgtt tccacctcag ctggctttct ttgggcttgg cactgggtaa agggcagcta ccatctgttg gctatttctg taaatctgta gttcttccta taaccttgtt atgtttcagg tgggaggcca tgtaaaaggg ctataaataa taatactact ccttttcctc ctgatggttg ggcactacag caagtggctg Cgtatgtata tagacacaaa agtccagctg agggggctca aacaggtcca agcaggacct aaaaaggaaa cttatttctg tgagattaca aaatagttga gtgtcaccca aggtgaagga aaaataaaga accaaagatt tgcttatttg caattgcctg agttagacaa aattcccttt acctgattta ttaatttgaa gagtctcctt gaccataaat agacattagt gggctttttc tgggcagccc taatgcttac agatactggt ttatgtcact agctaatgcc accttactga atgctttctg agatttacac catatacctg taatttgcag tagctacagt gtgacccagg gagtcctaca ttgacccctc tctatatata tttttaatta aggcctttga gcttcattct tttgtgatga atatacctta ataggtgact cataagaaag cagagcactc attggcagta acttagcagg gttttactac gaagagagat tgcctgtaat agtgattgta tgctttttag aataattttc tttgtttttc tctttaatga tgtttggttt tgagcttttc gacataagat atttgttgct agacccaatg actgggcaca ccattttgtc tgtgcaccag ctgtcaaagt taatgtggaa agctgccttc cagcttcttg actatattag ^2515 -2435 -2355 -2275 -2195 -2115 -2035 -1955 -1875 -17 95 -1715 -1635 -1555 -1475 -1395 -1315 -1235 -1155 -1075 - 995 - 915 - 835 - 755 - 675 - 595 - 515 - 435 - 355 - 275 - 195 - 115 - 35 1 Figure 2 Nucleotide sequence of 2594 bp upstream of the VLDLII gene. The r e s t r i c t i o n map and sequence strategy are o u t l i n e d above the sequence. Sequences determined by the chemical modification procedure are Indicated by • (5'-end-labeled fragments) and o (3'-end-1abeled fragments). The numbering Indicates positions relative to the cap site. H a e l l l s i t e at -254, we I n f e r that our arrangement Is the c o r r e c t one. Moreover, our sequence and arrangement agrees with the r e s t r i c t i o n map of the genoraic ONA determined by the other group (23) and ourselves (24). A cloning a r t i f a c t Is therefore excluded. As a consequence, some of the homology elements noted by Burch (11) are affected: Element 2, constituting an Imperfect Inverted repeat of element 1 (position -200), was I n i t i a l l y pos1- 8672 Nucleic Acids Research .1000 .900 .800 .700 .600 .500 _<00 .380 .200 J00 1 (cop) via. repeots 1-» enhancer ewes i-, v - — 2 9 4 bp "tronsposon" «-i v Py/Pu "20bp" VLOLD hss » repeats enhancer cores » 2-» 2-» w i« 276 bp b H "Ironsposon" p w,H / Py/Pu ( » ? "20 bp" F i g u r e 3 Common s t r u c t u r a l f e a t u r e s 1n the 51 f l a n k i n g regions of the Vtg and VLDLII gene. Nuclease-hypersens1t1ve s i t e s (hss) are given by black t r i a n g l e s ( • ) . Open t r i a n g l e s ( V ) I n d i c a t e p o s i t i o n s of sequences r e sembling enhancer core elements: 5'-CTTTCCTAGA-3' (at Vtg -282), 5'-ACTGGTTTTG-3' (at Vtg -337), 5'-GTGGAAGG-3' ( a t VLDL -199),5'-GTGGGAAG-3' (at VLDL -334), 5'-GTTGAAAG-3' (at VLDL -348) a l l 1n the lower strand and 5'-GTGGAATT-3' (at VLDL -280) 1n the upper strand. Arrows Indicate repeated sequences: 5'-GTTTTTTATCAG-3' ( a t Vtg - 8 9 , -181 and -393); 5'-TGAACTGTGA-3' (at VLDL -404 and -480). Py/Pu, °20 bp° and "transposon" refer to sequence elements discussed 1n the text. tioned around -290 but Is actually located about 2 kb further upstream. Common structural elements In the regions exhibiting estradiol-induced changes In chromatin structure From studies on the chromatin structure of genes I t has emerged that the upstream regions proximal to active genes, or genes which can be activated 1n the tissue studied, often contain sites which are hypersensitive to DNAsel (25-27). These sites are believed to mark nucleosorae-free regions Involved In the control of gene expression (25-27). The regions 5' to the Vtg gene (10,11) and 5' to the VLDLII gene (12) contain several nucleasehypersensitive sites which are Induced by estradiol and only occur In liver. The Vtg gene 1s marked by six sites, five of which have already been reported (11) and the sixth one recently been observed In our laboratory (K.Kok £ t iU, to be published) , a l l mapping within the f i r s t 900 base-pairs proximal to the cap s i t e . The four sites found for the VLDLII gene map within the 500 nucleotide sequence preceding the transcription I n i t i a t i o n site (12). In a search for common sequences that might have a bearing on the 8673 Nucleic Acids Research GRE-- VLDLII -2332 TTTG(ftGCTGAAAGA]CTGCACA6 fGAAAGAACA 3TAGGAAATAGTTG VITELL06ENIN TTCCTGGTCAGCGTGACCG<£GC fGAAAGAt\CA :ATTGATCCCGTGA -631 1 ER-blndlng oito- - J Figure 4 Putative estrogen-receptor binding sites within the Vtg and VLDLII 51 flanking regions. Homologous decanucleotides are boxed similarly. Direct and Inverted repeats are Indicated by arrows. The putative estrogen-receptor (ER) binding site 1s Indicated. A sequence showing 82% (VLDLII) and 94% (Vtg) horaology to the consensus sequence of the glucocort1co1d-respons1ve element (GRE) 1s Indicated. special chromatin organization of these regions, we have analyzed and compared their nucleotide sequences. In front of the Vtg as well as the VLDLII gene we find about 30-nucleotides long stretches of alternating pyrimidines and purines at roughly similar positions viz. -533 and -468, respectively (F1g. 3). Alternating pyrimidine/purine sequences may promote the Z-conformat1on of DNA (28) which prevents binding of nucleosomes (29). Such sequences have a potentiating a c t i v i t y on transcription 1n which capacity they are found 1n the SV40 enhancer region (30), which 1s essentially nucleosome-free (31). Horeover, a 9-bp alternating pyrimidine/purine element has been found to be essential for transcription of the hunan metallothionein gene (32). Ch Ch XI XI XI XI Vtg VLDLII Al A2 Bl B2 Consensus: T TTAC 275 «- T T T . C 98* T T T G C 107 - T T T A C 202 -• T T T . G 203 - T T T . G 99 - TTT C • G A A A C T G AT A A AA A A A C T G AC C A AT A A C CT G A T A C AA A A C CT G AT A C AG A A A GT G AC C A G G A A A G T G AC G A G G AA T A AC T G A CG C C A AG C G A Figure 5 A sequence element conserved I n yolk protein genes. Numbers r e f e r t o the 51 n u c l e o t i d e of the sequences shown. The d i r e c t i o n of the a r r o w s I n d i c a t e whether the sequence I s I n t h e upper (-») or i n the l o w e r ( « - ) s t r a n d . N u c l e o t i d e s I n v a r i a n t i n a l l s i x genes are boxed. Ch: c h i c k e n . X I : Xenopus l a e v i s ; sequences were taken from r e f . 41. 8674 Nucleic Acids Research In addition to the alternating pyrimidine/purine sequences, the Vtg and VLDLII 5'-flank1ng regions contain other motifs which are reminiscent of enhancers. F i r s t l y , four elements occur In the -190 to -350 region of the VLDLII gene which, 7 out of 8 nucleotides , match the SV40 enhancer core sequence (33). Similarly, the Vtg gene 1s flanked by two segments matching the IgCjj (34) or IgCk (35,36) enhancer core elements by 9 out of 10 positions. Secondly , multiple repeated sequences are present; two copies of a 10-bp repeat in front of the VLDLII gene and three (Im)perfect copies of a 12-bp repeat 1n front of the Vtg gene are most conspicuous. All elements discussed above are depicted in F1g. 3. The presence of several structural features which are attributed to potential enhancers 1n the regions of nuclease-hypersensitive sites may f i t Into the modular structure proposed for enhancers (for a review see Ref. 37). In this context, the elements found In front of the Vtg and VLDLII genes might have a role, although perhaps not an essential one , as modulators of a larger enhancer region regulating transcription. Distribution of putative estrogen receptor binding sites within the viteliogenin and VLDLII 5' flanking regions The region around -600 of the Vtg gene, containing the estrogen-dependent hypomethylated Hpall site (38,39), has been Implicated 1n the estrogen induction of the gene (40). In vitro DNAsel protection experiments have Indicated the sequence running from -621 to -597 as a possible target site for the estrogen-receptor (see also Fig. 4). This sequence segment overlaps the symmetrical element -GGTCANNNTGACC- which has also been found In front of the chicken VLDLII gene as well as In front of the four Xenopus laevis Vtg genes (41), although positions and copy number differ even within the Xenopus Vtg gene family. Moreover, at position -587 to -604 1n the lower strand of the chicken Vtg gene, a sequence resembling the glucocorticoid response element Is located (42). In view of these data, we screened the DMA flanking the VLDLII gene for s i m i l a r i t i e s to the -0.6 kb Vtg gene region. When the sequence of the proposed estrogen-receptor binding site was considered, only sequences of limited homology (624) were found, viz. at positions -1629 and -1452. However when we searched for subsets of the sequence more striking similarities were noticed. Two different decanucleotides, which 1n Vtg overlap and form part of the receptor binding s i t e , are also found in front of the VLDLII gene.viz. at -2.3 kb where they are 8 bp apart (see Fig. 4). In both genes these sequences are adjoined by Inverted repeats. Interestingly, a 8675 Nucleic Acids Research sequence resembling the glucocorticoid-responsive element Is also present In the -2.3 kb VLDLII gene region In the same configuration as 1n the Vtg gene (see F i g . 4). A s i m i l a r regulatory role of these sequences Is not excluded by t h e i r vastly different distances relative to their genes. Steroid-responsive elements generally have not been found at fixed positions In front of genes and have been located sometimes as far as 2.6 kb as observed f o r the r a b b i t uteroglobin gene (43). A sequence element conserved within the 5' flanking region of genes controlled by estradiol 1n l i v e r Comparative analysis of the Vtg and VLDLII gene reveals a 20- nucleotide block of sequence horaology, lying In opposite orientation, within the -100 to -300 region (F1g. 5). We found a s i m i l a r sequence 1n the Xenopus laevis vitellogenin genes (41) at corresponding p o s i t i o n s . The derived consensus sequence contains a I0-nucleot1de motif that 1s perfectly conserved within the six yolk protein genes (see F1g. 5). However, this motif 1s absent from the egg-wh1te protein genes, which are also estrogen-responsive but only expressed In the o v i d u c t , v i z . the ovalbumin, ovalbumin-Hnked X and Y, conalbumin (44), ovomucoid and the lysozyme gene, (Genbank, release 26.0). LENGTH 294 kp Ch«tg ATTTTC [TJA) ATGTSTTCTT6TGT _^ -591 CTGAAAGAACACAT ITGAI TCCCCTGATTTCAATAAA 276 ChtLOLII -827 ^_ GTTTTC i t t l S GATCTCACATTGTA _^ -544 TCCCCTGTGA6ATC ITTGI GCAAGT66CTGAGATTTA 448 X1A1/B1 CACAAC [TTAj G6C6CACATTTACT ACrAAAIfiI£CCCC fcAA) ATGTGTGTGtATTATATT 189 IlBJLf -727 CTACAC ICtAl GGGGCACATTCACQ « -S31 AcfAAATGTGCCCC fcTA| ATGTAGCA6ATTTTCTTC CTACAC (£TA) tCCGCACATATACT AGTAAATGTGCCCC ETAI ATGTAGCAGATTTTCTTC X1A2 CAACAC Rf*l CCGGCACATTTACT TCTAAATCTGCCCC WTAIATGATATAGATT6tATTT 11A1 CCGCAT |TGT] TCSGCAAAUTACC GGCAAATTTGCCCAG^ST] TGATAAATGAGCCCCACT 112 -692 ^__ Figure 6 Transposon-Uke elements 1n chicken and Xenopus laevis yolk protein genes. The bounding Inverted repeats are Indicated by arrows above the sequence. The flanking direct repeats are boxed. The length given for each element Includes the Inverted repeat. Positions, where known exactly, are given relative to the cap sites. Short sequence motifs that are frequently present 1n regulatory sequences are underlined. The Xenopus sequences were taken frora Schubiger et a]^ (45). 8676 Nucleic Acids Research Nor 1s the element found 1n the liver-specific, but hormone-Independent albumin gene. This sequence element common to yolk protein genes forms the core of a block of homology between the chicken and Xenopus Al and A2 Vtg genes noticed earlier (41). In that study, the homology between the chicken gene element and those of the Xenopus B genes ( at position -90) was rather poor. More consistent results are obtained with a sequence track around -200 of the B genes (see Fig. 5),y1eld1ng better homology to the chicken genes and to all four Xenopus genes as well. In the chicken Vtg and VLDLII gene, this conserved element coincides with an expression-1Inked nuclease-hypersensitive site at position -120 and -270, respectively, suggesting a regulatory function. Since the conserved element flanks estrogen-controlled genes expressed 1n liver only.1t might Influence their expression 1n a tissue-specific manner. A transposon-11ke DNA element within the upstream regions of the vitellogenin and VLDLII gene When we surveyed the 5' flanking regions of the Vtg and VLDLII gene our attention was drawn to a common structural feature at similar positions. I t 1s a sequence track of about 300 basepairs bounded by perfect Inverted Vtg -900 CGATTTTcEfiS-ATGTGTTCTTGTGTTATCAATATAAATCACAGTTAGTGATGAAGTTGGCTGCAAGCCTGCATCAGT***** ** * *** * ** * * ** ** ********** * TT6TTTT6ffT6TGATCTCACATTGT A6GTTTCAT6TAT6CCTCATGCCT6CACCTTTT VLDL -835 l 2 --TCAGCTACTTGGCTGCATTTTGTATTTGGTTCTGTAGGAAATGCA'MAGTTCTAGCTGACCTGCAC ******* * CCTCAGCTAATGC ***** ** * * * CATTT-GTTGCTCCATAT ****** +*** ** **** TTCTATCCCTCTTG--* ** *** * TGCAAATTTTCTGCCT--CCTGTTCAATGGTTTTATGTGTGAGATAC CCTTACTGCTGAGAATCTCTGCAGGTTTTAATTGTTCACATTTTGCTCCCATTTACTTTGGAAGATAAAATATTTACAGA ******** ** * ** **** * • ** * * * * ***** TGATGGTTGACCTTACTGAAGACCCAATGGGCTCTCGGAGATTGAGCTATGATTGAGATTGAGCTAEATTCTABAGCATGGCACTACAG- ATGCTTATGAAACCTTT---GTTCATTTAAAAATATTCCTGGTCAGCGTGACCGGAGCTGAAAGAACACATtTGA(TCCCGTGATTT ****** * **** * * * **** ** * * * * * * *** ATGCTTTCTGACTGGGCACAGTTCCCTAGGCAGT -582 * -CTGGGTA6AAGTTAT6CTTCCCCT6T6A6ATC-trnJ6CAA6T6GCT6 -535 Figure 7 Nucleotide sequences of the transposon-like elements within the upstream region of the Vtg and VLDLII gene. Dashes were Inserted to achieve optimal alignment of homologous sequences. The Inverted repeats bounding the elements are Indicated by thick arrows. The flanking direct repeats are boxed. Corrispondingly numbered arrows Indicate direct or Inverted repeats. An 8-bp sequence Identical to the enhancer core consensus (33) 1s boxed. Consecutive stretches of homology with the B2 Lf V1 element of Xenopus laevis (45) are Indicated by bold lines. 8677 Nucleic Acids Research repeats, which In turn are flanked by s1te-spec1f1c three base-pair direct repeats(F1g. 6), a structure reminiscent of mobile genetic elenents. Interestingly, a transposon-Uke DNA element (V1 element) Interspersed 1n the Vtg locus of Xenopus laevis has recently been described (46). Both chicken elements bear homology 1n structure, length of the direct repeats and sequence of the Inverted repeats to these V1 elements (F1g. 6).In Xenopus, seven copies of the V1 element are present near or 1n the Vtg genes whereas a total of about 7500 copies 1s dispersed throughout the genome (45). We found no evidence for a multiple occurrence of these elements 1n the chicken genome, when Southern blots of cloned DNA containing sequences of the Vtg or VLDLII element were probed with nick-translated genomic DNA at low stringency (data not shown). The transposon-11ke elements 1n chicken and Xenopus differ 1n length; however they share homologous nucleotide stretches up to 12 basepairs as 11 lustrated 1n F1g. 7 for the chicken Vtg and VLDLII genes and the Xenopus B2 gene. As yet we do not know the functional significance of these elements. However, the presence of a demethylation site (see above), the occurrence of direct and Indirect repeats and a sequence Identical to the enhancer core consensus (see Fig. 7) as well as the recurrence of the socalled recognition sequences TGTG/CACA and GAGA/TCTC (46) In the Inverted repeats bounding the elements (see F1g. 6), argue 1n favour of a possible regulatory role. Our sequence analysis has uncovered a number of elements conserved In the estrogen-responsive genes expressed exclusively In liver. Their precise role In the hormone- and tissue-specific expression of these genes, which are transcribed at high frequency, has s t i l l to be established. The availability of the sequence makes direct studies of the significance and Importance of these elements, of their Interaction with factors, and of their possibly cooperative function possible. ACKNOWLEDGEMENTS We are g r a t e f u l t o Dr. P. Terpstra f o r assistance i n the computer analys i s . We thank B. Dikkeschel for assistance I n the i n i t i a l p a r t of t h i s work. This I n v e s t i g a t i o n was c a r r i e d out under auspices of the Netherlands Foundat i o n f o r Chemical Research (S.O.N.) w i t h f i n a n c i a l a i d from the Netherlands Organization f o r the Advancement of Pure Research (Z.W.O.). REFERENCES 1. Bergink, E.W., Wallace, R.A., Van den Berg, J . A . , Bos, E . S . , Gruber, M. and AB, G. (1974) Am. Zool. 1±, 1177-1195. 8678 Nucleic Acids Research 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. Deeley, R.G. and Goldberger, R.F. (1979) In Ontogeny of receptors and reproductive hormone action pp. 291-307, eds. Hamilton, T . H . , Clark, J . H . and Sadler, W.A., Raven Press, New York. Chan, L., Means, A.R. and O'Halley, B.W. (1978) 1n Vitamins and Hormones 36, 259-295. Wahli, W., Dawid, I . B . , R y f f e l , G.U. and Weber, R. (1981) Science 212. 298-304. Bergink, E.W., Kloosterboer, H . J . , Gruber, M. and AB, G. (1973) B1och1m. Biophys. Acta 294, 497-506. W1skoc1l, R., Bensky, P., Dower, W., Goldberger, R.F., Gordon, J . I . and Deeley, R.G. (1980) Proc. N a t l . Acad. Sc1. USA 77_, 4474-4478. Wieringa, B., Van der Zwaag-GerHtsen, J . , Mulder, J . AB, G. and Gruber, H. (1981) Eur. J . Biochem. 114, 635-641. Elbrecht, A . , Lazier, C.B., P r o t t e r , A.A. and U11 Haras, D.L. (1984) Science 225, 639-641. Noteborn, M.H.M., Bakker, 0 . , de Jonge, M.A.W., Gruber, M. and AB, G. (1986) J . Steroid Biochem. 24, 14-18. Burch, J.B.E. and Weintraub. H. (1983) Cell 33, 65-76. Burch, J.B.E. (1984) Nucleic Adds Res. U_, 1117-1135. Kok, K., Snippe, L., AB, G. and Gruber, H. (1985) Nucleic Adds Res. U, 5189-5202. Van het Schip, A . D . , M e i j l i n k , F.C.P.W., S t H j k e r , R., Gruber, M., van V H e t , A . J . , van de Klundert, J . A . H . and AB, G. (1983) Nucleic Adds Res. U, 2529-2540. Hache, R.J.G., W i s k o d l . R., Vasa, M., Roy, R.N., Lau, P.C.K. and Deeley, R.G. (1983) J . B1ol. Chem. 258, 4556-4564. Walker, P., Brown-Luedi, M., Germond, J . - E . , WahH, W., M e i j U n k , F.C.P.W., van het Schip, A . D . , Roelink, H., Gruber, M. and AB, G. (1983) EMBO J . 2, 2271-2279. Geiser, M., M a t t a j , I.W., W1lks, A.F. and Jost, J . - P . (1983) J . B1ol. Chem. 258, 9024-9030. Messing, J . and V1e1ra, J . (1982) Gene ^ 9 , 269-276. Sanger, F . , Coulson, A.R., B a r r e l l , B.G., Smith, A.J.H. and Roe, B.A. (1980) J . Hoi. B1ol. 143, 161-178. Hong, G.F. (1982) J . Mol. B i o l . J J 8 , 539-549. Arnberg, A . C . , M e i j U n k , F.C.P.W., Mulder, J . , van Bruggen, E . F . J . , Gruber, M. and AB, G. (1981) Nucleic Adds Res. £ , 3271-3286. Maxam, A. and G i l b e r t , W. (1980) In Methods In Enzymology, v o l . 65, part I (Grossman, L. ed.) pp. 499-560 Academic Press, New York. George, D.G., Barker, W.C. and Hunt, L.T. (1986) Nucleic Adds Res. U, 11-15. Colgan, V . , Elbrecht, A . , Goldman, P . , Lazier, C.B. and Deeley, R.G. (1982) J . B i o l . Chem. 257, 14453-14460. M e i j l i n k , F.C.P.W. (1983) Ph.D. Thesis, The University of Groningen. E l g i n , S.C.R. (1981) Cell 27, 413-415. Igo-Kemenes, T . , Hflrz, W. and Zachau, H.G. (1982) Ann. Rev. Biochem. 51., 89-121. E l g i n , S.C.R. (1984) Nature 309, 213-214. Nordheim, A. and Rich, A. (1983) Nature 303, 674-679. Nickol, J . . Behe, M. and Felsenfeld, G7T1982) Proc. N a t l . Acad. S d . USA 79, 1771-1775. 8679 Nucleic Acids Research 30. 31. 32. 33. 34. 35. 36. 37. 38. 39. 40. 41. 42. 43. 44. 45. 46. 8680 Herr, W., Clarke, J . andGluzman, Y.(1985) In Eukaryotic transcription : the role of e l s - and trans-acting elements (Ed. Y. Gluzman) pp. 27-32, Cold Spring Harbor Laboratory, New York. Jongstra, J . , Reudelhuber, T . L . , Oudet, P . , Benoist, C , Chae, C.B., J e l t s c h , J . H . , Mathis, D.J. and Chambon, P. (1984) Nature 307, 708-714. Karin, H . , Haslinger, A., Holtgreve, H . , Richards, R . I . , Krauter, P . , Westphal, H.H. and Beato, H. (1984) Nature 308, 513-519. Weiher, H . , Konig, M. and Gruss, P. (1983) Science 219, 626-631. G i l l i e s , S . D . , Morrison, S . L . , 0 1 , V.T. and Tonegawa, S. (1983) Cell 3_3. 717-728. Picard, 0. and Schaffner, W. (1984) Nature 307, 80-82. Queen, C. and S t a f f o r d , J . (1984) J . Mol. C e l l . B i o l . 4_, 1042-1049. Schaffner, W. (1985) In Eukaryotic transcription : the role of e l s - and trans-acting elements (Ed. Y. Gluzman) pp. 1-18, Cold Spring Harbor Laboratory, New York. M e i j l i n k , F.C.P.W., PhiHpsen, J . N . J . , Gruber, H. and AB, G. (1983) Nucleic Adds Res. 1J., 1361-1373. Wilks, A . F . , Cozens, P . J . , M a t t a j , I.W. and Jost, J . P . (1982) Proc. N a t l . Acad. Sc1. USA 7£, 4252-4255. Jost, J . - P . , Seldran, H. and Geiser, H. (1984) Proc. N a t l . Acad. Sc1. USA 81., 429-433. Walker, P . , Germond, J . - E . , Brown-Luedi, M., G1vel, F. andWahU, W. (1984) Nucleic Adds Res. 12^, 8611-8626. Beato, M. (1984) Proc. of the 10th annual EMBO symposium on control of eukaryotic gene transcription. Cato, A.C.B., Geisse, S., Wenz, M., Westphal, H.H. and Beato, H. (1984) EMBO J . 3, 2771-2778. Cochet, M., Gannon, F . , Hen, R., Maroteaux, L., P e r r i n , F. and Chambon, P. (1979) Nature 282, 567-574. Schubiger, J . - L . , Germond, J . - E . , ten Heggeler, B. and Wahli, W. (1985) J . Mol. B1ol. JL86, 491-503. Nussinov, R. (1986) Biochim. Biophys. Acta 866^, 93-108.
© Copyright 2026 Paperzz