1 Biochem. Soc. Symp. 73, 1–10 (Printed in Great Britain) 2006 The Biochemical Society Investigations of the modular structure of bacterial promoters Nora S. Miroslavova and Stephen J.W. Busby1 School of Biosciences, The University of Birmingham, Birmingham B15 2TT, U.K. Abstract Bacterial RNA polymerase holoenzyme carries different determinants that contact different promoter DNA sequence elements. These contacts are essential for the recognition of promoters prior to transcript initiation. Here, we have investigated how active promoters can be built from different combinations of elements. Our results show that the contribution of different contacts to promoter activity is critically dependent on the overall promoter context, and that certain combinations of contacts can hinder transcription initiation. Overview All bacterial genes are transcribed by a multisubunit RNA polymerase that contains two large subunits, β and β′, and two copies of the smaller α subunit. Biochemical, genetic and structural studies have shown that the β and β′ subunits form a cleft that contains the enzyme’s active site for RNA polymerization, and that the N‑terminal domains of the two α subunits are responsible for holding the assembly together [1]. Although the RNA polymerase α2ββ′ assembly (known as the core enzyme) is competent to make RNA, it is unable to initiate transcription. In order to recognize promoters and to begin transcription, core RNA polymerase must acquire a σ subunit (resulting in the formation of RNA polymerase holoenzyme) [2]. All bacteria contain σ subunits, which function by binding to the surface of RNA polymerase core enzyme, thus providing determinants for the recognition of specific promoter elements. Following promoter recognition, the bound σ subunit choreographs a series of isomerizations that lead to promoter DNA unwinding and entry of the DNA template into the enzyme active site, where transcript formation is initiated. Most σ subunits contain four domains that are connected by flexible linkers [3]. Domains 2, 3 and 4 carry determinants for recognition of different promoter To whom correspondence should be addressed (email [email protected]). 1 N.S. Miroslavova and S.J.W. Busby elements, namely the −10, extended −10 and −35 elements respectively (Figure 1A). Structural studies with Thermus aquaticus RNA polymerase holoenzyme in complexes with DNA substrates [4,5] have resulted in a clear view of how the interactions between σ and different promoter elements are organized during promoter recognition (Figure 1B). Most bacteria contain a few thousand genes that are expressed from 1000– 2000 promoters. The best‑studied example is E. coli K‑12, which appears typical of many bacteria, in that the expression of its genes is very tightly regulated, with changes controlled by alterations in the environment. In most conditions, a single E. coli cell contains 1000–2000 molecules of RNA polymerase, and thus effective regulation requires controlled distribution of the polymerase between the promoters in its genome [6]. This distribution is mainly a result of the different base sequence determinants found at the promoters, although an important role is also played by supercoiling and the proteins that interact with the bacterial chromosome to fold and organize it (these are known as nucleoid‑associated proteins). Changes in the distribution of RNA polymerase between promoters are controlled by the production of different σ factors, and by transcription factors that bind at or near specific promoters and either activate or repress those promoters in response to specific metabolic signals [7]. Although most studies of the regulation of transcription initiation in E. coli have focused on promoter sequence determinants, σ factors and transcription factors, it is now clear that other factors, such as small ligands, small effector proteins that contact RNA polymerase without contacting promoter DNA, and changes in the bacterial folded chromosome structure, also play a key role [8,9]. Elements of promoter recognition In this article, we focus on the primary determinants of promoter activity in E. coli, which are the initial contacts between RNA polymerase and different promoter elements. Our study is concerned with RNA polymerase containing the major σ70 subunit, though the take‑home messages apply to most σ factors. It has long been known [2,10] that promoter −10 elements are recognized by a helix in Region 2.4 of σ70 Domain 2, and that a helix–turn–helix in Domain 4 is responsible for recognition of promoter −35 elements (Figure 1). The consensus −10 and −35 elements are established as the hexamer sequences TATAAT and TTGACA respectively. More recent work has shown that a helix in Domain 3 of σ70 contacts the extended −10 element (consensus TRTGN), located immediately upstream of the −10 element [3,11]. Finally, the RNA polymerase α subunits play a key role [12,13]. Each α subunit carries an independently folded C‑terminal domain (residues 250–329), joined to the α subunit N‑terminal domain by a flexible linker. The two C‑terminal domains of the two RNA polymerase α subunits can bind to a 20 bp DNA element, known as the UP element, located at promoters immediately upstream of the −35 element. The UP element consists of two half‑elements, each recognized by one α subunit C‑terminal domain (Figure 1A). A consensus UP element has been determined, and it has been shown that, at certain promoters, the interactions of RNA polymerase with UP © 2006 The Biochemical Society Modular structure of bacterial promoters Figure 1 RNA polymerase–promoter interactions. (A) The diagram illustrates the subunit structure of RNA polymerase holoenzyme, showing the disposition of the different subunits as the enzyme makes contact with a bacterial promoter. Each α subunit consists of an N‑terminal domain (αNTD) and a C‑terminal domain (αCTD), joined by a linker. The σ subunit is divided into four segments to illustrate the four domains. The locations of the promoter UP, −35, extended −10 and −10 elements are indicated, together with consensus sequences. The diagram illustrates the different interactions between RNA polymerase and promoter elements. The small 91‑amino‑acid ω subunit acts as a chaperone for the β′ subunit [24]. (B) Model of RNA polymerase holoenzyme docking to promoter DNA, based on structural studies with Thermus aquaticus RNA polymerase from Seth Darst and co‑workers [4,5]. [Reproduced with permission from Murakami, K.S., Masuda, S., Campbell, E.A., Muzzin, O. and Darst, S.A. (2002) Science 296, 1285–1290. Copyright 2002 AAAS.] The locations of domains 2, 3 and 4 of the σ subunit (coloured orange) and their interactions with the −10, extended −10 and −35 elements are shown. Note that αCTD and σ domain 1 are not seen in the T. aquaticus RNA polymerase structure. © 2006 The Biochemical Society N.S. Miroslavova and S.J.W. Busby Figure 2 Building a promoter from different elements. (A) The complete base sequence of the starting 144 bp NM513 EcoRI–HindIII fragment, which carries a consensus −10 element (bold type) but non‑consensus extended −10, −35 and UP elements. The locations of the different elements are indicated by horizontal lines. The transcription start site, +1, is in bold and underlined. (B) Partial base sequences of the starting NM513 promoter and three derivatives, carrying different promoter elements, are shown from positions −55 to +1. NM535 carries consensus −35 and −10 elements, shown in bold type. NM511 carries consensus extended −10 and −10 elements (bold). NM501 carries a consensus −10 element and a DNA site for CRP (bold). All the promoters carry identical sequences downstream of the transcription start site. Each promoter was fused to the lacZ gene (in the expression vector pRW50 [18]), and its ability to initiate transcription was deduced from measurements of β‑galactosidase activities in the ∆lac E. coli strain MC4100 growing exponentially. Cells were grown at 37°C in Luria–Bertani medium, supplemented with tetracycline (35 µg/ml). Activities are expressed as nmol of o‑nitrophenyl β‑d‑galactoside hydrolysed⋅min−1⋅mg dry cell mass−1 (Miller units) and are listed in the final column. (C) Partial base sequence from positions −55 to +1 of the NM500 promoter, which carries a DNA site for CRP, and consensus extended −10 and −10 elements (shown in bold). Expression from this promoter was deduced from the β‑galactosidase activity measurement shown in the final column. © 2006 The Biochemical Society Modular structure of bacterial promoters Figure 3 Reduction of promoter activity due to too many consensus promoter elements. (A) Partial base sequences from positions −50 to +1 of four promoters containing a consensus −10 element, with or without a consensus −35 hexamer, and with or without a consensus extended −10 element. Consensus sequence elements are indicated by bold type. Each promoter was fused to the lacZ gene, and its ability to initiate transcription was deduced from measurements of β‑galactosidase activities in a ∆crp derivative of E. coli strain MC4100, growing exponentially in Luria–Bertani medium. Activities are expressed as nmol of o‑nitrophenyl β‑d‑galactoside hydrolysed⋅min−1⋅mg dry cell mass −1 (Miller units) and are listed in the final column. (B) Derivatives of the four promoters in (A) in which the consensus −10 hexamer, TATAAT, was replaced with TATGGT. Activities were assayed exactly as in (A). element sequences can improve promoter activity by a factor of 10‑fold or much more [14,15]. Our strategy was to investigate how different combinations of promoter elements could make functional promoters. The starting point was the EcoRI– HindIII fragment illustrated in Figure 2(A) that carries the NM513 promoter. This fragment was derived from the E. coli galactose operon promoter region [16] which had been engineered to carry a consensus −10 hexamer element, TATAAT, but no consensus extended −10 or −35 element and no UP element, so that it possessed little or no promoter activity. Our aim was to introduce © 2006 The Biochemical Society N.S. Miroslavova and S.J.W. Busby different combinations of promoter elements and to measure the resulting promoter activities. Figure 2(B) illustrates the NM535 promoter, with an added consensus −35 element, and the NM511 promoter, with a consensus extended −10 element. We also constructed the NM501 promoter, which carries a DNA site for the transcription activator CRP (cAMP receptor protein). The reason for this was to show that promoter activity could be created by the intervention of trans‑acting factors as well as by the addition of cis‑acting promoter elements. CRP is a well understood bacterial transcription activator [17], and thus we placed a DNA site for CRP at a location where bound CRP would be competent to activate transcription. To measure the activity of the new promoter constructions, the different EcoRI–HindIII fragments were transferred to plasmid pRW50, a promoter‑probe vector that places the lac genes under the control of cloned promoters [18]. The resulting pRW50 derivatives were then transformed into the ∆lac E. coli K‑12 strain MC4100, and β‑galactosidase activities were measured as an index of the activity of the cloned promoter (as described in [18]). Data in Figure 2(B) show that, as expected, the starting NM513 promoter had very low activity (expression was close to background levels), whereas the presence of a consensus −35 element (in NM535), an extended −10 element (in NM511) or bound CRP (in NM501) created promoter activity. Promoter activity: more can mean less Figure 2(C) illustrates the NM500 derivative of the NM513 promoter, which carries both an extended −10 element and a DNA site for CRP. The activity of the NM500 promoter is greater than the individual activities of the NM511 and NM501 promoters, showing that an extended −10 element can work together with CRP. Figure 3(A) illustrates an experiment to investigate whether, in the context of a promoter with a consensus −10 element, an extended −10 element can function positively with a consensus −35 element. The data show that the addition of an extended −10 element to a −35 element in the NM530 promoter caused a decrease in promoter activity. Thus we conclude that too many strong contacts between a promoter and RNA polymerase may hinder rather than help a promoter, presumably because the bound RNA polymerase becomes anchored at the promoter and is less able to escape. Support for this point of view comes from the experiment illustrated in Figure 3(B), in which the −10 element was altered from the consensus, TATAAT, to TATGGT. Thus the activity of the NM130 promoter carrying both a consensus −35 element and an extended −10 element was higher than the activities of the NM131 or NM100 promoters that carry either one element or the other. UP element contributions to promoter activity are contingent on the −35 element In the promoters illustrated in Figure 2, the DNA sequence just upstream of the −35 region was chosen so as not to resemble any known UP element. © 2006 The Biochemical Society Modular structure of bacterial promoters Indeed, in the starting NM513 promoter, we used a DNA sequence that had been characterized previously as an ‘anti‑UP’ sequence, i.e. the sequence least likely to enhance promoter activity due to binding of the C‑terminal domain of the RNA polymerase α subunit [15]. In order to measure the effects of introducing a UP element, a 20 bp sequence, selected previously for its optimal UP element function [15], was cloned upstream of the −35 region at the NM511 or NM535 promoters (Figure 4). This yielded the NM510 promoter (carrying a UP element, and consensus −10 and extended −10 elements) and the NM534 promoter (carrying a UP element, and consensus −10 and −35 elements). To measure the effects of these UP elements, DNA fragments carrying the NM510 and NM534 promoters were cloned into the pRW50 lac expression vector as before. The data in Figure 4 show that the UP elements had small effects on promoter activity at these promoters. We reasoned that the effects of the UP elements at the NM510 and NM534 promoters might be increased if the other promoter elements were weakened. Thus we made a series of derivatives of NM534 in which the consensus −35 element was systematically mutated. These changes are illustrated in Figure 5; positions 2–5 of the consensus −35 hexamer, TTGACA, were changed to each of the three alternative bases (note that position 1 was not altered, as we wished to retain the BglII restriction site immediately upstream of the −35 element). The same set of 15 mutations was also created in the context of the NM535 promoter, which lacked a UP element. Figure 5 shows a comparison of the effects of each base change on promoter activity in either NM534 or NM535. Two clear results emerged. First, at the NM535 promoter without a UP element, the effects of the different −35 hexamer changes were in accordance with those reported previously in similar studies with the E. coli lacUV5 promoter [19] and Figure 4 Effects of a UP element at different promoters. Partial base sequences of promoters containing or lacking UP element sequences in combination with either consensus extended −10 and −10 elements (NM511 and NM510) or consensus −35 and −10 elements (NM535 and NM534) are shown. Consensus sequence elements are indicated by bold type. All of the promoters carry identical sequences downstream of the transcription start site. Each promoter was fused to the lacZ gene, and its ability to initiate transcription was deduced from measurements of β‑galactosidase activities in the ∆lac E. coli strain MC4100 growing exponentially in Luria–Bertani medium. Activities are expressed as nmol of o‑nitrophenyl β‑d‑galactoside hydrolysed⋅min−1⋅mg dry cell mass−1 (Miller units), and are listed in the final column. © 2006 The Biochemical Society N.S. Miroslavova and S.J.W. Busby Figure 5 Base changes in the promoter −35 element affect UP element function. The bar chart shows measured b‑galactosidase activities during exponential growth of E. coli MC4100 ∆lac cells containing pRW50 plasmid carrying lac fusions to the NM535 promoter and its derivatives (no UP element; black bars) or to the NM534 promoter and its derivatives (with a UP element; grey bars). The derivatives of both promoters carry single base changes at positions 2–6 of the consensus −35 element, TTGACA, as indicated under the bar chart; the location of each substitution is underlined. Assays were performed as in Figure 2. Activities are expressed as a percentage of the activity with the starting NM535 promoter (100%; equivalent to 625 Miller units). Values are means+S.E.M. for three independent experiments. the Salmonella P22 ant promoter [20]. Thus different changes reduce promoter activities by different amounts. Secondly, the presence of the UP element resulted in larger increases in promoter activity at the promoters where the −35 elements caused the biggest decreases. Hence it is clear that the UP element can compensate for defects in the −35 element. To explore further the idea that the effects of the UP element are greater at weaker promoters, mutations that altered the consensus −10 element from TATAAT to CATAAT or TGTAAT were made in the NM534 and NM535 promoters carrying a consensus −35 element, TTGACA, and derivatives in which the −35 element was changed to either TCGACA or TTGGCA. The results illustrated in Figure 6 show that UP‑element‑dependent stimulation of promoter activity is greater at promoters that have a defective −35 element than at promoters with a defective −10 element. Conclusions The recognition of promoters by the multisubunit bacterial RNA polymerase involves four different protein determinants that recognize four different promoter elements. Our results show that an efficient promoter can be built by different © 2006 The Biochemical Society Modular structure of bacterial promoters Figure 6 Effects of −10 and −35 element sequences on UP element function. The bar chart shows measured β‑galactosidase activities during exponential growth of E. coli MC4100 ∆lac cells containing pRW50 plasmid carrying lac fusions to the NM535 promoter and its derivatives (no UP element; black bars) or to the NM534 promoter and its derivatives (with a UP element; grey bars). The derivatives of both promoters carry combinations of the −35 hexamer elements TTGACA, TCGACA and TTGGCA and the −10 hexamer elements TATAAT, CATAAT and TGTAAT, as indicated. The location of each substitution is underlined. Assays were performed as described in Figure 2. Activities are expressed as a percentage of the activity with the starting NM535 promoter (100%; equivalent to 6251 Miller units). Values are means+S.E.M. for three independent experiments. combinations of these contacts. It is also clear that promoter activity reaches a ceiling if too many contacts are created (see also [21]). Presumably there is a trade‑off between contacts being sufficiently strong for promoter binding, but not too strong so that promoter escape is hindered. Our results suggest that, to some extent, the −35 element and the UP element are redundant. Presumably this is because they are both involved in the very initial contacts between a target promoter and RNA polymerase [22]. Hence there is greater synergy between the UP element and the promoter −10 element. Interestingly, all naturally occurring promoters use sub‑optimal combinations of the different contacts [23]. There appear to be two reasons for this. First, because expression levels of genes need to be controlled, the ‘tuning’ of a promoter’s activity is more important than the optimization of its activity. Secondly, sub‑optimally active promoters can, if needed, be activated by transcription factors. Hence, in E. coli (and probably also in other organisms), the government of the cell favours modest promoters and rejects greedy ones! We thank the Wellcome Trust for funding this work with a project grant, and the Darwin Trust of Edinburgh for supporting N.M.’s Ph.D. programme. Automated DNA sequencing was performed by the University of Birmingham Functional Genomics Laboratory, supported by grant 6/JIF13209 from the U.K. Joint Infrastructure Fund. © 2006 The Biochemical Society 10 N.S. Miroslavova and S.J.W. Busby References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. Ebright, R.H. (2000) J. Mol. Biol. 304, 687–698 Gross, C.A., Chan, C., Dombroski, A., Gruber, T., Sharp, M., Tupy, J. and Young, B. (1998) Cold Spring Harbor Symp. Quant. Biol. 63, 141–155 Campbell, E.A., Muzzin, O., Chlenov, M., Sun, J.L., Olson, C.A., Weinman, O., Trester‑Zedlitz, M.L. and Darst, S.A. (2002) Mol. Cell 9, 527–539 Murakami, K.S., Masuda, S., Campbell, E.A., Muzzin, O. and Darst, S.A. (2002) Science 296, 1285–1290 Murakami, K.S. and Darst, S.A. (2003) Curr. Opin. Struct. Biol. 13, 31–39 Ishihama, A. (2000) Annu. Rev. Microbiol. 54, 499–518 Browning, D.F. and Busby, S.J.W. (2004) Nat. Rev. Microbiol. 2, 1–9 Gralla, J.D. (2005) Mol. Microbiol. 55, 973–977 Travers, A. and Muskhelishvili, G. (2005) Nat. Rev. Microbiol. 3, 157–69 Busby, S. and Ebright, R.H. (1994) Cell 79, 743–746 Bown, J.A., Barne, K.A., Minchin, S.D. and Busby, S.J.W. (1997) Nucleic Acids Mol. Biol. 11, 41–52 Ebright, R.H. and Busby, S. (1995) Curr. Opin. Genet. Dev. 5, 197–203 Gourse, R.L., Ross, W. and Gaal, T. (2000) Mol. Microbiol. 37, 687–695 Estrem, S.T., Gaal, T., Ross, W. and Gourse, R.L. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 9761–9766 Estrem, S.T., Ross, W., Gaal, T., Chen, Z.W.S., Niu, W., Ebright, R.H. and Gourse, R.L. (1999) Genes Dev. 13, 2134–2147 Burns, H.D., Belyaeva, T.A., Busby, S.J.W. and Minchin, S.D. (1996) Biochem. J. 317, 305–311 Busby, S. and Ebright, R.H. (1999) J. Mol. Biol. 293, 199–213 Lodge, J., Fear, J., Busby, S., Gunasekaran, P. and Kamini, N.R. (1992) FEMS Microbiol. Lett. 95, 271–276 Kobayashi, M., Nagata, K. and Ishihama, A. (1990) Nucleic Acids Res. 18, 7367–7372 Moyle, H., Waldburger, C. and Susskind, M.M. (1991) J. Bacteriol. 173, 1944–1950 Grana, D., Gardella, T. and Susskind, M.M. (1988) Genetics 120, 319–327 Sclavi, B., Zaychikov, E., Rogozina, A., Walther, F., Buckle, M. and Heumann, H. (2005) Proc. Natl. Acad. Sci. U.S.A. 102, 4706–4711 Lisser, S. and Margalit, H. (1993) Nucleic Acids Res. 21, 1507–1516 Ghosh, P., Ramakrishnan, C. and Chatterji, D. (2003) Biophys. Chem. 103, 223–237 © 2006 The Biochemical Society
© Copyright 2026 Paperzz