JB Accepts, published online ahead of print on 3 November 2006 J. Bacteriol. doi:10.1128/JB.01356-06 Copyright © 2006, American Society for Microbiology and/or the Listed Authors/Institutions. All Rights Reserved. Naturally occurring adenines within mRNA coding sequences affect ribosome binding and expression in Escherichia coli Authors: Jay E. Brock1, Robert L. Paz2, Patrick Cottle, and Gary R. Janssen* Running Title: Adenine stimulation of gene expression T P Key Words: translation initiation, ribosome binding, gene expression, adenine E C stimulation, translation enhancer Author Affiliation: 1Department of Microbiology Miami University 32 Pearson Hall Oxford, Ohio 45056 USA C A Phone: 513-529-5448 Fax: 513-529-2431 Email: [email protected] Address for correspondence: Gary Janssen Department of Microbiology Miami University 32 Pearson Hall Oxford, Ohio 45056 USA Phone: 513-529-5422 Fax: 513-529-2431 Email: [email protected] 2 The Fuqua School of Business Duke University Box 90120 Durham, NC 27708-0120 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E *corresponding author Abstract 1 2 Translation initiation requires the precise positioning of a ribosome at the start codon. The major signals of bacterial mRNA that direct the ribosome to a translational 4 start site are the Shine-Dalgarno (SD) sequence, within the untranslated leader, and the 5 start codon. Evidence for many non-SD-led genes in prokaryotes provides motive for 6 studying additional interactions between ribosomes and mRNA that contribute to 7 translation initiation. A high incidence of adenines have been reported downstream of 8 the start codon for many E. coli genes and addition of downstream adenine-rich 9 sequences increases expression from several genes in E. coli. We describe here site- D E T P E C 10 directed mutagenesis of the E. coli aroL, pncB and cysJ coding sequences to assess the 11 contribution of naturally occurring adenines to in vivo expression and in vitro ribosome 12 binding from mRNAs with different SD-containing untranslated leaders. Base 13 substitutions that decreased the downstream adenines by one or two nucleotides 14 decreased expression significantly from aroL-, pncB- and cysJ-lacZ fusions; mutations 15 increasing downstream adenines by one or two nucleotides increased expression 16 significantly from aroL- and cysJ-lacZ fusions. Using primer extension inhibition 17 (toeprint) and filter binding assays to measure ribosome binding, the changes in in vivo 18 expression correlated closely with changes in in vitro ribosome binding strength. Our 19 data are consistent with a model in which downstream adenines influence expression 20 through their effects on the mRNA-ribosome association rate and the amount of ternary 21 complex formed. This work provides evidence that adenine-rich sequence motifs might 22 serve as a general enhancer of E. coli translation. C A 2 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 3 23 Introduction 24 Initiation of bacterial protein synthesis requires the selection of an mRNA’s translation initiation region (TIR) and initiator tRNA (fMet-tRNAfMet) by the 30S 26 ribosomal subunit aided by the three initiation factors (IF1, IF2 and IF3). Initiation is the 27 rate-limiting step of translation, and the formation of ribosome/initiator tRNA/mRNA 28 ternary complexes is influenced by sequence and structural motifs in and around the 29 mRNA’s ribosome binding site (RBS) (33). Features of the mRNA RBS contributing to 30 the efficiency of ribosome binding and translation include the start codon (i.e., AUG most 31 frequently in Escherichia coli) and the purine-rich Shine-Dalgarno (SD) sequence, 32 located within the untranslated leader region (UTR), which base-pairs with the anti-SD 33 (ASD) sequence near the 3’ end of 16S rRNA (29, 11). In addition to the start codon and 34 SD:ASD interaction, other sequence and structural motifs within mRNA have been 35 suggested to influence ribosome binding and translation including mRNA secondary 36 structure within the TIR (6, 7), specific translation-enhancing sequences upstream (10, 37 21, 37, 12) and downstream (32, 9, 19, 26) to the start codon, upstream pyrimidine-rich 38 tracts (1, 42, 38), AU-rich sequences within the UTR (14, 15), and downstream A (3) and 39 CA-rich tracts (16). In addition, an increasing number of non-SD-led genes (2) and genes 40 encoding mRNA lacking a 5’-untranslated leader region (leaderless mRNA) (13, 18) are 41 being identified, raising the potential for novel sequence and structural motifs within the 42 coding sequence that contribute to the formation of translation initiation complexes. 43 Statistical analysis of E. coli translational start sites (24, 27) revealed that the D E T P E C C A 44 region downstream of the initiation codon for several genes contain CA- and/or A-rich 45 sequences. In a recent analysis of nucleotide sequences around the boundaries of all open 3 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 25 46 reading frames in E. coli, nucleotide biases were observed immediately downstream of 47 the initiation codon and the two most frequent second codons, AAA and AAU, were 48 found to enhance translational efficiency (26). In addition to these studies, adenine-rich 49 sequences have been proposed to enhance translation in E. coli, as demonstrated by the 50 increased expression observed for the human gamma interferon (γ-IFN) and 51 chloramphenicol acetyltransferase (cat) genes after the addition of A-rich motifs 52 downstream of the initiation codon (3). T P The insertion of CA multimers immediately downstream of the lacZ initiation 54 codon results in an increase in gene expression (16, 30; G. R. Janssen, unpublished data). 55 CA nucleotides, inserted downstream of the start codon for leaderless or SD-leadered 56 mRNAs, exerted a greater stimulation when close to the initiation codon and exhibited a 57 dose-dependent increase in expression with an increasing number of CA repeats. 58 Increased expression was observed also after the addition of CA repeats to SD-leadered 59 and leaderless neo, kan and gusA genes indicating that CA-rich sequences might serve as 60 a general enhancer of expression for leadered and leaderless mRNAs in E. coli (16). 61 E C C A The work of Chen et al. (3) and Martin-Farmer and Janssen (16) demonstrate that 62 the addition of adenines and CA repeats, respectively, can dramatically increase gene 63 expression. The experiments described here investigate the influence of naturally 64 occurring downstream adenines on in vivo expression and in vitro ribosome binding. The 65 mRNAs encoded by the E. coli aroL, pncB and cysJ genes include SD-containing 66 untranslated leaders and adenines downstream of their AUG start codons; site-directed 67 mutagenesis was used to decrease or increase the A-richness in order to generate putative 68 “down” or “up” mutations, respectively. The putative “down” mutations in aroL, pncB 4 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 53 D E 69 and cysJ mRNAs decreased both in vivo expression and in vitro ribosome binding, 70 whereas putative “up” mutations in the aroL and cysJ mRNAs increased both in vivo 71 expression and in vitro ribosome binding. Using toeprint, filter binding and expression 72 assays, our data show that downstream adenines contribute significantly to the rate and 73 amount of ternary complex formed as well as the in vivo expression levels for the aroL, 74 pncB and cysJ genes even in the presence of a canonical SD sequence. E C C A 5 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 T P D E Materials and Methods 75 Bacterial strains. E. coli DH5α [F-, phi80d, lacZM15∆ (lacZYA-argF), U169, recA1, 77 endA1, hsdR17 (rk-, mk+), supE44, lambda-, thi-1, gyrA, relA1] was used as the host 78 strain for all plasmid constructs. E. coli RFS859 [F-, thr-1, araC859, leuB6, ∆lac74, tsx- 79 274, lambda-, gyrA111, recA11, relA1, thi-1], a lac-deletion strain (28), was used as the 80 host for the expression and assay of β-galactosidase activity from lacZ reporter genes. E. 81 coli K12 total genomic DNA was used for the isolation of wild type (WT) aroL, cysJ and 82 pncB gene fragments. E. coli MRE600 (39) was used for the isolation of 30S ribosomal 83 subunits. 84 Reagents and recombinant DNA procedures. Radiolabelled nucleotides, [γ-32P]ATP 85 (6,000 Ci/mmol; 150 mCi/ml) and [α-32P]CTP (3,000 Ci/mmol; 10 mCi/ml), were 86 purchased from Perkin Elmer. Oligonucleotides were synthesized using a Beckman 87 1000M DNA synthesizer or purchased from commercial suppliers. Restriction 88 endonucleases, T4 DNA ligase, T4 polynucleotide kinase, T4 DNA polymerase and T7 89 RNA polymerase were obtained from New England Biolabs. Pfu DNA polymerase was 90 obtained from Stratagene. AMV reverse transcriptase was obtained from Life Sciences 91 and RNase-free DNase I was purchased from Ambion. All enzymes were used according 92 to the manufacturer’s recommendations. Plasmid isolation, E. coli transformations and 93 other DNA manipulations were carried out in a standard manner (25). 94 Mutant Constructions. The aroL-, cysJ- and pncB-lacZ fusions were constructed 95 containing either the lacZ untranslated leader or the gene’s natural untranslated leader 96 and the first sixteen codons of the coding sequence fused to a lacZ reporter gene. 97 Transcription of the lac fusions was provided by the E. coli lac promoter. Plasmids D E T P E C C A 6 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 76 (pBR322-derived) containing the aroL-, cysJ- and pncB-lacZ fusions were used as 99 templates for site-directed mutagenesis in which one oligonucleotide primer contained 100 the desired mutation(s) (Table 1) and the other primer annealing within the lacZ coding 101 sequence. After amplification, the PCR product was trimmed with appropriate restriction 102 enzymes to facilitate cloning, ultimately producing identical plasmids varying only by the 103 single or double nucleotide mutations made within the coding sequence as shown in 104 Table 1. All DNA regions generated by PCR amplification were verified by DNA 105 sequencing. 106 β-galactosidase assays. Triplicate cultures of plasmid-containing strains were grown in 107 2XYT (per liter, 16g of Difco Bacto Tryptone, 10 g of Difco Bacto yeast extract, 10 g of 108 NaCl, pH 7.4) supplemented with 200 µg/ ml ampicillin and 0.2 mM IPTG, at 37°C to an 109 O.D.600 of 0.4-0.6 and quick chilled on ice. Triplicate β-galactosidase assays (17) were 110 performed with each of the triplicate cultures. 111 Primer extension inhibition (toeprinting). The mRNAs used in primer extension 112 inhibition (toeprint) experiments were generated in vitro with T7 RNA polymerase. 113 DNA templates for T7 transcription were prepared by PCR amplification of miniprep 114 plasmid DNA. Oligonucleotide primers T7lac.lead (5’- 115 ggaattctaatacgactcactatagaattgtgagcggataacaatttc-3’) and lac.comp2 (5’- 116 attaagttgggtaacgccag-3’) were used to generate DNA fragments containing the T7 117 promoter, lac leader and cysJ sequences (with or without mutations) fused to lacZ from 118 the respective template plasmids. T7pncB.lead (5’- 119 ctaatacgactcactatagttcctgaagatgtttattgtac-3’) and lac.comp2 were used to generate DNA 120 fragments containing the T7 promoter, pncB leader and pncB sequences (with or without D E T P E C C A 7 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 98 mutations) fused to lacZ. T7aroL.lead (5’- 122 ggaattctaatacgactcactatagattgagattttcactttaagtgg-3’) and lac.comp2 were used to generate 123 DNA fragments containing the T7 promoter, aroL leader and aroL sequences (with or 124 with out mutations) fused to lacZ. After gel purification of the resulting PCR-generated 125 fragments, transcription reactions [15 mM DTT, 4 mM NTPs, 40 mM Tris-HCL (pH7.9), 126 20 mM MgCl2, 1 µg template DNA, and 1 µl T7 RNA polymerase (NEB; 500 U/ µl); in a 127 volume of 20 µl] were carried out at 37°C for 1 hour. This was followed by 1 µl of 128 RNase-free DNase I [2 U/µl] for 30 minutes at 37°C. In vitro synthesized mRNA was 129 then gel purified on 6% polyacrylamide 7M urea denaturing gels. After UV shadowing 130 and elution of the mRNA with 0.2% SDS, 0.005M EDTA and 0.5M NH4OAc, mRNA 131 was extracted with phenol:chloroform:isoamyl alcohol (25:24:1) and precipitated with 132 0.2 M NH4OAc and 2.5 volumes of EtOH. 133 D E T P E C C A Isolation of E. coli 30S ribosomal subunits and toeprint assays were done as 134 previously described (16). Briefly, 9 µl reactions containing mRNA (44nM) with a 32P- 135 labelled primer annealed to the 3’ end, 30S ribosomal subunits, tRNAfMet (1:10:20 ratio) 136 and 1XSB [60mM NH4Cl, 10mM Tris-OAc (pH 7.4), 10mM MgOAc, 6mM beta- 137 mercaptoethanol] were incubated at 37°C for 30 minutes or the indicated times. 138 Reactions were placed on ice and 1µl of reverse transcriptase (1 U/µl) was added 139 followed by a 37°C incubation for 10 minutes. Reactions were precipitated with 40 µl of 140 0.3M NaOAc and 2.5 volumes of EtOH for at least 2.5 hours at -20°C and 141 electrophoresed on 6% polyacrylamide (7M urea) gels. Dideoxy sequencing reactions 142 mapped toeprint signals to the +16 position of mRNA relative to the first base of the start 143 codon (+1). Toeprint signals were quantified with a Molecular Dynamics 8 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 121 PhosphorImager (Storm 800) and expressed as relative toeprint complexes (RTC) 145 [toeprint pixel value/(toeprint pixel value + full-length pixel value)] or [toeprint pixel 146 value/(toeprint pixel value + full-length pixel value + upstream signal pixel value)] for 147 aroL RTCs. Comparisons of RTCs between different mRNAs were made from reactions 148 electrophoresed on the same gel. 149 Filter binding assays. mRNA used in filter binding assays was synthesized in a 5 µl 150 reaction containing 5mM DTT, 2mM NTPs, 14mM MgCl2, 0.5 µl 10X T7 RNA 151 polymerase buffer (NEB), 0.05 µg template DNA, 0.35 µl T7 RNA polymerase (NEB; 152 500 U/ µl) and 2.3 µl [α-32P]CTP. Filter binding reactions (10 µl) containing 153 radiolabeled mRNA (44nM), 30S ribosomal subunits, tRNAfMet (1:10:20 ratio) and 1XSB 154 were incubated for various amounts of time at 37°C. Reaction mixtures were then diluted 155 to 500 µl with chilled 1XSB and filtered through a nitrocellulose membrane (0.45-µm 156 pore size) in a Mini-Fold slot blot manifold (Schleicher and Schuell) followed by the 157 washing of each well with 3 ml of 1XSB. Membranes were then dried at room 158 temperature and samples cross-linked to the membranes by UV light (Fisher Scientific, 159 FB-UVXL-1000). The amount of complex bound to the membrane was quantified with a 160 Molecular Dynamics PhosphorImager (Storm 800); standard curves converted pixel 161 values to picomoles, as previously described (4), and data expressed as picomoles of 162 mRNA bound. D E T P E C C A 9 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 144 163 Results 164 All base substitutions in the aroL, pncB and cysJ genes were made via sitedirected mutagenesis and maintain the natural amino acid sequence; putative “down” 166 mutations decrease the downstream adenine content whereas putative “up” mutations 167 increase the downstream adenine content (Table 1). The effects of the mutations on gene 168 expression were assessed by translational fusions of aroL, pncB and cysJ gene fragments 169 to an E. coli lacZ reporter gene followed by β-galactosidase assays. Toeprint and filter 170 binding assays were used to compare the rate of ternary complex formation, defined here 171 as the amount of complex formed (ribosomes/tRNA/mRNA) over time, for aroL, pncB 172 and cysJ mRNAs with or without mutations that alter the downstream adenine content 173 (Table 1). In toeprint assays, variations in a specific RTC might be observed between 174 different assays but the correlation between wild type and mutant binding strengths were 175 consistent and reproducible. These assays were performed in an effort to correlate the in 176 vitro ribosome binding strength of aroL, pncB and cysJ mRNAs with the in vivo 177 expression data. 178 Downstream adenines of aroL 179 D E T P E C C A Transcription of the aroL gene, encoding shikimate kinase II, results in an mRNA 180 with a 124 nucleotide leader containing an optimally spaced (7-9 nucleotides), canonical 181 SD sequence (5). Mutations in aroL’s coding sequence altering the downstream adenine 182 content are shown in Table 1. 183 i. The effect of aroL downstream adenines on in vivo expression 184 185 Comparisons of β-galactosidase activity between cells containing the wild type (pAaroL.WT) or the putative “down” aroL-lacZ constructs revealed that the A→G 10 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 165 substitutions in codons two and three (pAaroL.DN1) decreased lacZ expression 72%; 187 incorporation of the individual substitutions resulted in a 15% (pAaroL.DN1a) and 44% 188 (pAaroL.DN1b) decrease in lacZ expression (Figure 1-A). The putative “up” aroL-lacZ 189 construct revealed that the U→A substitution at codon four (pAaroL.UP1) increased lacZ 190 expression 38% (Figure 1-A). These results indicate that the downstream adenines 191 influence aroL expression significantly, despite the presence of a canonical SD:ASD 192 interaction. 193 ii. The effect of aroL downstream adenines on in vitro ribosome binding 194 D E T P Toeprint assays with aroL-lacZ mRNA and 30S subunits revealed a tRNA- E C 195 dependent signal corresponding to position +16 relative to the A (+1) of the AUG start 196 codon (Figure 2-A, lanes 7-9). Phosphorimage analysis of toeprint signal intensity 197 revealed a 97% reduction from the putative “down” mutant mRNA (Figure 2-B, lane 15) 198 compared to wild type (Figure 2-B, lane 14). Incorporation of the U→A putative “up” 199 mutation at codon four increased the toeprint signal 278% (Figure 2-B, lane 16) 200 compared to wild type (Figure 2-B, lane 14). The toeprint results, when taken with the 201 aroL-lacZ expression levels, show a positive correlation between the downstream adenine 202 content, in vitro ribosome binding and in vivo expression levels. 203 iii. The effect of aroL downstream adenines on the rate and amount of ternary complex 204 formed 205 C A Incorporation of the A→G putative “down” mutations at aroL’s second and third 206 codons (Figure 3-A, lanes 10-19) resulted in a reduced rate and amount of ternary 207 complex formation compared to wild type (Figure 3-A, lanes 1-9), as quantified in Figure 208 3-B. Incorporation of the U→A putative “up” mutation at aroL’s fourth codon (Figure 3- 11 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 186 A, lanes 20-29) resulted in an increased rate and amount of ternary complex formation 210 compared to wild type (Figure 3-A, lanes 1-9), as quantified in Figure 3-B. The 211 significant amount of toeprint signal at t=0 (Figure 3-A and B) reflects the complex 212 formed and cDNA produced during the ten minute incubation time for reverse 213 transcriptase. An additional uncharacterized ternary complex signal is observed within 214 aroL’s untranslated leader and is noted with an asterisk (*) in Figure 3-A. Because this 215 signal occurs upstream to the aroL toeprint signal (arrow, Figure 3-A), it was included 216 with the full length signal for phosphorimage quantification of RTC values in Figure 3-B. 217 Calculation of RTC values without the additional band (*) resulted in higher RTC values 218 but the line slopes, relative to each other, were basically unchanged (data not shown). 219 D E T P E C In order to get a more accurate assessment of complex formation at earlier time 220 points, filter binding assays were used. Filter binding assays measure complex formation 221 based on the amount of mRNA bound by ribosomes when filtered through a 222 nitrocellulose membrane. Incorporation of the A→G putative “down” mutations at 223 aroL’s second and third codons reduced the rate and amount of ternary complex formed, 224 whereas the U→A putative “up” mutation at codon four increased the amount of ternary 225 complex formed compared to wild type (Figure 3-C). Ternary complex formation with 226 the “up” mutant demonstrated a similar rate to the wild type during the first 60 seconds of 227 incubation, after which the amount of product formed continued to increase with the “up” 228 mutant but not with the wild type (Figure 3-C). The toeprint and filter binding results 229 provide evidence that downstream adenines enhance the rate and/or amount of ternary 230 complex formation with aroL mRNA. C A 12 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 209 231 232 Downstream adenines of pncB Transcription of the pncB gene, encoding a nicotinic acid 233 phosphoribosyltransferase, results in an mRNA with a 58 nucleotide untranslated leader 234 containing a canonical SD sequence with suboptimal spacing (12 nucleotides) from the 235 start codon (40) and identical second and third codons to aroL. Mutations in pncB’s 236 coding sequence altering the downstream adenine content are shown in Table 1. 237 i. The effect of pncB downstream adenines on in vivo expression T P Comparisons of β-galactosidase activities between cells containing the wild type 239 (pApncB.WT) or the putative “down” pncB-lacZ constructs revealed that the A→G 240 substitutions in codons two and three (pApncB.DN1) decreased lacZ expression 99.9%; 241 incorporation of the individual substitutions resulted in an 89% (pApncB.DN1a) and 92% 242 (pApncB.DN1b) decrease in lacZ expression (Figure 1-B). The putative “up” pncB-lacZ 243 construct revealed that the U→A substitutions in codons five and six (pApncB.UP1) 244 unexpectedly decreased lacZ expression 67% (Figure 1-B). 245 E C C A To determine if altering the pncB downstream adenine content has a similar effect 246 on expression irrespective of the untranslated leader sequence, analogous pncB-lacZ 247 coding sequence fusions were placed downstream of the E. coli lacZ untranslated leader 248 containing a canonical SD sequence with optimal spacing (7 nucleotides) from the start 249 codon. Putative “down” mutations in codons two and three of the lac-leadered pncB-lacZ 250 mRNA reduced gene expression 87% relative to wild type (data not shown). These 251 results indicate that the properly spaced canonical SD sequence of the lac leader could 252 not fully compensate for the loss of downstream adenines. 13 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 238 D E 253 ii. The effect of pncB downstream adenines on in vitro ribosome binding 254 Toeprint assays with pncB-lacZ mRNA and 30S subunits revealed a tRNAdependent signal corresponding to position +16 relative to the A (+1) of the AUG start 256 codon (Figure 2-A, lanes 1-3). pncB-lacZ mRNA containing putative “down” mutations 257 at codon three (Figure 2-B, lane 4) or codons two and three (Figure 2-B, lane 5) nearly 258 eliminated ribosome binding (toeprint signal reduced >95%) compared to wild type 259 (Figure 2-B, lane 3). Incorporation of the putative “up” mutations at codons five and six 260 resulted in a 62% reduction in toeprint signal intensity (Figure 2-B, lane 6) compared to 261 wild type (Figure 2-B, lane 3). The toeprint results, when taken with the pncB-lacZ 262 expression levels, show a positive correlation between in vitro ribosome binding and the 263 in vivo expression levels. 264 iii. The effect of pncB downstream adenines on the rate and amount of ternary complex 265 formed 266 D E T P E C C A Incorporation of the A→G putative “down” mutation at pncB’s third codon 267 (Figure 4-A, lanes 9-16) resulted in a dramatically reduced rate and amount of ternary 268 complex formation when compared to wild type (Figure 4-A, lanes 1-8), as quantified in 269 Figure 4-B. Bands observed in the absence of tRNA and ribosomes are artifacts likely 270 due to structures formed within the mRNA. When analyzed by filter binding assays, 271 incorporation of the A→G putative “down” mutation at pncB’s third codon resulted in a 272 reduced rate and amount of ternary complex formation compared to wild type (Figure 4- 273 C). The toeprint and filter binding results provide evidence that downstream adenines 274 contribute to the rate and amount of ternary complex formation with pncB mRNA. 14 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 255 275 Downstream adenines of cysJ The E. coli cysJ gene, encoding a NADPH-sulfite reductase flavoprotein 277 component (22), has different second and third codons from aroL and pncB. Cloning a 278 DNA fragment encoding the cysJ untranslated leader and a fragment of cysJ coding 279 sequence into a multicopy plasmid caused a dramatic reduction in plasmid copy number 280 (data not shown); therefore, cysJ-lacZ coding sequence fusions were placed downstream 281 of the 38 nucleotide lacZ untranslated leader containing a canonical SD sequence with 282 optimal spacing (7-9 nucleotides) from the cysJ start codon. Mutations in cysJ’s coding 283 sequence altering the downstream adenine content are shown in Table 1. 284 i. The effect of cysJ downstream adenines on in vivo expression D E T P E C 285 Comparisons of β-galactosidase activities between cells containing the wild type 286 (pAZcysJ.WT) or the putative “down” (pAZcysJ.DN1) cysJ-lacZ construct revealed that 287 the third codon A→G base substitution resulted in a 77% decrease in lacZ expression 288 (Figure 1-C). The putative “up” cysJ-lacZ constructs revealed that the G→A 289 substitutions in codons two and four (pAZcysJ.UP1) increased lacZ expression 525%; 290 incorporation of the individual substitutions resulted in a 174% (pAZcysJ.UP1a) and 291 225% (pAZcysJ.UP1b) increase in expression (Figure 1-C). 292 ii. The effect of cysJ downstream adenines on in vitro ribosome binding 293 C A Toeprint assays with cysJ-lacZ mRNA and 30S subunits revealed a tRNA- 294 dependent signal corresponding to position +16 relative to the A (+1) of the AUG start 295 codon (Figure 2-A, lanes 4-6). Phosphorimage analysis of toeprint signal intensity 296 revealed an 86% reduction from the putative “down” mutant (Figure 2-B, lane 10) 297 compared to wild type (Figure 2-B, lane 9). Incorporation of putative “up” mutations at 15 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 276 298 codons two and four resulted in a 202% increase in toeprint signal intensity (Figure 2-B, 299 lane 11) compared to wild type (Figure 2-B, lane 9). The toeprint results, when taken 300 with the cysJ-lacZ expression levels, show a strong correlation between the downstream 301 adenine content, in vitro ribosome binding and in vivo expression levels. 302 iii. The effect of cysJ downstream adenines on the rate and amount of ternary complex 303 formed 304 D E Incorporation of the A→G putative “down” mutation at cysJ’s third codon (Figure 5-A, lanes 10-18) resulted in a reduced rate and amount of ternary complex formation 306 compared to wild type (Figure 5-A, lanes 1-9), as quantified in Figure 5-B. Incorporation 307 of the G→A putative “up” mutations at cysJ’s second and fourth codons (Figure 5-A, 308 lanes 19-27) resulted in an increased rate and amount of ternary complex formation 309 compared to wild type (Figure 5-A, lanes 1-9), as quantified in Figure 5-B. 310 Phosphorimage analysis of the full length, run-off signal yielded consistently high pixel 311 values, resulting in low RTC values; however, the rate comparisons between mutant and 312 WT mRNAs still correlated well with results visualized by autoradiography. When 313 analyzed by filter binding assays, incorporation of the A→G putative “down” mutation at 314 cysJ’s third codon resulted in a similar rate and amount of ternary complex formation as 315 with wild type mRNA during the first three minutes of incubation; after three minutes, 316 the rate and amount of ternary complex formed was reduced compared to wild type 317 (Figure 5-C). Incorporation of the G→A putative “up” mutations at cysJ’s second and 318 fourth codons resulted in an increased rate and amount of ternary complex formation 319 compared to wild type, with the most significant difference in rate occurring within the 320 initial three minutes of binding (Figure 5-C). The toeprint and filter binding results E C C A 16 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 T P 305 321 provide evidence that downstream adenines enhance the rate and amount of ternary 322 complex formed with cysJ mRNA. T P E C C A 17 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E Discussion 323 324 The correlation of adenine-rich regions with efficient translation in E. coli was first noted by Dreyfus (8), with later studies reporting stimulatory effects on translation 326 by the addition of adenines (3, 16, 30). We report here that adenines occurring naturally 327 downstream of the initiation codon can contribute significantly to ribosome binding and 328 expression in E. coli. Our data show that downstream adenines can exert major effects 329 on expression even in the presence of canonical SD sequences. Demonstration that 330 downstream nucleotides can function as major effectors of expression is especially 331 interesting in light of a recent report (2) that non-SD led genes, including leaderless 332 mRNAs, are as common as SD-led genes in prokaryotes. The absence of a SD sequence 333 requires that other sequence or structural features of the mRNA help direct the ribosome 334 to the correct initiation site. From work described here, the position and number of 335 downstream adenines may contribute to the ribosome binding strength of mRNAs with, 336 and possibly without, canonical SD sequences. 337 D E T P E C C A Our experiments investigate the influence of naturally occurring downstream 338 adenines on in vitro ribosome binding and in vivo expression for the E. coli aroL, pncB 339 and cysJ mRNAs. When compared to the wild type mRNAs, the putative “down” 340 mutations significantly decreased in vitro ribosome binding and in vivo expression; 341 putative “up” mutations in aroL and cysJ significantly increased in vitro ribosome 342 binding and in vivo expression. These data are consistent with earlier reports whereby A- 343 rich sequences added immediately downstream of the start codon stimulated expression 344 from several natural and heterologous genes in E.coli (16, 3). While the pncB “down” 345 mutations demonstrate clearly that adenines in codons two and three are important for 18 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 325 346 expression, it is unclear why the putative “up” mutations in codons five and six did not 347 increase expression; possible explanations are that the substitutions reside too far from 348 the start codon (i.e., compared to codons two and four for cysJ and aroL ”up” mutations) 349 or the mutations resulted in a secondary structure that reduced access of a ribosome to the 350 initiation region. 351 Downstream adenines affect rate and amount of ternary complex formed. 352 D E Using toeprint and filter binding assays as two independent approaches to estimate ribosome binding, we observed a strong correlation between the downstream 354 adenine content, in vivo expression and the rate and amount of in vitro ribosome binding, 355 suggesting strongly that the variations in expression were mediated through the observed 356 effects on ribosome binding. Toeprint assays revealed also that aroL, pncB and cysJ 357 ternary complexes, once formed, did not dissociate in the presence of a competitor 358 mRNA (data not shown), in agreement with a previous report that ternary complexes are 359 stable and irreversible in the presence of competitor mRNA (31). Therefore, downstream 360 adenines appear to influence expression through their affect on mRNA-ribosome 361 association and not by preventing ternary complex dissociation. 362 Possible contributions of downstream adenines to ribosome binding and expression 363 Comparison of the codons resulting from aroL, pncB and cysJ mutagenesis to an 364 E. coli genomic codon usage table (GenBank) suggests that the effects on expression are 365 not due to the introduction of rare codons or to low tRNA availability. Also, 366 conservation of the wild type amino acid sequence ensured that expression differences 367 from the mutant constructs were not the result of an altered peptide sequence. 368 Furthermore, the strong correlation between in vitro ribosome binding and in vivo E C C A 19 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 T P 353 369 expression, for both wild type and mutants, suggests the downstream adenines exert their 370 affect at initiation rather than elongation. 371 It is possible that the adenine effects seen here are due to structure within the TIR. It has been reported that secondary structure within the TIR may inhibit expression, 373 whereas a more open TIR with less structure allows increased expression (6, 7). In an 374 effort to explore the possible contribution of structure to our results, various lengths of 375 the cysJ-, pncB- and aroL-lacZ mRNAs, with or without mutations, were subjected to 376 MFOLD analysis (43) for prediction of mRNA secondary structures by energy 377 minimization. While slight variations in the predicted structures resulted from the 378 downstream base substitutions, a general correlation of more closed or open structures 379 with respective decreases or increases in expression and ribosome binding was not 380 observed (data not shown). In addition, comparing the signals from reverse transcriptase 381 pausing or terminating at inhibitory secondary structures during toeprint assays does not 382 suggest significant differences in structure between wild type and mutant mRNAs 383 (Figures 3-5). Another possibility involving structure is the formation of adenine-rich 384 pseudoknots, described previously as high-affinity RNA ligands to 30S subunits and 385 ribosomal protein S1 (23). While a contribution by downstream adenines to a stimulatory 386 secondary structure has not been discounted, we do not have any data to support this 387 model. 388 D E T P E C C A It is also possible that downstream adenines, perhaps in association with 389 pyrimidines (16), provide a region within the RBS that is recognized specifically by 390 ribosome-associated proteins that promote initiation complex formation. For example, 391 the 30S subunit protein S1 is a strong RNA binding protein reported to concentrate 20 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 372 mRNA near the ribosome decoding site (36, 35, 1). Previous studies revealed the binding 393 of S1 to various nucleotide motifs within the untranslated leader including an AU-rich 394 omega sequence (10, 1, 14, 15), a CAU-rich omega-like sequence (38) and a (CAA)n 395 repeat (38). From these data it is conceivable that S1 could bind downstream A-rich 396 regions for a more efficient delivery of mRNA to the ribosome decoding site; however, 397 there has been no direct evidence that S1 binds specific nucleotide motifs downstream of 398 the start codon. Interaction between downstream A-rich regions and 16S rRNA might 399 also contribute to mRNA-ribosome association. In possible support of this, poly (A) 400 RNA formed crosslinks to 16S rRNA nucleotides 1394-1399, located adjacent to the 401 ribosomal P-site (34). 402 D E T P E C Crystal structures of ribosome:mRNA complexes provide evidence that mRNA 403 positions -3 to +10 (with the A of the AUG start codon as +1) are wrapped closely around 404 the neck of the 30S subunit, with most ribosome contacts involving the mRNA backbone 405 rather than its bases (41). However, these crystal structures represent stable complexes 406 and lend little information on interactions occurring during ribosome loading that lead to 407 a stable complex. 408 C A Recent experiments from our lab also suggest contact between a natural adenine- 409 rich sequence of a leaderless mRNA and proteins associated with E. coli 30S subunits. In 410 this work, leaderless mRNA encoded by the cI gene of bacteriophage lambda, with a 411 photoactivatable 4-thio uridine in the AUG start codon, has been crosslinked to 30S 412 subunit proteins (J. E. Brock and G. R. Janssen, unpublished data). Crosslinking is 413 inhibited, however, by a competitor cI mRNA that lacked a start codon but contains the 414 natural adenine-rich sequence immediately downstream of the start codon position. Also, 21 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 392 415 the same cI competitor mRNA inhibits ribosome binding to wild type cI mRNA in 416 toeprint assays suggesting that the adenines contribute to an interaction between mRNA 417 and components of the ribosome that facilitate ternary complex formation. Additional 418 support for downstream adenine stimulation of leaderless mRNA translation in E. coli is 419 provided by a report (16) where addition of CA multimers increased expression from 420 leaderless lacZ, gusA and neo mRNAs. 421 Implications and application of downstream adenines for expression T P Although the mechanistic details for adenine-enhanced ribosome binding and 423 expression remain to be determined, their ability to stimulate translation from leaderless 424 mRNA (16) indicates that an untranslated leader or SD:ASD interaction is not required. 425 Even in the presence of a canonical SD:ASD interaction, downstream adenines 426 significantly influenced ribosome binding and expression from aroL, pncB and cysJ 427 mRNAs. The first five codons of coding sequence are protected by a ribosome during 428 formation of an initiation complex and thereby represent potential contacts that could 429 contribute to an mRNA’s ribosome binding strength; however, constraints imposed by 430 the encoded amino acid sequence has made it difficult to consider sequence-specific 431 translation enhancers in this region of the mRNA. Genetic code degeneracy and “wobble 432 position” variability, however, could allow for a bias towards adenines within the first 433 few codons while imposing minimal constraints on the encoded amino acids. 434 Downstream adenines, possibly in conjunction with the start codon or other features of 435 the mRNA, might present a recognition motif (sequence and/or structural) to a 436 component of the translational machinery and contribute importantly to the ribosome E C C A 22 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 422 D E 437 binding strength, with the number and position of adenines allowing for “fine-tuning” of 438 expression. 439 If downstream adenines merely provide an open structure for access of the start codon to ribosomes, then one might expect adenines to stimulate expression in a variety 441 of translation systems; however, addition of downstream adenines to a neo reporter gene 442 did not increase expression from the G+C-rich, gram-positive Streptomyces lividans (J. 443 M. Day and G. R. Janssen, unpublished data). The possibility that adenine stimulation 444 might not occur in G+C-rich organisms argues that the enhanced ribosome binding and 445 expression is at least partially based on sequence and suggests that adenine stimulation of 446 translation might be limited to E. coli and related organisms. Using crosslinking 447 techniques to identify downstream adenine/ribosome interactions, along with further 448 structural studies, may provide evidence to distinguish between sequence and structural 449 components of adenine stimulation. 450 D E T P E C C A Characterization of the adenine effect might allow also for simple engineering of 451 expression levels, up or down, by increasing or decreasing downstream adenines. 452 Identification of mRNA features contributing to ribosome binding and translation, from 453 coding sequences with or without SD sequences (2), is essential to understanding these 454 important stages of gene expression in E. coli and other organisms. 23 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 440 References 455 456 1. Boni, I.V., D. M. Isaeva, M. L. Musychenko, and N. V. Tzareva. 1991. 457 Ribosome-messenger recognition: mRNA target sites for ribosomal protein S1. 458 Nucleic Acids Res. 19:155-162. 459 D E 2. Chang, B., S. Halgamuge, and S. Tang. 2006. Analysis of SD sequences in 460 completed microbial genomes: Non-SD-led genes are as common as SD-led 461 genes. Gene 373:90-99. T P 3. Chen, H., L. Pomeroy-Cloney, M. Bjerknes, J. Tam, and E. Jay. 1994. The 463 influence of adenine rich motifs in the 3’ portion of the ribosome binding site on 464 human IFN-γ gene expression in Escherichia coli. J. Mol. Biol. 240:20-27. 465 466 467 468 469 470 471 E C 4. Day, M. J., and G. R. Janssen. 2004. Isolation and characterization of ribosomes and translational initiation factors from the gram-positive soil bacterium C A Streptomyces lividans. J. Bacteriol. 186:6864-6875. 5. DeFeyter, R. C., B. E. Davidson, and J. Pittard. 1986. Nucleotide sequence of the transcription unit containing the aroL and aroM genes from Escherichia coli K-12. J. Bacteriol. 165:233-239. 6. de Smit, M.H., and J. van Duin. 1990a. Control of prokaryotic translational 472 initiation by mRNA secondary structure. Proc. Nucleic Acid Res. Mol. Biol. 38:1- 473 35. 474 7. de Smit, M.H., and J. van Duin. 1990b. Secondary structure of the ribosome 475 binding site determines translational efficiency: a quantitative analysis. Proc. Natl. 476 Acad. Sci. USA 87:7668-7672. 24 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 462 477 478 479 8. Dreyfus, M. 1988. What constitutes the signal for the initiation of protein synthesis on Escherichia coli mRNAs? J. Mol. Biol. 204(1):79-94. 9. Etchegaray, J.P., and M. Inouye. 1999. Translational enhancement by an 480 element downstream of the initiation codon in Escherichia coli. J. Biol. Chem. 481 274:10079-10085. 10. Gallie, D.R., D. E. Sleat, J. W. Wyatts, P. C. Turner, and T. M. A. Wilson. 483 1987. The 5 prime leader sequence of tobacco mosaic virus RNA enhances the 484 expression of foreign gene transcripts in vitro and in vivo. Nucleic Acids Res. 485 15:3257-3273. T P E C 486 11. Gualerzi, C. O., L. Brandi, E. Caserta, A. La Teana, R. Spurio, J. Tomsic, 487 and C. L. Pon. 2000. Translation initiation in bacteria, p. 477-494. In R. A. 488 Garrett, S. R. Douthwaite, A. Liljas, A. T. Matheson, P. B. Moore, and H. F. 489 490 491 492 493 494 C A Noller (ed.), The ribosome: structure, function, antibiotics, and cellular interactions. ASM Press, Washington, D.C. 12. Ivanov, I.G., R. Alexandrova, B. Dragulev, D. Leclerc, A. Saraffova, V. Maximova, and M. G. Abouhaidar. 1992. Efficiency of the 5’-terminal sequence (omega) of tobacco mosaic virus RNA for the initiation of eukaryotic gene translation in Escherichia coli. Eur. J. Biochem. 209:151-156. 495 13. Janssen, G.R. 1993. Eubacterial, archaeabacterial, and eukaryotic genes that 496 encode leaderless mRNA, p. 59-67. In R. Baltz, G. D. Hegeman, and P. L. 497 Skatrud (ed.), Industrial microorganisms: basic and applied molecular genetics. 498 American Society for Microbiology, Washington, D.C. 25 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E 482 499 14. Komarova, A. V., L. S. Tchufistova, E. V. Supina, and I. V. Boni. 2002. 500 Protein S1 counteracts the inhibitory effect of the extended Shine-Dalgarno 501 sequence on translation. RNA 8:1137-1147. 502 15. Komarova, A. V., L. S. Tchufistova, M. Dreyfus, and I. V. Boni. 2005. AU- D E 503 rich sequences within 5’ untranslated leaders enhance translation and stabilize 504 mRNA in Escherichia coli. J. Bacteriol. 187:1344-1349. 505 16. Martin-Farmer, J. A., and G. R. Janssen. 1999. A downstream CA repeat sequence increases translation from leadered and unleadered mRNA in 507 Escherichia coli. Mol. Microbiol. 31:1025-1038. 508 509 510 511 512 513 514 515 516 517 E C 17. Miller, J. 1992. In A Short Course in Bacterial Genetics. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 18. Moll, I., S. Grill, C. O. Gualerzi, and U. Blasi. 2002. Leaderless mRNAs in C A bacteria: surprises in ribosomal recruitment and translational control. Mol. Microbiol. 43:239-246. 19. O’Conner, M., T. Asai, C. L. Squires, and A. E. Dahlberg. 1999. Enhancement of translation by the downstream box does not involve base pairing of mRNA with the penultimate stem sequence of 16S rRNA. Proc. Natl. Acad. Sci. USA 96:8973-8978. 20. O’Donnell, S. M., and G. R. Janssen. 2001. The initiation codon effects 518 ribosome binding and translational efficiency in Escherichia coli of cI mRNA 519 with or without the 5’ untranslated leader. J. Bacteriol. 183:1277-1283. 26 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 T P 506 520 21. Olins, P.O., and H. S. Rangwala. 1989. A novel sequence element derived from 521 bacteriophage T7 mRNA acts as an enhancer of translation of the lacZ gene in E. 522 coli. J. Mol. Biol. 264:16973-16976. 523 22. Ostrowski, J., and N. M. Kredich. 1989. Molecular characterization of the cysJIH promoters of Salmonella typhimurium and Escherichia coli: regulation by 525 cysB protein and N-acetyl-L-serine. J. Bacteriol. 171:130-140. 526 23. Ringquist, S., T. Jones, E. E. Snyder, T. Gibson, I. Boni, and L. Gold. 1995. 527 High-affinity RNA ligands to Escherichia coli ribosomes and ribosomal protein 528 S1: comparison of natural and unnatural binding sites. Biochemistry 34:3640- 529 3648. T P E C 530 24. Rudd, K. E., and T. D. Schneider. 1992. Compilation of E. coli ribosome 531 binding sites, p. 17.19-17.46. In J. Miller (ed.), A Short Course in Bacterial 532 533 534 535 536 C A Genetics. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 25. Sambrook, J., E. F. Fritsch, T. Maniatis. 1989. Molecular Cloning: A Laboratory Manual Second Edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 26. Sato, T., M. Terabe, H. Watanabe, T. Gojobori, C. Hori-Takemoto, and K. 537 Miura. 2001. Codon and base biases after the initiation codon of the open reading 538 frames in the Escherichia coli genome and their influence on the translation 539 efficiency. J. Biochem. 129:851-860. 27 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E 524 540 27. Scherer G. F. E., M. D. Walkinshaw, S. Arnott, and D. J. Morr. 1980. The 541 ribosome-binding sites recognized by E. coli ribosomes have regions with signal 542 character in both the leader and protein coding segments. Nucleic Acids Res. 543 8:3895-3907. 544 545 D E 28. Schleif, R. 1972. Fine-structure deletion map of the Escherichia coli L-arabinose operon. Proc. Natl. Acad. Sci. USA 11:3479-84. 29. Shine, J., and L. Dalgarno. 1974. The 3’-terminal sequence of Escherichia coli 547 16S ribosomal RNA: complementarity to nonsense triplets and ribosome binding 548 sites. Proc. Natl. Acad. Sci. USA 71:1342-1346. 549 550 551 552 553 554 555 556 557 558 T P E C 30. Slovek, L. E. 1999. M.S. thesis. Miami University, Oxford. Ribosomal binding and enhanced translation of CA-containing mRNA in Escherichia coli. 31. Spedding, G., T. Gluick, and D. Draper. 1993. Ribosome initiation complex C A formation with the pseudoknotted α operon messenger RNA. J. Mol. Biol. 229:609-622. 32. Sprengart, M.L., H. P. Farscher, and E. Fuchs. 1990. The initiation of translation in E. coli: apparent base pairing between the 16S rRNA and the downstream sequences of the mRNA. Nucleic Acid Res. 18:1719-1723. 33. Steitz, J. A. 1969. Polypeptide chain initiation: nucleotide sequences of the three ribosomal binding sites in bacteriophage R17 RNA. Nature 224:957-964. 559 34. Stiege, W., K. Stade, D. Schuller, and R. Brimacombe. 1988. Covalent cross- 560 linking of poly (A) to Escherichia coli ribosomes, and localization of the cross- 561 link site within the 16S rRNA. Nucleic Acids Res. 16:2369-2388. 28 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 546 562 563 564 35. Subramanian, A. R. 1984. Structure and function of the largest Escherichia coli ribosomal protein. TIBS 9:491-494. 36. Suryanarayana, T., and A. R. Subramanian. 1983. An essential function of 565 ribosomal protein S1 in messenger ribonucleic acid translation. Biochemistry 566 22:2715-2719. 567 D E 37. Thanaraj, T.A., and M. W. Pandit. 1989. An additional ribosome binding site on mRNA of highly expressed genes and a bifunctional site on the colicin 569 fragment of 16S rRNA from E. coli: important determinants of the efficiency of 570 translation initiation. Nucleic Acid Res. 18:2973-2985. 571 T P E C 38. Tzareva, N.V., V. I. Makhno, and I. V. Boni. 1994. Ribosome-messenger 572 recognition in the absence of the Shine-Dalgarno interactions. FEBS Lett. 573 337:189-194. 574 575 576 577 578 579 580 C A 39. Wade, H. E., and H. K. Robinson. 1966. Magnesium ion-independent ribonucleic acid depolymerases in bacteria. Biochem. J. 101:467-479. 40. Wubbolts, M. G., P. Terpstra, J. B. van Beilen, J. Kingma, H. A. R. Meesters, and B. Witholt. 1990. Variation of cofactor levels in Escherichia coli. J. Biol. Chem. 265:17665-17672. 41. Yusupova, G. Z., M. M. Yusupova, J. H. D. Cate, H. F. Noller. 2001. The path of messenger RNA through the ribosome. Cell 106:233-241. 581 42. Zhang, J., and M. P. Deutscher. 1991. A uridine rich sequence required for 582 translation of prokaryotic mRNA. Proc. Natl. Acad. Sci. USA 89:2605-2609. 583 584 43. Zuker, M. 22 Aug 2006. MFOLD version: 5.a (17:13). Rensselaer Polytechnic Institute. [Online.] http://bioweb.pasteur.fr/cgi-bin/seqanal/mfold.pl 29 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 568 618 619 620 621 622 Table 1. DNA sequence of gene fragments, with or without mutations, from the aroL, pncB and cysJ genes. Plasmid a Sequence b aroL-lacZ fusions pAaroL.WT: 5’…tggggaaaacccacgATG pAaroL.DN1: …ATG pAaroL.DN1a: …ATG pAaroL.DN1b: …ATG pAaroL.UP1: …ATG Amino Acid: M ACA ACG ACG ACA ACA T pncB-lacZ fusions pApncB.WT: 5’…caggatactgcgcacctATG pApncB.DN1: …ATG pApncB.DN1a: …ATG pApncB.DN1b: …ATG pApncB.UP1: …ATG Amino Acid: M cysJ-lacZ fusions pAZcysJ.WT: 5’…caggaaacagccATG pAZcysJ.DN1: …ATG pAZcysJ.UP1: …ATG pAZcysJ.UP1a: …ATG pAZcysJ.UP1b: …ATG Amino Acid: M a CAA CAG CAA CAG CAA Q ACA ACG ACG ACA ACA T ACG ACG ACA ACA ACG T CCT CCT CCT CCT CCA P CAA CAG CAA CAG CAA Q ACA ACG ACA ACA ACA T TTC TTC TTC TTC TTC F CAG CAG CAA CAG CAA Q CTT CTT CTT CTT CTT P TTT TTT TTT TTT TTT F GCT GCT GCT GCT GCA A GTC GTC GTC GTC GTC V CTG…3’ CTG… CTG… CTG… CTG… P TCT CCT…3’ TCT CCT… TCT CCT… TCT CCT… TCA CCT… S P D E T P CCA…3’ CCA… CCA… CCA… CCA… P E C The pA-series of plasmid constructs contain the genes’ natural untranslated C A leader while the pAZ-series contain the lac untranslated leader. b Lower case letters represent a portion of the untranslated leader and is shown only for the wild type (WT) constructs, but is present also in each mutant below it. The lower case, underlined sequences indicate the presumed SD sequence. The upper case ATG identifies the translational start site for the aroL, pncB 623 and cysJ coding sequences. The single letter code corresponding to the 624 encoded amino acids is given below the sequence. The upper case, underlined, 625 bold-faced nucleotides indicate base substitutions within the coding region to 626 create putative “down” or “up” mutations. 30 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 Figure Legends 627 Figure 1. The effect of aroL, pncB and cysJ downstream adenines on expression in 629 vivo. β-galactosidase activities from aroL-, pncB- and cysJ-lacZ fusions are compared 630 between cells containing plasmids with or without mutations altering downstream 631 adenine content (see Table 1). A) Plasmid-free cells and cells containing plasmids with 632 the following aroL-lacZ fusions: pAaroL.WT, 100%; pAaroL.DN1, 28%; pAaroL.DN1a, 633 85%; pAaroL.DN1b, 56%; pAaroL.UP1, 138%. B) Plasmid-free cells and cells 634 containing plasmids with pncB-lacZ fusions: pApncB.WT, 100%; pApncB.DN1, 0.1%; 635 pApncB.DN1a, 11%; pApncB.DN1b, 8%; pApncB.UP1, 33%. C) Plasmid-free cells and 636 cells containing plasmids with cysJ-lacZ fusions: pAZcysJ.WT, 100%; pAZcysJ.DN1, 637 23%; pAZcysJ.Up1, 625%; pAZcysJ.UP1a, 274%; pAZcysJ.UP1b, 325%. 638 Figure 2. The effects of aroL, pncB and cysJ downstream adenines on ribosome 639 binding in vitro. Toeprint assays with aroL-, pncB- and cysJ-lacZ mRNAs and 30S 640 subunits compare ribosome binding to mRNAs with or without mutations altering 641 downstream adenine content (see Table 1). The closed arrows indicate the ternary 642 complex-dependent toeprint signals at position +16 relative to the A (+1) of the AUG 643 start codons that result from bound 30S subunits and the asterisk (*) in (B) indicates an 644 uncharacterized ternary complex-dependent signal within the aroL upstream UTR. 645 Bands observed in the absence of tRNA and ribosomes are artifacts likely due to 646 structures formed within the mRNA. A) Lanes: GATC, dideoxy DNA sequence ladder 647 for the pncB template, with the ATG initiation triplet boxed; 1-3, pncB-lacZ WT mRNA; 648 4-6, cysJ-lacZ WT mRNA; 7-9, aroL-lacZ WT mRNA. B) Lanes 1-6 utilize pncB-lacZ 649 mRNAs made from the following templates: (1-3) pApncB.WT, (4) pApncB.DN1b, (5) D E T P E C C A 31 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 628 pApncB.DN1, (6) pApncB.UP1. Lanes 7-11 utilize cysJ-lacZ mRNAs made from the 651 following templates: (7-9) pAZcysJ.WT, (10) pAZcysJ.DN1, (11) pAZcysJ.UP1. Lanes 652 12-16 utilize aroL-lacZ mRNAs made from the following templates: (12-14) 653 pAaroL.WT, (13) pAaroL.DN1, (14) pAaroL.UP1. 654 Figure 3. The effects of aroL downstream adenines on the rate and amount of 655 ternary complex formed. Toeprint and filter binding assays compare ternary complex 656 formation on aroL-lacZ mRNAs with or without mutations altering downstream adenine 657 content. A) In toeprint assays, 30S subunits were incubated with tRNAfMet and aroL-lacZ 658 mRNA for increasing times (minutes). Lanes 1-29 utilize aroL-lacZ mRNAs made from 659 the following templates: (1-9) pAaroL.WT, (10-19) pAaroL.DN1, (20-29) pAaroL.UP1. 660 The closed arrow indicates the position of the ternary complex-dependent toeprint signals 661 at +16 and the asterisk (*) indicates an uncharacterized ternary complex-dependent signal 662 within the upstream UTR. B) The relative toeprint complex (RTC) formed with time was 663 quantified and the data averaged from three independent assays are plotted: ν, aroL-lacZ 664 WT mRNA (pAaroL.WT); Ο, aroL-lacZ “down” mutant mRNA (pAaroL.DN1); ∆, aroL- 665 lacZ “up” mutant mRNA (pAaroL.UP1). C) In filter binding assays, 30S subunits were 666 incubated with tRNAfMet and aroL-lacZ mRNA for increasing time and the amount of 667 bound RNA, averaged from three independent assays, is plotted: ν, aroL-lacZ WT 668 mRNA (pAaroL.WT); Ο, aroL-lacZ “down” mutant mRNA (pAaroL.DN1); ∆, aroL-lacZ 669 “up” mutant mRNA (pAaroL.UP1). 670 Figure 4. The effects of pncB downstream adenines on the rate and amount of 671 ternary complex formed. Toeprint and filter binding assays compare ternary complex 672 formation on pncB-lacZ mRNAs with or without mutations altering downstream adenine D E T P E C C A 32 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 650 content. A) In toeprint assays, 30S subunits were incubated with tRNAfMet and pncB- 674 lacZ mRNA for increasing times (minutes). Lanes 1-16 utilize pncB-lacZ mRNAs made 675 from the following templates: (1-8) pApncB.WT, (9-16) pApncB.DN1b. The arrow 676 indicates the position of the ternary complex-dependent toeprint signals. B) The relative 677 toeprint complex (RTC) formed with time was quantified and the data averaged from 678 three independent assays are plotted: ν, pncB-lacZ WT mRNA (pApncB.WT); Ο, pncB- 679 lacZ “down” mutant mRNA (pApncB.DN1b). C) In filter binding assays, 30S subunits 680 were incubated with tRNAfMet and pncB-lacZ mRNA for increasing time and the amount 681 of bound RNA, averaged from three independent assays, is plotted: ν, pncB-lacZ WT 682 mRNA (pApncB.WT); Ο, pncB-lacZ “down” mutant mRNA (pApncB.DN1b). 683 Figure 5. The effects of cysJ downstream adenines on the rate and amount of 684 ternary complex formed. Toeprint and filter binding assays compare ternary complex 685 formation on cysJ-lacZ mRNAs with or without mutations altering downstream adenine 686 content. A) In toeprint assays, 30S subunits were incubated with tRNAfMet and cysJ-lacZ 687 mRNA for increasing times (minutes). Lanes 1-27 utilize cysJ-lacZ mRNAs made from 688 the following templates: (1-9) pAZcysJ.WT, (10-18) pAZcysJ.DN1, (19-27) 689 pAZcysJ.UP1. The arrow indicates the position of the ternary complex-dependent 690 toeprint signals. B) The relative toeprint complex (RTC) formed with time was 691 quantified and the data averaged from three independent assays are plotted: ν, cysJ-lacZ 692 WT mRNA (pAZcysJ.WT); Ο, cysJ-lacZ “down” mutant mRNA (pAZcysJ.DN1); ∆, 693 cysJ-lacZ “up” mutant mRNA (pAZcysJ.UP1). C) In filter binding assays, 30S subunits 694 were incubated with tRNAfMet and cysJ-lacZ mRNA for increasing time and the amount 695 of bound RNA, averaged from three independent assays, is plotted: ν, cysJ-lacZ WT D E T P E C C A 33 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 673 696 mRNA (pAZcysJ.WT); Ο, cysJ-lacZ “down” mutant mRNA (pAZcysJ.DN1); ∆, cysJ- 697 lacZ “up” mutant mRNA (pAZcysJ.UP1). T P E C C A 34 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E Acknowledgements 698 699 700 701 This work was supported by grant GM065120 from the National Institutes of Health. J.E.B. thanks the Miami University Graduate School for a Graduate Student Achievement Award and the Center for Bioinformatics and Functional Genomics for a 703 summer research fellowship. Special thanks to Eileen Bridge for helpful comments on 704 the manuscript, and Holly Rovito and Michael Day for inspiring discussions and 705 assistance with assay development. T P E C C A 35 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E 702 ys J.U P1 b 600 pA Zc 700 ys J.U P1 a C. cysJ D N 1 pA 1b 1a P1 N N pn cB .U pn cB .D pn cB .D E C pA pA pn cB . 0 pA Zc N 1 pn cB .W T 20 ys J.U P1 sJ .D pA pA 40 pA Zc Zc y ys J.W T RF S8 59 % Beta-Galactosidase Activity P1 1b N D 1a N 1 N ar oL .U pA ar oL . pA ar oL .D pA W T ar oL .D pA 85 9 ar oL . pA RF S % Beta-Galactosidase Activity 20 0 B. pncB 120 100 80 60 500 400 300 200 100 0 T P D E Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 pA pA Zc RF S8 59 C A % Beta-Galactosidase Activity Figure 1. A. aroL 160 140 120 100 80 60 40 Figure 2. A. pncB B. pncB cysJ aroL cysJ aroL D E * +16 E C → Lane C T AG mRNA tRNAfMet 30S → 12 ++ + - + 3 + + + 4 + + - 5 + + 6 + + + 7 + + - 8 + + C A 9 + + + Lane mRNA tRNAfMet 30S 37 1 + + - 2 + + 3 + + + 4 + + + 5 6 ++ ++ ++ 7 + + - 8 + + 9 10 11 12 13 14 15 16 + + + + + + + + + + + + - + + + + + + - + + + + \ Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 T P A T G Figure 3. A. WT DN1 UP1 T P E C → C A Lane mRNA tRNAfMet 30S Time B. 0.9 0.8 0.7 0.6 0.5 1 + + - 2 + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + + 9 + + + 60 60 0 5 10 15 20 40 60 10 + + - 11 + + 12 + + + 60 60 0 13 14 + + + + + + 15 + + + 16 + + + 5 10 15 15 17 + + + 18 + + + 19 + + + 20 21 + + + - + 20 40 60 60 60 22 + + + 0 23 24 25 26 + + + + + + + + + + + + 27 28 29 + + + + + + + + + 5 10 15 15 20 40 60 C. 0.2 0.18 0.16 0.14 0.12 0.1 0.4 0.08 0.3 0.06 0.2 0.04 0.1 0.02 0 0 0 10 20 30 40 50 60 70 0 Time (minutes) 100 200 300 400 Time (seconds 38 500 600 700 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E * Figure 4. A. WT DN1b B. 0.25 0.2 0.15 0.1 D E 0.05 0 0 10 20 30 40 50 60 70 500 600 700 T P Time (minutes) C. 0.16 0.14 E C 0.12 0.1 0.08 Lane mRNA tRNAfMet 30S 1 + + 2 + + - 3 + + + 4 + + + Time 60 60 0 5 5 + + + 6 + + + 7 + + + 8 + + + 9 10 11 12 + + + + - + + + + - + + 13 14 15 16 + + + + + + + + + + + + C A 10 20 40 60 60 60 0 5 10 20 40 60 0.06 0.04 0.02 0 0 39 100 200 300 400 Time (seconds) Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 → Figure 5. A. WT DN1 UP1 Lane mRNA tRNAfMet 30S Time B. 0.16 0.14 0.12 0.1 0.08 0.06 1 + + + 2 + + + 3 + + + 4 + + + 5 + + + 6 + + + 7 + + + 8 + + 9 + + - 0 5 10 15 20 40 60 60 60 10 + + + C A E C 0 T P 11 + + + 5 12 + + + 13 14 + + + + + + 15 16 + + + + + + 17 18 + + - + + - 10 15 20 40 60 60 60 19 + + + 20 + + + 0 21 22 + + + + + + 23 24 25 + + + + + + + + + 26 27 + + - + + - 5 10 15 20 40 60 60 60 C. 0.12 0.1 0.08 0.06 0.04 0.04 0.02 0.02 0 0 0 10 20 30 40 50 60 70 0 Time (minutes) 100 200 300 400 Time (secon 40 500 600 700 Downloaded from jb.asm.org at Penn State Univ on February 8, 2008 D E →
© Copyright 2026 Paperzz