MIAMI UNIVERSITY The Graduate School Certificate for Approving the Dissertation We hereby approve the Dissertation of Heather Joann Beck Candidate for the Degree DOCTOR OF PHILOSOPHY ______________________________________ Mitch Balish, Director ______________________________________ Eileen Bridge, Reader ______________________________________ DJ Ferguson, Reader ______________________________________ Xiao-Wen Cheng, Reader ______________________________________ Jack Vaughn, Graduate School Representative ABSTRACT Roles of Escherichia coli 5’-terminal AUG triplets in translation initiation and regulation by Heather J. Beck The process of translation initiation involving the purine-rich Shine-Dalgarno sequence in bacteria has been well studied. However, approximately one-half of the mRNAs across prokaryotic genomes do not contain a Shine-Dalgarno sequence. Therefore, it is important to understand the mechanism and ribosome recognition signals involved in the Shine-Dalgarno-independent translation initiation, which is yet to be elucidated. To do this, leaderless mRNAs lacking a 5’-UTR are utilized to examine the roles of 5’-terminal AUGs in ribosome recognition. In this study, we identified AUGs at the 5’-terminus of canonical Shine-Dalgarno-led mRNAs and examined their ability to act as initiation codons for leaderless mRNA translation. The identified 5’-terminal AUGs were found to specify open reading frames that were translated at varying levels of expression, with some playing a role in downstream coding sequence regulation. Further investigation into one 5’-terminal AUG-led canonical mRNA, ptrB, identified a novel regulation mechanism that is distinct from the Shine-Dalgarno mechanism. This novel mechanism is instead controlled by the 5’-terminal AUG and can regulate in a gene context independent manner making it ideal for genetic engineering purposes. To further understand how the 5’-AUG is recognized and bound by ribosomes, a cross-linking method was developed to identify the initial interactions between leaderless mRNAs and the ribosome. Overall, this study provides insight into 5’-terminal AUG-directed translation and the role of 5’-AUGs in ribosomal recognition, binding and translational regulation. Roles of Escherichia coli 5’-terminal AUG triplets in translation initiation and regulation A DISSERTATION Presented to the Faculty of Miami University in partial fulfillment of the requirements for the degree of Doctor of Philosophy Department of Microbiology by Heather J. Beck The Graduate School Miami University Oxford, Ohio 2016 Dissertation Director: Mitch Balish © Heather J Beck 2016 TABLE OF CONTENTS General Introduction ........................................................................................................... 1 Chapter 1 ........................................................................................................................... 14 Abstract ......................................................................................................................... 15 Introduction ................................................................................................................... 16 Materials and Methods .................................................................................................. 18 Bacterial strains ......................................................................................................... 18 Reagents..................................................................................................................... 18 Construction of lacZ fusions...................................................................................... 18 β-galactosidase assay ................................................................................................. 18 Preparation of in vitro synthesized transcripts .......................................................... 18 Ribosome isolation .................................................................................................... 19 Primer extension inhibition (toeprint) assay .............................................................. 19 Results ........................................................................................................................... 20 5’-uORFs support translation .................................................................................... 20 5’-uAUGs bind 70S ribosomes.................................................................................. 35 5’-uAUGs can influence downstream gene expression ............................................. 46 Discussion ..................................................................................................................... 50 Chapter 2 ........................................................................................................................... 53 Abstract ......................................................................................................................... 54 Introduction ................................................................................................................... 55 Materials and Methods .................................................................................................. 60 Bacterial strains and reagents .................................................................................... 60 Recombinant DNA procedures .................................................................................. 60 Construction of DNA molecules with randomized sequence regions .................. 60 Gene fusions.......................................................................................................... 61 Preparation of competent cells .................................................................................. 61 Transformations ......................................................................................................... 62 β-galactosidase assay ................................................................................................. 62 In vitro synthesis of RNA .......................................................................................... 62 Primer extension inhibition (toeprint) assay .............................................................. 62 iii Results ........................................................................................................................... 64 ptrB 5’-uAUG as a ribosome binding signal ............................................................. 64 Predicted SD sequence has unexpected effects ......................................................... 64 Ribosome binding signals present within the ptrB 5’-UTR ...................................... 72 Spacing optimized between the 5’-uAUG and downstream RBS ............................. 78 Expression of the uORF negatively impacts downstream CDS expression .............. 81 Transplantability of regulation by the 5’-UTR of ptrB ............................................. 82 Initiation signals present within a bona fide CDS ..................................................... 87 Discussion ..................................................................................................................... 93 Appendix A ....................................................................................................................... 99 Concluding Remarks ....................................................................................................... 109 References ....................................................................................................................... 117 iv LIST OF TABLES Table 1-1. RegulonDB identified genes............................................................................ 21 Table 1-2. Selected genes examined that contain an AUG triplet at the 5’-terminus of its transcribed canonical mRNA. ........................................................................................... 29 Table 1-3. Translation from the 5’-uORF as a percent of translation relative to its downstream canonical CDS as determined by β-galactosidase assays performed in triplicate. ........................................................................................................................... 36 Table 1-4. Expression levels from downstream CDS fusions to lacZ that contain an intact 5’-uAUG (CDS), a mutated 5’-uAUC (KO), or an intact 5’-AUG with a premature stop codon (StartStop) (see Figure 1-1) including the standard deviation from triplicate cultures. ............................................................................................................................. 41 Table 1-5. Sequences of mRNAs tested ........................................................................... 44 Table 2-1. DNA sequences of gene fragments, with and without mutations, from ptrB.. 65 Table 2-2. ptrB mutations expressed as a percentage of either the 5’-uAUG in-frame leader fusion with lacZ (100%) (column 2) or the CDS fused with lacZ (100%) (column 3). ...................................................................................................................................... 82 v LIST OF FIGURES Figure I-1: Structural representation of the bacterial ribosome. ......................................... 2 Figure 1-1. Schematic of lacZ fusions constructed for each gene tested .......................... 31 Figure 1-2. 5’uAUGs support varying levels of translation compared to known leaderless mRNA in E. coli. .............................................................................................................. 33 Figure 1-3. 5’-uAUGs bind 70S ribosomes following the proposed leaderless mRNA initiation mechanism ......................................................................................................... 38 Figure 1-4. Loss of 30S subunit binding to CDS start codon as a result of the 5’-uAUG mutation. ........................................................................................................................... 48 Figure 2-1. The ptrB gene sequence. ................................................................................ 58 Figure 2-2. Role of SD sequence in ptrB regulation ........................................................ 69 Figure 2-3. ptrB mRNA structure may play a secondary role in expression .................... 73 Figure 2-4. Scanning mutagenesis of ptrB 5’-UTR .......................................................... 75 Figure 2-5. 5’-UTR mutations and deletions to affect spacing between 5’-uAUG and downstream RBS. ............................................................................................................. 79 Figure 2-6. Transplantability of regulation by the 5’UTR of ptrB. .................................. 85 Figure 2-7. ptrB 5’-UTR insufficient to stimulate expression of internal RNA fragments. ........................................................................................................................................... 88 Figure 2-8. Expression regulated by both upstream and downstream signals .................. 91 Figure A-1. Schematic of S1 domains and their amino acid positions in the S1 r-protein. ......................................................................................................................................... 101 Figure A-2. Cross-linking of 4S-U cI RNA to purified S1 protein ................................ 103 Figure A-3. Cross-linked positions on S1 ribbon structure. ........................................... 106 vi DEDICATION I would like to dedicate this dissertation to my late mentor, Dr. Gary R. Janssen. Calling him my mentor does not do him justice. He was also my trusted colleague, my source of constant encouragement and inspiration, my idol, but most importantly, my beloved friend. I feel blessed to have learned how to be a researcher and a scientist from him. He has made such an impact on me and will continue to influence my future. I will always aspire to be the scientist, the teacher and the friend that he was. The man, the beard, the legend. He was truly inspiring. vii ACKNOWLEDGEMENTS First, I would like to thank my lab mates, Racheal Devine, Sarah Steimer, and Jackie Giliberti for their continued guidance and assistance in the lab. To Racheal for teaching me how to do most everything in lab, being patient with me and for being such a dear friend. To Sarah, without whom I would have never made it through the constant struggles we faced together in the worst year of my life. Thank you for the very bottom of my heart for all the laughs and acting as my sounding board while writing even after you graduated. To Jackie, for also teaching me so much my first year in lab and trusting me to take over part of her project and for also being such a giant help in editing my dissertation. I would also like to thank the wonderful undergrads I’ve worked with in the lab, particularly Leah Carter and Brendan Radel. You have both become very close friends of mine and I so enjoyed watching you grow in your confidence in the lab. Next, I would like to thank my committee who all stepped in to help guide me in the last year of my graduate career. To Dr. Bridge and Dr. Ferguson who graciously read and edited my dissertation and for their help throughout the years in making my project better. To Dr. Vaughn who has also helped in the development of my project throughout the years. To Dr. Cheng who agreed to join my committee without much notice. Also to Dr. Morgan-Kiss who was also a very supportive member of my committee, even if it was for just a short time. Finally, to Dr. Balish who has become my new advisor. I know how much work it was to edit my dissertation and I can never thank you enough for agreeing to help me after Gary’s passing. Your guidance in this endeavor and throughout the years has been enormously helpful. I would also like to thank the rest of the microbiology department at Miami University, both the faculty and the graduate students who have been supportive over the years. I have made such wonderful life-long friends throughout the years, without whom, I likely would have never made it this far. A special thank you to Ryan R., Bill P., Dan Z., Liz F., Steve F., Jenna D., Ryann B., Steve, D., and Tzvia S. Also to our favorite Canadian, Chris Sedlacek. viii I would like to thank my Janssen lab family. Attending the reunions over the years has allowed me to meet so many previous graduate students who have been brought together by Dr. Gary Janssen. They are some of the most kind, approachable and helpful people I’ve ever met and they so easily welcomed me in as part of the family and have been there for me, especially after Gary’s passing. In addition to the graduate students, Kob, Leah and Ezra Janssen: thank you for being so kind and helpful to me. Lastly, I would like to thank my family, whose unwavering support and guidance have made me the person I am today. Thank you all. ix General Introduction Leaderless mRNAs utilize a 5’-terminal AUG as the signal for translation initiation. However, the exact mechanism for ribosome recognition and binding of these non-canonical mRNAs is unknown. This work seeks to examine the 5’-terminal AUG as a ribosome recognition signal to determine how leaderless mRNAs initially interact with the ribosome and if the 5’-terminal AUG can be used as a signal for canonical Shine-Dalgarno (SD) - led mRNAs to influence translation. Translation is the process of protein synthesis via the ribosome, a macromolecular ribonucleoprotein complex with enzymatic activity to catalyze peptide bond formation. The ribosome has the ability to decode the template mRNA to synthesize a polypeptide product. Due to the lack of a nuclear membrane in prokaryotes, translation is tightly coupled with transcription in the cell, so much so that translation can occur before transcription is terminated. Translation occurs in four steps: initiation, elongation, termination, and ribosome recycling. Translation initiation is the rate-limiting step in protein synthesis, occurring on the order of seconds (Laursen et al., 2005). During initiation, the ribosome complex consisting of a large and small subunit is assembled on the mRNA to be translated. It then proceeds to elongation, a much faster process, synthesizing polypeptides at 12 amino acids per second in Escherichia coli (Kennell and Riezman, 1977). The mRNA is decoded within the ribosome complex three nucleotides at a time with each triplet pairing with anticodons of its cognate tRNA until the ribosome reaches a stop codon. The stop codon signals the termination step in which the ribosome complex dissociates from the mRNA, the polypeptide is released, and the subunits are recycled. The Ribosome The bacterial ribosome consists of two subunits, 30S and 50S, which join together to form a 70S ribosome that catalyzes protein synthesis. The small 30S ribosomal subunit contains 21 distinct proteins and one rRNA (16S) and has a mass of 0.8 MDa (Laursen et al., 2005). This subunit is traditionally divided into an upper third, called the head, connected by the neck to the body and platform with a protrusion called the spur (Figure I-1A) (Laursen et al., 2005). The 30S subunit contains the decoding center, positioned at the upper part of the body and lower part of the head, made entirely of RNA, which controls translational fidelity by monitoring base pairing 1 Figure I-1: Structural representation of the bacterial ribosome. Adapted from Simonetti et al., 2009. (A) Diagram of 30S ribosomal subunit (brown) with view from above and the different regions indicated. The mRNA binding platform is highlighted in pink/red and the position of the anti-Shine-Dalgarno sequence (aSD) of the 16S rRNA is in cyan. (B) The 30S subunit with the initial mRNA interaction: docking onto the platform and binding the aSD before being loaded into the mRNA channel. Also included is a view of the ribbon structure of the 30S subunit preinitiation complex with an mRNA bound from a side vantage point. (C) Diagram of the large 50S ribosomal subunit outlining the three protuberances. (D) Depiction of the 50S subunit (green) bound to the 30S subunit (brown) forming the 70S initiation complex with the mRNA (purple) loaded into the mRNA channel and interacting with the initiator tRNA (dark red). 2 3 between each mRNA codon and its cognate tRNA (Green and Noller, 1997). Cognate tRNAs have lower dissociation rates from the ribosome and, when bound, cause an induced-fit conformational change of the ribosome (Ogle et al., 2001). This process allows for tightly controlled decoding with an error rate of tRNA selection in E. coli from 10-3 to 10-4 (Laursen et al., 2005). The 30S subunit also contains three tRNA binding sites, the acceptor (A), peptidyl (P), and exit (E) sites. The functions of these sites will be discussed below. The large 50S ribosomal subunit contains 34 distinct proteins and two rRNAs (5S and 23S) with a total mass of 1.5 MDa (Laursen et al., 2005). The 50S subunit is more compact than the 30S subunit, consisting of a rounded base with three protuberances (L1 stalk, central protuberance and L7/L12 stalk Figure I-1C) (Wilson and Nierhaus, 2003). The 50S subunit contains the catalytic peptidyltransferase center (PTC) where formation of the peptide bond occurs. The PTC consists primarily of RNA with no proteins within 18Å of the catalytic site (Nissen et al., 2000). A tunnel starts at the PTC and the nascent polypeptide exits through this tunnel, which is approximately 100 Å long with an average diameter of 15 Å (Nissen et al., 2000). The process of translation initiation For polypeptide formation to occur, a charged cognate tRNA (with the exception of the initiator tRNA which binds the P-site) must bind to the A-site (Laursen et al., 2005). The αamino group of the tRNA in the A-site attacks the carbonyl group of the P-site peptidyl group tRNA to form a peptide bond (Laursen et al., 2005). The now deacylated tRNA in the P-site moves to the E-site, which is specific for deacylated tRNAs, for ejection (Schmeing et al., 2003). The A-site peptidyl-tRNA moves to the P-site and a new cognate aminoacyl-tRNA binds in the A-site (Laursen et al., 2005). In the conventional pathway of initiation, a ternary complex is formed from the 30S ribosomal subunit, the messenger RNA (mRNA), and the initiator tRNA. 30S ribosomal subunits bind the purine-rich SD sequence of the mRNA via complementary pairing to the anti-Shine-Dalgarno (aSD) sequence of the 16S rRNA of the 30S subunit, thereby forming a binary complex (Shine and Dalgarno, 1974; Yusupova et al., 2001; Benelli et al., 2003). This initial interaction creates an unstable 30S pre-initiation complex which positions the mRNA on the ribosomal platform (Marzi et al., 2007). In the pre-initiation complex, the 30S subunit platform proteins S2, S7, S11, and S18 are near and may contact the mRNA, 4 suggesting a role for these proteins in docking the mRNA on the platform (Simonetti et al., 2009). Once bound to the platform, the mRNA is then adjusted into the mRNA tunnel of the 30S subunit. This may require unwinding the mRNA, a process for which several platform proteins (S2, S7, S11, S18, S21) as well as S1 with its helicase properties may be responsible, to promote movement into the tunnel and proper codon-anticodon pairing between the mRNA and the tRNA (Simonetti et al., 2009). The initiator tRNA binds to form the ternary complex, which is stabilized by the SD-aSD interaction and leads to proper placement of the start codon in the Psite of the ribosome (Ringquist et al., 1993; Wimberly et al., 2000). Final adjustments are then made in the decoding center after which translation proceeds into elongation. Three additional proteins, initiation factor 1 (IF1), initiation factor 2 (IF2), and initiation factor 3 (IF3) are also involved in translation initiation. These factors exert their effects by binding within the tRNA binding sites on the 30S subunit (Ramakrishnan, 2002). IF1 binds to the A-site of the 30S ribosomal subunit to block tRNA binding to the A-site (Ramakrishnan, 2002), directing the initiator tRNA to the P-site (Laursen et al., 2005). IF1 also increases the affinity of the 30S subunit for IF2 and IF3 and stimulates their activity (Gualerzi and Pon, 1990). IF3 binds the E-site on the interface side of the platform and neck of the 30S subunit to directly prevent subunit association, thereby also playing a role in ribosomal recycling (McCutcheon et al., 1999; Ramakrishnan, 2002). IF1 and IF3 are ejected from the 30S subunit to allow for association of the 50S subunit, which is stimulated by IF2 (Laursen et al., 2005). IF2 is thought to bind over the A-site to interact with IF1 while also binding the aminoacyl end of initiator tRNA in the P-site for proper positioning (Ramakrishnan, 2002). Once the initiator tRNA is properly positioned in the P-site, the GTP bound to IF2 is hydrolyzed and IF2 is released (Laursen et al., 2005). This process results in the ternary complex formation of mRNA, initiator tRNA and 70S ribosome, with the initiator tRNA placed in the P-site codon-anticodon pairing with the mRNA start codon. Factors involved in the regulation of translation initiation The variable element in translation initiation is the mRNA, which can differ in sequence and structural components. Most mRNAs have a 5’-untranslated region (5’-UTR) upstream of their initiation codon that contains ribosomal recognition features that influence the mRNA’s translation efficiency. The start codon itself can also influence translation efficiency. Typically, 5 an AUG codon is most efficient for initiation and is the most prevalent start codon in E. coli (90%) (Schneider et al., 1986). However, other codons also act as start codons in E. coli, with GUG used in 8% of mRNAs, UUG in 1% of mRNAs (Schneider et al., 1986), and AUU used in two mRNAs in E. coli (infC and pcnB) (Binns and Masters, 2002). The sequences flanking the start codon comprise the ribosome binding site, which typically extends approximately 30 nucleotides (Laursen et al., 2005). This region can include the SD sequence, the spacer region between the SD sequence and the start codon (7±2 nucleotides) and the start codon (Laursen et al., 2005). The ribosome can recognize the binding site through mRNA sequence and/or secondary structural features. Multiple sequence features, both upstream and downstream of the start codon, have been implicated in influencing ribosome binding and therefore translation efficiency. The “cumulative specificity initiation mechanism” theory suggests that both upstream and downstream sequence elements work cooperatively, yet independently to ensure no singular interaction is essential, to select the translation initiation site (Nakamoto, 2011). The sequence elements that influence expression and may therefore be part of this theory include, but are not limited to, the SD sequence, CA repeats, AU richness, and enhancer elements. The SD sequence directly interacts with the aSD sequence at the 3’end of the 16S rRNA (Shine and Dalgarno, 1974). This interaction is thought to contribute to ribosome binding and proper placement of the start codon in the P-site of the ribosome (Shine and Dalgarno, 1974). AU-rich sequences stimulate translation when located upstream or downstream of a start codon (Dreyfus, 1988; Qing et al., 2003) through an interaction with ribosomal protein S1 (Komarova et al., 2005). It has been hypothesized that the AU richness prevents the formation of secondary structure around the start codon allowing for easier ribosomal entry and binding to the mRNA (Kaberdin and Blasi, 2006). Downstream adenine-rich sequence motifs have been implicated as general enhancers of E. coli translation (Brock et al., 2007). CA multimers introduced downstream of the start codon increased expression independent of translation signals contained within the untranslated leader region by increasing the ribosome binding strength (Martin-Farmer and Janssen, 1999). The impact of these sequence features on translation demonstrate the importance of the mRNA’s primary sequence in translation efficiency. 6 Apart from sequence features, mRNA secondary structure can play roles in translation initiation. In general, ribosome binding requires local single-stranded structure for proper interaction (Draper, 1987; Draper et al., 1998; Gualerzi and Pon, 1990). Therefore, secondary structure can occlude regions of mRNA from ribosome binding and can inhibit translation. Since RNAs are dynamic, they can be highly influenced by their environment and can alternate between various secondary structures with different thermodynamic stabilities. Changes in temperature can result in changes in RNA secondary structure, a property utilized by bacteria as a mechanism of regulation (Klinkert and Narberhaus, 2009). Increase in temperature destabilizes secondary structure that would otherwise occlude the translation initiation region, thereby allowing for ribosome binding (Kaberdin and Blasi, 2006). Conversely, some mRNAs are upregulated during a decrease in temperature due to structural changes that shift the site of a hairpin loop and as a result eliminate the secondary structure at the translation initiation region (Thieringer et al., 1998; Gualerzi et al., 2003; Kaberdin and Blasi, 2006). In addition to secondary structure, trans-acting elements also affect translation. There are many examples of proteins binding to translational operators to repress translation (Salavati and Oliver, 1995; Kozak, 1999; Kaberdin and Blasi, 2006). This mechanism is especially effective in operons in which downstream translation is coupled to and therefore dependent upon translation of the preceding region. In this way, translation of multiple ORFs can be repressed by a single binding event. Examples of this type of regulation include the α operon and the spc operon of E. coli, both of which encode ribosomal proteins and are repressed by binding of a ribosomal protein to the operon, thereby self-regulating expression (Singer and Nomura, 1985; Kaberdin and Blasi, 2006). It has also been hypothesized that ribosome binding can indirectly increase expression by protecting the mRNA from RNase cleavage by competing for binding sites (Kaberdin and Blasi, 2006). Low-molecular weight effectors such as amino acids, coenzymes, and vitamins also bind to structured noncoding segments of mRNA, known as riboswitches, to control expression without the assistance of protein factors (Tucker and Breaker, 2005; Winkler and Breaker, 2005). Riboswitches are able to recognize the effectors and fold into an alternative structure to regulate the activity of the mRNA by opening or sequestering the RBS (Breaker, 2012). 7 Translation can also be regulated by binding of small noncoding RNAs termed antisense RNAs. Antisense RNAs typically base-pair to complimentary sequences within the translation initiation region of their target mRNAs to block ribosome binding and lead to degradation (Gottesman, 2005). Trans-encoded antisense RNAs are less specific and can affect multiple targets for widespread regulation but require the chaperone protein Hfq to exert their effects (Wagner et al., 2002). Antisense RNAs can have positive or negative effects on the translation initiation of their target mRNAs by either blocking ribosome binding or causing structural rearrangements that make the ribosome binding site more accessible (Kaberdin and Blasi, 2006). The expendable role of the Shine-Dalgarno sequence The SD sequence can be important in both ribosomal recognition and proper spatial mRNA binding. The mechanism for mRNA-ribosome binding via the SD-aSD interaction has been well studied and is well understood (Laursen et al., 2005 and references therein). However, the SD-aSD interaction is not the sole determinant of translational efficiency or selection of a translational start site (Melancon et al., 1990; Calogero et al., 1988; Benelli et al., 2003). Initiation complex formation can be achieved in the absence of a SD sequence but requires ribosomal protein S1 (Tzareva et al., 1994). Often translation initiation regions are characterized by signals other than the SD sequence, especially in translationally coupled open reading frames that are more highly dependent on the AUG start codon for recognition of the ribosome binding site (Andre et al., 2000). In multiple cases, deletion of the entire 5’-UTR leads to higher levels of expression than mutation of only the SD sequence (Van Etten and Janssen, 1998; Condo et al., 1999; Benelli et al., 2003), suggesting that mRNAs absent of a 5’-UTR are more efficiently expressed that those with absent SD sequences. Although the SD site may be important in proper positioning of the start codon in the P-site, it has been suggested that there is an initial SD-independent binding interaction (de Smit and van Duin, 2003). This suggestion is based on evidence that the rate constant for association of mRNA and the 30S subunit is independent of SD-aSD interaction strength. An increasing number of non-SD led genes (i.e., those with no recognizable SD sequence within the untranslated leader) as well as genes lacking a 5’-UTR altogether, thereby constituting leaderless mRNA, have been identified (Chang, 2006). A bioinformatic study of 162 prokaryotic genomes concluded that the number of SD-led genes is equal to the number of non-SD led genes 8 (Chang, 2006). Although an SD sequence is not required for initiation, it is unknown how the ribosome is recruited to non-SD led mRNAs. Especially in the case of leaderless mRNA, lacking both a 5’-UTR and SD sequence, the ribosome binding signals and mechanism of start codon positioning in the P-site have yet to be identified. Leaderless mRNA The study of leaderless mRNA has expanded our knowledge regarding the evolution of translation. Leaderless mRNAs have been identified in every domain of life and are proposed to be ancestral to all mRNAs (Grill et al., 2000). Translation factors from all three kingdoms share a significant degree of homology, suggesting that current components might have been present at the universal ancestor stage (Moll et al., 2002). Regardless of the origin of the leaderless mRNA, it can be translated by the translational machinery from any of the three domains (Grill et al., 2000). In contrast, leadered bacterial mRNAs are poorly translated in archaeal and eukaryotic cell-free translational assays (Grill et al., 2000). Leaderless mRNA are more frequent in Gram-positive bacteria than Gram-negative bacteria but have been identified in E. coli, Caulobacter crescentus, and Thermus thermophilus and have been predicted to be underrepresented due to the difficulties in identifying leaderless mRNAs bioinformatically (Moll et al., 2002). Leaderless mRNAs have been identified in many Gram-positive genera including Streptococcus, Lactococcus, Streptomyces, and Corynebacterium (Janssen, 1993) and appear to be quite common in archaea (Benelli et al., 2003, 2009; Brenneis and Soppa, 2009). The trend in archaea suggests that monocistronic transcripts and genes located at the 5’-proximal end of operons are leaderless, but internal genes in operons are preceded by SD sequences (Tolstrup et al., 2000; Slupska et al., 2001; Benelli et al., 2003). Distinct SD sequence-dependent and leaderless mechanisms for ribosome/mRNA interactions have been demonstrated in archaeal translation initiation (Benelli et al., 2003), supported by the prevalence of both SD sequence-led and leaderless mRNAs in archaea. In bacteria, leaderless mRNAs appear to undergo translation initiation through a novel pathway initially binding a 70S ribosome (Balakin et al., 1992; Udagawa et al., 2004; Moll et al., 2004). This binding is dependent on an AUG initiation codon and is stabilized by initiator tRNA in E. coli (Van Etten and Janssen, 1998; O’Donnell and Janssen, 2002; Brock et al., 2008). This binding is also positively influenced by IF2 through stabilization of fMet-tRNA binding, and 9 antagonized by IF3, which promotes dissociation of 70S ribosomes (Tedin et al., 1999; Grill et al., 2000, 2001). However, ribosome binding to leaderless mRNAs (Balakin et al., 1992) and progression from initiation into the elongation phase of translation occur in the absence of all IFs (Udagawa et al., 2005). Identification of a ribosomal particle deficient in up to 11 small subunit ribosomal proteins (r-proteins) that retains the ability to exclusively translate leaderless mRNA has provided further insight into the mechanism of leaderless mRNA translation and ribosomal scaffolding (Kaberdina et al., 2009). This r-protein-deficient ribosome, termed 61S, forms in the presence of the antibiotic kasugamycin (Ksg), which inhibits initiation complex formation by binding the mRNA track at the P- and E-site codons, thereby indirectly dissociating the P-site bound fMet-tRNAfMet from 30S subunits (Schulenzen et al., 2006; Schurwirth et al., 2006). After treatment with Ksg, translation of leadered mRNAs is inhibited but leaderless mRNA translation is still fully functional in vivo and in vitro (Kaberdina et al., 2009). The r-proteins absent in the 61S ribosome cluster in regions associated with the mRNA’s path through the ribosome, including regions associated with the aSD sequence (Wilson, 2009). Therefore, these regions might be dispensable for mRNAs lacking the SD sequence and 5’-UTR altogether. Although the active centers of the ribosome are primarily constructed of rRNA, certain r-proteins are necessary to support the rRNA structure (Wilson and Nierhaus, 2005). The case of the 61S ribosome translating leaderless mRNAs highlights the existence of a minimal set of r-proteins necessary for rRNA stabilization and active translation. The 61S ribosome also has evolutionary implications, suggesting that an ancestral form of the small subunit with the proteins deficient from the 61S may have been added later in history (Kaberdina et al., 2009). The 61S can also provide insight into the mechanism of ribosome binding as well as mRNA: ribosome contacts and mRNA movement through the ribosome. The set of ribosomal contacts for canonical leadered mRNA has been established (La Teana et al., 1995; Simonetti et al., 2009). Crosslink studies using E. coli 30S subunits have shown that the mRNA’s +2 position contacts the 16S rRNA at the 1530 position with the mRNA’s +3 position contacting the ribosomal platform proteins S18 and S21 (La Teana et al., 1995). The crosslinks shift upon addition of IFs, with the 16S rRNA’s 1530 position instead contacting the mRNA’s -3 position and a new crosslink forming between the mRNA’s +2 and -3 10 positions and the ribosomal head protein S7 (La Teana et al., 1995). These results suggest that the ribosome shifts in the 5’ direction on the mRNA, situating the mRNA closer to the P-site in the presence of IFs for canonical leadered mRNAs. Examination of 5’-terminal AUGs This study aims to provide insight into the mechanism of 5’-AUG recognition, ribosome binding, and translation initiation of leaderless mRNA. Since the 5’-AUG has been implicated as the signal for ribosome binding in leaderless mRNA, I hypothesize that an AUG triplet at the 5’-terminus of canonical SD-led mRNAs will also act as a signal for binding and as an initiation codon for upstream open reading frame translation. I will then investigate the 5’-AUGs’ translational efficiency and regulatory effects. Lastly, to provide insight into the initial interactions between the ribosome and the 5’-AUG, I will identify contact sites between ribosomal protein S1 and leaderless mRNA through development of a novel method that utilizes photochemical cross-linking and coupled mass spectrometry analysis. The importance of a 5’-terminal AUG start codon has been demonstrated in leaderless and “unleadered” (i.e., after removal of the leader sequence) mRNAs through the introduction of non-cognate initiation codons, resulting in reduced ribosome binding and translational efficiency (Van Etten and Janssen, 1998; O’Donnell and Janssen, 2002). However, the effect observed due to the use of non-cognate initiation codons is specific to leaderless mRNAs and does not occur in leadered mRNA (O’Donnell and Janssen, 2002). Addition of a 5’-terminal AUG to an internal segment of lacZ RNA makes it competent to form ternary complexes with 70S ribosomes and tRNA (Brock et al., 2008), suggesting that a 5’-terminal AUG is a sufficient signal for ribosomes to identify a leaderless mRNA. Also, a naturally occurring leaderless mRNA with a deletion of 30 nucleotides from its 5’-terminus cannot bind ribosomes; however, addition of a 5’-terminal AUG triplet to the truncated leaderless mRNA restores 70S ribosome binding and enables it to compete successfully with naturally leaderless mRNA for ribosome binding in vitro (Brock et al., 2008). These data illustrate that the presence of a 5’-terminal AUG is sufficient for an RNA molecule to bind ribosomes and be translated as a leaderless mRNA and identified the 5’terminal AUG as a critical signal for recognition by a ribosome. In the present study, I examined not only naturally occurring leaderless mRNAs but also 5’-terminal AUGs at the end of 11 canonical SD-led mRNAs, to further the understanding of leaderless mRNA prevalence and the mechanism of translation initiation utilized by leaderless mRNA. Because 5’-AUGs are signals for ribosomal recognition, 5’-AUGs at the end of canonical SD-led mRNAs may also bind ribosomes. In this work I searched for these mRNAs in the E. coli genome and sought to determine if the newly discovered 5’-terminal AUGs are efficiently translated in a manner similar to leaderless mRNAs. I hypothesize that the 5’-terminal AUGs present on canonical mRNAs are bound by 70S ribosomes and translated to varying degrees. These 5’-terminal AUG-specified ORFs may exhibit a variety of functions, such as acting as binding signals, exerting regulatory functions, or coding for small peptides. I further examined one identified 5’-terminal AUG-led canonical mRNA, ptrB, to determine its ability to exert regulatory functions on the downstream coding sequence. I hypothesize that the 5’-terminal AUG, in addition to the predicted SD sequence, acts as a regulatory signal to control downstream translation in a novel fashion. This study also aimed to explore how leaderless mRNAs interact with the translational machinery. Although contacts between the ribosome and mRNA are identified for canonical mRNAs, it is unclear how leaderless mRNA are initially bound by ribosomes. The contacts between specific ribosomal proteins and leaderless mRNA were investigated using 4-thiouridine (4S-U) photochemical cross-linking. Previous cross-linking studies done with leaderless mRNA showed association with several ribosomal proteins (S1, S3, S4, S7, and S10/S18) even in the absence of initiator tRNA (Brock et al., 2008). To expand on this research, I cross-linked purified ribosomal protein S1 to 4S-U-tagged, naturally occurring cI leaderless mRNA to identify specific sites on the protein that contact the mRNA’s 5’-AUG. This study will help to expand the knowledge on the alternative mechanism used by leaderless mRNAs for translation initiation. Variations in conventional translation initiation mechanisms exist in both prokaryotes and eukaryotes (Malys and McCarthy, 2010). It is important to understand these deviations from traditional initiation and the signals used for ribosome recognition and binding because they represent a significant proportion of all mRNAs (Chang et al., 2006). My research focuses on how 5’-AUGs are recognized by ribosomes to initiate translation and the roles of 5’-AUGs in regulation in E. coli. Due to the ancestral nature of leaderless mRNAs, it is likely that understanding leaderless mRNA initiation mechanisms will provide insight into universal 12 principles of translation initiation in all organisms. Identification of ribosomal recognition and binding signals could also be used in genetic engineering and designing expression vectors to fine-tune translational expression levels. 13 Chapter 1 5’-terminal AUGs in Escherichia coli mRNAs with Shine-Dalgarno sequences: identification and analysis of their roles in non-canonical translation initiation Heather J. Beck1*, Ian M. C. Fleming1#a, Gary R. Janssen† 1 Department of Microbiology, Miami University, Oxford, Ohio, United States of America #a Guild BioSciences, Dublin, Ohio, United States of America * Corresponding author Email: [email protected] (HJB) Published in PLoS One 14 Abstract Analysis of the Escherichia coli transcriptome identified a unique subset of messenger RNAs (mRNAs) that contain a conventional untranslated leader and Shine-Dalgarno (SD) sequence upstream of the gene’s start codon while also containing an AUG triplet at the mRNA’s 5’- terminus (5’-uAUG). Fusion of the coding sequence specified by the 5’-terminal putative AUG start codon to a lacZ reporter gene, as well as primer extension inhibition assays, reveal that the majority of the 5’-terminal upstream open reading frames (5’-uORFs) tested support some level of lacZ translation, indicating that these mRNAs can function both as leaderless and canonical SD-leadered mRNAs. Although some of the uORFs were expressed at low levels, others were expressed at levels close to that of the respective downstream genes and as high as the naturally leaderless cI mRNA of bacteriophage λ. These 5’- terminal uORFs potentially encode peptides of varying lengths, but their functions, if any, are unknown. In an effort to determine whether expression from the 5’-terminal uORFs impacts expression of the immediately downstream cistron, we examined expression from the downstream coding sequence after mutations were introduced that inhibit efficient 5’-uORF translation. These mutations were found to affect expression from the downstream cistrons to varying degrees, suggesting that some 5’-uORFs may play roles in downstream regulation. Since the 5’-uAUGs found on these conventionally leadered mRNAs can function to bind ribosomes and initiate translation, this indicates that canonical mRNAs containing 5’-uAUGs should be examined for their potential to function also as leaderless mRNAs. 15 Introduction Translation initiation is the rate-limiting step in protein synthesis and requires ribosomes to recognize and bind messenger RNAs (mRNA). In the conventional pathway of initiation, a ternary complex is formed between the 30S ribosomal subunit, the mRNA, and the initiator tRNA with the aid of three initiation factors (for review see Laursen et al., 2005). Briefly, 30S ribosomal subunits bind the Shine-Dalgarno (SD) sequence of the mRNA via complementary pairing to the 16S rRNA anti-Shine-Dalgarno (aSD) sequence (Shine and Dalgarno, 1974; Yusupova et al., 2001). The initiator tRNA binds to the complex, stabilized by the SD-aSD interaction, and promotes the proper placement of the start codon in the P-site of the ribosome with subsequent translation initiation (Ringquist et al., 1993; Wimberly et al., 2000). The mechanism for mRNA-ribosome binding via the SD-aSD has been well studied and is thought to be a prerequisite step for translation initiation (Laursen et al., 2005; Gualerzi and Pon, 2015 and references therein). However, an increasing number of genes have been identified that lack SD sequences or lack a 5’ untranslated region (5’-UTR) altogether (Chang et al., 2006). mRNAs lacking a 5’-UTR are referred to as leaderless mRNAs and have been reported in all domains of life (Janssen, 1993). Leaderless mRNA lack ribosome binding signals that would otherwise be contained within the 5’-UTR, such as the SD sequence. The widespread occurrence of these noncanonical mRNAs suggests that they contain features allowing recognition by ribosomes from all translation systems. Leaderless mRNA appear to follow a novel pathway by which a 70S ribosome binds the 5’-terminal AUG to initiate translation (Udagawa et al., 2004; Moll et al., 2004). Binding depends on an AUG initiation codon and is stabilized by initiator tRNA in Escherichia coli (Van Etten and Janssen, 1998; O’Donnell and Janssen, 2002; Brock et al., 2008). Addition of a 5’terminal AUG to an internal segment of lacZ mRNA makes it competent to form ternary complexes with 70S ribosomes and initiator tRNA (Brock et al., 2008), suggesting that a 5’terminal AUG might be a sufficient signal for ribosomes to identify a leaderless mRNA. Furthermore, a thirty-nucleotide deletion from the 5’-terminus of the naturally occurring leaderless cI mRNA results in the loss of the ability to bind ribosomes. However, addition of a 5’-terminal AUG triplet to the truncated cI leaderless mRNA restores 70S ribosome binding and allows it to compete with the native leaderless mRNA for ribosome binding in vitro and form translationally active complexes in vivo (Brock et al., 2008). Taken together, these results show 16 that the presence of a 5’-terminal AUG is sufficient for an RNA molecule to bind ribosomes and be translated as a leaderless mRNA. To further investigate the hypothesis that a 5’-terminal AUG is sufficient for translation of an mRNA, we sought to identify AUG triplets that occur at the 5’-termini of canonical, SDled mRNAs. Such 5’-terminal AUGs would have the potential to bind ribosomes and allow for translation of a second open reading frame (ORF), in addition to translation from the start codon of the downstream SD-led coding sequence (CDS). Previous work has demonstrated the abundance of small ORFs within 5’-UTRs in both eukaryotic and prokaryotic organisms. In prokaryotic organisms, these small ORFs often encode small peptides (Hobbs et al., 2011) and can regulate downstream ORF expression (Lovett and Rogers, 1996; Gaba et al., 2001). In this study, we utilized the RegulonDB database (Gama-Castro et al., 2011) to conduct an in silico search of E. coli for canonical, SD-led mRNAs that contain 5’-terminal AUG triplets. These 5’terminal upstream AUGs (5’-uAUGs) were assayed for their ability to act as initiation codons for translation of the putative 5’-terminal upstream open reading frame (5’-uORF). The 5’-uORF translational activity and ability to bind ribosomes was also analyzed, as well as their effect on the translation efficiency of their respective downstream SD-led CDS. Our results suggest that a number of canonical SD-led mRNAs contain 5’-uAUGs that bind 70S ribosomes and support biologically relevant levels of translation. Our results also suggest that certain 5’-uAUGs and their defined ORFs impact regulation of downstream expression. 17 Materials and Methods Bacterial strains. E. coli DH5α (New England Biolabs [NEB]) was used as the host for all plasmid DNA manipulations. E. coli RFS859 (F-, thr-1, araC859, leuB6, Dlac74, tsx-274, l-, gyrA111, recA11, relA1, thi-1) (Schleif, 1972) was used as host for the expression and assay of lacZ fusion mRNA constructs. E. coli K12 total genomic DNA was used as a template for PCR amplifications to isolate the genes of interest used in this study. Reagents. Radio-labeled [γ-32P] ATP (6000 Ci/mmol, 150 mCi/mL) was purchased from Perkin Elmer. Restriction endonucleases, T4 DNA ligase, T4 polynucleotide kinase (PNK), and T7 RNA polymerase were purchased from New England Biolabs and used according to manufacturer’s instructions. RNase-free DNase I (Roche), AMV reverse transcriptase (Life Sciences), and Pfu DNA polymerase (Stratagene) were used according to the manufacturer’s specifications. DNA oligonucleotides were purchased commercially (IDT). The lacZ-specific oligonucleotide 5’-GTTTTCCCAGTCACGACGTTG-3’, which anneals to positions +99 to +78 of the lacZ coding sequence in lacZ fusions, was used in the primer extension inhibition (toeprint) assays. Construction of lacZ fusions. Codons 1-16 of each gene tested, including the putative upstream open reading frame (uORF) expressed within 5’-UTR, were fused to the fifth codon of a lacZ reporter gene and cloned into pUC18-derivative plasmids (Janssen and Bibb, 1993) containing an ampicillin resistance marker. The constructs contained an upstream lac promoter (TATAAT). The 5’-uORF for each gene tested was fused to the fifth codon of a lacZ reporter gene just upstream of its in-frame stop codon. β-galactosidase assay. β -galactosidase assays were performed as previously described (Miller, 1992). Preparation of in vitro synthesized transcripts. The cloned plasmids were used as templates in PCR amplifications utilizing a primer to incorporate the T7 RNA polymerase promoter sequence (5’-TAATACGACTCACTATAG-3’). This produced DNA fragments containing a T7 promoter sequence, allowing for in vitro transcription with T7 RNA polymerase and production of RNA used in toeprint reactions. RNAs were synthesized and purified as described (Fredrick and Noller, 2002). RNAs used in toeprint assays were synthesized by combining purified PCR 18 amplicons (constructs with lacZ fusions containing a T7 promoter) and T7 RNA polymerase in 1X buffer (40 mM Tris-HCl at pH 7.8, 25 mM MgCl2, 1 mM spermidine, 0.01% Triton X-100, 5 mM each NTP, and 30 mM dithiothreitol). Transcription reactions were incubated for approximately 4 h at 37°C, and 40 mM ethylenediamineetetraacetic acid (EDTA) was added. Samples were treated with DNase (Roche) for 15 min at 37°C. RNA was ethanol-precipitated and suspended in RNA loading dye (50% formamide, 0.05% bromophenol blue, 0.05% xylene cyanol). Samples were subjected to polyacrylamide gel electrophoresis (PAGE; 6% acrylamide, 7 M urea) and full-length products were excised using UV shadowing. Gel slices were incubated overnight at room temperature in elution buffer [(300 mM NaOAc at pH 5.2, 0.1% sodium dodecyl sulfate (SDS), 1 mM EDTA)] with gentle rocking. The supernatant was phenolextracted and ethanol-precipitated. Ribosome isolation. Isolation of E. coli MRE600 70S ribosomes and 30S ribosomal subunits was performed as previously described (Martin-Farmer and Janssen, 1999). The same batch of ribosome preparations was used in each initial and duplicate primer extension inhibition assay. For confirmation of unexpected binding signals, a second ribosomal batch preparation was used and in each case the binding signals were reproduced (not shown). Primer extension inhibition (toeprint) assay. DNA oligonucleotides were phosphorylated at the 5’-terminus using [γ-32P] ATP (6000 Ci/mmol, 150 mCi/mL; Perkin Elmer) and T4 PNK in 1X kinase buffer for 30 min at 37°C and annealed to 3’-termini of RNA as previously described (Fredrick and Noller, 2002). Annealed RNA was incubated with 30S subunits or 70S ribosomes with or without tRNAfMet for 15 min at 37°C. Reactions were transferred to ice, and reverse transcriptase was added to extend from the labelled oligonucleotide primer to produce cDNA. The reactions were incubated for 15 min at 37°C and stopped by the addition of 0.3M NaOAc and 100% ethanol and precipitated overnight at -80oC. Precipitated complexes were collected by centrifugation and dissolved in loading dye (80% deionized formamide, 10 mM NaOH, 1 mM EDTA, 0.5% bromophenol blue, and xylene cyanol), followed by heat treatment (95°C, 5 min) and PAGE (6% acrylamide, 7 M urea) in 1X TBE. Gels were visualized via autoradiography. In each case, toeprint assays were performed at least twice with reproducible results. 19 Results 5’-uORFs support translation Bioinformatic analysis of the E. coli RegulonDB transcriptome database (Gama-Castro et al., 2011) of all promoter types identified several canonical mRNAs with an untranslated leader and SD-led open reading frame (ORF) that also contained an AUG triplet within three nucleotides of the mRNA’s 5’-terminus (i.e., 5’-AUG, NAUG, NNAUG, NNNAUG) (Krishnan et al., 2010). Of the 3,456 E. coli transcripts in RegulonDB, 115 transcripts have an AUG at the experimentally demonstrated or predicted 5’-terminus and 287 transcripts have an AUG within three nucleotides of their 5’-terminus (Table 1-1). In addition to undergoing translation as canonical SD-led mRNAs from an internal start codon, we predicted that these mRNAs might also be translated as leaderless mRNAs from AUG triplets located at, or near, their 5’-termini, thereby categorizing them as bicistronic mRNA. We selected a subset of thirteen genes for further study, chosen on the basis of characteristics that include the predicted gene’s function, length of the putative peptide encoded from the 5’-uORF, the distance of the 5’-uAUG triplet from the 5’-terminus, and the position of the uORF’s stop codon relative to the downstream ORF’s start codon (Table 1-2). We chose genes in which the 5’-uAUG was out of frame with the downstream start codon to select bicistronic mRNAs rather than genes in which translation of the 5’-uORF could produce longer isoforms of the canonical gene product. The translational activity of each of the 5’-uORFs was assessed by an in-frame fusion to a lacZ reporter gene and transcribed from the lac promoter (Figure 1-1A). β-galactosidase assays (Miller, 1992) were performed to measure translation from the 5’-uORFs. Activity measured from the products of lacZ fusions to naturally leaderless E. coli bacteriophage λ cI (Ptashne et al., 1976) and transposable element Tn1741 tetR (Klock and Hillen, 1986) mRNAs were used as controls for comparison of expression levels. Varying levels of expression were seen from the 5’-uORFs, with many falling within the expression range measured for the tetR and cI leaderless mRNAs (Figure 1-2). cmk, glpF, and luxS 5’-uORFs were expressed at levels 10-fold lower than tetR, whereas uvrY and pncB 5’uORFs were expressed at levels only 2-fold lower than tetR (Figure 1-2, inset). Conversely, ptrB, rhaB and xap 5’-uORFs were expressed at levels very similar to tetR (Figure 2, inset). Several 20 Table 1-1. RegulonDB identified genes. Complete list of genes identified by in silico analysis of the RegulonDB E. coli transcriptome as having a Shine-Dalgarno sequence within their 5’UTR as well as an AUG triplet within three nucleotides of the 5’-terminal (i.e., 5’-AUG, NAUG, NNAUG, and NNNAUG). The nucleotides defined as the 5’-uAUG for each gene are underlined. The table includes the gene name, the first 21 nucleotides at the 5’-terminal end of the mRNA, its start position on the E. coli chromosome, strand of location, accession number (ECK#), and the sigma factor associated with its transcription. 21 22 23 24 25 26 27 28 Table 1-2. Selected genes examined that contain an AUG triplet at the 5’-terminus of its transcribed canonical mRNA. An overlap refers to a uORF stop codon that is within the downstream coding sequence, albeit out of frame. 29 30 Figure 1-1. Schematic of lacZ fusions constructed for each gene tested. lacZ fusions to RNA encoding the 5’-uORF (A) or to the coding sequence (CDS) of the downstream gene (B-D). The 5’-uAUG is either maintained as AUG (CDS), mutated to AUC (KO) or contains a premature stop codon (StartStop). The arrow indicates the in-frame AUG start codon defining the ORF providing expression of the lacZ reporter gene. 31 32 Figure 1-2. 5’uAUGs support varying levels of translation compared to known leaderless mRNA in E. coli. Translation from select 5’-uORFs fused to lacZ (see Figure 1-1A) using βgalactosidase assays performed in triplicate. Translation is compared to β-galactosidase activity measured from cI-lacZ (= 100%) fusions. A subset of 5’-uORFs displaying lower levels of translation are shown as compared to β-galactosidase activity measured from cI-lacZ (=100%) fusions (inset). 33 34 5’-uORFs were actually expressed at levels 2-fold to 10-fold higher than tetR (e.g., mngR, fucP, rcnR and pnp) (Figure 1-2), albeit still significantly lower than cI (Figure 1-2). The most highly expressed 5’-uORF tested was iscR, which was expressed at a level 2-fold greater than cI (Figure 1-2). These results show that the 5’-uAUGs can act as an initiation codon to support translational expression, thereby identifying previously unexplored ORFs. lacZ fusions were also constructed to assess the translational activity of the canonical SDled downstream CDS (Figure 1-1B). In each instance, the 5’-uORF and the downstream SD-led ORF are in different reading frames or the uORF’s stop codon is upstream of the annotated ORF and could therefore be compared individually via LacZ fusions. This is important to note because in some instances the two cistrons overlap but will still produce two different gene products (Table 1-2). Only one gene tested, rcnR, supported equal levels of expression from both ORFs (Table 1-3). However, three genes, pnp, iscR, and mngR, had 5’-uORFs that were translated at levels ranging from 10-50% of downstream CDS expression (Table 1-3). The majority of 5’uORFs tested were not expressed at comparable levels (Table 1-3). There is large variation in the levels of translation from these 5’-uAUGs and the basis for these differences is still unclear. These results demonstrate that an AUG codon at the 5’-terminus of an ORF in the untranslated region of a canonical, SD-led mRNA has the potential to function as a start codon and support significant translational activity. However, the presence of a 5’-uAUG is not always sufficient for translation. Comparable expression to known leaderless mRNAs, as well as to their downstream cognate CDS in some cases, suggests the potential importance of 5’-uAUGs. 5’-uAUGs bind 70S ribosomes Primer extension inhibition (toeprint) assays were performed to assess ribosome binding patterns to the 5’-uAUG start codons and the internal SD-led start codons. In vitro transcribed mRNAs, corresponding to 5’-uORFs exhibiting varying expression levels (fucP, iscR, rcnR, and ptrB) (Figure 1-2), were tested by toeprint assays to analyze the inherent affinity of ribosome complexes for the 5’-uAUG. Assays were performed in the presence of initiator tRNA and either 30S ribosomal subunits or intact 70S ribosomes. As expected, 70S ribosomes bound the 5’uAUGs, whereas 30S subunits bound the internal CDS start codon for each gene tested (Figure 1-3). This ribosome binding pattern suggests that these mRNAs interact with ribosomes both as 35 Table 1-3. Translation from the 5’-uORF as a percent of translation relative to its downstream canonical CDS as determined by β-galactosidase assays performed in triplicate. 36 37 Figure 1-3. 5’-uAUGs bind 70S ribosomes following the proposed leaderless mRNA initiation mechanism. Primer extension inhibition (toeprint) reactions contain mRNA (A) fucP, (B) iscR (C) rcnR (D) ptrB, as well as initiator tRNA, and 30S subunits or 70S ribosomes as indicated by + or - symbols; reaction products are separated on denaturing polyacrylamide gels and visualized by autoradiography. Predicted position of toeprint signals (+15) to the 5’-uAUG (open arrow) and downstream AUG (closed arrow) are indicated. Additional positions displaying ribosome dependent bands are also indicated. Figure is representative of experiments performed in biological triplicate. 38 39 canonical leadered mRNAs, with 30S subunits binding to the CDS start codon, and as leaderless mRNAs with 70S ribosomes binding to the 5’-uAUGs. This further supports the notion that the 5’-uAUGs are functioning as initiation codons for uORF translation. In the case of fucP, a 70S toeprint signal was observed at the 5’-uAUG which displays weak intensity compared to the downstream 30S subunit signal. The relationship between the two signals correlates with the fucP expression data (Table 1-3). The presence of a 70S toeprint signal at the 5’-uAUG confirms the ability of the fucP 5’-uORF to bind ribosomes (Figure 1-3A) and be expressed as a leaderless mRNA (Figure 1-2). The fucP downstream CDS was shown to be relatively highly expressed (Table 1-4), which agrees with the strong internal toeprint signals. Interestingly, 70S ribosomes also appeared to bind fucP’s internal AUG in vitro. The internal 70S ribosome binding phenomenon was also seen in the case of iscR mRNA. 30S subunits bound to the SD-led CDS start codon of iscR mRNA and 70S ribosomes bound to the 5’-uAUG (Figure 1-3B), as expected. 70S ribosomes binding to iscR mRNA’s 5’uAUG (Figure 1-3B) corresponds with a high level of iscR 5’-uORF expression compared to leaderless cI mRNA (Figure 1-2). In addition, 70S ribosome binding to the internal SD-led start codon of iscR was observed, as with fucP above. The toeprint signal strength of 70S ribosome binding of the SD-led start codon was nearly as strong as that of the 30S subunit binding to the internal start codon. There have been other reports of internal 70S ribosome binding in toeprint assays, but the 70S ribosome binding is typically weaker than 30S subunit binding at the internal position (Hartz et al., 1989; Udagawa et al., 2004). It is interesting that internal 70S ribosomal binding was seen for both fucP and iscR, although not for rcnR or ptrB (Figure 1-3), suggesting that, for reasons that are still unclear, certain mRNAs have features that allow 70S internal binding in vitro. It is possible that 70S internal binding is always present, but less stable for certain mRNAs although further investigation must be completed to examine this line of inquiry. The observation that 70S ribosomes did not bind the CDS of all tested transcripts (Figure 1-3) demonstrates the absence of contamination of 30S subunits and absence of ribosomal splitting occurring within the 70S ribosomal preparations. Assays reproduced with different 70S batch preparations and different mRNA preparations consistently gave the same results (data not shown). 40 Table 1-4. Expression levels from downstream CDS fusions to lacZ that contain an intact 5’-uAUG (CDS), a mutated 5’-uAUC (KO), or an intact 5’-AUG with a premature stop codon (StartStop) (see Figure 1-1) including the standard deviation from triplicate cultures. NA=not available 41 42 As expected, 30S subunits bound to the SD-led CDS start codon of rcnR mRNA and 70S ribosomes bound to the 5’-uAUG (Figure 1-3C). The toeprint signal strength for 70S ribosomes bound to the 5’-uAUG of rcnR was equal to or greater than that of 30S subunits binding to the internal SD-led AUG. This indicates similar toeprint signal strength, and may reflect the observation that translation from lacZ fusions to the rcnR 5’-uORF and the downstream SD-led CDS were equivalent (Table 1-3). The 70S ribosomes binding at the 5’-uAUG, with or without initiator tRNA, displayed multiple toeprint signals at the expected +16 position, but also at the +5 position and the +25 position (Figure 1-3C lanes 4 and 6). These signals are ribosomedependent, indicating that they are not the product of secondary structure. Ribosomes appear to stably bind the 5’-terminus without the AUG start codon positioned in the P-site, demonstrating that tRNA codon-anticodon pairing is unnecessary, supporting tRNA-independent binding. Although this signal pattern has been reproduced in subsequent toeprint assays (data not shown), further experimentation is necessary to explore if this ribosome binding pattern reflects the process of ribosomal loading on 5’-terminal AUGs. Aside from the predicted 30S subunit toeprint binding signal binding to the downstream rcnR CDS start codon, there was another tRNA-dependent signal at position +102 corresponding to 30S subunit binding to an AUG at position +88 within the 5’-UTR (Figure 1-3C lane 3, Table 1-5). This potential start codon is out of frame with the rcnR CDS start codon and specifies an uORF that would produce a 22-amino acid long putative peptide before encountering a stop codon. These results indicate that there are additional ORFs within the rcnR mRNA that may represent additional peptide production. Similarly to the other mRNAs tested, 30S subunits bound to the internal SD-led start codon of ptrB mRNA, and 70S ribosomes bound to the 5’-uAUG. In the case of ptrB, however, the 70S toeprint signal intensity at the 5’-uAUG was not a reliable predictor of expression level. ptrB’s 5’-uAUG showed very strong 70S binding as well as tRNA-independent 70S binding (Figure 1-3D), but expression from the 5’-uORF was low compared to either its downstream CDS (Table 1-3) or known leaderless mRNA (Figure 1-2). In this case, the ribosome binding data does not correlate to translational activity seen in β-galactosidase assays which may indicate that select 5’-uAUGs are bound by ribosomes for purposes other than translation initiation. 43 Table 1-5. Sequences of mRNAs tested. List of mRNAs tested in this study and their sequences including their 5’UTRs (lower case) and the first 15 codons of the coding sequences (upper case). The 5’-uAUGs and their in-frame stop codons are upper case and bold. The underlined sequences correspond to the additional rcnR putative uORF identified using toeprint assays. 44 45 5’-uAUGs can influence downstream gene expression The 70S toeprint signal intensity of ptrB led us to suspect that 5’-uAUGs function not only in peptide production but also as regulatory features. Genes in the same transcriptional unit, such as in an operon, typically have related functions, and in some cases the uORFs can affect expression of downstream cistrons. One example of this type of regulation is referred to as translational coupling (Sarabhai and Brenner, 1967) and is typically necessary due to sequestration of the downstream ribosome binding regions by secondary structure (Rex et al., 1994) and is most efficient when the upstream stop codon is in close proximity to the downstream start codon (Cone and Steege, 1985). To assess effects on downstream CDS expression that might result from disruption of 5’-uORF translation, lacZ was fused to the SDled CDS in the presence of a 5’-uAUG knockout mutation (i.e., uAUGAUC) (Figure 1-1C). Mutation of the 5’-terminal start codon would be expected to prevent ribosomal binding as well as prevent translation of the 5’-uORF. Mutation of a subset of genes’ 5’-uAUGs to AUCs affected expression of the SD-led downstream CDS to varying degrees (Table 1-4). Many remained minimally affected, whereas some increased in expression (e.g., cmk, iscR, luxS and mngR) and others decreased in expression (e.g., fucP, glpF, ptrB, and rcnR) (Table 1-4). To determine whether the change in downstream CDS expression was due to loss of translation of the 5’-uORF or loss of ribosome binding to the mRNAs’ 5’-terminus, we made a separate start-stop mutation to introduce a premature stop codon two codons after the 5’-uAUG start codon (i.e., AUGxxxUGA) in a subset of genes (Figure 1-1D). The 5’-uAUGs may stabilize the mRNA due to their inherent ability to bind ribosomes thereby protecting the mRNA from degradation by RNases (Komarova et al., 2005). This protection may contribute to the positive effects the 5’-uAUG has on downstream expression, independent of 5’-uORF translation. In some cases, the start-stop mutation produced results similar to those of the corresponding 5’-uAUG knockout mutations, emphasizing the importance of translation of the 5’-uORF (Table 1-4). The luxS and mngR mRNAs both had increased expression levels in the knockout mutant and decreased expression levels in the start-stop mutant (Table 1-4), highlighting the negative effect of the 5’-uAUG codon on CDS expression. Of the thirteen different genes assayed containing 5’-uAUG5’-AUC mutations, the ptrB gene was particularly affected by the 5’-uAUG mutations. As a result of the 5’-uAUG 5’46 AUC mutation, expression from the downstream ptrB CDS was drastically reduced by 93% (Table 1-4). Toeprint assays revealed that 30S subunit binding to the downstream start codon in the presence of the 5’-AUC mutation was nearly eliminated (Figure 1-4), correlating with the expression data (Table 1-4). However, in the presence of the start-stop mutation, expression was only reduced by 59% compared to the wild-type level (Table 1-4) and reappearance of a 30S subunit toeprint was observed (Figure 1-4). These data show that ptrB’s 5’-uAUG influences 30S subunit binding to the downstream start codon and concomitantly influences expression. This suggests there is a potential regulatory function of ptrB’s 5’-uAUG on downstream expression that is not dependent on uORF translation. 47 Figure 1-4. Loss of 30S subunit binding to CDS start codon as a result of the 5’-uAUG mutation. Primer extension inhibition (toeprint) reactions contain mRNA, initiator tRNA, and 30S subunits or 70S ribosomes as indicated by + or - symbols; reaction products are separated on denaturing polyacrylamide gels and visualized by autoradiography. Predicted position of toeprint signals (+15) to the 5’-AUG (open arrow) and downstream AUG (closed arrow) are indicated. ptrB sequencing ladder is included on the left. Figure is representative of experiments performed in biological triplicate. 48 49 Discussion The results presented here indicate that AUG triplets at the 5’-termini of SD-led mRNAs can function as start codons and have the ability to bind ribosomes and be translated as leaderless mRNAs. Using β-galactosidase and toeprint assays, we have shown that many 5’-uORFs were translated at biologically relevant levels and bound by 70S ribosomes, with some appearing to play a role in expression of the downstream CDS. A number of the previously uncharacterized 5’-uORFs supported levels of expression comparable to known leaderless mRNAs or their respective downstream CDS. rcnR is one example of an mRNA this study has identified as an efficiently translated bicistronic mRNA. Stable ribosome binding at the 5’-terminus supports high expression levels observed from rcnR’s 5’-uORF. Since rcnR’s 5’-uORF and CDS expression were similar (Table 1-3), this suggests nearly equivalent amounts of polypeptide product are being made for each cistron. This is surprising because leaderless mRNA is typically thought to be less efficiently translated than leadered mRNA (Moll et al., 2002). Since the rcnR 5’-uAUGAUC mutation reduced downstream CDS expression (Table 1-4), this 5’-uAUG may also play a role in downstream expression, possibly through translational coupling. Therefore, rcnR is an example of a 5’-uAUG that specifies a highly expressed uORF whose expression is linked to downstream expression. Additional regulatory features, possibly independent of the putative translated peptide, may be present within the 5’-uORF that could influence expression efficiency or function of the downstream CDS. This form of regulation is widespread and has been seen in both prokaryotic (Dincbas et al., 1999; Delgardo-Olivares et al., 2006; Sharma et al., 2012) and eukaryotic systems (Morris and Geballe, 2000 and references therein; Meijer and Thomas, 2002; Medenbach et al., 2011). Mutation of the 5’-uAUG to AUC impacted downstream expression in iscR, mngR, ptrB, and rcnR mRNA (Table 1-4). In each case, this suggests that disruption of ribosome binding and/or 5’-uORF translation has an effect on the downstream CDS. Conversely, in some mRNAs (i.e., rhaB, uvrY, and xap), no change was seen in CDS expression as a result of the 5’-uAUG mutation (Table 1-4) indicating that the CDS is expressed independently from any 5’-uORF ribosome binding or translation. This reinforces the idea that 5’-uAUGs have diverse functions, and may function differently in different contexts. 50 The ptrB CDS showed a dramatic decrease in expression, as well as loss of internal 30S subunit binding, in the presence of the 5’-uAUG knockout (Table 1-4, Figure 1-4). The dependency on the 5’-uAUG, but not translation of the putative peptide, for ribosome binding and expression of the ptrB CDS, suggests regulatory roles. Since the ptrB 5’-UTR is only 26 nucleotides long (Table 1-5), the regulatory effects may be related to an overlap in ribosomal occupancy causing a steric hindrance and prohibiting both 70S ribosomes and 30S subunits from being stably bound at the same time. We propose that the 5’-uAUG is the major signal for ribosome recruitment and binding to the mRNA for CDS expression, although the mechanism remains unclear. In one possible scenario, rather than translating the 5’-uORF, this region may act as a standby site (de Smit and van Duin, 2003) for ribosome loading onto the mRNA, taking advantage of inherent binding strength to 5’-AUGs. Once the ribosome is bound to the 5’terminus, it could then more efficiently access and bind the downstream CDS translation initiation region. This pre-loading may result in more stable ribosome binding and more efficient internal CDS translation. Further investigation into this potential model may elucidate a novel mechanism of initiation or regulation of translation via a 5’-uAUG. While the majority of the 5’-uAUGs we tested function in either 5’-uORF translational expression or regulation, some 5’-uAUGs showed no obvious role in these activities. The 5’uORFs of cmk, glpF, and luxS mRNA were translated at levels much lower than tetR (Figure 12), and do not appear to have a substantial effect on downstream CDS expression (Table 1-4). It is possible that these 5’-uAUGs may have formed by chance and have not been evolutionarily selected for a regulatory role. Alternatively, it is possible that there is an unknown function for these 5’-uAUGs that we have not considered or tested. Overall, the 5’-uAUGs we tested may possess a variety of potential functions, such as providing protection to stabilize the mRNA transcript via ribosome binding, producing peptides, and contributing to regulation of the downstream CDS. The ribosome binding studies revealed the translation potential for unannotated uORFs, which may be more widespread than previously thought. Ribosome binding studies similar to the work we have performed could also reveal ORFs that are not detected by visual inspection or bioinformatic analysis. This study provides insight into pitfalls of our current methods of identifying translation initiation sites, and imply that in silico analyses may be biased by imposing size limitations and overlooking uORFs when 51 analyzing genomes. The 5’-uORFs may represent an additional subtype of cistron that should continue to be considered when annotating genomes because this study shows their potential to be functional at biologically relevant levels. 52 Chapter 2 Novel translation initiation regulation mechanism in Escherichia coli ptrB directed by 5’terminal AUG and downstream coding sequence elements Heather Beck, Gary Janssen 53 Abstract Translation of Escherichia coli ptrB requires an AUG triplet at the 5’-terminus of its messenger RNA (mRNA). The 5’-terminal AUG (5’-uAUG) acts as a ribosome recognition signal to attract ribosomes to the ptrB mRNA. A mutation in the 5’-uAUG of ptrB reduced downstream ptrB expression by more than 90%, even in the presence of the predicted ptrB Shine-Dalgarno (SD) sequence. Strengthening of the ptrB SD sequence relieved the necessity for the 5'-uAUG, suggesting that the regulation imposed by the 5'-uAUG is distinct from canonical SD sequencemediated regulation. Additional sequences within the ptrB 5’-untranslated region (5’-UTR) were identified that contribute to ribosome binding and expression of the downstream ptrB coding sequence (CDS). Replacement of 5’-UTRs from other mRNAs with the ptrB 5’-UTR sequence showed a similar dependence on the 5’-uAUG for downstream expression, suggesting that the regulatory features contained within the ptrB 5’-UTR are sufficient to control the expression of other CDS. Mutational analysis also identified two specific adenine nucleotides within the ptrB CDS at position +6 and +9 that contribute to ptrB expression. The primary sequence features identified in ptrB mRNA reveal a novel form of translation regulation driven by the 5’-uAUG. Due to the abundance of AUG triplets at the 5’-termini of E. coli mRNAs and the ability of the ptrB 5’-UTR regulation to function independent of gene context, the regulatory effects of 5’uAUGs on downstream CDS may be widespread throughout the E. coli genome. 54 Introduction Protein synthesis, in which the ribosome translates a messenger RNA (mRNA) sequence to form polypeptides, is a four-step process, including initiation, elongation, termination, and ribosome recycling. Translation initiation is energy-dependent and the most highly regulated phase of translation (Laursen et al., 2005). Both the mRNA’s sequence and secondary structure can affect how it interacts with the translational machinery and therefore influence its translation efficiency. The translational machinery interacts with the mRNA at its ribosome binding site (RBS), along with initiator tRNA, to form a ternary complex that is equipped for polypeptide production. The RBS of a canonical mRNA contains an initiation codon, a purine-rich ShineDalgarno (SD) sequence complementary to the anti-Shine-Dalgarno (aSD) sequence of the 16S rRNA (Shine and Dalgarno, 1974), and an appropriately sized spacer region between the two aforementioned elements (Ringquist et al., 1992; McCarthy and Brimacombe, 1994). The SD sequence and its spacing upstream of the initiation codon are important for ribosome binding and proper placement of the initiation codon in the ribosomal P-site to initiate translation (Laursen et al., 2005). These primary structural elements, as well as potential upstream and downstream enhancer sequences can contribute to translation efficiency. The cooperative effect of multiple nucleotides at specified regions is referred to as the “cumulative specificity initiation mechanism,” and the multiple elements can collectively contribute to ribosome recognition, strength of ribosome binding, and reading frame selection (Nakamoto, 2011). The mRNA’s secondary structure can also impact the rate of translation. The ribosome requires a local single stranded region for efficient binding (Draper, 1987; Draper et al., 1998; Gualerzi and Pon, 1990). Secondary structure that occludes the RBS can cause the region to be unrecognizable to the ribosome and negatively impact translation (de Smit and van Duin, 1990; de Smit and van Duin, 1994; Gu et al., 2010). Nevertheless, secondary structure can act to regulate translation and allow for certain mRNAs to be translated only under specific conditions. For example, cold or heat stress can cause thermodynamic changes in mRNA structure that could either open or occlude the RBS and therefore result in a change in initiation efficiency (Storz, 1999; Narberhaus et al., 2006; Klinkert and Narberhaus, 2009). There are also examples of cis-acting elements, such as upstream open reading frames (uORFs), which can influence the expression of downstream open reading frames (ORFs) on the 55 same transcript. In prokaryotes, uORFs exert their effects primarily through translational coupling (Sarabhai and Brenner, 1967; Min Jou et al., 1972; Andre, 2000). These effects occur after the ribosome completes translation of the uORF and, rather than dissociating, remains bound to the mRNA. The ribosome then repositions to the downstream initiation region using a scanning-like movement (Adhin and van Duin, 1990). This process is beneficial because it allows polycistronic mRNAs to be controlled from a single point (Spanjaard and van Duin, 1989) and may eliminate inhibitory RNA structures that would otherwise disrupt initiation of the downstream ORF (Rex et al., 1994; Takyar et al., 2005). uORFs can also encode nascent peptides that cause ribosomal stalling by binding to regions of the peptidyltransferase center, blocking the ribosome exit tunnel or through attenuation (Lovett and Rogers, 1996; Tenson and Ehrenberg, 2002). This stalling can then impact downstream translation by blocking the RBS, causing conformational changes in secondary structure, or increasing degradation of the mRNA (Nakatogawa and Ito, 2002; Cruz-Vera et al., 2011). We recently reported another example of the influences of uORFs on downstream expression (Beck et al., 2016). In that study, we identified novel examples of uORFs contained within the 5’-untranslated region (5’-UTR) of SD sequence-led mRNAs in Escherichia coli. The newly discovered uORFs were all positioned at the 5’-terminus, thereby classifying them as leaderless mRNAs. Leaderless mRNAs lack a 5’-UTR and are thought to initiate translation via a mechanism distinct from leadered mRNAs, in which an intact 70S ribosome binds to the 5’terminus of the mRNA (Balakin et al., 1992; Udagawa et al., 2004; Moll et al., 2004). The leaderless uORFs tested bound 70S ribosomes and were translated to varying degrees (Beck et al., 2016). A subset of the uORFs also contributed to downstream ORF expression levels, either through uORF translation or through their respective 5’-terminal AUG’s ability to act as a ribosome binding signal. Downstream expression of ptrB, an excellent example, was dependent upon the 5’-terminal AUG (5’-uAUG) even though the 5’-terminal uORF was not efficiently translated (Beck et al., 2016). The necessity of the ptrB 5’-uAUG and its role in ptrB coding sequence (CDS) regulation is the focus of this study. The annotated ptrB CDS produces a 686-amino acid protease II predicted to be a member of the prolyl endopeptidase family (Kanatani et al., 1991), but little is currently known about its regulation. The ptrB mRNA has a 26-nucleotide untranslated leader 56 region, containing a predicted SD sequence located nine nucleotides upstream of the ptrB initiation codon with imperfect complementarity to the aSD sequence (Figure 2-1). The 5’uAUG defined uORF overlaps with the downstream CDS reading frame, such that the uORF stop codon is at the +35 position (with the ‘A’ of the CDS start codon at +1). Therefore, if translated, the uORF would produce a 21-amino acid peptide. This study reinforces the dependency of ptrB translation on the 5’-uAUG based on its ability to act as a ribosome-binding signal rather than through translation of the uORF. Ribosome binding and expression assays revealed a regulatory process distinct from SD sequence-mediated regulation, suggesting a novel mechanism for translation regulation driven by a 5’-uAUG. We propose two potential regulatory models that could play a role in the translation initiation of ptrB. These findings are significant because the ptrB 5’-UTR regulatory regions can act autonomously regardless of gene context, suggesting that this form of regulation may be widespread in E. coli. 57 Figure 2-1. The ptrB gene sequence. The 5’-ATG is bolded at the 5’-terminus. The 5’-UTR is indicated by the lowercase letters. The predicted SD sequence is underlined. The CDS is indicated by the uppercase letters. The 5’-uAUG defined ORF’s stop codon is within the coding sequence and also bolded (TGA). The SalI site used to construct the in-frame lacZ fusion is indicated by the dashed underline. 58 59 Materials and Methods Bacterial strains and reagents. E. coli DH5α (New England Biolabs [NEB]) was used as the host for all plasmid DNA manipulations. E. coli RFS859 (F-, thr-1, araC859, leuB6, Dlac74, tsx-274, l-, gyrA111, recA11, relA1, thi-1) (Schleif, 1972) was used as the host for the expression and assay of lacZ fusion mRNA constructs. Chromosomal DNA from E. coli K12 was used as a template for initial PCR amplifications of the ptrB gene. Two plasmids were used in the study, pM1108 (for the incorporation of a mutated lac promoter) and pA906 (to allow for the lacZ gene fusion) (Janssen and Bibb, 1993). Radiolabeled [γ-32P] ATP (6000 Ci/mmol, 150 mCi/mL) was purchased from Perkin Elmer. Restriction endonucleases, T4 DNA ligase, T4 polynucleotide kinase (PNK), and T7 RNA polymerase were purchased from NEB and used according to the manufacturer’s instructions. RNase-free DNase I (Roche), AMV reverse transcriptase (Life Sciences), and Pfu DNA polymerase (Stratagene) were used according to the manufacturers’ specifications. DNA oligonucleotides were purchased (IDT). Recombinant DNA procedures. PCR amplifications were performed using E. coli K12 chromosomal DNA or purified plasmids containing the gene of interest as templates. Specific oligonucleotides (IDT) were added to incorporate specific mutations (Table 2-1), in addition to the template, 5X Pfu buffer (commercially bought with subsequent addition of 1 mM dNTPs), and Pfu DNA polymerase, to complete PCR amplifications. After amplification, PCR products were electrophoresed on 1.3% low-melting point agarose gels for DNA purification. DNA bands were then excised from the gels and purified using a phenol: chloroform: isoamyl alcohol (25:24:1) mixture and isopropanol precipitation. Purified DNA products were then digested with restriction endonucleases and ligated into plasmid pA906, containing an ampicillin resistance marker and the mutated lac promoter (AATAAT), to fuse the regions of interest in-frame to the fifth codon of the lacZ reporter gene. i. Construction of DNA molecules with randomized sequence regions Oligonucleotides with regions of randomized nucleotides were purchased (IDT). The randomized regions in each primer were three nucleotides long and designed to allow for the 60 possibility of each nucleotide (A, T, G, or C) at each position within a single primer mixture, except in an instance that would result in the introduction of a termination codon. The randomized region extended from the +4 to +16 position (with the ‘A’ of the 5’-uAUG at position +1). Mutants were cloned as previously described and screened via lacZ blue-white screening on Luria-Bertani plates with the addition of 40 µg/mL 5-bromo-4-chloro-3-indolylbeta-D-galactopyranoside (X-gal). Colonies with detectable color differences compared to the wild-type ptrB were selected and sequenced on an Applied Biosystems 3730 DNA analyzer to identify the sequence of the randomized region. ii. Gene fusions Fusions of regions from two separate genes (Table 2-1) were constructed using trimolecular ligations. The primers at the amplicons’ junction were phosphorylated with T4 PNK, T4 PNK buffer (2 M Tris-HCl, pH 7.6, 1 M MgCl2, 1 M DTT) and ATP (25 mM) for 30 minutes at 37oC. The reactions were then heat-inactivated for 20 minutes at 65oC. The regions within the genes used for the fusions were amplified using PCR with the phosphorylated primers, purified as previously described, and digested to allow for incorporation into pM1108. The two amplicons and the plasmid were then ligated using T7 ligase and 10X ligation buffer (NEB) and incubated at 16oC overnight. The ligation was then used as a template for a subsequent PCR amplification using a 5’ primer specific for the plasmid and a 3’ primer specific to the downstream amplicon to ensure proper ligation. The resulting amplicon was then purified and subcloned in pA906 as previously described for constructing lacZ fusions. Preparation of competent cells. This is a modified version of the previously described Hanahan procedure (Hanahan, 1983). An overnight stationary phase culture of E. coli DH5α or RFS859 cells was diluted 1:50 with L broth (10 g/L Difco Bacto tryptone, 5 g/L Difco Bacto yeast extract, 10 g/L NaCl, 1 g/L glucose) supplemented with 20 mM MgCl2 and incubated at 37oC, with shaking at 200 rpm, for two hours. Cultures were then incubated on ice for 10 minutes and centrifuged at 2795 × g for 5 minutes. The supernatant was discarded and the cell pellet was resuspended in 20 mL of ice-cold 10 mM NaCl. After resuspension, the culture was again centrifuged at 2795 × g for 5 minutes. The supernatant was discarded and the cell pellet was resuspended with 20 mL of 30 mM CaCl2 and incubated on ice for 1 hour. The resuspension was 61 again centrifuged at 2795 x g for 5 minutes and the pellet was resuspended in 3 mL of 30 mM CaCl2 and stored at 4oC for up to two weeks. Transformations. Transformations were performed by adding ligation mix to chemically competent E. coli cells and incubated on ice for 30 minutes. The mixture was then heat shocked at 37oC for 2 minutes then incubated on ice for 2 minutes. The transformation mixture was then plated onto L agar (L broth with agar, 15 g/L) containing 200 µg/mL ampicillin for selection and 40 µg/mL X-gal for screening and incubated at 37oC for approximately 24 hours. β-galactosidase assay. β -galactosidase assays were performed as previously described (Miller, 1992). In vitro synthesis of RNA. The cloned plasmids were used as templates in PCR amplifications utilizing a primer to incorporate the T7 RNA polymerase promoter sequence (5’TAATACGACTCACTATAG-3’). This produced DNA fragments containing a T7 promoter sequence, allowing for in vitro transcription with T7 RNA polymerase and production of RNA used for toeprint reactions. RNAs were synthesized and purified as described (Fredrick and Noller, 2002). RNAs used in toeprint assays were synthesized by combining purified PCR amplicons (constructs with lacZ fusions containing a T7 promoter) and T7 RNAP in 1X buffer (40 mM Tris, pH 7.8, 25 mM MgCl2, 1 mM spermidine, 0.01% Triton X-100, 5 mM each NTP, and 30 mM DTT). Transcription reactions were incubated for approximately 4 h at 37°C, and 40 mM EDTA was added. Samples were treated with DNase (Roche) for 15 min at 37°C. RNA was ethanol-precipitated and suspended in RNA loading dye (50% formamide, 0.05% bromophenol blue, 0.05% xylene cyanol). Samples were subjected to PAGE (6% acrylamide, 7 M urea) and full-length products were excised using UV shadowing. Gel slices were incubated overnight at room temperature in elution buffer (300 mM NaOAc at pH 5.2, 0.1% SDS, 1 mM EDTA) with gentle rocking. The supernatant was phenol-extracted and ethanol-precipitated. Primer extension inhibition (toeprint) assay. DNA oligonucleotides were phosphorylated at the 5’-terminus using [γ-32P] ATP (6000 Ci/mmol, 150 mCi/mL; Perkin Elmer) and T4 PNK in 1X kinase buffer for 30 min at 37°C and annealed to 3’-termini of RNA as previously described (Martin-Farmer and Janssen, 1999). Annealed RNA was incubated with 30S subunits or 70S ribosomes with or without tRNAfMet for 15 min at 37°C. Reactions were transferred to ice, and 62 reverse transcriptase was added. The reactions were incubated for 15 min at 37°C and stopped by the addition of 0.3M NaOAc and 100% ethanol and precipitated overnight at -80oC. Precipitated complexes were collected by centrifugation and dissolved in loading dye (80% deionized formamide, 10 mM NaOH, 1 mM EDTA, and 0.5% bromophenol blue and xylene cyanol), followed by heat treatment (95°C, 5 min) and PAGE (6% acrylamide, 7 M urea) in 1X TBE. Gels were visualized via autoradiography. 63 Results ptrB 5’-uAUG as a ribosome binding signal Our recent study showed that the ptrB 5’-uAUG is necessary for downstream CDS expression, although the 5’-uORF is expressed at a low level (Beck et al., 2016). lacZ fusions after five, ten, and fifteen codons of the uORF also resulted in low levels of expression (data not shown) suggesting that no region of the uORF is efficiently translated. Because the 5’-uORF is not efficiently translated, we hypothesize that the 5’-uAUG’s effect on CDS expression is due to the use of the 5’-uAUG as a signal for ribosome binding, rather than through translation of the uORF. However, it is possible that the two ORFs may be translationally coupled (Andre et al., 2000) because there are examples of coupling that do not exhibit a linear relationship of a gene pair with fixed stoichiometry (Rex et al., 1994; Yu et al., 2001), so we chose to first investigate this possibility. To rule out the possibility of translational coupling in ptrB, the stop codon of the uORF was mutated, allowing for ribosomal read-through and therefore disrupting potential coupling. Mutation of the uORF stop codon resulted in no discernible change in CDS expression (data not shown), suggesting that this regulation is not a result of translational coupling. Additional stop codons in-frame with the 5’-uAUG were also introduced upstream of the natural uORF stop codon (Table 2-1, pAptrB.prestop1, pAptrB.prestop2, pAptrB.prestop3) to examine the influences of ribosomal progression on the transcript. The additional stop codons also had no effect on CDS expression (data not shown) supporting the conclusion that translational coupling does not contribute to ptrB regulation. This also reinforces the notion that it is the 5’-uAUG itself that is necessary for downstream CDS expression, rather than uORF translation. Predicted SD sequence has unexpected effects Expression of the downstream CDS was drastically reduced in the presence of the 5’uAUG mutation despite the fact that the predicted SD sequence was still intact (Beck et al., 2016). These data suggest that the SD sequence is not the sole ribosome binding signal and its presence cannot overcome the negative impact of the 5’-uAUG mutation. To address the SD sequence’s role in ptrB CDS expression, the SD sequence was mutated to its complement 64 Table 2-1. DNA sequences of gene fragments, with and without mutations, from ptrB. The predicted SD sequence is underlined, the mutated nucleotides are bolded and the stop codons are italicized. The pA series of plasmid constructs contained the ptrB coding sequence, and the pAB series contained the CDSs of different genes. 65 Plasmid Sequence Variable 5’UTR Variable coding sequence pAptrB.WT ATGTTTCAACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.5’KO ATCTTTCAACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.prestop1 ATGTTTTAACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.prestop2 ATGTTTCAACCAGAATGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.prestop3 ATGTTTCAACCAGAAAGAACAATAAC ATG CTA CTA AAA GCC GCC pAptrB.SDmut ATGTTTCAACCACTTTCAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.5’KO.SDmut ATCTTTCAACCACTTTCAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.SDstr ATGTTTCAACCAGGAGGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.5’KO.SDstr ATCTTTCAACCAGGAGGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.structmut ATGAAACAACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.structmutcomp ATGAAACAACCAGAAACTTCAATAAC ATG CTA CCA AAA GCC GCC pAptrB.scan1 ATGAAAGTACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.scan2 ATGTTAGTTGCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.scan3 ATGTTTCATGGTCAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.scan4 ATGTTTCAACGTCCCAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.scan5 ATGTTTCAACCACCCCCAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.scan6 ATGTTTCAACCAGAACCTTGTATAAC ATG CTA CCA AAA GCC GCC pAptrB.scan7 ATGTTTCAACCAGAAAGAACTTATTC ATG CTA CCA AAA GCC GCC pAptrB.5ntdel ATGTTCAACCAGAAAGAACAA ATG CTA CCA AAA GCC GCC pAptrB.9ntdel ATGCAACGAAAGAACAA ATG CTA CCA AAA GCC GCC pAptrB.SDdel ATGTTTCAACCAAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.9ntadd ATGTTTCAACCATTTCAACCAGAAAGAAC AATAAC ATG CTA CCA AAA GCC GCC pAptrB.21ntadd ATGTTTCAACCACTTTCAACAATAACATT TCAACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC 66 pAptrB.33ntadd ATGTTTCAACCACTTTTTCAACCACTTTC AACAATAACATTTCAACCAGAAAGAACA ATAAC ATG CTA CCA AAA GCC GCC pAptrB.6GCA ATGTTGCAACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.6TCG ATGTTTCGACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.6GCG ATGTTGCGACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.7CGC ATGTTTCGCCCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pAptrB.7AGA ATGTTTAGACCAGAAAGAACAATAAC ATG CTA CCA AAA GCC GCC pABptrB-pnp ATGTTTCAACCAGAAAGAACAATAAC TTG CTT AAT CCG ATC GTT pABptrB-pncB ATGTTTCAACCAGAAAGAACAATAAC ATG ACA CAA TTC GCT TCT pABptrB5’KO-pnp ATCTTTCAACCAGAAAGAACAATAAC TTG CTT AAT CCG ATC GTT pABptrB5’KO-pncB ATCTTTCAACCAGAAAGAACAATAAC ATG ACA CAA TTC GCT TCT pABptrB-pncBdblmut ATGTTTCAACCAGAAAGAACAATAAC ATG ACG CAG TTC GCT TCT pABptrB5’KO-pncBdblmut ATCTTTCAACCAGAAAGAACAATAAC ATG ACG CAG TTC GCT TCT pAptrBCDSdblmut ATGTTTCAACCAGAAAGAACAATAAC ATG CTG CCG AAA GCC GCC pAptrB5’KO.CDSdblmut ATCTTTCAACCAGAAAGAACAATAAC ATG CTG CCG AAA GCC GCC pABptrB-tna.IN ATGTTTCAACCAGAAAGAACAATAAC ATG GTG GCG TTC TCT AAC pABptrB-aroL.IN ATGTTTCAACCAGAAAGAACAATAAC ATG ACG GTC GCG GAG ATC pABptrB5’KO-tna.IN ATCTTTCAACCAGAAAGAACAATAAC ATG GTG GCG TTC TCT AAC pABptrB5’KO-aroL.IN ATCTTTCAACCAGAAAGAACAATAAC ATG ACG GTC GCG GAG ATC 67 (5’-CTTTC-3’) to disrupt its complementarity with the 16S rRNA aSD sequence (Table 2-1, pAptrB.SDmut). This change significantly reduced CDS expression by approximately 60% (Figure 2-2A). However, an SD sequence mutation in E. coli would typically result in complete disruption of expression because it is the sole ribosome binding signal (de Boer et al., 1983; Hui and de Boer, 1987; Van Etten and Janssen, 1998), which does not appear to be the case for ptrB. Also, in the ptrB SD sequence mutant (pAptrB.SDmut, Table 2-1), a 30S subunit binding signal at the internal start codon is present in the primer extension inhibition assay (toeprint assay) (Figure 2-2B, lane 15), supporting the levels of expression observed (Figure 2-2A). These data suggest that ribosomes can still initiate translation, albeit at reduced efficiency, in the absence of the predicted ptrB SD sequence. Next, the SD sequence was strengthened such that its sequence had absolute complementarity to the 16S rRNA aSD sequence (i.e. 5’-GAAAG-3’ 5’-GGAGG-3’, pAptrB.SDstr, Table 2-1). The mutation increased the downstream expression levels by 2.5-fold (Figure 2-2A). To understand the relationship between the 5’-uAUG and the SD sequence, the 5’-uAUG was mutated to 5’-AUC in concert with the strengthened SD sequence (pAptrB.5’KO.SDstr, Table 2-1). An insignificant change in ptrB CDS expression was observed as a result of the 5’-uAUG mutation in the presence of the strengthened SD sequence (Figure 22A), demonstrating that the necessity for the 5’-uAUG to regulate ptrB CDS expression was relieved by strengthening the SD sequence. The loss of 5’-uAUG dependence was supported by the toeprint assay, which displayed a loss of 70S binding to the 5’-terminus due to the 5’-uAUG mutation, whereas the strong internal 30S binding signal was maintained (Figure 2-2C). Therefore, the SD sequence appears to be acting as a part of a mechanism completely separate from 5’-uAUG regulation. Because the 5’-uAUG and SD sequence appear to be acting through distinct regulatory mechanisms, we tested whether ptrB expression would be completely abolished by disrupting both. In the presence of both the 5’-uAUGAUC mutation and the SD sequence mutated to its complement (pAptrB.5’KOSDmut, Table 2-1), ptrB CDS expression was reduced by 45% (Figure 2-2A). Taken together, ptrB CDS expression is higher as a result of the mutations in 68 Figure 2-2. Role of SD sequence in ptrB regulation. (A) Expression from the ptrB CDS fused to lacZ in the presence of the various SD sequence mutations (see Table 2-1), shown as a percentage of ptrB wild-type (WT) CDS (pAptrB.WT) fused to lacZ (100%). * refers to a statistical significance compared to pAptrB.WT where p<0.001. (B) Primer extension inhibition (toeprint) reactions contain mRNA, 30S subunits or 70S ribosomes, and initiator tRNA as indicated by + or - symbols; reaction products are separated on denaturing polyacrylamide gels and visualized by autoradiography. Predicted positions of toeprint signals (+15) to the 5’ AUG (open arrow) and downstream AUG (closed arrow) are indicated. (C) Toeprint assay as previously described containing strengthened SD sequence mutations (see Table 2-1). Reaction components are indicated. 69 70 71 tandem than it is in the presence of the mutations separately (Figure 2-2A). This finding was supported by the ribosome binding data as well, exhibiting a stronger band corresponding to 30S subunit binding to the CDS start codon in the presence of the tandem mutations, compared to each single mutation (Figure 2-2B, lane 21 compared to lanes 9 and 15). Using secondary structure prediction software (Zuker, 2003), both singular mutations (pAptrB.5’KO and pAptrB.SDmut, Table 2-1) were predicted maintain the wild-type structure, including a hairpin loop within the 5’-UTR (Figure 2-3A). However, the double mutation (pAptrB.5’KO.SDmut, Table 2-1) caused a loss of the secondary structure, resulting in an open conformation with no steric hindrance to prevent ribosome binding (Figure 2-3B). The complete loss of secondary structure in the double mutant could allow for SD sequence-independent initiation, because the absence of secondary structure is necessary and sufficient to initiate SD sequence-independent translation (Scharff et al., 2013). To examine the importance of secondary structure in ptrB regulation, a construct was created to mutate one side of the aforementioned hairpin loop in the 5’-UTR to disrupt the predicted secondary structure (pAptrB.structmut, Table 2-1). Secondly, a compensatory mutant was constructed to reconstitute the secondary structure by mutating the second side of the hairpin loop (pAptrB.structmutcomp, Table 2-1). If structure were the primary means of ptrB regulation, the compensatory mutant would be expected to restore wild-type expression levels. The mutant that disrupted the secondary structure, resulting in an open conformation, caused a 1.5-fold increase in ptrB CDS expression (Figure 2-3C), following the trend suggested by the previous expression data (Figure 2-2A). In contrast, the compensatory mutant caused a significant reduction in CDS expression of over 80% (Figure 2-3C). The same hairpin structure is present in both the wild-type mRNA and the pAptrB.structmutcomp mRNA suggesting that the reduction in expression seem in the pAptrB.structmutcomp mRNA is a result of the changes in the primary sequences rather than secondary structure. Therefore, although secondary structure may play a role in regulation, the sequence features are more important in influencing CDS expression. Ribosome binding signals present within the ptrB 5’-UTR The occurrence of ptrB CDS expression in the absence of both the 5’-uAUG and the SD sequence suggests there may be other sequence features within the 5’-UTR that act as ribosome 72 Figure 2-3. ptrB mRNA structure may play a secondary role in expression. (A) Predicted secondary structure of ptrB WT mRNA using computational modeling (Zuker et al., 2003). The predicted SD sequence is underlined and the CDS initiation codon is boxed. (B) Predicted secondary structure of ptrB mRNA with 5’-uAUG mutated to AUC and SD sequence mutated to its complement (pAptrB.5’KO.SDmut, Table 2-1) using computational modeling (Zuker et al., 2003). The mutated SD sequence is underlined and the CDS initiation codon is boxed (C) Expression from the ptrB CDS fused to lacZ and in the presence of structural mutations (see Table 2-1) shown as a percentage of ptrB WT CDS (pAptrB.WT) fused to lacZ (100%). * refers to a statistical significance compared to pAptrB.WT where p<0.05. 73 74 Figure 2-4. Scanning mutagenesis of ptrB 5’-UTR. (A) Schematic of regions within the 5’UTR mutated to its complementary DNA sequence, corresponding to lines 1-7 beneath the ptrB sequence. (B) Expression from the ptrB CDS fused to lacZ in the presence of the various 5’-UTR mutations (1-7), shown as a percentage of ptrB WT CDS (pAptrB.WT) fused to lacZ (100%). * refers to a statistical significance compared to pAptrB.WT where p<0.05. (C) Toeprint assay comparing pAptrB.WT mRNA to pAptrB.scan2 and pAptrB.scan6 mRNA performed as previously described (see Figure 2-2). Reaction components are indicated. Predicted positions of toeprint signals (+15) to the 5’-uAUG (open arrow) and downstream AUG (closed arrow) are indicated. 75 76 77 binding signals. To test whether there are other regions of importance within the 5’-UTR, scanning mutagenesis was conducted, spanning the length of the ptrB 5’-UTR with overlapping mutations (Figure 2-4A). The mutations changed the wild-type sequence to its complement, except in the case of the SD region, in which the nucleotides were all mutated to cytosines. The majority of the mutations significantly disrupted CDS expression, some more drastically than others (Figure 2-4B). Mutations of the regions closer to the 5’-terminus caused a more severe decrease in CDS expression, corresponding with the region of ribosomal coverage when bound to mRNA, overlapping to approximately position +19 (Huttenhofer and Noller, 1994). A toeprint assay was also conducted to analyze the ribosome binding pattern in two of the scanning mutant constructs (pAptrB.scan2 and pAptrB.scan6, Table 2-1). In both mutants, the 70S binding to the 5’-terminus and the internal 30S binding was greatly reduced compared to the ptrB wildtype even in the presence of an intact 5’-uAUG (Figure 2-4C). These data suggest that there are signals within the 5’-UTR that may help to stabilize the ribosome once bound to the mRNA’s 5’terminus. Spacing optimized between the 5’-uAUG and downstream RBS The spacing between primary sequence elements in the mRNA also contributes to efficient expression. The importance of spacing is seen with regards to the SD sequence, which must be spaced an appropriate distance from the initiation codon to place the initiation codon in the P-site when the SD sequence is properly aligned with the 16S rRNA aSD sequence (Ringquist et al., 1992). To determine if the spacing between the 5’-uAUG and the CDS initiation codon is similarly important, five and nine nucleotides within the 5’-UTR, shown to be nonessential to CDS expression, were deleted to shorten the 5’-UTR (pAptrB.5ntdel, pAptrB.9ntdel, Table 2-1) and therefore reduce the distance between the 5’-uAUG and the downstream RBS. The predicted SD sequence, also five nucleotides in length, was deleted (pAptrB.SDdel, Table 2-1) to determine if a deletion of the SD sequence would produce results similar to the complementary mutation, pAptrB.SDmut (Figure 2-2). Deletion of five nucleotides or deletion of the SD sequence each significantly reduced CDS expression by 98%, whereas deletion of nine nucleotides abolished CDS expression completely (Figure 2-5A). This suggests that the 5’-UTR cannot be shortened without a significant reduction in downstream expression 78 Figure 2-5. 5’-UTR mutations and deletions to affect spacing between 5’-uAUG and downstream RBS. (A). Expression from the ptrB CDS fusion to lacZ in the presence of the deletion or insertion mutants (see Table 2-1) to either reduce or increase the length of the 5’UTR expressed as a relative percent of ptrB wild-type CDS expression (100%). * above the error bars refer to a statistical significance compared to pAptrB.WT where p<0.05 and ** where p<0.001. A horizontal line and ** above two bars denotes to a statistical comparison between those two constructs where p<0.001. (B) Toeprint assay containing mRNA corresponding to either the pAptrB.WT or pAptrB.21ntadd constructs performed as previously described (see Figure 2-2). Reaction components are indicated. Double plus sign (++) refers to component which was added first for pre-binding. Predicted positions of toeprint signals (+15) to the 5’uAUG of pAptrB.WT (open arrow), the 5’-uAUG of pAptrB.21add (shaded arrow) and the downstream AUG (closed arrow) are indicated. 79 80 and the positioning of the sequence elements relative to one another may be influential for CDS expression. Due to the short nature of the ptrB 5’-UTR, there is not enough space to allow for simultaneous binding of both a 70S ribosome at the 5’-terminus and a 30S subunit at the internal RBS. Therefore, the 5’-UTR was lengthened, which would presumably allow for dual ribosome binding. Nine, twenty-one and thirty-three nucleotides of intervening sequence were then added to the 5’-UTR, immediately following the 5’-uAUG (pAptrB.9ntadd, pAptrB.21ntadd, pAptrB.33ntadd, Table 2-1). The addition of nine nucleotides had no impact on downstream expression however, as the region between the 5’-uAUG and the CDS initiation codon was extended, CDS expression was reduced in a linear fashion (Figure 2-5A). To understand the relationship between ribosome binding now possible on the elongated ptrB 5’-UTR, a toeprint assay was performed with the ptrB mRNA containing the 21-nucleotide insertion (pAptrB.21ntadd). 30S subunits and 70S ribosomes were added separately and simultaneously to compare the differences seen as a result of ribosomal interactions (Figure 25B). When added separately (Figure 2-5B, lanes 7-12), no significant difference was observed in the ribosomal binding pattern when compared to ptrB wild-type mRNA (Figure 2-5B, lanes 1-6). However, when added simultaneously (Figure 2-5B, lane 13), internal binding was observed with no discernable binding at the 5’-terminus. The 30S subunits were then pre-bound to the mRNA for 15 minutes with subsequent addition of the 70S ribosomes and vice versa. When the 30S subunits were pre-bound (Figure 2-5B, lanes 14-16), binding was observed at both the internal RBS and the 5’-terminus. Binding at both RBSs was also seen when the 70S ribosomes were pre-bound (Figure 2-5B, lanes 17-19), however 70S ribosome pre-binding resulted in a more pronounced band at the 5’-terminus when compared to the internal RBS. Overall, the toeprint assay shows that there are differences that occur depending on the order of ribosomal addition to the system, suggesting that the 70S ribosome and 30S subunits may be interacting or competing for binding regions, which could contribute to CDS expression levels. Expression of the uORF negatively impacts downstream CDS expression 81 The ptrB uORF is not efficiently translated despite strong 70S ribosome binding to the 5’-uAUG (Figure 2-2), leading to the prediction that expression from the 5’-uAUG is repressed to allow for CDS translation. Therefore, we introduced randomized sequence mutations across the 5’-UTR to test whether the ptrB 5’-uORF translational repression could be relieved. A tandem mutation at positions +6 and +8 (with the A of the 5’-uAUG at +1) from TCA to GCG was identified (pAptrB.6GCG, Table 2-1) that caused a 16-fold increase in 5’-uORF expression (Table 2-2). Conversely, in the presence of the GCG mutation (pAptrB.6GCG), expression from the CDS was reduced to 30% of wild-type expression (Table 2-2). Individual mutations were also made at each position (pAptrB.6GCA and pAptrB.6TCG, Table 2-1) but neither produced as drastic results as seen by the mutations together (Table 2-2). Although relieving the apparent repression of the uORF translation reduced CDS expression, other sequence mutations within the 5’-UTR that did not allow for efficient uORF translation also reduced CDS expression (Table 22). Therefore, inhibition of uORF may indirectly influence downstream regulation, while specific sequence features within the 5’-UTR have significant effects on CDS translational efficiency. The mutation at position +8 is predicted to cause a change in amino acid sequence in the event of uORF translation, with glutamine (CAA) being changed to an arginine (CGA) at the third position. Because the CGA codon is rare in E. coli, we tested whether the amino acid sequence influenced expression levels, by mutating the third codon of the uORF to either CGC (pAptrB.7CGC, Table 2-1), a non-rare arginine codon, or AGA (pAptrB.7AGA, Table 2-1), another rare arginine codon. The CGC mutation led to a 22% reduction in uORF expression and the AGA mutation resulted in a nearly 4-fold increase in uORF expression (Table 2-2). Conversely, the CGC mutation resulted in an 83% reduction in CDS expression, whereas the AGA mutation resulted in a 25% reduction in CDS expression (Table 2-2). These data indicate that the changes in both uORF expression and downstream CDS expression are not a result of the amino acid changes within the uORF or tRNA availability, but rather the nucleotide sequence changes, highlighting important sequence features within the 5’-UTR. Heterologous regulation by the 5’-UTR of ptrB There appear to be multiple regions of importance contained within the ptrB 5’-UTR that influence downstream CDS expression. To determine if the ptrB 5’-UTR could similarly influence other genes’ CDSs, gene fusions were constructed linking the ptrB 5’-UTR to two 82 Table 2-2. ptrB mutations expressed as a percentage of either the 5’-uAUG in-frame leader fusion with lacZ (100%) (column 2) or the CDS fused with lacZ (100%) (column 3). 83 Construct Relative percentage 5-uAUG Leader frame Relative percentage CDS frame pAptrB.6GCA 107 ± 18 50 ± 3 pAptrB.6TCG 176 ± 40 40 ± 4 pAptrB.6GCG 1640 ± 30 30 ± 10 pAptrB.7CGC 80 ± 9 17 ± 5 pAptrB.7AGA 379 ± 32 74 ± 13 84 Figure 2-6. Heterologous regulation by the 5’UTR of ptrB. (A) LacZ activity expressed from ptrB 5’UTR (AUG or AUG AUC KO) fusions to pnp or pncB coding sequence fused to lacZ (Table 2-1). * refers to a statistical significance where p<0.001. (B) Toeprint reactions (as described in Figure 2) with mRNA containing the ptrB 5’UTR (5’-uAUG or AUG AUC KO) fused to the pnp coding sequence. Reaction components are indicated. Predicted positions of toeprint signals (+15) to the 5’-uAUG (open arrow) and downstream CDS AUG (closed arrow) are indicated. 85 86 other genes’ ORFs, pncB and pnp (pABptrB-pnp and pABptrB-pncB, Table 2-1). To examine the 5’-uAUG regulatory effect on the other genes, the ptrB 5’-uAUG was again mutated to AUC to disrupt ribosome binding and 5’-uORF translation. The change caused the expression of both pncB and pnp CDSs to be significantly reduced by more than 90% (Figure 2-6A), following the trend that was seen previously for the ptrB CDS. The ribosome binding pattern of the ptrB-pnp fusion supported the expression data, with the 5’-uAUGAUC mutation resulting in the loss of binding signals at both the 5’-terminus and the internal start codon (Figure 2-6B). Therefore, the ptrB 5’-UTR regulatory signals can exert their effects on other E. coli genes when transferred upstream of their CDS. Initiation signals present within a bona fide CDS Because the ptrB 5’-UTR conferred positive regulation for downstream ORFs, we sought to determine if the addition of the ptrB 5’-UTR would allow for translation initiation from an internal fragment of RNA. Presumably, an internal fragment would be devoid of any translation initiation signals, but would still have the ability to be translated due to its coding potential. Internal fragments of two E. coli genes, tna and aroL, were used. An internal in-frame methionine, 381 nucleotides and 126 nucleotides from the native tna and aroL initiation codons, respectively, was selected to act as an initiation codon. Each truncated ORF was then fused to the ptrB 5’-UTR (pABptrB-tna.IN and pABptrB-aroL.IN, Table 2-1). This fusion would allow the ptrB 5’-UTR to regulate expression of the downstream ORF to produce a truncated version of the Tna or AroL proteins, respectively. The ptrB 5’-UTR was not sufficient for efficient expression of the internal tna or aroL ORFs (Figure 2-7). In the case of the aroL fusion, expression was just above levels of detection Figure 2-7). The tna fusion produced low levels of expression but interestingly, when the ptrB 5’-uAUG was mutated to AUC, it caused a drastic reduction in expression of approximately 90%, similar to that seen with the bona fide coding sequences of ptrB, pncB and pnp (Figure 22A, Figure 2-6A, and Figure 2-7). It would appear that the lack of translation initiation signals present on the internal fragments of RNA prevent efficient translation. Taken together, there are signals within translation initiation regions of bona fide coding sequences that are important for translation, and the ptrB 5’-UTR is not able to confer efficient expression independently and must instead work in concert with signals within the CDS to identify a translation initiation 87 Figure 2-7. ptrB 5’-UTR insufficient to stimulate expression of internal RNA fragments. Expression levels of internal RNA fragments (aroL.IN/tna.IN) fused to lacZ with or without ptrB 5’-uAUG mutation (see Table 2-1). All values compared to ptrB CDS WT (pAptrB.WT) fused to lacZ (100%). * refers to a statistical significance compared to pAptrB.WT where p<0.001. A horizontal line above two bars denotes a statistical significance between those two constructs where p<0.001. 88 89 region. However, the 5’-uAUG still acts as an important signal for even low levels of translation, supporting its regulatory role. To further examine the role of CDS regions, we utilized pncB, which relies on two nucleotides within its CDS for expression (Brock et al., 2007). pncB has a weak SD sequence, does not possess a 5’-uAUG, and is not influenced by its native 5’-UTR for expression. Instead, two nucleotides, both adenines, at positions +6 and +9 of the CDS, appear to influence translation. When these two adenines are mutated to guanines, pncB expression is completely abolished (Brock et al., 2007). A construct was made replacing the pncB 5’-UTR with the ptrB 5’-UTR (pABptrB-pncB, Table 1) to determine if the positive effects of the ptrB 5’-UTR could overcome the negative impact of the pncB CDS mutations. In the presence of the ptrB 5’-UTR, the CDS mutations (pABptrB-pncBdblmut, Table 2-1) caused pncB expression to be significantly reduced by approximately 80% (Figure 2-8A), rather than completely abolished, suggesting that the signals within the ptrB 5’-UTR partially rescue the negative impact of the pncB CDS mutations on expression. The ptrB 5’-uAUG to AUC mutation (pABptrB5’KOpncB, Table 2-1) resulted in an even more dramatic reduction in pncB CDS expression compared to the pncB CDS mutations (pABptrB-pncBdblmut, Table 2-1) in the presence of the ptrB 5’UTR. In this case, expression was reduced by approximately 88% (Figure 2-8A), which follows the trends previously seen with ptrB 5’-uAUG mutation (Figure 2-2A). These data indicate that, in this background, the 5’-uAUG has more of an impact on pncB CDS expression than the influential regions within the pncB CDS itself. Finally, when the ptrB 5’-uAUG and the two nucleotides within the pncB CDS were mutated in tandem (pABptrB5’KO-pncBdblmut, Table 21), pncB CDS expression was again abolished (Figure 2-8A), supporting the notion that the 5’uAUG and the two nucleotides within the CDS are the signals that control pncB CDS expression in this context. Interestingly, similar to the pncB CDS, the ptrB CDS also has adenines at position +6 and +9 (relative to the A of the CDS start codon at +1). When both adenines were mutated to guanines in ptrB (pAptrB.CDSdblmut, Table 2-1), CDS expression was also reduced by approximately 70% (Figure 2-8B). In addition, when the 5’-uAUG and the two nucleotides within the ptrB CDS are mutated in tandem (pAptrB5’KO.CDSdblmut, Table 2-1), expression 90 Figure 2-8. Expression regulated by both upstream and downstream signals. (A) Expression values represented as a percent of the ptrB 5’-UTR fused to the pncB CDS with the pncB CDS in-frame with lacZ (100%). Comparison of constructs with mutated upstream and downstream signals (Table 1). (B) Expression values represented as a percent of that of the ptrB WT CDS inframe with lacZ (100%). Comparison of constructs with same mutated upstream and downstream signals as in (A) (also see Table 1). * refers to a statistical significance compared to pABptrBpncB or pAptrB.WT respectively where p<0.001. 91 A B 92 was abolished (Figure 2-8B). These results are concordant with the expression data derived from similar mutations in the ptrB-pncB construct (Figure 2-8A). Taken together, these data suggest that specific upstream and downstream sequence elements are responsible for CDS expression in this novel regulation mechanism to compensate for the weak SD sequence. 93 Discussion Our results suggest that translation initiation of E. coli ptrB mRNA is controlled via its 5’-terminal AUG, which represents a novel form of regulation that can act as an autonomous regulatory element independent of gene context. We present two models that might explain the mechanism of ptrB regulation. The models differ in the stability of the initial interaction between the mRNA and the ribosome. In the first model, the 5’-uAUG acts as a recognition signal to attract ribosomes through transient, unstable binding at the 5’-terminus. The transient binding increases the local concentration of ribosomes to allow for more efficient binding to the downstream ORF RBS resulting in ternary complex formation and increased CDS initiation. In the second model, the 5’-uAUG acts as both a recognition signal and stable binding platform. Ribosomes can stably bind the 5’-terminus of the mRNA through their inherent ability to bind 5’-AUGs, i.e., leaderless mRNAs. Additional sequences in the ptrB 5’-UTR then help stabilize the ribosome at the 5’-terminus to prevent the ribosome from dissociating. The ribosome can then transition down the mRNA until it reaches the CDS initiation codon. In each model, the CDS also plays a role, likely supporting the proper positioning of the ribosome on the CDS initiation codon, thereby defining the reading frame. In both models, the ptrB mRNA contains upstream and downstream binding signals to compensate for its weak SD sequence which function cooperatively in a novel regulatory mechanism. The models only differ as to whether the ribosome dissociates from the 5’-terminus of the mRNA or remains associated to the mRNA. Both models agree with the previously described “cumulative specificity initiation mechanism” of Nakamoto (2011), in which multiple preferred upstream and downstream bases at specific positions work cumulatively, but independent of each other, to regulate translation initiation, thereby making no singular interaction completely essential. In both models presented, the 5’-uAUG is required, but as a binding signal rather than acting as an initiation codon. We have shown the inefficiency of uORF translation to the extent in which translation appears to be repressed to prevent elongation and instead directs ribosomes to the downstream RBS. This repression allows ptrB to take advantage of a sequence, the 5’terminal AUG, that strongly binds ribosomes without the added energetic consequence of uORF translation. It is still unclear exactly how translation is repressed but poor coding potential appears to be involved. Although the uORF is not efficiently translated, the 5’-uAUG still plays a major role in ribosome binding to the ptrB mRNA to influence expression of the downstream 94 CDS. It is evident that the regulation is acting via the 5’-uAUG, rather than another trans-acting element, because the effect of the 5’-uAUG mutation is also seen in the closed system of the toeprint assay. Therefore, we can conclude that the 5’-uAUG is interacting with either the initiator tRNA or the ribosomes rather than a trans-acting element such as a RNA-binding protein or small RNA. To corroborate this hypothesis, we also measured ptrB expression levels in an Hfq deletion strain (Baba et al., 2006) and saw no change in expression (data not shown), greatly reducing the possibility of the involvement of a small RNA interaction. If the 5’-uAUG acts as the sole recognition signal rather than the SD sequence, the SD sequence could be mutated without loss of expression. However, mutation of the SD sequence significantly decreased CDS expression, indicating that the SD sequence may be used as a signal to properly position the start codon in the ribosomal P-site. Thus, the ribosome can bind to the mRNA but cannot be efficiently directed to the proper start codon in the absence of an intact SD sequence, negatively impacting expression. The SD sequence appears to be too weak to effectively act as both the binding and positioning signal, causing the 5’-uAUG to adopt one of these roles. However, when the SD sequence is strengthened, it can again take on both roles, making the 5’-uAUG obsolete (Figure 2-2). The 5’-uAUG might be necessary to compensate for secondary structure within the 5’UTR that negatively impacts ribosome binding, because a local single stranded region is needed for efficient ribosome binding (Draper, 1987; Gualerzi and Pon, 1990; de Smit and van Duin, 1990; Draper et al., 1998). Single-stranded regions upstream of the RBS have been shown to act as “standby sites” to allow for temporary ribosome binding until the inhibitory secondary structure opens (de Smit and van Duin, 1994; Studer and Joseph, 2006; Unoson and Wagner, 2007). It has been suggested that the ribosome can then transition downstream in a scanning-like movement to reach the start codon (Petersen et al., 1978; Adhin and van Duin, 1990). However, it appears that, in the case of ptrB, the 5’-UTR must be of a certain length to accommodate ribosome binding and downward progression (Figure 2-5A). Examples of such energyindependent ribosome movement have also been demonstrated during translational reinitiation (Schmidt et al., 1987; Berkhout et al., 1987; Adhin and van Duin, 1990; Osterman et al., 2013; Shell et al., 2015, Yamamoto et al., 2015), translation of the MazF-regulon in E. coli (Sauert et al., 2016), and in chloroplasts (Hirose and Sugiura, 2004). Therefore, the 5’-uAUG may act as a 95 standby-like site for initial ribosomal loading, after which the ribosome can then slide/scan down the mRNA to the RBS when the structure relaxes momentarily. However, increasing the distance that the ribosome must travel leads to a decrease in CDS initiation efficiency (Figure 25A). The same effect occurs in translation reinitiation in which longer intercistronic gaps lead to reduced downstream initiation in operons due to ribosome drop-off (Cone and Steege, 1985). Although it is unclear whether the 30S subunit or 70S ribosome is responsible for ptrB regulation, our data raise the possibility that the 30S subunit may be initially loaded onto the 5’terminus and occupy the upstream region of the mRNA. When 30S subunits are pre-bound or added simultaneously with 70S ribosomes, they appear to block the ribosome binding of the 70S ribosome at the 5’-terminus (Figure 2-5B). However, the 30S subunit binding is known to be unstable and therefore unlikely to block the action of the reverse transcriptase (Brock et al., 2008), which can explain why we are unable to capture the 30S subunit binding at the 5’terminus in the toeprint assay (Figure 2-5B). This idea also explains why there is reduced 30S subunit binding internally when the 70S ribosome is pre-bound (Figure 2-5B), because the 70S ribosome may be blocking the 30S subunit entry site at the 5’-terminus. Although 70S ribosomes are thought to primarily bind at the 5’-terminus, Brock et al. (2008) demonstrated that the ribosomal proteins implicated in 5’-terminus binding are all 30S subunit proteins, suggesting that the 30S subunit could also bind the 5’-terminus, and may therefore be implicated in ptrB regulation. It is also possible, due to the short nature of the ptrB 5’-UTR, that binding of the ribosome at the 5’-terminus can resolve secondary structure at the RBS. It is unclear how the 70S ribosome is loaded on to the 5’-terminus of leaderless mRNA; however, it is thought that the 5’end is pulled through the channel between the 30S and 50S subunits (Moll et al., 2002). The 5’AUG then finds its way through the tunnel until it is positioned in the P-site through interactions with the initiator tRNA (Moll et al., 2004). The tunnel is only, on average, 15 Å in diameter (Nissen et al., 2000; Takyar et al., 2005), so it cannot accommodate secondary structure. Therefore, the structure must be eliminated as the mRNA is fed through the tunnel such that the region of mRNA protected by the ribosome, from the 5’-terminus to approximately position +19 (Huttenhofer et al., 1994), becomes single-stranded. Because the ptrB 5’-UTR is only 26 96 nucleotides long, the loaded ribosome may cover the majority of the 5’-UTR, thereby eliminating secondary structure. Regardless of the model, this 5’-uAUG regulation mechanism has the ability not only to influence ptrB CDS expression, but also to regulate other E. coli CDS. However, the inefficient use of internal RNA fragments (Figure 2-7) suggests that signals within a genuine CDS must nonetheless be present to direct the ribosome downstream for start site selection. Mutational studies indicate that adenines at certain positions play a large part in the signal important for CDS expression. Statistical analyses have shown an abundance of CA and A-rich downstream sequences (Scherer et al., 1980; Rudd et al., 1992) with a large bias in codons at positions +2, +3, and +4 (Sato et al., 2001) supporting this notion. Adenine-rich regions enhance expression through increasing the rate and amount of ternary complex formation during initiation (Brock et al., 2007). It is thought that the increased efficiency is due to ribosomal protein S1’s high affinity for polypyrimidines (Draper and von Hippel, 1978; Subramanian, 1983) and S1’s ability to bind different base motifs (Tzareva et al., 1994). Similarly, our results demonstrate a decrease in expression of both pncB and ptrB as a result of a decrease in adenine richness (Figure 2-8). A similar effect was also seen when analyzing the uORF expression levels via mutational analysis. The mutations that introduce adenines increase uORF expression by relieving the apparent uORF repression (Table 2-2), presumably by improving the codon usage and potential ribosome interactions. Overall, these data suggest a novel mechanism of translational regulation, functioning in a distinct, yet cooperative mechanism from the canonical SD sequence mechanism, and instead highlight the 5’-uAUG. Our previous study demonstrated the abundance of 5’-uAUGs within E. coli transcripts (Beck et al., 2016); therefore, it is possible that this mechanism of regulation is widespread in E. coli and may contribute to regulation of mRNA with weak or absent SD sequences. Understanding mechanisms of non-SD sequence translation is essential because approximately one half of prokaryotic mRNAs do not contain a SD sequence (Chang et al., 2006). There are numerous examples of translation events occurring in the absence of an SD sequence (Boni et al., 2001; Nakamoto, 2006 and references therein); leaderless mRNAs efficiently initiate translation without a 5’-UTR altogether. There is also evidence that suggests that, in certain instances, the SD sequence is important only if the initiation codon is not an AUG 97 or the RBS is masked by secondary structure (Munson et al., 1984; de Smit and van Duin, 1994; Fargo et al., 1998), and the aforementioned S1-mediated initiation acts independently of the SD sequence (Boni et al., 2001; Osterman et al., 2013). Clearly, there are more diverse mechanisms for translation initiation than the conventional SD sequence-regulated mechanism. Others agree that alternatives to the SD sequence-dependent mechanism are much more prevalent in bacteria than anticipated (Hering et al., 2009; Malys and McCarthy, 2011). It is important to understand the different mechanisms of translational regulation because this information can be applied to synthetic biology and genetic engineering to finely tune protein production. Other studies exploring optimization of recombinant gene expression have discovered upstream and downstream sequence elements that work cooperatively to influence expression (Berg et al., 2012). However, the elements identified work in a gene-dependent manner (Berg et al., 2012). The sequence elements identified in this study, however, appear to function in a geneindependent manner, suggesting that they may be more easily adapted to control gene expression. 98 Appendix A Leaderless mRNAs initiate translation via a mechanism of ribosome binding distinct from canonical leadered initiation. Rather than the multi-step process of a 30S subunit binding followed by a 50S subunit binding to form the 70S initiation complex, the intact 70S ribosome binds directly to the 5’-terminus of leaderless mRNA (Balakin et al., 1992; Udagawa et al., 2004; Moll et al., 2004). While the process of leadered mRNA recognition and binding by the 30S subunit has been elucidated, it is currently unknown how leaderless mRNAs undergo these initial steps involving the 70S ribosome. In Escherichia coli, the ribosomal proteins (r-proteins) that appear to be involved in leaderless mRNA loading were discovered using 4-thiouridine (4SU) cross-linking studies with intact ribosomal subunits in either the presence or absence of initiator tRNA and initiation factors (IFs) (Brock et al., 2008). The r-proteins found to cross-link leaderless mRNA were all part of the 30S subunit and include S1, S3, S4, S7, and S10/S18. Since the r-proteins involved in the initial mRNA: ribosome interaction have been identified, it is now of interest to determine the specific cross-link positions on each purified rprotein (Giliberti, dissertation) to map the path of the leaderless mRNA when loaded into the 70S ribosome. The objective of this study is to develop a method to identify specific cross-linked positions between a RNA molecule and a protein that can be utilized to determine how leaderless mRNAs interact with the 70S ribosome. Similar to previous reports (Brock et al., 2008), a 4S-U at the +2 position (the U of the 5’AUG) of the leaderless mRNA cI from bacteriophage λ was used as a substrate for photochemical cross-linking. 4S-U is used as a cross-linker due to its ability to covalently crosslink to nucleotides or amino acids, preferentially lysines, that are within 4 Å after UV excitation (Favre, 1990). Although the initiator tRNA and IFs have been implicated in contributing to efficient translation initiation of leaderless mRNAs, the cross-links to the ribosomal subunits were unchanged in the absence of the initiator tRNA and IFs (Brock et al., 2008). This suggests that their influence on translation efficiency occurs after the initial ribosome interaction and were therefore not considered in this study. For initial method development, r-protein S1 from E. coli was used due to its size, ease of purification and abundance of previously identified cross-links (Brock et al., 2008). S1 also 99 contains four known RNA binding domains (R1-R4) located at the C-terminus and two domains at the N-terminus involved in ribosome binding (Figure A-1) (Subramanian, 1983; McGinness and Sauer, 2004). It is thought that domains R1-R3 are involved in mRNA binding whereas R4 is involved in the autogenous regulation of S1 expression (McGinness and Sauer, 2004). For cross-linking, 1 nmol of purified His-tagged S1 (Giliberti, dissertation) was incubated with 2 nmol of a 20-nucleotide RNA oligomer. The oligonucleotide sequence was 5’-A [4S-U] GAGCACAAAAAAGAAACC-3’, corresponding to the first 20 nucleotides of the bacteriophage cI leaderless mRNA. The cross-linking was performed for 20 minutes at 37oC in 5X sample buffer (300 mM NH4Cl, 50 mM Tris-acetate pH 7.6, 50 mM Mg(OAc)2, 3 mM βmercaptoethanol). The reaction mixture was then distributed in 20-µL spots onto a Petri dish resting on ice to reduce temperature changes as a result of UV light exposure. The pre-warmed hand held UV light was then placed approximately 0.5 inches above the samples and the reactions were exposed for 15 minutes to UV light (366 nm). Reactions were then pooled and purified to remove non-cross-linked RNA from the S1 protein using metal ion affinity chromatography (PrepEase Histidine-tagged Protein Purification Midi Kit, Affymetrix). The column was first equilibrated with wash buffer (10 mM Tris, 0.3 M NaCl) prior to sample loading. After the sample was loaded onto the column, it was then washed three times with the wash buffer and eluted three times using the elution buffer (10 mM Tris, 0.3 M NaCl, 250 mM imidazole). Fractions were collected and analyzed by SDS-PAGE and Coomassie staining (Figure A-2). The gel indicates that the process of cross-linking is not 100% efficient because a band representing uncross-linked S1 remains. However, the shifted band represents purified S1 protein that has been cross-linked with the cI RNA oligonucleotide (Figure A-2). Fractions containing S1 were pooled for further analysis. After purification, the samples were trypsindigested, treated with nuclease P1 and subjected to inductively coupled plasma mass spectrometry (ICP-MS) and electrospray ionization mass spectrometry (ESI-MS), both coupled with on-line high-performance liquid chromatography (HPLC) for identification of cross-linked amino acid residues (Giliberti, dissertation). 100 Figure A-1. Schematic of S1 domains and their amino acid positions in the S1 r-protein. The gray boxes are the two N-terminal domains and the black boxes are the four C-terminal RNA-binding domains (R1-R4). Adapted from McGinness and Sauer, 2004. 101 102 Figure A-2. Cross-linking of 4S-U cI RNA to purified S1 protein. Purified His-tagged S1 (lane 2) was cross-linked to 4S-U cI RNA oligonucleotide and bound to PrepEase His-tag purification column. Lane 3 represents the fraction eluted off the column using imidazole. Lane 4 represents the fraction collected after washing the column prior to the elution step. Lane 1 contains the molecular weight marker and sizes of each band are indicated. 103 104 Several cross-links were detected reproducibly in multiple biological replicates and at least five technical replicates suggesting these are structured interactions between the RNA fragment and r-protein S1. These cross-links were identified at lysine residues K196 (domain R1), K260 (domain R1), K272 (domain R1/R2), K450 (domain R3/R4), K457 (domain R4) and K464 (domain R4) (Figure A-1, Figure A-3). Spatially, the cross-links appear to lie in two distinct regions and within those regions, the cross-links all lie on the same face of the protein (Figure A-3). The cross-links all also lie within mRNA binding domains, with no cross-links occurring within N-terminal domains known to play a role in association with the 30S ribosomal subunit. Unfortunately, high resolution X-ray crystal structures of the ribosome are intentionally depleted of S1 due to its high flexibility (Schuwirth et al., 2005). Therefore, it is difficult to relate, with any certainty, our identified cross-links to the ribosome crystal structure. S1 was expected to allow for mRNA: protein cross-links due to its multiple RNA binding domains (Subramanian, 1983). The mRNA binding domains R1-R4 in the C-terminus have been predicted to extend at least 10 nm from the 30S subunit into the cytoplasmic space to interact with mRNA (Walleczek, et al., 1990). A flexible hinge region has been identified that could allow the C-terminus to scan the area surrounding the ribosome for mRNA and then contract to position the mRNA close to the mRNA track (Byrgazov et al., 2015) supporting the prediction that S1 is the first r-protein to interact with leadered mRNA (Komarova et al., 2002). S1 also interacts with polyuracil regions of the 5’-untranslated region to facilitate 30S binding to mRNAs with weak Shine-Dalgarno sequences (Boni et al., 2001; Simonetti et al., 2009). S1 is highly involved in leadered mRNA binding (Suryanarayana and Subramanian, 1983; Boni et al., 1990), and thought to be required for most, if not all, mRNA translation (Sørensen et al., 1998). Although it has been suggested that leaderless mRNA initiation can occur in the absence of S1 in vitro (Tedin et al., 1997; Krishnan, 2010), these results suggest that S1 might play an influential role in ribosome binding to cI leaderless mRNA due to the reproducibility and non-random distribution of cross-links. These results suggest that a method has been successfully developed to identify crosslinks between mRNA and r-proteins, generating reproducible results with biological relevance. Further use of this method to determine the interactions between leaderless mRNA and r-proteins 105 Figure A-3. Cross-linked positions on S1 ribbon structure. Crystal structure of ribosomal protein S1 in green with cI 20mer oligonucleotide highlighted in red depicting potential crosslinked sites shown from two different views. 106 107 is important because the r-proteins have been implicated to play a role in progression of the start codon to the P-site. Cross-linking results produced by Brock et al. (2008) indicated that the presence of IFs and tRNA does not change the cross-link pattern between the ribosomal subunits and the leaderless mRNA. This result differs from previous leadered mRNA cross-linking studies performed by La Teana et al. (1995), suggesting that rather than IFs and tRNA causing the progression of the start codon to the P-site, the r-proteins may instead be playing this role in leaderless mRNA, reinforcing the differences between the two mechanisms of initiation. It is now of interest utilize this method to determine the cross-link positions for the other previously identified cross-linked r-proteins (S3, S4, S7, and S10/S18) since they may play larger roles in leaderless mRNA binding and progression to the P-site. Using this newly developed method, researchers can determine the numerous roles the r-proteins may be playing in translation initiation through their interactions with leaderless mRNAs. 108 Concluding Remarks The process of translation initiation in bacteria is typically simplified to include a single factor, the SD sequence, which governs efficiency and is used to define open reading frames. However, advancements in bioinformatics and transcriptomics have uncovered novel types of RNAs and regulation mechanisms revealing the abundance of non-canonical mRNAs in prokaryotic organisms. Through in silico analysis, this study reinforces the abundance of noncanonical mRNAs in Escherichia coli by identifying previously unannotated open reading frames (ORFs) at the 5’-terminus of canonical Shine-Dalgarno (SD) sequence-led mRNAs that are translated at biologically relevant levels. In these messages, an AUG triplet at the 5’-terminus (5’-uAUG) acts as an initiation codon and is bound by 70S ribosomes, similar to the 70S initiation mechanism used by leaderless mRNA (Balakin et al., 1992; Udagawa et al., 2004; Moll et al., 2004). A subset of 5’-uAUGs identified were also implicated in playing a role in regulation of the downstream coding sequence (CDS). In one mRNA, ptrB, we were able to define a novel regulation mechanism dependent upon the 5’-uAUG as a signal itself, rather than its ability to act as an initiation codon, that works in concert with downstream sequence features to govern expression of the CDS. Overall, this study supports the idea that leaderless mRNAs are more abundant than originally expected. Since leaderless mRNAs are relatively prevalent, it is important to understand how they are recognized by the ribosome in the absence of conventional binding signals. The translation initiation mechanism used by leaderless mRNAs is distinct from the mechanism used by canonical leadered mRNAs but the process of mRNA loading into the 70S ribosome is yet to be elucidated. This study provided further insight into a leaderless mRNA’s initial interactions with the ribosome through the use of photochemical cross-linking. We contributed to the development of a method that makes use of 4-thiouridine (4S-U) cross-linking to identify specific contact sites between a protein and an RNA molecule, specifically ribosomal protein S1 and the naturally occurring leaderless mRNA cI from bacteriophage λ. This method will allow researchers to map the initial interaction between leaderless mRNAs and the ribosome, providing insight into the long-sought after mechanism of leaderless mRNA loading into the 70S ribosome. Efforts to correct the underrepresentation of leaderless mRNAs and small peptides 109 Bioinformatic analysis performed in this study identified 287 transcripts in E. coli with an AUG triplet within three nucleotides of the 5’-terminus. Although only thirteen transcripts have been tested, the 5’-uORFs were expressed at varying levels and all the 5’-uAUGs appear to perform diverse functions, indicating that extensive testing of all transcripts identified could uncover novel functions and peptides. This study highlights the current underrepresentation of leaderless mRNAs in E. coli, as only three leaderless mRNAs were previously described (Ptashne et al., 1976; Christie and Calendar, 1985; Klock and Hillen, 1986). If leaderless mRNAs are underestimated in one of the most thoroughly studied model organisms used in molecular genetics, the occurrence of leaderless mRNAs is likely to be miscalculated throughout the prokaryotes. Similar in silico analyses performed in prokaryotes with published transcriptomes are likely to uncover a large number of 5’-uAUGs on canonical mRNAs that can function to initiate translation of an uORF. Although leaderless mRNAs in E. coli must initiate with an AUG triplet, bacteria of other genera contain leaderless mRNAs that can also initiate with a GUG triplet (Cortes et al., 2013; Schrader et al., 2014), expanding the potential number of transcripts analyzed in other bacteria and revealing a vast number of putative peptides encoded by uORFs. Although the number of small peptides is likely to be underestimated because they are harder to identify through both experimental and bioinformatic methods, the small peptides that have been identified participate in a wide array of cellular processes in both bacteria and archaea (Wang et al., 2008; Hobbs et al., 2011). Since small peptides require less energy to be translated and folded, organisms often use them to signal environmental changes and to globally regulate relevant pathways (Seligmann, 2003; Wang et al., 2008), suggesting there may be a selective advantage to maintaining small peptides in the cell. There are many examples of peptides as small as three amino acids in length with a variety functions such as detoxification of electrophiles (Ferguson, 1999), penicillin biosynthesis (Martin et al., 2010) and protease inhibition (Tashiro et al., 1987), demonstrating the ability of very small peptides to have specific functions. All of the uORFs in this study encode peptides that exceed three amino acids in length, suggesting that they might also exhibit relevant functions and should be analyzed further. Whereas some small peptides function in a variety of cellular processes, others are used to regulate those processes. Some uORFs as small as six codons in length are used for 110 translation of nascent peptides that influence the functionality of the ribosome on which they are synthesized by inducing ribosomal stalling (Tenson and Ehrenberg, 2002; Cruz-Vera et al., 2011). Nascent peptides are able to interact with the ribosome in variety of ways, including binding within and blocking the ribosome exit tunnel, acting within the peptidyltransferase center to inhibit its catalytic activity, or altering 23S rRNA conformation (Lovett and Rogers, 1996; Tanner et al., 2009; Yan et al., 2010). Although some uORFs produce small regulatory peptides, others as small as two codons in length are used as regulators to increase the rate of tRNA starvation, thereby inhibiting further protein synthesis (Dincbas et al., 1999; DelgadoOlivares et al., 2006). Yet another uORF function was identified in this study, in which the ptrB 5’-uAUG acts to regulate downstream translation from a SD sequence-led initiation codon, rather than initiating translation to produce a peptide. The 5’-uORFs identified through the bioinformatic analysis in this study could function similarly to any of the aforementioned uORFs, or could show novel types of regulation. Further experimentation should be performed to define possible functions of these 5’-uORFs. Mechanisms to regulate protein production Translation is highly regulated in a variety of ways because it is an energetically expensive process. Since initiation is the rate-liming step of translation, many systems are in place to specifically govern initiation to prevent an unnecessary use of energy. The abundance of non-SD-led mRNAs has led to the belief that there are a variety of different mechanisms utilized by prokaryotes to regulate initiation, including many alternatives to the canonical mechanism (Malys and McCarthy, 2011). Therefore, it is not surprising that another regulation mechanism has been identified to control expression of ptrB mRNA. However, more work must be completed to fully understand the energetics and ribosomal movement in the mechanism. Although ribosomal movement along a transcript when not actively translating is energyindependent (Schmidt et al., 1987; Yamamoto et al., 2015), the energy consumption in ptrB regulation should be analyzed as well. The conformation of the ribosome and its initial binding site must also be elucidated. Also, because the regulation operates in a gene context-independent manner, this regulation may be more widespread. Therefore, it is of interest to identify other transcripts that utilize the same method of regulation, potentially even in other genera. 111 The regulation of translation initiation employed by ptrB, though unique, has parallels with that of E. coli tisAB mRNA, in which the uORF, tisA, is not translated but is simply used as a platform for ribosome binding due to the absence of secondary structure (Darfeuille et al., 2007). In the case of tisAB, upstream ribosome loading onto the standby site is required for downstream initiation due to a highly structured SD sequence-led translation initiation region (Darfeuille et al., 2007). The uORF in this case, although not at the 5’-terminus, is essential due to inhibitory secondary structure, in contrast to the regulation of ptrB, which has a weak SD sequence and instead must take advantage of the inherent binding strength of the ribosome for 5’-AUGs to allow for ribosomal binding and CDS expression. Therefore, the regulation observed in translation initiation of ptrB may be a result of evolutionary selection to possess a 5’-terminal AUG to be used as a ribosome binding platform to enhance translation that would be inefficient if it were reliant upon canonical ribosome binding signals. Leaderless mRNAs provide insight into the evolution of translation Leaderless mRNA translation initiation has been hypothesized to reflect the mechanism used by ancestral mRNAs because, regardless of origin, an mRNA with a 5’-terminal start codon can be translated using the translational machinery from any of the three domains (Grill et al., 2000). This evidence suggests that the inherent ability to translate leaderless mRNAs was present prior to domain separation and has been retained throughout evolution. Genomic studies have shown that more slowly-evolving bacteria and those located near the root of the 16S rRNA phylogenetic tree have more leaderless mRNAs, and phylogenetically related species have a similar abundance of leaderless mRNAs (Zheng et al., 2011). It is thought that distinct mechanisms of initiation evolved due to the presence of the nuclear envelope in eukaryotes, which decouples transcription and translation and requires added protection from RNases through the addition of the 5’-7-methylguanylate cap and 3’-polyA tail (Malys and McCarthy, 2011). Although leaderless mRNAs are present in every domain of life (Janssen, 1993), the reduced abundance of leaderless mRNAs in prokaryotes has also been attributed to an increased number of operons (Zheng et al., 2011). Organizing genes into operons ensures that only the proximal cistron can be leaderless, whereas the distal cistrons must have upstream sequence features to recruit the ribosome. These sequence features are necessary for ribosomal retention and proper alignment of the ribosome on downstream initiation codons (Cone and Steege, 1985). 112 This trend is also seen in archaea, in which proximal genes in operons are leaderless and the distal genes are SD sequence-led (Tolstrup et al., 2000). This operon organization appears to be the case for several of the genes identified in this study because they are the first cistron in their operon, i.e., fucP, glpF, iscR, and uvrY, and have been shown in this study to also contain a proximal leaderless 5’-uORF. Although the mechanism of leaderless mRNA initiation reflects an ancestral initiation mechanism, the translational machinery still possesses an inherent functionality to translate leaderless mRNAs. The number of leaderless mRNAs present in prokaryotes suggests there may be an advantage to retaining the ability to translate mRNAs lacking a 5’-UTR. During translation, the mRNA must pass through a closed channel of the 30S ribosomal subunit, so an initiation codon at the 5’-terminus allows for leaderless mRNAs to be threaded directly into the decoding channel of the 70S ribosome (Benelli and Londei, 2009). Therefore, there are structural advantages to lacking a 5’-UTR. These advantages manifest under distinct conditions such as cold stress, carbon starvation and amino acid depletion (Uchida et al., 1970; Ruscetti and Jacobson, 1972; Jones and Inouye, 1996; Vesper et al., 2011). Under these conditions, the concentration of 70S ribosomes increases, which may influence their binding efficiency to leaderless mRNAs (Vesper et al. 2011). This response to different conditions would allow 70S binding to be globally regulated and certain mRNAs, such as leaderless mRNAs, to be translated only under those specific conditions. Under specific stress conditions, changes occur not only in the concentration of 70S ribosomes but also in their composition. The composition of the 70S ribosome is altered under kasugamycin stress, resulting in the loss of multiple ribosomal proteins from the 30S subunit, giving rise to the formation of the 61S particle (Kaberdina et al., 2009). This reduced ribosomal particle can still efficiently translate leaderless mRNAs, but not leadered mRNAs, suggesting that the 61S particle reflects an ancient form of the ribosome that was present before domain separation (Kaberdina et al., 2009). The proteins absent from the 61S particle were likely added later in evolution to facilitate leadered mRNA translation. It would be of interest to determine if the expression levels of the newly identified 5’-uORFs vary under stress conditions that alter the concentration and composition of 70S ribosomes. Applying newly discovered regulatory sequences to genetic engineering 113 A deeper understanding of microbial genetics has allowed scientists to begin the enterprise of creating synthetic life (Hutchison et al., 2016). However, this study has demonstrated that there are extensive areas of the process of translation, a fundamental component of the central dogma, that are still unclear. These includes the different levels of initiation regulation, and the abundance and mechanism of initiation of non-canonical mRNAs. Because translation is such a tightly regulated and important process in the microbial life cycle, it is important to understand how to control translational efficiency so we can effectively manipulate microorganisms in the laboratory to benefit scientific research, as well as exploiting them as protein production centers with maximum efficiency to provide limitless supplies of indemand proteins. The goal is to identify regulatory elements that can be applied to any variety of CDSs to produce the expected concentrations of each protein. This study has identified novel regulation mechanisms that can be utilized to help control translation initiation, and therefore help control protein production. The presence of an uORF has been shown in genetic engineering studies to be beneficial for translation of a downstream cistron due to the positive effects of translational coupling (Mutalik et al., 2013). Coupling was also shown to overcome the effects of a weak SD sequence, similar to the results seen in this study. The presence of an uORF was beneficial when the same regulatory element was used to control the translation of different CDSs, by reducing the variability of expression that naturally arises when comparing expression of different CDSs due to the differences in coding potential (Mutalik et al., 2013). Nevertheless, there was still variability in expression, demonstrating that the regulatory elements tested function in a CDS-dependent manner. This variability could cause problems in complex synthetic genetic systems because the expression is not as tightly controlled to maintain the balance needed to sustain cellular metabolism and growth. However, this study successfully identified regulatory sequence features that function in a CDS-independent manner. The ptrB regulation mechanism influenced expression of multiple CDSs in E. coli similarly, suggesting that we can utilize the ptrB regulatory elements to control expression of diverse CDSs. The use of regulatory signals that function independently of gene context with low variability has vast implications for its use in designing synthetic transcripts. The use of ptrB regulatory sequences may be one step towards the goal of designing controllable regulatory elements. 114 Using forward engineering, others could incorporate the sequence features identified in this study to synthetically design a ribosome binding site to drive translation at a specified rate (Salis et al., 2009). Designing synthetic translation initiation regions would allow for fine-tuning of protein production, especially through the discovery of additional regulatory signals. Therefore, rather than simply controlling translation through the strength of the SD sequence, a transcript could also contain a variety of important upstream and downstream signals to result in even more efficient expression. This tightly regulated control will be necessary as construction of larger and more complicated genetic systems progresses. Leaderless mRNAs advancing antibiotic research The process of translation and the ribosome are both targets of numerous antibiotics (Lambert, 2012). Further understanding of the process of translation and how it is regulated may help us to improve current antibiotics or develop new antibiotics. There are antibiotics that are ineffective in repressing leaderless mRNA translation, specifically kasugamycin and pactamycin (Chin et al., 1993). Leaderless mRNAs can be translated under kasugamycin inhibition because the binding region between the mRNA and the antibiotic is reduced due to the absence of the 5’UTR, and the presence of the 50S subunit stabilizes the binding of tRNA such that kasugamycin is unable to cause tRNA dissociation to elicit its effect (Schuwirth et al., 2006). Pactamycin is thought to specifically disrupt the SD-anti-SD interaction to block protein production (Brodersen et al., 2000); however, because leaderless mRNAs are not reliant upon this interaction they are successfully translated in the presence of this drug. Although this information may currently be of little concern, certain bacteria undergo a stress response that induces sequence specific endonuclease cleavage of mRNA 5’-UTRs resulting in leaderless mRNAs (Vesper et al., 2011; Sauert et al., 2016). If a significant portion of bacteria could acquire a similar response system through horizontal gene transfer, or already possess a homologous system, the organisms may respond to antibiotic stress by generating antibiotic-resistant leaderless mRNAs. This study has demonstrated the potential abundance of previously unannotated leaderless mRNAs in E. coli, suggesting that there are likely many leaderless mRNAs throughout prokaryotes, any of which might have resistance to certain antibiotics. Therefore, it is important to not only understand the details of translation to generate new antibiotics but to also understand how leaderless mRNAs are translated, in case bacteria can 115 utilize them as a countermeasure to antibiotic stress. The methods developed in this study can be utilized to provide this invaluable information in understanding leaderless mRNA interactions with the ribosome. Conclusions Overall, the information gained in this study has illustrated the need for more comprehensive genomic and transcriptomic studies to identify potential uORFs. Although advancements in high-throughput DNA sequencing have expanded our knowledge at a genomic level, there are biases in the annotation of those genomes, with small ORFs and uORFs being drastically underrepresented. Performing laboratory experiments is necessary to properly and meticulously identify the transcriptome and proteome of various organisms. Unfortunately, complete annotation of the transcriptomes will not provide us with detailed mechanisms of translational regulation. We have worked to identify more sequences involved in controlling protein production; however, we hypothesize that there are still many that are unknown. It appears that each mRNA may possess different combinations of regulatory sequences. This idea is termed the “cumulative specificity initiation mechanism,” in which the initiation site is selected by the cumulative and cooperative actions of preferred nucleotides at certain positions, both upstream and downstream relative to the initiation site (Nakamoto, 2011). These independent interactions allow the ribosome to have broad specificity and ensure that no singular interaction is absolutely essential (Nakamoto, 2011). Therefore, researchers should continue to try to identify as many important regulatory sequences as possible to further our understanding of a cellular process essential to every domain of life. Because there appears to be an advantage to maintaining a 5’-uAUG and producing small peptides, we should continue our research in both of these areas to utilize the millions of years of natural selection performed by bacteria. This information can then be applied to antibiotic manipulation, genetic engineering and –omics annotation. 116 References Adhin, M.R. and van Duin, J. 1990. Scanning model for translational reinitiation in eubacteria. J. Mol. Biol. 213: 811-818. Andre, A., Puca, A., Sansone, F., Brandi, A., Antico, G., Calogero, R.A. 2000. Reinitiation of protein synthesis in Escherichia coli can be induced by mRNA cis-elements unrelated to canonical translation initiation signals. FEBS Letters 468:73-78. Baba, T., Ara, T., Hasegawa, M., Takai, Y., Okurmura, Y., Baba, M., Datsenko, K.A., Tomita, M., Wanner, B.L., Mori, H. 2006. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2:1-11. Balakin, A.G. Skripkin, E.A., Shatsky, I.N., Bogdanov, A.A. 1992. Unusual ribosome binding properties of mRNA encoding bacteriophage λ repressor. Nuc. Acids Res. 20:563-571. Benelli, D., Maone, E., Londei, P. 2003. Two different mechanisms for ribosome/mRNA interaction in archaeal translation initiation. Mol. Microbiol. 50:635-643. Benelli, D., Londei, P. 2009. Begin at the beginning: evolution of translation initiation. Res Microbiol. 160:493-501. Berkhout B., Schmidt, B.F., van Strien, A., van Boom, J., van Westenen, J., van Duin, J. 1987. Lysis gene of bacteriophage MS2 is activated by translation termination at the overlapping coat gene. J. Mol. Biol. 195: 517-524. Binns, N., and Masters, M. 2002. Expression of the Escherichia coli pcnB gene is translationally limited using an inefficient start codon: a second chromosomal example of translation initiated at AUU. Mol. Microbiol. 44:1287-1298. Boni, I.V., Isaeva, D.M., Musychenko, M.L., Tzareva, N.V. 1990. Ribosome-messenger recognition: mRNA target sites for ribosomal protein S1. Nuc. Acids Res. 19:155-162. Boni, I.V., Artamonova, V.S., Tzareva, N.V., Dreyfus, M. 2001. Non-canonical mechanism for translational control in Bacteria: synthesis of ribosomal protein S1. EMBO J. 20: 4222-4232. Breaker, R.R. 2012. Riboswitches and the RNA world. Cold Spring Harb Perspect Biol. 4:a003566. 117 Brenneis, M., Soppa, J. 2009. Regulation of translation in haloarchaea: 5’- and 3’-UTRs are essential and have to functionally interact in vivo. PLOS One 4:e4484. Brock, J.E., Paz, R.L., Cottle, P., Janssen, G.R. 2007. Naturally occurring adenines within mRNA coding sequences affect ribosome binding and expression in Escherichia coli. J. Bacteriol. 189:501-510. Brock, J.E., Pourshahian S., Giliberti, J., Limbach, P.A., Janssen, G.R. 2008. Ribosomes bind leaderless mRNA in Escherichia coli through recognition of their 5’-terminal AUG. RNA. 14:2159-2169. Brodersen, D.E., Clemons, W.M., Carter, A.P., Morgan-Warren, R.J., Wimberly, B.T., Ramakrishnan, V. 2000. The structural basis for the action of antibiotics tetracycline, pactamycin, and hygromycin B on the 30S ribosomal subunit. Cell. 103: 1143-1154. Byrgazov, K., Manoharadas, S., Kaberdina, A.C., Vesper, O., Moll, I. 2012. Direct interaction of the Nterminal domain of ribosomal protein S1 with protein S2 in Escherichia coli. PLoS One 7(3): e32702. doi:10.1371/journal.pone.0032702. Byrgazov, K., Grishkovkaya, I., Arenz, S., Coudevylle, N., Temmel, H., Wilson, D.N., DjinovicCarugo, K., Moll, I. 2015. Structural basis for the interaction of protein S1 with the Escherichia coli ribosome. Nuc. Acids Res. 43: 661-673. Calogero, R.A., Pon, C.L., Canonaco, M.A., and Gualerzi, C.O. 1988. Selection of the mRNA translation initiation region by Escherichia coli ribosomes. Proc. Natl. Acad. Sci. USA. 85:64276431. Chang, B., Halgamuge, S., Tang, S. 2006. Analysis of SD sequences in completed microbial genomes: Non-SD-led genes are as common as SD-led genes. Gene. 373:90-99. Chin, K., Shean, C.S., Gottesman, M.E. 1993. Resistance of λ cI translation to antibiotics that inhibit translation initiation. J. Bacteriol. 175:7471-7473. Christie, G.E., Calendar, R. 1985. Bacteriophage P2 late promoters II. Comparison of the four late promoter sequences. J Mol Biol. 181: 373-382. 118 Condo, I., Ciammaruconi, A., Benelli, D, Ruggero, D. Londei, P. 1999. Cis-acting signals controlling translational initiation in the thermophilic archaeon Sulfolobus solfataricus. Mol. Microbiol. 34:377-384. Cone, K.C., and Steege, D.A. 1985. Functional analysis of lac repressor restart sites in translational initiation and reinitiation. J. Mol. Biol. 186: 733-742. Cortes, T., Shubert, O.T., Rose, G., Arnvig, K.B., Comas, I., Aebersold, R., Young, D.B. 2013. Genome-wide mapping of transcriptional start site defines an extensive leaderless transcriptome in Mycobacterium tuberculosis. Cell Reports. 5: 1121-1131. Cruz-Vera, L.R., Sachs, M.S., Squires, C.L., Yanofsky, C. 2011. Nascent polypeptide sequences that influence ribosome function. Curr. Opin. In Microbiol. 14: 160-166. Darfeuille, F., Unoson, C., Vogel, J., Wagner, E.G.H. 2007. An antisense RNA inhibits translation by competing with standby ribosomes. Mol. Cell. 26: 381-392. de Boer, H.A., Hui, A., Comstock, L.J., Wong, E., Vasser, M. 1983. Portable Shine-Dalgarno regions: a system for a systematic study of defined alterations of nucleotide sequences within E.coli ribosome binding sites. DNA. 2:231-235. Delgardo-Olivares, L., Zamora-Romo, E., Guarneros, G., Hernandez-Sanchez, J. 2006. Codon-specific and general inhibition of protein synthesis by the tRNA-sequestering minigenes. Biochimie. 88: 793-800. de Smit, M.H., van Duin, J. 1990. Secondary structure of the ribosome binding site determines translational efficiency: a quantitative analysis. Proc. Natl. Acad. Sci. USA. 87: 7668-7672. de Smit, M.H., van Duin, J. 1994. Translational initiation on structured messengers. Another role for the Shine-Dalgarno interaction. J. Mol. Biol. 235:173-184. de Smit, M.H. and van Duin, J. 2003. Translational standby sites: How ribosomes may deal with the rapid folding kinetics of mRNA. J. Mol. Biol. 331:737-743. Dincbas, V., Heurgue-Hamard, V., Buckingham, R.H., Karimi, R., Ehrenberg, M. 1999. Shutdown in protein synthesis due to the expression of min-genes in bacteria. J Mol Biol. 291: 745-759. 119 Draper, D.E., von Hippel, P.H. 1978. Nucleic acid binding properties of Escherichia coli ribosomal protein S1. J. Mol. Biol. 122: 321-359. Draper, D.E. 1987. Translational regulation of ribosomal proteins in Escherichia coli. In Translational Regulation of Gene Expression. Ilan, J. (ed.) New York: Plenum Press, pp.1-26. Draper, D.E., Gluick, T.C., Schlax, P.J. 1998. Pseduoknots, RNA folding and translational regulation. In RNA Structure and Function. Simons R.W., and Grubery-Manago, M. (eds). Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press, pp. 415-436. Dreyfus, M. 1988. What constitutes the signal for initiation of protein synthesis on Escherichia coli mRNAs? J. Mol. Biol. 204:79-94. Fargo, D.C., Zhang, M., Gillham, N.W., Boynton, J.E. 1998. Shine-Dalgarno-like sequences are not required for translation of chloroplast mRNAs in Chlamydomonas reinhardtii chloroplasts or in Escherichia coli. Mol. Gen. Genet. 257:271-282. Favre, A. 1990. Photochemistry and the nucleic acids. In Bioorganic photochemistry (ed. H. Morrison), pp. 379–425. John Wiley &Sons, New York. Ferguson, G.P. 1999. Protective mechanisms against toxic electrophiles in Escherichia coli. Trends in Microbiology. 7:242-247. Fredrick, K., Noller, H.F. 2002. Accurate translocation of mRNA by the ribosome requires a peptidyl group or its analog on the tRNA moving into the 30S P site. Mol. Cell. 5: 1125-1131. Gaba, A., Wang, Z., Krishnamoorthy, T., Hinnebusch, A., Sachs, M. 2001. Physical evidence for distinct mechanisms of translational control by upstream open reading frames. EMBO. 20: 64536463. Gama-Castro, S., Salgado, H., Peralta-Gil, M., Santos-Zavaleta, A., Muniz-Rascado, L., Solano-Lira, H. et al. 2011. RegulonDB version 7.0: transcription regulation of Escherichia coli K-12 integrated within genetic sensory response unit (Gensor Units). Nuc. Acids Res. 39: D98-D105. 120 Gottesman, S. 2005. Micros for microbes: non-coding regulatory RNAs in bacteria. Trends Genet. 21:399-404. Green, R., and Noller, H.F. 1997. Ribosomes and translation. Annu. Rev. Biochem. 66:679-716. Grill, S., Gualerzi, C.O., Londei, P., Blasi, U. 2000. Selective stimulation of translation of leaderless mRNA by initiation factor 2: evolutionary implications for translation. The EMBO Journal. 19: 4101-4110. Grill, S., Moll, I., Hasenohrl, D., Gualerzi, C.O., Blasi, U. 2001. Modulation of ribosomal recruitment to 5’-terminal start codons by translation initiation factors IF2 and IF3. FEBS Letters 495:167-171. Gu, W., Zhou, T., Wilke, C.O. 2010. A universal trend of reduced mRNA stability near the translationinitiation site in prokaryotes and eukaryotes. PLoS Comput Biol. 6:e1000664. Gualerzi, C.O., Pon, C.L. 1990. Initiation of mRNA translation in prokaryotes. Biochemistry 29:58815889. Gualerzi, C.O., Giuliodori, A. M., Pon, C.L. 2003. Transcriptional and post-transcriptional control of cold-shock genes. J. Mol. Biol. 331:527-539. Gualerzi, C.O., and Pon, C.L. 2015. Initiation of mRNA translation in bacteria: structural and dynamic aspects. Cell Mol. Life Sci. 72: 4341-4367. Hanahan, D. 1983. Studies on transformation of Escherichia coli with plasmids. J. Mol. Biol. 4:557-580. Hartz, D., McPheeters, D.S., Gold, L. 1989. Selection of the initiator tRNA by Escherichia coli initiation factors. Genes Dev. 3: 1899-1912. Hering, O., Brenneis, M., Beer, J., Suess, B., Soppa, J. 2009. A novel mechanism for translation initiation operates in haloarchaea. Mol. Microbiol. 71: 1451-1463. Hirose, T., Sugiura, M. 2004. Multiple elements required for translation of plastid atpB mRNA lacking the Shine-Dalgarno sequence. Nuc. Acids Res. 32: 3503-3510. Hobbs, E.C., Fontaine, F., Yin, X., Storz, G. 2011. An expanding universe of small proteins. Curr. Opin. in Microbiol. 14: 167-173. 121 Hui, A., de Boer, H.A., 1987. Specialized ribosome system: Preferential translation of a single mRNA species by a subpopulation of mutated ribosomes in Escherichia coli. Proc. Natl. Acad. Sci. USA. 84:4762-4766. Hutchison III, C.A., Chuang, R., Noskov, V.N, Assad-Garcia, N., Deerinck, T.J., Ellisman, M.H., Gill, J., Kannan, K., Karas, B.J., Ma, L., Pelletier, J.F., Qi, Z., Richter, R.A., Strychalski, E.A., Sun, L., Suzuki, Y., Tsvetanova, B., Wise, K.S., Smith, H.O., Glass, J.I., Merryman, C., Gibson, D.G., Venter, J.C. 2016. Design and synthesis of a minimal bacterial genome. Science. 351: DOI: 10.1126/science.aad6253. Huttenhofer, A., Noller, H.F. 1994. Footprinting mRNA-ribosome complexes with chemical probes. The EMBO Journal. 13:3892-3901. Islam, S., Shaw, R., Frankel, G., Pallen, M.J., Busby, S.J.W. 2012. Translation of a minigene in the 5’leader sequence of the enterohaemorrhagic Escherichia coli LEE1 transcription unit affects expression of neighboring downstream gene. Biochem J. 441: 247-253. Janssen, G.R. Eubacterial, archaebacterial, and eukaryotic genes that encode leaderless mRNA. In Industrial Microorganisms: Basic and Applied Molecular Genetics. 1993. Baltz, R.H., Hegeman, G.D. and Skartrud, P.L. (eds). Washington, DC: American Society for Microbiology Press. Pp. 59-67. Janssen, G.R., Bibb, M.J. 1993. Derivatives of pUC18 that have BglII sites flanking a modified multiple cloning site and that retain the ability to identify recombinant clones by visual screening of Escherichia coli colonies. Gene. 124: 133-134. Jones, P.G. and Inouye, M. RbfA, a 30S ribosomal binding factor, is a cold-shock protein whose absence triggers the cold-shock response. Mol. Microbiol. 1996; 21: 1207-1218. Kaberdin, V.R., and Blasi, U. 2006. Translation initiation and the fate of bacterial mRNAs. FEMS Microbiol. Rev. 30:967-979. Kaczanowska, M., and Ryden-Aulin, M. 2007. Ribosome biogenesis and the translation process in Escherichia coli. Microbiol. and Mol. Biol. Rev. 71:477-494. 122 Kanatani, A., Masuda, T., Shimoda, T., Misoka, F., Sheng Lin, X., Yoshimoto, T., Tsuru, D. 1991. Protease II from Escherichia coli: Sequencing and expression of the enzyme gene and characterization of the expressed enzyme. J. Biochem. 110: 315-320. Kennell, D. and Riezman, H. 1977. Transcription and translation initiation frequencies of the Escherichia coli lac operon. J. Mol. Biol. 114-1-21. Klinkert B, Narberhaus F. 2009. Microbial thermosensors. Cell. Mol. Life Sci. 66:2661-2676. Klock, G., and Hillen, W. 1986. Expression, purification and operator binding of the transposon Tn1721-encoded Tet repressor. J. Mol. Biol. 189: 633-641. Komarova, A.V., Tchufistova, L.S., Supina, E.V., Boni, I.V. 2002. Protein S1 counteracts the inhibitory effect of the extended Shine-Dalgarno sequence on translation. RNA. 8:1137-1147. Komarova, A.V., Tchufistova, L.S., Dreyfus, M., Boni, I.V. 2005. AU-rich sequences within 5’ untranslated leaders enhance translation and stabilize mRNA in Escherichia coli. J Bacteriol. 187:1344-1349. Kozak M. 1999. Initiation of translation in prokaryotes and eukaryotes. Gene. 234: 187-208. Krishnan, K.M., Van Etten, III W.J., Janssen, G.R. 2010. Proximity of the start codon to a leaderless mRNA is a strong positive determinant of ribosome binding and expression in Escherichia coli. J Bacteriol. 192: 6482-6485. Lambert, T. 2012. Antibiotics that affect the ribosome. Rev. Sci. Tech. 31:57-64. La Teana, A., Gualerzi, C.O., Brimacombe, R. 1995. From stand-by to decoding site. Adjustment of the mRNA on the 30S ribosomal subunit under the influence of the initiation factors. RNA. 1:772782. Laursen, B., Sorensen, H., Mortensen, K., Sperlink-Petersen, H. 2005. Initiation of protein synthesis in bacteria. Micro. and Mol. Biol. Rev. 6:101-123. Lovett, P.S. and Rogers, E.J. 1996. Ribosome regulation by the nascent peptide. Microbiol. Rev. 60: 366-385. 123 Malys, N., McCarthy, J.E.G. 2011. Translation initiation: variations in the mechanism can be anticipated. Cell. Mol. Life Sci. 68: 991-1003. Martin, J.F., Ullan, R.V., Garcia-Estrada, C. 2010. Regulation and compartmentalization of β-lactam biosynthesis. Microbial Biotech. 3:285-299. Martin-Farmer, J., and Janssen, G.R. 1999. A downstream CA repeat sequence increases translation from leadered and unleadered mRNA in Escherichia coli. Mol. Micro. 31:1025-1038. Marzi, S., Myansnikov, A.G., Sergavov, A., Ehresmann, C., Romby, P., Yuspupov, M., Klaholz, B.P. 2007. Structured mRNAs regulate translation initiation by binding to the platform of the ribosome. Cell. 130:1019-1031. McCarthy, J.E., Brimacombe, R. 1994. Prokaryotic translation: the interactive pathway leading to initiation. Trend Genet. 10:402-407. McCutcheon, J.P., Agrawal, R.K., Rhilips, S.M., Grassucci, R.A., Gerchman, S.E., Clemons, W.M., Jr., Ramakrishnan, V., and Frank, J. 1999. Location of the translation initiation factor IF3 on the small ribosomal subunit. Proc. Natl. Acad. Sci. USA. 96:4301-4306. McGinness, K.E., Sauer, R.T. 2004. Ribosomal protein S1 binds mRNA and tmRNA similarly but plays distinct roles in translation of these molecules. Proc. Natl. Acad. Sci. 101:13454-13459. Medenbach, J., Seiler, M., Hentze, M.W. 2011. Translational control via protein-regulated upstream open reading frames. Cell. 145: 902-913. Meijer, H., and Thomas, A.M. 2002. Control of eukaryotic protein synthesis by upstream open reading frames in the 5’-untranslated region of an mRNA. Biochem J. 367: 1-11. Melancon, P., Leclerc, D., Destroismasisons, N., Brakier-Gingras L. 1990. The anti Shine-Dalgarno region in Escherichia coli 16S ribosomal RNA is not essential for the correct selection of translational starts. Biochem. 29:3402-3407. Miller, J. A short course in bacterial genetics. In A laboratory manual for Escherichia coli and related bacteria. Vol 1, pp. 72-74. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. 1992. 124 Min Jou, W., Haegeman, G., Ysebaert M., Fiers, W. 1972. Nucleotide sequence of gene coding for the bacteriophage MS2 coat protein. Nature. 237:82-88. Moll, I., Grill, S., Gualerzi, C.O., Blasi, U. 2002. Leaderless mRNAs in bacteria: surprises in ribosomal recruitment and translational control. Mol. Microbiol. 43: 239-246. Moll, I., Hirokawa, G., Kiel, M.C., Kaji, A., Blasi,U. 2004. Translation initiation with 70S ribosomes: an alternative pathways for leaderless mRNAs. Nuc. Acids Res. 32:3354-3363. Morris, D.R. and Geballe, A.P. 2000. Upstream open reading frames as regulators of mRNA translation. Mol. and Cell Biol. 20: 8635-8642. Munson, L.M., Stormo, G.D., Niece, R.L., Reznikoff, W.S. 1984. lacZ translation initiation mutations. J. Mol. Biol. 177: 663-683. Mutalik, V.K., Guimaraes, J.C., Cambray, G., Lam, C., Christoffersen, M.J., Mai, Q., Tran, A.B., Paull, M., Keasling, J.D., Arkin, A.P., Endy, D. 2013. Precise and reliable gene expression via standard transcription and translation initiation elements. Nature Methods. 10: 354-360. Nakamoto, T. 2006. A unified view of initiation of protein synthesis. Biochem. and Biophys. Res. Comm. 341: 675-678. Nakamoto, T. 2011. Mechanisms of the initiation of protein synthesis: in reading frame binding of ribosomes to mRNA. Mol. Biol. Rep. 38: 847-855. Nakatogawa, H., Ito, K. 2002. The ribosomal exit tunnel functions as a discriminating gate. Cell 108: 629-636. Narberhaus, F., Waldminghaus, T., Chowdhury, S. 2006. RNA thermometers. FEMS Microbiol Rev. 30: 3-16. Nissen, P., Hansen, J., Ban, N., Moore, P.B., Steitz, T.A. 2000. The structural basis of ribosome activity in peptide body synthesis. Science. 289:920-930. O’Donnell, SM., and Janssen, GR. 2002. Leaderless mRNAs bind 70S ribosomes more strongly than 30S ribosomal subunits in Escherichia coli. J. Bacteriol. 184:6730-6733. 125 Ogle, J.M., Brodersen, D.E., Clemons, W.M., Tarry, M.J., Carter, A.P., Ramakrishnan V. 2001. Recognition of cognate transfer RNA by the 30S ribosomal subunit. Science. 292:897-902. Osterman, I.A., Evfratov, S.A., Sergiev, P.V., Donstova, O.A. 2013. Comparison of mRNA features affecting translation initiation and reinitiation. Nuc. Acids Res. 41: 474-486. Petersen, H.U., Joseph, E., Ullmann, A., Danchin, A. 1978. Formylation of initiator tRNA methionine in prokaryotic protein synthesis: in vivo polarity in lactose operon expression. J. Bacteriol. 135: 453-459. Perego, M. 1997. A peptide export-import control circuit modulating bacterial development regulates protein phosphatases of the phosphorelay. Proc. Natl. Acad. Sci. USA. 94:8612-8617. Ptashne, M., Backman, K., Humayun, M.Z., Jeffrey, A., Maurer, R., Meyer, B., et al. 1976. Autoregulation and function of a repressor in bacteriophage lambda. Science. 194: 156-161. Qing, G., Xia, B., Inouye M. 2003. Enhancement of translation initiation by A/T –rich sequences downstream of the initiation codon in Escherichia coli. J. Mol. Microbiol. Biotechnol. 6:133144. Ramakrishnan, V. 2002. Ribosome structure and the mechanism of translation. Cell. 108:557-572. Rex, G., Surin, B., Besse, G., Schneppe, B., McCarthy, J.E.G. 1994. The mechanism of translational coupling in Escherichia coli. J. Biol. Chem. 269: 18118-18127. Ringquist, S., Shinedling, S., Barrick, D., Green, L., Binkley, J., Stormo, G.D., Gold, L. 1992. Translation initiation in Escherichia coli: sequence within the ribosome-binding site. Mol. Microbiol. 6:1219-1229. Ringquist, S., MacDonald, M., Gibson, T., Gold, L. 1993. Nature of the ribosomal mRNA track: Analysis of ribosome-binding sites containing different sequences and secondary structures. Biochem. 32: 10254-10262. Ruscetti, F.W. and Jacobson, L.A. 1972. Accumulation of 70S monoribosomes in Escherichia coli after energy source shift-down. J. Bacteriol. 111: 142-151. 126 Rudd, K.E., Schneider, T.D. 1992. Complication of E.coli ribosome binding site, p 17.19-17.46. In J. Miller (ed.), A short course in bacterial genetics. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Salavati R, Oliver D. 1995. Competition between ribosome and SecA binding promotes Escherichia coli secA translational regulation. RNA. 1:745-753. Salis, H.M., Mirsky, E.A., Voigt, C.A. 2009. Automated design of synthetic ribosome binding sites to control protein expression. Nature Biotechnology. 27: 946-950. Sarabhai, A., Brenner, G. 1967. A mutant which reinitiates the polypeptide chain after chain termination. J. Mol. Biol. 27: 145-162. Sato, T., Terabe, M., Watanabe, H., Gojobori, T., Hori-Takemoto, C., Mirua, K. 2001. Codon and base biases after the initiation codon of the open reading frames in Escherichia coli genome and their influence on the translational efficiency. .J Biochem. 129: 851-860. Sauert, M., Wolfinger, M.T., Versper, O., Muller, C., Byrgazov, K., Moll, I. 2016. The MazF-regulon: a toolbox for the post-transcriptional stress response in Escherichia coli. Nuc. Acids Res. 10:1-16. Scharff, L.B., Childs, L., Walther, D., Bock, R. 2013. Local absence of secondary structure permits translation of mRNAs that lack ribosome-binding sites. PLoS Genetics 7: 1-11. Scherer, G.F.E., Walkinshaw, M.D., Arnott, S., Morr, D.J. 1980. The ribosome-binding sites recognized by E.coli ribosomes have regions with signal character in both the leader and protein coding segments. Nucleic Acids Res. 8: 3895-3907. Schleif, R. 1972. Fine-structure deletion map of the Escherichia coli L-arabinose operon. Proc. Natl. Acad. Sci. USA. 11: 3479-3484. Schluenzen F., Toakemoto, C., Wilson, D.N., Kaminishi, T., Harms, J.M., Hanawa-Suetsugu, K., Szaflarski, W., Kawazoe, M., Shirouzu, M., Nierhaus, K.H. 2006. The antiobiotic kasugamycin mimics mRNA nucleotides to destabilize tRNA binding and inhibit canonical translation initiation. Nat. Struct. Mol. Biol. 13:871-878. 127 Schmeing, T.M., Moore, P.B., Steitz, T.A. 2003. Structures of deacylated tRNA mimics bound to the E site of the large ribosomal subunit. RNA. 9:1345-1352. Schmidt, B.F., Berkhout, B., Overbeek, G.P., van Strien, A., van Duin, J. 1987. Determination of the RNA secondary structure that regulates lysis gene expression in the bacteriophage MS2. J Mol. Biol. 195: 505-516. Schneider, T.D., Stormo, G.D., Gold, L., Ehrenfeucht, A. 1986. Information content of binding sites on nucleotide sequences. J. Mol. Biol. 188:415-431. Schrader, J.M., Zhou, B., Li, G., Lasker, K., Childers, W.S., Williams, B., Long, T., Crosson, S., McAdams, H.H., Weissman, J.S., Shapiro, L. 2014. The coding and noncoding architecture of the Caulobacter crescentus genome. PLOS Genetics. 10(7): e1004463. doi:10.1371/journal.pgen.1004463. Schuwirth, B.S., Borovinskaya, M.A., Hau, C.W., Zhang, W., Vila-Sanjurjo, A., Holton, J.M., Cate, J.H.D. 2005. Structure of the 70S ribosome at 3.5 Å resolution. Science. 310:827-834. Schuwirth, B.S., Day, J.M., Hau, C.W., Janssen, G.R., Dahlberg, A.E., Cate, J.H., Vila-Sanjurjo, A. 2006. Structural analysis of kasugamycin inhibition of translation. Nat. Struct. Mol. Biol. 13:879-886. Seligmann, H. 2003. Cost-minimization of amino acid usage. J. Mol. Evol. 56: 151-161. Sharma, P., Yan, F., Doronina, V.A., Escuin-Ordinas, H., Ryan, M.D., Brown, J.D. 2012. 2A peptides provide distinct solutions to driving stop-carry on translation recoding. Nuc. Acids Res. 40: 3143-3151. Shell, S.S., Wang, J., Lapierre, P., Mir, M., Chase, M.R., Pyle, M.M., Gawande, R., Ahmad, R., Sarracino, D.A., Ioerger, T.R., Fortune, S.M., Derbyshire, K.M., Wade, J.T., Gray, T.A. 2015. Leaderless transcripts and small proteins are common features of the mycobacterial translational landscape. PLoS Genet. 11(11): e1005641. doi:10.1371/journal.pgen.1005641 Shine, J., and Dalgarno, L. 1974. The 3’-terminal sequence of Escherichia coli 16S ribosomal RNA: Complementarity to nonsense triplets and ribosome binding sites. Proc. Natl. Acad. Sci. USA. 71:1342-1346. 128 Simonetti, A., Marzi, S., Jenner, L., Mayansnikov, A., Romby, P., Yusupova, G., Klaholz, B.P., Yusupov, M. 2009. A structural view of translation initiation in bacteria. Cell Mol. Life Sci. 66:423-436. Singer, P. and Nomura, M. 1985. Stability of ribosomal protein mRNA and translational feedback regulation in Escherichia coli. Mol. Gen. Genet. 199:543-546. Slupska, M.M., King, A.G., Fitz-Gibbon, S., Besemer, J., Borodovsky, M., Miller, J.H. 2001. Leaderless transcripts of the crenarchaeal hyperthermophile Pyrobaculum aerophilum. J. Mol. Biol. 309:347-360. Sørensen, M.A., Fricke, J., Pedersen, S. 1998. Ribosomal protein S1 is required for translation of most, if not all, natural mRNAs in Escherichia coli in vivo. J. Mol. Biol. 280: 561-569. Spanjaard, R.A., and van Duin, J. 1989. Translational reinitiation in the presence and absence of a Shine and Dalgarno sequence. Nuc. Acids Res. 17: 5501-5507. Storz, G. 1999. An RNA thermometer. Genes Dev. 13:633-636. Studer, S.M., Joseph, S. 2006. Unfolding of mRNA secondary structure by the bacterial translation initiation complex. Mol. Cell. 22:105-115. Subramanian, A.R. 1983. Structure and functions of ribosomal protein S1. Prog. Nucleic Acid Res. Mol. Biol. 28:101-142. Suryanarayana, T., Subramanian, A.R. 1983. An essential function of ribosomal protein S1 in messenger ribonucleic acid translation. Biochemistry. 22:2715-2719. Takyar, S., Hickerson, R.P., Noller, H.F. 2005. mRNA helicase activity of the ribosome. Cell 120:49-58. Tanner, D.R., Cariello, D.A., Woolstenhulme, C.J., Broadbent, M.A., Buskirk, A.R. 2009. Genetic identification of nascent peptides that induce ribosome stalling. J. Biol. Chem. 284:34809-34818. Tashiro, M., Klenk, H.D., Rott, R. 1987. Inhibitory effect of a protease inhibitor, leupeptin, on the development of influenza pneumonia, mediated by concomitant bacteria. J. Gen. Virol. 68:20392041. 129 Tedin K., Resch A., Blasi U. 1997. Requirements for ribosomal protein S1 for translation initiation of mRNAs with and without a 5’ leader sequence. Mol. Microbiol. 25: 189-199. Tedin, K., Moll, I., Grill, S., Resch, A., Grashopf, A., Gualerzi, C.O., Blasi, U. 1999. Translation initiation factor 3 antagonizes authentic start codon selection on leaderless mRNAs. Mol. Microbiol. 31:67-77. Tenson, T., Ehrenberg, M. 2002. Regulatory nascent peptides in the ribosomal tunnel. Cell. 108:591594. Thieringer H.A., Jones, P.G., Inouye, M. 1998. Cold shock and adaptation. Bioessays. 20:49-57. Tolstrup, N., Sensen, C.W., Garrett, R.A., Clausen, I.G. 2000. Two different and highly organized mechanisms of translation initiation in the archaeon Sulfolobus solfataricus. Extremophiles. 4:175-179. Tucker, B.J. and Breaker, R.R. 2005. Riboswitches as versatile gene control elements. Curr. Opin. Struct. Biol. 15:342-348. Tzareva, N.V., Makhno, V.I., Boni, I.V. 1994. Ribosome-messenger recognition in the absence of the Shine-Dalgarno interactions. FEBS Letters 337:189-194. Uchida, T., Abe, M., Matsuo, K., Yoneda, M. 1970. Amounts of free 70S ribosomes and ribosomal subunits found in Escherichia coli at various temperatures. Biochem. Biophys. Res. Commun. 41: 1048-1054. Udagawa, T., Shimizu, Y., Ueda, T. 2004. Evidence for the translation initiation of leaderless mRNAs by the intact 70S ribosome without its dissociation into subunits in Eubacteria. J. Biol. Chem. 279:8539-8546. Unoson, C., Wagner, E.G.H. 2007. Dealing the stable structures at ribosome binding sites: bacterial translation and ribosome standby. RNA Biol. 4:113-117. Van Etten, W.J. and Janssen, G.R. 1998. An AUG initiation codon, not codon-anticodon complementarity, is required for the translation of unleadered mRNA in Escherichia coli. Mol. Microbiol. 27:987-1001. 130 Vesper, O., Amitai, S., Belitsky, M., Byrgazov, K., Kaberdina, A.C., Engelberg-Kulka, H., Moll, I. 2011. Selective translation of leaderless mRNAs by specialized ribosomes generated by MazF in Escherichia coli. Cell. 147: 147-157. Wagner, E.G., Altuvia, S. Romby, P. 2002. Antisense RNAs in bacteria and their genetic elements. Adv. Genet. 46:361-398. Walleczek, J., Albrecht-Ehrlich, R., Stoffler, G., Stoffler-Meilicke, M. 1990. Three-dimensional localization of the NH2- and carboxyl-terminal domain of ribosomal protein S1 on the surface of the 30S subunit from Escherichia coli. J. Biol. Chem. 265: 11339-11344. Wang, F., Xiao, J., Pan, L., Yang, M., Zhang, G., Jin, S., Yu, J. 2008. A systematic survey of miniproteins in bacteria and archaea. PLOS One. 3: e4027. doi: 10.1371/journal.pone.0004027. Wilson, D.N., and Nierhaus, K.H. 2003. The ribosome through the looking glass. Angew. Chem. Int. Ed. Engl. 42:3464-3486. Wilson, D.N., Nierhaus, K.H. 2005. Ribosomal proteins in the spotlight. Crit. Rev. Biochem, Mol. Biol. 40: 243-267. Wilson, DN. 2009. Less is more for leaderless mRNA translation. Mol. Cell. 33:141-142. Wimberly, B., Broderson, D., Clemons, W., Morgan-Warren, R., Carter, A., Vonrhein, C., Hartsch, T., Ramakrishnan, V. 2000. Structure of the 30S ribosomal subunit. Nature. 407:327-339. Yamamoto, H., Wittek, D., Gupta, R., Qin, B., Ueda, T., Krause, R., Yamamoto, K., Albrecht, R., Pech, M., Nierhaus, K.H. 2016. 70S-scanning initiation is a novel and frequent initiation mode of ribosomal translation in bacteria. Proc. Natl. Acad. Sci. USA. 113:E1180-1189. Yan, F., Doronina, V.A., Sharma, P., Brown, J.D. 2010. Orchestrating ribosomal activity from the inside: effects of the nascent chain on the peptidyltransferase centre. Biochem. Soc. Trans. 38: 1576-1580. Yusupova, G., Yusupov, M., Cate, J., Noller H. 2001. The path of the messenger RNA through the ribosome. Cell. 106:233-241. 131 Winkler, W.C. and Breaker, R.R. 2005. Regulation of bacterial gene expression by riboswitches. Annu. Rev. Microbiol. 59:487-517. Zheng, X., Hu, G., She, Z., Zhu, H. 2011. Leaderless genes in bacteria: clue to evolution of translation initiation mechanisms in prokaryotes. BMC Genomics. 12: 361-374. Zuker, M. 2003. Mfold web server for nucleic acid folding and hybridization prediction. Nuc. Acids Res. 31: 4306-3415. 132
© Copyright 2026 Paperzz