RNA-MEDIATED PROGRAMMING OF ACTIVE CHROMATIN A DISSERTATION SUBMITTED TO THE PROGRAM IN CANCER BIOLOGY AND THE COMMITTEE ON GRADUATE STUDIES OF STANFORD UNIVERSITY IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF DOCTOR OF PHILOSOPHY Yul Wonjun Yang August 2012 © 2012 by Yul Wonjun Yang. All Rights Reserved. Re-distributed by Stanford University under license with the author. This work is licensed under a Creative Commons AttributionNoncommercial 3.0 United States License. http://creativecommons.org/licenses/by-nc/3.0/us/ This dissertation is online at: http://purl.stanford.edu/zj169pb5921 ii I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Howard Chang, Primary Adviser I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Paul Khavari I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Seung Kim I certify that I have read this dissertation and that, in my opinion, it is fully adequate in scope and quality as a dissertation for the degree of Doctor of Philosophy. Joanna Wysocka Approved for the Stanford University Committee on Graduate Studies. Patricia J. Gumport, Vice Provost Graduate Education This signature page was generated electronically upon submission of this dissertation in electronic format. An original signed hard copy of the signature page is on file in University Archives. iii iv ABSTRACT Various long noncoding RNAs (lncRNAs) have been recently identified, but their functions remain unknown. To better understand the role of lncRNAs in gene activation, we have characterized a 3.76 kilobase highly conserved lncRNA, called HOTTIP, found at the 5’ distal tip of the HoxA locus. HOTTIP is required for the expression of HoxA9 to HoxA13 in human fibroblasts and chicken embryos. This region spans a linear 40 kilobases, but forms a compact three dimensional DNA structure by high throughput chromosome conformation capture. Knockdown of HOTTIP causes loss of the activating histone H3K4 trimethylation mark, as well as lost occupancy of the MLL methylase complex by chromatin immunoprecipitation coupled with tiling microarray analysis. To cause these changes, HOTTIP binds WDR5, a component of the MLL complex, to alter histone H3K4 trimethylation and gene activation. Together, the characterization of HOTTIP demonstrates an example of lncRNA-mediated epigenetic activation. A close examination of the binding interaction between HOTTIP and WDR5 has further revealed a novel pathway of lncRNA-regulated proteolysis. HOTTIP acts as a “molecular switch,” causing increased WDR5 protein levels by preventing proteasomal degradation through thermodynamic stabilization post poly-ubiquitination. Increased WDR5 deposition then causes gene activation. One HOTTIP RNA binds a single WDR5 protein through a direct RNA-protein interaction, and RNA-mediated stabilization requires a specific HOTTIP RNA domain in a long RNA context. Using a small scale alanine scanning mutagenesis screen, the HOTTIP binding interface on WDR5 has been identified as the cleft between blades 5 and 6. WDR5 mutations that abrogate lncRNA binding cannot be stabilized by HOTTIP, and are defective in gene activation, maintenance of histone H3K4 trimethylation, and embryonic stem cell self renewal. By altering protein turnover, lncRNAs may be able to regulate the temporal landscape of proteins in cells, potentially altering epigenetic states and cellular functions. v ACKNOWLEDGEMENTS First and foremost, I would like to thank my advisor, Howard Chang, for his mentorship, his unflappable demeanor in light of “not so good data,” and countless pages of hand drawn figures to organize experiments. Without his guidance, none of the experiments in this thesis could have occurred. In addition, I am deeply indebted to his patience and understanding for my growth as an individual. Thanks to him, I leave the Chang lab not only as a better scientist, but a more mature person. I also thank Kevin Wang, who was gracious to teach every technique to a rotation student with no prior experience in epigenetics or chromatin. I only truly realized his skill in teaching when I began working with my own rotation students. I thank my collaborators at University of Michigan, Yong Chen, Bingbing Wan, and Ming Lei, who provided vital reagents and data fundamental to my work. I also thank Kun Qu and Jiajing (Jenny) Zhang for their bioinformatics help. I also thank Karla Kirkegaard, Michel Brahic, and J. Antonio Gomez. I am also grateful to the rest of my labmates for reagents, data analysis, bench space, and/or a friendly person to talk to. I would like to thank especially Ryan Corces-Zimmerman, Ryan Flynn, and Angeline Protacio for their endless support and enthusiasm for HOTTIP, as well as Ci Chu, Paul Giresi, Tiffany Hung, Lingjie Li, Nicole Rapicavoli, Robert Spitale, Eduardo Torre, Miao Tsai, Yue Wan, Orly Wapinski, and Grace Zheng for their various contributions and much needed advice. This research was supported by the Stanford Medical Scientist Training Program, who provided both funding and much needed overall support, especially from Seung Kim, PJ Utz, Lorie Langdon, and Moira Louca. Finally, I would like to thank my parents, Sung-Chul and Young-Ok Yang, for encouraging me to do whatever I wanted. My brother, Gene Yang, for being always proud of his little brother, and somehow knowing about epigenetics as a Sociology/ Urban Studies major. I also thank my in-laws, Pete and Kelly Liberda, who always enjoy the fact that my research involves foreskin fibroblasts. Most importantly, I am forever grateful to my wife, Kristine Yang, for her love and caring for me. Without her, the work in this thesis would have never transpired. vi TABLE OF CONTENTS ABSTRACT iv ACKNOWLEDGEMENTS v TABLE OF CONTENTS vi LIST OF ILLUSTRATIONS viii CHAPTER 1: BACKGROUND 1 1.1 Significance of long noncoding RNAs 2 1.2 Introduction to epigenetics 4 1.3 Long noncoding RNAs in epigenetic silencing 5 1.4 Long noncoding RNAs in epigenetic activation 7 1.5 References 8 CHAPTER 2: A LONG NONCODING RNA MAINTAINS ACTIVE CHROMATIN TO COORDINATE HOMEOTIC GENE EXPRESSION 2.1 Abstract 14 2.2 Results 15 2.3 Discussion 30 2.4 Materials and Methods 31 2.5 Supplementary Figures 39 2.7 References 57 CHAPTER 3: LONG NONCODING RNA AS REGULATORY SWITCH OF PROTEIN TURNOVER 3.1 Abstract 62 3.2 Introduction 63 3.3 Results 66 3.4 Discussion 89 3.5 Materials and Methods 93 3.6 Supplementary Figures 101 13 61 vii 3.7 References CHAPTER 4: CONCLUDING REMARKS 106 111 4.1 Conclusion 112 4.2 Future Directions 115 4.3 References 117 viii List of Illustrations Figure 2.1. HOTTIP is a lincRNA transcribed in distal anatomic sites. 16 Figure 2.2. HOTTIP is required for coordinate activation of 5′ HOXA genes. 20 Figure 2.3. HOTTIP RNA is required for the active chromatin state of 5′ HOXA cluster. 24 Figure 2.4. HOTTIP RNA programs active chromatin via WDR5. 27 Figure 2S1. Molecular characterization of HOTTIP. 39 Figure 2S2. Single molecule RNA-fluorescence in situ hybridization (FISH) confirms low copy number of HOTTIP RNA. 41 Figure 2S3. Lack of antisense HOTTIP RNA with HOTTIP knockdown. 42 Figure 2S4. Efficient retroviral infection in developing chick limb buds. 43 Figure 2S5. HOTTIP knockdown decreases expression of 5’ HoxA genes in vivo. 44 Figure 2S6. HOTTIP depletion causes little change to higher-order chromosome configuration. 46 Figure 2S7. ChIP-qPCR validation of HOTTIP-dependence of H3K4me3 occupancy at distal HOXA. 48 Figure 2S8. Zoom-in view of Figure 2.3 highlighting HOTTIP dependence of MLL1 and WDR5 localization to HOXA. 49 Figure 2S9. HOTTIP binds stably expressed FLAG-WDR5 in HeLa cells. 50 Figure 2S10. HOTTIP overexpression does not affect distal HOXA expression. 51 Figure 2S11. Ectopic HOTTIP expression does not activate 5’HOXA genes nor rescue the effects of depleting endogenous nascent HOTTIP. 53 Figure 2S12. HOTTIPExons1-2 binds to WDR5 and acts in a dominant negative manner to inhibit 5’ HOXA gene expression. 55 Figure 2S13. Model of HOTTIP action. HOTTIP is positioned near the active 5’ HOXA genes via chromosomal looping. 56 Figure 3.1. HOTTIP RNA overexpression stabilizes exogenous FLAG-WDR5. 68 ix Figure 3.2. HOTTIP RNA thermodynamically stabilizes WDR5 and does not affect ubiquitination. 72 Figure 3.3. Full length HOTTIP RNA stabilizes WDR5 through specific interactions. 75 Figure 3.4. HOTTIP binds and stabilizes WDR5 through the same binding site as RbBP5. 78 Figure 3.5. HOTTIP binding mutations prevent HOTTIP mediated stabilization of WDR5, as well as show decreased ability to activate target genes in 293T cells. 81 Figure 3.6. WDR5 F266A mutation decreases protein stability and causes defects in mouse embryonic stem cell self renewal. 86 Figure 3.7. WDR5 F266A causes loss of self renewal genes and increased expression of differentiation genes. 88 Figure 3S1. 101 Figure 3S2. Two regions of HOTTIP bind WDR5. 102 Figure 3S3. HOTTIP bases 1953-2453 (Frag D.3) demonstrate equivalent binding affinity to bases 1953-3760 (Frag D). 104 Figure 3S4. 105 1 CHAPTER 1 Background 2 1.1 Significance of long noncoding RNAs With the conclusion of the human genome project, researchers were surprised to find that much of the genome did not appear to encode for proteins (Lander et al., 2001; Venter et al., 2001). Because proteins were deemed to be the “building blocks of life,” researchers at the time dubbed the extensive, nonttranslated regions as the “dark matter” of the genome, or more simply as “junk DNA.” Since then, many groups have been probing this untranslated “dark matter” using traditional Sanger sequencing, or the more high throughput microarray and next generation sequencing technologies (Birney et al., 2007; Cabili et al., 2011; Carninci et al., 2005; Cawley et al., 2004; Core et al., 2008; Okazaki et al., 2002; Preker et al., 2008). While much of the “dark matter” are repetitive elements, regulatory regions, and small RNAs (eg. miRNAs), a surprising proportion is transcribed into long RNA molecules with little or no coding potential. Ranging from several hundred to kilobases in size, these long RNAs lacking coding potential have been called large/long intergenic noncoding RNAs (lincRNAs) or large/long noncoding RNAs (lncRNAs), somewhat interchangably, although the latter term is more inclusive. Many of these lncRNAs are spliced and polyadenylated, and further demonstrate evolutionary stability (Lebenthal and Unger, 2010; Ponjavic et al., 2007). By correlating histone modifications signifying transcription with coding potential, researchers have estimated that lncRNAs may number in the thousands (Guttman et al., 2009; Khalil et al., 2009). In fact, recent research has suggested that most of the genome may actually be transcribed, albeit some regions at very low levels with unknown functionality (Mercer et al., 2012). Over the last several years, many of these lncRNAs have been characterized, revealing important roles in all aspects of cellular function, from gene transcription to posttranslational control. For example, to regulate transcription, a noncoding RNA transcribed from the minor DHFR promoter represses DHFR expression by forming a stable triplex at the major promoter, inhibiting transcription factor binding (Martianov et al., 2007). Similarly, the Alu RNA, transcribed from short interspersed elements (SINEs) throughout the genome in heat shock, binds RNA polymerase II to prevent 3 genome-wide transcription (Mariner et al., 2008). Post transcriptionally, a noncoding RNA has been described that occludes the 5’ splice site of the Zeb2 mRNA, reducing splicing to cause retention of an internal ribosomal entry site (Beltran et al., 2008). Likewise regulating mRNA levels, ½-sbsRNAs pair with Alu repeats in 3’UTRs, to cause STAU1-mediated RNA decay (Gong and Maquat, 2011). At the protein level, the noncoding RNA NRON regulates NFAT subcellular localization (Willingham et al., 2005). Other noncoding RNAs have been found to regulate localization of splicing factors and active/silent chromatin domains (Tripathi et al., 2010; Yang et al., 2011). LncRNAs have also been found to be associated with disease processes. For example, in heart disease, expression of the antisense noncoding RNA in the INK4 locus (ANRIL) has been found to be associated with increased atherosclerosis (Holdt et al., 2010), and expression of the noncoding RNA MIAT correlates with increased risk of myocardial infarction (Ishii et al., 2006). In another example, the stress-induced BACE1-AS noncoding RNA is overexpressed in Alzheimer’s disease, where it forms an RNA duplex with and stabilizes BACE1, resulting in a feed-forward progression of the amyloid cascade (Faghihi et al., 2008). In cancer, ceRNAs (competitive endogenous RNAs) bind to and reduce miRNAs, thus regulating tumor suppressor genes (eg. PTEN) and proto-oncogenes (eg. KRAS) (Poliseno et al., 2010). Multiple other long noncoding RNAs have been reported in malignancies of the colon (Pibouin et al., 2002), prostate (Bussemakers et al., 1999; Fu et al., 2006), breast (Guffanti et al., 2009), and liver (Lin et al., 2007), as well as in leukemia (Calin et al., 2007; Yu et al., 2008), melanoma (Pasmant et al., 2007), and in metastasis-prone lung and breast cancers (Gupta et al., 2010; Ji et al., 2003). 4 1.2 Introduction to epigenetics Epigenetics refers to inheritable changes occurring through mechanisms other than DNA sequence alterations. While potentially an expansive definition, much of today’s epigenetics focuses on two changes in the DNA state: 1) methylation at CpG sites on the DNA molecule itself and 2) chromatin remodeling by covalent histone modifications. Both DNA methylation and histone modifications are faithfully transmitted from mother to daughter cells. Thus, the state of gene expression can be transmitted as well. For example, CpG methylation, H3K9 methylation, and H3K27 methylation indicate silenced genes. In contrast, histone acetylation, H3K4 methylation, and lack of DNA methylation mark active genes. As the field progresses, more diverse histone modifications are being described and characterized, such as propionylation, sumoylation, crotonylation, citrullination, butyrylation, proline isomerization, ADP ribosylation, formylation, phosphorylation, ubiquylation, and hydroxylation (Tan et al., 2011). Together, these modifications generate a “histone code,” which denotes the DNA state of particular loci (Campos and Reinberg, 2009). Current research is ongoing to determine the functionality of each of these marks, and their relative contributions to particular gene states. 5 1.3 Long noncoding RNAs in epigenetic silencing LncRNAs have been found to play major roles in silencing genes by altering DNA methylation and histone modifications. For example, transcription of antisense RNA to HBA2 causes CpG methylation and gene silencing, resulting in thalassemia (Tufarelli et al., 2003). Similarly, the pRNA (promoter-associated RNA) form DNARNA triplexes that recruit the DNA methyltransferase DNMT3b for similar methylation-mediated silencing (Schmitz et al., 2010). As an example in regulating histone modifications, the Air noncoding RNA recruits the histone methyltransferase G9a to the Slc22a2 promoter to cause H3K9 methylation and transcriptional repression in the placenta (Nagano et al., 2008). Silencing can also occur by inhibiting the addition of histone activating marks. DNA damage-induced noncoding RNAs activate the TLS protein to bind and inhibit the CBP/p300 histone acetyltransferase activity on the Cyclin D1 locus, thus reducing gene expression (Wang et al., 2008). In the best understood example of lncRNA-mediated epigenetic control, other lncRNAs silence genes by recruiting the PRC2 methyltransferase complex to cause H3K27 trimethylation. In females, the long noncoding RNA Xist binds the PRC2 complex and recruits it to silence one of the two copies of the X chromosome in dosage compensation (Morey and Avner, 2011). Likewise, the Kcnq1ot1 RNA recruits the PRC2 complex as well as the H3K9 methyltransferase G9a to silence the Kcnq1 domain in the placenta (Pandey et al., 2008), and a similar interaction has been found to regulate p15/INK4B (Kotake et al., 2011). In addition, HOTAIR, a lncRNA expressed only from distal and posterior anatomic sites, acts as a molecular scaffold, simultaneously binding the H3K27 methyltransferase complex PRC2 and H3K4 demethyltransferase complex LSD1/CoREST/REST (Tsai et al., 2010). It then brings these proteins to silence target genes in trans by two mechanisms: 1) adding the H3K27 trimethyl silencing mark and 2) demethylating the H3K4 dimethyl activating mark. In cancer, HOTAIR can silence metastasis suppressor genes – overexpression of HOTAIR in epithelial cancer cells causes increased invasiveness and metastasis (Gupta et al., 2010). Additional PRC2-binding lncRNAs have also been characterized, such as COLDAIR (Heo and Sung, 2011), as well as other candidates genome-wide 6 (Guil et al., 2012; Khalil et al., 2009; Zhao et al., 2010), suggesting a common mechanism of lncRNA-mediated gene silencing. 7 1.4 Long noncoding RNAs in epigenetic activation While lncRNAs involved in gene silencing have been well characterized, their role in gene activation is less understood. As the best studied example, the lncRNAs roX1 and roX2 recruit the MSL complex to the single X chromosome in male Drosophila dosage compensation, causing H4K16 acetylation and gene activation (Deng and Meller, 2006). Also in Drosophila, noncoding RNA transcription from trithorax response elements correlates with activation of entire regulatory domains in segmental development, possibly by noncoding RNA-mediated recruitment of the H3K4 methyltransferase Ash1 (Petruk et al., 2006; Sanchez-Elsner et al., 2006). Research has also shown a role for activating lncRNAs in mammals. Recently, lncRNAs from various cell types have been found to display enhancer-like functions (Orom et al., 2010). siRNA-mediated knockdown of these lncRNAs causes downregulation of nearby genes, and these enhancer-like lncRNAs can upregulate nearby luciferase reporter genes in cis in reporter assays. As an individual example, the lncRNA HOTAIRM1 has been found to be required to maintain expression of nearby myelopoiesis-related HoxA genes (Zhang et al., 2009). Also in the Hox loci, the Mistral RNA (Mira) has been described to bind and recruit MLL to activate transcription of HoxA6-HoxA7 (Bertani et al., 2011). Finally, multiple lines of evidence have shown correlation between lncRNA transcription and nearby coding gene activation, with noncoding RNAs arising from enhancer elements and conserved regions (Kim et al., 2010; Ponjavic et al., 2009; Rada-Iglesias et al., 2011). Because of their extensive involvement in all aspects of cellular function, a better understanding of lncRNAs would provide valuable in explaining disease and developmental processes. The body of work in this dissertation focuses on the initial discovery and ongoing characterization of the novel lncRNA HOTTIP, and the role of lncRNAs in WDR5-mediated epigenetic activation. 8 1.5 References Beltran, M., Puig, I., Pena, C., Garcia, J.M., Alvarez, A.B., Pena, R., Bonilla, F., and de Herreros, A.G. (2008). A natural antisense transcript regulates Zeb2/Sip1 gene expression during Snail1-induced epithelial-mesenchymal transition. Genes Dev 22, 756-769. Bertani, S., Sauer, S., Bolotin, E., and Sauer, F. (2011). The noncoding RNA Mistral activates Hoxa6 and Hoxa7 expression and stem cell differentiation by recruiting MLL1 to chromatin. Mol Cell 43, 1040-1046. Birney, E., Stamatoyannopoulos, J.A., Dutta, A., Guigo, R., Gingeras, T.R., Margulies, E.H., Weng, Z., Snyder, M., Dermitzakis, E.T., Thurman, R.E., et al. (2007). Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799-816. Bussemakers, M.J., van Bokhoven, A., Verhaegh, G.W., Smit, F.P., Karthaus, H.F., Schalken, J.A., Debruyne, F.M., Ru, N., and Isaacs, W.B. (1999). DD3: a new prostate-specific gene, highly overexpressed in prostate cancer. Cancer Res 59, 5975-5979. Cabili, M.N., Trapnell, C., Goff, L., Koziol, M., Tazon-Vega, B., Regev, A., and Rinn, J.L. (2011). Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev 25, 1915-1927. Calin, G.A., Liu, C.G., Ferracin, M., Hyslop, T., Spizzo, R., Sevignani, C., Fabbri, M., Cimmino, A., Lee, E.J., Wojcik, S.E., et al. (2007). Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas. Cancer Cell 12, 215-229. Campos, E.I., and Reinberg, D. (2009). Histones: annotating chromatin. Annu Rev Genet 43, 559-599. Carninci, P., Kasukawa, T., Katayama, S., Gough, J., Frith, M.C., Maeda, N., Oyama, R., Ravasi, T., Lenhard, B., Wells, C., et al. (2005). The transcriptional landscape of the mammalian genome. Science 309, 1559-1563. Cawley, S., Bekiranov, S., Ng, H.H., Kapranov, P., Sekinger, E.A., Kampa, D., Piccolboni, A., Sementchenko, V., Cheng, J., Williams, A.J., et al. (2004). Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs. Cell 116, 499-509. Core, L.J., Waterfall, J.J., and Lis, J.T. (2008). Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science 322, 1845-1848. Deng, X., and Meller, V.H. (2006). Non-coding RNA in fly dosage compensation. Trends Biochem Sci 31, 526-532. Faghihi, M.A., Modarresi, F., Khalil, A.M., Wood, D.E., Sahagan, B.G., Morgan, T.E., Finch, C.E., St Laurent, G., 3rd, Kenny, P.J., and Wahlestedt, C. (2008). Expression of a noncoding RNA is elevated in Alzheimer's disease and drives rapid feed-forward regulation of beta-secretase. Nat Med 14, 723-730. Fu, X., Ravindranath, L., Tran, N., Petrovics, G., and Srivastava, S. (2006). Regulation of apoptosis by a prostate-specific and prostate cancer-associated noncoding gene, PCGEM1. DNA Cell Biol 25, 135-141. 9 Gong, C., and Maquat, L.E. (2011). lncRNAs transactivate STAU1-mediated mRNA decay by duplexing with 3' UTRs via Alu elements. Nature 470, 284-288. Guffanti, A., Iacono, M., Pelucchi, P., Kim, N., Solda, G., Croft, L.J., Taft, R.J., Rizzi, E., Askarian-Amiri, M., Bonnal, R.J., et al. (2009). A transcriptional sketch of a primary human breast cancer by 454 deep sequencing. BMC Genomics 10, 163. Guil, S., Soler, M., Portela, A., Carrere, J., Fonalleras, E., Gomez, A., Villanueva, A., and Esteller, M. (2012). Intronic RNAs mediate EZH2 regulation of epigenetic targets. Nat Struct Mol Biol. Gupta, R.A., Shah, N., Wang, K.C., Kim, J., Horlings, H.M., Wong, D.J., Tsai, M.C., Hung, T., Argani, P., Rinn, J.L., et al. (2010). Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature 464, 1071-1076. Guttman, M., Amit, I., Garber, M., French, C., Lin, M.F., Feldser, D., Huarte, M., Zuk, O., Carey, B.W., Cassady, J.P., et al. (2009). Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature 458, 223-227. Heo, J.B., and Sung, S. (2011). Vernalization-mediated epigenetic silencing by a long intronic noncoding RNA. Science 331, 76-79. Holdt, L.M., Beutner, F., Scholz, M., Gielen, S., Gabel, G., Bergert, H., Schuler, G., Thiery, J., and Teupser, D. (2010). ANRIL expression is associated with atherosclerosis risk at chromosome 9p21. Arterioscler Thromb Vasc Biol 30, 620-627. Ishii, N., Ozaki, K., Sato, H., Mizuno, H., Saito, S., Takahashi, A., Miyamoto, Y., Ikegawa, S., Kamatani, N., Hori, M., et al. (2006). Identification of a novel non-coding RNA, MIAT, that confers risk of myocardial infarction. J Hum Genet 51, 1087-1099. Ji, P., Diederichs, S., Wang, W., Boing, S., Metzger, R., Schneider, P.M., Tidow, N., Brandt, B., Buerger, H., Bulk, E., et al. (2003). MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage nonsmall cell lung cancer. Oncogene 22, 8031-8041. Khalil, A.M., Guttman, M., Huarte, M., Garber, M., Raj, A., Rivea Morales, D., Thomas, K., Presser, A., Bernstein, B.E., van Oudenaarden, A., et al. (2009). Many human large intergenic noncoding RNAs associate with chromatinmodifying complexes and affect gene expression. Proc Natl Acad Sci U S A 106, 11667-11672. Kim, T.K., Hemberg, M., Gray, J.M., Costa, A.M., Bear, D.M., Wu, J., Harmin, D.A., Laptewicz, M., Barbara-Haley, K., Kuersten, S., et al. (2010). Widespread transcription at neuronal activity-regulated enhancers. Nature 465, 182-187. Kotake, Y., Nakagawa, T., Kitagawa, K., Suzuki, S., Liu, N., Kitagawa, M., and Xiong, Y. (2011). Long non-coding RNA ANRIL is required for the PRC2 recruitment to and silencing of p15(INK4B) tumor suppressor gene. Oncogene 30, 1956-1962. Lander, E.S., Linton, L.M., Birren, B., Nusbaum, C., Zody, M.C., Baldwin, J., Devon, K., Dewar, K., Doyle, M., FitzHugh, W., et al. (2001). Initial sequencing and 10 analysis of the human genome. Nature 409, 860-921. Lebenthal, I., and Unger, R. (2010). Computational evidence for functionality of noncoding mouse transcripts. Genomics 96, 10-16. Lin, R., Maeda, S., Liu, C., Karin, M., and Edgington, T.S. (2007). A large noncoding RNA is a marker for murine hepatocellular carcinomas and a spectrum of human carcinomas. Oncogene 26, 851-858. Mariner, P.D., Walters, R.D., Espinoza, C.A., Drullinger, L.F., Wagner, S.D., Kugel, J.F., and Goodrich, J.A. (2008). Human Alu RNA is a modular transacting repressor of mRNA transcription during heat shock. Mol Cell 29, 499-509. Martianov, I., Ramadass, A., Serra Barros, A., Chow, N., and Akoulitchev, A. (2007). Repression of the human dihydrofolate reductase gene by a non-coding interfering transcript. Nature 445, 666-670. Mercer, T.R., Gerhardt, D.J., Dinger, M.E., Crawford, J., Trapnell, C., Jeddeloh, J.A., Mattick, J.S., and Rinn, J.L. (2012). Targeted RNA sequencing reveals the deep complexity of the human transcriptome. Nat Biotechnol 30, 99-104. Morey, C., and Avner, P. (2011). The demoiselle of X-inactivation: 50 years old and as trendy and mesmerising as ever. PLoS Genet 7, e1002212. Nagano, T., Mitchell, J.A., Sanz, L.A., Pauler, F.M., Ferguson-Smith, A.C., Feil, R., and Fraser, P. (2008). The Air noncoding RNA epigenetically silences transcription by targeting G9a to chromatin. Science 322, 1717-1720. Okazaki, Y., Furuno, M., Kasukawa, T., Adachi, J., Bono, H., Kondo, S., Nikaido, I., Osato, N., Saito, R., Suzuki, H., et al. (2002). Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 420, 563-573. Orom, U.A., Derrien, T., Beringer, M., Gumireddy, K., Gardini, A., Bussotti, G., Lai, F., Zytnicki, M., Notredame, C., Huang, Q., et al. (2010). Long noncoding RNAs with enhancer-like function in human cells. Cell 143, 46-58. Pandey, R.R., Mondal, T., Mohammad, F., Enroth, S., Redrup, L., Komorowski, J., Nagano, T., Mancini-Dinardo, D., and Kanduri, C. (2008). Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation. Mol Cell 32, 232-246. Pasmant, E., Laurendeau, I., Heron, D., Vidaud, M., Vidaud, D., and Bieche, I. (2007). Characterization of a germ-line deletion, including the entire INK4/ARF locus, in a melanoma-neural system tumor family: identification of ANRIL, an antisense noncoding RNA whose expression coclusters with ARF. Cancer Res 67, 3963-3969. Petruk, S., Sedkov, Y., Riley, K.M., Hodgson, J., Schweisguth, F., Hirose, S., Jaynes, J.B., Brock, H.W., and Mazo, A. (2006). Transcription of bxd noncoding RNAs promoted by trithorax represses Ubx in cis by transcriptional interference. Cell 127, 1209-1221. Pibouin, L., Villaudy, J., Ferbus, D., Muleris, M., Prosperi, M.T., Remvikos, Y., and Goubin, G. (2002). Cloning of the mRNA of overexpression in colon carcinoma-1: a sequence overexpressed in a subset of colon carcinomas. Cancer Genet Cytogenet 133, 55-60. Poliseno, L., Salmena, L., Zhang, J., Carver, B., Haveman, W.J., and Pandolfi, P.P. 11 (2010). A coding-independent function of gene and pseudogene mRNAs regulates tumour biology. Nature 465, 1033-1038. Ponjavic, J., Oliver, P.L., Lunter, G., and Ponting, C.P. (2009). Genomic and transcriptional co-localization of protein-coding and long non-coding RNA pairs in the developing brain. PLoS Genet 5, e1000617. Ponjavic, J., Ponting, C.P., and Lunter, G. (2007). Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. Genome Res 17, 556-565. Preker, P., Nielsen, J., Kammler, S., Lykke-Andersen, S., Christensen, M.S., Mapendano, C.K., Schierup, M.H., and Jensen, T.H. (2008). RNA exosome depletion reveals transcription upstream of active human promoters. Science 322, 1851-1854. Rada-Iglesias, A., Bajpai, R., Swigut, T., Brugmann, S.A., Flynn, R.A., and Wysocka, J. (2011). A unique chromatin signature uncovers early developmental enhancers in humans. Nature 470, 279-283. Sanchez-Elsner, T., Gou, D., Kremmer, E., and Sauer, F. (2006). Noncoding RNAs of trithorax response elements recruit Drosophila Ash1 to Ultrabithorax. Science 311, 1118-1123. Schmitz, K.M., Mayer, C., Postepska, A., and Grummt, I. (2010). Interaction of noncoding RNA with the rDNA promoter mediates recruitment of DNMT3b and silencing of rRNA genes. Genes Dev 24, 2264-2269. Tan, M., Luo, H., Lee, S., Jin, F., Yang, J.S., Montellier, E., Buchou, T., Cheng, Z., Rousseaux, S., Rajagopal, N., et al. (2011). Identification of 67 histone marks and histone lysine crotonylation as a new type of histone modification. Cell 146, 1016-1028. Tripathi, V., Ellis, J.D., Shen, Z., Song, D.Y., Pan, Q., Watt, A.T., Freier, S.M., Bennett, C.F., Sharma, A., Bubulya, P.A., et al. (2010). The nuclear-retained noncoding RNA MALAT1 regulates alternative splicing by modulating SR splicing factor phosphorylation. Mol Cell 39, 925-938. Tsai, M.C., Manor, O., Wan, Y., Mosammaparast, N., Wang, J.K., Lan, F., Shi, Y., Segal, E., and Chang, H.Y. (2010). Long noncoding RNA as modular scaffold of histone modification complexes. Science 329, 689-693. Tufarelli, C., Stanley, J.A., Garrick, D., Sharpe, J.A., Ayyub, H., Wood, W.G., and Higgs, D.R. (2003). Transcription of antisense RNA leading to gene silencing and methylation as a novel cause of human genetic disease. Nat Genet 34, 157165. Venter, J.C., Adams, M.D., Myers, E.W., Li, P.W., Mural, R.J., Sutton, G.G., Smith, H.O., Yandell, M., Evans, C.A., Holt, R.A., et al. (2001). The sequence of the human genome. Science 291, 1304-1351. Wang, X., Arai, S., Song, X., Reichart, D., Du, K., Pascual, G., Tempst, P., Rosenfeld, M.G., Glass, C.K., and Kurokawa, R. (2008). Induced ncRNAs allosterically modify RNA-binding proteins in cis to inhibit transcription. Nature 454, 126130. Willingham, A.T., Orth, A.P., Batalov, S., Peters, E.C., Wen, B.G., Aza-Blanc, P., Hogenesch, J.B., and Schultz, P.G. (2005). A strategy for probing the function 12 of noncoding RNAs finds a repressor of NFAT. Science 309, 1570-1573. Yang, L., Lin, C., Liu, W., Zhang, J., Ohgi, K.A., Grinstein, J.D., Dorrestein, P.C., and Rosenfeld, M.G. (2011). ncRNA- and Pc2 methylation-dependent gene relocation between nuclear structures mediates gene activation programs. Cell 147, 773-788. Yu, W., Gius, D., Onyango, P., Muldoon-Jacobs, K., Karp, J., Feinberg, A.P., and Cui, H. (2008). Epigenetic silencing of tumour suppressor gene p15 by its antisense RNA. Nature 451, 202-206. Zhang, X., Lian, Z., Padden, C., Gerstein, M.B., Rozowsky, J., Snyder, M., Gingeras, T.R., Kapranov, P., Weissman, S.M., and Newburger, P.E. (2009). A myelopoiesis-associated regulatory intergenic noncoding RNA transcript within the human HOXA cluster. Blood 113, 2526-2534. Zhao, J., Ohsumi, T.K., Kung, J.T., Ogawa, Y., Grau, D.J., Sarma, K., Song, J.J., Kingston, R.E., Borowsky, M., and Lee, J.T. (2010). Genome-wide identification of polycomb-associated RNAs by RIP-seq. Mol Cell 40, 939-953. 13 CHAPTER 2 A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression This chapter was published in Wang, K.C., Yang, Y.W.*, Liu, B.*, Sanyal, A., Corces-Zimmerman, R., Chen, Y., Lajoie, B.R., Protacio, A., Flynn, R.A., Gupta, R.A., Wysocka J., Lei M., Dekker J., Helms J.A., and Chang H.Y. (2011). A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature 472, 120-124. *These authors contributed equally to this work. Y.W. Yang’s primary contributions to this publication were portions of the control regions (Figure 2.2B), chick HOTTIP shRNA screen (later used in Figure 2.2C-F, 2S4-2S5), GST-tagged protein pull down assays (Figure 2.4A-B, 2S12A), antibody screen (later used for Figure 2.4C), WDR5 siRNA experiments (Figure 2.4D), strand specific qPCR (Figure 2S1B, 2S3A), titrated qRT-PCR (Figure 2S2C), tagged WDR5 RNA-immunoprecipitation assays (Figure 2S9A-B), and generation of HOTTIP and HOTTIP fragment overexpression cell lines (Figure 2S10, 2S11, 2S12B). Y.W. Yang also had a minor role in preparing and writing the manuscript. 14 2.1 Abstract The genome is extensively transcribed into long intergenic noncoding RNAs (lincRNAs), many of which are implicated in gene silencing (Mercer et al., 2009; Ponting et al., 2009). Potential roles of lincRNAs in gene activation are much less understood (Dinger et al., 2008; Petruk et al., 2006; Sanchez-Elsner et al., 2006). Development and homeostasis require coordinate regulation of neighbouring genes through a process termed locus control (Dean, 2006). Some locus control elements and enhancers transcribe lincRNAs (Ashe et al., 1997; De Santa et al., 2010; Kim et al., 2010; Orom et al., 2010), hinting at possible roles in long-range control. In vertebrates, 39 Hox genes, encoding homeodomain transcription factors critical for positional identity, are clustered in four chromosomal loci; the Hox genes are expressed in nested anterior-posterior and proximal-distal patterns colinear with their genomic position from 3′ to 5′of the cluster (Chang, 2009). Here we identify HOTTIP, a lincRNA transcribed from the 5′ tip of the HOXA locus that coordinates the activation of several 5′ HOXA genes in vivo. Chromosomal looping brings HOTTIP into close proximity to its target genes. HOTTIP RNA binds the adaptor protein WDR5 directly and targets WDR5/MLL complexes across HOXA, driving histone H3 lysine 4 trimethylation and gene transcription. Induced proximity is necessary and sufficient for HOTTIP RNA activation of its target genes. Thus, by serving as key intermediates that transmit information from higher order chromosomal looping into chromatin modifications, lincRNAs may organize chromatin domains to coordinate long-range gene activation. 15 2.2 Results We examined chromosome structure and histone modifications in human primary fibroblasts derived from several anatomic sites (Rinn et al., 2007), and found distinctive differences in the HOXA locus. High throughput chromosome conformation capture (5C) (Dostie et al., 2006) across HOXA revealed that its higher order structure is dependent on positional identity. In anatomically distal cells (for example, foreskin and foot fibroblasts), we detected abundant chromatin interactions within the transcriptionally active 5′ HOXA locus (with reference to the directions of transcription of constituent Hox genes), pointing to a compact and looped conformation. In contrast, no long-range chromatin interactions are detected within the transcriptionally silent 3′ HOXA which seems largely linear (Figure 2.1A). Strikingly, anatomically proximal cells (for example, lung fibroblasts) have the diametrically opposite pattern. The ON and OFF states of Hox and other key developmental genes are maintained by the MLL/Trithorax (Trx) and polycomb group (PcG) proteins, which mediate trimethylation of histone H3 lysine 4 (H3K4me3) to activate genes or lysine 27 (H3K27me3) to repress genes (Schuettengruber et al., 2007). The portions of HOXA in tight physical interaction are marked by broad domains of H3K4me3, whereas H3K27me3 marks the physically extended and transcriptional silent regions (Figure 2.1A). 16 Figure 2.1. HOTTIP is a lincRNA transcribed in distal anatomic sites. a, Chromatin state map of distal versus proximal cells. Top panels, chromosome conformation capture-carbon copy (5C) analysis of distal (foreskin) and proximal (lung) human fibroblasts. Heat map representations (generated by my5C, ref. (Lajoie et al., 2009)) of 5C data (bin size 30 kb, step size 3 kb) for HOXA in foreskin and lung fibroblasts. Red intensity of each pixel indicates relative interaction between the two points on the 17 genomic coordinates. The diagonal represents frequent cis interactions between regions located in close proximity along the linear genome. 5C signals that are away from the diagonal represent long-range looping interactions. Bottom panels, chromatin occupancy across HOXA. x-axis is genomic coordinate; y-axis depicts occupancy of the indicated histone marks or protein (ChIP/input). Box and arrows highlight chromatin states of HOTTIP gene. b, HOTTIP RNA expression in primary human fibroblasts from 11 anatomic sites. Means ± s.d. are shown (n = 2). c, In situ hybridization of HOTTIP RNA in E13.5 mouse embryo. 18 On the very 5′ and 3′ edges of the two respective interaction clusters are two lincRNA loci that exhibit distinct chromatin modifications. The 3′element has been previously identified as the myelopoiesis-associated lincRNA HOTAIRM1 (Zhang et al., 2009). The 5′ element, for which we suggest the name HOTTIP for ‘HOXA transcript at the distal tip’, exhibits bivalent H3K4me3 and H3K27me3, a histone modification pattern associated with poised regulatory sequences (Bernstein et al., 2006). Comparison with RNA polymerase II occupancy and RNA expression showed that the bivalent H3K4me3 and H3K27me3 modifications on HOTTIP gene do not require HOTTIP transcription, but transcription of HOTTIP is associated with increased H3K4me3 and decreased H3K27me3 (Figure 2.1A, left). Chromatin immunoprecipitation (ChIP) analysis confirmed that the HOTTIP gene is occupied by both polycomb repressive complex 2 (PRC2) and MLL complex, consistent with the bivalent histone marks (Figure 2S1A). HOTTIP transcription yields a 3,764-nucleotide, spliced and polyadenylated lincRNA that initiates ~330 bases upstream of HOXA13. Only the strand antisense to HOXA genes is transcribed (Figure 2S1B). Genes near the 5′ end of each HOX cluster tend to be expressed in more posterior and/or distal anatomical locations. Consistent with its genomic location 5′ to HOXA13, HOTTIP is expressed in distal and/or posterior anatomic sites (Figure 2.1B). In situ hybridization of developing mouse and chick embryos confirmed that HOTTIP is expressed in posterior and distal sites in vivo, indicating a conserved expression pattern from development to adulthood (Figure 2.1C and Figure 2S1C). Even in distal cells where HOTTIP is expressed, its RNA level is very low and estimated to be ~0.3 copies per cell (Figure 2S2). We employed small interfering RNAs (siRNAs) to knock down HOTTIP RNA in fibroblasts from a distal anatomic site (foreskin), and examined expression of 5′ HOXA genes by quantitative reverse transcription PCR. Notably, HOTTIP RNA knockdown abrogated expression of distal HOXA genes across 40 kilobases with a trend dependent on the distance to HOTTIP. The strongest blockade was observed for HOXA13 and HOXA11, with progressively less severe effects on HOXA10, HOXA9 and HOXA7 (Figure 2.2A). The effect on gene transcription appeared to be 19 unidirectional, as there were no appreciable changes in the levels of EVX1, located ~40 kilobases 5′ of the HOXA cluster (data not shown). HOTTIP knockdown did not affect expression of the highly homologous HOXD genes, other control genes, nor induce antisense transcription at its own locus (Figure 2.2B, Figure 2S3A). Several independent siRNAs targeting HOTTIP yielded similar results (Figure 2S3B). These results indicate that HOTTIP RNA is necessary to coordinate activation of 5′ HOXA genes. 20 Figure 2.2. HOTTIP is required for coordinate activation of 5′ HOXA genes. a, b, Knockdown of HOTTIP RNA abrogates expression of 5′ HOXA genes in foreskin fibroblasts (a), but not HOXD or BID genes (b). Means + s.d. are shown (n = 3). GFP, green fluorescent protein. c, Schematic of chick RNAi experiment. d, HOTTIP RNA is required for 5′ HoxA gene expression in vivo. RT–PCR of the indicated genes from control or HOTTIP-depleted distal limb bud is shown; quantification and normalization by Actin signal is shown below each band. GAG signal confirms successful retroviral transduction in all cases. e, In situ hybridization of 5′ HoxA genes 21 in chick limb buds. Arrowheads highlight distal domains of high HoxA gene expression that are affected by HOTTIP knockdown. f, Shortening of distal bony elements in HOTTIP-depleted forelimbs. Alcian blue staining highlights the skeletal elements. Red and purple lines highlight radius and 3rd digit lengths, respectively. 22 We next addressed the function of HOTTIP RNA in vivo in the developing chick limb bud (Figure 2.2C). Whereas prior genetic studies of noncoding RNAs (ncRNAs) involved deletion or insertion into the gene locus (Kmita et al., 2002), we wished to distinguish the functions of HOTTIP RNA from its corresponding DNA element. HOTTIP gene can nucleate H3K4 and H3K27 methylation independent of transcription (Figure 2.1A), and the precise genomic distance between upstream enhancer elements and Hox genes is critical for their proper colinear activation (Kmita et al., 2002). Therefore, we used RNA interference (RNAi) in chick embryos, where replication-competent retroviruses can deliver short-hairpin RNAs (shRNAs) with high penetrance and precise spatiotemporal control (Harpavat and Cepko, 2006) (Figure 2S4). In the limb bud, 5′ HoxA genes are transcribed in a nested pattern along the proximal-distal axis (Nelson et al., 1996). In this tissue, HoxA function is highly redundant with that of the HoxD locus, which allowed us to assess altered HoxA expression patterns without major changes in anatomic landmarks (Kmita et al., 2005). We injected retroviruses carrying shRNAs against chick HOTTIP into upper limb buds of stage 13 chicks; RT–PCR and in situ hybridization were performed on both control and knockdown samples after 2–4 days. Knockdown of HOTTIP RNA by two independent shRNAs in limb buds decreased expression of HoxA13, HoxA11 and HoxA10—again with a graded impact depending on genomic proximity to HOTTIP gene. Vector control or an shRNA that fails to deplete HOTTIP RNA had little effect on Hox gene expression (Figure 2.2D). In situ hybridization on whole embryos (Figure 2.2E) and sections (Figure 2S5) revealed that HOTTIP RNA most strongly affects HoxA gene expression at the distal edge of the developing limb bud, where the 5′ HoxA genes are most strongly expressed. By stage 36, limbs depleted of HOTTIP RNA showed notable shortening and bending of distal bony elements, including the radius, ulna and third digit (~20% length reduction for each compared to contralateral and stage-matched limbs treated with control virus, P < 0.05, Student’s t-test, Figure 2.2F). This phenotype resembled some of the defects in mice lacking HoxA11 and HoxA13 (Davis et al., 1995; Fromental-Ramain et al., 1996; Small and Potter, 1993). 23 Together, these data indicate that HOTTIP RNA controls activation of distal Hox genes in vivo. The broad impact of HOTTIP RNA on gene activation across the HOXA locus is reminiscent of the broad domains of chromatin modifications demarcating active and silent chromosomal domains (Rinn et al., 2007). 5C analysis of control and HOTTIP-depleted cells showed little change in higher order chromosomal structure, indicating that the chromosomal looping is pre-configured and upstream of gene expression (Figure 2S6A). In contrast, HOTTIP RNA knockdown led to broad loss of H3K4me3 and H3K4me2 across the HOXA locus, most prominently over 5′ HOXA and HOTTIP gene itself (Figure 2.3A, Figure 2S6B-7). HOTTIP RNA knockdown also increased H3K27me3 focally over HOTTIP gene, but had little impact on H3K27me3 across HOXA. These results indicate that HOTTIP RNA is required for maintenance of H3K4me3 across the HOXA. These findings also imply that loss of 5′ HOXA gene transcription upon HOTTIP RNA knockdown is likely to be due to loss of H3K4me3 (or other changes) rather than ectopic spread of H3K27me3. 24 Figure 2.3. HOTTIP RNA is required for the active chromatin state of 5′ HOXA cluster. a, Knockdown of HOTTIP RNA broadly decreases H3K4me3 across 5′ HOXA locus but focally affects H3K27me3 at HOTTIP gene. Display is as in Figure 2.1A. b, Knockdown of HOTTIP RNA abrogates peaks of MLL1 and WDR5 occupancy near TSSs of 5′ HOXA genes and leads to accumulation of these proteins at HOTTIP gene itself. Arrows highlight peaks of MLL1 and WDR5 occupancy; open arrowheads highlight chromatin state of HOTTIP gene upon HOTTIP RNA knockdown. 25 H3K4 methylation of the HOX loci is carried out by the MLL family of complexes (Ruthenburg et al., 2007). In mammals, at least six MLL family members of SET-domain-containing lysine methyltransferases interact with a core complex of WDR5, ASH2L, RBBP5, as well as with other proteins, for substrate recognition and genomic targeting (Ruthenburg et al., 2007). Genetic analyses indicate that MLL1 and 2 are most essential for HOX gene expression in fibroblasts (Wang et al., 2009), and MLL1 in particular is recruited to promoters of HOX genes to maintain their activation states (Guenther et al., 2005). In distally-derived human fibroblasts, MLL1 and WDR5 densely occupied extended region of the 5′ HOXA cluster, coincident with the H3K4me3 domain, with specific ‘peaks’ of occupancy near the transcriptional start sites (TSS) of multiple 5′ HOXA genes (Figure 2.3B). Strikingly, HOTTIP RNA knockdown abrogated the peaks of MLL1 and WDR5 occupancy near TSS, resulting in diffuse and less intense binding of MLL1 and WDR5 across HOXA cluster, most prominently over the 5′ HOXA domain. HOTTIP RNA knockdown also led to increased accumulation of MLL1 and WDR5 on HOTTIP gene itself (Figure 2S8). Thus, HOTTIP RNA seems critical for maintaining a specific pattern of MLL complex occupancy across the HOXA locus to facilitate H3K4me3 and active transcription. To define the molecular link between HOTTIP RNA and MLL complex, we reasoned that HOTTIP RNA may physically interact with one or more subunits of the MLL complex. Purified, in-vitro-transcribed, full-length HOTTIP RNA bound specifically to recombinant glutathione-S-transferase-conjugated WDR5 (GST– WDR5), but not to GST, RBBP5, ASH2L, or the telomeric protein TRF1 (also known as TERF1; Figure 2.4A-B). The C terminus of MLL1, containing the SET domain, bound non-specifically to all RNAs, consistent with previous studies (Krajewski et al., 2005). Immunoprecipitation of endogenous WDR5 from two different cell lines each specifically retrieved endogenous HOTTIP RNA (Figure 2.4C), indicating that WDR5 and HOTTIP RNA interact in living cells. Immunoprecipitation of an epitopetagged WDR5 from a stable cell line that previously enabled stoichiometric purification of WDR5-interacting proteins (Wysocka et al., 2005) also specifically retrieved HOTTIP RNA (Figure 2S9). Knockdown of WDR5 broadly inhibited 26 expression of 5′ HOXA genes, and also abrogated HOTTIP transcription, demonstrating mutual interdependence between HOTTIP RNA and WDR5 (Figure 2.4D). 27 Figure 2.4. HOTTIP RNA programs active chromatin via WDR5. a, Summary of RNA–protein interaction studies. Each of the indicated recombinant protein was purified and used to retrieve purified HOTTIP RNA or control histone RNA in vitro. Only GST–WDR5 specifically retrieved HOTTIP. b, HOTTIP RNA binds directly and 28 specifically to WDR5. Left, purified GST and GST–WDR5 are visualized by SDS– PAGE and Coomassie Blue staining. Right, retrieved RNAs are quantified by qRT– PCR. c, HOTTIP RNA binds specifically to WDR5 in cells. Immunoprecipitation (IP) of endogenous WDR5 protein from PC3 (prostate) and T24 (bladder) carcinoma cells specifically retrieved HOTTIP, but not control IPs with IgG or chromatin binder SIRT6. U1 spliceosomal RNA served as negative control. d, WDR5 is required for 5′ HOXA gene expression, including HOTTIP RNA. e, HOTTIP RNA recruitment potentiates transcription. Left, the BoxB tethering system. BoxB–RNA specifically binds λN fused to GAL4 DNA binding domain, recruiting the complex to a UASluciferase reporter gene. After transient transfection, IP of GAL4- λN specifically retrieves BoxB–HOTTIP RNA. Right, luciferase activity after co-transfection of the indicated constructs. *P < 0.05 Student’s t-test comparing BoxB–LacZ versus BoxB– HOTTIP). Sloped triangle indicates increasing input of plasmids encoding ncRNAs. Means ± s.d. (n = 3) are shown for all panels. 29 HOTTIP RNA seems to regulate genes in cis, due to its low copy number, distance dependence of HOXA target gene activation on endogenous HOTTIP, and the physical proximity of HOTTIP and its target genes as seen in 5C. Indeed, ectopic expression of HOTTIP RNA by retroviral transduction of lung fibroblasts, which do not express HOTTIP, failed to activate expression of distal HOXA genes, and did not change H3K4me3 and H3K27me3 patterns across HOXA (Figure 2S10). Moreover, in foreskin fibroblasts that express endogenous HOTTIP, ectopic HOTTIP expression did not induce 5′ HOXA genes, nor rescue the effects of depleting endogenous nascent HOTTIP RNA (Figure 2S11). The lack of response in foreskin fibroblasts is notable because endogenous HOTTIP RNA is active in these cells, indicating that the protein partners of HOTTIP are all present and target genes are receptive. Ectopically expressed HOTTIP RNA, being transcribed from retroviral insertion sites scattered randomly in the genome, may not be able to find 5′HOXA genes. In contrast, endogenous HOTTIP RNA is directly positioned near the 5′ HOXA genes by chromosomal looping, allowing interaction and control. To test the requirement of an exogenous targeting mechanism, we engineered an allele of HOTTIP RNA that can be artificially recruited to a reporter gene. Addition of five copies of the BoxB RNA element (Baron-Benhamou et al., 2004) to HOTTIP RNA allows the fusion transcript to be recruited to the λN RNA binding domain fused to a GAL4 DNA-binding domain (Figure 2.4E). Recruitment of HOTTIP RNA to a silent GAL4 promoter is not sufficient to initiate transcription, but can significantly boost transcription if the promoter is also bound by WDR5 and transcriptionally active (Figure 2.4E). By uncoupling the sites of HOTTIP transcription versus HOTTIP RNA function, this experiment indicates that the proximity of HOTTIP RNA—rather than the act of transcription—maintains target gene expression. To further support the functionality of HOTTIP RNA, deletion analysis identified a ~1 kb domain in the 5′ of HOTTIP RNA (HOTTIPExons 1–2) that retains WDR5 binding activity (Figure 2S12A). Enforced overexpression of HOTTIPExons 1–2 in foreskin fibroblasts inhibited 5′ HOXA gene expression in an apparently dominant negative manner (Figure 2S12B). 30 2.3 Discussion In summary, HOTTIP RNA is a key locus control element of HOXA genes and distal identity. Chromosomal looping brings HOTTIP RNA in close proximity to the 5′ HOXA genes. HOTTIP transcription acts as a switch to produce HOTTIP lincRNA, which binds to and targets WDR5–MLL complexes to the 5′ HOXA locus, yielding a broad domain of H3K4me3 and transcription activation (Figure 2S13). The mutual interdependence between HOTTIP RNA and WDR5 creates a positive feedback loop that maintains the ON state of the locus. These findings provide an integrated view linking three dimensional genome organization to dynamic programming of chromatin states, and ultimately to developmental pattern formation. H3K4 methylation is a feature of almost all transcribed genes, and MLL family proteins are involved in many cell fate decisions in development and disease (Ruthenburg et al., 2007). Our findings suggest that additional lincRNAs, especially those associated with enhancers or enhancer-like activities (De Santa et al., 2010; Kim et al., 2010; Orom et al., 2010), may also be involved in gene activation by programming active chromatin states, and highlight WDR5 and other WD40 repeat proteins as candidate adaptors that link chromatin remodelling complexes to lincRNAs. Cis-restricted lincRNAs may be ideally suited to link chromosome structure and gene expression. Because such lincRNA can only act on its neighbours in space, information in higher order chromosomal looping can be faithfully transmitted to chromatin modification via RNA recruitment of enzymatic activities, and thus into gene expression. 31 2.4 Materials and Methods Cells Primary human fibroblasts derived from different anatomic sites were as described (Bernstein et al., 2005; Chang et al., 2002; Chang et al., 2004; Rinn et al., 2006; Rinn et al., 2007; Rinn et al., 2008a; Rinn et al., 2008b). Primary human fibroblasts in culture retain their positional identity and have been used to examine chromatin states associated with positional memory, which have been confirmed in vivo (Bernstein et al., 2005; Rinn et al., 2006; Soshnikova and Duboule, 2009). Chromatin immunoprecipitation followed by microarray analysis ChIP-chip was performed using anti-H3K27me3 (Abcam), anti-H3K4me3 (Abcam), anti-H3K4me2 (Abcam), anti-histone H3 (Abcam), anti-PolII (Abcam), anti-MLL1 (gift of R. Roeder), and anti-WDR5 (Wysocka et al., 2005) antibodies as previously described (Rinn et al., 2007). Chromatin from each indicated cell type or RNAi treatment is split into multiple tubes and subject to ChIP with different antibodies in parallel. Retrieved DNA and input chromatin were competitively hybridized to custom tiling arrays interrogating human HOX loci at 5-bp resolution as previously described (Rinn et al., 2007). 5C analysis of the ENm010 HoxA1 region 5C primers were designed at HindIII restriction sites using 5C primer design tools previously developed (Dostie et al., 2006) and made available online at http://my5C.umassmed.edu (Lajoie et al., 2009). Reverse primers were designed for fragments overlapping a known transcription start site from GENCODE transcripts (Harrow et al., 2006), or overlapping a start site as experimentally determined by CAGE Tag data of the ENCODE pilot project (Birney et al., 2007). Forward primers were designed for all other HindIII restriction fragments. Primers were excluded if highly repetitive sequences prevented the design of a sufficiently unique 5C primer. Primers settings were: U-BLAST: 3; S-BLAST: 130: 15-MER: 1320; MIN_FSIZE: 40; MAX_FSIZE: 50000; OPT_TM: 65; OPT_PSIZE: 40. DNA sequence of the universal 32 tails of forward primers was CCTCTCTATGGGCAGTCGGTGAT; DNA sequence for the universal tails of reverse primers was AGAGAATGAGGAACCCGGGGCAG. A 6-base barcode was included between the specific part of the primers and the universal tail. In total 17 reverse primers and 90 forward primers were designed in the 500 kb HoxA1 locus (ENm010) and hence a total of 1,530 cis interaction were interrogated in this region. Primer sequences are available separately (Table S1). 3C was performed with HindIII as previously described (Dostie and Dekker, 2007) separately for fetal lung and foreskin fibroblasts (FB) and also for the control and HOTTIP knockdown foreskin FBs. For the 5C reaction, a total of 107 forward and reverse primers of HoxA1 region were mixed with either the ENCODE random region (ENr) primer pool comprising of 2,673 forward and 523 reverse primers (covering 30 additional ENCODE regions) or the ENr313 primer pool comprising of 57 forward and 58 reverse primers (covering 1 additional ENCODE region). 5C was then performed in 10 reactions each containing an amount of 3C library that represents 200,000 genome equivalents and 1 fmol of each primer. The 5C analysis of HoxA1 region was carried out in two biological replicates of fetal lung and foreskin FBs. 5C ligation products were amplified using a pair of universal primers that recognize the common tails of the 5C forward and reverse primers described above and pooled together. To facilitate paired-end DNA sequence analysis on the Illumina GA2 platform, paired-end adaptor oligonucleotides were ligated to the 5C library using the Illumina PE protocol and PCR amplification of the library was carried out for 18 cycles with Illumina PCR primer PE 1.0 and 2.0. The 5C library was then sequenced on the Illumina GA2 platform generating 36 base paired end reads. For fetal lung FBs we obtained 7,625,276 and 10,947,424 mapped reads for two biological replicates of which 1,339,861 and 242,301 could be specifically mapped back to interactions within ENm010 using Novoalign (http://www.novocraft.com), respectively. For two biological replicates of foreskin FBs we obtained 7,311,386 and 5,731,107 mapped reads of which 2,752,789 and 66,769 could be mapped back to the ENm010 region, respectively. In the case of the knockdown study, control green fluorescent protein (GFP) knockdown foreskin FB 5C library yielded 4,909,482 mapped reads whereas 33 HOTTIP knockdown foreskin FB had 5,565,389 mapped reads of which 39,168 and 38,950 could be mapped back to ENm010 for control GFP and HOTTIP knockdown, respectively. In the set with fetal lung and foreskin fibroblast samples, 5C for ENm010 was multiplexed for deep sequencing with 5C of one other region, ENr313; in the set containing the knockdown samples, ENm010 was multiplexed with 5C of 30 other genomic regions. The different extent of multiplexing resulted in different number of sequencing reads mapping back to ENm010. In all instances the mappable reads were proportional to the degree of multiplexing, indicating equivalent library quality despite different read numbers. Table S2 outlines the library composition of each experiment. The heat maps are scaled as follows—for Figure 2.1A, distal (foreskin) FBs: 262– 17,467, proximal (lung) FBs: 7–5,846; for Figure 2S6, siGFP: 1–100, siHOTTIP: 1– 100. Raw data from the 5C experiments used to generate the binned heat maps in Figure 2.1A and Figure 2S6 can be found in File S1. Raw data are available by request. HOTTIP cloning, sequence and expression analysis We previously identified a portion of HOTTIP as a non protein-coding transcribed region named ncHOXA13-96 (Rinn et al., 2007). This region also overlaps expressed sequence tag (EST) clone AK093987 that was previously observed to be expressed in cancer cell lines derived from posterior anatomic sites (Sasaki et al., 2007). 5′ and 3′ RACE (RLM Race kit, Applied Biosystems/Ambion) showed fulllength HOTTIP RNA to be 3,764 nucleotides, extending the known transcribed region by more than 1,400 bases. BLAST and BLAT confirmed that portions of HOTTIP are well conserved in mammals and even in avians but had no protein coding potential. Full-length HOTTIP RNA sequence has been deposited at NCBI (accession number GU724873). qRT–PCR with SYBR Green was conducted as recommended by the manufacturer (Agilent Technologies). Primer sequences specific for HOTTIP were CCTAAAGCCACGCTTCTTTG (HOTTIP-F) and TGCAGGCTGGAGATCCTACT (HOTTIP-R). For Figure 2S11, endogenous nascent HOTTIP was distinguished from 34 ectopic HOTTIP expressed from cDNA using primers that spanned intron–exon junctions. Strand-specific RT–PCR RNA extracted from primary foreskin fibroblasts was reverse transcribed (SuperScript III, Invitrogen) using combinations of the previously described HOTTIPspecific primers HOTTIP-F and/or HOTTIP-R as diagrammed in Figure 2S1B. Resulting cDNA was then PCR-amplified using both HOTTIP-F and HOTTIP-R primers to visually determine strand specificity. HOTTIP transcript count per cell The level of HOTTIP transcript per cell was calculated from the level of HOTTIP in 500,000 cells. Full-length HOTTIP in pcDNA3.1+ was assayed by qPCR using primers HOTTIP-F and HOTTIP-R at predetermined concentrations in triplicate to generate a linear amplification curve dependent on the moles of template DNA (Figure 2S2). The qRT–PCR value from 500,000 foreskin fibroblasts was determined and plotted, and the corresponding total molecules of transcript was divided by 500,000 to determine the approximate number of transcripts per cell. Single-molecule RNA fluorescence in situ hybridization (RNA-FISH) Single molecule RNA-FISH was performed as described in (Raj et al., 2008) with the following modifications: the amount of hybridization solution per chamber was doubled to allow for proper coating of the chamber and the amount of glucoseoxidase buffer was tripled to assist in image acquisition. Images were acquired using an Olympus FV1000 confocal microscope within 2 h of the addition of the glucoseoxidase buffer. RNA interference Primary foreskin fibroblasts were transfected with siRNAs targeting HOTTIP and WDR5 using Lipofectamine 2000 (Invitrogen) as per manufacturer’s instructions. Total RNA was harvested 48–72 h later using TRIzol (Invitrogen) and RNeasy Mini 35 Kits (Qiagen) as previously described (Rinn et al., 2006). For the intronic HOTTIP knockdown experiment in Figure 2S11, a pool of 10 siRNAs (Table S3) targeting intronic regions in HOTTIP were transfected into foreskin fibroblasts, and RNA isolated as above. Generation of shRNAs against chicken HOTTIP A reporter construct encoding a GFP–chicken HOTTIP fusion transcript was used in a small-scale screen to identify highly effective shRNA constructs. Eleven shRNAs targeting conserved regions of chicken HOTTIP were designed and inserted into the pSMP system (Thermo/Open Biosystems). The reporter construct and shRNA constructs were cotransfected into Phoenix cells, and HOTTIP transcript levels were analysed via reduced GFP fluorescence and by qRT–PCR. Three shRNAs that were effective in vitro were then cloned into RCAS vector for studies in chick embryos (Harpavat and Cepko, 2006). Chick RNAi RCAS HOTTIP hairpin and RCAS AP viruses were made by transfecting DF1 cells with viral DNA. Transfected DF-1 cells were grown and passaged, after which the virus-containing supernatant was collected, concentrated and titred. Fertilized chicken eggs were incubated in a humidified rotating incubator at 37 °C until they reached Hamilton/Hamburger stage 10. Eggs were then windowed to expose the embryos. After gently removing the vitelline membrane, chicken embryos were microinjected with RCAS-HOTTIP hairpins and RCAS-AP viruses at the prospective wing and leg buds. All viral stocks have titres of 1 × 108 IU ml−1, and each limb was injected five times. The infected embryos were allowed to incubate at 37 °C and were harvested 2 or 4 days after injection to detect viral infection by immunohistochemistry. Total RNA was extracted from injected forelimbs, and RT–PCR analysis was performed 4 days after injection. Chicken embryos were harvested 9 days postinjection to carry out whole-mount Alcian blue staining. A total of 50 animals were injected. 36 Hairpin sequences for chick HOTTIP were TGCTGTTGACAGTGAGCGACCCGAAGATGTGTCTGATTTGTAGTGAAGCCA CAGATGTACAAATCAGACACATCTTCGGGCTGCCTACTGCCTCGGA (2-2-1), TGCTGTTGACAGTGAGCGCCGCTCTGCTCTCCTCTCTCTCTAGTGAAGCCA CAGATGTAGAGAGAGAGGAGAGCAGAGCGATGCCTACTGCCTCGGA (3-21), and TGCTGTTGACAGTGAGCGAATCCTTAATCGAATCTGATTTTAGTGAAGCCA CAGATGTAAAATCAGATTCGATTAAGGATCTGCCTACTGCCTCGGA (4-4-1). HOTTIP overexpression Full-length HOTTIP and a truncated transcript consisting of exons 1 and 2 (HOTTIPExons 1–2) were cloned into the LZRS vector (gift of P. Khavari), and then transfected into Phoenix cells (gift of G. Nolan) to generate amphotropic retroviruses. Primary human fibroblasts were infected with either LZRS-full length HOTTIP (lung), LZRS-truncated HOTTIP (foreskin), or LZRS-GFP (both lung and foreskin), then passaged over 60 days, with periodic testing of HOXA and HOTTIP expression by qRT–PCR. These cells were used in the rescue experiments depicted in Figure 2S11. GST pull-down Full-length HOTTIP, truncated HOTTIP containing exons 1 and 2 (HOTTIPExons 1–2), and histone H2B1 mRNA were transcribed in vitro using T7 polymerase according to manufacturer’s instructions (Promega), denatured, and refolded in folding buffer (100 mM KCl, 10 mM MgCl2, Tris pH 7.0). GST-tagged WDR5, C-terminal MLL1, RBBP5/Ash2L and TRF1 were expressed in Escherichia coli and purified as described (Smith and Johnson, 1988). Each GST-fusion protein was bound to glutathione beads (Amersham/GE Healthcare) and blocked with excess yeast total mRNA in PB100 buffer (20 mM HEPES pH 7.6, 100 mM KCl, 0.05% NP40, 1 mM DTT, 0.5 mM PMSF) for 1 h at room temperature. Beads were then incubated with either in-vitro-transcribed HOTTIP or histone H2B1 mRNA for 45 min at room temperature. After three washes in PB200 buffer (20 mM HEPES pH 7.6, 37 200 mM KCl, 0.05% NP40, 1 mM DTT, 0.5 mM PMSF), bound RNAs were extracted and analysed by qRT–PCR, as previously described. RNA immunoprecipitation HeLa-WDR5-Flag cells: 48 h after Lipofectamine 2000-mediated transfection of HOTTIP into HeLa WDR5-Flag cells (approximately 107 cells), total protein was extracted as previously described, with modifications (Dignam et al., 1983). Briefly, cells were resuspended in Buffer A (10 mM HEPES pH 7.5, 1.5 mM MgCl2, 10 mM KCl, 0.5 mM DTT, 1.0 mM PMSF), lysed in 0.25% NP40, and fractionated by low speed centrifugation. The nuclear fraction was resuspended and lysed in Buffer C (20 mM HEPES pH 7.5, 10% glycerol, 0.42 M KCl, 4 mM MgCl2, 0.5 mM DTT, 1.0 mM PMSF). Combined nuclear and cytoplasmic fractions were immunoprecipitated with mouse anti-Flag M2 monoclonal antibody (Sigma) or mouse IgG affixed to agarose beads (Sigma) for 3 to 4 h at 4 °C. Beads were washed four times with wash buffer (50 mM TrisCl pH 7.9, 10% glycerol, 100 mM KCl, 5 mM MgCl2, 10 mM β-mercaptoethanol, 0.1% NP40). After elution using Flag peptide (Sigma), co-immunoprecipitated RNA was extracted and analysed by qRT–PCR. Endogenous WDR5 and SIRT6 RIP: cellular fractions were isolated as above and incubated with the anti-WDR5 (Millipore 07-706) or anti-Sirt6 (ab62739, Abcam) antibodies overnight at 4 °C. Samples were washed in wash buffer, and coimmunoprecipitated RNA was extracted and analysed by qRT–PCR. RNA chromatography Full-length in-vitro-transcribed HOTTIP RNA was conjugated to adipic acid dehydrazide agarose beads as described (Michlewski and Caceres, 2010). The complexed beads were incubated with whole cell lysates from Hela WDR5-Flag cells, washed, and bound proteins visualized by western blotting. BoxB tethering assay 293T cells were grown to about 50% confluence in 6-well plates on the day of transfection. Using Lipofectamine 2000 (Invitrogen), a plasmid encoding a luciferase 38 gene under the control of five tandem GAL4 UAS sites were co-transfected with plasmids encoding GAL4-WDR5, GAL4-λN (the 22 amino acid RNA-binding domain of the lambda bacteriophage antiterminator protein N) peptide fused to a C-terminal GFP tag, BoxB (containing five repeats of the λN-specific 19 nucleotide binding site), BoxB fused to full-length LacZ, or BoxB fused to full-length HOTTIP. Cells were lysed 48 h after transfection, and luciferase assay kit (Promega) was used to determine relative levels of the luciferase gene product, following the manufacturer’s protocol. 39 2.5 Supplementary Figures Figure 2S1. Molecular characterization of HOTTIP. (A) ChIP-chip confirmed that Suz12, a subunit of PRC2, and MLL1 both occupy HOTTIP in human lung fibroblasts, where HOTTIP RNA is not expressed. Chromatin occupancy map across the ~100 kilobase HOXA locus is shown. Genomic coordinate is on the X-axis; Y-axis depicts relative occupancy of MLL1, Suz12, and RNA polymerase II (Pol II) on a linear scale. Arrows depict MLL1 and Suz12 binding on HOTTIP DNA. Note that the peak of MLL1 occupancy coincides with diminished Suz12 binding, which are flanked by regions of higher Suz12 occupancy. This reciprocal binding of MLL and Polycomb subunits, even when both are present and juxtaposed, have been previously observed in bivalent domains21. (B) Strand-specific RT-PCR of HOTTIP RNA. Primer combinations used for reverse transcription (1, 2) and HOTTIP PCR (F, R) are 40 diagrammed on the schematic. (C) Conserved distal expression pattern of chick HOTTIP. In situ hybridization of stage 28 chick embryo limbs are shown. White scale bar=0.1 mm. (D) Chromatin marks at the distal HOXA locus in human embryonic stem cells (ESC). In ES cells, the entire HOXloci, including HOTTIP, is marked by a broad domain of H3K27me3; the HOTTIP locus has no H3K4me3 or H3K36me3 marks, consistent with its silent status. Genome graphic data exported from http://genome.ucsc.edu/, utilizing the ENCODE whole genome data sets22. 41 Figure 2S2. Single molecule RNA-fluorescence in situ hybridization (FISH) confirms low copy number of HOTTIP RNA. (A) Primary human foreskin fibroblasts were hybridized with a mixture of Cy-5-labeled probes against HOTTIP and assayed for HOTTIP expression via confocal microscopy. Representative confocal images showing an example of a cell with one (left) or no (right) copies of HOTTIP RNA (green, highlighted by arrow). DAPI staining of cellular nuclei is in red. White scale bar=15μm. (B) Quantitation of RNA-FISH experiments, demonstrating that the majority (>97%) of the cells examined contain zero or one copy of HOTTIP RNA. HOTTIP RNA is present on on average at 0.31 copies per cell. The counts were pooled from 3 independent replicates (n=449 cells). (C) Titration curve used to determine number of HOTTIP molecules per 500,000 cells. Y-axis depicts 2^-ct values from qRT-PCR; X-axis depicts scaled total molecules of transcript in a sample. Blue points outline the titration curve using a known quantity of HOTTIP molecules to determine corresponding qRT-PCR 2^-ct values; error bars represent s.d. from three technical replicates. The pink square represents the qRT-PCR value from a standard sample of 500,000 foreskin fibroblasts. 42 Figure 2S3. Lack of antisense HOTTIP RNA with HOTTIP knockdown. (A) No antisense transcription upon HOTTIP depletion. RNA from foreskin fibroblasts treated with control or HOTTIP siRNA was reverse transcribed with strand-specific primers, and then transcript abundance is analyzed by qPCR. N.D. not detectable. (B) Independent siRNAs targeting HOTTIP abrogated 5’ HOXA gene expression in primary fibroblasts. Mean + s.d. are shown. 43 Figure 2S4. Efficient retroviral infection in developing chick limb buds. (A) Broad expression of the avian virus protein Gag in mesenchymal cells of chick forelimb 2 days after injection with RCAS-shHOTTIP. (B) Higher magnification of A showing detection of Gag protein in mesenchymal and muscle cells of the injected limb. (C) After 4 days of injection, Gag protein was detected in limb chondrocytes. (D) Alkaline phosphatase activities were detected in hindlimb injected with control RCAS-AP. r, radius, u, ulna. 44 Figure 2S5. HOTTIP knockdown decreases expression of 5’ HoxA genes in vivo. Replication competent avian virus (RCAS) expressing short hairpin RNA targeting HOTTIP was injected into chick limb buds at St. 17. After 4 days of incubation, embryos were harvested and prepared for sections and in situ hybridization. Contralateral limbs served as controls. Shown are forelimb zeugopod and autopod (i.e. elbow to fingertip). (A)HOTTIP expression in the limb bud is reduced (B) following over-expression of HOTTIP shRNA. In control limbs (C,E,G), HoxA13(C), HoxA11 (E), and HoxA10 (G) expression domains are strongest in distal, undifferentiated limb 45 mesenchyme. In treated limbs (D,F,H) the expression domains of the HoxA genes are reduced in size and fewer transcripts are detectable. Abbreviations: r=radius (anterior side of limb); u=ulna; d= digit, D=distal tip of limb bud; P= proximal end . 46 Figure 2S6. HOTTIP depletion causes little change to higher-order chromosome configuration. (A) Chromosome Conformation Capture (5C) analysis of foreskin fibroblasts treated with control siRNAs (siGFP) or siRNA targeting HOTTIP. Heatmap representations of 5C data (bin size 30 kb, step size 3 kb) across human HOXA are as in Figure 2.1A. The diagonal represents frequent cis interactions between regions located in proximity to each other in the HoxA locus. Long-range looping interactions are 5C signals that are away from the diagonal. Each pixel represents median interactions in a 30kb region; the intensity of pixel corresponds to the total number of reads. (B) Knockdown of HOTTIP broadly decreases H3K4me2 across 5’ HOXA locus but does not change overall occupancy of histone H3. ChIPchip data across ~100 kilobase HOXAlocus are shown for foreskin fibroblasts with 47 control (top) versus HOTTIP (bottom) knockdown. Genomic coordinate is on the Xaxis; Y-axis depicts relative occupancy of H3K4me2 and pan histone H3 on a linear scale. 48 Figure 2S7. ChIP-qPCR validation of HOTTIP-dependence of H3K4me3 occupancy at distal HOXA. Chromatin-immunoprecipitation with H3K4me3 was performed on foreskin fibroblasts transfected with siRNA against GFP (siGFP) or HOTTIP (siHOTTIP), and the subsequent enrichment of distal (HOXA13,HOXA11) and proximal (HOXA1)HOXA genes were analyzed by quantitative PCR. Mean + s.d. are shown. 49 Figure 2S8. Zoom-in view of Figure 2.3 highlighting HOTTIP dependence of MLL1 and WDR5 localization to HOXA. Knockdown of HOTTIP abrogates MLL1(A) and WDR5 (B) peaks across the 5’ HOXA locus, and results in diffuse and less intense binding of MLL1 and WDR5 across the 5’HOXA. HOTTIP knockdown also led to increased accumulation of MLL1 and WDR5 on HOTTIP itself. Arrows highlight peaks of MLL1 and WDR5 occupancy. 50 Figure 2S9. HOTTIP binds stably expressed FLAG-WDR5 in HeLa cells. (A) HOTTIP expression levels in primary foreskin fibroblasts, FLAG-WDR5 HeLa cells, and FLAG-WDR5 HeLa cells transduced with HOTTIP. Error bars represent mean + s.d.. (B) HOTTIP binds specifically to WDR5 in cells. Top: IP of FLAGWDR5 verified by WDR5 immunoblot. Bottom: Quantitation of RIP by qRT-PCR, normalized by the levels of RNA retrieved by IgG control. (C) RNA binding by H3K4 methylase complexes. Extract from from FLAG-WDR5 HeLa cells were passed over beads (control) or bead coupled to full-length HOTTIP RNA, extensively washed, and interrogated for retrieval of specific proteins by immunoblot. HOTTIP retrieved WDR5, Menin (unique to MLL1/MLL2 complexes), and WDR82 (unique to the Set1A/Set1B), but not PTIP (unique to MLL3/MLL4 complexes). 51 Figure 2S10. HOTTIP overexpression does not affect distal HOXA expression. (A) Ectopic overexpression of HOTTIP in lung fibroblasts does not activate 5’HOXA genes. (B) HOTTIP overexpression in lung fibroblasts do not change HOXA chromatin state. ChIP-chip data across ~100 kilobase HOXA locus are shown for lung fibroblasts overexpressing HOTTIP (top) compared those from regular lung 52 fibroblasts (bottom). Genomic coordinate is on the X-axis; Y-axis depicts relative occupancy of H3K27me3, H3K4me3, and RNA polymerase II (Pol II) on a linear scale. 53 Figure 2S11. Ectopic HOTTIP expression does not activate 5’HOXA genes nor rescue the effects of depleting endogenous nascent HOTTIP. (A) HOTTIP overexpression in foreskin fibroblasts is unable to activate 5’HOXA genes, and has apparently dominant Wang et al., p. 6 negative effect on HOXA13 expression. GFP transduction served as negative control, which did not change HOTTIP level or 5’ HOXA genes. (B) Knockdown of endogenous nascent HOTTIP was achieved by siRNAs targeting intronic regions in HOTTIP, leading to decreased expression of 5’ HOXA genes. siRNAs targeting HOTTIP introns do not affect ectopic HOTTIP overexpression from 54 cDNA (>100 fold in both siScr vs. siHOTTIP-intronic treated cells compared to wild type). Cells transduced with ectopic HOTTIP cDNA responded to endogenous HOTTIP depletion in the same manner, indicating that ectopic HOTTIP cannot rescue depletion of endogenous HOTTIP. Mean + s.d. are shown. *, p<0.05, Student’s t-test. N.S., not significantly different between control and ectopic HOTTIP overexpression. 55 Figure 2S12. HOTTIPExons1-2 binds to WDR5 and acts in a dominant negative manner to inhibit 5’ HOXA gene expression. (A) Schematic of HOTTIP fragments and their relative affinities in binding to GST-WDR5. Each fragment was in vitro transcribed and purified, and incubated with purified recombinant GST or GSTWDR5. Specific retrieval of RNA by GST-WDR5 was quantified by qRT-PCR. + indicates higher interaction with GST-WDR5 than GST. Notably, HOTTP exons 1-2 bound GST-WDR5 ~2 fold better than full length HOTTIP on a molar basis. Mean +s.d. is shown. (B) Overexpression of HOTTIPExons1-2 in foreskin fibroblasts exerts a dominant negative effect, resulting in reduced 5’ HOXA gene expression.Error bars represent s.d. from biologic duplicates.Bottom:Overexpression is confirmed by RTPCR using primers specific for HOTTIPExons1-2 and GFP; GAPDH is included as a loading control. 56 Figure 2S13. Model of HOTTIP action. HOTTIP is positioned near the active 5’ HOXA genes via chromosomal looping. The top panel depicts the situation in distal cells when HOTTIP is depleted: the chromosome is looped to bring 5’ HOXA genes together, but MLL1/WDR5 is not properly localized, nor is there H3K4me3 and gene transcription. Upon transcription (bottom panel), HOTTIP RNA binds to WDR5MLL1 complex, and drives MLL1/WDR5 occupancy and H3K4 trimethylation of 5’ HOXA genes. Because HOTTIP transcription is dependent on WDR5 and WDR5 occupancy is dependent on HOTTIP, this mutual interdependence creates a positive feedback loop that maintains the ON state of the 5’ HOXA locus. 57 2.7 References Ashe, H.L., Monks, J., Wijgerde, M., Fraser, P., and Proudfoot, N.J. (1997). Intergenic transcription and transinduction of the human beta-globin locus. Genes Dev 11, 2494-2509. Baron-Benhamou, J., Gehring, N.H., Kulozik, A.E., and Hentze, M.W. (2004). Using the lambdaN peptide to tether proteins to RNAs. Methods Mol Biol 257, 135154. Bernstein, B.E., Kamal, M., Lindblad-Toh, K., Bekiranov, S., Bailey, D.K., Huebert, D.J., McMahon, S., Karlsson, E.K., Kulbokas, E.J., 3rd, Gingeras, T.R., et al. (2005). Genomic maps and comparative analysis of histone modifications in human and mouse. Cell 120, 169-181. Bernstein, B.E., Mikkelsen, T.S., Xie, X., Kamal, M., Huebert, D.J., Cuff, J., Fry, B., Meissner, A., Wernig, M., Plath, K., et al. (2006). A bivalent chromatin structure marks key developmental genes in embryonic stem cells. Cell 125, 315-326. Birney, E., Stamatoyannopoulos, J.A., Dutta, A., Guigo, R., Gingeras, T.R., Margulies, E.H., Weng, Z., Snyder, M., Dermitzakis, E.T., Thurman, R.E., et al. (2007). Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799-816. Chang, H.Y. (2009). Anatomic demarcation of cells: genes to patterns. Science 326, 1206-1207. Chang, H.Y., Chi, J.T., Dudoit, S., Bondre, C., van de Rijn, M., Botstein, D., and Brown, P.O. (2002). Diversity, topographic differentiation, and positional memory in human fibroblasts. Proc Natl Acad Sci U S A 99, 12877-12882. Chang, H.Y., Sneddon, J.B., Alizadeh, A.A., Sood, R., West, R.B., Montgomery, K., Chi, J.T., van de Rijn, M., Botstein, D., and Brown, P.O. (2004). Gene expression signature of fibroblast serum response predicts human cancer progression: similarities between tumors and wounds. PLoS Biol 2, E7. Davis, A.P., Witte, D.P., Hsieh-Li, H.M., Potter, S.S., and Capecchi, M.R. (1995). Absence of radius and ulna in mice lacking hoxa-11 and hoxd-11. Nature 375, 791-795. De Santa, F., Barozzi, I., Mietton, F., Ghisletti, S., Polletti, S., Tusi, B.K., Muller, H., Ragoussis, J., Wei, C.L., and Natoli, G. (2010). A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS Biol 8, e1000384. Dean, A. (2006). On a chromosome far, far away: LCRs and gene expression. Trends Genet 22, 38-45. Dignam, J.D., Lebovitz, R.M., and Roeder, R.G. (1983). Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei. Nucleic Acids Res 11, 1475-1489. Dinger, M.E., Amaral, P.P., Mercer, T.R., Pang, K.C., Bruce, S.J., Gardiner, B.B., Askarian-Amiri, M.E., Ru, K., Solda, G., Simons, C., et al. (2008). Long noncoding RNAs in mouse embryonic stem cell pluripotency and differentiation. Genome Res 18, 1433-1445. Dostie, J., and Dekker, J. (2007). Mapping networks of physical interactions between genomic elements using 5C technology. Nat Protoc 2, 988-1002. 58 Dostie, J., Richmond, T.A., Arnaout, R.A., Selzer, R.R., Lee, W.L., Honan, T.A., Rubio, E.D., Krumm, A., Lamb, J., Nusbaum, C., et al. (2006). Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. Genome Res 16, 1299-1309. Fromental-Ramain, C., Warot, X., Messadecq, N., LeMeur, M., Dolle, P., and Chambon, P. (1996). Hoxa-13 and Hoxd-13 play a crucial role in the patterning of the limb autopod. Development 122, 2997-3011. Guenther, M.G., Jenner, R.G., Chevalier, B., Nakamura, T., Croce, C.M., Canaani, E., and Young, R.A. (2005). Global and Hox-specific roles for the MLL1 methyltransferase. Proc Natl Acad Sci U S A 102, 8603-8608. Harpavat, S., and Cepko, C.L. (2006). RCAS-RNAi: a loss-of-function method for the developing chick retina. BMC Dev Biol 6, 2. Harrow, J., Denoeud, F., Frankish, A., Reymond, A., Chen, C.K., Chrast, J., Lagarde, J., Gilbert, J.G., Storey, R., Swarbreck, D., et al. (2006). GENCODE: producing a reference annotation for ENCODE. Genome Biol 7 Suppl 1, S4 19. Kim, T.K., Hemberg, M., Gray, J.M., Costa, A.M., Bear, D.M., Wu, J., Harmin, D.A., Laptewicz, M., Barbara-Haley, K., Kuersten, S., et al. (2010). Widespread transcription at neuronal activity-regulated enhancers. Nature 465, 182-187. Kmita, M., Fraudeau, N., Herault, Y., and Duboule, D. (2002). Serial deletions and duplications suggest a mechanism for the collinearity of Hoxd genes in limbs. Nature 420, 145-150. Kmita, M., Tarchini, B., Zakany, J., Logan, M., Tabin, C.J., and Duboule, D. (2005). Early developmental arrest of mammalian limbs lacking HoxA/HoxD gene function. Nature 435, 1113-1116. Krajewski, W.A., Nakamura, T., Mazo, A., and Canaani, E. (2005). A motif within SET-domain proteins binds single-stranded nucleic acids and transcribed and supercoiled DNAs and can interfere with assembly of nucleosomes. Mol Cell Biol 25, 1891-1899. Lajoie, B.R., van Berkum, N.L., Sanyal, A., and Dekker, J. (2009). My5C: web tools for chromosome conformation capture studies. Nat Methods 6, 690-691. Mercer, T.R., Dinger, M.E., and Mattick, J.S. (2009). Long non-coding RNAs: insights into functions. Nat Rev Genet 10, 155-159. Michlewski, G., and Caceres, J.F. (2010). RNase-assisted RNA chromatography. RNA 16, 1673-1678. Nelson, C.E., Morgan, B.A., Burke, A.C., Laufer, E., DiMambro, E., Murtaugh, L.C., Gonzales, E., Tessarollo, L., Parada, L.F., and Tabin, C. (1996). Analysis of Hox gene expression in the chick limb bud. Development 122, 1449-1466. Orom, U.A., Derrien, T., Beringer, M., Gumireddy, K., Gardini, A., Bussotti, G., Lai, F., Zytnicki, M., Notredame, C., Huang, Q., et al. (2010). Long noncoding RNAs with enhancer-like function in human cells. Cell 143, 46-58. Petruk, S., Sedkov, Y., Riley, K.M., Hodgson, J., Schweisguth, F., Hirose, S., Jaynes, J.B., Brock, H.W., and Mazo, A. (2006). Transcription of bxd noncoding RNAs promoted by trithorax represses Ubx in cis by transcriptional interference. Cell 127, 1209-1221. 59 Ponting, C.P., Oliver, P.L., and Reik, W. (2009). Evolution and functions of long noncoding RNAs. Cell 136, 629-641. Raj, A., van den Bogaard, P., Rifkin, S.A., van Oudenaarden, A., and Tyagi, S. (2008). Imaging individual mRNA molecules using multiple singly labeled probes. Nat Methods 5, 877-879. Rinn, J.L., Bondre, C., Gladstone, H.B., Brown, P.O., and Chang, H.Y. (2006). Anatomic demarcation by positional variation in fibroblast gene expression programs. PLoS Genet 2, e119. Rinn, J.L., Kertesz, M., Wang, J.K., Squazzo, S.L., Xu, X., Brugmann, S.A., Goodnough, L.H., Helms, J.A., Farnham, P.J., Segal, E., et al. (2007). Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell 129, 1311-1323. Rinn, J.L., Wang, J.K., Allen, N., Brugmann, S.A., Mikels, A.J., Liu, H., Ridky, T.W., Stadler, H.S., Nusse, R., Helms, J.A., et al. (2008a). A dermal HOX transcriptional program regulates site-specific epidermal fate. Genes Dev 22, 303-307. Rinn, J.L., Wang, J.K., Liu, H., Montgomery, K., van de Rijn, M., and Chang, H.Y. (2008b). A systems biology approach to anatomic diversity of skin. J Invest Dermatol 128, 776-782. Ruthenburg, A.J., Allis, C.D., and Wysocka, J. (2007). Methylation of lysine 4 on histone H3: intricacy of writing and reading a single epigenetic mark. Mol Cell 25, 15-30. Sanchez-Elsner, T., Gou, D., Kremmer, E., and Sauer, F. (2006). Noncoding RNAs of trithorax response elements recruit Drosophila Ash1 to Ultrabithorax. Science 311, 1118-1123. Sasaki, Y.T., Sano, M., Kin, T., Asai, K., and Hirose, T. (2007). Coordinated expression of ncRNAs and HOX mRNAs in the human HOXA locus. Biochem Biophys Res Commun 357, 724-730. Schuettengruber, B., Chourrout, D., Vervoort, M., Leblanc, B., and Cavalli, G. (2007). Genome regulation by polycomb and trithorax proteins. Cell 128, 735-745. Small, K.M., and Potter, S.S. (1993). Homeotic transformations and limb defects in Hox A11 mutant mice. Genes Dev 7, 2318-2328. Smith, D.B., and Johnson, K.S. (1988). Single-step purification of polypeptides expressed in Escherichia coli as fusions with glutathione S-transferase. Gene 67, 31-40. Soshnikova, N., and Duboule, D. (2009). Epigenetic temporal control of mouse Hox genes in vivo. Science 324, 1320-1323. Wang, P., Lin, C., Smith, E.R., Guo, H., Sanderson, B.W., Wu, M., Gogol, M., Alexander, T., Seidel, C., Wiedemann, L.M., et al. (2009). Global analysis of H3K4 methylation defines MLL family member targets and points to a role for MLL1-mediated H3K4 methylation in the regulation of transcriptional initiation by RNA polymerase II. Mol Cell Biol 29, 6074-6085. Wysocka, J., Swigut, T., Milne, T.A., Dou, Y., Zhang, X., Burlingame, A.L., Roeder, R.G., Brivanlou, A.H., and Allis, C.D. (2005). WDR5 associates with histone H3 methylated at K4 and is essential for H3 K4 methylation and vertebrate 60 development. Cell 121, 859-872. Zhang, X., Lian, Z., Padden, C., Gerstein, M.B., Rozowsky, J., Snyder, M., Gingeras, T.R., Kapranov, P., Weissman, S.M., and Newburger, P.E. (2009). A myelopoiesis-associated regulatory intergenic noncoding RNA transcript within the human HOXA cluster. Blood 113, 2526-2534. 61 CHAPTER 3 Long noncoding RNA as regulatory switch of protein turnover This chapter has been submitted as Yang, Y.W., Chen, Y., Wan, B., Qu, K., Lei, M., and Chang, H.Y. Long noncoding RNA as regulatory switch of protein turnover. Submitted to Cell/Molecular Cell on 3/14/12 Y.W. Yang’s was the primary contributor to all aspects of this chapter, excepting protein purification used for scanning alanine mutagenesis screen (Figure 3.4A-C, Figure 3S4B), microarray data analysis (Figure 3.7A), and in vitro ubiquitination assay (Figure 3S1B). Y.W. Yang and H.Y. Chang prepared and wrote the manuscript with input from all authors. 62 3.1 Abstract Long intervening noncoding RNAs (lincRNAs) are prevalent genes with poorly understood functions. Here we discover a pathway of lincRNA-regulated proteolysis. The enhancer-like lincRNA HOTTIP extends the half-life of its binding protein WDR5, a subunit of the MLL H3K4 methylase complex, resulting in increased chromatin occupancy and gene activation. LincRNA-mediated stabilization requires direct RNA-protein interaction in a long RNA context, and blocks turnover at a step after target protein poly-ubiquitination. We elucidate the lincRNA binding interface on WDR5. A WDR5 mutant that selectively abrogates lincRNA binding becomes unstable, and is defective in gene activation, maintenance of histone H3 lysine 4 trimethylation, and embryonic stem cell self renewal. The ability to modulate protein turnover may allow lincRNAs to control the lifespan of molecular interactions at chromatin and elsewhere, broadening their scope in epigenetics and cell fate control. 63 3.2 Introduction Eukaryotic genomes are expansively transcribed, generating thousands of long intervening noncoding RNA (lincRNA) transcripts ranging from several hundred nucleotides to dozens of kilobases (Guttman et al., 2009; Guttman et al., 2010). These lincRNAs are capped, spliced, and often polyadenylated in a manner similar to canonical messenger RNAs, but do not appear to function by coding for proteins. Of the large number of lincRNAs, only a small fraction has been functionally characterized, revealing diverse functions in development and disease processes (Wang and Chang, 2011). Intriguingly, some protein coding transcripts may also possess lincRNA-like regulatory functions independent of their coding capacity (Candeias et al., 2008; Zofall et al., 2012), suggesting a broad scope for RNA-based regulation. LincRNAs have been demonstrated to play key roles in epigenetic regulation by controlling chromatin states (Wang and Chang, 2011). Specific lincRNAs can bind repressive or activating chromatin modification complexes, and localize these activities to specific gene loci. For example, the lincRNA XIST binds the Polycomb Repressive Complex 2 (PRC2) to cause histone H3 lysine 27 trimethylation and silence the X chromosome for dosage compensation in females (Morey and Avner, 2011). As another example, the lincRNA HOTAIR acts as a molecular scaffold, binding both PRC2 and the H3K4 demethylase LSD1 complex to silence hundreds of loci throughout the genome (Chu et al., 2011; Gupta et al., 2010; Rinn et al., 2007; Tsai et al., 2010). In gene activation, the roX RNAs act in Drosophila dosage compensation by binding the MSL complex to acetylate and activate the single X chromosome in males (Deng and Meller, 2006). Recent studies have suggested that enhancers and promoters often produce lincRNAs, and some of these lincRNAs indeed have enhancer-like functions in the activation of nearby genes (Hung et al., 2011; Kim et al., 2010; Orom et al., 2010; Rada-Iglesias et al., 2011). Furthermore, a recent study of lincRNAs expressed in mouse embryonic stem cells has found that many lincRNAs bind protein complexes involved in reading, writing, and erasing chromatin marks, suggesting a comprehensive role of lincRNAs throughout epigenetic 64 regulation (Guttman et al., 2011). HOTTIP is a prime example of an enhancer-like lincRNA (Wang, 2011). HOTTIP is a 3.8 kilobase polyadenylated RNA transcribed from the 5’ distal tip of the HOXA locus. HOTTIP coordinates expression of HOXA9 to HOXA13, which are important for distal identity. HOTTIP RNA directly binds to and recruits the MLL H3K4 methylase complex via the WDR5 protein to maintain H3K4 trimethylation (H3K4me3). The MLL family of H3K4 methylases are evolutionarily conserved (known as trithorax in Drosophila and COMPASS in yeast) (reviewed by Smith et al., 2011), and MLL translocations are an important cause of human leukemias (Meyer et al., 2009). Members of the MLL family encode single SET domain-containing H3K4 methylases, and are associated with WDR5, Ash2L, RbBP5, and additional proteins for localization and enzymatic activity. WDR5, which directly interacts with HOTTIP, is a multifunctional adaptor protein that is present in several gene activating protein complexes. WDR5 can discriminate posttranslational modifications on histone tails, as well as bind to MLL and RbBP5 (Migliori et al., 2012; Wysocka et al., 2005). WDR5 is highly expressed in embryonic stem cells (ESC), and is particularly required for ESC self renewal and maintenance of active chromatin for pluripotency genes (Ang et al., 2011). WDR5 is also targeted by the DDB1-CUL4B E3 ubiquitin ligase in the nucleus, and is thus under control of the ubiquitin-proteasome system (UPS) (Nakagawa and Xiong, 2011). WDR5 is a key substrate of CUL4B to regulate neuronal genes, and is believed to underlie the role of CUL4B mutations in human X-linked mental retardation. Regulated proteolysis is an essential strategy for cell fate control, ranging from cell cycle progression to immunity. The UPS controls numerous aspects of gene expression (reviewed by Muratani and Tansey, 2003), including transcription factor half life and chromatin modifications. After a ubiquitin activating enzyme (E1) adenylates ubiquitin and transfers it to a ubiquitin conjugating enzyme (E2), a ubiquitin ligase (E3) transfers ubiquitin to the target protein. The proteasome then recognizes, unfolds, and degrades the polyubiquitinated protein. The entire ubiquitinproteasome system is highly regulated, including at the level of ubiquitin ligase 65 substrate specificity and at the level of proteasome activity (Bader et al., 2011). Nucleosomes are turned over in a dynamic and regulated fashion at specific genomic loci (Deal et al., 2010; Dion et al., 2007; Mito et al., 2007) , and histone tails can be proteolyzed during differentiation to erase prior histone modifications (Duncan et al., 2008). These examples suggest that the half-lives of molecular interactions at chromatin are a critical feature of epigenetic regulation, but the basis of temporal regulation at chromatin is not well understood. Here, we show that a lincRNA can regulate protein turnover. HOTTIP RNA not only recruits WDR5 to specific genes, but also stabilizes WDR5 by preventing proteasomal degradation. We define the mechanism of this novel mode of lincRNA function, and demonstrate its importance for WDR5 function in chromatin regulation and embryonic stem cell self renewal. 66 3.3 Results HOTTIP lincRNA blocks WDR5 protein turnover In the course of studying the HOTTIP-WDR5 interaction, we observed a surprising effect of HOTTIP lincRNA on WDR5 stability. Co-expression of increasing levels of HOTTIP lincRNA with constant levels of a FLAG-WDR5 expression construct in 293T human embryonic kidney cells led to increased levels of FLAGWDR5 protein as judged by immunoblot, without a concordant effect on the FLAGWDR5 mRNA levels (Figure 3.1A). In contrast, co-expression with the PRC2associated lincRNA HOTAIR did not affect FLAG-WDR5 protein levels, and expression of mRNA encoding either LacZ or eGFP (data not shown) caused a slight decrease in FLAG-WDR5 protein. These results suggest that HOTTIP lincRNA increases FLAG-WDR5 protein levels via a post-transcriptional mechanism. Next, we performed pulse-chase analysis to determine the effects of HOTTIP on WDR5 protein turnover. Addition of cycloheximide to stop protein translation after transient transfection showed that FLAG-WDR5 is turned over with a half life of ~8 h in control conditions, as previously described in HCT116 cells (Nakagawa and Xiong, 2011) (Figure 3.1B-D), and in concordance with our later experiments in embryonic stem cells. WDR5 turnover is mediated by the proteasome as it is blocked by proteasome inhibitor MG-132. Notably, co-expression of HOTTIP lincRNA significantly extended the half life of FLAG-WDR5 to greater than 16 hours, demonstrating that HOTTIP RNA prevents turnover of FLAG-WDR5. In contrast, HOTAIR lincRNA or LacZ mRNA did not increase WDR5 protein half life. Cycloheximide treatment did not affect transcription of HOTTIP lincRNA or FLAGWDR5 mRNA, as seen by qRT-PCR (Figure 3S1A). In contrast to ectopically expressed WDR5 from transient transfection, endogenous WDR5 or stably transfected WDR5 displayed a very long half life (>16 hours) in 293T cells, HeLa cells, and human foreskin fibroblasts (data not shown), in concordance with previously published data (Schwanhausser et al., 2011). In addition, endogenous WDR5 protein levels were not affected by HOTTIP expression or exogenous WDR5 expression (data not shown). Upon transfection in 293T cells, two separate populations of WDR5 67 appear to exist: (i) endogenous WDR5 already stabilized by endogenous HOTTIP-like lincRNAs (more on this below); and (ii) ectopically expressed WDR5 protein that require lincRNA stabilization. We reasoned that the ability of HOTTIP to increase WDR5 protein half-life may also prolong WDR5 chromatin occupancy, independently of the ability of HOTTIP to recruit WDR5 to specific target genes. To test this idea, we utilized an established reporter cell line, in which GAL4 binding sites driving a luciferase gene have been stably integrated into 293T cells (Wysocka et al., 2005) (Figure 3.1E). Coexpression of HOTTIP lincRNA with WDR5 fused to GAL4 DNA binding domain (GAL4-WDR5) substantially increased GAL4-WDR5 occupancy on reporter gene chromatin that further amplified over time, as measured by chromatin immunoprecipitation (ChIP) (Figure 3.1E). In contrast, vector- or HOTAIRexpressing cells showed less GAL4-WDR5 occupancy that quickly plateaued. Analysis of luciferase activity confirmed that HOTTIP significantly increased reporter gene output (Figure 3.1F). Because the GAL4 DNA binding domain also acts as a nuclear localization signal (Chan et al., 1998), it is unlikely that the HOTTIP effect occurs through altering the nuclear localization of GAL4-WDR5. These results suggest that the ability of HOTTIP lincRNA to prolong WDR5 protein half life extends to the chromatin-bound pool, which increases WDR5 chromatin occupancy at a defined locus and boosts gene expression. 68 69 Figure 3.1. HOTTIP RNA overexpression stabilizes exogenous FLAG-WDR5. A. Increased HOTTIP RNA expression caused increased levels of exogenous FLAGWDR5 at the protein level. Similar effects were not seen with control HOTAIR or LacZ overexpression. Top: Representative western blot. Bottom: Average fold western blot density quantitation normalized to FLAG-WDR5 RNA levels. B-D. HOTTIP RNA overexpression extended FLAG-WDR5 half-life during cycloheximide treatment. Similar effects were not seen with vector only, HOTAIR, or LacZ control conditions. B. Representative western blots. Hours post cycloheximide indicated above. MG: MG132 treated sample. C. Average WDR5 western blot density quantitation, normalized to B actin. D. RT-PCR of samples at T=0. E-F. HOTTIP RNA expression potentiated GAL4-WDR5-mediated activation of chromosomally integrated luciferase by increasing protein occupancy on chromatin. E. Top: Schematic of chromosomally integrated luciferase and ChIP qPCR primer locations. Bottom: HOTTIP RNA increased GAL4-WDR5 occupancy on chromatin over time, as seen by GAL4 ChIPqPCR. Similar results were not seen with vector only or HOTAIR controls.F. HOTTIP RNA expression increased GAL4-WDR5-mediated expression of luciferase. Similar effects were not seen with control vector only or HOTAIR overexpression. 70 HOTTIP RNA stabilizes WDR5 at a step after poly-ubiquitination We next defined the step that lincRNA impacts the ubiquitin-proteasome pathway. A series of enzymes (E1, E2, E3) activate and conjugate ubiquitin to target proteins, after which the poly-ubiquitin chains are then recognized by the proteasome for protein degradation (Muratani and Tansey, 2003). We co-expressed epitope-tagged ubiquitin, WDR5, and HOTTIP followed by high stringency immunoprecipitation to map the effect of HOTTIP on WDR5 ubiquitination (Figure 3.2A). Surprisingly, HOTTIP lincRNA quantitatively increased the levels of both un-ubiquitinated and poly-ubiquitinated forms of WDR5 protein. The effect of HOTTIP lincRNA is similar to prolonged incubation with MG132, which causes accumulation of substrate proteins bearing poly-ubiquitin chains. We also used an in vitro ubiquitination assay with purified E1, E2, and the E3 ligase CUL4B (which generates ubiquitin lysine 48 linkages), and found that HOTTIP does not interfere with CUL4B mediated polyubiquitination (Figure 3S1B). Both in vivo and in vitro studies show that HOTTIP does not block WDR5 ubiquitination. Rather, the accumulation of ubiquitinated WDR5 with HOTTIP expression suggests stabilization at a step after protein ubiquitination. Poly-ubiquitinated proteins targeted for degradation are unfolded and proteolyzed by the proteasome. Thermodynamic stabilization of ubiquitinated proteins, e.g. by strong ligand-protein interactions, can forestall proteasomal degradation, presumably by increasing the energetic barrier for protein unfolding that is required to thread the protein into the proteasome (Johnston et al., 1995). To test whether HOTTIP binding thermodynamically stabilizes WDR5, we conducted differential scanning fluorimetry (DSF) experiments (Niesen et al., 2007) (Figure 3.2B-C). Purified WDR5 protein was incubated with HOTTIP RNA in the presence of the hydrophobic dye SYPRO orange, and then slowly heated. Protein denaturation was then detected by SYPRO orange binding to newly exposed core hydrophobic residues, and the melting temperatures (Tm) were approximated by the first derivative maximum of fluorescence. Whereas RNA alone did not cause any signal (data not shown), WDR5 alone demonstrated a bimodal denaturation pattern (Figure 3.2B). Previous 71 analysis of DSF experiments has shown that the first peak likely defines the true melting temperature, whereas the second peak represents protein aggregation post denaturation (Kopec and Schneider, 2011). Incubation of HOTTIP increased the WDR5 Tm by 5.1°C (p=0.002, Figure 3.2C). In contrast, addition of the nonbinding lincRNA HOTAIR increased the Tm by only 1.65°C (p=0.058). These results demonstrate that HOTTIP RNA binds WDR5 to physically stabilize the protein and prevent unfolding. Indeed, thermodynamic stability may be a common mechanism by which other nucleic acids can inhibit proteasomal processing of cognate protein partners (Coppotelli et al., 2011; Sorokin et al., 2005). 72 Figure 3.2. HOTTIP RNA thermodynamically stabilizes WDR5 and does not affect ubiquitination. A. HOTTIP RNA overexpression did not reduce in vivo ubiquition of WDR5. Left: FLAG-WDR5 was immunoprecipitated, and ubiquitinated FLAGWDR5 was identified using anti-HA antibody. Right: FLAG-ubiquitin was immunoprecipitated, and ubiquitinated V5-WDR5 was identified. B-C. Differential scanning fluorimetry using purified GST-WDR5 and in vitro transcribed RNAs revealed HOTTIP RNA thermodynamically stabilizes GST-WDR5, compared to no RNA or HOTAIR controls. B. Representative first derivative fluorescence plots. Arrows indicate melting temperatures, as seen as initial first derivative maximums. C. Average melting temperature of GST-WDR5 when incubated with indicated RNAs. 73 A specific domain of HOTTIP confers WDR5 stabilization To identify the region(s) of HOTTIP lincRNA that bind WDR5 protein, we conducted in vivo native RNA immunoprecipitation (RIP) experiments (Figure 3.3A). We found that while full length HOTTIP consistently bound purified WDR5 in vitro, in vitro transcribed HOTTIP fragments behaved with substantial batch-to-batch variation that was not seen in vivo, perhaps due to limitations of RNA folding in vitro (data not shown). In RIP experiments, we co-expressed equal levels of successively smaller HOTTIP RNA fragments with FLAG-WDR5 in 293T cells, immunoprecipitated FLAG-WDR5, and quantified retrieval of RNAs using the same qRT-PCR primer for either the 5’ or 3’ region (Figures 3S2, 3S3). To further prevent biases, all RIP experiments were conducted in parallel with a positive control full length HOTTIP RNA for normalization. Negative control RNAs (HOTAIR, U1), no reverse transcription control, and control IgG immunoprecipitations revealed very little or no enrichment (data not shown). We found two regions, Frag A (bases 1-1659) and Frag D (1953-3760), that displayed strong binding affinity to WDR5 (Figure 3S2). Because HOTTIP bases 1953-3760 demonstrated the strongest binding affinity to WDR5, we used further deletion mapping to narrow the binding site to a 500 nt region of HOTTIP, termed Frag D.3 (bases 1953-2453, Figure 3S3). Because two distinct regions of HOTTIP bind WDR5, we also determined the valency of protein:RNA interaction. Theoretically, each HOTTIP region may bind a distinct WDR5 molecule, in which HOTTIP would act as an “RNA scaffold” to multiplex proteins (Tsai et al., 2010). Alternatively, one HOTTIP molecule may bind a single WDR5 molecule via multiple contacts. To distinguish between these possibilities, we coexpressed FLAG- and V5- tagged WDR5 in 293T cells, and conducted FLAG immunoprecipitation in native conditions (Figure 3S2I). HOTTIP expression increased both V5-WDR5 and FLAG-WDR5 protein levels, demonstrating that both forms of WDR5 were bound to and stabilized by HOTTIP RNA. Yet, FLAG IP did not retrieve V5-WDR5 with or without HOTTIP addition. These results suggest that each HOTTIP RNA binds to and stabilizes a single WDR5 protein by multiple contacts. 74 Since full length HOTTIP can stabilize WDR5, we next examined whether minimal WDR5 binding domains of HOTTIP RNA are sufficient to stabilize WDR5. Cycloheximide chase experiments revealed that FLAG-WDR5 turnover was not affected by any HOTTIP fragment tested, including either of the strong binding regions (Figure 3.3B-C). Thus, strong binding by one HOTTIP fragment appears to be insufficient to stabilize WDR5, suggesting additional RNA regions are required. Similarly, Frag D was not sufficient to increase the Tm of WDR5 by DSF (data not shown). One possible explanation is that a tandem combination of strong WDR5binding RNA sequences may be required for WDR5 stabilization. Alternatively, a WDR5-binding RNA motif may need to be placed in the context of a long RNA in order to function. To distinguish between these possibilities, we designed two fusion RNAs (Figure 3.3A). The first was a duplication of the high affinity HOTTIP fragment D, thus representing two tandem strong WDR5-binding regions. The second fusion RNA was the minimal high affinity binding fragment D.3 fused to the 3’ end of LacZ mRNA, thus representing a strong WDR5-binding region fused to a nonspecific long RNA. Surprisingly, both chimeric RNAs were able to stabilize WDR5, albeit with slightly less potency compared to full length HOTTIP (Figure 3.3B-C). Moreover, the LacZ-FragD.3 fusion could also potentiate GAL4-WDR5-mediated reporter gene expression, whereas LacZ or FragD.3 alone could not (Figure 3.3D). Thus, lincRNA-mediated stabilization of WDR5 requires a specific WDR5-binding RNA domain in the context of a nonspecific long RNA sequence. 75 Figure 3.3. Full length HOTTIP RNA stabilizes WDR5 through specific interactions. A. Schematic of analyzed HOTTIP fragments and fusion RNAs. HOTTIP bases 3731659 (horizontal shading) and 1953-2453 (diagonal shading) demonstrated increased binding affinity, as analyzed by cell-based native RNA immunoprecipitation. B-C. 76 HOTTIP fragment overexpression was insufficient to stabilize FLAG-WDR5 protein with cycloheximide treatment. In contrast, WDR5-binding HOTTIP fragment fusion RNAs was sufficient to stabilize WDR5. B. Representative western blots. Hours post cycloheximide indicated above. MG: MG132 treated sample. C. Average western blot density quantitation depicted by heat map, normalized to B actin. Full length HOTTIP data same as Figure 3.1. D. HOTTIP region 1953-2453 (Frag D.3) fused to LacZ moderately potentiated GAL4-WDR5-mediated expression of luciferase. Similar effects were not seen with LacZ or HOTTIP bases 1953-2453 (Frag D.3) overexpression. Full length HOTTIP data same as Figure 3.1. 77 Discovering the RNA binding interface on WDR5 To determine the HOTTIP RNA binding site, we next performed alanine scanning mutagenesis of WDR5, guided by its crystal structure (Avdic et al., 2011; Odho et al., 2010; Trievel and Shilatifard, 2009) (Figure 3.4A-B). WDR5 is a barrelshaped protein with several charged clefts on its surface, several of which are known to mediate protein-protein interactions with MLL, histone H3 tail, or RbBP5 (Figure 3.4A, 3S4A). We generated 19 WDR5 point mutants, expressed them in E. coli as GST-fusion proteins, and purified them to homogeneity (Figure 3S4B). Four out of nineteen mutants significantly reduced the ability to retrieve HOTTIP lincRNA in vitro: Y228A, L240A, K250A, and F266A. These WDR5 mutations defined a cleft between blades 5 and 6, partially encompassing the same surface previously described to bind RbBP5 amino acids 371-381 (Avdic et al., 2011; Odho et al., 2010). Thus, a focal binding site defines the interaction between WDR5 and HOTTIP. To confirm that HOTTIP binds an overlapping WDR5 protein surface as RbBP5, we preincubated wild type GST-WDR5 with an excess of RbBP5 peptide (amino acids 371381) or control H3K4me3 peptide (amino acids 1-20), and then assayed for HOTTIP binding (Figure 3.4C). Whereas addition of H3K4me3 peptide had no effect, preincubation with RbBP5 peptide prevented HOTTIP binding to WDR5, thus confirming the shared binding cleft. To verify the HOTTIP binding site in living cells, we conducted RIP experiments with select WDR5 mutants in 293T cells (Figure 3.4D). Immunoblot analysis confirmed the expected interactions of wild type WDR5 with MLL complex proteins (Figure 3.4D). As previously described, the D107A mutation affected MLL1 interactions (Song and Kingston, 2008), and the K250A mutation reduced RbBP5 binding (Avdic et al., 2011; Odho et al., 2010). Consistent with the direct in vitro binding assay, both K250A and F266A mutations fully abrogated WDR5 binding to HOTTIP in vivo. In contrast, mutations at D107A and R181A showed minimal effects on WDR5-HOTTIP interactions. K250A and F266A did not affect the accumulation of HOTTIP lincRNA (Figure 3S4C). Taken together, the in vitro and in vivo analyses demonstrate that HOTTIP RNA binds WDR5 through a specific binding pocket. 78 Figure 3.4. HOTTIP binds and stabilizes WDR5 through the same binding site as RbBP5. A. Crystal structure of WDR5 (PDB:3P4F, visualized with PyMol) revealed that mutations abrogating HOTTIP RNA binding align along a cleft between blades 5 and 6, opposite the H3K4/MLL binding site. This binding surface is the same as that for RbBP5. Top: Schematic of tested and HOTTIP binding mutations. Bottom: Magnification of HOTTIP binding cleft. Yellow: MLL peptide. Blue: RbBP5 peptide. B. qRT-PCR results of indicated GST-WDR5 mutants tested by in vitro assay for binding to HOTTIP RNA or control Histone 1H2BG RNA.C. RbBP5 peptide competition assay indicates that RbBP5 amino acids 371-381 can fully compete HOTTIP binding to WDR5. Wild type data same as in Figure 3.4B. D. Average fold pulldown of full length HOTTIP by cell-based native RNA immunoprecipitation of select WDR5 mutants. Top: qRT-PCR results of RNA immunoprecipitation of FLAGWDR5 mutants. All values were normalized to input, then to FLAG pulldown of wild type and positive control D107A. Negative RNA controls (U1, HOTAIR) and reaction without reverse transcription (-RT) showed minimal enrichment (data not shown). Bottom: Representative western blots of immunoprecipitations. 79 A selective WDR5 mutant reveals the importance of lincRNA binding on WDR5 half life and gene activation Because HOTTIP binding blocks WDR5 turnover, we tested whether HOTTIP can stabilize select WDR5 mutants with cycloheximide chase experiments (Figure 3.5A-C). Whereas without HOTTIP expression all WDR5 mutants displayed short half lives, WDR5 D107A and WDR5 R181A, two mutants that do bind HOTTIP, were protected from turnover by HOTTIP co-expression. In contrast, WDR5 K250A and F266A, both of which cannot bind HOTTIP, could no longer be stabilized by HOTTIP co-expression. Thus, binding to HOTTIP lincRNA is apparently required for WDR5 stabilization. To pinpoint the functional consequences of a selective lincRNA-binding mutation of WDR5, we further analyzed WDR5 F266A. In contrast to the other HOTTIP binding mutations, WDR5 F266A is solely defective in HOTTIP binding in vitro and in vivo, but without any defects in binding MLL complex subunits RbBP5 or MLL1 (Figure 3.4D). Isothermal calorimetry confirmed that F266A minimally affected the affinity of WDR5 for RbBP5 peptide (WDR5 WT KD = 1.25 + 0.06 µM, WDR5 F266A Kd= 5.13 + 0.49 µM). We reasoned that the F266A mutation offered an experimental strategy to test the requirement of lincRNA binding for WDR5 function. Preliminary RIP-seq and CLIP-seq experiments have revealed multiple RNAs that bind WDR5 (R. Flynn and Y.W.Y., unpublished observation); it would be challenging to knock down all of these RNAs at once to determine the impact on WDR5 function. However, the F266A mutation selectively abrogates the ability of WDR5 to interact with HOTTIP (and possibly other lincRNAs), allowing examination of the effects of lincRNA binding to WDR5. To test the effects of the F266A mutation on gene activation, we transfected increasing amounts of plasmids encoding GAL4-WDR5 WT or GAL4-WDR5 F266A to activate luciferase reporter gene expression (Figure 3.5D). Even in the absence of exogenously expressed lincRNA, WDR5 F266A showed a severe defect in luciferase gene activation, requiring >100 fold transfected plasmid compared with wild type to achieve similar reporter gene activity. We infer that some endogenous HOTTIP-like 80 activating lincRNAs may bind to and stabilize GAL4-WDR5, and the lack of RNA binding severely reduces WDR5 function. Moreover, HOTTIP could no longer potentiate activation by GAL4-WDR5 F266A (Figure 3.5E). Thus, even though the WDR5 F266A mutation can bind the full MLL complex, the inability to bind lincRNAs strongly compromises WDR5 function in activating gene expression. 81 82 Figure 3.5. HOTTIP binding mutations prevent HOTTIP mediated stabilization of WDR5, as well as show decreased ability to activate target genes in 293T cells. A-C. HOTTIP binding mutations abrogated HOTTIP-mediated stabilization of WDR5 during cycloheximide treatment. A-B. Representative western blots without (A) and with (B) HOTTIP co-expression. Hours post cycloheximide indicated above. MG: MG132 treated sample. C. Average western blot density quantitation visualized as heat map, normalized to B actin. Wild type WDR5 data same as Figure 3.1. D-E. WDR5 F266A is unable to activate luciferase expression. D. GAL4-WDR5 F266 was defective in activating luciferase expression, as seen in luciferase titration tests.E. HOTTIP could not potentiate GAL4-WDR5 F266A activation of luciferase gene expression. Wild type WDR5 data same as Figure 3.1. 83 lincRNA binding to WDR5 is essential for H3K4 methylation and embryonic stem cell self renewal WDR5 has been recently shown to be required to maintain H3K4 trimethylation (H3K4me3) in mouse embryonic stem cell (ESC) genes for pluripotency and self renewal (Ang et al., 2011). To test whether endogenous WDR5 requires lincRNA-mediated stabilization, we created “rescue” ESC lines that replace endogenous mouse WDR5 with either wild type human WDR5 or F266A mutant WDR5 (Figure 3.6A). ESCs were infected by lentiviruses containing two tandem gene expression cassettes (Ang et al., 2011). The first cassette constitutively expresses a highly efficient shRNA to repress endogenous mouse WDR5. WDR5 deficiency is rescued by the second cassette, in which human WDR5 wild type (WT) or human WDR5 F266A linked to GFP is expressed under control of a doxycycline (dox)inducible promoter. In the presence of dox, the ability of WDR5 mutant to support ESC self renewal can be compared; upon dox withdrawal, the half-life of the mutant protein and its regulatory impact are further revealed. GFP+ cells were sorted and cultured for 4 days in the presence (+dox) and absence (-dox) of doxycycline, and then analyzed by western blot for human WDR5 and global H3K4me3 levels (Figure 3.6B). Even though both WDR5 WT and WDR5 F266A ESCs were sorted using the same fluorescence parameters, slightly more WDR5 WT protein was present when compared with WDR5 F266A, suggesting that the WDR5 WT protein is more stable. Indeed, WDR5 mRNA levels between the WT and F266A forms were similar (data not shown). In WDR5 WT ESCs, promoter shutoff of WDR5 resulted in ~50% reduction of H3K4me3 (Lanes 1 and 2), as previously reported (Ang et al., 2011). Interestingly, ESCs harboring WDR5 F266A displayed a global 50% reduction of H3K4me3 (Lane 3), although WDR5 F266A protein was expressed. In addition, promoter shutoff of WDR5 F266A caused a drastic >80% reduction of H3K4me3 levels (Lanes 3 and 4). Thus, the F266A mutation resembles a WDR5 loss of function, and exhibits a striking inability to maintain global H3K4me3 levels over time. We reasoned that because WDR5 F266A presumably cannot bind endogenous 84 HOTTIP-like lincRNAs in ESCs, WDR5 F266A is more unstable, thus causing both reduced H3K4me3 levels and faster H3K4me3 decay. To test this hypothesis, we conducted pulse chase experiments using doxycycline withdrawal to shut off FLAGhWDR5 transcription, and followed the fate of pre-existing wild type or F266A WDR5 protein in the nucleus and cytoplasm (Figure 3.6C). Different cullin ring ligases can ubiquitinate WDR5, depending on cell type and whether the WDR5 protein is cytoplasmic (CUL4A) or nuclear (CUL4B) (Nakagawa and Xiong, 2011). Indeed, we found that in the cytoplasm, both WDR5 WT and F266A have equally short half lives of less than 12 hours. In contrast, nuclear WDR5 WT persists for up to 24 hours, but WDR5 F266A is turned over at least 6 hrs sooner (Figure 3.6C). The selective instability of WDR5 F266A in the nucleus is apparently sufficient to deplete F266A from the nuclear and chromatin fractions compared to WT (Figure 3.6D, E). The alteration in cellular distribution of F266A can be explained by differential stability in nuclear vs. cytoplasmic pools, or alternatively, by a role of F266 in directly controlling nuclear localization. We believe that the former is the correct explanation based on direct pulse chase data (Figure 3.6C) and also on the fact that fusion of F266A to a strong nuclear localization signal (GAL4 DNA binding domain) did not rescue its function (Figure 3.5D). Loss of lincRNA binding also impacts the ability of WDR5 to promote ESC self renewal. (Figure 3.6F-G). In WDR5 WT ESCs, promoter shut-off caused a ~50% reduction in the number of alkaline phosphatase-positive colonies, a specific indicator of ESC state. Notably, F266A WDR5 ESCs grown in the presence of doxycycline displayed a similar ~50% reduction in colony number. By morphology, both WDR5 WT ESCs –dox and WDR5 F266A ESCs +dox formed similar small, partially differentiated colonies. Doxycycline withdrawal in WDR5 F266A ESCs caused a >90% reduction in number of colonies, with the majority of cells forming clusters of fully differentiated alkaline phosphatase-negative cells. WDR5 directly binds to the chromatin of key genes required for in ESC self renewal (Ang et al., 2011). ESCs expressing WDR5 F266A have significantly reduced mRNA levels of pluripotency regulators Oct4, Nanog, Sox2, and Esrrb (Figure 3.6H). Genome-wide expression 85 profiling further confirmed that WDR5 F266A was unable to sustain the gene expression program associated with ESC self renewal and pluripotency, and instead allowed ectopic expression of genes indicative of ectodermal and mesodermal lineages (Figure 3.7A). Collectively, these data suggest that the ability to bind endogenous lincRNAs are likely essential for WDR5 in maintaining H3K4me3, ESC gene expression, and ESC self renewal. 86 Figure 3.6. WDR5 F266A mutation decreases protein stability and causes defects in mouse embryonic stem cell self renewal. A. Schematic of lentiviral vectors, modified from (Ang et al., 2011). B. Western blot 4 days after doxycycline removal. C. WDR5 F266A mutation decreased protein stability in the nucleus after doxycycline withdrawal. Both WDR5 WT and WDR5 F266A were similarly unstable in the cytoplasmic fraction. D. WDR5 F266A was defective in nuclear accumulation, compared with WDR5 WT. E. WDR5 F266A reduced chromatin association, as seen in chromatin isolation experiments. F-G. Alkaline phosphatase staining and morphology of embryonic stem cell colonies after 6 days growth in conditions specified. WDR5 F266A demonstrated reduced alkaline phosphatase positive colonies 87 with increased differentiation compared with wild type. F. Representative wells and colony morphology. Scale bar represents 200 µm. G. Quantitation of colonies per well. H. Loss of WDR5 or WDR5 F266A cause reduction of ESC pluripotency marker expression by qRT-PCR. 88 Figure 3.7. WDR5 F266A causes loss of self renewal genes and increased expression of differentiation genes. A. Microarray data reveal that both WDR5 F266A +dox and WDR5 WT -dox displayed loss of self renewal genes. WDR5 F266A +dox further demonstrated increased expression of ectodermal and mesodermal markers. B. Model for HOTTIP-mediated switch of protein turnover to activate target genes. Expressed lincRNAs bind the WDR5 protein, causing thermodynamic stability to allow protein chromatinization, MLL complex assembly, and methylase activity for target gene activation. Without HOTTIP or HOTTIP-like lincRNA binding, WDR5 cannot associate effectively with chromatin and is rapidly degraded. 89 3.4 Discussion RNA as protein timers By examining the interaction between the HOTTIP lincRNA and the WDR5 protein, we have identified a novel mechanism by which a lincRNA regulates the turnover of its cognate protein partner (Figure 3.7B). The ability of HOTTIP to stabilize WDR5 is notable for several reasons. First, the interaction between HOTTIP and WDR5 is strikingly specific. Cells are full of diverse species of RNAs; some, such as ribosomal RNAs and snoRNAs, are very abundant. But the vast majority of cellular RNAs are not capable of stabilizing WDR5, and HOTTIP co-expression is necessary to stabilize ectopic WDR5. Second, the magnitude of protein stabilization is significant. It is intriguing to consider that HOTTIP more than doubled the half life of WDR5 protein such that WDR5 is not detectably degraded after 16 hours, which starts to approach the time scale of an entire cell division cycle (especially in ESCs). These results suggest that in addition to their roles in recruiting protein complexes to specific genomic loci, some lincRNAs, like HOTTIP, can regulate the longevity of their interacting protein complexes. As we showed, such an interaction can increase the level and lifespan of chromatin occupancy, which is likely to have important impacts on the transmission of chromatin states across cell generations, a critical aspect of epigenetic regulation (Rando and Chang, 2009). Hundreds of lincRNAs have been recently reported to bind to multiple readers, writers, and erasers of chromatin marks (Guttman et al., 2009; Khalil et al., 2009; Zhao et al., 2010); one or more of these lincRNAs may also control their partner proteins’ temporal activity by regulating protein turnover, possibly with different gradations of half lives. More generally, as there is no intrinsic barrier to this mechanism operating on proteins in other cellular compartments, many long RNAs may modulate the half lives of RNA binding proteins (RBPs), a large class of regulators that outnumbers transcription factors (Cook et al., 2011). Coupled with the well-known capacity of microRNAs to inhibit protein synthesis, this mechanism of lincRNA-mediated protein stabilization points to the broad scope of noncoding RNAs in shaping the proteome. Our experiments uncovered several unexpected features in the pathway of 90 lincRNA-mediated protein stabilization. We found that direct RNA-protein interaction in the context of a long RNA is important for WDR5 stabilization. The requirement for a long RNA context is correlated with the ability of RNA to increase WDR5 folding energy; while interaction with full length HOTTIP increased the Tm of WDR5 by DSF, the smaller WDR5 binding domain of HOTTIP did not. The requirement of a specific RNA domain within a long RNA context is consistent with recent work demonstrating the importance of short conserved RNA sequences in the conserved functions of apparently divergent lincRNAs (Ulitsky et al., 2011). As we were able to confer protein stabilizing activity to LacZ mRNA by simply grafting on a HOTTIP domain, it is likely that other RNAs, including mRNAs, may similarly regulate the stability and half-life of their cognate RNA-binding proteins. In addition, we found that HOTTIP does not block WDR5 poly-ubiquitination, but appears to block protein turnover at a subsequent step. Thus, the consequence of lincRNA-mediated protein stabilization is the accumulation of poly-ubiquitinated RNA binding protein that resists degradation. Our findings may be applicable to understanding development and disease processes beyond epigenetic regulation. The accumulation of ubiquitinated protein inclusions is a hallmark of many neurodegenerative diseases, several of which are known to involve RNA binding proteins (Lagier-Tourenne et al., 2010; Polymenidou et al., 2011; Wang et al., 2008). While prior studies have focused on the impact of altered RBPs on RNA metabolism, our study suggests that specific RNAs may play important initiating roles by altering protein turnover. Protein interaction with nucleic acids, either DNA or RNA, has been documented to block proteolysis(Coppotelli et al., 2011; Sorokin et al., 2005). Proteasome activity is also regulated by accessory factors (Bader et al., 2011). The precise mechanism by which long RNA interactions inhibits proteolysis should be addressed in future studies. Requirement of a lincRNA interface in WDR5 for gene activation and ESC self renewal Guided by the crystal structure and comprehensive mutagenesis, we identified 91 a lincRNA binding cleft in WDR5. WDR5 is a multifunctional adaptor protein that can interact with multiple subunits of the MLL complex, such as MLL itself and RbBP5, as well as with regulators and substrates of the complex, such as HOTTIP and histone H3 arginine 2 symmetric demethylation (H3R2me2s) (Migliori et al., 2012). The MLL and H3R2me2s share the same binding pocket, and we found that HOTTIP and RbBP5 share a distinct binding pocket. While Oct4 has been reported to bind WDR5, we found no difference in the lincRNA binding mutant for its interaction with Oct4 (data not shown). Nonetheless, these multitudes of interactions suggest that molecular arrangements in this complex may be dynamic and intricate. Our coimmunoprecipitation experiments with concomitant HOTTIP overexpression demonstrate that RNA binding is compatible with the assembly of the intact MLLWDR5-Ash2L-RbBP5 complex. By comprehensive structure-guided mutagenesis, we identified surface residues of WDR5 that are specifically required for HOTTIP binding but do not impact interactions with other protein partners, such as MLL1 or RbBP5. WDR5 protein mutant for these specific residues can no longer be stabilized by HOTTIP, and are unable to activate transcription, suggesting that direct lincRNA-protein interactions are important for WDR5 stabilization and function. Replacement of endogenous WDR5 expression in mouse ESCs with an RNA binding mutant, WDR5 F266A, revealed that this interface is critical for the longevity of nuclear WDR5, as well as accumulation in both the nucleus and on chromatin. The loss of stabilized WDR5 leads to an inability to maintain global H3K4me3 level, loss of expression of pluripotency regulators, and loss of ESC self renewal. Interestingly, the impact of F266A mutation on H3K4me3 level is modest in steady state conditions, but becomes quite striking with transcriptional shutoff of WDR5. We interpret these results to indicate that WDR5 stability is important for the robustness of the active chromatin state, exemplified by H3K4me3. In the absence of stabilized WDR5, H3K4me3 becomes much more sensitive to fluctuations in the transcriptional levels of WDR5 and likely other subunits of the MLL complex. Thus, these results reveal a new facet in the connection between protein half lives and the temporal transmission of 92 epigenetic information via chromatin. As HOTTIP is not expressed in ESCs, our results with WDR5 F266A imply the existence of a family of ESC lincRNAs that stabilize WDR5. The recent discovery of numerous lincRNAs important for ESC self renewal and pluripotency provide plausible candidate for this concept (Chakraborty et al., 2012; Guttman et al., 2011). It is also possible that one or more protein-based regulators may interact with WDR5 in a F266 dependent fashion. Studies of other chromatin modification complexes have shown that one complex (e.g. PRC2) can interact with hundreds of lincRNAs, each of which regulates a subset of genes targeted by the complex (Khalil et al., 2009; Zhao et al., 2010). A similar model likely applies to MLL and other gene activating complexes as well. The MLL complex possesses multiple components with RNA-binding capabilities. While WDR5 binds HOTTIP and potentially other lincRNAs (Wang et al., 2011), MLL1 itself also has domains that bind RNAs in specific and nonspecific fashions (Bertani et al., 2011; Krajewski et al., 2005). Cyp33, an allosteric regulator of MLL, also contains a RNA recognition motif, and its regulation of MLL can be controlled in an RNA-dependent manner (Hom et al., 2010; Wang et al., 2010). These multiple RNA interaction domains may bind to different portions of lincRNAs either specifically or non-specifically, allowing a long RNA to “shrink wrap” the entire complex and stabilize it (Figure 3.7B). The presence of multiple RNA interaction domains may complicate efforts toward systematic identification of specific RNA species that regulate MLL activity (Krajewski et al., 2005). Our identification of specific mutations that abrogate RNA-mediated WDR5 stabilization may facilitate future efforts to discover additional examples of this new class of lincRNAs. 93 3.5 Materials and Methods Cell lines, antibodies, and vectors: HEK293T/17 cells were obtained from the American Type Culture Collection (ATCC). The HEK293T 5XUAS-Luciferase cell line has been previously described (Wysocka et al., 2005). V.6.5 mouse embryonic stem cells were grown on 1% gelatinized plates in Knockout DMEM supplemented with 15% FBS, 1% Glutamax (Invitrogen 35050), 1% nonessential amino acids, 1% penstrep, 0.2% betamercaptoethanol, LIF (1:10000, Millipore ESG 1107), and 2 µg/mL doxycycline (Sigma D9891). To maintain consistent levels of doxycycline, media was changed every two days. Antibodies used were: anti-FLAGM2 (Sigma F1804), anti-β actin (Abcam ab8227), anti-V5 (Sigma V8137), anti-HA (Cell Signaling 6E2 #2367), antiRbBP5 (Bethyl A300-109A), anti-MLL1-N (Bethyl A300-086A), anti-GAL4 (Santa Cruz sc-577), anti-H3K4me3 (Abcam ab8580), anti-β tubulin (Abcam ab6046), antiH3 (Abcam ab1791). Excepting chromatin immunoprecipitation experiments, all other immunoprecipitations were conducted using anti-FLAGM2 (Sigma A2220) or mouse IgG (Sigma A0919) agarose beads, or anti-FLAGM2 magnetic beads (Sigma M8823). Unless noted, all expression vectors were cloned into pcDNA3 or pcDNA3.1+backbones. qRT-PCR primers: HOTTIP: (F:CCTAAAGCCACGCTTCTTTG, R:TGCAGGCTGGAGATCCTACT 5’ HOTTIP (F:TCTTTCCAGGGAAACAGTGG, R:AACAGTGTGGACAGGGAAGG) 3’ HOTTIP (F:TATGGGTTGGGAGAGGGAGT, R:AGCACCTGTAGTTGCCCATT) Mid HOTTIP (F:CAAACTCCGTCCTCCAAAAC, R:CAGTGAAGAGCGATCAGTGG) HOTAIR (F:GGTAGAAAAAGCAACCACGAAGC, R:ACATAAACCTCTGTCTGTGAGTGCC) LacZ: (F:GTCGTTTGCCGTCTGAATTT, R:CCGCCACATATCCTGATCTT) 94 FLAG-WDR5: (F:GACTACAAAGACGATGACGACAA, R:TCCCAGCTTGTGACCAGATA) GAPDH (F:AGGTGGAGGAGTGGGTGTCGCTGTT, R:CCGGGAAACTGTGGCGTGATGG) U1 (F:ATACTTACCTGGCAGGGGAG, R:CAGGGGGAAAGCGCGAACGCA) mβactin (F:GCTGTATTCCCCTCCATCGTG, R:CACGGTTGGCCTTAGGGTTCAG) mEsrrb (F:GGGTAGAGCCCACTTGTTCA, R:AGGTAGCCTGGGTTTTTGCT) mNanog (F:TGGTCCCCACAGTTTGCCTAGTTC, R:CAGGTCTTCAGAGGAAGGGCGA) mOct4 (F:GTGGAGGAAGCCGACAACAATGA, R:CAAGCTGATTGGCGATGTGAG) mSox2 (F:CAGGAGAACCCCAAGATGCACAA, R:AATCCGGGTGCTCCTTCATGTG) ChIP qPCR primers: GAPDH (F:AGGTGGAGGAGTGGGTGTCGCTGTT, R:CCGGGAAACTGTGGCGTGATGG) Luc1 (F: ACCATAGTCCCGCCCCTA, R: AACAGTACCGGAATGCCAAG) Luc2 (F: CCGGTACTGTTGGTAAAATGG, R: AACCAGGGCGTATCTCTTCA) RNA mediated effects on WDR5 protein levels pcDNA3.1+FLAG-WDR5 and increasing amounts of indicated RNA expression vectors were transfected into 293T cells using Lipofectamine 2000 per manufacturer's instructions. pcdna3.1+ was added to normalize the total ng of transfected plasmid. 96 hours after transfection, cells were harvested into two aliquots. One aliquot was lysed in RIPA buffer (50 mM Tris-HCl pH 7.4, 1% NP40, 0.25% Nadeoxycholate, 450 mM NaCl, 1 mM EDTA, supplemented with cOmplete protease inhibitor (Roche)), and analyzed by western blot for protein expression. FLAG-WDR5 protein levels were normalized to β-actin protein levels using ImageJ software (NIH). 95 RNA was extracted from the other aliquot using TRIzol (Invitrogen) and Rneasy columns (Qiagen), treated with Turbo DNAfree (Ambion), and then analyzed by qRTPCR (Stratagene, Brilliant II SYBR Green). FLAG-WDR5 RNA levels were normalized to GAPDH RNA. Normalized protein levels were then divided by normalized RNA levels, and adjusted to 0 ng of RNA expression vector transfected. WDR5 cycloheximide chase pcdna3.1+FLAG-WDR5 and RNA expression vectors were transfected into 293T cells using Lipofectamine 2000. 24 hours after transfection, media was replaced with media containing 4 µg/mL cycloheximide (Sigma). To one sample, MG132 was also added to a final concentration of 10 µM (EMD Chemicals). Cells were harvested at indicated time points, lysed with RIPA buffer, and analyzed by western blot. Relative protein levels were visualized using Treeview software. GAL4-WDR5 Chromatin immunoprecipitation pCMX-GAL4-WDR5 and indicated RNA expression vectors were transfected into 10 cm plates of HEK293T 5XUAS-Luciferase cells using Lipofectamine 2000. 48 and 96 hours after transfection, cells were fixed in 1% formaldehyde (Thermo Pierce). Subsequent pellets were lysed, chromatin was extracted and immunoprecipitated using 1 µg antiGAL4 antibody, as previously described (Squazzo et al., 2006). Resulting ChIP DNA was analyzed by qPCR (Roche Sybr Green). Luciferase assay pCMX-GAL4-WDR5, pRL Renilla Luicferase plasmid and indicated RNA expression vectors were transfected into HEK293T 5XUAS-Luciferase cells using Lipofectamine 2000. 40-48 hours after transfection, cells were harvested and analyzed using the Dual-Luciferase Reporter Assay System (Promega). Luciferase readings were normalized to Renilla readings, and then normalized to no GAL4-WDR5 background control. 96 Ubiquitination assays For in vivo ubiquitination assays, indicated plasmids were cotransfected into 293T cells plated on 10 cm plates using Lipofectamine 2000. After 48h, cells were treated with final concentration 10 µM MG132, harvested after 16h, and snap frozen in liquid nitrogen. Cells were then lysed and immunoprecipitated using extremely high stringency protocols, as previously described (Yong et al., 2010). Cell pellets were resuspended in cold Empigen buffer (20 mM Tris pH 7.5, 500 mM NaCl, 2.5 mM MgCl2, 1% Empigen, 2 mg/mL Heparin), supplemented with RnaseOUT (1:100), 100 mM Nethylmaleimide (1:50 EMD Chemicals), and cOmplete proteinase inhibitor (1:50), and lysed with sonication. Lysates were cleared with centrifugation, then incubated with anti-FLAG M2 magnetic beads for 4 hours. Beads were washed 5 times with cold Empigen buffer, once with room temperature RSB100 buffer (20 mM pH 7.5, 100 mM NaCl, 2.5 mM MgCl2, 0.01% Triton X-100), washed twice more with empigen buffer, then once with room temperature reverse crosslink buffer (20 mM Tris pH 8.0, 500 mM NaCl, 1 mM EDTA, 10 mM βmercaptoethanol). After elution using FLAG peptide in reverse crosslink buffer, eluates were analyzed by western blot. In vitro ubiquitination assays were conducted as previously described (Nakagawa and Xiong, 2011) with indicated RNAs. Differential Scanning Fluorimetry (DSF) DSF experiments were conducted as previously described, with modifications (Niesen et al., 2007). GST-tagged WDR5 protein was expressed and purified from Escherichia coli as previously described (Smith and Johnson, 1988). 300 ng GSTWDR5 was incubated with T7 RNA polymerase in vitro transcribed RNA (Promega, 0, 600 ng, or 1800 ng RNA) in signal buffer (10 mM HEPES pH 7.6, 150 mM NaCl, SYPRO Orange 1:1000 (Invitrogen)), then incubated with a ramped temperature (1C/min) in a Lightcycler 480 instrument (Roche, set to Multicolor hybprobe, Red 610). Data was analyzed by calculating the first derivative between each set of points, graphed, and maximum peaks were noted. In case of multiple peaks, only the first 97 peak was recorded. P values were calculated using unpaired two tail Student’s T test. Native RNA Immunoprecipitation Native cell-based RNA immunoprecipitation was performed as previously described with modifications (Dignam et al., 1983). 293T cells were transfected with pcDNA3.1+FLAG-WDR5 and RNA expression plasmid. After 48-72 hours, cells were harvested by scraping in cold PBS, spun down, and pellets were then snap frozen in liquid nitrogen and stored at -80C. Cells were resuspended in Buffer A (10 mM HEPES pH 7.5, 1.5 mM MgCl2, 10 mM KCl, 0.5 mM DTT, 1.0 mM PMSF, supplemented with RnaseOUT (Invitrogen, 1:100)), and cell membranes were lysed with 0.25% NP40. After centrifugation, resulting nuclei were further lysed in Buffer C (20 mM HEPES pH7.5, 10% glycerol, 0.42M KCl, 4 mM MgCl2, 0.5 mM DTT, 1.0 mM PMSF supplemented with RnaseOUT (1:100)). Resulting whole cell lysates were immunoprecipitatied with mouse anti-FLAG M2 or control mouse IgG agarose beads, and washed four times with wash buffer (50 mM TrisCl pH 7.9, 10% glycerol, 100 mM KCl, 5 mM MgCl2, 10 mM β-mercaptoethanol, 0.1% NP40). To confirm immunoprecipitation, FLAG peptide-eluted protein-RNA complexes were analyzed by western blot. Coimmunoprecipitated RNA was extracted using TriZOL LS (Invitrogen) and Qiagen RNeasy columns, treated with TURBO DNAfree, and then analyzed by qRT-PCR (Brilliant II Sybr Green). Resulting values were defined as fraction of input, then normalized to the positive control sample. CoIP pulldown experiments were conducted as described above, with modifications. pcDNA3.1+FLAG-WDR5, pcDNA3.1+V5-WDR5, pcDNA3.1+HOTTIP and pcDNA3.1+ plasmids were transfected into 293T cells as indicated. Cells were lysed and immunoprecipitated as described above, and resulting pulldowns were analyzed by western blot. 98 In vitro GST pulldown and competition GST pulldown and competition assays were performed as previously described, with modifications (Wang, 2011). GST-tagged WDR5 wild type, mutants, and GST control proteins were expressed and purified from Escherichia coli as previously described (Smith and Johnson, 1988). GST proteins were bound to glutathione sepharose 4B (Amersham/GE Healthcare), blocked with 0.2 mg/mL BSA (Ambion) in 2X binding buffer (40 mM Hepes pH 7.6, 200 mM KCl) at 4C for 1 hour. For peptide competition assays, indicated peptides (Elim Biopharmaceuticals) were also added. 0.04 mg/mL heparin (1:1, Sigma H3149) was added to further block nonspecific RNA binding. Protein-bound beads were then incubated with folded T7-transcribed HOTTIP or histone 1H2BG mRNA for 45 minutes (Promega T7). Beads were washed twice with PB200 (20 mM Hepes pH 7.6, 200 mM KCl, 0.05% NP40) and once with DEPC-treated water to remove detergents. After resuspension in water, beads were directly used in qRT-PCR reactions to determine RNA levels. To confirm lack of degradation, bead-bound proteins were analyzed by Silver Stain Plus (Bio-Rad). Establishment of FLAG-tagged WDR5 Rescue Embyronic Stem Cells Lentiviral constructs were generated that that constitutively knock down mouse WDR5 while allowing expression of a doxycycline-inducible human WDR5. The rescue vector pLKO.tre (Ang et al., 2011) was modified by replacing the shRNAimmune mouse WDR5-cDNA with either wildtype or F266A mutated human WDR5. The resulting pLKO vectors and rtTA (from Marius Wernig, Stanford University) constructs were cotransfected with packaging plasmids (pLKO: second generation pCMV-dR8.2 dvpr and pCMV-VSVG| rTta: third generation pMDLg/pRRE, pCMVVSVG, pRSV-Rev) into 293T cells using Lipofectamine 2000 (Invitrogen). After 16 hours, media was changed. Viruses were harvested after 48 and 72 hours, pooled, and then clarified by centrifugation. Viruses were concentrated using Lenti-X concentrator, incubated with 8 µg/mL polybrene (Sigma H9268) and then used to infect V6.5 ESCs. GFP+ cells were sorted (Stanford Shared FACS Facility) and then cultured as described above. For RNA analysis, 5000 GFP+ cells were plated in media with or 99 without doxycycline, and then harvested after 6 days. For qRT-PCR analysis, resulting values were normalized to control β actin levels, and then normalized to WT+dox. Embryonic stem cell fractionation ESCs were fractionated as previously described (Tee et al., 2010), with modifications. To isolate nuclei and cytoplasm, Pellets were lysed in nuclei isolation buffer (NIB: 10 mM Tris pH7.5, 60 mM KCl, 15 mM NaCl, 1.5 mM MgCl2, 1 mM CaCl2, 250 mM Sucrose, 10% glycerol, and 0.1% NP40. Lysates were centrifuged at low speed to separate cytoplasm (supernatant) from nuclei (pellet). Cells were washed once with NIB without NP40, resuspended in NIB without NP40, and treated with Turbo Dnase (Ambion). EDTA was added to a final concentration of 10 mM to neutralize the Dnase. After centrifugation, the resulting washed pellet was resuspended in NIB, and NaCl was added to a final concentration of 500 mM to lyse the nuclei. For chromatin isolation, cells were lysed with cytoskeleton buffer (CSK: 10 mM PIPES KOH pH7, 100 mM NaCl, 300 mM sucrose, 3 mM MgCl2, 0.5% Triton X100), centrifuged, and the resulting pellet was washed twice more with CSK buffer. The washed pellet was resuspended in CSK buffer, and treated with Turbo Dnase to release chromatin-bound proteins. Alkaline phosphatase staining For colony forming assays, FLAG-tagged WDR5 rescue V6.5 ESCs were counted by flow cytometer, as well as hemocytometer, and 250 cells were plated onto 12 well plates with full media with or without doxycycline. After 6 days, cells were fixed with 1:1 methanol:acetone, then stained using Vector Blue (Vector Laboratories) per the manufacturer’s instructions. 100 Microarray analysis cDNA was synthesized, labeled, and hybridized to Affymetrix Mouse 430 2.0 arrays in biologic duplicates (Stanford Protein and Nucleic Acid Facility). Arrays were normalized by robust multi-array average (RMA), and probes that had an expression value ≥50 in at least 1 sample were filtered out. Gene expression with multiple probes were averaged. Heatmaps show mean-centered gene expression, based on previously described expression patterns (Ang, 2011). Microarray data are available at Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?token=xtydzkwwmqymitm&acc=GS E36513). 101 3.6 Supplementary Figures Figure 3S1. A. Cycloheximide treatment does not affect RNA levels. qRT-PCR of representative experiment of HOTTIP and Flag-WDR5 during cycloheximide treatment, normalized to GAPDH. B. HOTTIP overexpression does not affect in vitro ubiquitination of purified WDR5. 102 Figure 3S2. Two regions of HOTTIP bind WDR5. Successively smaller 5’ (A-C) and 3’ (E-F) fragments of HOTTIP RNA were analyzed by cell-based native RNA immunoprecipitation. A,D. Schematic of fragments. *: estimated region of 5’ and 3’ qPCR primers. B,E. Average fold pulldown of RNA fragments in Flag-WDR5 or representative IgG control immunoprecipitations by qRT-PCR, relative to input levels. 103 All values were normalized to Flag pulldown of full length HOTTIP.C,F. Average relative levels of input HOTTIP fragment RNA by qRT-PCR, relative to GAPDH. G. Representative western blots of immunoprecipitations. H. In vitro transcription of schematized HOTTIP fragments. I. Immunoprecipitation of Flag-WDR5 does not pulldown V5-WDR5 with HOTTIP overexpression, even though HOTTIP stabilizes both proteins. 104 Figure 3S3. HOTTIP bases 1953-2453 (Frag D.3) demonstrate equivalent binding affinity to bases 1953-3760 (Frag D). Successively smaller 5’ (A-C) and 3’ (E-F) fragments of HOTTIP RNA bases 1953-3760 (Frag D) were analyzed by cell-based native RNA immunoprecipitation. A,D. Schematic of fragments. Full length HOTTIP is provided as reference. *: estimated region of mid and 3’ qPCR primers. B,E. Average fold pulldown of RNA fragments in Flag-WDR5 or representative IgG control immunoprecipitations by qRT-PCR, relative to input levels. All values were normalized to Flag pulldown of HOTTIP fragment 1953-3760. C,F. Average relative levels of input HOTTIP fragment RNA by qRT-PCR, relative to GAPDH. G. Representative western blots of immunoprecipitations. H. In vitro transcription of selected schematized HOTTIP fragments not shown on Figure 3S2H. 105 Figure 3S4. A. Electrostatic surface contouring of the WDR5 crystal structure (PDB:3P4F, visualized with PyMol) reveal that mutations abrogating HOTTIP RNA binding define an electropositively charged region. Orange: WDR5 residues important for HOTTIP binding. Cyan: MLL peptide. Grey: RbBP5 peptide B. Silver stain of post-assay glutathione-bound GST-WDR5 mutants indicates no protein degradation or loading bias occurred during in vitro binding assay. C. Average levels of input HOTTIP RNA for cell-based native RNA immunoprecipitation assays of WDR5 mutants by qRT-PCR, relative to GAPDH. 106 3.7 References Ang, Y.S., Tsai, S.Y., Lee, D.F., Monk, J., Su, J., Ratnakumar, K., Ding, J., Ge, Y., Darr, H., Chang, B., et al. (2011). Wdr5 mediates self-renewal and reprogramming via the embryonic stem cell core transcriptional network. Cell 145, 183-197. Avdic, V., Zhang, P., Lanouette, S., Groulx, A., Tremblay, V., Brunzelle, J., and Couture, J.F. (2011). Structural and biochemical insights into MLL1 core complex assembly. Structure 19, 101-108. Bader, M., Benjamin, S., Wapinski, O.L., Smith, D.M., Goldberg, A.L., and Steller, H. (2011). A conserved F box regulatory complex controls proteasome activity in Drosophila. Cell 145, 371-382. Bertani, S., Sauer, S., Bolotin, E., and Sauer, F. (2011). The noncoding RNA Mistral activates Hoxa6 and Hoxa7 expression and stem cell differentiation by recruiting MLL1 to chromatin. Mol Cell 43, 1040-1046. Candeias, M.M., Malbert-Colas, L., Powell, D.J., Daskalogianni, C., Maslon, M.M., Naski, N., Bourougaa, K., Calvo, F., and Fahraeus, R. (2008). P53 mRNA controls p53 activity by managing Mdm2 functions. Nat Cell Biol 10, 10981105. Chakraborty, D., Kappei, D., Theis, M., Nitzsche, A., Ding, L., Paszkowski-Rogacz, M., Surendranath, V., Berger, N., Schulz, H., Saar, K., et al. (2012). Combined RNAi and localization for functionally dissecting long noncoding RNAs. Nat Methods. Chan, C.K., Hubner, S., Hu, W., and Jans, D.A. (1998). Mutual exclusivity of DNA binding and nuclear localization signal recognition by the yeast transcription factor GAL4: implications for nonviral DNA delivery. Gene Ther 5, 12041212. Chu, C., Qu, K., Zhong, F.L., Artandi, S.E., and Chang, H.Y. (2011). Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. Mol Cell 44, 667-678. Cook, K.B., Kazan, H., Zuberi, K., Morris, Q., and Hughes, T.R. (2011). RBPDB: a database of RNA-binding specificities. Nucleic Acids Res 39, D301-308. Coppotelli, G., Mughal, N., Marescotti, D., and Masucci, M.G. (2011). High avidity binding to DNA protects ubiquitylated substrates from proteasomal degradation. J Biol Chem 286, 19565-19575. Deal, R.B., Henikoff, J.G., and Henikoff, S. (2010). Genome-wide kinetics of nucleosome turnover determined by metabolic labeling of histones. Science 328, 1161-1164. Deng, X., and Meller, V.H. (2006). Non-coding RNA in fly dosage compensation. Trends Biochem Sci 31, 526-532. Dignam, J.D., Lebovitz, R.M., and Roeder, R.G. (1983). Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei. Nucleic Acids Res 11, 1475-1489. Dion, M.F., Kaplan, T., Kim, M., Buratowski, S., Friedman, N., and Rando, O.J. (2007). Dynamics of replication-independent histone turnover in budding yeast. Science 315, 1405-1408. 107 Duncan, E.M., Muratore-Schroeder, T.L., Cook, R.G., Garcia, B.A., Shabanowitz, J., Hunt, D.F., and Allis, C.D. (2008). Cathepsin L proteolytically processes histone H3 during mouse embryonic stem cell differentiation. Cell 135, 284294. Gupta, R.A., Shah, N., Wang, K.C., Kim, J., Horlings, H.M., Wong, D.J., Tsai, M.C., Hung, T., Argani, P., Rinn, J.L., et al. (2010). Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature 464, 1071-1076. Guttman, M., Amit, I., Garber, M., French, C., Lin, M.F., Feldser, D., Huarte, M., Zuk, O., Carey, B.W., Cassady, J.P., et al. (2009). Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature 458, 223-227. Guttman, M., Donaghey, J., Carey, B.W., Garber, M., Grenier, J.K., Munson, G., Young, G., Lucas, A.B., Ach, R., Bruhn, L., et al. (2011). lincRNAs act in the circuitry controlling pluripotency and differentiation. Nature 477, 295-300. Guttman, M., Garber, M., Levin, J.Z., Donaghey, J., Robinson, J., Adiconis, X., Fan, L., Koziol, M.J., Gnirke, A., Nusbaum, C., et al. (2010). Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs. Nat Biotechnol 28, 503-510. Hom, R.A., Chang, P.Y., Roy, S., Musselman, C.A., Glass, K.C., Selezneva, A.I., Gozani, O., Ismagilov, R.F., Cleary, M.L., and Kutateladze, T.G. (2010). Molecular mechanism of MLL PHD3 and RNA recognition by the Cyp33 RRM domain. J Mol Biol 400, 145-154. Hung, T., Wang, Y., Lin, M.F., Koegel, A.K., Kotake, Y., Grant, G.D., Horlings, H.M., Shah, N., Umbricht, C., Wang, P., et al. (2011). Extensive and coordinated transcription of noncoding RNAs within cell-cycle promoters. Nat Genet 43, 621-629. Johnston, J.A., Johnson, E.S., Waller, P.R., and Varshavsky, A. (1995). Methotrexate inhibits proteolysis of dihydrofolate reductase by the N-end rule pathway. J Biol Chem 270, 8172-8178. Khalil, A.M., Guttman, M., Huarte, M., Garber, M., Raj, A., Rivea Morales, D., Thomas, K., Presser, A., Bernstein, B.E., van Oudenaarden, A., et al. (2009). Many human large intergenic noncoding RNAs associate with chromatinmodifying complexes and affect gene expression. Proc Natl Acad Sci U S A 106, 11667-11672. Kim, T.K., Hemberg, M., Gray, J.M., Costa, A.M., Bear, D.M., Wu, J., Harmin, D.A., Laptewicz, M., Barbara-Haley, K., Kuersten, S., et al. (2010). Widespread transcription at neuronal activity-regulated enhancers. Nature 465, 182-187. Kopec, J., and Schneider, G. (2011). Comparison of fluorescence and light scattering based methods to assess formation and stability of protein-protein complexes. J Struct Biol 175, 216-223. Krajewski, W.A., Nakamura, T., Mazo, A., and Canaani, E. (2005). A motif within SET-domain proteins binds single-stranded nucleic acids and transcribed and supercoiled DNAs and can interfere with assembly of nucleosomes. Mol Cell Biol 25, 1891-1899. 108 Lagier-Tourenne, C., Polymenidou, M., and Cleveland, D.W. (2010). TDP-43 and FUS/TLS: emerging roles in RNA processing and neurodegeneration. Hum Mol Genet 19, R46-64. Meyer, C., Kowarz, E., Hofmann, J., Renneville, A., Zuna, J., Trka, J., Ben Abdelali, R., Macintyre, E., De Braekeleer, E., De Braekeleer, M., et al. (2009). New insights to the MLL recombinome of acute leukemias. Leukemia 23, 14901499. Migliori, V., Muller, J., Phalke, S., Low, D., Bezzi, M., Mok, W.C., Sahu, S.K., Gunaratne, J., Capasso, P., Bassi, C., et al. (2012). Symmetric dimethylation of H3R2 is a newly identified histone mark that supports euchromatin maintenance. Nat Struct Mol Biol 19, 136-144. Mito, Y., Henikoff, J.G., and Henikoff, S. (2007). Histone replacement marks the boundaries of cis-regulatory domains. Science 315, 1408-1411. Morey, C., and Avner, P. (2011). The demoiselle of X-inactivation: 50 years old and as trendy and mesmerising as ever. PLoS Genet 7, e1002212. Muratani, M., and Tansey, W.P. (2003). How the ubiquitin-proteasome system controls transcription. Nat Rev Mol Cell Biol 4, 192-201. Nakagawa, T., and Xiong, Y. (2011). X-linked mental retardation gene CUL4B targets ubiquitylation of H3K4 methyltransferase component WDR5 and regulates neuronal gene expression. Mol Cell 43, 381-391. Niesen, F.H., Berglund, H., and Vedadi, M. (2007). The use of differential scanning fluorimetry to detect ligand interactions that promote protein stability. Nat Protoc 2, 2212-2221. Odho, Z., Southall, S.M., and Wilson, J.R. (2010). Characterization of a novel WDR5binding site that recruits RbBP5 through a conserved motif to enhance methylation of histone H3 lysine 4 by mixed lineage leukemia protein-1. J Biol Chem 285, 32967-32976. Orom, U.A., Derrien, T., Beringer, M., Gumireddy, K., Gardini, A., Bussotti, G., Lai, F., Zytnicki, M., Notredame, C., Huang, Q., et al. (2010). Long noncoding RNAs with enhancer-like function in human cells. Cell 143, 46-58. Polymenidou, M., Lagier-Tourenne, C., Hutt, K.R., Huelga, S.C., Moran, J., Liang, T.Y., Ling, S.C., Sun, E., Wancewicz, E., Mazur, C., et al. (2011). Long premRNA depletion and RNA missplicing contribute to neuronal vulnerability from loss of TDP-43. Nat Neurosci 14, 459-468. Rada-Iglesias, A., Bajpai, R., Swigut, T., Brugmann, S.A., Flynn, R.A., and Wysocka, J. (2011). A unique chromatin signature uncovers early developmental enhancers in humans. Nature 470, 279-283. Rando, O.J., and Chang, H.Y. (2009). Genome-wide views of chromatin structure. Annu Rev Biochem 78, 245-271. Rinn, J.L., Kertesz, M., Wang, J.K., Squazzo, S.L., Xu, X., Brugmann, S.A., Goodnough, L.H., Helms, J.A., Farnham, P.J., Segal, E., et al. (2007). Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell 129, 1311-1323. Schwanhausser, B., Busse, D., Li, N., Dittmar, G., Schuchhardt, J., Wolf, J., Chen, W., and Selbach, M. (2011). Global quantification of mammalian gene expression 109 control. Nature 473, 337-342. Smith, D.B., and Johnson, K.S. (1988). Single-step purification of polypeptides expressed in Escherichia coli as fusions with glutathione S-transferase. Gene 67, 31-40. Smith, E., Lin, C., and Shilatifard, A. (2011). The super elongation complex (SEC) and MLL in development and disease. Genes Dev 25, 661-672. Song, J.J., and Kingston, R.E. (2008). WDR5 interacts with mixed lineage leukemia (MLL) protein via the histone H3-binding pocket. J Biol Chem 283, 3525835264. Sorokin, A.V., Selyutina, A.A., Skabkin, M.A., Guryanov, S.G., Nazimov, I.V., Richard, C., Th'ng, J., Yau, J., Sorensen, P.H., Ovchinnikov, L.P., et al. (2005). Proteasome-mediated cleavage of the Y-box-binding protein 1 is linked to DNA-damage stress response. EMBO J 24, 3602-3612. Squazzo, S.L., O'Geen, H., Komashko, V.M., Krig, S.R., Jin, V.X., Jang, S.W., Margueron, R., Reinberg, D., Green, R., and Farnham, P.J. (2006). Suz12 binds to silenced regions of the genome in a cell-type-specific manner. Genome Res 16, 890-900. Tee, W.W., Pardo, M., Theunissen, T.W., Yu, L., Choudhary, J.S., Hajkova, P., and Surani, M.A. (2010). Prmt5 is essential for early mouse development and acts in the cytoplasm to maintain ES cell pluripotency. Genes Dev 24, 2772-2777. Trievel, R.C., and Shilatifard, A. (2009). WDR5, a complexed protein. Nat Struct Mol Biol 16, 678-680. Tsai, M.C., Manor, O., Wan, Y., Mosammaparast, N., Wang, J.K., Lan, F., Shi, Y., Segal, E., and Chang, H.Y. (2010). Long noncoding RNA as modular scaffold of histone modification complexes. Science 329, 689-693. Ulitsky, I., Shkumatava, A., Jan, C.H., Sive, H., and Bartel, D.P. (2011). Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell 147, 1537-1550. Wang, K.C., and Chang, H.Y. (2011). Molecular mechanisms of long noncoding RNAs. Mol Cell 43, 904-914. Wang, K.C., Yang, Y.W., Liu, B., Sanyal, A., Corces-Zimmerman, R., Chen, Y., Lajoie, B.R., Protacio, A., Flynn, R.A., Gupta, R.A., et al. (2011). A long noncoding RNA maintains active chromatin to coordinate homeotic gene expression. Nature 472, 120-124. Wang, X., Arai, S., Song, X., Reichart, D., Du, K., Pascual, G., Tempst, P., Rosenfeld, M.G., Glass, C.K., and Kurokawa, R. (2008). Induced ncRNAs allosterically modify RNA-binding proteins in cis to inhibit transcription. Nature 454, 126130. Wang, Z., Song, J., Milne, T.A., Wang, G.G., Li, H., Allis, C.D., and Patel, D.J. (2010). Pro isomerization in MLL1 PHD3-bromo cassette connects H3K4me readout to CyP33 and HDAC-mediated repression. Cell 141, 1183-1194. Wysocka, J., Swigut, T., Milne, T.A., Dou, Y., Zhang, X., Burlingame, A.L., Roeder, R.G., Brivanlou, A.H., and Allis, C.D. (2005). WDR5 associates with histone H3 methylated at K4 and is essential for H3 K4 methylation and vertebrate development. Cell 121, 859-872. 110 Yong, J., Kasim, M., Bachorik, J.L., Wan, L., and Dreyfuss, G. (2010). Gemin5 delivers snRNA precursors to the SMN complex for snRNP biogenesis. Mol Cell 38, 551-562. Zhao, J., Ohsumi, T.K., Kung, J.T., Ogawa, Y., Grau, D.J., Sarma, K., Song, J.J., Kingston, R.E., Borowsky, M., and Lee, J.T. (2010). Genome-wide identification of polycomb-associated RNAs by RIP-seq. Mol Cell 40, 939953. Zofall, M., Yamanaka, S., Reyes-Turcu, F.E., Zhang, K., Rubin, C., and Grewal, S.I. (2012). RNA elimination machinery targeting meiotic mRNAs promotes facultative heterochromatin formation. Science 335, 96-100. 111 CHAPTER 4 Concluding remarks 112 4.1 Conclusion The examination of the noncoding “dark matter” of the genome has revealed multiple surprises, most notably, the presence of long noncoding RNA transcripts. Even more surprising, whereas the small noncoding miRNA class simply silences target mRNAs, lncRNAs do not share a single isolated mechanism. Most lncRNAs function through protein interactions, although a few notable exceptions primarily modulate DNA, RNA, or miRNA directly (Rinn and Chang, 2012). lncRNAs can recruit proteins in cis or in trans, modulate protein catalytic activity, or act as decoys to prevent proper protein interaction with their targets (Wang and Chang, 2011). In this manner, lncRNAs have been shown to regulate the spatial and/or functional properties of their corresponding binding proteins. Because many lncRNAs have been found to be important in epigenetic silencing (Rinn and Chang, 2012), we wanted to characterize a lncRNA involved in the opposing epigenetic activation. To this end, we cloned a novel lncRNA, HOTTIP, which is transcribed from the distal end of the HoxA locus. HOTTIP is important in maintaining H3K4 trimethylation and gene expression of the distal HoxA genes – a 40 kilobase region that forms a highly interacting DNA cluster in three dimensional space, as seen by high throughput chromosome conformation capture. Overexpression of HOTTIP does not induce upregulation of distal HoxA gene expression in fibroblasts that do not normally express HOTTIP; thus, we believe that HOTTIP acts in cis, similar to the previously well characterized model mechanism of Xist lncRNA in X chromosome inactivation (Morey and Avner, 2011). HOTTIP binds specifically to WDR5, a component of the MLL methyltransferase complex. Thus, in our initial model, HOTTIP RNA, once transcribed, recruits WDR5 and the MLL complex to the nearby distal HoxA locus, thus creating a localized domain of H3K4 trimethylation to maintain gene expression for positional identity. Since then, our understanding of the HOTTIP-WDR5 function and interaction has greatly increased, revealing that HOTTIP and potential HOTTIP-like lncRNAs may play a larger role than previously appreciated. Using an overexpression system in cells not expressing HOTTIP, exogenous HOTTIP is able to prevent exogenous 113 WDR5 degradation by cycloheximide chase. HOTTIP-mediated stabilization allows increased WDR5 deposition at a specific locus. This HOTTIP-mediated stabilization occurs after ubiquitination, likely by increasing the energy threshold required to unfold WDR5 for proteasomal degradation. To better characterize the physical interaction between HOTTIP and WDR5, we conducted mutagenesis assays to examine both the RNA and protein binding sites. First, by deletion mapping, we found that HOTTIP bases 1953-2453 bind most strongly to WDR5, and that RNA-mediated stability requires that the HOTTIP bases 1953-2453 be fused to an arbitrary long RNA sequence. To find the corresponding binding site on WDR5, we conducted a small scale alanine-scanning mutagenesis screen, and isolated the RNA binding site to the WDR5 protein cleft between blades 5 and 6, encompassing the same binding surface region as the RbBP5 peptide 371-381. Surprisingly, HOTTIP RNA and the RbBP5 peptide 371-381 compete for the same binding pocket in purified component binding assays, further confirming this binding site. One mutation, WDR5 F266A, causes reduced binding to HOTTIP, but does not have any affects on RbBP5 binding. This unique attribute allowed us to separate the effects of lncRNA binding on WDR5 from the protein-mediated effects. As would be expected, WDR5 F266A is unable to be stabilized by HOTTIP, and is ineffective in activating luciferase gene expression. To then test the functional consequences of inability to bind lncRNAs, we then utilized mouse embryonic stem cells, which have been previously shown to be reliant on WDR5 to maintain stem cell identity. By shRNA knockdown of endogenous WDR5 with inducible expression of either WDR5 WT or WDR5 F266A, we could ascertain the importance of lncRNA binding to WDR5 on cell function. Compared with WDR5 WT, WDR5 F266A cells are unable to maintain H3K4me3, show more rapid WDR5 turnover, and accumulate less in the nucleus and chromatin. Furthermore, embryonic stem cells with WDR5 F266A are defective in self renewal, as seen by colony morphology, alkaline phosphatase staining, and gene expression. Thus, binding to lncRNAs via the HOTTIP binding pocket is crucial for WDR5 function in ES cells. 114 Altogether, this body of work reveals a novel function of an activating lncRNA. HOTTIP, the 3.76 kilobase lncRNA transcribed from the distal tip of the HoxA locus, binds WDR5, and also thermodynamically stabilizes the protein to prevent proteasomal degradation. This lncRNA-mediated protein stabilization is crucial for WDR5 and MLL methyltransferase function, revealing a novel ability for lncRNAs to act as “molecular switches.” 115 4.2 Future Directions However, more questions remain regarding both the mechanism and functional role of HOTTIP and HOTTIP-like lncRNAs. First and foremost, the true identity of the prominent HOTTIP RNA isoform still remains a mystery. The full length 3.76 kilobase HOTTIP cDNA was RACE cloned, and found to be both spliced and polyadenylated. Yet, because HOTTIP is present at only ~0.3 copies per cell, Northern analysis was unsuccessful. The exact sequence of the predominant, functional form of HOTTIP may provide useful in understanding its endogenous role in distal fibroblasts and maintaining distal identity. One possible method to address the difficulties in working with low copy transcripts would be to enrich for the transcript using complementary base pairing technology, as previously described (Mercer et al., 2012). This may then allow sufficient enrichment to determine exact isoforms and abundance by Northern blot. As another problem posed by the copy number, if HOTTIP is present at only ~0.3 copies per cell, how can it find, interact, and stabilize ~20,000 copies of WDR5 per cell (Schwanhausser et al., 2011)? A low copy RNA both lowers the likelihood of random interactions with a protein, and ability to thermodynamically stabilize multiple copies of that protein. One possibility is that HOTTIP simply acts in a localized domain or few domains (eg. primarily distal HoxA locus), and regulates WDR5 stability there, similar to the previously described Pc2-binding lncRNAs (Yang et al., 2011). Thus, to maintain WDR5 across the genome, an entire class of HOTTIP-like lncRNAs must interact with WDR5, similar to the recently described PRC2-binding lncRNAs (Guil et al., 2012; Khalil et al., 2009; Zhao et al., 2010). Another possibility is that HOTTIP stabilizes a small population of WDR5 (eg. nascent WDR5 protein). Enzymatic recruitment may also be involved. In this manner, perhaps temporary lncRNA-mediated stabilization is sufficient to allow other mechanisms of stabilization, such as MLL complex assembly. From our mutagenesis work characterizing the RNA-protein interaction, we surprisingly found that HOTTIP RNA and RbBP5 protein compete for the same binding surface on WDR5. Because binding to RbBP5 is required for full catalytic 116 activity, HOTTIP binding to WDR5 appears likely to be inhibitory. How can the low copy HOTTIP (0.3 copies per cell), compete with RbBP5 (7,000 copies per cell (Schwanhausser et al., 2011)) for binding with WDR5 in the first place? Because RbBP5 is found in multiple complexes, the amount of free RbBP5 may be far less in abundance. As another explanation, HOTTIP may act for temporal control, to regulate WDR5 stability until RbBP5 can bind and replace it. Indeed, RNAs and proteins appear to commonly compete for the same binding site, as can be seen in the example of the Cyp33-MLL1 interaction (Hom et al., 2010). Further investigation of the effects of protein-RNA competition on enzyme kinetics and localization may prove fruitful in linking biochemical observations with cellular functions. Ultimately, the HOTTIP RNA presents both a novel example for lncRNAmediated epigenetic activation, as well as for lncRNA-mediated protein stability. Yet, cells that do not express HOTTIP RNA (eg. mouse ESCs, lung fibroblasts) still maintain WDR5 expression and protein levels. Furthermore, in mouse ESCs, the F266A mutation renders WDR5 unstable and ineffective to maintain the ESC self renewal state, in a manner very similar to WDR5 knockdown. Thus, we conclude that other endogenous HOTTIP-like RNAs likely exist in every cell type, comprising a novel class of lncRNAs involved in WDR5 stabilization and gene activation. Discovery and characterization of the various HOTTIP-like lncRNAs may prove insightful in understanding human health and disease processes. 117 4.3 References Guil, S., Soler, M., Portela, A., Carrere, J., Fonalleras, E., Gomez, A., Villanueva, A., and Esteller, M. (2012). Intronic RNAs mediate EZH2 regulation of epigenetic targets. Nat Struct Mol Biol. Hom, R.A., Chang, P.Y., Roy, S., Musselman, C.A., Glass, K.C., Selezneva, A.I., Gozani, O., Ismagilov, R.F., Cleary, M.L., and Kutateladze, T.G. (2010). Molecular mechanism of MLL PHD3 and RNA recognition by the Cyp33 RRM domain. J Mol Biol 400, 145-154. Khalil, A.M., Guttman, M., Huarte, M., Garber, M., Raj, A., Rivea Morales, D., Thomas, K., Presser, A., Bernstein, B.E., van Oudenaarden, A., et al. (2009). Many human large intergenic noncoding RNAs associate with chromatinmodifying complexes and affect gene expression. Proc Natl Acad Sci U S A 106, 11667-11672. Mercer, T.R., Gerhardt, D.J., Dinger, M.E., Crawford, J., Trapnell, C., Jeddeloh, J.A., Mattick, J.S., and Rinn, J.L. (2012). Targeted RNA sequencing reveals the deep complexity of the human transcriptome. Nat Biotechnol 30, 99-104. Morey, C., and Avner, P. (2011). The demoiselle of X-inactivation: 50 years old and as trendy and mesmerising as ever. PLoS Genet 7, e1002212. Rinn, J.L., and Chang, H.Y. (2012). Genome Regulation by Long Noncoding RNAs. Annu Rev Biochem 81, 145-166. Schwanhausser, B., Busse, D., Li, N., Dittmar, G., Schuchhardt, J., Wolf, J., Chen, W., and Selbach, M. (2011). Global quantification of mammalian gene expression control. Nature 473, 337-342. Wang, K.C., and Chang, H.Y. (2011). Molecular mechanisms of long noncoding RNAs. Mol Cell 43, 904-914. Yang, L., Lin, C., Liu, W., Zhang, J., Ohgi, K.A., Grinstein, J.D., Dorrestein, P.C., and Rosenfeld, M.G. (2011). ncRNA- and Pc2 methylation-dependent gene relocation between nuclear structures mediates gene activation programs. Cell 147, 773-788. Zhao, J., Ohsumi, T.K., Kung, J.T., Ogawa, Y., Grau, D.J., Sarma, K., Song, J.J., Kingston, R.E., Borowsky, M., and Lee, J.T. (2010). Genome-wide identification of polycomb-associated RNAs by RIP-seq. Mol Cell 40, 939-953.
© Copyright 2026 Paperzz