Plant Cell Advance Publication. Published on February 13, 2017, doi:10.1105/tpc.16.00718 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 RESEARCH ARTICLE 43 INTRODUCTION 44 Exogenous Transposable Elements Circumvent Identity-Based Silencing Permitting the Dissection of Expression-Dependent Silencing Dalen Fultz1,2 and R. Keith Slotkin1,3,4 1 Department of Molecular Genetics, The Ohio State University Molecular, Cellular and Developmental Biology Graduate Program, The Ohio State University 3 Center for RNA Biology, The Ohio State University 4 Corresponding Author: [email protected] 2 Short Title: Silencing Exogenous Transposable Elements One-sentence summary: Transformation of endogenous and exogenous transposable elements into Arabidopsis distinguishes expression-dependent and identity-based mechanisms responsible for the establishment, progression, and heritability of silencing. The author responsible for distribution of materials integral to the findings presented in this article in accordance with the policy described in the Instructions for Authors (www.plantcell.org) is: R. Keith Slotkin ([email protected]). ABSTRACT The propagation of epigenetic marks has received a great deal of attention, yet the initiation of epigenetic silencing of a new transgene, virus, or transposable element (TE) remains enigmatic. The overlapping and simultaneous function of multiple silencing mechanisms has obscured this area of investigation. Here, we have revealed two broad mechanisms that can initiate silencing independently: identity-based and expression-dependent silencing. We found that identity-based silencing is targeted by 21-22 or 24 nucleotide small interfering RNAs (siRNAs) generated from previously silenced regions of the genome. By transforming exogenous TEs into Arabidopsis thaliana, we circumvented identity-based silencing allowing us to isolate and investigate the molecular mechanism of expression-dependent silencing. We found that several siRNA-generating mechanisms all trigger de novo expression-dependent RNAdirected DNA methylation (RdDM) through RNA Polymerase V. In addition, while full-length TEs quickly progress beyond RdDM to heterochromatin formation and the final maintenance methylation state, TE fragments stall at the RdDM phase. Lastly, we found that transformation into a mutant genotype followed by introgression into wild type does not result in the same level of silencing as direct transformation into wild type. This demonstrates that the plant genotype during a narrow window of time during TE insertion (or transgene transformation) is key for establishing the transgenerational extent of epigenetic silencing. Transgenes, viruses, and transposable elements (TEs) undergo three distinct phases of 45 silencing in plants: initiation, establishment, and maintenance (Bond and Baulcombe, 2015). 46 These pathways are likely linked in a linear progression leading to long-term epigenetic silencing 47 (Panda and Slotkin, 2013); however, how each phase is activated is unclear. The maintenance 48 phase responsible for epigenetic inheritance has been extensively investigated in plants and ©2017 American Society of Plant Biologists. All Rights Reserved 1 49 shown to rely on the replication of symmetrical DNA methylation patterns across cell divisions 50 (Kato et al., 2003; Feng et al., 2010). In addition, DNA methylation and histone 3 lysine 9 51 dimethylation (H3K9me2) reinforce each other to generate a transcriptionally silent 52 heterochromatic state (Du et al., 2012; Stroud et al., 2014). Although the mechanism(s) of 53 silencing initiation and establishment have been investigated on regions of the genome such as 54 endogenous TEs (Nuthikattu et al., 2013), pre-existing DNA methylation, and H3K9me2 directed 55 by maintenance mechanisms confound this analysis. Therefore, the mechanism of de novo 56 triggering (initiation) and placement (establishment) of chromatin modifications has remained 57 enigmatic. 58 Establishment of repressive chromatin modifications (including promoter DNA methylation 59 and H3K9me2) is known to occur through several distinct small RNA-directed DNA methylation 60 (RdDM) pathways in Arabidopsis (reviewed in Cuerda-Gil and Slotkin, 2016). A hallmark of 61 RdDM is the equal targeting of all cytosine contexts (CG, CHG, CHH, where H=A, T, or C), 62 whereas maintenance methylation occurs more efficiently at the symmetric CG context than 63 asymmetric CHH sites (Lister et al., 2008; Zemach et al., 2013). RdDM can be subdivided into 64 several distinct pathways. The canonical ‘Pol IV-RdDM’ pathway acts through the transcription 65 of regions of the genome already associated with H3K9me2 by RNA Polymerase IV (Pol IV) 66 (Law et al., 2013). Pol IV transcription, coupled with the activity of its binding partner, RDR2, 67 produces double-stranded RNA substrates that are degraded by the Dicer protein DCL3 into 24 68 nucleotide (nt) small interfering RNAs (siRNAs)(reviewed in Matzke and Mosher, 2014). These 69 siRNAs are incorporated into either ARGONAUTE 4 (AGO4) or AGO6 and provide targeting 70 specificity to downstream components of the RdDM pathway. Pol V generates nascent RNAs 71 from methylated DNA regions to act as scaffolding transcripts, tethering the siRNA-bound AGO4 72 or AGO6 to this region of chromatin by an RNA:RNA interaction (Wierzbicki et al., 2009), and 73 triggering DNA methylation through DRM1/2 (Cao et al., 2003). Pol IV-RdDM occurs 74 independently of Pol II expression and has been theorized to function as an identity sensor, 75 using the 24 nt siRNAs produced from silenced TEs to target similar sequences across the 76 genome for RdDM (Fultz et al., 2015). 77 The field of RdDM has been recently transformed by the identification of non-canonical 78 (non-Pol IV) mechanisms that are dependent on Pol II (reviewed in Cuerda-Gil and Slotkin, 79 2016). For example, RDR6-RdDM functions on transcriptionally active TEs through the 80 production of 21-22 nt siRNAs directly generated from the degradation of TE mRNAs (Nuthikattu 2 81 et al., 2013; Panda et al., 2016). These 21-22 nt siRNAs guide RdDM through their incorporation 82 into AGO6 (McCue et al., 2015). In addition to TEs, the non-protein coding single copy TAS loci 83 are targets of this pathway in wild-type (wt) TE-silenced plants (Wu et al., 2012; Panda et al., 84 2016). Pol II mRNAs can also be directly degraded by DCL3 into 24 nt siRNAs, which target 85 RdDM in a pathway termed DCL3-RdDM (Panda et al., 2016). Additionally, Dicer-independent 86 small RNAs have been shown to function in RdDM (Ye et al., 2016; Yang et al., 2015). These 87 three ‘non-canonical’ RdDM pathways have only limited roles in wt plants that lack TE activity, 88 but play a considerable role in silencing transcriptionally reactivated TEs (Panda et al., 2016). 89 Consequently, each of these non-canonical RdDM pathways has been hypothesized to function 90 in the original establishment of TE silencing. 91 To focus solely on the initiation and establishment phases of silencing, and separate 92 these mechanisms from the maintenance of epigenetic marks, investigation has concentrated on 93 transformation or infection of epigenetically naïve (chromatin modification-free) DNA into plants 94 (Greenberg et al., 2011; Bond and Baulcombe, 2015; Jackel et al., 2016). Although transgenesis 95 might be considered an artificial approach to investigate silencing, evidence suggests that 96 transgenesis can occur naturally (Kyndt et al., 2015). In addition, TEs undergo widespread 97 horizontal transfer into new genomes (Bartolomé et al., 2009; Walsh et al., 2013) akin to 98 transgenesis. For example, the P element TE was horizontally transferred from Drosophila 99 willistoni to D. melanogaster in the early 20th century after >40 million years of divergence 100 between the two species (Daniels et al., 1990). Plant genomes display evidence of widespread 101 and frequent horizontal transfer of TEs (El Baidouri et al., 2014). TE-transgenes have been 102 previously generated as tools for gene tagging and mutant generation (Raizada et al., 2001; Li 103 et al., 2013). TEs transgenes can be selected for activity, yet without selection for activity 104 evidence suggests that they are recognized by host-defense mechanisms and silenced by 105 RdDM (Hirochika et al., 2000; Pérez-Hormaeche et al., 2008). Like TEs, silencing of newly 106 integrated transgenes can occur through homology to the existing genome (Matzke et al., 2000, 107 1994), yet the detailed mechanism(s) of silencing establishment at the newly integrated TEs or 108 transgenes is currently unknown. 109 To dissect the mechanisms of establishment of TE silencing and model the horizontal 110 transfer of a TE, we used floral dip transformation to stably integrate epigenetically naïve TEs 111 into wt and mutant Arabidopsis genomes. We found that a naïve endogenous TE is subject to 112 two distinct de novo silencing mechanisms: identity-based and expression-dependent silencing. 3 113 We found that identity-based silencing, which can function independent of mRNA production, is 114 targeted through 21-22 or 24 nt siRNAs produced from previously transcriptionally or post- 115 transcriptionally silenced regions of the genome. We were able to circumvent identity-based 116 silencing by introducing exogenous TEs into the Arabidopsis genome. Our system has therefore 117 uniquely permitted the molecular characterization of the de novo establishment of TE chromatin 118 modification by expression-dependent silencing without the obstruction of the identity-based 119 pathway. 120 121 RESULTS 122 Endogenous TEs are the target of distinct identity-based and expression-dependent silencing 123 mechanisms 124 To investigate how epigenetically naïve TEs undergo de novo establishment of DNA 125 methylation, we created a TE-transgene and inserted it into the Arabidopsis genome. The 126 transgene contains a 395 base pair (bp) fragment of the Arabidopsis Athila6 long terminal repeat 127 (LTR), which acts as the transcriptional control region for this TE family (Nuthikattu et al., 2013). 128 Endogenous Arabidopsis Athila family TEs are transcriptionally silenced gypsy retrotransposons 129 that produce abundant Pol IV-dependent 24 nt siRNAs (Nuthikattu et al., 2013). We placed an 130 estradiol-inducible promoter upstream of the transgene LTR fragment (Figure 1A), providing the 131 ability to control Pol II expression of the entire LTR (Figure 1B)(inducible-Athila LTR transgene = 132 iAthila LTR). Upon induction of wt iAthila T1 plants, LTR biological replicate average mRNA 133 levels increase 225-fold compared to the mock-induced control biological replicates (Figure 1B). 134 Since the iAthila LTR includes the native Athila6 transcriptional start site (TSS)(Figure 1A), low 135 but detectible levels of transgene mRNA accumulate from the native Athila TSS in wt Col without 136 estradiol induction (Figure 1C), but not from upstream of the native TSS (Figure 1B). This 137 demonstrates that the Athila LTR promoter within this transgene is both functional and weakly 138 active. To assay silencing, we determined the DNA methylation level of this TE-transgene via 139 bisulfite sequencing and found an intermediate level of DNA methylation on the iAthila LTR 140 transgene in wt Col when it was not induced. This methylation increased in all sequence 141 contexts when expression was induced in wt (Figure 1D). In the ago6 mutant, steady-state 142 mRNA levels were higher from both the native Athila TSS and the induced promoter TSS 143 (Figure 1B-C). However, the expression-dependent increase in DNA methylation found in wt Col 144 did not occur in ago6 mutants (Figure 1D), recapitulating previous results demonstrating AGO6’s 4 145 role in RdDM of expressed TEs (McCue et al., 2015). This analysis demonstrates that TE- 146 transgenes are targeted by an expression-dependent RdDM mechanism in a similar fashion as 147 endogenous TEs (Nuthikattu et al., 2013; McCue et al., 2015) and viruses (Herr et al., 2005). 148 However, this transgene cannot be used to distinguish expression-dependent from expression- 149 independent silencing mechanisms due to its identity to silenced endogenous TEs and the 150 constant activity of its native Athila promoter (Figure 1C). 151 To separate the expression-dependent and independent mechanisms functioning on the 152 iAthila LTR transgene, we created an Athila LTR transgene without the inducible promoter 153 (Figure 1A). This transgene was methylated (Figure 1E) with the sequence hallmarks of RdDM 154 (CHH methylation greater than or equal to CG methylation, see Supplemental Figure 1), and 155 matches abundant 24 nt siRNAs generated from silenced endogenous Athila TEs (Figure 1G). 156 As a control, we cloned the proximal promoter of the constitutively expressed gene AXS2 into 157 the same transgene vector (Figure 1A). When transformed into wt, this region does not 158 accumulate DNA methylation (Figure 1E), demonstrating that we are not investigating general 159 transgene silencing, but rather the methylation of the Athila LTR transgene is specific to its 160 identification as a TE. To assay the role of expression, we created an Athila LTR transgene with 161 a 138 bp deletion of the native Athila TSS and upstream TATA box (Athila LTR Δpro, Figure 1A). 162 This eliminated expression from the Athila LTR region of the transgene (Figure 1F), yet this 163 deletion does not remove the targeting potential of 24 nt siRNAs produced from endogenous 164 Athila TEs (Figure 1G). The level of DNA methylation was significantly reduced (t-test, p<0.05) 165 but not absent, with 10.3% total cytosine methylation still targeted to the Athila LTR Δpro (Figure 166 1E, Supplemental Figure 2A). Thus, for a newly introduced TE integration event that is identical 167 to endogenous TEs, expression-dependent RdDM does not work alone, and there must an 168 expression-independent identity-based pathway that targets DNA methylation even when the 169 transgene lacks a core promoter and expression. 170 171 172 Mechanism of identity-based silencing When transformation into Arabidopsis, a naïve FWA transgene with a TE in its promoter 173 is de novo silenced by the activity of the Pol IV-RdDM pathway (Chan et al., 2004; Greenberg et 174 al., 2011; Kinoshita et al., 2007; Cao and Jacobsen, 2002). To determine and differentiate how 175 the Pol IV-RdDM pathway functions in the silencing of TE transgenes, we transformed the Athila 176 LTR and Athila LTR Δpro transgenes directly into the pol IV (nrpd1a) mutant background. Plants 5 177 with mutated pol IV produce very few siRNAs that match the Athila LTR (Figure 1G), yet the 178 integrated Athila LTR transgene is targeted for DNA methylation with the hallmarks of RdDM 179 (CHH≥CG)(Figure 1E), confirming that Pol IV-RdDM is not solely responsible for the 180 establishment of DNA methylation on an expressed TE (reviewed in Cuerda-Gil and Slotkin, 181 2016). In contrast, the unexpressed Athila LTR Δpro transgene does not accumulate abundant 182 DNA methylation (2.1% CHH)(Figure 1E), demonstrating that expression is required when Pol 183 IV-RdDM fails. Without either expression (the Δpro transgene) or sequence-complementary 184 siRNAs provided by the endogenous silenced TEs (in the pol IV mutant), the TE transgene is not 185 targeted for abundant DNA methylation. Therefore, the Pol IV-RdDM pathway is sufficient to 186 target an expressed or unexpressed naïve TE or transgene based on sequence identity via the 187 abundant Pol IV-dependent 24 nt siRNAs that Athila and other endogenous silenced TEs 188 produce. We refer to this expression-independent pathway as ‘identity-based’ silencing. 189 We next aimed to determine whether 21-22 nt siRNAs could function in identity-based 190 silencing in a similar fashion to Pol IV-dependent 24 nt siRNAs. The endogenous TRANS- 191 ACTING SIRNA 3 A (TAS3A) locus is transcribed by Pol II into an mRNA that is targeted for 192 post-transcriptional degradation triggered by microRNA cleavage and subsequent RDR6- 193 dependent 21-22 nt tasiRNA production (Allen et al., 2005). Since TAS3 is an endogenous 194 target of RDR6-RdDM (Wu et al., 2012; McCue et al., 2015)(Supplemental Figure 1C-D), we 195 sought to determine whether the 21-22 nt siRNAs generated from this region could target RDR6- 196 RdDM in trans to a separate naïve region of the genome and function in expression-independent 197 identity-based silencing. We created a promoterless transgene of the TAS3A region that is 198 internal to the two microRNA target sites which initiate tasiRNA production (Figure 1A)(Axtell et 199 al., 2006). We found that our TAS3 transgene is not expressed (Figure 1H) but is targeted for 200 DNA methylation (Figure 1E). This methylation pattern shows the sequence hallmarks of RdDM 201 (CHH≥CG), demonstrating that the 21-22 nt tasiRNAs can function in trans on a newly 202 introduced transgene that was not expressed. We conclude that identity-based RdDM can 203 function in trans through 21-22 nt siRNAs in addition to 24 nt siRNAs. Therefore, as long as 204 siRNAs are produced, transcriptionally active and silenced regions of the endogenous genome 205 can both trigger identity-based silencing of a newly integrated TE/transgene. 206 207 Exogenous TEs circumvent identity-based RdDM and are targeted solely by expression- 208 dependent mechanisms 6 209 To separate the activities of identity-based RdDM from expression-dependent RdDM, we 210 used the exogenous Tto1 copia LTR retrotransposon from tobacco (Nicotiana tabacum – 211 separated from Arabidopsis by ~118 million years). When a full-length copy of this TE was 212 transferred into the Arabidopsis genome, the element was active and fully capable of 213 transposition (Hirochika et al., 2000; Böhmdorfer et al., 2010). However, this transposition was 214 only observed after selection in tissue culture (where TE silencing is compromised (Tanurdzic et 215 al., 2008)), in a chromatin mutant, or when an inducible promoter is used to force TE expression, 216 demonstrating that there is repression or silencing of this TE in wt Arabidopsis. To determine 217 whether this silencing could be identity-based, we matched Tto1 to the Arabidopsis genome via 218 BLAST and aligned Arabidopsis small RNA deep sequencing libraries to Tto1 via bowtie (see 219 Methods and Supplemental Figure 3). We determined that no portion of the Tto1 sequence is 220 identical to the Arabidopsis genome, and that there is only a slightly higher magnitude of 221 similarity between Tto1 and the Arabidopsis genome compared to control alignments of 222 scrambled DNA (Supplemental Figure 3). The most similar region between Tto and the 223 Arabidopsis genome is 846 nt in length but only 64.7% similar, while the level of sRNA similarity 224 between Tto1 and the Arabidopsis transcriptome is the same as matching to a scrambled 225 sequence of the same length (Supplemental Figure 3). Therefore, our analysis demonstrates 226 that Arabidopsis has only limited DNA sequence and sRNA identity to Tto1. 227 We generated a Tto1 transgene representing the full 574 bp LTR, which includes the Tto1 228 promoter and TSS (Hirochika, 1993; Takeda et al., 1999)(Figure 2A). When transformed into wt 229 Col, this ‘solo Tto LTR’ transgene produces mRNA (Figure 2B) and is targeted for DNA 230 methylation with the sequence context hallmarks of RdDM (CHH≥CG) over multiple generations 231 (Figure 2C-D). To ensure that the solo Tto LTR transgene is targeted for RdDM because of its 232 identification as a TE, and that we are not investigating general transgene silencing, we 233 performed three controls. First, we demonstrated that transformation of an endogenous 234 Arabidopsis genic promoter does not result in DNA methylation (Figure 1E). Second, to ensure 235 that the solo Tto LTR is not targeted for RdDM due to its detection by RNA quality control 236 mechanisms, we added an efficient terminator sequence (creating the transgene solo Tto 237 LTR+term, Figure 2A). We found that the lack of a proper termination sequence in the solo Tto 238 LTR transgene is not responsible for triggering the RdDM of this transgene (Figure 2C). Third, 239 we investigated the NOS bacterial promoter driving the herbicide resistance gene on the same 240 transgene as the solo Tto LTR and found no evidence of DNA methylation (Figure 2C), 7 241 demonstrating that not all foreign transgene or non-Arabidopsis sequences are targeted for 242 RdDM. 243 To determine if RdDM of the solo Tto LTR transgene is expression-dependent, we 244 created a transgene with a 142 bp deletion within the solo Tto LTR (solo Tto LTR Δpro)(Figure 245 2A), which disables mRNA production (Figure 2B). In contrast to the solo Tto LTR, the solo Tto 246 LTR Δpro transgene is not targeted for DNA methylation (Figure 2C, Supplemental Figure 2B). 247 This result is in contrast to the Athila LTR Δpro transgene, which is targeted for DNA methylation 248 via identity-based RdDM even when it is not expressed (Figure 1E, Supplemental Figure 2A). 249 Thus, by using an exogenous TE without preexisting identical copies in the wt Arabidopsis 250 genome, the solo Tto LTR transgene has circumvented identity-based RdDM, resulting in 251 regulation solely by expression-dependent RdDM. 252 To demonstrate that identity-based RdDM can target Tto, we tested whether a previously 253 introduced and silenced Tto element in the Arabidopsis genome could trigger methylation of a 254 new Tto integration event. We stably transformed wt Col with an inducible full-length form of Tto 255 (Böhmdorfer et al., 2010), which is targeted for silencing (iTto, see below, Figures 3,4). We then 256 re-transformed these iTto lines with the solo Tto LTR Δpro transgene from Figure 2A and found 257 that this newly introduced TE is now targeted for identity-based (expression-independent) RdDM 258 (Figure 2E). Therefore, the presence of a preexisting Tto copy in the Arabidopsis genome is 259 sufficient to target this new TE integration event for RdDM, demonstrating that Tto can be 260 subject to identity-based silencing. This correlates with the presence of abundant 24 nt siRNAs 261 from the iTto background that match the Tto LTR, which are absent in the wt Col background 262 without Tto1 (Figure 2F). The function of iTto 24 nt siRNAs in identity-based silencing strongly 263 resembles the abundant 24 nt siRNAs that match the Athila LTR in wt Col (Figure 1G), 264 demonstrating that the preexistence of matching siRNAs enables identity-based methylation of 265 unexpressed TEs. 266 Identity-based and expression-dependent RdDM can occur simultaneously on the same 267 target, producing additive methylation levels. This is observed on both the iAthila LTR and Athila 268 LTR transgenes that are targeted for identity-based silencing and display increased methylation 269 when they are expressed (Figure 1D-E). When the solo Tto LTR was transformed into the iTto 270 line (Figure 2E), increased CHG and CHH methylation occurs compared to either the solo Tto 271 LTR Δpro transgene transformed into iTto (identity-based silencing only)(Figure 2E), or the solo 272 Tto LTR transgene transformed into wt Col (expression-dependent silencing only)(Figure 2C). 8 273 This is in line with the observation that transcriptionally active endogenous TEs are the strongest 274 targets of RdDM (Panda et al., 2016), since both identity-based and expression-dependent 275 mechanisms simultaneously target them. 276 277 278 Multiple siRNA generating mechanisms redundantly target expression-dependent RdDM To identify the mechanism(s) responsible for targeting expression-dependent RdDM 279 without the confounding presence of identity-based RdDM, we directly transformed the solo Tto 280 LTR transgene into a number of mutant Arabidopsis lines. When transformed into wt Col, the 281 solo Tto LTR transgene produces a low level of mRNA as well as 21, 22, and 24 nt siRNAs 282 (Figure 3A-B, Supplemental Figure 4). We found that multiple distinct mechanisms act to 283 generate these siRNAs. We investigated the pol IV mutant to determine the role of Pol IV-RdDM. 284 Pol IV-RdDM directs some, but not all, of the solo Tto LTR RdDM (Figure 3C). This incomplete 285 dependence on Pol IV-RdDM correlated with the incomplete loss of 24 nt siRNAs in the pol IV 286 mutant (Figure 3B). However, all 24 nt siRNAs were dependent on the activity of DCL3 (Figure 287 3B). DCL3 is able to cleave Pol II transcripts into 24nt siRNAs, and this mechanism (termed 288 DCL3-RdDM) has recently been linked to the RdDM of transcriptionally active TEs (Panda et al., 289 2016). We conclude that Tto LTR 24 nt siRNAs are generated from both Pol II and Pol IV 290 transcripts processed by DCL3. Both of these mechanisms (Pol IV-RdDM and DCL3-RdDM) 291 likely contribute to solo Tto LTR RdDM. Complicating this analysis, RdDM of the solo Tto LTR 292 persists in dcl3 mutants (Figure 3C). The RdDM in dcl3 mutants is likely targeted by RDR6- 293 RdDM, as only 21-22 nt siRNAs are present in the dcl3 mutant to perform RdDM (Figure 3B). 294 When these 21-22 nt siRNAs are removed in the dcl2/3/4 triple mutant (Figure 3B), RdDM of the 295 solo Tto LTR does not occur (Figure 3C). We only detected RDR6-RdDM in the dcl3 mutant and 296 not in wt Col (Figure 3C), suggesting that for this transgene, RDR6-RdDM acts as a secondary 297 pathway which functions only when other pathways are disabled. RDR6 does function post- 298 transcriptionally in wt Col to degrade the solo Tto LTR transcripts into 21-22 nt siRNAs, as in the 299 rdr6 mutant the mRNA level is increased (Figure 3A) and 21-22 nt siRNAs are reduced (Figure 300 3B). Lastly, in contrast to recent reports that suggested Dicer-independent siRNAs are 301 responsible for the initiation of TE silencing (Ye et al., 2016; Yang et al., 2015), our data 302 demonstrate that (multiple) Dicer-dependent mechanisms are necessary for TE RdDM, as 303 transformation into the dcl2/3/4 triple mutant resulted in a loss of all siRNAs (Figure 3B) and 304 RdDM (Figure 3C). In summary, Pol IV-RdDM, DCL3-RdDM, and RDR6-RdDM (but not Dicer9 305 independent RdDM) all function in an overlapping manner to direct the methylation of the newly 306 inserted solo Tto LTR transgene in T1 plants. 307 In contrast to the redundant mechanisms responsible for siRNA production, we found that 308 Pol V is responsible for all expression-dependent RdDM (Figure 3C). Using pol V (nrpe1) T1 309 seedlings, we confirmed that Pol V is necessary for DNA methylation from as early as we can 310 assay transgene germinal integration (Figure 3D). The Pol V transcript is likely targeted by both 311 AGO6 and AGO4, as ago6 mutants only display a mild (not statistically significant) reduction in 312 Tto LTR methylation (Figure 3C). Similar to the pol V mutant, the drm1/2 double mutant does not 313 target RdDM to the solo Tto LTR (Figure 3D). Therefore, we conclude that multiple siRNA 314 generating mechanisms redundantly target one downstream RdDM chromatin-bound complex 315 requiring Pol V and DRM1/2. Pol V has been shown to be recruited to methylated DNA and 316 RdDM loci by the SUVH2 and SUVH9 proteins (Johnson et al., 2014). Surprisingly, in a suvh2/9 317 double mutant, solo Tto LTR RdDM occurs at levels similar to wt Col (Figure 3E). This suggests 318 the existence of a yet-to-be-defined SUVH2/9-independent mechanism responsible for recruiting 319 Pol V. 320 We also tested the role of primary (independent of RNA-dependent RNA polymerase 321 activity) small RNAs in targeting solo Tto LTR RdDM, since these small RNAs are thought to 322 initiate the first stages of TE transcript recognition, RDR6 activity and RdDM (Creasey et al., 323 2014). We detected no loss of solo Tto LTR RdDM in ago1, ago2, or dcl1 mutants (Figure 3E), 324 although the ago1 and dcl1 mutations do not result in the complete loss of protein function due 325 to their requirement in embryonic development (Nodine and Bartel, 2010). In addition, we tested 326 AtRRP6L1, a trimmer protein necessary for Dicer-independent small RNA processing (Ye et al., 327 2016). Similar to ago1, ago2 and dcl1, RdDM persists at near wt levels in the atrrp6l1 mutant, 328 suggesting that either primary small RNAs are not driving the recognition of the solo Tto LTR for 329 initial silencing or that redundant mechanisms exist. 330 331 332 Full Tto elements transition to maintenance methylation and heterochromatin formation When a TE invades a new genome by horizontal transfer, evidence suggests that a full- 333 length element is transferred, not simply an LTR-only fragment (solo LTR)(El Baidouri et al., 334 2014; Pace et al., 2008). Although the solo Tto LTR provides a tool to investigate expression- 335 dependent RdDM, it does not recapitulate the silencing dynamics of a full element. In addition, 336 recent data have determined that TE structure plays a key role in targeting specific silencing 10 337 mechanisms (Panda et al., 2016). We therefore aimed to determine how a full-length exogenous 338 TE is silenced using two versions of a full (or nearly full) length Tto1 element. The ‘full Tto’ 339 transgene includes both complete LTRs and the entire Tto1 internal coding sequence (CDS) 340 (Figure 4A), as in the native N. tabacum. Two short introns were added in the CDS as tools to 341 measure RNA processing and retrotransposon-induced cDNA formation (Böhmdorfer et al., 342 2010). Similar to the solo Tto LTR transgene, which shares the same LTR promoter, the full Tto 343 transgene is expressed when transformed into wt Col (Figure 4B). In contrast to the solo Tto 344 LTR, which produces both 21-22 and 24nt siRNAs (Figure 3B and 4C), the full Tto element 345 produces low levels of only 24 nt siRNAs in the inflorescence tissue of T1 plants (Figure 4C, 346 Supplemental Figure 5). A second transgene, iTto, was previously constructed using a full Tto1 347 element with added CDS introns and a truncated 5’ LTR (Böhmdorfer et al., 2010). This element 348 is not able to produce its own coding transcripts, but rather is under the control of an inducible 349 promoter and has a transcriptional terminator (Figure 4A). We confirmed that this transgene 350 does not accumulate abundant mRNA transcripts without induction (Figure 4B). Like the full Tto 351 transgene, the iTto transgene only generates 24 nt LTR siRNAs in T1 inflorescence tissue, 352 however the levels are higher regardless of induction of expression (Figure 4C, Supplemental 353 Figure 5). 354 To examine the DNA methylation targeted to these transgenes we performed bisulfite 355 sequencing. As a control, we used an endogenous Arabidopsis gene, RAS-RELATED 356 NUCLEAR PROTEIN1 (RAN1), cloned into the same inducible vector backbone as iTto (iRAN1 357 (Nuthikattu et al., 2013)). The low level of CHH DNA methylation of iRAN1 (4.7%, Figure 4D) 358 demonstrates that the Tto transgenes are being targeted for silencing due to their TE structure / 359 sequence and not due to the process of transformation. Compared to the DNA methylation of 360 the solo Tto LTR transgene, there is higher methylation of the LTR sequences of both the full 361 Tto and iTto transgenes (Figure 4D). In contrast to the RdDM pattern of solo Tto LTR 362 methylation, the CHH sequence context methylation is low compared to CG for three out of the 363 four full-element LTRs, indicating a maintenance methylation state (Figure 4E)(Supplemental 364 Figure 1). Methylation of the Tto internal CDS region is consistently lower than methylation at 365 the LTRs (Figure 4D), suggesting that the CDS does not cause element recognition and that the 366 LTRs drive silencing. This correlates with published data demonstrating that the ends of 367 endogenous long TEs are preferentially targeted by RdDM mechanisms (Zemach et al., 2013; 368 Böhmdorfer et al., 2016). 11 369 Since we detect maintenance methylation patterns in the T1 generation transformed with 370 full-length elements, we aimed to verify that full-length Tto transgene silencing was not 371 dependent on pre-existing identity to the Arabidopsis genome. First, we tested and confirmed 372 that there is only slight sequence similarity between the Arabidopsis genome or small RNAs and 373 the full Tto1 sequence beyond what is expected by chance (Supplemental Figure 3B,D). Second, 374 we created the iTf1 transgene using a truncated fission yeast (Schizosaccharomyces pombe – 375 separated from Arabidopsis by ~1624 million years) Tf1 LTR retrotransposon placed into the 376 same inducible vector backbone as iTto. We found that the Tf1 TE has no substantial sequence 377 identity to the Arabidopsis genome compared to what is expected by chance (Supplemental 378 Figure 3D), and the top BLAST hit is 87 bp with 71% similarity. The iTf1 transgene displays the 379 same silencing characteristics as full-length Tto elements: higher methylation levels of the LTRs 380 compared to the internal CDS, and a CG≥CHH methylation pattern of the LTRs (Figure 4F). 381 Therefore, the maintenance methylation pattern of full-length Tto elements is not due to identity- 382 based silencing depending on existing similarity to the Arabidopsis genome, but rather a general 383 feature of the silencing of full-length exogenous TEs. 384 To test if heterochromatin formation is occurring on full-length elements, we measured 385 the heterochromatic mark H3K9me2 of the solo Tto LTR and iTto transgenes. The solo Tto LTR 386 element accumulates very little H3K9me2, while iTto accumulates substantial enrichment 387 throughout the element (Figure 4G), suggesting that the structure of a full-length TE is 388 necessary for abundant heterochromatin formation. To test whether the presence of the internal 389 CDS is a critical structural feature triggering maintenance methylation, we created a Tto 390 transgene that has the full 5’ LTR and full CDS, but lacks the 3’ LTR (Figure 4H). This transgene 391 is targeted by RdDM, but fails to activate maintenance methylation (CHH≥CG, Figure 4E,H), 392 similar to the solo Tto LTR transgene. Therefore, the CDS is not critical in the silencing of the 393 full-length elements, and we conclude that either the full TE structure or the presence of two 394 LTRs is critical for the cell to target maintenance methylation and heterochromatin formation. 395 Intrigued by the possibility of the unique regulation of full-length retrotransposons 396 compared to their fragmented derivatives, we searched the Arabidopsis genome for endogenous 397 TE subfamilies with both full-length elements and solo LTRs. Of the 121 TE subfamilies 398 identified with full-length elements, 45 contained at least one solo LTR copy. Of these 45, we 399 found 8 that show the same pattern as the exogenous Tto: the solo LTRs are regulated by 400 RdDM (reduced methylation in pol V mutants compared to wt) and full-length elements are 12 401 regulated by maintenance methylation and not RdDM. Overall, solo LTRs were significantly 402 more likely to be regulated by RdDM compared to full-length elements (t-test, p<0.001)(Figure 403 4I). 404 405 TE detection by RdDM is required for subsequent maintenance methylation and 406 heterochromatin formation 407 Like an endogenous full-length TE, iTto is targeted by maintenance methylation and 408 heterochromatin formation (Figure 4D, E, G). It is therefore an excellent model for identity- 409 independent silencing that progresses to the full maintenance methylation and heterochromatin 410 formation stage. To investigate this mechanism, we transformed iTto directly into pol IV and pol 411 V mutants. Upon induction of expression and accumulation of iTto mRNAs (Figure 5A), iTto 412 produces transposition intermediates (Figure 5B). We used the engineered introns (which are 413 spliced from retrotransposon-induced cDNA) to determine the level of transposition 414 intermediates and found significantly higher levels in pol V mutants (Figure 5B). This result was 415 independently verified by identification of circularized cDNA retrotransposition intermediates in 416 the pol V mutant (Supplemental Figure 6)(Hirochika and Otsuki, 1995), demonstrating that iTto 417 progresses further into the retrotransposition mechanism in pol V mutants. 418 When induced in wt Col, iTto undergoes a slight (1.33-fold) increase in 21-22 nt siRNAs 419 (365 to 486 RPM, Figure 5C,D) and expression-dependent methylation which is observed as 420 increased levels of CHH methylation (Figure 5E-F). This increase in 21-22 nt siRNAs is only 421 detectable by deep sequencing, while not detectible by RNA gel blot (Figure 5C, Supplemental 422 Figure 7). The increase in transposition intermediates in pol V mutants correlates with an 423 increase in 21-22 nt siRNAs produced throughout the iTto element (Figure 5C-D, Supplemental 424 Figure 7) and low methylation levels comparable to the baseline level of the control iRAN1 425 transgene (Figure 5E-F). The low level of methylation in pol V mutants is recapitulated with the 426 iTf1 full-length TE (Figure 5G). This low methylation suggests that full-length exogenous TEs are 427 not recognized as TEs for RdDM without a functional Pol V chromatin-modifying complex. Like 428 pol V mutants, drm1/2 double mutants do not accumulate iTto methylation (Figure 5E-F), 429 confirming that RdDM is required to initiate the silencing of iTto. Therefore, the iTto and iTf1 430 maintenance methylation pattern (CG>CHH) observed in the wt Col T1 inflorescence (Figure 431 5E-G) required the activity of RdDM, even though the RdDM pattern (CHH≥CG) is no longer 432 observed in this T1 inflorescence tissue. 13 433 To determine if iTto methylation can be established in pol V mutants after multiple 434 generations, we self-fertilized the T1 iTto plants and investigated the T2 generation. We 435 propagated both expression-induced and mock-induced plants, however, the induced plants 436 were not re-induced in the T2 generation and did not accumulate iTto mRNAs in the T2 437 generation (Figure 5H). The iTto transgene remains unrecognized as a TE in the T2 of pol V 438 mutants, in contrast to the continued methylation observed in wt Col lines (Figure 5I). To test the 439 formation of heterochromatin (H3K9me2) in the T2 generation, we performed ChIP and found 440 that iTto accumulates H3K9me2 over the entire element in wt Col, although induced expression 441 of iTto in the previous generation results in reduced levels (Figure 5J). In contrast, pol V mutants 442 fail to accumulate H3K9me2 (Figure 5J). Since both iTto DNA methylation levels and H3K9me2 443 are at background levels in pol V (Figure 5I-J), we conclude that pol V is fully responsible for the 444 triggering of silencing on exogenous TE chromatin. In addition, H3K9me2 does not accumulate 445 in drm1/2 double mutants (Figure 5J), proving that the establishment of TE DNA methylation via 446 RdDM is necessary before the establishment of H3K9me2. Since iTto and iTf1 have a 447 maintenance methylation pattern (CG>CHH) in T1 wt Col inflorescence tissue (Figure 5E-G), our 448 results suggest that full-length elements are dependent on RdDM to reach the ultimate silencing 449 state of maintenance methylation and heterochromatin formation. Even though we do not 450 observe RdDM methylation patterns for iTto or iTf1 after they have been transformed into wt Col, 451 these transgenes were likely expressed, identified as a TE, and subjected to RdDM at a 452 developmental time point prior to our analysis. 453 Pol IV is not required to establish silencing or chromatin modification at exogenous full- 454 length TEs, but only to transition to a maintenance methylation phase of silencing. We find that 455 pol IV mutants are able to perform RdDM on the iTto and iTf1 transgenes, demonstrated by the 456 intermediate levels of DNA methylation with the signature of RdDM (CHH≥CG) in this mutant 457 (Figure 5E-G). While the TEs are recognized in pol IV mutants, maintenance methylation fails to 458 activate and establish high CG levels compared to CHH (as in wt Col). Even in the T2 459 generation, iTto remains only partially CG methylated in pol IV mutants compared to wt Col 460 (Figure 5I). In pol IV mutants, H3K9me2 is only detected if iTto was expressed in the previous 461 generation (Figure 5J), demonstrating that Pol IV is not absolutely required to establish 462 H3K9me2, but there are two distinct routes for H3K9me2 deposition: 1) a Pol IV mechanism that 463 is expression-independent and 2) a Pol IV-independent mechanism that requires expression. 464 14 465 466 The genotype of TE insertion/transformation is critical for de novo silencing Since the iTto transgene fails to correctly proceed to the maintenance methylation phase 467 of silencing when integrated into a pol IV genome, we aimed to determine if reintroduction of a 468 functional Pol IV copy would result in this progression of silencing. We crossed mock-induced 469 iTto T1 pol IV plants to wt Col, and then self-fertilized the T2 progeny to produce T3 individuals, 470 which were genotyped and selected for Pol IV/Pol IV wt plants (blue labels, Figure 6A). We 471 found that T3 wt plants in which iTto had been integrated into pol IV mutants had equal levels of 472 DNA methylation in all sequence contexts (CHH≥CG), and thus were never able to transition 473 from RdDM to maintenance methylation (Figure 6B-C). We biologically replicated the T1 cross 474 with a distinct iTto insertion event (Figure 6B, replicate B) and in addition performed exact 475 reciprocal crosses with a third unique insertion site (Figure 6B, replicate C). We consistently 476 found a lack of maintenance methylation across replicates. Therefore, when a functional copy of 477 Pol IV is crossed to the pol IV iTto line, Pol IV does not act on iTto. This is possibly due to the 478 lack of H3K9me2 (the heterochromatic signal that recruits Pol IV (Law et al., 2013)) in pol IV 479 mock-induced plants (Figure 5J). As a control, we crossed wt Col mock-induced iTto T1 plants 480 to pol IV mutants and then self-fertilized the progeny (orange labels, Figure 6A). This results in 481 T3 plants that have the same genotype, but in which the transgene was inserted into a wt rather 482 than mutant genome. These plants displayed a maintenance methylation pattern 483 (CG>CHH)(Figure 6B-C), similar to plants constantly in the wt genotype (Figure 5E, F, I). We 484 also selected pol IV / pol IV homozygous mutants from the T3 generation (purple labels, Figure 485 6A) and surprisingly these plants also demonstrated a maintenance methylation pattern 486 (CG>CHH) similar to wt plants (Figure 6B-C). This indicates that after maintenance methylation 487 occurs in the wt T1 plants (Figure 5E-F), Pol IV is no longer required for full maintenance-level 488 methylation. Pol IV’s function, therefore, is to transition the silencing from the RdDM phase to 489 the maintenance phase in the T1 generation. We conclude that a plant’s current wt or pol IV 490 genotype is not as important as the genotype that was initially transformed. 491 To determine if the lack of H3K9me2 deposition is restored upon addition of a functional 492 Pol IV copy into pol IV mutant lines, we performed ChIP of the T3 wt/wt plants from the pol IV 493 mutant cross (blue label, replicate A, Figure 6A-B). As a control, we compared these plants to 494 the self-fertilized T2 and T3 generation of iTto directly integrated into wt Col or pol IV from Figure 495 5. We found that only low levels of H3K9me2 accumulate if iTto was inserted into a pol IV 15 496 mutant genotype, irrespective of the crossing scheme or current genotype of the T3 plants 497 (Figure 6D). 498 We repeated our experimental design with iTto integrated into pol V mutants (Figure 6A), 499 which do not show iTto methylation in the T1 or T2 generation (Figure 5E,F,I), and found that 500 methylation appears in wt Pol V/Pol V T3 plants (blue labels, Figure 6B). This indicates that, 501 when a functional copy of Pol V is crossed into the T2 generation, Pol V acts on iTto without any 502 previous DNA or H3K9me2 marks. However, the methylation of iTto in these plants remains in 503 the RdDM phase (CHH≥CG) and is unable to progress to maintenance methylation (Figure 6B- 504 C), similar to the results of the pol IV cross. As a control, we performed the control cross with 505 transgene insertion directly into the wt genotype (orange labels, Figure 6A). Again, we found that 506 if transformed into a wt genome, the silencing progresses to the maintenance methylation phase 507 (CG>CHH) regardless of crossing scheme (Figure 6B-C). Similar to our findings with Pol IV, our 508 Pol V data demonstrate that the genotype of genome insertion / T1 is critical for transitioning to 509 maintenance methylation, and when transformed into a pol IV or pol V mutant, the maintenance 510 methylation phase is never obtained. These results confirm that the genotype of element 511 insertion (or at least soon after this event in the T1 generation), and not the crossing scheme or 512 current genotype, is the critical factor responsible for dictating the level of maintenance 513 methylation and heterochromatin formation in the T3 generation. 514 515 DISCUSSION 516 Separation of Identity-Based Silencing from Expression-Dependent Silencing 517 Multiple silencing mechanisms target active endogenous TEs to generate high levels of 518 DNA methylation (Panda et al., 2016). Through the analysis of exogenous TEs integrated into 519 the Arabidopsis genome, we have functionally distinguished two broad mechanisms of silencing 520 initiation. Plant transformation is an imperfect system, as multiple extra-chromosomal copies of 521 all transgenes are likely present within the cell at the time of genome insertion. However, this 522 process will be the same for all the transgenes used in this study, and the comparison of TE 523 transgenes to genic control transgenes has allowed us to dissect not the process of 524 transformation itself, but rather specific TE recognition mechanisms. Identity-based silencing 525 requires small RNAs produced from one transcriptionally or post-transcriptionally silenced TE (or 526 other region of the genome) to match a second incoming TE or transgene. Identity-based 527 silencing may form a cellular ‘immune system’ that is able to store a history of the TEs that the 16 528 plant has previously silenced and use this repository to silence a transposing or horizontally 529 transferred TE with sequence similarity. This mechanism may be the reason why Pol IV 530 transcription and 24 nt siRNA production continues from silenced TEs in most cells (Lu et al., 531 2005), even for TEs that do not require RdDM reinforcement of their silenced state (Zemach et 532 al., 2013; Li et al., 2015; McCue et al., 2015). Identity-based silencing is able to establish DNA 533 methylation that is subject to heterochromatin formation and maintenance methylation. This 534 mechanism is likely more efficient at targeting regions for methylation compared to expression- 535 dependent RdDM, particularly in the CG sequence context (compare Figure 2C to 2E). 536 The mechanism of identity-based silencing broadly resembles the homology-dependent 537 phenomena known to affect non-TE transgenes when they become silenced (Matzke et al., 538 2002, 2000). Identity-based silencing has been utilized as a tool to target silencing of an 539 endogenous gene using a transgene or viral DNA trigger (Jakowitsch et al., 1999; Ruiz et al., 540 1998). However, it is unclear what level of homology (sequence similarity) is required to target 541 this mechanism. The amount of sequence similarity required for identity-based silencing likely 542 follows similar dynamics to the targeting of endogenous mRNAs by siRNAs or microRNAs (Liu 543 et al., 2014). Because we are certain that identical sequences target this mechanism, but we do 544 not know the level of sequence similarity required, we favor describing this pathway as ‘identity- 545 based’ rather than ‘homology-based’ silencing. In contrast to the trans-acting nature of identity- 546 based silencing, expression-dependent silencing occurs in cis – the same region that produces 547 the siRNAs is the target of RdDM, so the siRNA sequence is likely identical to its target 548 transcript. Our investigation determined that both 24 nt and 21-22 nt siRNAs can separately 549 function in identity-based silencing, providing a robust small RNA defense against new copies of 550 DNA that are similar to previously transcriptionally or post-transcriptionally silenced regions of 551 the genome. 552 553 554 Mechanism of Expression-Dependent Silencing Both identity-based and expression-dependent mechanisms function on endogenous TEs, 555 confounding the interpretation of their silencing dynamics. The solo Tto LTR transgene is useful 556 because 1) it is only targeted by expression-dependent RdDM, and 2) it never progresses 557 beyond the RdDM phase, remaining targeted by expression-dependent RdDM over multiple 558 generations. We used the solo Tto LTR transgene to probe the mechanisms expression- 559 dependent silencing and found that AGO2, RDR6-generated 24 nt siRNAs, and high TE copy 17 560 numbers were not required to initiate TE silencing, refuting previous reports (Hirochika et al., 561 2000; Pontier et al., 2012; Marí-Ordóñez et al., 2013). Instead, we identified multiple upstream 562 small RNA-generating mechanisms (including Pol IV-RdDM, DCL3-RdDM, and RDR6-RdDM) 563 that all redundantly function to produce the siRNAs necessary for RdDM, while one downstream 564 silencing complex dependent on Pol V is necessary for all RdDM. The dependency on a single 565 chromatin-bound Pol V complex, yet multiple small RNA biogenesis pathways, is recapitulated in 566 the initiation of viral silencing (Bond and Baulcombe, 2015), which also likely functions through 567 expression-dependent RdDM. Interestingly, expression-dependent RdDM of the solo Tto LTR 568 transgene did not require the SUVH2/9 proteins, which are known to mediate the recruitment of 569 Pol V (Johnson et al., 2014). This suggests that an independent mechanism exists to recruit Pol 570 V to newly inserted or expressed TEs or transgenes. In line with the theory of an alternate route 571 of Pol V recruitment, the addition of a functional Pol V copy into pol V mutant line restores iTto 572 RdDM, suggesting that Pol V is recruited to an unmethylated TE. The only other known 573 recruitment mechanism of Pol V is through double-strand DNA breaks (but Pol V was shown not 574 to function in RdDM at these sites (Wei et al., 2012)). The double-strand break recruitment of 575 Pol V remains an intriguing possibility, particularly for TEs and transgenes that insert into the 576 genome via DNA breaks. 577 578 Progression of Silencing 579 We found that RdDM is necessary for the initiation of silencing on TE fragments and full- 580 length TEs, while only full-length elements (and not solo LTRs or LTR+CDS elements with only 581 one LTR) progress beyond RdDM to heterochromatin formation and a maintenance methylation 582 state within the T1 generation. Our data suggest that full-length TEs are regulated differently 583 than their derived fragments, and that solo LTRs are more likely to be regulated by RdDM 584 genome-wide. Full-length TEs may progress via expression-dependent silencing more efficiently 585 due to the presence of two LTRs (Figure 4), and the ability of siRNAs produced from either LTR 586 (including the 3’LTR that is partially transcribed in the retrotransposon coding transcript) to target 587 the 5’LTR promoter. 588 Transformation of the full-length iTto into mutants demonstrated that both Pol IV and Pol 589 V are required in the T0 or T1 generation to progress beyond RdDM to the maintenance 590 methylation and heterochromatin formation phase. In pol V mutants, RdDM does not occur and 591 the TE cannot accumulate DNA or H3K9 methylation. We find that the role of Pol IV on iTto is to 18 592 transition from the establishment phase via RdDM to the end-point maintenance methylation 593 phase. This suggests that maintenance methylation originally triggered by expression- 594 dependent RdDM occurs through the intermediate step of Pol IV-RdDM, forming a linear 595 ‘initiation-establishment-reinforcement-maintenance’ progression of TE silencing (Figure 7). 596 597 Genotype of Transformation 598 Surprisingly, upon outcrossing a mutant T1 generation and thereby complementing the 599 T2 generation with a functional Pol IV or Pol V, the silencing of the integrated TE was not fully 600 restored to wt Col levels. When inserted into pol IV mutants and outcrossed, the TE DNA 601 methylation remained at intermediate levels established by RdDM, and only minimal levels of 602 H3K9me2 were induced. This demonstrates that the initial signal that recruits Pol IV to a new TE 603 insertion must be transient and limited to a narrow window of time during insertion or within the 604 T1 generation. This signal is likely H3K9me2 itself, as without the formation of H3K9me2 at the 605 TE in the T1 generation of pol IV mutants, functional Pol IV is not recruited to the TE in later 606 generations. In contrast, when iTto was inserted into pol V mutants, no methylation was 607 detected, however an intermediate level returns when a functional Pol V copy is introduced. This 608 intermediate level of methylation has the sequence context hallmarks of RdDM (CHH≥CG), 609 suggesting that Pol V is successfully recruited back to the inserted TE (unlike Pol IV) and 610 functions there to target RdDM. As in the pol IV cross, the silencing cannot progress to the 611 maintenance methylation phase. This is likely due to the lack of H3K9me2 in pol V mutants that 612 again prohibits Pol IV recruitment during a critical window, stalling the activation of maintenance 613 methylation and heterochromatin formation. In addition to the analysis performed in Figure 6 on 614 iTto un-induced for expression, we have not yet determined how inducing expression in the T1 615 generation (or later) affects these trans-generational silencing dynamics. 616 Our genotype-of-transformation finding has implications for crop improvement. In many 617 crops, a specialized genotype is used for transformation, followed by introgression of the 618 transgene into an elite performance line. Our finding suggests that the genotype of first insertion 619 is critical for the establishment of trans-generational silencing. Transformation directly into a 620 silencing mutant (such as pol IV or pol V) followed by segregation away of these mutant alleles 621 during introgression into an elite performance line may be a route to avoid the complete 622 silencing of transgenes. 623 19 624 METHODS 625 Plant Materials 626 Plants were grown in long‐day (18 h cool white (4100K) fluorescent light at 120 μmol m-2 s-1 627 provided by Sylvania Octron Eco bulbs) conditions at 23°C. See Supplemental Table 1 for a list 628 of mutant alleles used and their sources. Biological replicates were grown side-by-side. Plants 629 were stably (germinal) transformed via Agrobacterium floral dip and T1 events selected on 630 appropriate antibiotics. Selection on antibiotics was continued each generation. Unless 631 otherwise noted, all experiments use inflorescence tissue (stage 1 -12 unopened flower buds). 632 633 634 Transgene Generation and Expression The iTto and iRAN transgenes have been previously described (Böhmdorfer et al., 2010; 635 Nuthikattu et al., 2013). For other transgenes, TE sequences were amplified from the 636 Arabidopsis genome (iAthila LTR, Athila LTR, At1g08200 pro, TAS3A internal), the iTto plasmid 637 (solo Tto LTR), or the pHL414 (Levin et al., 1993) plasmid (iTf1) using Phusion High-Fidelity 638 DNA Polymerase (NEB) and primers from Supplemental Table 2. The iAthila LTR and iTf1 TE 639 sequences were cloned into pMDC7 (Curtis and Grossniklaus, 2003) at the restriction sites AscI 640 and PacI using In-Fusion HD cloning (Clontech), removing the ccdB region. The remaining 641 amplicons were cloned into pBGW (Karimi et al., 2002) at the restriction sites XbaI and SpeI 642 using In-Fusion HD cloning (Clontech), removing the ccdB region. The promoter deletion 643 constructs (Athila LTR Δpro, Tto LTR Δpro) were generated by performing inverse PCR on their 644 respective parent constructs with primers upstream of the TATA box and downstream of the 645 TSS (primers in Supplemental Table 2). These amplicons were then blunted and self-ligated 646 using Quick Blunting Kit (NEB) and T4 DNA ligase (NEB). This method was also used to delete 647 the neomycin resistance from the Tf1 sequence derived from pHL414 before cloning the iTf1 648 construct. To generate the Tto LTR+term construct, the 35S terminator from pB2GW7 was 649 amplified using Phusion polymerase and cloned into the PstI site of the solo Tto LTR construct 650 using In-Fusion cloning (Clontech). To generate the full Tto construct, iTto was digested with 651 Bpu10I (NEB) and the resulting TE portion was gel extracted and purified. The solo Tto LTR 652 construct was linearized with Bpu10I and treated with Shrimp Alkaline Phosphatase (NEB). 653 These two products were then ligated together using T4 DNA ligase, and colonies were 654 screened for the appropriate direction of ligation. See Supplemental Table 2 for all primers used 20 655 in cloning. Induction of expression-inducible transgenes was performed as in (Nuthikattu et al., 656 2013). 657 658 Reverse Transcription-quantitative PCR (RT-qPCR) 659 Tissue from multiple sibling individuals was pooled for RNA isolations. At least three distinct 660 biological replicate pools were performed for each genotype. Each replicate consisted of a non- 661 overlapping pool from at least 6 individuals. RT-qPCR was performed and analyzed as 662 described in (Nuthikattu et al., 2013). RT-qPCR primers are listed in Supplemental Table 2. 663 664 Transposition Assay 665 An exon-exon spanning primer (see Supplemental Table 2) was used to amplify DNA directly 666 extracted from the indicated plants in Figure 5B, as spliced DNA is only generated from 667 retrotransposon-induced cDNA formation. 668 669 Bisulfite Sequencing 670 Tissue from at least six T1 individuals per each transgene construct / genotype / treatment 671 combination were pooled for each experiment. DNA extraction was performed on the pooled 672 tissue and bisulfite sequencing was performed and analyzed as in (McCue et al., 2015), with the 673 exception that PCR amplification of the bisulfite converted DNA was performed using MyTaq 674 DNA polymerase (Bioline). See Supplemental Table 2 for all primers used in bisulfite PCR. 675 Bisulfite primers are anchored to non-TE sequences of the chromosomal transgene, and 676 therefore do not detect the extra-chromosomal retrotransposon cDNA generated in Figure 5B or 677 potential transposed copies. PCR products were cloned using the TOPO-TA pCR4 vector 678 (Invitrogen). At least 6 clones were sequenced per sample. 95% Wilson confidence intervals and 679 t-tests were calculated as in (McCue et al., 2015). 680 681 Chromatin Immunoprecipitation 682 Chromatin was isolated from pooled sibling plants of the same genotype. Chromatin 683 Immunoprecipitation was performed as in (McCue et al., 2015). Each ChIP and mock-ChIP was 684 performed on three distinct and non-overlapping biological replicate samples per 685 genotype/treatment. ChIP used the H3K9me2 antibody (Abcam #ab1220). qPCR was performed 686 with primers shown in Supplemental Table 2. Three technical replicate qPCR amplifications 21 687 were performed for each of the biological replicates. The data reported represents the average 688 and standard error between distinct biological replicate pools. 689 690 Small RNA Analysis 691 RNA was extracted from pooled sibling plants of the same genotype. For RNA gel blots, RNA 692 preparation, gel electrophoresis, blotting, and probe production were performed as in (Nuthikattu 693 et al., 2013), with the exception that nine micrograms of small RNA-enriched RNA were loaded 694 in each lane. See Supplemental Table 2 for all primers used for probe production. 695 For the small RNA-seq in Figure 5, small RNA preparation and sequencing were 696 performed as in (McCue et al., 2015). The number of reads used for reads per million 697 normalization was determined by filtering each library for size (18–28 nt), exact matches to the 698 Arabidopsis genome (TAIR10) plus any transgene present in the line, and the removal of tRNA / 699 rRNA reads. Sequencing tracks were generated for iTto allowing up to 3 mismatches using 700 Bowtie (Langmead et al., 2009) and small RNA reads that where only filtered for size (18-28 nt) 701 and not filtered for genome-matching or tRNA/rRNA. For small RNA sequencing in Figure 2F, 702 the wt Col (no transgene) library from GSM2102963 and the wt Col iTto mock-induced library 703 (GSM2308720, described above) were aligned to the respective LTRs using Bowtie and 704 allowing for 3 mismatches. 705 706 Genomic Informatics Analysis 707 To compare full-length TEs versus solo LTRs for dependence on RdDM, we selected TEs with 708 two LTRs and an intact coding sequence (full-length TEs), or containing only a single LTR with 709 no nearby TE coding sequence (solo LTRs). We averaged the total percent cytosine methylation 710 of these TEs by subfamily in both wt Col and pol V (Panda et al., 2016). We took the difference 711 of these averages (wt Col – pol V percent methylation) and performed a two-tailed Student’s t- 712 test between all full-length TEs and all solo LTRs. 713 714 Accession Numbers 715 716 717 718 719 720 RAN1 – AT5G20010; AXS2 – AT1G08200; TAS3A – AT3G17185; AGO6 – AT2G32940; POL IV (NRPD1) – AT1G63020; DCL3 – AT3G43920; POL V (NRPE1) – AT2G40030; RDR6 – AT3G49500; DCL2 – AT3G03300; DCL4 – AT5G20320; DRM1 – AT5G15380; DRM2 – AT5G14620; SUVH2 – AT2G33290; SUVH9 – AT4G13460; AGO1 – AT1G48410; DCL1 – AT1G01040; AGO2 – AT1G31280; RRP6L1 – AT1G54440 22 721 Small RNA-sequencing raw data can be accessed at NCBI GEO as GSE86787 using the 722 following GSM numbers: 723 wt Col inflorescence (non-transgenic) – GSM2308719 724 wt Col inflorescence with T1 iTto, mock induced – GSM2308720 725 wt Col inflorescence with T1 iTto, expression induced – GSM2308721 726 pol IV inflorescence with T1 iTto, expression induced – GSM2308722 727 pol V inflorescence with T1 iTto, expression induced – GSM2308723 728 729 Supplemental Data 730 Supplemental Figure 1 - CHH/CG methylation vs. total methylation scatter plots accurately 731 define the methylation type observed from bisulfite sequencing data (supports Figures 1-2). 732 Supplemental Figure 2 – Bisulfite cytosine methylation results linearly displayed on the Athila 733 LTR and solo Tto LTR sequences, with and without the Δpro deletion (supports Figures 1-2). 734 Supplemental Figure 3 - The Tto1 / Tf1 sequences are not identical to the Arabidopsis genome 735 (Supports Figures 2-4). 736 Supplemental Figure 4 - Full scanned images of the RNA gel blots supporting Figure 3B. 737 Supplemental Figure 5 - Full scanned images of the RNA gel blots supporting Figure 4C. 738 Supplemental Figure 6 - iTto retrotransposon-induced cDNA forms in pol V mutants (supports 739 Figure 5). 740 Supplemental Figure 7 - Full scanned images of the RNA gel blots supporting Figure 5C. 741 Supplemental Table 1 - Mutant alleles used this report 742 Supplemental Table 2 – Primer and probe sequences used this report 743 Supplemental Methods 744 745 ACKNOWLEDGEMENTS 746 The authors thank Andrea McCue, Eric Roose, Josquin Daron, Kaushik Panda, and Shobhana 747 Rajasenan for their data contributions. We thank Mikel Zaratiegui for the Tf1 plasmid, Andreas 748 Bachmair for the iTto plasmid, and Qikun Liu & Steve Jacobsen for the suvh2/9 mutant line. D.F. 749 is supported by the Presidential and University Fellowships and the Cellular, Molecular and 750 Biochemical Sciences T32 Training Program (NIH-GM086252) at The Ohio State University. 751 This work is supported by National Science Foundation grants MCB-1252370 and MCB- 752 1608392 to R.K.S. 23 753 754 AUTHOR CONTRIBUTIONS 755 D.F and R.K.S. designed the research. D.F. performed research. D.F and R.K.S. analyzed data. 756 D.F and R.K.S. wrote the paper. 757 758 759 Figure Legends 760 761 762 763 764 765 766 767 768 769 770 771 772 773 774 775 776 777 778 779 780 781 782 783 Figure 1. Transgenes of endogenous TEs are targeted by identity-based and expressiondependent silencing. A. Transgenes generated and transformed into Arabidopsis. Black arrows indicate transcriptional start sites (TSS), green are PCR primers used in RT-qPCR, and orange are PCR primers used for bisulfite sequencing of DNA methylation levels. B. RT-qPCR detecting mRNA accumulation from the iAthila LTR transgene 5’ of the native Athila TSS in either mock (not) induced or transcription-induced plants (primer set “B” in part A). For all RTqPCR, bar height represents the average calculated relative expression (compared to the constitutively expressed UDP-D-APIOSE/UDP-D-XYLOSE SYNTHASE 2 (AXS2)) from biological replicate samples of non-overlapping pools of sibling plants with the same genotype / treatment. Error bars represent the standard error of the mean between biological replicate samples. C. RT-qPCR of the iAthila LTR transgene 3’ of the native Athila TSS (primer set “C” in part A). For all RT-qPCR, primers only amplify the transgene copy and not the endogenous Athila elements, which are not activated by estradiol-induction (Nuthikattu et al., 2013). D. Bisulfite sequencing of DNA methylation levels of the mock or transcription-induced iAthila LTR transgene in T1 plants. H=A,T or C. For all bisulfite sequencing, bar height represents the average methylation level, and bars represent 95% Wilson score confidence intervals, which display the sequencing depth. For all bisulfite PCR, primers only amplify the transgene copy and not the endogenous Athila elements. E. Bisulfite sequencing of DNA methylation levels of the transgenes depicted in part A in the T1 generation. The TAS3 internal CHG methylation has a large confidence interval due to the low number (1) of CHG context cytosines in this PCR product, while the number of CHH sites is high (38) and is used for all data comparisons. F. RTqPCR of the Athila LTR and Athila LTR Δpro T1 transgenes. G. Endogenous small RNAs from inflorescence tissue without the transgene aligned to the indicated transgene. H. RT-qPCR of the endogenous TAS3a locus and the TAS3 internal region transgene. 784 785 786 787 Figure 2. The exogenous Tto TE circumvents identity-based silencing A. Transgenes generated and transformed into Arabidopsis. Arrows are the same as described in Figure 1. B. RT-qPCR detecting steady-state mRNA accumulation in T1 inflorescence tissue of plants with the indicated transgene. C. Bisulfite sequencing of T1 DNA methylation levels of the indicated 24 788 789 790 791 792 793 794 795 transgenes. The NOS promoter (pro) region is from the same transgene and integration events as the solo Tto LTR T1. D. Categorization of methylation type (no methylation, no CHH, RdDM, or maintenance methylation) based on the CHH/CG methylation context ratio (see Supplemental Figure 1). Results from panels C and E are plotted. E. Bisulfite sequencing of T1 DNA methylation levels of the indicated transgenes transformed into a wt Col line carrying the silenced iTto TE transgene. F. Small RNA deep sequencing reads that match the indicated LTR sequence. The small RNA library used is indicated above each bar, and the LTR used for alignment is labeled below. 796 797 798 799 800 801 802 803 804 Figure 3. Mechanism of expression-dependent RdDM. A. RT-qPCR of mutant T1 plants transformed with the solo Tto LTR transgene. B. Small RNA gel blot of the individuals depicted in part A. For all RNA gel blot assays, lanes are generated from pooled sibling plants. 21 vs 22 nt small RNAs are not distinguishable in this assay. Control Athila LTR siRNAs are absent in dcl3 and pol IV mutants, yet only reduced in the dcl2/3/4 triple mutant due to the activation of the Dicer-independent pathway (Ye et al., 2016). C. Bisulfite sequencing of T1 DNA methylation levels of plants depicted in part A in inflorescence, or in T1 seedlings or juvenile leaves (D). E. Bisulfite sequencing of T1 DNA methylation levels in inflorescence tissue of the solo Tto LTR transgene transformed into various single and double mutant combinations. 805 806 807 808 809 810 811 812 813 814 815 816 817 818 819 820 821 822 823 Figure 4. Full-length transposable elements are targeted for maintenance methylation and heterochromatin formation. A. Transgenes generated and transformed into Arabidopsis. Arrows are the same as described in Figure 1. B. RT-qPCR of the Tto LTR region in T1 plants transformed with the indicated transgene. C. Small RNA gel blot of plants depicted in part B. The iRAN1 transgene serves as a negative control for plants that do not contain Tto1 sequences. D. Bisulfite sequencing of T1 DNA methylation levels of plants depicted in part B. For the full Tto and iTto transgenes, multiple regions are assayed, including both LTRs. The ‘LTRs’ lane is the combination of both 5’ and 3’ LTRs for equal comparison to the solo Tto LTR transgene. E. Categorization of methylation type based on the CHH/CG methylation context ratio. LTR results from panel D and H are plotted. F. Bisulfite sequencing of T1 DNA methylation levels of plants carrying the mock-induced iTf1 fission yeast TE-transgene. G. H3K9me2 ChIP of plants carrying either the solo Tto LTR or iTto transgene. For all ChIP experiments, three biological replicate samples were isolated from non-overlapping pools of sibling plants with the same genotype. ChIP qPCR fold-enrichment is normalized to the constitutively expressed gene AXS2. Bar height represents the average between biological replicate samples, and the error bars represent the standard error of the mean between biological replicates. N/A indicates that the indicated region does not exist in that transgene. H. Bisulfite sequencing of T1 DNA methylation levels of plants carrying the Tto LTR:CDS transgene. I. Comparison of total DNA methylation levels of endogenous LTR retrotransposons in the Arabidopsis genome in wt Col 25 824 825 826 827 and pol V mutants. Each dot represents the average methylation of a single TE family for either full-length copies (open circles) or solo LTR copies (red dots). Text labels indicate solo LTRs and full-length elements that are regulated by RdDM (decrease of at least 4% methylation in pol V compared to wt). 828 829 830 831 832 833 834 Figure 5. Silencing of exogenous TEs requires RdDM to progress to maintenance methylation. A. RT-qPCR of iTto steady-state mRNA levels in T1 plants of the indicated genotype. Plants were either induced for iTto expression or mock-induced. B. qPCR of spliced retrotransposon-induced cDNA formation in the same plants as part A. Relative copy number is calculated based on the amount of retrotransposon-induced cDNA compared to the single copy chromosomal gene AXS2. Bar height represents the average calculated from three biological replicate samples of non-overlapping pools of sibling plants with the same genotype / treatment. 835 836 837 838 839 840 841 842 843 844 845 846 847 848 849 850 Error bars represent the standard error of the mean between biological replicate samples. C. Small RNA gel blot of the plants from part A. D. Results of small RNA deep sequencing aligned to the iTto sequence. The Y-axes are all to the same scale, with the maximum abundance adjusted to the highest peak by genotype, as indicated. Small RNAs have been library-size normalized by calculating the reads per million of genome-matched (including the transgene sequence) 18-28 nt reads. E. Bisulfite sequencing of T1 DNA methylation levels of plants depicted in part A. F. Categorization of methylation type based on the CHH/CG methylation context ratio. LTR results from panel E (orange) and H (blue) are plotted. G. Bisulfite sequencing of T1 DNA methylation levels of plants with the iTf1 TE. H. RT-qPCR of iTto mRNA accumulation from T2 plants, some of which were produced from expression-induced T1 plants. The X-axis labels refer to the induction of the T1 parent of the assayed T2 progeny. I. Bisulfite sequencing of T2 of plants depicted in part H. In parts H-I, PCR primers only detect the transgene copy of iTto and not any potential extra-chromosomal or transposed copies. J. H3K9me2 ChIP of T2 of plants depicted in part H. 5’ and 3’ LTR PCR primers are anchored to the transgene, while the CDS primers could potentially amplify extra-chromosomal or transposed iTto copies. 851 852 853 854 855 856 857 858 Figure 6. The genotype of direct transformation dictates the final level of transgene silencing. A. Experimental crossing scheme. The right side indicates control crosses. Colors denote data labels in parts B-C. B. Bisulfite sequencing of DNA methylation levels of genotyped T3 plant lines depicted in part A. These T3 plants were generated by performing the T1 cross indicated on the X-axis followed by selfing of the T2 generation. Replicates (rep A-C) denote different transgene insertion events used in the crossing scheme. Brackets under plant lines represent reciprocal crosses performed with the same T1 individual. C. Categorization of methylation type based on the CHH/CG methylation context ratio using the results of panel B. D. 26 859 860 861 H3K9me2 ChIP of plants with the iTto transgene directly transformed into the current genotype (wt Col T2, wt Col T3, pol IV T2) compared to plants in which the transgene was transformed into pol IV and introgressed wt Col (replicate A from part B). 862 863 864 865 866 867 868 869 Figure 7. The Initiation-Establishment-Reinforcement-Maintenance model of TE silencing. Incoming unique TEs must be expressed, go through post-transcriptional silencing via RNAi (initiation), and establishment of DNA methylation though either the RDR6-RdDM or DCL3RdDM mechanism of expression-dependent RdDM (Cuerda-Gil and Slotkin, 2016). Incoming TEs that share identity to the pool of siRNAs in the existing genome short-circuit this pathway and enter directly into Pol IV-RdDM. A functional Pol IV-RdDM pathway is necessary in the transformed T0 cells or subsequent T1 generation to reinforce DNA methylation and transition the TE from RdDM to the fully silenced maintenance of DNA methylation. 870 871 References 872 873 874 875 876 877 878 879 880 881 882 883 884 885 886 887 888 889 890 891 892 893 894 895 896 897 898 899 Allen, E. et al. (2005). microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell 121: 207–21. Axtell, M.J. et al. (2006). A two-hit trigger for siRNA biogenesis in plants. Cell 127: 565–77. Bao, W. et al. (2015). Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob DNA 6: 11. Bartolomé, C. et al. (2009). Widespread evidence for horizontal transfer of transposable elements across Drosophila genomes. Genome Biol. 10: R22. Bond, D.M. and Baulcombe, D.C. (2015). Epigenetic transitions leading to heritable, RNAmediated de novo silencing in Arabidopsis thaliana. Proc. Natl. Acad. Sci. U.S.A. 112: 917–22. Böhmdorfer, G. et al. (2010). A synthetic biology approach allows inducible retrotransposition in whole plants. Syst Synthetic Biology 4: 133–138. Böhmdorfer, G. et al. (2016). Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin. Elife 5: e19092. Cao, X. et al. (2003). Role of the DRM and CMT3 methyltransferases in RNA-directed DNA methylation. Curr. Biol. 13: 2212–7. Cao, X. and Jacobsen, S.E. (2002). Locus-specific control of asymmetric and CpNpG methylation by the DRM and CMT3 methyltransferase genes. Proc. Natl. Acad. Sci. U.S.A. 99 Suppl 4: 16491–8. Chan, S.W. et al. (2004). RNA silencing genes control de novo DNA methylation. Science 303: 1336. Creasey, K.M. et al. (2014). miRNAs trigger widespread epigenetically activated siRNAs from transposons in Arabidopsis. Nature 508: 411–5. Cuerda-Gil, D. and Slotkin, R.K. (2016). Non-canonical RNA-directed DNA methylation. Nat Plants 2: 16163. Curtis, M.D. and Grossniklaus, U. (2003). A gateway cloning vector set for high-throughput functional analysis of genes in planta. Plant Physiol. 133: 462–9. Daniels, S.B. et al. (1990). Evidence for horizontal transmission of the P transposable element between Drosophila species. Genetics 124: 339–55. 27 900 901 902 903 904 905 906 907 908 909 910 911 912 913 914 915 916 917 918 919 920 921 922 923 924 925 926 927 928 929 930 931 932 933 934 935 936 937 938 939 940 941 942 943 944 945 946 947 Du, J. et al. (2012). Dual binding of chromomethylase domains to H3K9me2-containing nucleosomes directs DNA methylation in plants. Cell 151: 167–80. El Baidouri, M. et al. (2014). Widespread and frequent horizontal transfers of transposable elements in plants. Genome Res. 24: 831–8. Feng, S. et al. (2010). Conservation and divergence of methylation patterning in plants and animals. Proc. Natl. Acad. Sci. U.S.A. 107: 8689–94. Fultz, D. et al. (2015). Silencing of active transposable elements in plants. Curr. Opin. Plant Biol. 27: 67–76. Greenberg, M.V. et al. (2011). Identification of genes required for de novo DNA methylation in Arabidopsis. Epigenetics 6: 344–54. Herr, A. et al. (2005). RNA Polymerase IV Directs Silencing of Endogenous DNA. Science 308: 118–120. Hirochika, H. (1993). Activation of tobacco retrotransposons during tissue culture. The EMBO journal 12: 2521. Hirochika, H. et al. (2000). Silencing of retrotransposons in arabidopsis and reactivation by the ddm1 mutation. Plant Cell 12: 357–69. Hirochika, H. and Otsuki, H. (1995). Extrachromosomal circular forms of the tobacco retrotransposon Tto1. Gene 165: 229–32. Jackel, J.N. et al. (2016). Arabidopsis RNA Polymerases IV and V Are Required To Establish H3K9 Methylation, but Not Cytosine Methylation, on Geminivirus Chromatin. J. Virol. 90: 7529– 40. Jakowitsch, J. et al. (1999). Molecular and cytogenetic characterization of a transgene locus that induces silencing and methylation of homologous promoters in trans. Plant J. 17: 131–40. Johnson, L.M. et al. (2014). SRA- and SET-domain-containing proteins link RNA polymerase V occupancy to DNA methylation. Nature 507: 124–8. Karimi, M. et al. (2002). GATEWAY vectors for Agrobacterium-mediated plant transformation. Trends Plant Sci. 7: 193–5. Kato, M. et al. (2003). Role of CG and non-CG methylation in immobilization of transposons in Arabidopsis. Curr. Biol. 13: 421–6. Kinoshita, Y. et al. (2007). Control of FWA gene silencing in Arabidopsis thaliana by SINErelated direct repeats. Plant J. 49: 38–45. Kyndt, T. et al. (2015). The genome of cultivated sweet potato contains Agrobacterium T-DNAs with expressed genes: An example of a naturally transgenic food crop. Proc. Natl. Acad. Sci. U.S.A. 112: 5844–9. Langmead, B. et al. (2009). Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10: R25. Law, J.A. et al. (2013). Polymerase IV occupancy at RNA-directed DNA methylation sites requires SHH1. Nature 498: 385–9. Levin, H.L. et al. (1993). Novel gene expression mechanism in a fission yeast retroelement: Tf1 proteins are derived from a single primary translation product. EMBO J. 12: 4885–95. Li, S. et al. (2015). Detection of Pol IV/RDR2-dependent transcripts at the genomic scale in Arabidopsis reveals features and regulation of siRNA biogenesis. Genome Res. 25: 235–45. Li, Y. et al. (2013). Gene tagging with engineered Ds elements in maize. Methods Mol. Biol. 1057: 83–99. Lister, R. et al. (2008). Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133: 523–36. Liu, Q. et al. (2014). Analysis of complementarity requirements for plant microRNA targeting using a Nicotiana benthamiana quantitative transient assay. Plant Cell 26: 741–53. 28 948 949 950 951 952 953 954 955 956 957 958 959 960 961 962 963 964 965 966 967 968 969 970 971 972 973 974 975 976 977 978 979 980 981 982 983 984 985 986 987 988 989 990 991 992 993 994 995 Lu, C. et al. (2005). Elucidation of the small RNA component of the transcriptome. Science 309: 1567–9. Marí-Ordóñez, A. et al. (2013). Reconstructing de novo silencing of an active plant retrotransposon. Nat. Genet. 45: 1029–39. Matzke, A.J. et al. (1994). Homology-dependent gene silencing in transgenic plants: epistatic silencing loci contain multiple copies of methylated transgenes. Mol. Gen. Genet. 244: 219–29. Matzke, M.A. et al. (2002). Homology-dependent gene silencing and host defense in plants. Adv. Genet. 46: 235–75. Matzke, M.A. et al. (2000). Transgene silencing by the host genome defense: implications for the evolution of epigenetic control mechanisms in plants and vertebrates. Plant Mol. Biol. 43: 401–15. Matzke, M.A. and Mosher, R.A. (2014). RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat. Rev. Genet. 15: 394–408. McCue, A.D. et al. (2015). ARGONAUTE 6 bridges transposable element mRNA-derived siRNAs to the establishment of DNA methylation. EMBO J. 34: 20–35. Nodine, M.D. and Bartel, D.P. (2010). MicroRNAs prevent precocious gene expression and enable pattern formation during plant embryogenesis. Genes Dev. 24: 2678–92. Nuthikattu, S. et al. (2013). The initiation of epigenetic silencing of active transposable elements is triggered by RDR6 and 21-22 nucleotide small interfering RNAs. Plant Physiol. 162: 116–31. Pace, J. et al. (2008). Repeated horizontal transfer of a DNA transposon in mammals and other tetrapods. P Natl Acad Sci Usa 105: 17023–8. Panda, K. et al. (2016). Full-length autonomous transposable elements are preferentially targeted by expression-dependent forms of RNA-directed DNA methylation. Genome Biol. 17: 170. Panda, K. and Slotkin, R.K. (2013). Proposed mechanism for the initiation of transposable element silencing by the RDR6-directed DNA methylation pathway. Plant Signal Behav 8: e25206. Pontier, D. et al. (2012). NERD, a plant-specific GW protein, defines an additional RNAidependent chromatin-based pathway in Arabidopsis. Mol. Cell 48: 121–32. Pérez-Hormaeche, J. et al. (2008). Invasion of the Arabidopsis genome by the tobacco retrotransposon Tnt1 is controlled by reversible transcriptional gene silencing. Plant Physiol 147: 1264–78. Raizada, M.N. et al. (2001). Somatic and germinal mobility of the RescueMu transposon in transgenic maize. Plant Cell 13: 1587–608. Ruiz, MT et al. (1998). Initiation and maintenance of virus-induced gene silencing. Plant Cell 10: 937–46. Stroud, H. et al. (2014). Non-CG methylation patterns shape the epigenetic landscape in Arabidopsis. Nat. Struct. Mol. Biol. 21: 64–72. Takeda, S. et al. (1999). A 13-bp cis-regulatory element in the LTR promoter of the tobacco retrotransposon Tto1 is involved in responsiveness to tissue culture, wounding, methyl jasmonate and fungal elicitors. Plant J Cell Mol Biology 18: 383–93. Tanurdzic, M. et al. (2008). Epigenomic consequences of immortalized plant cell suspension culture. PLoS Biol. 6: 2880–95. Walsh, A.M. et al. (2013). Widespread horizontal transfer of retrotransposons. Proc. Natl. Acad. Sci. U.S.A. 110: 1012–6. Wei, W. et al. (2012). A role for small RNAs in DNA double-strand break repair. Cell 149: 101– 12. 29 996 997 998 999 1000 1001 1002 1003 1004 1005 1006 1007 Wierzbicki, A.T. et al. (2009). RNA polymerase V transcription guides ARGONAUTE4 to chromatin. Nat. Genet. 41: 630–4. Wu, L. et al. (2012). Roles of dicer-like and argonaute proteins in TAS-derived small interfering RNA-triggered DNA methylation. Plant Physiol. 160: 990–9. Yang, D.-L.L. et al. (2015). Dicer-independent RNA-directed DNA methylation in Arabidopsis. Cell Res 26: 66-82. Ye, R. et al. (2016). A Dicer-Independent Route for Biogenesis of siRNAs that Direct DNA Methylation in Arabidopsis. Mol. Cell 61: 222–35. Zemach, A. et al. (2013). The Arabidopsis nucleosome remodeler DDM1 allows DNA methyltransferases to access H1-containing heterochromatin. Cell 153: 193–205. 30 3x10-6 2x10-6 1x10-6 R la L Δp TR ro At hi la hi At 0.10 0.05 0.00 LT ro 0.15 ge re TA T nou gi S A s on 3 S3 Tr int a an er sg na en l e At hi la LT R Δp R LT la hi At LT la hi At 0 H 4x10-6 sRNA size (nt) 25 24 23 22 21 20 do ol tC w ol o6 ag tC ro Δp R LT al in S3 TA la rn pr 0 20 08 1g At F 0 o o6 ag tC w ag tC w ol 0 o6 0 ol 20 Induced w 40 20 Mock Induced ol o6 60 hi 40 p < 0.05 60 ag tC 80 % Methylation 80 Constructs transformed into pol IV Constructs transformed into wt Col te iAthila LTR E100 At CG CHG CHH 200 0.2 Mock Induced Induced Mock Induced Induced 87-224 D 100 % Methylation w promoter deletion Athila LTR Δpro 400 0.4 0 0 Athila6 LTR 0.6 Relative Expression Athila LTR Ttransgene sRNA matches (RPM) At At hi hil la a LT LT R R At Δp hi ro la LT R 0.5 800 wt Col pol IV sRNA sRNA En TAS3a G 600 o6 tC ol ag o6 TAS3 internal region 1.0 0.8 w At1g08200 promoter 1.5 3’ of Athila TSS ag term 1.0 Relative Expression Athila6 LTR Relative Expression C C 2.0 5’ of Athila TSS Relative Expression At1g08200 pro B Native Athila TSS Inducible TSS iAthila LTR B Inducible Promoter R Figure 1 A Figure 1. Transgenes of endogenous TEs are targeted by identity-based and expression-dependent silencing. A. Transgenes generated and transformed into Arabidopsis. Black arrows indicate transcriptional start sites (TSS), green are PCR primers used in RT-qPCR, and orange are PCR primers used for bisulfite sequencing of DNA methylation levels. B. RT-qPCR detecting mRNA accumulation from the iAthila LTR transgene 5’ of the native Athila TSS in either mock (not) induced or transcription-induced plants (primer set “B” in part A). For all RT-qPCR, bar height represents the average calculated relative expression (compared to the constitutively expressed UDP-D-APIOSE/ UDP-D-XYLOSE SYNTHASE 2 (AXS2)) from biological replicate samples of non-overlapping pools of sibling plants with the same genotype / treatment. Error bars represent the standard error of the mean between biological replicate samples. C. RT-qPCR of the iAthila LTR transgene 3’ of the native Athila TSS (primer set “C” in part A). For all RT-qPCR, primers only amplify the transgene copy and not the endogenous Athila elements, which are not activated by estradiol-induction (Nuthikattu et al., 2013). D. Bisulfite sequencing of DNA methylation levels of the mock or transcription-induced iAthila LTR transgene in T1 plants. H=A,T or C. For all bisulfite sequencing, bar height represents the average methylation level, and bars represent 95% Wilson score confidence intervals, which display the sequencing depth. For all bisulfite PCR, primers only amplify the transgene copy and not the endogenous Athila elements. E. Bisulfite sequencing of DNA methylation levels of the transgenes depicted in part A in the T1 generation. The TAS3 internal CHG methylation has a large confidence interval due to the low number (1) of CHG context cytosines in this PCR product, while the number of CHH sites is high (38) and is used for all data comparisons. F. RT-qPCR of the Athila LTR and Athila LTR Δpro T1 transgenes. G. Endogenous small RNAs from inflorescence tissue without the transgene aligned to the indicated transgene. H. RT-qPCR of the endogenous TAS3a locus and the TAS3 internal region transgene. B solo Tto LTR T2 0.6 solo Tto LTR ∆pro 0.4 solo Tto LTR in iTto no CHH 20 30 40 total methylation (%) % Methylation 40 20 50 0 ∆p LT R o Tt Tt o lo so lo so o pr O S N ∆p ro m LT R +t er Tt o LT R so 1000 i sRTto N in A w tC ol F lo Tt o lo so so lo Tt o LT R T2 T1 60 LT 10 Constructs transformed into iTto 100 80 solo Tto LTR ∆pro in iTto 0 E w sR t C N ol A solo Tto LTR+term maintenance 0.2 0.0 solo Tto LTR Transgene sRNA matches (RPM) 0.8 no methylation RdDM 0 0.00 800 600 400 sRNA size (nt) 25 24 23 22 21 20 200 0 so so lo lo Tt Tto o L LT so TR R lo ∆p Tt ro o LT R 1.0 20 R CHH/CG methylation D term 40 LT R Tto1 LTR 0.05 CG CHG CHH 60 Tt o Tto1 LTR promoter deletion solo Tto LTR+term Constructs transformed into wt Col 80 lo soloTto LTR ∆pro 83-225 0.10 so Relative Expression Tto1 LTR C 100 Tto LTR ro TSS % Methylation solo Tto LTR so so lo lo Tt Tt o o LT LT R R ∆p ro Figure 2 A Figure 2. The exogenous Tto TE circumvents identity-based silencing A. Transgenes generated and transformed into Arabidopsis. Arrows are the same as described in Figure 1. B. RT-qPCR detecting steady-state mRNA accumulation in T1 inflorescence tissue of plants with the indicated transgene. C. Bisulfite sequencing of T1 DNA methylation levels of the indicated transgenes. The NOS promoter (pro) region is from the same transgene and integration events as the solo Tto LTR T1. D. Categorization of methylation type (no methylation, no CHH, RdDM, or maintenance methylation) based on the CHH/CG methylation context ratio (see Supplemental Figure 1). Results from panels C and E are plotted. E. Bisulfite sequencing of T1 DNA methylation levels of the indicated transgenes transformed into a wt Col line carrying the silenced iTto TE transgene. F. Small RNA deep sequencing reads that match the indicated LTR sequence. The small RNA library used is indicated above each bar, and the LTR used for alignment is labeled below. E /3 /4 1 6l o2 ol tC w 1/ m dr tC w po tC w /3 l2 dc rd lI po po ag dc ol tC rrp 0 ag 0 l1 0 2 20 ol 20 lV 20 ol 40 /4 40 r6 40 V 60 lV 60 o6 60 l3 80 % Methylation solo Tto LTR in Inflorescence 100 80 w dc l2 w tC ol po lI V rd r6 lV o6 ag solo Tto LTR Leaf Seedling dc 100 o1 80 miR161 D solo Tto LTR in Inflorescence CG CHG CHH 100 endogenous Athila LTR 9 w C Tto LTR antisense ag o dc l l ag 3 po o6 w lV tC po ol lI V dc rdr l2 6 /3 /4 0.0 24nt 21nt 24nt 21nt 21nt 2/ 0.2 Tto LTR sense vh 0.4 24nt 21nt su 0.6 po solo Tto LTR in: w tC ol dc l3 0.8 B solo Tto LTR tC Relative Expression Figure 3 A 1.0 Figure 3. Mechanism of expression-dependent RdDM. A. RT-qPCR of mutant T1 plants transformed with the solo Tto LTR transgene. B. Small RNA gel blot of the individuals depicted in part A. For all RNA gel blot assays, lanes are generated from pooled sibling plants. 21 vs 22 nt small RNAs are not distinguishable in this assay. Control Athila LTR siRNAs are absent in dcl3 and pol IV mutants, yet only reduced in the dcl2/3/4 triple mutant due to the activation of the Dicer-independent pathway (Ye et al., 2016). C. Bisulfite sequencing of T1 DNA methylation levels of plants depicted in part A in inflorescence, or in T1 seedlings or juvenile leaves (D). E. Bisulfite sequencing of T1 DNA methylation levels in inflorescence tissue of the solo Tto LTR transgene transformed into various single and double mutant combinations. engineered introns iTto LTR Tto1 LTR Tto1 CDS Term iTf1 LTR Tf1 LTR Term 0.00 oc k In Ind u d iR AN uce ced d 1 M to , iT to , LT R to Tt o lT o 0.05 24nt 21nt 24nt 21nt Tto LTR sense 21nt miR161 Tto LTR antisense so l o Tf1 CDS 0.10 Tt o LT fu R iT ll T t Ino, M to du o ce ck d inducible promoter in wt Col: iT Tto1 LTR Tto1 CDS C 0.15 fu l full Tto Tto1 LTR LTR Relative Expression B solo Tto LTR Tto1 LTR 0.20 so l Figure 4 A 100 80 % Methylation E CG CHG CHH CHH/CG Methylation D 60 40 20 20 40 20 40 total methylation (%) no CHH 60 Tto1 CDS 50 solo LTRs full-length 40 30 ATCOPIA26 ATCOPIA49 ENDOVIR1 ATLANTYS3 ATGP1 ATCOPIA15 META1 ATCOPIA39 ATRE1 ATCOPIA82 20 10 ATCOPIA31 D :C LT LT R Tt o o Tt lo D Tt so 3' C LT R o Tt Tt o 5’ LT R 2 tin la hi Ac 0 S 0 R N/A LT R 0 N/A 10 S 20 00 maintenance pol V % Methylation 30 60 full Tto, 3'LTR iTto, 5'LTR 0.2 LTRs 3'LTR % Methylation 40 At iTf1 50 82 CDS 5'LTR 20 iTto, 3'LTR 0.4 I solo Tto LTR 100 iTto, Mock Induced 80 60 g0 40 CDS 5'LTR H 400 At 1 60 solo Tto LTR 0.6 iTto, Mock Induced Tto LTR:CDS Tto1 LTR 100 80 % Methylation LTRs full Tto o G 100 H3K9me2 Relative Fold Enrichment F 3’LTR CDS 5’LTR LTR TSS solo Tto LTR Transgene: iRAN full Tto, 5'LTR Tto LTR:CDS 0.8 0.0 0 Target region: 0 no methylation 1.0 RdDM 0 0 10 20 30 40 wt Col % Methylation META1 50 Figure 4. Full-length transposable elements are targeted for maintenance methylation and heterochromatin formation. A. Transgenes generated and transformed into Arabidopsis. Arrows are the same as described in Figure 1. B. RT-qPCR of the Tto LTR region in T1 plants transformed with the indicated transgene. C. Small RNA gel blot of plants depicted in part B. The iRAN1 transgene serves as a negative control for plants that do not contain Tto1 sequences. D. Bisulfite sequencing of T1 DNA methylation levels of plants depicted in part B. For the full Tto and iTto transgenes, multiple regions are assayed, including both LTRs. The ‘LTRs’ lane is the combination of both 5’ and 3’ LTRs for equal comparison to the solo Tto LTR transgene. E. Categorization of methylation type based on the CHH/CG methylation context ratio. LTR results from panel D and H are plotted. F. Bisulfite sequencing of T1 DNA methylation levels of plants carrying the mock-induced iTf1 fission yeast TE-transgene. G. H3K9me2 ChIP of plants carrying either the solo Tto LTR or iTto transgene. For all ChIP experiments, three biological replicate samples were isolated from non-overlapping pools of sibling plants with the same genotype. ChIP qPCR fold-enrichment is normalized to the constitutively expressed gene AXS2. Bar height represents the average between biological replicate samples, and the error bars represent the standard error of the mean between biological replicates. N/A indicates that the indicated region does not exist in that transgene. H. Bisulfite sequencing of T1 DNA methylation levels of plants carrying the Tto LTR:CDS transgene. I. Comparison of total DNA methylation levels of endogenous LTR retrotransposons in the Arabidopsis genome in wt Col and pol V mutants. Each dot represents the average methylation of a single TE family for either full-length copies (open circles) or solo LTR copies (red dots). Text labels indicate solo LTRs and full-length elements that are regulated by RdDM (decrease of at least 4% methylation in pol V compared to wt). M ed V lI ORF TSS LTR Term iTto 5’LTR 80 60 40 20 0 1400 300 500 150 300 LTR ORF J 1000 600 200 60 40 0 At1g08200 LTR Term T2 Actin2 Mock Induced 0.6 0.0 0.4 pol V Ind 0.2 pol V Ind pol V Mock drm1/2 Ind drm1/2 Mock G 100 0 20 80 iTf1 5’LTR 4 60 3 40 2 20 1 0 0 endogenous Athila LTR w t w Col tC I iTto Relative Expression o nd w l I uc po t C ndu ed l I ol ce V M d p In o po ol I duc ck l V V ed M po Ind ock l V uc M ed oc k 1.0 oc k F M 300 50 50 300 oc k 21-22 nt 24 nt M 1000 100 V LTR E V 0 % Methylation 5 po l iR iR AN AN 1 M 1 oc I w ndu k t w Co ced tC l ol Mo In ck po du c l po IV ed lI M V o In ck po duc po l V ed l V Mo c dr Ind k m u dr 1/ ce m 2 d 1/ M 2 o In ck du ce d 50 50 50 50 50 50 300 to to in in w tC o w l iT to t C o in p l le iT to ol I af V in iR p o l AN V 1 in iT w to tC in o w iT to t C l ol in w iT to t C ol in p o le a iT f to l in IV iR p o AN l V 1 in w tC ol iT iT C oc k 150 p < 0.05 po lI 0 0.8 CHH/CG Methylation iTto Relative Expression 10 w wt C tC o iTto Relative Mobile Copy # ol l M In o p o c po l du k l I IV ce V M d In o po pol du ck l V V ce In Mo d du ck ce d w wt C tC o ol l M I po pol ndu ock l I IV ce V M d In o po pol du ck l V V ce In Mo d du ck ce d 3 % Methylation D wt Col no transgene 6 M 300 500 iTto pol V Induced iTto wt Col Mock 9 tC ol iTto wt Col Induced 12 15 w iTto pol IV Induced B H3K9eme2 Relative Fold Enrichment T2 oc k In du c po e lV d M po oc lV k In du ce d po V k oc du c In ol M po lI ol tC oc k M % Methylation 100 w tC 1 AN I w iR Figure 5 A 15 24nt 21nt iTto LTR sense 24nt 21nt iTto CDS sense 21nt miR161 Induced TSS iTto 5’LTR 80 no methylation pol V Mock wt Col Mock 40 60 Total Methylation (%) iTto 5'LTR H 5 iTto CDS CG CHG CHH 60 40 20 0 orange = T1 blue = T2 pol IV Ind RdDM pol IV Mock wt Col Ind pol IV Mock wt Col Ind pol IV Ind wt Col Mock maintenance no CHH 80 T1 T2 wt Col Mock wt Col Induced pol IV Mock pol IV Induced pol V Mock pol V Induced drm1/2 Mock drm1/2 Induced 20 iTto 3'LTR Figure 5. Silencing of exogenous TEs requires RdDM to progress to maintenance methylation. A. RT-qPCR of iTto steady-state mRNA levels in T1 plants of the indicated genotype. Plants were either induced for iTto expression or mock-induced. B. qPCR of spliced retrotransposon-induced cDNA formation in the same plants as part A. Relative copy number is calculated based on the amount of retrotransposon-induced cDNA compared to the single copy chromosomal gene AXS2. Bar height represents the average calculated from three biological replicate samples of non-overlapping pools of sibling plants with the same genotype / treatment. Error bars represent the standard error of the mean between biological replicate samples. C. Small RNA gel blot of the plants from part A. D. Results of small RNA deep sequencing aligned to the iTto sequence. The Y-axes are all to the same scale, with the maximum abundance adjusted to the highest peak by genotype, as indicated. Small RNAs have been library-size normalized by calculating the reads per million of genome-matched (including the transgene sequence) 18-28 nt reads. E. Bisulfite sequencing of T1 DNA methylation levels of plants depicted in part A. F. Categorization of methylation type based on the CHH/CG methylation context ratio. LTR results from panel E (orange) and H (blue) are plotted. G. Bisulfite sequencing of T1 DNA methylation levels of plants with the iTf1 TE. H. RT-qPCR of iTto mRNA accumulation from T2 plants, some of which were produced from expression-induced T1 plants. The X-axis labels refer to the induction of the T1 parent of the assayed T2 progeny. I. Bisulfite sequencing of T2 of plants depicted in part H. In parts H-I, PCR primers only detect the transgene copy of iTto and not any potential extra-chromosomal or transposed copies. J. H3K9me2 ChIP of T2 of plants depicted in part H. 5’ and 3’ LTR PCR primers are anchored to the transgene, while the CDS primers could potentially amplify extra-chromosomal or transposed iTto copies. 100 CG CHG CHH LT R 3' 60 to 20 40 Total Methylation (%) 0 iT 0 no CHH S maintenance 100 LT R wt Col x pol IV wt Col x pol IV pol IV x wt Col wt Col x pol V pol IV x wt Col pol V x wt Col 200 5' pol IV x wt Col, rep A pol V x wt Col, rep B pol IV x wt Col, rep B pol IV x wt Col, rep C to pol V x wt Col, rep A iT 0.5 wt Col x pol IV, rep C wt Col, selfed T2 wt Col, selfed T3 pol IV, selfed T2 T3: pol IV x wt Col, rep A wt/wt 0 1.0 300 2 RdDM D tin wt Col x pol V, rep B wt Col x pol V, rep A Ac no methylation 20 iTto from wild-type grandparent 0 08 CHH/CG Methylation 1.5 screen mutant for mutant 20 1g wt wt iTto from mutant grandparent C wt (T2) mutant (T3) (T3) 40 At screen wt for wt mutant wt mutant (T2) wt 60 mutant mutant H3K9me2 Relative Fold Enrichment wt mutant mutant mutant lI V po x l I wt V po x Col l I wt , re V w x Col p A tC w , ol t C rep x ol, B po re lI p po V, C lV re p w xw C tC t C ol po x ol, l V po re p w x w l V, A tC t re C ol p x ol, A po re lV pB ,r ep w B tC o po l x l I po V l w x w IV tC t C o po l x ol l V po x lV w tC w ol tC ol po x l I po V l x IV w tC ol wt wt (T1) po wt wt wt wt mockinduced % Methylation 80 mock- mutant induced mutant (T1) D iTto T3 pol IV pol IV T3 wt/wt C iTto B to mutant transform wild-type iT Figure 6 A transform Figure 6. The genotype of direct transformation dictates the final level of transgene silencing. A. Experimental crossing scheme. The right side indicates control crosses. Colors denote data labels in parts B-C. B. Bisulfite sequencing of DNA methylation levels of genotyped T3 plant lines depicted in part A. These T3 plants were generated by performing the T1 cross indicated on the X-axis followed by selfing of the T2 generation. Replicates (rep A-C) denote different transgene insertion events used in the crossing scheme. Brackets under plant lines represent reciprocal crosses performed with the same T1 individual. C. Categorization of methylation type based on the CHH/ CG methylation context ratio using the results of panel B. D. H3K9me2 ChIP of plants with the iTto transgene directly transformed into the current genotype (wt Col T2, wt Col T3, pol IV T2) compared to plants in which the transgene was transformed into pol IV and introgressed wt Col (replicate A from part B). Figure 7 Establishment by Expressiondependent RdDM TE Pol II AAAAA RDR6 Reinforcement by Pol IV-RdDM Pol V RD R6 - Rd DM Initiation DCL3-RdDM AGO TE Pol IV DCL3 AGO1 AAAAA DRM2 TE RDR2 Maintenance DRM2 TE Pol V AGO4/6 Pol IV-RdDM 24 nt siRNAs MET1, CMT2, CMT3, and DDM1 Epigenetic inheritance of the transcriptionallysilenced TE 21 & 22 nt siRNAs plant cell DCL2, DCL4 Expressiondependent silencing Identitybased silencing TE Unique Incoming TEs TE Homologous Figure 7. The Initiation-Establishment-Reinforcement-Maintenance model of TE silencing. Incoming unique TEs must be expressed, go through post-transcriptional silencing via RNAi (initiation), and establishment of DNA methylation though either the RDR6-RdDM or DCL3-RdDM mechanism of expression-dependent RdDM (Cuerda-Gil and Slotkin, 2016). Incoming TEs that share identity to the pool of siRNAs in the existing genome short-circuit this pathway and enter directly into Pol IV-RdDM. A functional Pol IV-RdDM pathway is necessary in the transformed T0 cells or subsequent T1 generation to reinforce DNA methylation and transition the TE from RdDM to the fully silenced maintenance of DNA methylation. Parsed Citations Allen, E. et al. (2005). microRNA-directed phasing during trans-acting siRNA biogenesis in plants. Cell 121: 207-21. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Axtell, M.J. et al. (2006). A two-hit trigger for siRNA biogenesis in plants. Cell 127: 565-77. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Bao, W. et al. (2015). Repbase Update, a database of repetitive elements in eukaryotic genomes. Mob DNA 6: 11. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Bartolomé, C. et al. (2009). Widespread evidence for horizontal transfer of transposable elements across Drosophila genomes. Genome Biol. 10: R22. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Bond, D.M. and Baulcombe, D.C. (2015). Epigenetic transitions leading to heritable, RNA-mediated de novo silencing in Arabidopsis thaliana. Proc. Natl. Acad. Sci. U.S.A. 112: 917-22. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Böhmdorfer, G. et al. (2010). A synthetic biology approach allows inducible retrotransposition in whole plants. Syst Synthetic Biology 4: 133-138. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Böhmdorfer, G. et al. (2016). Long non-coding RNA produced by RNA polymerase V determines boundaries of heterochromatin. Elife 5: e19092. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Cao, X. et al. (2003). Role of the DRM and CMT3 methyltransferases in RNA-directed DNA methylation. Curr. Biol. 13: 2212-7. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Cao, X. and Jacobsen, S.E. (2002). Locus-specific control of asymmetric and CpNpG methylation by the DRM and CMT3 methyltransferase genes. Proc. Natl. Acad. Sci. U.S.A. 99 Suppl 4: 16491-8. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Chan, S.W. et al. (2004). RNA silencing genes control de novo DNA methylation. Science 303: 1336. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Creasey, K.M. et al. (2014). miRNAs trigger widespread epigenetically activated siRNAs from transposons in Arabidopsis. Nature 508: 411-5. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Cuerda-Gil, D. and Slotkin, R.K. (2016). Non-canonical RNA-directed DNA methylation. Nat Plants 2: 16163. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Curtis, M.D. and Grossniklaus, U. (2003). A gateway cloning vector set for high-throughput functional analysis of genes in planta. Plant Physiol. 133: 462-9. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Daniels, S.B. et al. (1990). Evidence for horizontal transmission of the P transposable element between Drosophila species. Genetics 124: 339-55. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Du, J. et al. (2012). Dual binding of chromomethylase domains to H3K9me2-containing nucleosomes directs DNA methylation in plants. Cell 151: 167-80. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title El Baidouri, M. et al. (2014). Widespread and frequent horizontal transfers of transposable elements in plants. Genome Res. 24: 831-8. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Feng, S. et al. (2010). Conservation and divergence of methylation patterning in plants and animals. Proc. Natl. Acad. Sci. U.S.A. 107: 8689-94. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Fultz, D. et al. (2015). Silencing of active transposable elements in plants. Curr. Opin. Plant Biol. 27: 67-76. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Greenberg, M.V. et al. (2011). Identification of genes required for de novo DNA methylation in Arabidopsis. Epigenetics 6: 344-54. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Herr, A. et al. (2005). RNA Polymerase IV Directs Silencing of Endogenous DNA. Science 308: 118-120. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Hirochika, H. (1993). Activation of tobacco retrotransposons during tissue culture. The EMBO journal 12: 2521. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Hirochika, H. et al. (2000). Silencing of retrotransposons in arabidopsis and reactivation by the ddm1 mutation. Plant Cell 12: 35769. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Hirochika, H. and Otsuki, H. (1995). Extrachromosomal circular forms of the tobacco retrotransposon Tto1. Gene 165: 229-32. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Jackel, J.N. et al. (2016). Arabidopsis RNA Polymerases IV and V Are Required To Establish H3K9 Methylation, but Not Cytosine Methylation, on Geminivirus Chromatin. J. Virol. 90: 7529-40. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Jakowitsch, J. et al. (1999). Molecular and cytogenetic characterization of a transgene locus that induces silencing and methylation of homologous promoters in trans. Plant J. 17: 131-40. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Johnson, L.M. et al. (2014). SRA- and SET-domain-containing proteins link RNA polymerase V occupancy to DNA methylation. Nature 507: 124-8. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Karimi, M. et al. (2002). GATEWAY vectors for Agrobacterium-mediated plant transformation. Trends Plant Sci. 7: 193-5. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Kato, M. et al. (2003). Role of CG and non-CG methylation in immobilization of transposons in Arabidopsis. Curr. Biol. 13: 421-6. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Kinoshita, Y. et al. (2007). Control of FWA gene silencing in Arabidopsis thaliana by SINE-related direct repeats. Plant J. 49: 38-45. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Kyndt, T. et al. (2015). The genome of cultivated sweet potato contains Agrobacterium T-DNAs with expressed genes: An example of a naturally transgenic food crop. Proc. Natl. Acad. Sci. U.S.A. 112: 5844-9. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Langmead, B. et al. (2009). Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10: R25. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Law, J.A. et al. (2013). Polymerase IV occupancy at RNA-directed DNA methylation sites requires SHH1. Nature 498: 385-9. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Levin, H.L. et al. (1993). Novel gene expression mechanism in a fission yeast retroelement: Tf1 proteins are derived from a single primary translation product. EMBO J. 12: 4885-95. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Li, S. et al. (2015). Detection of Pol IV/RDR2-dependent transcripts at the genomic scale in Arabidopsis reveals features and regulation of siRNA biogenesis. Genome Res. 25: 235-45. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Li, Y. et al. (2013). Gene tagging with engineered Ds elements in maize. Methods Mol. Biol. 1057: 83-99. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Lister, R. et al. (2008). Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133: 523-36. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Liu, Q. et al. (2014). Analysis of complementarity requirements for plant microRNA targeting using a Nicotiana benthamiana quantitative transient assay. Plant Cell 26: 741-53. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Lu, C. et al. (2005). Elucidation of the small RNA component of the transcriptome. Science 309: 1567-9. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Marí-Ordóñez, A. et al. (2013). Reconstructing de novo silencing of an active plant retrotransposon. Nat. Genet. 45: 1029-39. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Matzke, A.J. et al. (1994). Homology-dependent gene silencing in transgenic plants: epistatic silencing loci contain multiple copies of methylated transgenes. Mol. Gen. Genet. 244: 219-29. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Matzke, M.A. et al. (2002). Homology-dependent gene silencing and host defense in plants. Adv. Genet. 46: 235-75. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Matzke, M.A. et al. (2000). Transgene silencing by the host genome defense: implications for the evolution of epigenetic control mechanisms in plants and vertebrates. Plant Mol. Biol. 43: 401-15. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Matzke, M.A. and Mosher, R.A. (2014). RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat. Rev. Genet. 15: 394-408. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title McCue, A.D. et al. (2015). ARGONAUTE 6 bridges transposable element mRNA-derived siRNAs to the establishment of DNA methylation. EMBO J. 34: 20-35. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Nodine, M.D. and Bartel, D.P. (2010). MicroRNAs prevent precocious gene expression and enable pattern formation during plant embryogenesis. Genes Dev. 24: 2678-92. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Nuthikattu, S. et al. (2013). The initiation of epigenetic silencing of active transposable elements is triggered by RDR6 and 21-22 nucleotide small interfering RNAs. Plant Physiol. 162: 116-31. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Pace, J. et al. (2008). Repeated horizontal transfer of a DNA transposon in mammals and other tetrapods. P Natl Acad Sci Usa 105: 17023-8. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Panda, K. et al. (2016). Full-length autonomous transposable elements are preferentially targeted by expression-dependent forms of RNA-directed DNA methylation. Genome Biol. 17: 170. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Panda, K. and Slotkin, R.K. (2013). Proposed mechanism for the initiation of transposable element silencing by the RDR6-directed DNA methylation pathway. Plant Signal Behav 8: e25206. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Pontier, D. et al. (2012). NERD, a plant-specific GW protein, defines an additional RNAi-dependent chromatin-based pathway in Arabidopsis. Mol. Cell 48: 121-32. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Pérez-Hormaeche, J. et al. (2008). Invasion of the Arabidopsis genome by the tobacco retrotransposon Tnt1 is controlled by reversible transcriptional gene silencing. Plant Physiol 147: 1264-78. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Raizada, M.N. et al. (2001). Somatic and germinal mobility of the RescueMu transposon in transgenic maize. Plant Cell 13: 1587-608. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Ruiz, MT et al. (1998). Initiation and maintenance of virus-induced gene silencing. Plant Cell 10: 937-46. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Stroud, H. et al. (2014). Non-CG methylation patterns shape the epigenetic landscape in Arabidopsis. Nat. Struct. Mol. Biol. 21: 6472. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Takeda, S. et al. (1999). A 13-bp cis-regulatory element in the LTR promoter of the tobacco retrotransposon Tto1 is involved in responsiveness to tissue culture, wounding, methyl jasmonate and fungal elicitors. Plant J Cell Mol Biology 18: 383-93. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Tanurdzic, M. et al. (2008). Epigenomic consequences of immortalized plant cell suspension culture. PLoS Biol. 6: 2880-95. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Walsh, A.M. et al. (2013). Widespread horizontal transfer of retrotransposons. Proc. Natl. Acad. Sci. U.S.A. 110: 1012-6. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Wei, W. et al. (2012). A role for small RNAs in DNA double-strand break repair. Cell 149: 101-12. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Wierzbicki, A.T. et al. (2009). RNA polymerase V transcription guides ARGONAUTE4 to chromatin. Nat. Genet. 41: 630-4. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Wu, L. et al. (2012). Roles of dicer-like and argonaute proteins in TAS-derived small interfering RNA-triggered DNA methylation. Plant Physiol. 160: 990-9. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Yang, D.-L.L. et al. (2015). Dicer-independent RNA-directed DNA methylation in Arabidopsis. Cell Res 26: 66-82. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Ye, R. et al. (2016). A Dicer-Independent Route for Biogenesis of siRNAs that Direct DNA Methylation in Arabidopsis. Mol. Cell 61: 222-35. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Zemach, A. et al. (2013). The Arabidopsis nucleosome remodeler DDM1 allows DNA methyltransferases to access H1-containing heterochromatin. Cell 153: 193-205. Pubmed: Author and Title CrossRef: Author and Title Google Scholar: Author Only Title Only Author and Title Exogenous Transposable Elements Circumvent Identity-Based Silencing Permitting the Dissection of Expression Dependent Silencing Dalen Fultz and R. Keith Slotkin Plant Cell; originally published online February 13, 2017; DOI 10.1105/tpc.16.00718 This information is current as of June 18, 2017 Supplemental Data /content/suppl/2017/02/13/tpc.16.00718.DC1.html Permissions https://www.copyright.com/ccc/openurl.do?sid=pd_hw1532298X&issn=1532298X&WT.mc_id=pd_hw1532298X eTOCs Sign up for eTOCs at: http://www.plantcell.org/cgi/alerts/ctmain CiteTrack Alerts Sign up for CiteTrack Alerts at: http://www.plantcell.org/cgi/alerts/ctmain Subscription Information Subscription Information for The Plant Cell and Plant Physiology is available at: http://www.aspb.org/publications/subscriptions.cfm © American Society of Plant Biologists ADVANCING THE SCIENCE OF PLANT BIOLOGY
© Copyright 2025 Paperzz