Phylogenetically Close Group I Introns with Different Positions among Paramecium bursaria Photobionts Imply a Primitive Stage of Intron Diversification Ryo Hoshina* and Nobutaka Imamura *Department of Biomedical Science, College of Life Sciences, Ritsumeikan University, Shiga, Japan; and Department of Pharmacy, College of Pharmaceutical Sciences, Ritsumeikan University, Shiga, Japan Group I introns are a distinct RNA group that catalyze their excision from precursor RNA transcripts and ligate the exons. Group I introns have a sporadic and highly biased distribution due to the two intron transfer mechanisms of homing and reverse splicing. These transfer pathways recognize assigned sequences even when introns are transferred beyond the species level. Consequently, introns at homologous gene sites between different host organisms are more related than those at heterologous sites within an organism. We describe the subgroup IE introns of two Chlorella species that are symbiotic green algae (photobionts) of a ciliate, Paramecium bursaria. One strain Chlorella sp. SW1-ZK (Csw.) had two IE introns at S651 and L2449, and the other strain Chlorella sp. OK1-ZK (Cok.) had four IE introns at S943, L1688, L1926, and L2184 (numbering reflects their homologous position in Escherichia coli rRNA gene: S 5 small subunit rRNA, L 5 large subunit rRNA). Despite locating on six heterologous sites, the introns formed a monophyletic clade independent of other groups. Phylogenetic and structural analyses of the introns indicated that Csw.L2449 has an archaic state, and the other introns are assumed to be originated from this intron. Some of the introns shared common internal guide sequences, which are necessary for misdirected transfer (i.e., transposition) via reverse splicing. Other introns, however, shared similar sequence fragments further upstream, after the insertions. We propose a hypothetical model to explain how these intron transpositions may have occurred in these photobionts; they transposed by a combination of homing-like event requiring relaxed sequence homology of recognition sequences and reverse splicing. This case study may represent a key to describe how group I intron explores new insertion sites. Introduction Group I introns are a distinct RNA group that function as enzymes, splicing themselves out of precursor RNA transcripts, and ligating exons (Kruger et al. 1982; Cech 1985). To date, more than 2,000 group I introns have been identified from nuclear, mitochondrial, and plastid DNA (Comparative RNA Web [CRW] site [http://www.rna.ccbb. utexas.edu/]; Cannone et al. 2002). Group I introns have a wide but sporadic and highly biased distribution in nature and are found in seven (i.e., Opisthokonts, Amoebozoa, Plants, Cercozoa, Alveolates, Heterokonts, and Discicristates) of the eight eukaryotic supergroups classified by Baldauf (2003) and Bacteria and viruses (e.g., Kuhsel et al. 1990; Yamada et al. 1994; Edgell et al. 2000; Cannone et al. 2002; Jackson et al. 2002). Group I introns have various lengths (e.g., 200–1,500 nt), but primary and secondary structures, especially enzymatic core regions, are surprisingly well conserved, which suggests that they derived from a common origin (Burke 1988; Cech 1988). In contrast, peripheral elements of the core have various forms; these were used to subdivide introns into four major subgroups, namely IA, IB, IC, and ID (Michel and Westhof 1990). Subsequently, another independent subgroup, IE, was created from a subset of subgroup IB introns (Suh et al. 1999). Nuclear encoding introns belong to either subgroup IC or IE, and for unknown reasons, they are restricted to ribosomal RNA genes (rDNA); this differs from organelles, in which introns are found in rRNA, tRNA, and protein-coding regions (see CRW). Key words: degenerative homing, intron diversification, intron transposition, Paramecium bursaria symbiotic algae, reverse splicing. E-mail: [email protected]. Mol. Biol. Evol. 26(6):1309–1319. 2009 doi:10.1093/molbev/msp044 Advance Access publication March 11, 2009 Ó The Author 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: [email protected] A distinctive character of group I introns is their mobility. Some similar introns are found in distantly related organisms. Some introns within a lineage have phylogenetic relationships that do not correspond to their host phylogeny. This may be due to the lateral transfer (duplication) of introns beyond the species level. That is, group I introns are mobile genetic elements that specialize in invading genomes of different species. They may spread by homing (e.g., Dujon 1989; Jurica and Stoddard 1999; Chevalier and Stoddard 2001) or reverse splicing (Woodson and Cech 1989; Roman and Woodson 1995, 1998). In homing, an intron encodes a homing endonuclease (HE) gene located at the terminus of a peripheral helix (e.g., the tip of P1, P2, or P8). HE recognizes 14–40 specific nucleotide rows of double-stranded DNA and then cleaves the target site, typically resulting in a 3# single-stranded overhang. As one side tries to repair its loss, the intron (with the HE gene) serves as the template for repair synthesis. Reverse splicing is the inverse process of intron splicing at the RNA level. Because additional steps of cDNA synthesis and recombination are necessary, and rDNA, for example, is a multicopy gene, this type of movement likely hinders intron fixation in a population or species (Bhattacharya et al. 2005; Haugen, Simon, and Bhattacharya 2005). Recent intron phylogenetic analyses have provided strong evidence for lateral transfer events via reverse splicing (Bhattacharya et al. 1996, 2002, 2005). Consequently, the highly sporadic distributions of group I introns in presentday organisms may be explained by a combination of ancestral gains, vertical inheritance with continual loss, and new gains by horizontal transfer (e.g., Bhattacharya et al. 2005; Haugen, Simon, and Bhattacharya 2005). However, this explanation further complicates what is known about intron evolution. As mentioned above, group I introns are found in diverse lineages of organisms and at extremely unbalanced frequencies. Most nuclear-encoded group I introns are particularly dense in fungi, red algae, and chlorophyte algae (Haugen, Simon, and Bhattacharya 2005). 1310 Hoshina and Imamura Paramecium bursaria is a ciliate that maintains a symbiotic relationship with Chlorella-like photobionts (for reviews on Chlorella taxonomy, see Huss et al. 1999; Krienitz et al. 2004). Although two exceptional photobionts have been reported (Chlorella vulgaris and Coccomyxa sp., see Hoshina and Imamura 2008a), all other algae whose phylogenetic placements are known belong to ‘‘American’’ (‘‘Southern’’) or ‘‘European’’ (‘‘Northern’’) groups (Hoshina et al. 2005; Summerer et al. 2008). Both the American and European groups belong to the Chlorellaceae (Trebouxiophyceae) and the phylogenetic relationships within this family are still unclear; however, the rather larger genetic dissimilarities between American and European groups compared with their host Paramecium suggest that they are derived from different origins as paramecian photobionts (Hoshina and Imamura 2008a). Symbiotic Chlorella spp. demand organic nitrogen compounds (Reisser 1984; Kamako et al. 2005) and are sensitive to Chlorella viruses, which are abundant in natural freshwater (Van Etten et al. 1991; Yamada et al. 1991). Moreover, they have never been collected from nature as free-living Chlorella. In other words, both the American and European groups comprise distinct species inhabiting the cells of P. bursaria. We previously introduced an American group alga, Chlorella sp. NC64A collected in NY, which was particularly intron-rich, containing eight group I introns (Cnc. introns: four subgroup IC and four IE) in the nuclear rDNA (Hoshina and Imamura 2008b). These had five new insertion sites (S1367, L200, L1688, L2183, and L2437; numbering reflects their homologous position in Escherichia coli rRNA gene: S 5 small subunit [SSU] rRNA, L 5 large subunit [LSU] rRNA) and one occupied a unique site (S943) as subgroup IE. Phylogenetic analyses revealed that the Cnc. IC introns formed a clade with chlorophyte introns with insertions at nine heterologous sites and all Cnc. IE introns made a robust clade with an intron of S651 (Chlorella sp. AN 1-3) independent of other chlorophyte introns. These data could not be rationally explained in terms of evolutionary processes. Therefore, in the present study, we sequenced rDNA of two strains of P. bursaria photobionts: Chlorella sp. SW1-ZK (isolated from German P. bursaria) belonging to European group and Chlorella sp. OK1-ZK (isolated from Japanese P. bursaria) belonging to American group. Our aim is to answer the following questions: 1) Does the SW1-ZK have some introns in the LSU rDNA region? European algae contain one intron in their SSU rDNA (Hoshina et al. 2005; Gaponova et al. 2007; Hoshina and Imamura 2008a), but no analysis on this intron has been published, 2) How does the SW1-ZK introns phylogenetically relate to the Cnc. introns or other introns?, and 3) Does the OK1-ZK also possess the same types of introns as Chlorella sp. NC64A? This last question would help address whether these introns are stable among the algal group even if rather distant habitats (e.g., over 10,000 km). Although group I introns, especially their enzymatic functions, are well understood (see Cech 2002 and references therein), their roles from the perspective of evolutionary biology and the mechanisms of their diversification are poorly understood. We introduce a set of subgroup IE introns that cannot be explained by existing theories of intron spread and discuss their evolutionary context. Materials and Methods Establishment of Algal Strains and Cultures Photobiont strains of Chlorella sp. SW1-ZK and OK1ZK were obtained as follows. Cells of P. bursaria SW1 (German strain; see Hoshina et al. 2005) and OK1 (Japanese strain; see Hoshina et al. 2004) were physically disrupted, and the suspension was spread onto C medium (Ichimura 1971) agar plates. Algal microcolonies that formed were transferred to C þ bacto-peptone (1 g l1) agar plates. These grew into small colonies, which were subsequently transferred to C liquid medium with 2.3 mM casamino acid. The photobionts were maintained under fluorescent illumination (16:8 h light/dark, 100 lmol photons m2 s1) at 25 °C. DNA Extraction, Amplification, and Sequencing For DNA extraction, we used a DNeasy plant mini kit (Qiagen, GmbH, Düsseldorf, Germany) according to the manufacturer’s directions. SSU rDNA (of Chlorella sp. SW1-ZK) was amplified by polymerase chain reaction (PCR) with the three primer pairs SR-1/SR-5, SR-4/SR-9, and SR-8/SR12, and LSU rDNA was amplified with the following five primer pairs: INT4F/HLR3R, HLR0F/ LR5, HLR5F/LR8, HLR7F/HLR9R, and HLR9F/LR12k. The PCR products were confirmed via agarose gel electrophoresis, purified via polyethylene glycol precipitation, and then sequenced directly. A fragment of the posterior HLR9F/LR12k product of Chlorella sp. SW1-ZK could not be read by direct sequencing, and thus, we reamplified this region with the primer pair HLR10F/LR12k, which was subcloned into pGEM-T Easy Vector (Promega, Madison, WI). The subcloning procedure defined several polymorphic sequences with indels at nucleotide position L2629, indicating that the sequence was made unreadable by one of the nucleotide indels. PCR and the sequencing primers are described by Hoshina et al. (2004) and Hoshina and Imamura (2008b). Structure Diagrams of Csw. Introns We found two group I introns in the rDNA of Chlorella sp. SW1-ZK (Csw.) and eight in Chlorella sp. OK1-ZK (Cok.). Secondary structure diagrams of Csw. introns were prepared according to previously reported models (e.g., Lehnert et al. 1996; Haugen et al. 2002). The central catalytic intron cores (e.g., from P3 to P7) were determined and then the remaining helices predicted by the Mfold (Mathews et al. 1999; Zuker 2003) version 3.2 Web server (http://frontend.bioinfo.rpi.edu/applications/mfold/cgi-bin/ rna-form1.cgi/) were appended to the core. Phylogenetic Analyses We used several group I introns belonging to subgroups IC and IE (to which nuclear group I introns belong) to show the rough phylogenetic positions of the Csw. introns. The secondary structure diagrams of all these Primitive Stage of Group I Intron Spread 1311 FIG. 1.—Insertion sites of group I introns on nuclear ribosomal RNA genes of Chlorella sp. SW1-ZK and Chlorella sp. OK1-ZK. Subgroup IE introns are in bold. The numbering reflects their homologous positions in the Escherichia coli rRNA gene. comparative introns are published (e.g., Suh et al. 1999; Mavridou et al. 2000; Müller et al. 2005) or are available at CRW. We manually aligned the highly conserved (ribozymatic core) regions of the P, Q, R, S, and P3 helix elements and appended two Csw. introns and some Cok. introns. Phylogenetic analyses were conducted with these alignment data (supplementary fig. S1, Supplementary Material online) using MEGA version 4 (Tamura et al. 2007). An unrooted tree was obtained by the minimum evolution (ME) method of equally weighted P distance and then bootstrap analyses were calculated (1,000 replicates) using the same evolutionary model. Only subgroup IE introns were used for the posterior analyses. The intron sequences were aligned manually, taking into account the secondary structures, and in total, 268 aligned positions (200 parsimony informative) were selected (supplementary fig. S2, Supplementary Material online). These positions contained not only the catalytic cores of P3–P7 but also some peripheral elements. For accurate alignment, we excluded ambiguous regions, such as the highly diverged P9.2 region and thereafter, additional branching elements of P2.1c (seen in S989 and S1199 introns, see Müller et al. 2005) and P1, the sequences of which highly depend on insertion sites. Unrooted trees were constructed using the ME method of equally weighted P distance and the Neighbor-Joining (NJ) method of Jukes and Cantor in MEGA and maximum likelihood (ML) method in PAUP 4.0b10 (Swofford 2003). The best-fit evolutionary models for the ML analysis were determined via Modeltest 3.6 (Posada and Crandall 1998) and then the best likelihood score (ln L 5 5647.84) was obtained under the general time reversible (GTR) þ G þ I evolutionary model with the following parameters: nucleotide frequencies of A 5 0.229676, C 5 0.234538, G 5 0.301781, and T 5 0.234005; substitution rate matrix of AC 5 0.8384134, AG 5 3.9786037, AT 5 2.8529331, CG 5 1.3359373, CT 5 6.1273343, and GT 5 1; an invariable proportion of sites (0.090251); variable site rates assumed to follow a gamma distribution with a shape parameter of 0.892388; and four rate categories. The bootstrap probabilities in each method were computed for 1,000 replicates. Results and Discussion Ribosomal DNA Sequences and Intron Detection We established two P. bursaria photobiont strains: the European-type Chlorella sp. SW1-ZK (Csw.) and the American-type Chlorella sp. OK1-ZK (Cok.). For the Chlorella sp. SW1-ZK, we obtained a 6,210-bp (or 6,209, see Materials and Methods) product between primers SR1 and LR12k, including SSU rDNA, internal transcribed spacer (ITS) 1, 5.8S rDNA, ITS2, and LSU rDNA (AB437244–56). The SSU rDNA–ITS region was identical to the uncultured algal DNA sequence (AB206547) that we previously obtained directly from P. bursaria SW1 (Hoshina et al. 2005). This sequence included an intron-like (324 nt) insertion at S651. In the LSU rDNA region of the algal isolate SW1-ZK, we detected another long (330 nt) insertion at L2449 (fig. 1). Chlorella sp. OK1-ZK had eight group I introns with three in the SSU rDNA region (S943, S1367, and S1512; AB162912) and five in the LSU rDNA region (L200, L1688, L1926, L2184, and L2437; AB437257). These insertion positions and sequences are virtually identical to those of Chlorella sp. NC64A (Cnc.), an authentic American strain, reported by Hoshina and Imamura (2008b). The only difference between them was in the L200 introns; the 89th residue (P3–P4 junction) of Cok.L200 was A, whereas that of Cnc.L200 was a double composition of A and C. Structure Models of Csw. Introns To confirm the two Csw. insertion identities, we performed a Fasta similarity search on each sequence in the DNA Data Bank of Japan (DDBJ; http://www.ddbj.nig. ac.jp/), which defined their similarities to some group I introns belonging to subgroup IE. Figure 2 shows the secondary structure diagrams of Csw.S651 and Csw.L2449. They were constructed by tracing the well-known core region (as ribozyme) and adding physically folded peripheral loops. The structures contained helices P1–P10 and the P13 loop–loop contact segment (thought to be involved in the tertiary interaction necessary for excising the intron from rRNA, Cech 1990; Suh et al. 1999). Of these, the shorter P5 helices are typical for subgroup IE introns. Rough Phylogenetic Positions of Csw. Introns A ME tree constructed using only the core regions of 60 nt (49 parsimony informative) sites strongly suggested that the Csw. insertions were subgroup IE introns; sets of IE and IC introns were clearly separated (fig. 3). The distinctive structural difference between nuclear rDNA introns, that is, subgroups IC and IE, is the possession (IC) or lack (IE) of an extended P5 domain. However, recent studies have reported derived (inherited for a long period) IC introns with shortened P5 and/or P9 domains (Haugen, Joseph, and Bhattacharya 2004; Simon et al. 2005), resembling IE intron structure. Csw. introns had structures that are typical of the IE subgroup (fig. 2) and 1312 Hoshina and Imamura FIG. 2.—Structure diagrams for the Chlorella sp. SW1-ZK rDNA introns (Csw.S651 and Csw.L2449), folded according to previously reported models (e.g., Lehnert et al. 1996; Haugen et al. 2002). Capital letters indicate intron sequences, and lowercase letters indicate flanking exon sequences. Arrows point to the 5# and 3# splice sites. Pairing segments of P1–P10 and P13 loop–loop contact (this domain is thought to be involved in tertiary interactions necessary for splicing from rRNA; Cech 1990; Suh et al. 1999) locations are indicated. were phylogenetically strongly suggested to be affiliated with the IE group. The Metarhizium anisopliae L2449 intron (Man.L2449) has an extended P5abc helix (Mavridou et al. 2000) typical of subgroup IC introns; however, some studies have treated it as an IE intron (e.g., Bhattacharya et al. 2005; Simon et al. 2005). Our tree indicated that Man.L2449 was located in an intermediate position between IE and IC. With or without Man.L2449, however, fungal L2449 introns have commonly emerged at the base of IE or as IC in previous reports (Nikoh and Fukatsu 2001; Haugen, Wikmark, et al. 2005; Müller et al. 2005; Simon et al. 2005), in agreement with our results. We also confirmed that there is no link between Csw.L2449 and Man.L2449. In IE, although some phylogenetic branches had lower bootstrap support, fungal S516 and foraminiferous L1926 diverged first and then S989 of rhodophyte and fungi branched off. The Csw. introns were included in a moderately supported (71%) clade including L1926 of diatoms, fungal L2066, chlorophyte S516, and Cok.S943 (only one IE Cok. intron was used in this analysis). Phylogenetic Relationships of Csw., Cok., and Other Introns The second tree was constructed of only IE introns, separate from IC and IC-like introns, to more deeply resolve the relationships among closely related IE introns. To execute the comparison, we included wide structural elements of the introns that might include phylogenetic information to the alignment. The IC introns, whose structure varies, were hinder when attempting to align with IE introns with accuracy. These efforts were fully rewarded as the data set included as many as 268 (200 parsimony informative) sites of P2–P9.1 (see supplementary fig. S2, Supplementary Material online). Phylogenetic trees were constructed by ME (fig. 4), NJ (supplementary fig. S3, Supplementary Material online), Primitive Stage of Group I Intron Spread 1313 The Structural Characters of Clade A FIG. 3.—Phylogenetic relationships of subgroup IE and IC inferred from 60 nt sites across highly conserved sequence elements of the P, Q, R, S, and P3 segments. An unrooted tree was constructed by the ME method of equally weighted P distance. IE introns of Chlorella sp. SW1-ZK (Csw.) and Chlorella sp. OK1-ZK (Cok.) are in bold. The bootstrap probabilities of the ME analyses are given; only values above 50% support are shown. Letters in brackets indicate taxonomic affiliations: Vc, Chlorophyceae; Vk, Klebsormidiophyceae; Vt, Trebouxiophyceae; A, ascomycetes; B, basidiomycetes; D, diatoms; F, foraminiferans; and R, rhodophytes. The inserted positions of rRNA sites are given after brackets. and ML (supplementary fig. S4, Supplementary Material online) methods. ME and NJ trees shared the nearly same topology in that discussed below; however, ML tree had some different branches from those in ME and NJ. For short and highly divergent sequences, like our intron alignment, the parameter-rich ML analyses may give incorrect topologies (Nei et al. 1998). The ‘‘simple’’ models such as the single parameter ME or NJ methods are thought to be potentially useful for these cases (Bruno and Halpern 1999; Piontkivska 2004). We, therefore, place more emphasis on simple distance trees rather than parameter-rich ML tree. The ME tree (fig. 4) showed a large but slightly supported clade as a sister to the S989–S1199 clade (not supported in ML, supplementary fig. S4, Supplementary Material online). We termed this ‘‘Clade A,’’ which contained three partially to highly supported clades of chlorophyte S516 (not supported in ML, supplementary fig. S4, Supplementary Material online), fungal L2066 with L1926 of diatoms, and ‘‘Clade B’’ composed of Csw. introns, Cok. introns, and Chlorella sp. AN 1-3 intron (Can.S651) with six heterologous insertion sites. Cok.L1926 did not seem to have a direct relationship to the L1926 of diatoms or foraminiferans. The tree revealed some evolutionary contexts in Clade B: Csw.L2449 diverged first (not supported in ML, supplementary fig. S4, Supplementary Material online) and then Cok.L2184 branched off from the other five introns of Cok.S943, Cok.L1688, Cok.L1926, Csw.S651, and Can.S651. In general, distinct but similar characters, whether morphological or molecular, are indicative of a closely related group. The same is true of introns. Group I introns that occur at the same insertion sites (i.e., position family, as described below) usually share similar structures (Tan 1997; Müller et al. 2005), whereas introns in different positions but with similar structures are often specifically related. In the case of IE introns, S989 and S1199 members commonly possess a branched P2.1 helix (Li and Zhang 2005; Müller et al. 2005), indicative of monophyly (Nikoh and Fukatsu 2001; Bhattacharya et al. 2001; Müller et al. 2005; fig. 4). The IE intron analyses defined a large clade consisting of chlorophyte S516, fungal L2066, L1926 of diatoms, Can.S651, and the P. bursaria photobiont introns (Clade A; fig. 4). This clade would correspond to the minor subgroup IE2 sensu Li and Zhang (2005), although the ML analysis did not support (supplementary fig. S4, Supplementary Material online). Although we excluded the P9.2 helix region from the phylogenetic analyses, this section contained a surprisingly consensual sequence in Clade A members; they shared a GAGGAAAUGC (or highly similar) terminal loop (fig. 2) with the only exceptions being the S516 introns of Heterochlorella luteoviridis and Watanabea reniformis. The monophyly of these two exceptions was partially indicated within the closely related S516 insertion position family, and thus, this common sequence of the P9.2 termini is considered a symplesiomorphic character of Clade A. Such high character conservation may be necessary for ribozyme stability. Heterochlorella luteoviridis and W. reniformis are phylogenetically close taxa (Huss et al. 1999; Krienitz et al. 2004), suggesting that an evolutionary change in the P9.2 terminus occurred in their common ancestor, and thereafter, the intron with a mutated P9.2 was vertically inherited. Six Heterologous Introns in Clade B Subgroup IE introns have only been observed in nuclear rRNA genes, and only about 300 sequences are listed in the CRW. Except for those in P. bursaria photobionts, they have extremely biased insertion position distributions; almost 90% of the introns are in S516, S989, and S1199. Conversely, the introns of P. bursaria photobionts are rich in original insertion positions: L1688 and L2184 are specific to American photobionts, each insertion at S943 and L2449 is specific to American or European alga as IE subgroup, and the S651 site is only seen in European alga and in Chlorella sp. AN 1-3, all of which formed a clade with robust bootstrap support (Clade B in fig. 4). This result is unusual and is discussed in detail below. Assuming that they were truly monophyletic, we measured the variation in their primary and secondary structures. Wikmark et al. (2007) studied fungal S1389 introns that were assumed to have diverged from a common origin; they constructed a consensus secondary structure of six S1389 introns. In the core regions of PQRS and P3, the S1389 introns had 33 invariants and 27 over-50% conserved positions out of 62 juxtaposed positions. Clade B members, on the other hand, had higher consistency: 42 invariants and 20 over-50% conserved positions out of the same 62 1314 Hoshina and Imamura FIG. 4.—Unrooted ME (equally weighted P distance) tree of selected IE introns. Introns of Chlorella sp. SW1-ZK (Csw.) and Chlorella sp. OK1ZK (Cok.) are in bold. Bootstrap values above the internal nodes were inferred from ME (left) and NJ of Jukes and Cantor (right) analyses, whereas ML bootstrap analyses of GTR þ I þ G evolutionary models selected via Modeltest 3.6 (Posada and Crandall 1998) are shown below the nodes; only values above 50% support are shown. Letters in brackets indicate taxonomic affiliations: Vc, Chlorophyceae; Vk, Klebsormidiophyceae; Vt, Trebouxiophyceae; A, ascomycetes; B, basidiomycetes; D, diatoms; F, foraminiferans; and R, rhodophytes. Intron insertion positions are given on the right side of the vertical bars. positions (data not shown). In addition, the P6 region of Clade B members had highly different structures. Some IE introns had a longer P6 with a GNRA tetraloop at the termini. GNRA tetraloops are frequently observed in group I introns and other large self-assembling RNA molecules (e.g., Woese et al. 1990; Bhattacharya et al. 2001), and the tertiary interactions of those that reside at the P5 terminus of IC introns are well studied (e.g., Jaeger et al. 1994; Cate et al. 1996; Costa and Michel 1997). Longer P6 regions were seen in fungal and chlorophyte S516, L1926 of diatoms, and Csw.L2449 (fig. 5). However, Cok.L2184 had a slightly shorter P6, and the other Clade B members had a much shorter conserved P6. Moreover, some termini substituted and lost the GNRA motif (Cok.L2184 and CokL1926). Although the tertiary contact region of the P6 GNRA tetraloop among IE introns is not yet known, this may represent a loss event of a tertiary interaction and, further, may imply the ancestral status of Csw.L2449 among Clade B members. If so, how did the ancestral Csw.L2449 spread into six different positions, as seen in Clade B? Position Family, the Theory of Intron Spread With increasing data on group I intron sequences in GenBank, many phylogenetic analyses have been conducted to better understand intron evolution. One important Primitive Stage of Group I Intron Spread 1315 FIG. 5.—Structural variations in P6 helices. Fungal and chlorophyte S516, L1926 of diatoms, and Csw.L2449 share longer helices with a GNRA tetraloop (boxed), whereas others have shorter helices with or without the GNRA tetraloop. Csw. and Cok. are abbreviations for Chlorella sp. SW1-ZK and Chlorella sp. OK1-ZK, respectively. discovery is that introns at homologous gene sites are related (i.e., belong to the same position family), even among distantly related host organisms (Bhattacharya et al. 1994, 1996; Hibbett 1996; Nikoh and Fukatsu 2001). This phenomenon is linked to intron spreading mechanisms. Lateral transfer of group I introns may be due to homing or reverse splicing. Despite a lack of any direct evidence of intron movement via reverse splicing from genetic crosses, there is no other explanation for transposition (except among neighboring sites; see Haugen, Reeb, et al. 2004), due to sequence recognition differences. HEs require a longer sequence of 14–40 nt, which strongly prevents transfer among heterologous sites, whereas reverse splicing may occur with small sequences of 4–6 nt. This region is called the internal guide sequence (IGS); it constructs the P1 helix (fig. 2), which is spliced from rRNA, andis a targetfor lateral transfer (with or without transposition) via reverse splicing. Most of the 5# flanking sequences of S989 and S1199, which formed a clade in the present study (fig. 4), shared a common IGS of –CAGGT– (see the group I intron sequence and structure Database [GISSD], http://www.rna.whu.edu.cn/gissd/; Zhou et al. 2008). This is in agreement with previous studies (Bhattacharya et al. 1996, 2002, 2005), indicating that they may have been derived from a common ancestor and currently reside at two heterologous positions, caused by reverse splicing events with transposition. Among Clade B members with six heterologous insertion sites, L2184 and L1926 shared a common IGS of – CTCTT–and L1688 and S943 appeared to share a shorter IGS of –CGT– (see Cnc. introns; Hoshina and Imamura 2008b). A part of Clade B introns are, consequently, thought possibly to be incorrect duplications via reverse splicing. However, the others did not have a common IGS. Therefore, reverse splicing alone is not enough to explain the entire intron lineage of Clade B. Scheme of Group I Intron Evolution It is not clear why introns appear to have evolved into five major subgroups. And after the primal diversification, introns have spread into many genic loci (multichannelization). In IE introns, for example, more than 10 (exclusive of the introns of P. bursaria photobionts’) insertion sites have been found (see CRW) and of those most heterologous introns do not share a common 5# flanking sequence. Studies on the evolution of group I introns have been hard to put to reply how the introns gained such various insertion positions (e.g., Nikoh and Fukatsu 2001; Bhattacharya et al. 2002, 2005; Wikmark et al. 2007). Introns with many insertion positions can be understood to have undergone the stages shown in figure 6. In stage I, an ancestral group I intron diversified into the five subgroups currently 1316 Hoshina and Imamura FIG. 6.—Schematic model of group I intron evolution with implied insertion position diversification, showing seven insertion positions. L2184 originated from L2449 via an unknown pathway, and L1926 is thought to be an incorrect duplication via reverse splicing. S287 (Bhattacharya et al. 2002, 2005) and S989 (this study) are also attributed to transpositional movements via reverse splicing, starting from S1199. recognized. In stage II, each intron identified acceptable gene insertion sites (e.g., S516, S1199, and L2449 of rRNA genes); it is possible that some younger introns originated in this stage from older introns (e.g., Csw.L2449 might generate Cok.L2184). Finally, in stage III, incorrect duplication, though rare, occurred via reverse splicing (S1199 spread into S287 [Bhattacharya et al. 2002, 2005] and also into S989, as described above; Cok.1926 might be derived from Cok.L2184). Intron spreading (homing or reverse splicing) without transpositional events is rather ordinary, helping to consolidate position families. Based on the above scheme, Clade B can be explained by stage II and III diversifications. Csw.L2449 explored position L2184 by an unknown mechanism. Cok.L2184 then duplicated within the genome and explored position L1926 via reverse splicing and other sites by an unknown mechanism. Possibility of Degenerative Homing No recognized pathway can explain how introns gain new insertion sites or transposed heterologous sites (e.g., stage II in fig. 6), although Belfort and Perlman (1995) reported an important possibility that allows for illegitimate recombination, namely, intron transposition by a degenera- tive homing event requiring relaxed sequence homology. When considering Csw.L2449 and Cok.L2184 from this perspective, although they do not share a common IGS, we unexpectedly detected similar sequence blocks of – GRAGCT–, –GGYGYCRG–, and –AAA(T/-)TACCAC– in upstream regions (fig. 7). Likewise, similar sequence blocks (–TAGAAYAA– and –GYCGGYRAAAT–) were seen between L2184 and L1688. A shorter region (–GG(A/-)AGTCGGC–), but with higher homology, was also found between Cok.L1688 and Csw.S651. That is, a specific six-nucleotide-long sequence emerges stochastically once every 4 kb, and a seven-nucleotide-long sequence emerges once per 16 kb. Such similar sequence blocks occurring at almost the same position relative to intron insertions are therefore unlikely to be coincidental. More noteworthy is that three (or more) nucleotides following insertions were identical. These data suggest the possibility of degenerative homing events. HE genes promote intron homing, and HE proteins are classified into four major families: LAGLIDADG, His-Cys box, GIY-YIG, and HNH (Jurica and Stoddard 1999 and references therein). The GIY-YIG endonuclease has the most flexible sequence recognition, that is, I-TevI (T4 phage GIY-YIG endonuclease) allows for 15% degenerative substrate (Jurica and Stoddard 1999). Further, the cleavage and intron insertion sites are well separated, resulting in a long sequence recognition (not palindromic) up to 40 bp (Bryk et al. 1993). If an HE has promoted the transpositional transfer of Csw.L2449 into Cok.L2184 and subsequent spreads, it might analogize GIY-YIG type endonuclease with further flexible sequence recognition. The alternation of sequence recognition with HE genes or mutations has been partially simulated (Chevalier et al. 2003; Haugen and Bhattacharya 2004; Haugen, Reeb, et al. 2004). Each member of the HE family, thought to have evolved from a common origin, has a variety of recognition sites. For example, the His-Cys box family is exclusively associated with nuclear rDNA group I introns (more than 20 introns reported) and possesses five different recognition sites (S516, S943 and nearby sites, S1199, S1506, and L1926 and nearby sites), depending on each HE-containing intron (Haugen, Reeb, et al. 2004). This indicates that HE has evolved by dramatically changing its sequence recognition and may involve intron diversification at an early stage. HE genes are selfish genetic elements that invade noncritical regions of introns and regions between introns (Haugen, Reeb, et al. 2004; Haugen, Wikmark, et al. 2005). Moreover, the introns that include the HE gene were FIG. 7.—Sequences near intron insertion sites. Similar sequence fragments (highlighted) are found further upstream and after insertions. Identical nucleotides are marked with asterisks. The sequences of these regions are common in Chlorella sp. SW1-ZK and Chlorella sp. OK1-ZK. Primitive Stage of Group I Intron Spread 1317 FIG. 8.—Evolutionary context of subgroup IE introns found in Paramecium bursaria photobionts (outline characters: Chlorella sp. SW1-ZK; solid characters: Chlorella sp. OK1-ZK) with six heterologous insertion sites. This scheme is based on hypothetical transposition pathways of degenerative homing (DH) and reverse splicing (RS). once fixed in the population; the HE genes no longer have active functions and are ultimately lost (Goddard and Burt 1999; Haugen, Reeb, et al. 2004). Only a tiny minority of introns contain the HE gene. Therefore, the fact that Clade B members do not contain any HE gene-like sequences does not contradict the hypothesis. Rather, Cok.L2184 and Cok.L1926 have long (more than 130 and 180 nt) insertions at P1 and P8 (see Cnc.L2184 and Cnc.L1926, Hoshina and Imamura 2008b). Such long sequence insertions (.50) at peripheral helices can be regarded as HE gene remnants (Haugen, Reeb, et al. 2004). Conclusion Most group I introns that share a common insertion site are phylogenetically more closely related than are the hosts of these introns. However, we identified a deviant group in which six heterologous subgroup IE introns in two algal photobionts (European-type Chlorella sp. SW1-ZK and American-type Chlorella sp. OK1-ZK) of P. bursaria and one in Chlorella sp. AN 1-3 formed an independent clade. This result is of essential importance in understanding the evolution of group I introns. These heterologous introns were likely partly duplicated by a primitive mechanism of intron spread, that is, degenerative homing with a succession of recognition sequences. The introns share similar sequence fragments upstream of the insertions and a few residues after insertions, which could be ideal recognition sequences for HE. Although the introns of photobionts do not contain intact HE or HE-like gene sequences, some retain unusual extra sequences (might be HE gene wreckage). These findings represent the first strong evidence for the multichannelization strategy of group I intron insertion positions (fig. 6). The above hypothesis and the phylogenetic relationships suggest the following scenario for their evolutionary context, as shown in figure 8: an ancestral group I intron infected a highly acceptable L2449 site (Jackson et al. 2002; Simon et al. 2005) of European alga via an unknown pathway; the L2449 intron laterally transferred into the novel site L2184 of American alga via degenerative homing; this L2184 intron was copied and transferred within the genome via degenerative homing (into L1688) and via reverse splicing (into L1926); finally, the L1688 intron was copied to the American S943 site via reverse splicing and laterally back to the European genome via degenerative homing as a novel intron, S651. Neither European nor American algae live freely in natural water sources due to their nutritional requirement and the viral sensitivity (as mentioned above). However, the completion of the hereditary line from L2449 to S651, including two transfer events beyond the species barrier, might have needed a special situation where two algae have frequently contacted each other. It is just conceivable that there was a period during which the two photobionts lived sympatrically, simultaneously in the P. bursaria cell (double-photobiont association in P. bursaria has been known to occur; Nakahara et al. 2004). Such a situation where cell–cell contact within a small space may accelerate the lateral transfer of group I introns (Bhattacharya et al. 1996; Friedl et al. 2000). In the future, these introns may be duplicated into other organisms via mechanisms such as homing or reverse splicing. Indeed, Can.S651 may be a first step in the formation of a new position family of introns. Group I intron has yet to be found in P. bursaria genome. Intron transfer is rather ordinary among ingroup taxa (e.g., among green algae); however, intron transfer straddling different Kingdoms is extremely rare. Lichen photobionts (e.g., Trebouxia species, Trebouxiophyceae) are a treasury of group I introns, and these introns have been suggested their frequent lateral transfer (due to cell–cell contact, Friedl et al. 2000). Although such transmissions have continuously undergone in the small space of filamentous tissue of lichen-forming fungi, any algae-derived intron (in phylogenetic sense) has not been discovered from the host fungi. These facts imply that existing introns are hard to transfer beyond higher biological ranks in nature. Then how have the organisms gained initial introns? Viruses possibly play a part in the intron transfer (Yamada et al. 1994; Bhattacharya et al. 1996; Nishida et al. 1998; Friedl et al. 2000). A few kinds of viruses possess group I intron, of which only eukaryotic virus is Chlorella virus (see GISSD). This virus, surprisingly, infects only the photobionts of P. bursaria. The intron sequences of Chlorella virus are phylogenetically related to eukaryotic introns assigned to subgroup IC (Yamada et al. 1994; Nishida et al. 1998); however, the introns are not close to IC introns of American-type photobiont (contains four IC introns, mentioned above) or of other green algae (Hoshina and Imamura 2008b). IE-type intron has not been documented from viral genomes. Whether or not Chlorella virus truly mediates intron transfer remains to be seen. 1318 Hoshina and Imamura Supplementary Material Supplementary figures S1–S4 are available at Molecular Biology and Evolution online (http://www.mbe. oxfordjournals.org/). Acknowledgments We sincerely thank Mr T. Nishimura (Nagahama Institute of Bio-Science and Technology) for preparing E. coli-competent cells, Dr T. Sasaki (Tokyo Institute of Technology) for kindly providing many copies of articles, and Dr S-i. Kamako (Ritsumeikan University) for help in establishing algal cultures. Literature Cited Baldauf SL. 2003. The deep roots of eukaryotes. Science. 300:1703–1706. Belfort M, Perlman PS. 1995. Mechanisms of intron mobility. J Biol Chem. 270:30237–30240. Bhattacharya D, Cannone JJ, Gutell RR. 2001. Group I intron lateral transfer between red and brown algal ribosomal RNA. Curr Genet. 40:82–90. Bhattacharya D, Friedl T, Damberger S. 1996. Nuclear-encoded rDNA group I introns: origin and phylogenetic relationships of insertion site lineages in the green algae. Mol Biol Evol. 13:978–989. Bhattacharya D, Friedl T, Helms G. 2002. Vertical evolution and intragenic spread of lichen-fungal group I introns. J Mol Evol. 55:74–84. Bhattacharya D, Reeb V, Simon DM, Lutzoni F. 2005. Phylogenetic analyses suggest reverse splicing spread of group I introns in fungal ribosomal DNA. BMC Evol Biol. 5:68. Bhattacharya D, Surek B, Rüsing M, Damberger S, Melkonian M. 1994. Group I introns are inherited through common ancestry in the nuclear-encoded rRNA of Zygnematales (Charophyceae). Proc Natl Acad Sci USA. 91:9916–9920. Bruno WJ, Halpern AL. 1999. Topological bias and inconsistency of maximum likelihood using wrong models. Mol Biol Evol. 16:564–566. Bryk M, Quirk SM, Mueller JE, Loizos N, Lawrence C, Belfort M. 1993. The td intron endonuclease I-TevI makes extensive sequence-tolerant contacts across the minor groove of its DNA target. EMBO J. 12:4040–4041. Burke JM. 1988. Molecular genetics of group I introns: RNA structures and protein factors required for splicing—a review. Gene. 73:273–294. Cannone JJ, Subramanian S, Schnare MN, et al. (14 co-authors). 2002. The Comparative RNA Web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinform. 3:2. Cate JH, Gooding AR, Podell E, Zhou K, Golden BL, Kundrot CE, Cech TR, Doudna JA. 1996. Crystal structure of a group I ribozyme domain: principles of RNA packing. Science. 273: 1678–1685. Cech TR. 1985. Self-splicing RNA: implications for evolution. Int Rev Cytol. 93:3–22. Cech TR. 1988. Conserved sequences and structures of group I introns: building an active site for RNA catalysis—a review. Gene. 73:259–271. Cech TR. 1990. Self-splicing of group I introns. Annu Rev Biochem. 59:543–568. Cech TR. 2002. Ribozymes, the first 20 years (ribozyme mechanisms and folding). Biochem Soc Trans. 30:1162–1166. Chevalier BS, Stoddard BL. 2001. Homing endonucleases: structural and functional insight into the catalysts of intron/ intein mobility. Nucleic Acids Res. 29:3757–3774. Chevalier B, Turmel M, Lemieux C, Monnat RJ Jr, Stoddard BL. 2003. Flexible DNA target site recognition by divergent homing endonuclease isoschizomers I-CreI and I-MsoI. J Mol Biol. 329:253–269. Costa M, Michel F. 1997. Rules for RNA recognition of GNRA tetraloops deduced by in vitro selection: comparison with in vivo evolution. EMBO J. 16:3289–3302. Dujon B. 1989. Group I introns as mobile genetic elements: facts and mechanistic speculations—a review. Gene. 82:91–114. Edgell DR, Belfort M, Shub DA. 2000. Barriers to intron promiscuity in bacteria. J Bacteriol. 182:5281–5289. Friedl T, Besendahl A, Pfeiffer P, Bhattacharya D. 2000. The distribution of group I introns in lichen algae suggests that lichenization facilitates intron lateral transfer. Mol Phylogenet Evol. 14:342–352. Gaponova IN, Andronov EE, Migunova AV, Vorobyev KP, Chizhevskaja EP, Kvitko KV. 2007. Genomic dactyloscopy of Chlorella sp., symbionts of Paramecium bursaria Protistology. 4:311–317. Goddard MR, Burt A. 1999. Recurrent invasion and extinction of a selfish gene. Proc Natl Acad Sci USA. 96:13880–13885. Haugen P, Bhattacharya D. 2004. The spread of LAGLIDADG homing endonuclease genes in rDNA. Nucleic Acid Res. 32:2049–2057. Haugen P, De Jonckheere JF, Johansen S. 2002. Characterization of the self-splicing products of two complex Naegleria LSU rDNA group I introns containing homing endonuclease genes. Eur J Biochem. 269:1641–1649. Haugen P, Joseph R, Bhattacharya D. 2004. Long-term evolution of the S788 fungal nuclear small subunit rRNA group I intron. RNA. 10:1084–1096. Haugen P, Reeb V, Lutzoni F, Bhattacharya D. 2004. The evolution of homing endonuclease genes and group I introns in nuclear rDNA. Mol Biol Evol. 21:129–140. Haugen P, Simon DM, Bhattacharya D. 2005. The natural history of group I introns. Trends Genet. 21:111–119. Haugen P, Wikmark O-G, Vader A, Coucheron DH, Sjøttem E, Johansen SD. 2005. The recent transfer of a homing endonuclease gene. Nucleic Acids Res. 33:2734–2741. Hibbett DS. 1996. Phylogenetic evidence for horizontal transmission of group I introns in the nuclear ribosomal DNA of mushroom-forming fungi. Mol Biol Evol. 13:903–917. Hoshina R, Imamura N. 2008a. Multiple origins of the symbioses in Paramecium bursaria. Protist. 159:53–63. Hoshina R, Imamura N. 2008b. Eu-Chlorella large subunit rDNA sequences and group I introns in ribosomal DNA of the paramecian symbiotic alga NC64A. Phycol Res. 56:21–32. Hoshina R, Kamako S-i, Imamura N. 2004. Phylogenetic position of endosymbiotic green algae in Paramecium bursaria Ehrenberg from Japan. Plant Biol. 6:447–453. Hoshina R, Kato Y, Kamako S-i, Imamura N. 2005. Genetic evidence of ‘‘American’’ and ‘‘European’’ type symbiotic algae of Paramecium bursaria Ehrenberg. Plant Biol. 7: 526–532. Huss VAR, Frank C, Hartmann EC, Hirmer M, Kloboucek A, Seidel BM, Wenzeler P, Kessler E. 1999. Biochemical taxonomy and molecular phylogeny of the genus Chlorella sensu lato (Chlorophyta). J Phycol. 35:587–598. Ichimura T. 1971. Sexual cell division and conjugation-papilla formation in sexual reproduction of Closterium strigosum. In: Nishizawa K, Arasaki S, Chihara M, Hirose H, Nakamura V, Tsuchiya Y, editors. Proceedings of the Seventh International Primitive Stage of Group I Intron Spread 1319 Seaweed Symposium. Tokyo, Japan: University of Tokyo Press. p. 208–214. Jackson SA, Cannone JJ, Lee JC, Gutell RR, Woodson SA. 2002. Distribution of rRNA introns in the three-dimensional structure of the ribosome. J Mol Biol. 323:35–52. Jaeger L, Michel F, Westhof E. 1994. Involvement of a GNRA tetraloop in long-range RNA tertiary interactions. J Mol Biol. 236:1271–1276. Jurica MS, Stoddard BL. 1999. Homing endonuclease: structure, function and evolution. Cell Mol Life Sci. 55:1304–1326. Kamako S-i, Hoshina R, Ueno S, Imamura N. 2005. Establishment of axenic endosymbiotic strains of Japanese Paramecium bursaria and the utilization of carbohydrate and nitrogen compounds by the isolated algae. Eur J Protistol. 41:193–202. Krienitz L, Hegewald E, Hepperle D, Huss VAR, Rohr T, Wolf M. 2004. Phylogenetic relationship of Chlorella and Parachlorella gen. nov. (Chlorophyta, Trebouxiophyceae). Phycologia. 43:529–542. Kruger K, Grabowski PJ, Zaug AJ, Sands J, Gottschling DE, Cech TR. 1982. Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena. Cell. 31:147–157. Kuhsel MG, Strickland R, Palmer JD. 1990. An ancient group I intron shared by eubacteria and chloroplasts. Science. 250:1570–1573. Lehnert V, Jaeger L, Michel F, Westhof E. 1996. New loop-loop tertiary interactions in self-splicing introns of subgroup IC and ID: a complete 3D model of the Tetrahymena thermophila ribozyme. Chem Biol. 3:993–1009. Li Z, Zhang Y. 2005. Predicting the secondary structures and tertiary interactions of 211 group I introns in IE subgroup. Nucleic Acids Res. 33:2118–2128. Mathews DH, Sabina J, Zuker M, Turner DH. 1999. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J Mol Biol. 288:911–940. Mavridou A, Cannone J, Typas MA. 2000. Identification of group-I introns at three different positions within the 28S rDNA gene of the entomopathogenic fungus Metarhizium anisopliae var. anisopliae. Fungal Genet Biol. 31:79–90. Michel F, Westhof E. 1990. Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. J Mol Biol. 216:585–610. Müller KM, Ellenor DW, Sherwood AR, Sheath RG, Cannone JJ, Gutell RR. 2005. Evidence for lateral transfer of an IE intron between fungal and red algal small subunit rRNA Genes. J Phycol. 41:380–390. Nakahara M, Handa S, Watanabe S, Deguchi H. 2004. Choricystis minor as a new symbiont of simultaneous twospecies association with Paramecium bursaria and implications for its phylogeny. Symbiosis. 36:127–151. Nei M, Kumar S, Takahashi K. 1998. The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small. Proc Natl Acad Sci USA. 95:12390–12397. Nikoh N, Fukatsu T. 2001. Evolutionary dynamics of multiple group I introns in nuclear ribosomal RNA genes of endoparasitic fungi of the genus Cordyceps. Mol Biol Evol. 18:1631–1642. Nishida K, Suzuki S, Kimura Y, Nomura N, Fujie M, Yamada T. 1998. Group I introns found in Chlorella viruses: biological implications. Virology. 242:319–326. Piontkivska H. 2004. Efficiencies of maximum likelihood methods of phylogenetic inferences when different substitution models are used. Mol Phylogenet Evol. 31:865–873. Posada D, Crandall KA. 1998. Modeltest: testing the model of DNA substitution. Bioinformatics. 14:817–818. Reisser W. 1984. The taxonomy of green algae endosymbiotic in Ciliates and Sponge. Brit Phycol J. 19:309–318. Roman J, Woodson SA. 1995. Reverse splicing of the Tetrahymena IVS: evidence for multiple reaction sites in the 23S rRNA. RNA. 1:478–790. Roman J, Woodson SA. 1998. Integration of the Tetrahymena group I intron into bacterial rRNA by reverse splicing in vivo. Proc Natl Acad Sci USA. 95:2134–2139. Simon D, Helms G, Friedl T, Bhattacharya D. 2005. Divergent histories of rDNA group I introns in the lichen family Physciaceae. J Mol Evol. 60:434–446. Suh SO, Jones KG, Blackwell M. 1999. A group I intron in the nuclear small subunit rRNA gene of Cryptendoxyla hypophloia, an ascomycetous fungus: evidence for a new major class of group I introns. J Mol Evol. 48:493–500. Summerer M, Sonntag B, Sommaruga R. 2008. Ciliate-symbiont specificity of freshwater endosymbiotic Chlorella (Trebouxiophyceae, Chlorophyta). J Phycol. 44:77–84. Swofford DL. 2003. PAUP*: phylogenetic analysis using parsimony (* and other methods). Version 4.0b10. Sunderland (MA): Sinauer Associates. Tamura K, Dudley J, Nei M, Kumar S. 2007. MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol. 24:1596–1599. Tan MK. 1997. Origin and inheritance of group I introns in 26S rRNA genes of Gaeumannomyces graminis. J Mol Evol. 44:637–645. Van Etten JL, Lane LC, Meints RH. 1991. Viruses and virus-like particles of eukaryotic algae. Microbiol Rev. 55:586–620. Wikmark O-G, Haugen P, Lundblad EW, Kaugli K, Johansen SD. 2007. The molecular evolution and structural organization of group I introns at position S1389 in nuclear small subunit rDNA of myxomycetes. J Eukaryot Microbiol. 54:49–56. Woese CR, Winker S, Gutell RR. 1990. Architecture of ribosomal RNA: constraints on the sequence of ‘‘tetra- loops’’. Proc Natl Acad Sci USA. 87:8467–8471. Woodson SA, Cech TR. 1989. Reverse self-splicing of the Tetrahymena group I intron: implication for the directionality of splicing and for intron transposition. Cell. 57:335–345. Yamada T, Higashiyama T, Fukuda T. 1991. Screening of natural waters for viruses which infect Chlorella cells. Appl Environ Microbiol. 57:3433–3437. Yamada T, Tamura K, Aimi T, Songsri P. 1994. Self-splicing group I introns in eukaryotic viruses. Nucleic Acids Res. 22:2532–2537. Zhou Y, Lu C, Wu Q-J, Wang Y, Sun Z-T, Deng J-C, Zhang Y. 2008. GISSD: group I intron sequence and structure Database. Nucleic Acid Res. 36:D31–D37. Zuker M. 2003. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31:3406–3415. Charles Delwiche, Associate Editor Accepted February 27, 2009
© Copyright 2026 Paperzz