Microevolution of Cryptococcus neoformans Driven by Massive Tandem Gene Amplification Eve W.L. Chow,1 Carl A. Morrow, 1 Julianne T. Djordjevic, 2 Ian A. Wood, 3 and James A. Fraser* ,1 1 Australian Infectious Diseases Research Centre, School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, Queensland, Australia 2 Centre for Infectious Diseases and Microbiology, Westmead Millennium Institute, University of Sydney at Westmead Hospital, New South Wales, Australia 3 School of Mathematics and Physics, University of Queensland, Brisbane, Queensland, Australia *Corresponding author: E-mail: [email protected]. Associate editor: James McInerney The subtelomeric regions of organisms ranging from protists to fungi undergo a much higher rate of rearrangement than is observed in the rest of the genome. While characterizing these ;40-kb regions of the human fungal pathogen Cryptococcus neoformans, we have identified a recent gene amplification event near the right telomere of chromosome 3 that involves a gene encoding an arsenite efflux transporter (ARR3). The 3,177-bp amplicon exists in a tandem array of 2–15 copies and is present exclusively in strains with the C. neoformans var. grubii subclade VNI A5 MLST profile. Strains bearing the amplification display dramatically enhanced resistance to arsenite that correlates with the copy number of the repeat; the origin of increased resistance was verified as transport-related by functional complementation of an arsenite transporter mutant of Saccharomyces cerevisiae. Subsequent experimental evolution in the presence of increasing concentrations of arsenite yielded highly resistant strains with the ARR3 amplicon further amplified to over 50 copies, accounting for up to ;1% of the whole genome and making the copy number of this repeat as high as that seen for the ribosomal DNA. The example described here therefore represents a rare evolutionary intermediate—an array that is currently in a state of dynamic flux, in dramatic contrast to relatively common, static relics of past tandem duplications that are unable to further amplify due to nucleotide divergence. Beyond identifying and engineering fungal isolates that are highly resistant to arsenite and describing the first reported instance of microevolution via massive gene amplification in C. neoformans, these results suggest that adaptation through gene amplification may be an important mechanism that C. neoformans employs in response to environmental stresses, perhaps including those encountered during infection. More importantly, the ARR3 array will serve as an ideal model for further molecular genetic analyses of how tandem gene duplications arise and expand. Key words: Cryptococcus neoformans, subtelomere, amplification. Introduction The haploid basidiomycete yeast Cryptococcus neoformans is an opportunistic pathogen that causes high morbidity and mortality among the immunocompromised population, particularly in the HIV-infected community (Park et al. 2009). Generally acquired from the environment, infection of a host is accidental and is not required for completion of the C. neoformans lifecycle (Casadevall 1998). A number of key virulence traits that enable infection have been extensively characterized, including the antiphagocytic polysaccharide capsule, production of the pigment melanin, and the ability to grow at mammalian body temperature (Kwon-Chung 1986). Unlike bacterial virulence factors, which are tailored specifically for pathogenesis, fungal virulence factors are considered adaptations to environmental or predatory stresses and are likely co-opted for evasion of the immune system after evolving under the selective pressure of the environmental niche (Steenbergen et al. 2001). Previous studies have shown that extensive karyotypic differences exist between C. neoformans strains collected not only from the natural environment versus clinical isolates but also among isolates obtained from the same patient at different time points during infection (Fries et al. 1996; Boekhout and van Belkum 1997). Variant karyotypes often predominate during later stages of infection and potentially confer a selective advantage in the human host. In support of this theory, emergence of a second karyotype during therapy has been observed coincident with increased virulence and drug resistance (Fries et al. 1996; Blasi et al. 2001; Jain et al. 2006). Analysis of initial and relapse clinical isolates has shown that the vast majority of clinical recurrences are a result of persistence of the original infection rather than reinfection with a new strain (Fries et al. 1996; Desnos-Ollivier et al. 2010). Modern population analyses of C. neoformans employing multilocus sequence typing (MLST) and amplified fragment length polymorphism have demarcated the species into four haploid molecular types: C. neoformans var. grubii (VNI, VNII, and VNB) cause ;95% of infections and C. neoformans var. neoformans (VNIV) approximately 5% (Xu et al. 2000). Studies have shown that while the genomes of C. neoformans var. grubii strain H99 (VNI) and C. neoformans var. neoformans © The Author 2012. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: [email protected] Mol. Biol. Evol. 29(8):1987–2000. 2012 doi:10.1093/molbev/mss066 Advance Access publication February 14, 2012 1987 Research article Abstract MBE Chow et al. · doi:10.1093/molbev/mss066 strain JEC21 (VNIV) are highly syntenic overall, a number of translocations, inversions, and duplications are present (Fraser, Huang, et al. 2005; Kavanaugh et al. 2006; Sun and Xu 2009). Far more examples of karyotypic change have been characterized in Saccharomyces cerevisiae, where they typically correspond with the occurrence of translocations, deletions, insertions, duplications, or changes in copy number of repetitive DNA sequences (Adams et al. 1992; Ibeas and Jimenez 1996; Fischer et al. 2000). These are most commonly found near the chromosome ends, the transitional regions between chromosome-specific sequences and telomeric repeat units (Bosco and Haber 1998; Kuo et al. 2006; Rehmeyer et al. 2006; Fan et al. 2008). Subtelomeres are generally rich in arrays of nonfunctional repetitive sequences, including degenerate telomeric repeats, microsatellites, transposable elements, and members of nonessential gene families (De Bruin et al. 1994; Louis 1995; Flint et al. 1997). Since they undergo high rates of rearrangement and as one telomere can functionally replace another, the subtelomeres are ideal locations for gene amplification events through aberrant DNA repair (Louis 1995; Brown et al. 1998). In Neurospora crassa and S. cerevisiae, amplified gene arrays are found in multiple subtelomeres (Brown et al. 1998; Bergthorsson et al. 2007; Wu et al. 2009; Brown et al. 2010). Over a longer timeframe, this new genetic material can be used in the generation of novel genes through accumulation of mutations and domain accretion. Based on the frequent development of karyotypic variants that occur during cryptococcal infection and the evidence of subtelomeric fluidity in other organisms, we hypothesized that these genomic regions may be subject to a higher rate of karyotypic change. Here, we provide evidence that in contrast to the rest of the genome, the gene content of the subtelomeres of Cryptococcus differs considerably between strains. In addition, we have identified and characterized a specific example of this change as it is underway: a gene amplification event that has occurred in the right subtelomere of chromosome 3 in C. neoformans var. grubii and is now present in all isolates of a defined subclade with worldwide distribution. Through the amplification of an arsenite efflux transporter gene (ARR3), this event confers resistance to high concentrations of arsenite in C. neoformans, producing some of the most arsenic resistant fungi ever reported. This finding is a proof of concept that adaptation through gene amplification occurs in the subtelomeric regions and supports the potential use of this mechanism to promote adaptation during human infection. Materials and Methods Bioinformatic Analyses Analysis of the Cryptococcus subtelomeric regions utilized the whole-genome shotgun sequences of C. neoformans var. grubii VNI strain H99, Cryptococcus gattii VGII strain R265 and C. gattii VGI strain WM276 genome downloaded from the Broad Institute of Harvard and Massachusetts Institute of Technology (http://www.broadinstitute.org/) (D’Souza et al. 1988 2011), the C. neoformans var. neoformans VNIV strain JEC21 downloaded from TIGR (http://www.tigr.org/tdb/ e2k1/cna1/), and VNIV strain B3501A downloaded from Stanford Genome Technology Centre (http://www-sequence. stanford.edu/group/C.neoformans/index.thml) (Loftus et al. 2005). Basic local alignment search tool (BLAST) searches were used to annotate 40-kb regions from each of the 28 chromosome ends, with the homologs identified in closely related species (Altschul et al. 1990). Sequence alignments were generated using ClustalW (Thompson et al. 1994). Sequence traces for MLST analyses were examined using Sequencher 4.7 (Gene Codes Corporation, Ann Arbor, MI) and aligned using ClustalW v1.4 against C. neoformans MLST genotype sequence data (www.mlst.net) (Litvintseva et al. 2006). Statistical Analyses A variation on the procedure described by Brown et al. (2010) was used to look for changes in gene density in the chromosome arms, which might indicate the boundary of the subtelomeric region. The gene positions in a window of size 20 kb were compared against a true uniform distribution, and the window was moved in sequential steps of 1 kb from the telomere. A Kolmogorov–Smornov (KS) P value was calculated for each window. A second method was also employed to look for a change point in gene density using the Pruned Exact Linear Time (PELT) algorithm (Killick et al. 2011) implemented in the R package change point (Killick and Eckley Forthcoming). Intergenic distances were defined as the distances between the centers of consecutive genes in base pairs. The natural logarithm of the intergenic distance values was approximately normally distributed. The PELT algorithm was then applied to the log values to look for change points in the mean of a normal distribution. For both methods, three sequenced genomes of Cryptococcus, (H99, JEC21, and WM276) were pooled to increase sample size. To determine whether the subtelomeres show gene enrichment, a hypothetical subtelomeric cutoff was utilized by starting at 5 kb and increasing in increments of 5 kb. The numbers of hexose transporter genes within and outside the cutoff were compared against the numbers of all other genes using Fisher’s exact test. As the previous two analyses aimed to identify a difference in gene density, the P values were calculated for a range of possible cutoff values, and a graph was plotted of the Fisher’s exact test P value against distance of cutoff from the telomere. In all three analyses, the final cutoff value used was 110 kb, as the shortest chromosome arm is 111 kb in length. Fungal Strains and Growth Conditions Fungal strains used in this study are listed in table 1. Strains were grown on YPD medium (1% yeast extract, 2% peptone, and 2% glucose) or on minimal YNB (Becton Dickinson, Franklin Lakes, NJ) medium (2% glucose as a carbon source) and supplemented to meet auxotroph requirements. Metalloid sensitivity assays were performed using a method modified from Wysocki et al. (2001). Cultures were grown overnight in 100 ml YPD liquid media at 20 °C with MBE Massive Gene Amplification in C. neoformans · doi:10.1093/molbev/mss066 Table 1. Fungal Strains Used in the Study. Strains Cryptococcus H99 CM8 (H99 arr3D) CM48 (H99 arr3D CnARR3) M8A, M8B M8A-11 M8A-13 JEC21 R265 F0, F2 B0, B1, B2, B4 M7A, M7B, M7C, M7F J35a, J35b G0, G1, G2 I51, I52 I55, I56, I60 I61, I63 I66, I73 I68, I74 I82, I83 I91, I95, I96 I107, I110, I111, I112 A5 35-17 C48 C8 A2 102-5 Ug2463 It754 Tn470 Bt104 123.91 124.91 125.91 A7 A7 35-23 C2 C12 C16 C44 C45 8–1 Ug2472 I57, I59 d-27 Bt33 Bt34 Bt63 Saccharomyces cerevisiae BY4739 BY4739 arr3D EC2 (BY4739 arr3D ScARR3) EC3 (BY4739 arr3D CnARR3) Genotype/Description Source MATa VNI MATa arr3::NEO MATa arr3::NEO ARR3:NAT MATa VNI Experimentally evolved M8A Experimentally evolved M8A MATa VNIV MATa VGII MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNI MATa VNII MATa VNII MATa VNII MATa VNII MATa VNII MATa VNII MATa VNII MATa VNII MATa VNII MATa VNII MATa VNB MATa VNB MATa VNB Perfect Lab This study This study Fries B. (unpublished data) This study This study Kwon-Chung et al. (1992) Fraser, Giles, et al. (2005) Fries B. (unpublished data) Fries B. (unpublished data) Fries B. (unpublished data) Fries B. (unpublished data) Fries B. (unpublished data) Jain et al. (2005) Jain et al. (2005) Jain et al. (2005) Jain et al. (2005) Jain et al. (2005) Jain et al. (2005) Jain et al. (2005) Jain et al. (2005) Litvintseva et al. (2005) Litvintseva et al. (2005) Litvintseva et al. (2005) Litvintseva et al. (2005) Litvintseva et al. (2005) Litvintseva et al. (2005) Litvintseva et al. (2005) Litvintseva et al. (2005) Lengeler et al. (2000) Lengeler et al. (2000) Lengeler et al. (2000) Litvintseva et al. (2005) Litvintseva et al. (2006) Litvintseva et al. (2006) Litvintseva et al. (2006) Litvintseva et al. (2006) Litvintseva et al. (2005) Litvintseva et al. (2005) Nielsen et al. (2003) Litvintseva et al. (2006) Jain et al. (2005) Litvintseva et al. (2006) Litvintseva et al. (2003) Litvintseva et al. (2006) MATa MATa MATa MATa Heitman Lab Heitman Lab This study This study leu2D0 lys2D0 ura3D0 leu2D0 lys2D0 ura3D0 arr3::KanMX leu2D0 lys2D0 ura3D0 arr3::KanMX CnARR3:URA3 leu2D0 lys2D0 ura3D0 arr3::KanMX ScARR3:URA3 constant agitation. To determine the maximum noninhibitory concentration of arsenic salts, 3 ll of each dilution series were spotted onto solid YPD media with increasing concentrations of NaAsO2 (Sigma-Aldrich, St Louis, MO). Plates were incubated for 3 days at 30 °C. Experimental evolution assays were performed by sequentially plating strains on media containing increasing concentrations of arsenite. Strain M8A (;14 copies of ARR3 amplicon) was initially plated on media containing 10 mM arsenite. Colonies that arose after 3 days of incubation at 30 °C were subcultured onto media containing 15 mM arsenite and further incubated. This process was repeated in steps of 5 mM arsenite until 35 mM arsenite where no further growth was observed. Primers and Sequencing Primers used in this study are listed in supplementary table S1 (Supplementary Material online). Primers were designed using Oligo 6.8 (Molecular Biology Insights, West Casade, CO) and synthesized by Life Technologies (Carlsbad, CA). Polymerase chain reaction (PCR) products were purified using a QIAquick gel extraction kit (Qiagen GmbH, Hilden, DE) 1989 MBE Chow et al. · doi:10.1093/molbev/mss066 Table 2. Plasmids Used in This Study. Plasmid pCR2.1 TOPO YEplac195 pEWC5 pEWC17 pEWC18 pEWC19 pEWC20 pEWC21 pEWC22 Description Escherichia coli cloning vector, Ampicillin-resistant Saccharomyces cerevisiae–E. coli shuttle vector, Ampicillin-resistant, URA3 7.46-kb PCR fragment containing Cryptococcus ARR3 cloned in pCR2.1 TOPO 3.1-kb PCR fragment containing Cryptococcus ARR3 CDS cloned in pCR2.1 TOPO 2.16-kb PCR fragment containing Saccharomyces ARR3, ARR2, and a fragment of ARR1 cloned in pCR2.1 TOPO - kb SacI fragment containing Cryptococcus ARR3 ligated into SacI-digested phosphatased pPZP-NATcc vector 2.27-kb PCR fragment containing Cryptococcus ARR3 CDS flanked by a fragment of Saccharomyces ARR2 and ARR1 cloned in pCR2.1 TOPO 2.37-kb XbaI–SacI fragment from pEWC20 ligated into the XbaI–SacI cut site of YEplac195 2.26-kb XbaI–SacI fragment from pEWC18 ligated into the XbaI–SacI cut site of YEplac195 and sequenced at the Australian Genome Research Facility (Brisbane, AU). To identify strains bearing the ARR3 gene amplification, a diagnostic PCR test was developed using primers UQ426 and UQ427. Primers UQ436 and UQ437 were subsequently used to amplify the entire amplicon region for sequencing. Amplification of 12 MLST loci (IGS1, CAP59, CAP10, MP88, GPD1, TOP1, SOD1, URE1, PLB1, MPD1, TEF1, and LAC1) was performed using standard oligonucleotides (Litvintseva et al. 2006), except for MP88 and TOP1, which were redesigned to avoid the need for step down PCR. PCR products were ethanol precipitated and sequenced. Molecular Techniques Cryptococcus neoformans genomic DNA was prepared using the CTAB method (Pitkin et al. 1996). Total RNA was prepared from C. neoformans strains grown in 100 ml YPD liquid media for 16 h at 30 °C with constant agitation. cDNA was generated using the BioLine cDNA Synthesis Kit (BioLine, London UK) according to the manufacturers’ protocols. Southern hybridizations were performed using Modified Church buffer (Church and Gilbert 1984). Probes were produced by PCR amplification from H99 genomic DNA using primers given in supplementary table S1 (Supplementary Material online) and purified using the QIAquick gel extraction kit (Qiagen, Valencia, CA). Radiolabeled probes were prepared using the GE Healthcare Rediprime II Random Prime Labelling System (GE Healthcare) with 20 lCi a-32P dCTP (Perkin Elmer, Waltham, MA). Hybridizations were performed overnight at 65 °C. Probes were detected by exposing the blots to Fujifilm Super RX medical X-ray film (Fujifilm, Tokyo, JA) at 80 °C and developed using a Konica Minolta SRX-101A Film Processor. Pulsed-Field Gel Electrophoresis Intact C. neoformans chromosomal DNA in agarose plugs was prepared according to the method of Lengeler et al. (2000). Pulsed-field gel electrophoresis (PFGE) plugs were digested with restriction enzymes and electrophoresed in 1% pulsed-field certified agarose (Bio-Rad) using 0.5 Tris-borate-EDTA running buffer using the CHEF DRIII PFGE system (Bio-Rad, Richmond, CA) as described (Lengeler et al. 2000) with minor modifications. Running conditions are as follows: ramped switch time from 1.5 to 10 s, 120°, 6 V/cm, 24 h, performed at 14 °C. Gels were stained with ethidium bromide and visualized with a UV transillu1990 Source Invitrogen This This This This study study study study This study This study This study minator. Southern blotting of pulsed-field gels was performed as previously described (Marra et al. 2004) onto Hybond-XL nylon membranes (GE Healthcare, Chalfont St Giles, UK). Blots were UV crosslinked with 100 mJ UV using a Stratagene UV Stratlinker 2400. Deletion and Complementation of the ARR3 Gene The ARR3 (CNAG_06025.2) deletion in strain H99 was created using overlap PCR and biolistic transformation as described (Davidson et al. 2000) using the selectable NEO marker (Fraser et al. 2003). Recombinants were selected on media containing G418 (100 lg/ml) and absence of growth on media containing 1 mM sodium arsenite. Deletion of the gene was confirmed by Southern hybridization. Plasmids used in this study are listed table 2. To create the deletion construct and complementation plasmids, primers UQ528 and UQ531 were used to amplify a 7.46-kb fragment containing the ARR3 gene along with ;1-kb flanking regions, using H99 genomic DNA as a template. The PCR product was cloned into pCR2.1 TOPO vector (Life Technologies) to create pEWC5 and transformed into Escherichia coli MACH1 cells. The ARR3 construct was then subcloned onto pPZP-NAT to create pEWC19 (Idnurm et al. 2004). Complementation of the H99 arr3n strain was performed using biolistic transformation and selected on media. Complementation of the ARR3 gene was verified by Southern hybridization. To verify the functional role of the ARR3 gene in C. neoformans, cross-species complementation was performed using the S. cerevisiae arr3n strain from the yeast deletion library (Winzeler et al. 1999; Giaever et al. 2002). To create the cross-species complementation plasmid, the protein-coding open reading frame (ORF) of the C. neoformans var. grubii ARR3 gene (CnARR3) was amplified from YPD-grown H99 cDNA using primers UQ476 and UQ477 and cloned into pCR2.1 TOPO, yielding the plasmid pEWC17. The S. cerevisiae ARR3 gene (ScARR3) and flanking regions were amplified from S. cerevisiae strain BY4739 using primers UQ809 and UQ812 and cloned into pCR2.1 TOPO to create pEWC18. The CnARR3 ORF and the ScARR3 promoter and terminator were combined by overlap PCR and cloned into pCR2.1 TOPO to create pEWC20 and subcloned into YEplac195 to create pEWC21. This vector was transformed into the S. cerevisiae arr3n strain using the lithium acetate method (Ghosh et al. 1999). The S. cerevisiae arr3n strain was also complemented with the native ScARR3 gene Massive Gene Amplification in C. neoformans · doi:10.1093/molbev/mss066 under the control of its endogenous promoter and terminator by subcloning the gene from pEWC18 into YEplac195 to create pEWC22 and transforming into the S. cerevisiae arr3n. Recombinants from both transformations were isolated on appropriate selective media, and the complementation of the CnARR3 and ScARR3 genes were verified by growth on media containing 1 mM arsenite. Antifungal Susceptibility Testing The antifungals amphotericin B, flucytosine, fluconazole, and intraconazole were obtained from Sigma-Aldrich. Stock solutions were prepared in dimethyl sulfoxide (amphotericin B and itraconazole) or water (flucytosine and fluconazole). Broth microdilutions were performed in accordance with the CLSI M27-A2 guidelines (Clinical and Laboratory Standards Institute 2002) using YNB medium supplemented with ammonium sulfate, a final inoculum concentration of (1.5 ± 1.0) 103 cells/ml and incubation at 35 °C for 72 h (Ghannoum et al. 1992). Concentrations of the antifungal agents were 0.097–50 lg/ml for fluconazole and flucytosine and 0.0156–8 lg/ml for amphotericin B and itraconazole. Minimal inhibitory concentration (MIC)50 is defined as the concentration that produced .50% growth inhibition and MIC90 is defined as the concentration that produced .90% growth inhibition. Assays were performed in triplicate. Quality control was performed by testing the CLSI-recommended strains ATCC6258 (Candida krusei), ATCC90028 (Candida albicans), and ATCC22019 (Candida parapsilosis). Murine Virulence Assays and Organ Burden Pathogenicity studies were conducted in 6- to 7-week-old BALB/c mice (Animal Resource Centre, Canning Vale, Australia). Infection was performed under methoxyflurane anesthesia. Groups of 11 mice were inoculated intranasally with 1 105 yeast cells in 50 ll phosphate-buffered saline. All mice were weighed and monitored daily for physical signs of sickness or distress and humanely euthanized by CO2 asphyxiation when their body weight had reduced by 20% from their preinfection weight or upon signs of debilitating effects. Kaplan–Meier survival curves were plotted using GraphPad Prism 5.0 and significance was determined using a log-rank test. Values of P , 0.05 were considered statistically significant. Murine virulence assays were carried out in strict accordance with the recommendations in the Australian Code of Practice for the Care and Use of Animals for Scientific Purposes by the National Health and Medical Research Council. The protocol was approved by the Molecular Biosciences Animal Ethics Committee of The University of Queensland (AEC approval number: SCMB/473/09/UQ/NHMRC). MBE lomeric regions are gene poor compared to the rest of the genome (Brown et al. 2010). We performed a statistical analysis of gene density by comparing the number of genes within a window with that of a true uniform gene distribution. Comparison across all chromosome arms of C. neoformans var. grubii strain H99 (VNI), C. neoformans var. neoformans strain JEC21 (VNIV), and C. gattii strain WM276 (VGI) revealed no changes in gene density sufficient to define a subtelomeric boundary cutoff (all KS P values .0.35). We also analyzed intergenic distances between genes as a measure of gene density, moving progressively along the pooled set of chromosome arms toward the centromere to look for a change point in gene density. This analysis also did not show a significantly lower gene density in the subtelomeres compared with the rest of the genomes (results not shown). This suggests that unlike S. cerevisiae, the subtelomeres of Cryptococcus possess a gene density similar to the rest of the genome and that with the exception of the centromeres, gene density is uniform throughout the Cryptococcus genome. As gene density did not reveal any unusual characteristics of the chromosome ends, we decided to investigate the genomic composition of this part of the genome to see if they are conserved between var. grubii strain H99 and var. neoformans strain JEC21. Synteny analysis employing dot plot matrices revealed that while the interstitial chromosomal regions of JEC21 appeared syntenic with H99, the final 2- to 36-kb regions of each chromosome end were commonly rearranged or included sequences that were present only in one or the other strain. To support this surrogate measure of the region we would define as the subtelomere, we elected to investigate the types of gene common to this part of the genome. In other organisms, subtelomeres are enriched in genes involved in niche adaptation (Adams et al. 1992; De Bruin et al. 1994; De Las Penas et al. 2003). Analysis of these regions in C. neoformans using BLAST revealed an unusual number of hexose transporter genes. Whole-genome analysis revealed 34 genes predicted to encode hexose transporters, 12 of which were scattered within the regions we were investigating. We therefore compared the number of hexose transporter genes in the region near the telomeres with those in the rest of the genome to see if there was any statistical evidence of difference in proportion. Based on a range of possible subtelomeric lengths, we found that the lowest P value was at the cutoff of 35 kb from the chromosome end (Fisher’s exact test, P value 5 9.1 107), indicating that hexose transporter encoding genes are enriched in subtelomeric regions (fig. 1 and supplementary table S2, Supplementary Material online). A Focus on the Right Subtelomere of Chromosome 3 Results A Measure of Subtelomere Length in Cryptococcus To help us define the regions we would consider subtelomeric in C. neoformans, we first employed an analytical method based on the observation that S. cerevisiae subte- Although our study of gene density had not revealed anything unique about the C. neoformans chromosome ends, our studies of synteny and hexose transporter gene distribution indicated the final ;40 kb exhibit unique characteristics: a higher rate of rearrangement and an enrichment of certain gene types. We therefore sought to expand this 1991 Chow et al. · doi:10.1093/molbev/mss066 FIG. 1. The chromosome ends of Cryptococcus neoformans are enriched for hexose transporter genes. By comparing the proportion of hexose transporter genes at the chromosome ends to total number of genes in the whole genome, in a 5 kb window and increasing that window size in steps of 5 kb toward the centromere, up to 110 kb, the subtelomere in C. neoformans strain H99 is ;35 kb (dotted line, P value 5 9.1 107). The Fisher’s exact test P value, between the proportions of hexose transporter genes to those of the other genes, is compared with the distance from the telomere. characterization by focusing on one subtelomere across a range of isolates. Our synteny analysis had shown that the right subtelomere of chromosome 3 of H99 shared the least synteny with the corresponding subtelomere in JEC21, making this a good candidate for further characterization (fig. 2). To more closely scrutinize the gene content of the chromosome 3 right subtelomere in C. neoformans var. grubii H99, the final 40 kb of genome sequence from the end of the chromosome was analyzed bioinformatically MBE using BLAST and ClustalW and compared with the homologous telomere from strains JEC21 (VNIV), B3501A (VNIV) and C. gattii strain R265 (VGII), and WM276 (VGI) (Loftus et al. 2005; D’Souza et al. 2011). Although the subtelomeric regions of the three molecular types analyzed were generally syntenic, there were a number of unique sequences (fig. 2). For instance, JEC21 possesses a putative polymerase II transcription elongation factor–encoding gene (CNK00050), which is absent in the other strains, suggesting gene gain via gene duplication in var. neoformans. An interesting finding made in all five genomes is that the region appears to be rich in carbohydrate metabolism related genes; of the 20 genes in the 40-kb subtelomeric region analyzed in chromosome 3 of strain H99, 8 are predicted to play a role in carbohydrate metabolism (fig. 2), whereas the other strains have between 4 and 10 of these genes. The Chromosome 3 End of C. neoformans var. grubii Shows Variation in Length among Isolates A detailed Southern hybridization mapping of the chromosome 3 right subtelomere was performed in a collection of C. neoformans var. grubii using a combination of three restriction enzymes and multiple gene-specific probes. Subtelomeric regions were liberated via in-gel restriction digest using a SfiI site 64 kb from the H99 chromosome end and resolved using PFGE; Southern blotting and subsequent hybridization with a gene-specific probe telomere proximal to the SfiI cut site revealed a significant increase in distance to the chromosomal end in 5 of 22 tested isolates compared with laboratory reference strain H99 (fig. 3). Size differences become less apparent when a more telomere proximal ApaI restriction site (31 kb) was employed in the analysis, FIG. 2. The right telomere of chromosome 3 of var. grubii differs markedly from homologous regions in the different molecular types. Schematic diagram showing the genome content of the right telomere of chromosome 3 in Cryptococcus neoformans var. grubii strain H99 and the corresponding regions in C. neoformans var. neoformans strains JEC21 (Chr 11) and B3501A (Chr 11) and C. gattii strains WM276 (Chr 11) and R265 (Chr 11). The yellow arrows indicate genes predicted to be involved in carbohydrate metabolism. 1992 Massive Gene Amplification in C. neoformans · doi:10.1093/molbev/mss066 MBE FIG. 3. The right telomere of chromosome 3 differs markedly between strains. (A) Schematic diagram showing the cut sites of the three restriction enzymes SfiI, ApaI, and SpeI and the gene-specific probes employed in the Southern blot hybridization. (B) Exposure of the three blots. The SfiI-digested blot was hybridized to the xylulose/fructose phosphoketolase–specific probe (PPK1); the ApaI-digested blot was hybridized to the Ras GTPase activator–specific probe (IRA2); and the SpeI-digested blot was hybridized to the hexose transporter–specific probe (HXT5). The presence of a band in each lane indicates the presence of the gene in each isolate. The size of the band in reference to the band size of Cryptococcus neoformans var. grubii VNI laboratory reference strain H99 indicates the occurrence of subtelomeric length variation. with variation between strains reduced even further when a SpeI site only 18 kb from the end of the chromosome—in this case, almost all strains yielded an equivalent banding pattern. These results therefore suggest that the most significant variation in the right end of chromosome 3 occurs not at the very end but between the SfiI (64 kb) and the ApaI (31 kb) cut sites. A Subtelomeric Gene Amplification Event on Chromosome 3 In order to determine the nature of the observed interstrain variability of chromosome 3, we designed probes specific to the nine genes in the SfiI/ApaI region and sequentially probed genomic digest Southern of these tested strains. Although most probes gave equivalent hybridization intensity in all 22 strains, a probe targeting CNAG_06925.2 displayed markedly stronger hybridization intensity for strains M8A and G0, suggesting an increase in copy number of the probe DNA in these genomes. To test this hypothesis, a Southern blot of genomic DNA from M8A as well as VNI reference strain H99 was performed using a variety of restriction enzymes that cut in a 10-kb region around the probe sequence (fig. 4). Rather than observing the fragment sizes observed in H99, each M8A lane contained either a high molecular weight band of undetermined size or a common ;3.2-kb band when using restric- tion enzymes in close proximity to CNAG_06925.2. These results are consistent with strain M8A bearing tandemly arrayed copies of the region encompassing CNAG_06925.2 (fig. 4). Based on the predicted existence of a tandem repeat array, diagnostic CNAG_06925.2 primers were designed facing away from each other, enabling an amplification product only from strains that carry a tandem array of this gene (fig. 5A). Subsequent PCR revealed that no product was obtained from H99, whereas M8A yielded a product consistent with the presence of a tandem array (fig. 5B). The same amplicon was also observed when DNA from the other strongly hybridizing strain (G0) was used as the PCR template. Sequencing enabled the repeat boundary to be determined, revealing the repeat unit to be precisely 3,177 bp in length, comprising the entire CNAG_06925.2 gene, predicted to be a homolog of S. cerevisiae ARR3 gene, plus a short fragment of the adjacent gene (CNAG_06926.2), predicted to encode a Major Facilitator Superfamily sugar transporter. Isolates with the CNAG_06925.2 Gene Amplification Share a Common Ancestry During our survey of the right telomere of chromosome 3 in 22 isolates, we identified two strains bearing the CNAG_06925.2 amplicon repeat. To determine if this amplification event is common in var. grubii, a collection of 70 1993 Chow et al. · doi:10.1093/molbev/mss066 FIG. 4. The ARR3 gene is part of a tandemly amplified array. (A) Schematic diagram showing the cut sites of the nine restriction enzymes EcoRI, SnaBI, PshAI, BglII, StuI, SacI, SphI, NdeI, and XhoI and the location of the ARR3 gene-specific probe (gray box) used. The strains cut in a ;10-kb region around the probe sequence. (B) Exposure of the two blots. Genomic DNA of Cryptococcus neoformans var. grubii VNI laboratory reference strain H99 and clinical isolate M8A were digested. strains comprising 12 environmental isolates and 58 clinical isolates encompassing all known MLST subclades were tested with our diagnostic PCR for the presence of the tandem array (supplementary fig. S1, Supplementary Material online). Of those examined, seven isolates tested positive for the presence of the amplification. Importantly, all seven isolates shared identical MLST alleles for the 12 loci tested, revealing they belonged to the previously described VNI subclade A5 (ST5) (Litvintseva et al. 2006). The remaining 63 amplicon-negative strains all belonged to other subclades. To determine the approximate amplicon copy number in the positive isolates, whole chromosomal preparations of these isolates were digested in-gel with XhoI, a restriction enzyme that cuts outside the amplicon boundaries and Southern blotted (fig. 5C). As the size of the ARR3 amplicon unit is known, the number of tandem repeat units in the isolates could be determined based on band size. Copy number was found to vary from 3 to 19 copies, suggesting that strains bearing the amplification may undergo expansion and contraction of the repeat (fig. 5C). CNAG_06925.2 Encodes an Arsenite Efflux Transporter Whose Copy Number Correlates with Arsenical Resistance BLASTx analysis suggested that the CNAG_06925.2 ORF encodes an arsenite efflux transporter homologous to S. cerevisiae Arr3p (Wysocki et al. 1997). To explore the function of this gene in C. neoformans, the gene was deleted by homologous recombination in the strain H99 background. Deletion of CNAG_06925.2 resulted in sensitivity to arsenite; subsequent complementation with the wild-type 1994 MBE gene restored arsenical resistance (fig. 6). These phenotypic results are consistent with CNAG_06925.2 encoding an arsenite efflux transporter. To confirm this role, we introduced CNAG_06925.2 into the S. cerevisiae arr3n mutant, which effectively restored arsenite resistance to the mutant (fig. 6). Functional complementation thus confirmed the bioinformatically predicted function of the gene product as an arsenite efflux transporter, and we have therefore named the gene ARR3. Retention of the ARR3 amplicon in all tested strains belonging to clade VNI A5 suggests that it may confer a selective advantage. To determine if increasing copy number of ARR3 correlates with an increased level of arsenical resistance, serial dilution spotting assays were performed on rich media supplemented with increasing concentrations of sodium arsenite (0–20 mM) (fig. 5D). While the reference strain H99 could not grow in the presence of 5 mM arsenite, isolates with 3–5 copies of the repeat unit were generally able to grow in the presence of 5 mM arsenite, isolates with 9–11 copies grew in the presence of 10 mM arsenite and isolates with 13–19 copies grew in the presence of 15 mM arsenite. No strains were able to grow in the presence of 20 mM arsenite. Copy number of the ARR3 amplicon is thus positively correlated with the level of arsenical resistance. To determine if further amplification of the ARR3 repeat was possible, we next performed an experimental evolution assay in an attempt to increase amplicon copy number by repeated selection of colonies displaying increased arsenite resistance. The 14 copy-containing strain M8A, which could grow in concentrations as high as 15 mM arsenite, was subcultured onto media containing increasing concentrations of arsenite in a stepwise fashion. Spontaneous colonies with high resistance were easily identified as the strain was exposed to progressively higher concentrations of arsenite. Using this approach, we isolated derivatives of strain M8A that could withstand up to 30 mM arsenite. Southern blot analysis of the subcultured strains revealed a massive amplification of the ARR3 amplicon in the experimentally evolved strains (supplementary fig. S2, Supplementary Material online). The Presence of ARR3 in High Copy Number Does Not Contribute to the Success of Infection of an Animal Host It has been postulated that the virulence factors C. neoformans employs during host infection have been co-opted from features that promote survival in the environmental niche (Buchanan and Murphy 1998). To determine if ARR3 has been co-opted to play a role during infection, virulence of the arr3n mutant was assessed using the murine inhalation model of cryptococcosis. One hundred percent of mortality was reached in 21 days in BALB/c mice infected with the arr3n strain compared with 23 days for control strain H99 and 19 days for arr3DþARR3 the complemented strain (fig. 7A). The survival rate of mice infected with the arr3n strain was not significantly different to the survival rates of the mice infected with the wild-type or Massive Gene Amplification in C. neoformans · doi:10.1093/molbev/mss066 MBE complemented strains, suggesting that Arr3 does not play a significant role during infection. To ascertain if the amplification of ARR3 could contribute to virulence, two experimentally evolved strains M8A13 (30 copies of the amplicon) and M8A-11 (80 copies) were compared with the original strain M8A in a second virulence assay. Mice infected with the control strain M8A succumbed within 21 days, whereas mice infected with the evolved strains of M8A-13 and M8A-11 did not survive past 22 days and 24 days, respectively, and were not significantly different (fig. 7B). Strains containing increased copy number of the ARR3 amplicon were thus unaffected for virulence (fig. 7A and B). The lack of a detectable role in virulence for Arr3 does not preclude this protein from having a function during infection; as an efflux pump, Arr3 could potentially have substrate specificity that extends beyond arsenite alone, perhaps into the efflux of antifungal agents. Treatment of cryptococcal meningoencephalitis traditionally involves the antifungal agents amphotericin B, flucytosine, and azoles such as fluconazole. To investigate if the arsenite efflux transporter has been co-opted to play a role in multidrug resistance, we tested the sensitivity of M8A and the experimentally evolved strains (M8A-11 and M8A-13), and the H99 arr3n deletion strain to the antifungal drugs fluconazole, itraconazole, amphotericin B, and 5-fluorocytosine using a MIC assay. The MIC50 and MIC90 of the experimentally evolved strains did not differ from the MICs of the wild-type strain (table 3). Similarly, the arr3D mutant displayed identical sensitivity to all antifungals tested. This suggests that massive amplification of the arsenite efflux transporter gene does not contribute to antifungal resistance. Discussion FIG. 5. Presence of ARR3 amplification correlates with increased arsenite resistance Cryptococcus neoformans var. grubii VNI laboratory reference strain H99 was used as a negative control and a strain with ARR3 gene amplification was used as a positive control. (A) Schematic diagram showing the cut sites of restriction enzyme XhoI and the location of the ARR3 amplicon–specific primers. The primers are designed such that a product (1.3 kb in size) is amplified if the amplicon units were tandemly arrayed. The amplicon unit (highlighted by the gray box) is 3,177 bp in size and consists of the promoter and terminal regions of the ARR3 gene and a fragment of the adjacent predicted MFS sugar phosphate permease gene (SEO1). (B) Strains M8A, G0, and NIH 38 showed a 1.3 kb amplified product, indicating that the strains contained that ARR3 amplification. (C) Amplicon copy number was determined based on the size of the Xho1 cleavage band. The number of tandem repeats in the isolates was found to be unstable; copy number varied from 3 to 19 copies. (D) Arsenical resistance spot assay results showed a general correlation between copy number and arsenical resistance. Strains were grown on YPD media. In S. cerevisiae and Homo sapiens, subtelomeres are regions near the telomeres that are repeat rich and gene poor (Pryde et al. 1997). These regions have been extensively studied in S. cerevisiae in particular, where despite being gene depleted, there is an enrichment of genes with known functions in transport and fermentation (Winzeler et al. 2003; Brown et al. 2010). A recent study estimated that these regions are approximately 33 kb in length in the Saccharomycetales (Brown et al. 2010); we sought to determine the subtelomeric length in C. neoformans using the same gene density-based approach. We found that unlike S. cerevisiae, the subtelomeres in Cryptococcus appear to possess a gene density that is equivalent to the rest of the genome. The subtelomeres are therefore yet another genomic feature of C. neoformans that differ in their fundamental structure compared with S. cerevisiae. For instance, the centromeres of S. cerevisiae all possess a defined consensus sequence, whereas the centromeres of C. neoformans are large transposon-dense structures (Hegemann and Fleig 1993; Fleig et al. 1995; Loftus et al. 2005). In S. cerevisiae, MAT is small and has a cassette-based switching system, whereas in C. neoformans, MAT is large and does not change (Fraser et al. 2003; Fraser, Giles, et al. 2005; Morrow and 1995 Chow et al. · doi:10.1093/molbev/mss066 FIG. 6. Arr3 function is conserved across multiple fungal phyla. Deletion of the ARR3 resulted in arsenite sensitivity in the mutant and complementation with the wild-type gene restored arsenite resistance. Complementation of the Saccharomyces cerevisiae arr3n mutant with the protein-coding ORF of the Cryptococcus ARR3 gene restored arsenite resistance strongly indicates that the Cryptococcus ARR3 gene is an ortholog of the ARR3 gene in S. cerevisiae. Serial dilution spotting assay of the strains were grown on YPD media. Fraser 2009). The subtelomeres now join this list as being distinct from yeast, instead sharing more similarity to species like Magnaporthe oryzae, where the gene density within the subtelomeric regions is also similar to that in the rest of the genome (Farman and Leong 1995; Mizuno et al. 2006; Farman 2007). Our synteny analyses revealed that the ;40-kb regions near the ends of the chromosomes of Cryptococcus nevertheless still possess unique characteristics, exhibiting a higher rate of rearrangement and an enrichment of certain gene types. These observations are consistent with the known potential for rapid evolution of genes that reside in the subtelomeric regions that has been particularly well exploited by a number of fungi and protists, many of which have ac- MBE cumulated families of genes involved in niche adaptation (Adams et al. 1992; De Bruin et al. 1994; De Las Penas et al. 2003). Such genes tend to be nonessential but provide the capability to respond to specific ecological challenges. We found that the subtelomeric regions in C. neoformans are enriched in hexose transporters in a similar fashion to S. cerevisiae (Winzeler et al. 2003). We therefore defined the subtelomeric regions in C. neoformans as ;40 kb, based on our synteny analysis and the enrichment of hexose transporter genes in these regions. Studies by Xue et al. (2010) previously described a myo-inositol transporter gene family, with 5 of 11 members lying within 30 kb of the chromosome end. Since C. neoformans can utilize inositol as a sole carbon source and since inositol has been demonstrated to be involved in virulence in multiple fungal species, it is likely that this enrichment arose as an adaptive mechanism in response to the environment (Chen et al. 2008; Bethea et al. 2010; Xue et al. 2010; Wang et al. 2011). We postulate that the enrichment of hexose transporter genes is likely to also play a role in virulence, as acquisition of carbon sources is of critical importance during infection (Perfect 2005; Price et al. 2011). To expand on our analysis of C. neoformans subtelomeres, we focused on one subtelomere in particular, the right subtelomere of chromosome 3, as this region displayed the least synteny between var. grubii and var. neoformans in our synteny analysis. This focus enabled us to identify the presence of a tandemly arrayed amplification involving the ARR3 gene. Studies in a range of both eukaryotes and prokaryotes suggest that tandem gene arrays originate from an initial gene duplication event due to a double-stranded DNA break (Romero and Palacios 1997; Haber and Debatisse 2006; Hastings 2007), representing a possible mechanism for the origin of the ARR3 amplification. The initial duplication may have arisen from a double-stranded break from the stalling or collapse of a replication fork followed by repair through breakinduced replication (Bosco and Haber 1998; Haber et al. 2004; Watanabe and Horiuchi 2005; McEachern and Haber 2006). In such a scenario, the duplication would be in direct FIG. 7. The deletion or amplification of ARR3 does not affect virulence in a murine model. (A) Deletion of ARR3 gene does not affect virulence in a mouse model (log-rank [Mantel–Cox] test P value 5 0.85). H99 median survival 5 16 days; arr3D median survival 5 17 days; and arr3D þ ARR3 median survival 5 16 days. (B) Amplification of the ARR3 gene does not confer an obvious advantage during infection (log-rank [MantelCox] test P value 5 0.47). M8A (14 copies) median survival 5 19 days; M8A-11 (80 copies) median survival 5 19.5 days; and M8A-13 (30 copies) median survival 5 18 days. 1996 Massive Gene Amplification in C. neoformans · doi:10.1093/molbev/mss066 Table 3. Minimal Inhibitory Concentrations of Cryptococcus neoformans Strains Tested. MIC (mg/ml) reading at 72 h AmB Strain name MIC90 MIC50 H99 1 0.25 narr3 1 0.25 narr3 1 CnARR3 1 0.25 M8A 1 0.25 M8A-11 1 0.25 M8A-13 1 0.25 5-FC Flu Itr MIC90 MIC50 MIC90 MIC50 MIC90 MIC50 0.25 0.125 6.25 3.125 1 0.5 0.25 0.125 6.25 3.125 1 0.5 0.25 0.125 6.25 0.125 <0.125 12.5 0.125 <0.125 12.5 0.125 <0.125 12.5 3.125 3.125 3.125 3.125 1 2 2 2 0.5 0.5 0.5 0.5 NOTE.—AmB, amphotericin B; 5-FC, 5-fluorocytosine; Flu, Fluconazole; and Itr, Itraconazole. orientation, consistent with the observed rearrangement of the ARR3 amplicon. Break-induced replication occurs regularly near telomeres of S. cerevisiae and can result in gene amplifications from mismatched templates as replication proceeds to the chromosome end (Haber et al. 2004; Koszul et al. 2004; Reams and Neidle 2004; Schildkraut et al. 2005). After the initial duplication event, subsequent expansion can occur by unequal crossing-over between sister chromatids, rolling-circle replication, double rolling-circle replication, or extrachromosomal amplification and reintegration (Watanabe and Horiuchi 2005). Tandem genomic duplications make further rearrangements, such as those we have observed conferring arsenic hypertolerance, particularly facile through ectopic recombination, as the region is prone to misalignment. Alternatively, unequal crossing-over yielding a change in DNA copy number could occur as a result of replication initiation at an origin of replication and subsequent replication fork breakage, which has been demonstrated during ribosomal DNA amplification (Ganley et al. 2009). While it is not known if the ARR3 gene is located near a replication origin, it is likely subsequent amplification events involving ARR3 in C. neoformans occurred as a consequence of unequal sister chromatid exchange. Our genetic studies confirmed ARR3 encodes an arsenite efflux transporter that appears similar in function to its ortholog in S. cerevisiae. As arsenic is a ubiquitous contaminant of soil and groundwater that is regularly encountered by many organisms, we postulate that the founder strain of subclade A5 may have originated from an environment with high levels of arsenite contamination. High levels of arsenite are genotoxic and cytotoxic, adversely affecting cell signaling and proliferation, and impairing cellular respiration and ATP production (Hosiner et al. 2009). Accordingly, arsenical detoxification mechanisms are ubiquitous in nature, and resistance has been characterized in many organisms ranging from bacterial to mammalian cells (Dabbs and Sole 1988; Broer et al. 1993; Rosen 1995; Wysocki et al. 1997). In S. cerevisiae, three independent pathways for arsenical detoxification have been identified. The first and best studied of these involves the arsenical resistance genes ARR1, ARR2, and ARR3, which are found in a cluster 7.5 kb from the end of chromosome XVI (Rosen 1995, 2002; Wysocki MBE et al. 1997; Mukhopadhyay and Rosen 1998). The transcription factor Arr1p regulates production of arsenate reductase Arr2p required for reducing arsenate As(V) to arsenite As(III) and the plasma membrane arsenite efflux transporter Arr3p. The second pathway involves the sequestration of arsenite in vacuoles as glutathione conjugates via the ATP-binding cassette transporter Ycf1p (Wysocki et al. 1997; Ghosh et al. 1999). In the third mechanism, the extrusion of arsenite is mediated by the bidirectional aquaglyceroporin Fps1p, which is also involved in arsenite uptake (Wysocki et al. 2001; Maciaszczyk-Dziubinska et al. 2010). Tolerance assays showed a positive correlation between copy number and level of arsenical resistance in C. neoformans. This mechanism is analogous to increased copper and cadmium resistance observed in S. cerevisiae when the metallothionein gene CUP1 is amplified (Wysocki et al. 1997; Ghosh et al. 1999). To our knowledge, this is the first example of a naturally occurring gene amplification leading to high-level arsenite tolerance in a fungus. Although triplication of S. cerevisiae ARR3 has been reported in sake yeast, this involves a large region spanning nine genes, with selection for the aquaporin-encoding gene AQY1 proposed to be the likely reason for the amplification (Ogihara et al. 2008). In this instance, the AQY1-ARR3 region in chromosome XVI was translocated onto chromosomes IV and XIII in a nonreciprocal manner. The presence of ARR3 in the amplicon region is probably coincidental as S. cerevisiae is unlikely to encounter arsenite during sake brewing. Of all fungi, why would C. neoformans in particular carry an amplification of an arsenite transporter gene? Insight again comes from elegant studies in S. cerevisiae that have revealed that arsenite uptake can be mediated via hexose transporters (Liu et al. 2004). Interestingly, the genes of both arsenite and hexose transporters are associated with the subtelomeric region in C. neoformans as well as in the sugar-loving S. cerevisiae. Aside from the metabolic requirements for hexose sugars, the polysaccharide capsule of C. neoformans is composed primarily of the complex sugars glucuronoxylomannan and galactoxylomannan, which contain high concentrations of the hexose sugars xylose, mannose, and galactose, plus glucuronic acid, which is derived from glucose. The hexose transporter genes likely represent a key gene family that is important in acquiring resources for capsule biosynthesis and expansion could therefore potentially be related to synthesis of the capsule, as disruption to hexose transporter activity reduces capsule formation (Chikamori and Fukushima 2005). Increased hexose transporter activity characteristic of the sugar-coated killer C. neoformans could unintentionally lead to increased arsenite uptake in contaminated niches; an isolated gene duplication event involving a subtelomeric arsenite efflux gene may have therefore been rapidly selected for and subsequently amplified in response to environmental arsenite levels. In support of this model, we demonstrated that further ARR3 amplification could be selected for through passage on increasing concentrations of arsenite. Expansion of amplicon units did not adversely affect normal growth but mediated the degree of arsenite 1997 Chow et al. · doi:10.1093/molbev/mss066 resistance when the cells were placed under this particular selective pressure. Finally, although the ARR3 gene in C. neoformans does not appear to play a role in virulence or pathogenesis, the occurrence of the amplification, as well as the enrichment of hexose transporter genes in the subtelomeric regions, is evidence that the subtelomeric regions of this fungus are likely important in niche adaptation. Future studies may therefore reveal further gene duplication or amplification events taking place in the subtelomeres of C. neoformans during infection of the human host, providing further insight into mechanisms of virulence in this dangerous pathogen. Supplementary Material Supplementary tables S1 and S2 and figures S1 and S2 are available at Molecular Biology and Evolution online (http:// www.mbe.oxfordjournals.org/). Acknowledgments We would like to thank Drs Joseph Heitman and Anastasia P. Litvintseva (Duke University Medical Centre) for kindly providing strains used in the study. We thank Dr Austen R. D. Ganley (Institute of Natural Sciences, Massey University, New Zealand) for helpful comments on the manuscript. This study was funded by the National Health and Medical Research Council of Australia (Project Grant 455980 and CDA 569673). References Adams J, Puskas-Rozsa S, Simlar J, Wilke CM. 1992. Adaptation and major chromosomal changes in populations of Saccharomyces cerevisiae. Curr Genet. 22:13–19. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol. 215:403–410. Bergthorsson U, Andersson DI, Roth JR. 2007. Ohno’s dilemma: evolution of new genes under continuous selection. Proc Natl Acad Sci U S A. 104:17004–17009. Bethea EK, Carver BJ, Montedonico AE, Reynolds TB. 2010. The inositol regulon controls viability in Candida glabrata. Microbiology 156:452–462. Blasi E, Brozzetti A, Francisci D, Neglia R, Cardinali G, Bistoni F, Vidotto V, Baldelli F. 2001. Evidence of microevolution in a clinical case of recurrent Cryptococcus neoformans meningoencephalitis. Eur J Clin Microbiol Infect Dis. 20:535–543. Boekhout T, van Belkum A. 1997. Variability of karyotypes and RAPD types in genetically related strains of Cryptococcus neoformans. Curr Genet. 32:203–208. Bosco G, Haber JE. 1998. Chromosome break-induced DNA replication leads to nonreciprocal translocations and telomere capture. Genetics 150:1037–1047. Broer S, Ji G, Broer A, Silver S. 1993. Arsenic efflux governed by the arsenic resistance determinant of Staphylococcus aureus plasmid pI258. J Bacteriol. 175:3480–3485. Brown CA, Murray AW, Verstrepen KJ. 2010. Rapid expansion and functional divergence of subtelomeric gene families in yeasts. Curr Biol. 20:895–903. Brown CJ, Todd KM, Rosenzweig RF. 1998. Multiple duplications of yeast hexose transport genes in response to selection in a glucose-limited environment. Mol Biol Evol. 15:931–942. Buchanan KL, Murphy JW. 1998. What makes Cryptococcus neoformans a pathogen? Emerg Infect Dis. 4:71–83. 1998 MBE Casadevall A. 1998. Ecology of Cryptococcus neoformans. In: Casadevall A, Perfect JR, editors. Cryptococcus neoformans. Washington (DC): ASM Press. p. 41–70. Chen YL, Kauffman S, Reynolds TB. 2008. Candida albicans uses multiple mechanisms to acquire the essential metabolite inositol during infection. Infect Immun. 76:2793–2801. Chikamori M, Fukushima K. 2005. A new hexose transporter from Cryptococcus neoformans: molecular cloning and structural and functional characterization. Fungal Genet Biol. 42:646–655. Church GM, Gilbert W. 1984. Genomic sequencing. Proc Natl Acad Sci U S A. 81:1991–1995. Clinical and Laboratory Standards Institute. 2002. Reference method for broth dilution antifungal susceptibility testing of yeasts; approved Standard-second edition. Wayne (PA): Clinical and Laboratory Standards Institute. Dabbs ER, Sole GJ. 1988. Plasmid-borne resistance to arsenate, arsenite, cadmium, and chloramphenicol in a Rhodococcus species. Mol Gen Genet. 211:148–154. Davidson RC, Cruz MC, Sia RA, Allen B, Alspaugh JA, Heitman J. 2000. Gene disruption by biolistic transformation in serotype D strains of Cryptococcus neoformans. Fungal Genet Biol. 29:38–48. De Bruin D, Lanzer M, Ravetch JV. 1994. The polymorphic subtelomeric regions of Plasmodium falciparum chromosomes contain arrays of repetitive sequence elements. Proc Natl Acad Sci U S A. 91:619–623. De Las Penas A, Pan SJ, Castano I, Alder J, Cregg R, Cormack BP. 2003. Virulence-related surface glycoproteins in the yeast pathogen Candida glabrata are encoded in subtelomeric clusters and subject to RAP1- and SIR-dependent transcriptional silencing. Genes Dev. 17:2245–2258. Desnos-Ollivier M, Patel S, Spaulding AR, Charlier C, GarciaHermoso D, Nielsen K, Dromer F. 2010. Mixed infections and in vivo evolution in the human fungal pathogen Cryptococcus neoformans. MBio 1(1):e00091–10. D’Souza CA, Kronstad JW, Taylor G, et al. (38 co-authors). 2011. Genome variation in Cryptococcus gattii, an emerging pathogen of immunocompetent hosts. MBio 2(1):e00342–10. Fan C, Zhang Y, Yu Y, Rounsley S, Long M, Wing RA. 2008. The subtelomere of Oryza sativa chromosome 3 short arm as a hot bed of new gene origination in rice. Mol Plant. 1:839–850. Farman ML. 2007. Telomeres in the rice blast fungus Magnaporthe oryzae: the world of the end as we know it. FEMS Microbiol Lett. 273:125–132. Farman ML, Leong SA. 1995. Genetic and physical mapping of telomeres in the rice blast fungus, Magnaporthe grisea. Genetics 140:479–492. Fischer G, James SA, Roberts IN, Oliver SG, Louis EJ. 2000. Chromosomal evolution in Saccharomyces. Nature 405:451–454. Fleig U, Beinhauer JD, Hegemann JH. 1995. Functional selection for the centromere DNA from yeast chromosome VIII. Nucleic Acids Res. 23:922–924. Flint J, Bates GP, Clark K, Dorman A, Willingham D, Roe BA, Micklem G, Higgs DR, Louis EJ. 1997. Sequence comparison of human and yeast telomeres identifies structurally distinct subtelomeric domains. Hum Mol Genet. 6:1305–1313. Fraser JA, Giles SS, Wenink EC, et al. (8 co-authors). 2005. Same-sex mating and the origin of the Vancouver Island Cryptococcus gattii outbreak. Nature 437:1360–1364. Fraser JA, Huang JC, Pukkila-Worley R, Alspaugh JA, Mitchell TG, Heitman J. 2005. Chromosomal translocation and segmental duplication in Cryptococcus neoformans. Eukaryot Cell. 4:401–406. Fraser JA, Subaran RL, Nichols CB, Heitman J. 2003. Recapitulation of the sexual cycle of the primary fungal pathogen Cryptococcus neoformans var. gattii: implications for an outbreak on Vancouver Island, Canada. Eukaryot Cell. 2:1036–1045. Fries BC, Chen F, Currie BP, Casadevall A. 1996. Karyotype instability in Cryptococcus neoformans infection. J Clin Microbiol. 34:1531–1534. Massive Gene Amplification in C. neoformans · doi:10.1093/molbev/mss066 Ganley AR, Ide S, Saka K, Kobayashi T. 2009. The effect of replication initiation on gene amplification in the rDNA and its relationship to aging. Mol Cell. 35:683–693. Ghannoum MA, Ibrahim AS, Fu Y, Shafiq MC, Edwards JE Jr, Criddle RS. 1992. Susceptibility testing of Cryptococcus neoformans: a microdilution technique. J Clin Microbiol. 30:2881–2886. Ghosh M, Shen J, Rosen BP. 1999. Pathways of As(III) detoxification in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A. 96:5001–5006. Giaever G, Chu AM, Ni L, et al. (70 co-authors). 2002. Functional profiling of the Saccharomyces cerevisiae genome. Nature 418:387–391. Haber JE, Debatisse M. 2006. Gene amplification: yeast takes a turn. Cell 125:1237–1240. Haber JE, Ira G, Malkova A, Sugawara N. 2004. Repairing a doublestrand chromosome break by homologous recombination: revisiting Robin Holliday’s model. Philos Trans R Soc Lond B Biol Sci. 359:79–86. Hastings PJ. 2007. Adaptive amplification. Crit Rev Biochem Mol Biol. 42:271–283. Hegemann JH, Fleig UN. 1993. The centromere of budding yeast. Bioessays 15:451–460. Hosiner D, Lempiainen H, Reiter W, Urban J, Loewith R, Ammerer G, Schweyen R, Shore D, Schuller C. 2009. Arsenic toxicity to Saccharomyces cerevisiae is a consequence of inhibition of the TORC1 kinase combined with a chronic stress response. Mol Biol Cell. 20:1048–1057. Ibeas JI, Jimenez J. 1996. Genomic complexity and chromosomal rearrangements in wine-laboratory yeast hybrids. Curr Genet. 30:410–416. Idnurm A, Reedy JL, Nussbaum JC, Heitman J. 2004. Cryptococcus neoformans virulence gene discovery through insertional mutagenesis. Eukaryot Cell. 3:420–429. Jain N, Guerrero A, Fries BC. 2006. Phenotypic switching and its implications for the pathogenesis of Cryptococcus neoformans. FEMS Yeast Res. 6:480–488. Jain N, Wickes BL, Keller SM, Fu J, Casadevall A, Jain P, Ragan MA, Banerjee U, Fries BC. 2005. Molecular epidemiology of clinical Cryptococcus neoformans strains from India. J Clin Microbiol. 43:5733–5742. Kavanaugh LA, Fraser JA, Dietrich FS. 2006. Recent evolution of the human pathogen Cryptococcus neoformans by intervarietal transfer of a 14-gene fragment. Mol Biol Evol. 23:1879–1890. Killick R, Eckley IA. Forthcoming. changepoint: an R package for changepoint analysis. Killick R, Fearnhead P, Eckley IA. 2011. Optimal detection of changepoints with a linear computational cost. IoP Publishing. Advance Access published January 2011, arXiv 1101.1438v2 Koszul R, Caburet S, Dujon B, Fischer G. 2004. Eucaryotic genome evolution through the spontaneous duplication of large chromosomal segments. EMBO J. 23:234–243. Kuo HF, Olsen KM, Richards EJ. 2006. Natural variation in a subtelomeric region of Arabidopsis: implications for the genomic dynamics of a chromosome end. Genetics 173:401–417. Kwon-Chung JK. 1986. Encapsulation and melanin formation as indicators of virulence in Cryptococcus neoformans. Infect Immun. 51:218–223. Kwon-Chung KJ, Wickes BL, Stockman L, Roberts GD, Ellis D, Howard DH. 1992. Virulence, serotype, and molecular characteristics of environmental strains of Cryptococcus neoformans var. gattii. Infect Immun. 60:1869–1874. Lengeler KB, Wang P, Cox GM, Perfect JR, Heitman J. 2000. Identification of the MATa mating-type locus of Cryptococcus neoformans reveals a serotype A MATa strain thought to have been extinct. Proc Natl Acad Sci U S A. 97:14455–14460. MBE Litvintseva AP, Kestenbaum L, Vilgalys R, Mitchell TG. 2005. Comparative analysis of environmental and clinical populations of Cryptococcus neoformans. J Clin Microbiol. 43:556–564. Litvintseva AP, Marra RE, Nielsen K, Heitman J, Vilgalys R, Mitchell TG. 2003. Evidence of sexual recombination among Cryptococcus neoformans serotype A isolates in sub-Saharan Africa. Eukaryot Cell. 2:1162–1168. Litvintseva AP, Thakur R, Vilgalys R, Mitchell TG. 2006. Multilocus sequence typing reveals three genetic subpopulations of Cryptococcus neoformans var. grubii (serotype A), including a unique population in Botswana. Genetics 172:2223–2238. Liu Z, Boles E, Rosen BP. 2004. Arsenic trioxide uptake by hexose permeases in Saccharomyces cerevisiae. J Biol Chem. 279: 17312–17318. Loftus BJ, Fung E, Roncaglia P, et al. (51 co-authors). 2005. The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans. Science 307:1321–1324. Louis EJ. 1995. The chromosome ends of Saccharomyces cerevisiae. Yeast 11:1553–1573. Maciaszczyk-Dziubinska E, Migdal I, Migocka M, Bocer T, Wysocki R. 2010. The yeast aquaglyceroporin Fps1p is a bidirectional arsenite channel. FEBS Lett. 584:726–732. Marra RE, Huang JC, Fung E, Nielsen K, Heitman J, Vilgalys R, Mitchell TG. 2004. A genetic linkage map of Cryptococcus neoformans variety neoformans serotype D (Filobasidiella neoformans). Genetics 167:619–631. McEachern MJ, Haber JE. 2006. Break-induced replication and recombinational telomere elongation in yeast. Annu Rev Biochem. 75:111–135. Mizuno H, Wu J, Kanamori H, Fujisawa M, Namiki N, Saji S, Katagiri S, Katayose Y, Sasaki T, Matsumoto T. 2006. Sequencing and characterization of telomere and subtelomere regions on rice chromosomes 1S, 2S, 2L, 6L, 7S, 7L and 8S. Plant J. 46:206–217. Morrow CA, Fraser JA. 2009. Sexual reproduction and dimorphism in the pathogenic basidiomycetes. FEMS Yeast Res. 9:161–177. Mukhopadhyay R, Rosen BP. 1998. Saccharomyces cerevisiae ACR2 gene encodes an arsenate reductase. FEMS Microbiol Lett. 168:127–136. Nielsen K, Cox GM, Wang P, Toffaletti DL, Perfect JR, Heitman J. 2003. Sexual cycle of Cryptococcus neoformans var. grubii and virulence of congenic a and alpha isolates. Infect Immun. 71:4831–4841. Ogihara F, Kitagaki H, Wang Q, Shimoi H. 2008. Common industrial sake yeast strains have three copies of the AQY1-ARR3 region of chromosome XVI in their genomes. Yeast 25:419–432. Park BJ, Wannemuehler KA, Marston BJ, Govender N, Pappas PG, Chiller TM. 2009. Estimation of the current global burden of cryptococcal meningitis among persons living with HIV/AIDS. AIDS 23:525–530. Perfect JR. 2005. Cryptococcus neoformans: a sugar-coated killer with designer genes. FEMS Immunol Med Microbiol. 45:395–404. Pitkin JW, Panaccione DG, Walton JD. 1996. A putative cyclic peptide efflux pump encoded by the TOXA gene of the plantpathogenic fungus Cochliobolus carbonum. Microbiology 142(6):1557–1565. Price MS, Betancourt-Quiroz M, Price JL, Toffaletti DL, Vora H, Hu G, Kronstad JW, Perfect JR. 2011. Cryptococcus neoformans requires a functional glycolytic pathway for disease but not persistence in the host. MBio 2(3):e00103-11. Pryde FE, Gorham HC, Louis EJ. 1997. Chromosome ends: all the same under their caps. Curr Opin Genet Dev. 7:822–828. Reams AB, Neidle EL. 2004. Selection for gene clustering by tandem duplication. Annu Rev Microbiol. 58:119–142. Rehmeyer C, Li W, Kusaba M, Kim YS, Brown D, Staben C, Dean R, Farman M. 2006. Organization of chromosome ends in the 1999 Chow et al. · doi:10.1093/molbev/mss066 rice blast fungus, Magnaporthe oryzae. Nucleic Acids Res. 34:4685–4701. Romero D, Palacios R. 1997. Gene amplification and genomic plasticity in prokaryotes. Annu Rev Genet. 31:91–111. Rosen BP. 1995. Resistance mechanisms to arsenicals and antimonials. J Basic Clin Physiol Pharmacol. 6:251–263. Rosen BP. 2002. Transport and detoxification systems for transition metals, heavy metals and metalloids in eukaryotic and prokaryotic microbes. Comp Biochem Physiol. 133:689–693. Schildkraut E, Miller CA, Nickoloff JA. 2005. Gene conversion and deletion frequencies during double-strand break repair in human cells are controlled by the distance between direct repeats. Nucleic Acids Res. 33:1574–1580. Steenbergen JN, Shuman HA, Casadevall A. 2001. Cryptococcus neoformans interactions with amoebae suggest an explanation for its virulence and intracellular pathogenic strategy in macrophages. Proc Natl Acad Sci U S A. 98:15245–15250. Sun S, Xu J. 2009. Chromosomal rearrangements between serotype A and D strains in Cryptococcus neoformans. PLoS One 4:e5524. Thompson JD, Higgins DG, Gibson TJ. 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673–4680. Wang Y, Liu TB, Delmas G, Park S, Perlin D, Xue C. 2011. Two major inositol transporters and their role in cryptococcal virulence. Eukaryot Cell. 10:618–628. 2000 MBE Watanabe T, Horiuchi T. 2005. A novel gene amplification system in yeast based on double rolling-circle replication. EMBO J. 24:190–198. Winzeler EA, Castillo-Davis CI, Oshiro G, Liang D, Richards DR, Zhou Y, Hartl DL. 2003. Genetic diversity in yeast assessed with whole-genome oligonucleotide arrays. Genetics 163:79–89. Winzeler EA, Shoemaker DD, Astromoff A, et al. (49 co-authors). 1999. Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Science 285:901–906. Wu C, Kim YS, Smith KM, Li W, Hood HM, Staben C, Selker EU, Sachs MS, Farman ML. 2009. Characterization of chromosome ends in the filamentous fungus Neurospora crassa. Genetics 181:1129–1145. Wysocki R, Bobrowicz P, Ulaszewski S. 1997. The Saccharomyces cerevisiae ACR3 gene encodes a putative membrane protein involved in arsenite transport. J Biol Chem. 272:30061–30066. Wysocki R, Chery CC, Wawrzycka D, Van Hulle M, Cornelis R, Thevelein JM, Tamas MJ. 2001. The glycerol channel Fps1p mediates the uptake of arsenite and antimonite in Saccharomyces cerevisiae. Mol Microbiol. 40:1391–1401. Xu J, Vilgalys R, Mitchell TG. 2000. Multiple gene genealogies reveal recent dispersion and hybridization in the human pathogenic fungus Cryptococcus neoformans. Mol Ecol. 9:1471–1481. Xue C, Liu T, Chen L, Li W, Liu I, Kronstad JW, Seyfang A, Heitman J. 2010. Role of an expanded inositol transporter repertoire in Cryptococcus neoformans sexual reproduction and virulence. MBio 1(1):e00084–10.
© Copyright 2026 Paperzz