Alu-Mediated Acquisition of Unstable ATTCT Pentanucleotide Repeats in the Human ATXN10 Gene Tatsuaki Kurosaki,* Tohru Matsuura, Kinji Ohno, and Shintaroh Ueda* *Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan; and Division of Neurogenetics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan Spinocerebellar ataxia type 10 is caused by ATTCT repeat expansion in the ATXN10 gene in humans. We studied the evolutionary history of the human genome to determine the time and mechanism of the acquisition of unstable ATTCT repeats in the genome. We found that long interspersed element-1 (LINE-1) was inserted into ATXN10 intron 9; Alu was then inserted in the middle of LINE-1; and endogenous retrovilcus K was lastly retrotransposed in the middle of Alu. The ATTCT repeat was located on the boundary between the 3#-end of the Alu element and the direct repeat arising from LINE-1. We determined nucleotide sequences of the orthologous region of 50 individuals representing 33 primate species and compared them with the human sequence. The analysis revealed that the ATTCT repeat is present only in human and apes. Old World monkeys also possess pentanucleotide repeats, but their motifs are TGTCT and GGTCT. New World monkeys and prosimians are not informative because they lack the corresponding region in ATXN10 intron 9. Our studies dictate two parsimonious scenarios of evolution. First, a TTTCT motif arose from a TTTTT motif at the junction of Alu and LINE-1, which was followed by introduction of A to make an ATTCT motif in hominoids. Second, an ATTCT motif was directly generated from an ancestral ATTTT motif in the common ancestor of catarrhines. We also demonstrate that orangutan uniquely introduced G to make a GTTCT motif and later C to make a GTTCC motif, where newly introduced nucleotides are underlined. Our studies reveal that nucleotide substitutions in a poly(A) tail of the Alu element and the following amplification of pentanucleotides occurred in the lineages of Old World monkeys and hominoids and that unstable ATTCT pentanucleotide repeats originated in the common ancestor of hominoids. These findings also highlight a new aspect of the role of retrotransposons in human disease and evolution, which might be useful in investigating the mystery of human uniqueness. Introduction Spinocerebellar ataxia type 10 (SCA10) is an autosomal dominant neurodegenerative disorder characterized by progressive ataxia and seizure (Grewal et al. 1998; Rasmussen et al. 2001). It is caused by unstable and massive expansion of the ATTCT pentanucleotide repeat in intron 9 of the ATXN10 gene, which is mapped on 22q13.3 (Matsuura et al. 1999, 2000; Zu et al. 1999). SCA10 is the only genetic disease known to be caused by an expansion of a pentanucleotide repeat. A recent study has shown that mutant ATXN10 allele containing the expanded repeat is transcribed at the same level as the wild-type allele and that its transcript is normally spliced, suggesting that the expanded AUUCU repeats potentially exert a gain-of-function disease mechanism (Wakamiya et al. 2006). The SCA10 expansion has unique characteristics of instability in both somatic and germline tissues, which are distinct from other repeat expansions (Matsuura et al. 2004). There is great difference in the ATTCT repeat length between the alleles of normal and affected individuals: Wild-type alleles have 10–29 repeats, whereas mutant alleles have 280–4,500 repeats (Matsuura et al. 2000, 2006). The size of the expanded segment is among the largest found to date in human microsatellite mutations (Matsuura et al. 2000). In other types of repeat expansion disorders, the difference is moderate, and some of their repeat ranges are partially overlapped between the wild-type and mutant alleles, suggesting the presence of an intermediated ‘‘premutation’’ state of the wild-type alleles. It is considered that the premutation repeats are unstable and may expand during transmission Key words: spinocerebellar ataxia, SCA10, pentanucleotide repeat, neurodegenerative disease, Alu. E-mail: [email protected]. Mol. Biol. Evol. 26(11):2573–2579. 2009 doi:10.1093/molbev/msp172 Advance Access publication August 3, 2009 Ó The Author 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: [email protected] from parents to offspring. Repeat interruption is popularly observed in the wild-type alleles of repeat expansion disease genes such as those of spinocerebellar ataxia type 1, spinocerebellar ataxia type 2, fragile X syndrome, and Friedreich ataxia (FRDA) (Chung et al. 1993; Kunst and Warren 1994; Imbert et al. 1996; Pulst et al. 1996; Sanpei et al. 1996; Montermini et al. 1997; Kurosaki et al. 2006). These interruptions are supposed to stabilize the repeat itself or prevent repeat expansion (Sobczak and Krzyzosiak 2004, 2005). However, the role of repeat interruptions in SCA10 is controversial in that multiple interruptions have been recently observed in the expanded ATTCT repeat of SCA10 as well as large normal alleles and they may also influence a disease presentation (Matsuura et al. 2006). To better understand the role of interruptions in SCA10 expansion and disease, we addressed the evolutionary process of the repeat configuration with interruption. Our analyses reveal that Alu and long interspersed element-1 (LINE-1) were retrotransposed into a common ancestor before the divergence of Old World monkeys and the ATTCT motif appeared at the junction of the Alu and LINE-1. Materials and Methods Samples We examined the orthologous region of 14 individuals of 5 species of apes, 28 individuals of 21 species of Old World monkeys, 6 individuals of 6 species of New World monkeys, and 2 individuals of 1 species of prosimians (table 1). Comparative In Silico Analysis of Mammalian ATXN10 Genes The homologous regions of mammalian ATXN10 genes corresponding to the human ATXN10 ATTCT repeats 2574 Kurosaki et al. Table 1 Repeat Configurations of ATXN10 Gene in Primates Species Num. Repeat Configuration Human Apes Chimpanzee Gorilla — (ATTCT)10–29 8 1 Orangutan 2 White-handed gibbon Agile gibbon 1 2 (ATTCT)4ATTTA ATTCC(ATTCT)15/17ATCCT ATTCTTTTAT ATTCTATTCT(GTTCT)2 (GTTCTGTTCC)3ATTCT ATTCT(GTTCT)3(GTTCT GTTCC)3ATTCT (ATTCT)4ATTTA (ATTCT)11/14/15ATTCA Old World monkeys Green monkey Patas monkey Hamadryas baboon Anubis baboon Mandrill Black mangabey Blue monkey 1 1 1 1 1 Barbary macaque Crab-eating macaque Lion-tailed macaque Stump-tailed macaque Pig-tailed macaque Rhesus macaque Bonnet macaque Assam macaque Tibetan macaque Taiwan macaque Japanese macaque Colobus Hanuman langur Guenon 1 1 1 1 1 1 1 1 1 1 6 1 1 1 AGTCT(TGTCT)4GGTCTTGTCT AGTCT(TGTCT)10(GGTCT)2 TGTCT(GGTCT)2TGTCT AGTCT(TGTCT)9(TGTCC)8 GTCTTTTCTTTTGTCT AGTCT(TGTCT)5(TGTCC)9 GTCTTTTCTTTTGTCT AGTCT(TGTCT)4GGTCTTGTCT AGTCT(TGTCT)4GGTCTTGTCT AGTCT(TGTCT)3(GGTCT)2TGTCT AGTCT(TGTCT)3GGTCTTGTCT AGTCTTATCT(TGTCT)3GGTC TTGTCT AGTAT(TGTCT)3(GGTCT)2TGTCT AGTAT(TGTCT)2(GGTCT)3TGTCT AGTAT(TGTCT)3(GGTCT)2TGTCT AGTAT(TGTCT)2(GGTCT)3TGTCT AGTAT(TGTCT)3(GGTCT)2TGTCT AGTAT(TGTCT)3(GGTCT)2TGTCT AGTAT(TGTCT)2(GGTCT)3TGTCT AGTAT(TGTCT)2(GGTCT)3TGTCT AGTAT(TGTCT)2(GGTCT)3TGTCT AGTAT(TGTCT)3(GGTCT)2TGTCT AGTAT(TGTCT)3(GGTCT)2TGTCT AGTCT(TGTCT)3GGTCT AGTCT(TGTCT)2(GGTCT)2 AGTCT(TGTCT)3GGTCTTGTCT 1 — 1 1 — — 1 1 1 — — — 2 — New World monkeys White-throated capuchin Tafted capuchin Long-haired spider monkey Cotton-top tamarin Common marmoset Owl monkey Prosimians Greater galago 1 3 were obtained from the University of California at Santa Cruz Genome Browser. Retrotransposable elements were identified using the RepeatMasker program (http://www. repeatmasker.org) by using the data obtained from the RepBase library of repeats (Jurka et al. 2005). Simple tandem repeats were surveyed using the Tandem Repeats Finder program (Benson 1999). Their sequences were mechanically aligned with ClustalX version 1.83 (Thompson et al. 1997) and further edited manually using BioEdit version 7.0.5 to verify the insertions and deletions. The Alu subfamily was identified using both the RepeatMasker and CENSOR (http://www.girinst.org/censor/index.php) programs. Genomic Polymeric Chain Reaction (PCR) and Sequencing of Nonhuman Primate ATXN10 Genes DNA samples were prepared from blood specimens by the conventional phenol–chloroform methods. The PCR primers and conditions are summarized in supplementary table 1, Supplementary Material online. For direct sequencing of fragments A–D in supplementary table 1, Supplementary Material online, we used KOD-plus-DNA polymerase (Toyobo, Japan). We fractionated the PCR products on an agarose gel, purified the excised fragments using NucleoSpin Extract II (Macherey-Nagel, Germany), and directly sequenced them on an ABI prism 3100 Genetic Analyzer (Applied Biosystems, United States). For samples showing ambiguous sequences of fragments A and B, the PCR products were cloned and sequenced using TArget Clone-plus-cloning system (Toyobo). For cloning and sequencing of fragments C and D, we employed KOD Dash DNA polymerase (Toyobo) and the TOPO TA Cloning Kit (Invitrogen, United States). For direct sequencing of prosimian fragment D, we used the Phusion high-fidelity DNA polymerase (Finnzymes, Finland). The nucleotide sequences were deposited in DDBJ/EMBL/GenBank (accession numbers AB449112–AB449143 and AB450921– AB450926). Results To investigate the evolutionary origin of the unstable ATTCT repeat in human ATXN10, comparative analysis was first performed using the known mammalian ATXN10 ! FIG. 1.—Genomic structure of the human ATXN10 gene and ATTCT repeat surrounding region. (a) Genomic alignment of intron 9 corresponding to the human ATXN10 ATTCT repeats among mammalian ATXN10 genes. The broken square denotes a 7.4-kb block that is absent in nonprimate species. Repetitive sequences are also shown, classified into short interspersed elements (SINEs), LINEs, LTRs, and simple tandem repeats. pTRNT1 is a processed pseudogene of TRNT1. (b) Nucleotide sequence surrounding the ATTCT pentanucleotide repeats in the human ATXN10 gene. SINEs, LINEs, LTRs, and simple tandem repeats are indicated by blue, green, gray, and red colors, respectively. The black-shaded and yellow-shaded nucleotides represent the direct repeats and poly(A) tail as a signature of the blue-shaded Alu insertion into the green-shaded LINE-1 sequence, respectively (due to the reversed direction, the poly(A) tail is designated as poly(T) tail in the coding direction of the ATXN10 gene). The dark gray-shaded nucleotides represent the direct repeats as a signature of the LTR-type transposable element, endogenous retrovilcus K (ERVK) (light gray shaded), insertion into the Alu sequence. The Alu element is separated into two parts by insertion of the ERVK element. The retrotransposons found in the 7.4-kb block absent in nonprimate species are as follows: 1, Alu element (286–112); 2, Alu element (111–1); 3, Alu element (305–139); 4, LINE-1 (3443–3144); 5, LINE-1 (1585–2380); 6, LINE-1 (2368–4773); 7, LINE-1 (4774–4852); 8, LINE-1 (4853–6235); 9, ERV1 (486–27); 10, ERVK (1–1236); and 11, MaLR (1–467). The numbers in parentheses are the nucleotide positions of each retrotransposon (in the direction of the reading frame of the ATXN10 gene), which are according to the notation of each reference sequence in RepeatMasker version open 3.2.5. The putative LINE-1 endonuclease cleavage site is indicated by a vertical black arrow. Origin of ATXN10 ATTCT Pentanucleotide Repeats 2575 2576 Kurosaki et al. genes, including human, chimpanzee, macaque, mouse, rat, rabbit, dog, cow, elephant, and opossum. As shown in figure 1a, the ATXN10 ATTCT repeat is present in human, chimpanzee, and macaque, whereas an approximately 7.4-kb region containing the repeat is absent in the nonprimate mammals. Most—in fact nearly all—of the sequences in this region were positively screened by the RepeatMasker program and the ATTCT repeat was located on a fragment rich in interspersed repeats. Analysis of individual sequences revealed that three retrotransposons are nested in the repeat flanking region (fig. 1b). First, a LINE-1 (green in fig. 1b) was retrotransposed in intron 9 of ATXN10. The subtype of LINE-1 was identified as L1ME2, which belongs to an old LINE-1 subfamily inserted into mammalian lineage before the primate radiation (Smit et al. 1995). Second, an Alu element (blue in fig. 1b) was inserted in the middle of LINE-1. The Alu element was flanked by a 13-bp stretch of direct repeats of 5#-TAKAMTGGATTYT-3# (black in fig. 1b) that originated from LINE-1. The downstream repeat was exactly located at the LINE-1 endonuclease cleavage site of 5#-TTTT/GA-3# (arrow in fig. 1b). Chimpanzee carried the same cleavage site of 5#-TTTT/GA-3#, whereas orangutan and rhesus macaque harbored another well-known LINE-1 cleavage site of 5#-TTTT/AA-3# (Feng et al. 1996; Jurka 1997; Cost and Boeke 1998), indicating that the Alu insertion is likely to have occurred at 5#-TTTT/AA-3#. In addition, poly(A) tail (yellow in fig. 1b) was preserved at the 3# end of the Alu. The Alu element subtype was classified into AluSz or AluSx. AluS is an intermediate-age Alu subfamily and is considered to have explosively expanded 35–50 Ma (Britten 1994; Kapitonov and Jurka 1996; Roy-Engel et al. 2002). We calculated the approximate age of the Alu element by the Kimura-2-parameter model with a mutation rate of 0.0015 substitutions/site/My at non-CpG substitution sites (Labuda and Striker 1989; Batzer et al. 1990; Xing et al. 2004) and estimated that the retrotransposition occurred 50 ± 3.9 (age ± standard deviation) Ma. This age is compatible with the evolutionary divergence of species between platyrrhines (New World monkeys) and catarrhines (Old World monkeys, apes, and human) that occurred around 40–50 Ma (Goodman et al. 1998; Kumar and Hedges 1998). Third, an LTR (long terminal repeat)–type transposable element, endogenous retrovilcus K (ERVK) (light gray in fig. 1b), was inserted in the middle of Alu. The ERVK is flanked by direct repeats (dark gray in fig. 1b) originating from the Alu element. Explosive expansion of the ERVK element is estimated to have occurred 30–45 Ma almost exclusively in catarrhines (Mariani-Costantini et al. 1989; Steinhuber et al. 1995; Sverdlov 2000). The insertion is thus likely to have occurred in the common ancestor of catarrhines after divergence between platyrrhines and catarrhines. The ATTCT repeat was flanked by the 13-bp direct repeat (black in fig. 1b) originating from LINE-1 and by the poly(A) tail (yellow in fig. 1b) of Alu, indicating that the repeat arose from either element. To further dissect the evolutionary origin of the unstable ATTCT repeat, we determined nucleotide sequences of 5 species of apes, 21 of Old World monkeys, 6 of New World monkeys, and 1 of prosimians (supplementary fig. 1, Supplementary Material online). All catarrhine primates (apes and Old World monkeys) carried the orthologous region, whereas New World monkeys and prosimians lacked the region. The orthologous region in catarrhines harbored both Alu and ERVK as in human, but the pentanucleotide repeats widely varied in motif and length. In apes, except for orangutan, the repeat configuration was of ATTCT pentanucleotide as in human (table 1). Orangutan had repeat configurations of GTTCT and GTTCC, flanked by ATTCT at both ends. These results indicated that the ancestral configuration of the repeat was ATTCT in hominoids. In Old World monkeys, the repeat configurations were not of ATTCT pentanucleotide but of TGTCT and GGTCT pentanucleotides. To gain an insight into intraspecific variation, we examined multiple individuals of each of the following species: chimpanzee (N 5 8), orangutan (N 5 2), agile gibbon (N 5 2), patas monkey (N 5 3), and Japanese macaque (N 5 6). We observed a single unique allele in chimpanzee and Japanese macaque, whereas we detected two to three different alleles in orangutan, agile gibbon, and patas monkey (table 1). It is interesting to note that shorter repeat numbers resulted in a unique allele, whereas longer repeats tended to give rise to variable alleles. We also determined nucleotide sequences of six species of New World monkeys (supplementary fig. 1, Supplementary Material online). The missing region in New World monkeys corresponded to a region shown in nucleotide sequences in supplementary fig. 2, Supplementary Material online, which included the ATTCT repeats, Alu, ERVK, and the flanking regions of LINE-1. As the flanking sequences were well conserved among New World monkeys and as the junction did not coincide with the end of LINE-1, partial–complete deletion of the retrotransposons is likely to have occurred in the common ancestor of New World monkeys. In a prosimian, the 7.4-kb region, which was absent in nonprimate mammals, was mostly deleted, but the evolutionary origin of the lack of the retrotransposons remain unelucidated (supplementary fig. 3, Supplementary Material online). Discussion Transposable elements abundantly exist in the human genome, occupying up to 44% of the whole genome (Lander et al. 2001). Although most of the transposable elements have lost their transposition ability long ago, some elements such as LINE-1 and Alu have been observed to sustain this ability in the recent history of primate evolution (Deininger and Batzer 2002). It is known that Alu causes various inherited disorders such as Lesch–Nyhan syndrome and Tay–Sachs disease. In addition, several types of cancer are also associated with Alu-mediated recombination (Deininger and Batzer 1999). The time and order of LINE-1, Alu, and ERVK insertions into ATXN10 intron 9 that we estimated in the current studies well support previous reports indicating when the retrotransposons expanded in primate lineage. Our current studies indicate that a T stretch at the junction of Alu and LINE-1 has gradually evolved to Origin of ATXN10 ATTCT Pentanucleotide Repeats 2577 FIG. 2.—Two parsimonious scenarios for acquisition of ATTCT pentanucleotide repeats in human ATXN10 gene. The LINE-1, Alu, and ERVK retrotransposition events are shown as dark bars on the evolutionary tree with the putative time of each event. The number at each node represents divergence time according to TimeTree (Hedges et al. 2006) in Ma. Repeat motifs are indicated by pentanucleotides and shaded in black. Newly introduced nucleotides are underlined. a pentanucleotide repeat during primate (simian) evolution. One of the most parsimonious scenarios of evolution is as follows (fig. 2a): First, an Alu insertion either before or after divergence of platyrrhines and catarrhines generated a TTTTT motif at the junction with LINE-1. Second, a polymorphic C was introduced into TTTTT to make a TTTCT motif, where a newly introduced nucleotide is underlined. We observe TTTCT only at the 3# end of the pentanucleotide repeats in all the species except chimpanzee and gib- bons. Third, a clade of Old World monkeys introduced G at the second nucleotide to make a TGTCT motif and thereafter another G at the first nucleotide to make a GGTCT motif. Fourth, hominoids introduced A at the first nucleotide to make an ATTCT motif. Fifth, orangutan uniquely introduced G at the first nucleotide to make a GTTCT motif and later C at the fifth nucleotide to make a GTTCC motif. This scenario indicates that ATTCT appeared after divergence of Old World monkeys. An alternative scenario of 2578 Kurosaki et al. evolution is that ATTTT instead of TTTTT has evolved to the unstable ATTCT pentanucleotide (fig. 2b). In the alternative scenario, ATTCT appeared before divergence of Old World monkeys, and Old World monkeys introduced three polymorphic nucleotides to the ATTCT motif. Although three polymorphic nucleotide substitutions in a single clade are less likely, conservation of AGTCT in the first unit of the pentanucleotide repeat in all Old World monkeys except for genus macaca supports the alternative scenario (table 1). A previous study suggested that Alu elements are a source for the genesis of microsatellite repeats (Arcot et al. 1995), and 5.7% of Alu poly(A) tail consists of patterned A-rich repeat sequence (Economou et al. 1990). However, there are few examples that show that the microsatellite repeat derived from an Alu element is responsible for a genetic disorder in human. FRDA is caused by the expansion of GAA triplet repeats in intron 1 of the FXN gene. Normal alleles have 6–36 repeats, whereas mutant alleles have 66–1,700 repeats (Campuzano et al. 1996; Dürr et al. 1996; Epplen et al. 1997; Montermini et al. 1997). Justice et al. (2001) report that the triplet repeats are introduced by retrotransposition of an Alu element into the FXN gene during primate evolution, which is analogous to Alu that we identify in intron 9 of ATXN10. Unlike ATXN10, the triplet repeats in FXN lie in the middle A-rich region of the Alu element (Montermini et al. 1997). To our knowledge, our current studies first demonstrate that the unstable repeats in repeat expansion disease genes have originated from a poly(A) tail of an Alu element and its junction. The human ATXN10 gene acquired pathogenicity due to subsequent mutations, namely, nucleotide substitutions and configuration as repeats, in the poly(A) tract after retrotransposition during primate evolution. This might imply that retrotransposon-mediated mutations could contribute to the diversification of species and possibly to human uniqueness. Supplementary Material Supplementary table 1 and supplementary figures 1–3 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/). Acknowledgments This study was supported by grants from the Ministry of Education, Science, Sports, and Culture of Japan. No conflict of interest is associated with the manuscript. We thank anonymous reviewers for valuable advice. Literature Cited Arcot SS, Wang Z, Weber JL, Deininger PL, Batzer MA. 1995. Alu repeats: a source for the genesis of primate microsatellites. Genomics. 29:136–144. Batzer MA, Kilroy GE, Richard PE, Shaikh TH, Desselle TD, Hoppens CL, Deininger PL. 1990. Structure and variability of recent inserted Alu family members. Nucleic Acids Res. 18:6793–6798. Benson G. 1999. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27:573–580. Britten RJ. 1994. Evidence that most human Alu sequences were inserted in a process that ceased about 30 million years ago. Proc Natl Acad Sci USA. 91:6148–6150. Campuzano V, Montermini L, Molto MD, et al. (31 co-authors). 1996. Friedreich’s ataxia: autosomal recessive disease caused by an intronic GAA triplet repeat expansion. Science. 271: 1423–1427. Chung MY, Ranum LP, Duvick LA, Servadio A, Zoghbi HY, Orr HT. 1993. Evidence for a mechanism predisposing to intergenerational CAG repeat instability in spinocerebellar ataxia type I. Nat Genet. 5:254–258. Cost GJ, Boeke JD. 1998. Targeting of human retrotransposon integration is directed by the specificity of the L1 endonuclease for regions of unusual DNA structure. Biochemistry. 37:18081–18093. Deininger PL, Batzer MA. 1999. Alu repeats and human disease. Mol Genet Metab. 67:183–193. Deininger PL, Batzer MA. 2002. Mammalian retroelements. Genome Res. 12:1455–1465. Dürr A, Cossee M, Agid Y, Campuzano V, Mignard C, Penet C, Mandel JL, Brice A, Koenig M. 1996. Clinical and genetic abnormalities in patients with Friedreich’s ataxia. N Engl J Med. 335:1169–1175. Economou EP, Bergen AW, Warren AC, Antonarakis SE. 1990. The polydeoxyadenylate tract of Alu repetitive elements is polymorphic in the human genome. Proc Natl Acad Sci USA. 87:2951–2954. Epplen C, Epplen JT, Frank G, Miterski B, Santos EJ, Schols L. 1997. Differential stability of the (GAA)n tract in the Friedreich ataxia (STM7) gene. Hum Genet. 99:834–836. Feng Q, Moran JV, Kazazian HH Jr, Boeke JD. 1996. Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition. Cell. 87:905–916. Goodman M, Porter CA, Czelusniak J, Page SL, Schneider H, Shoshani J, Gunnell G, Groves CP. 1998. Toward a phylogenetic classification of Primates based on DNA evidence complemented by fossil evidence. Mol Phylogenet Evol. 9: 585–598. Grewal RP, Tayag E, Figueroa KP, Zu L, Durazo A, Nunez C, Pulst SM. 1998. Clinical and genetic analysis of a distinct autosomal dominant spinocerebellar ataxia. Neurology. 51: 1423–1426. Hedges SB, Dudley J, Kumar S. 2006. TimeTree: a public knowledge-base of divergence times among organisms. Bioinformatics. 22:2971–2972. Imbert G, Saudou F, Yvert G, et al. (15 co-authors). 1996. Cloning of the gene for spinocerebellar ataxia 2 reveals a locus with high sensitivity to expanded CAG/glutamine repeats. Nat Genet. 14:285–291. Jurka J. 1997. Sequence patterns indicate an enzymatic involvement in integration of mammalian retrotransposons. Proc Natl Acad Sci USA. 94:1872–1877. Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J. 2005. Repbase update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 110: 462–467. Justice CM, Den Z, Nguyen SV, Stoneking M, Deininger PL, Batzer MA, Keats BJ. 2001. Phylogenetic analysis of the Friedreich ataxia GAA trinucleotide repeat. J Mol Evol. 52: 232–238. Kapitonov V, Jurka J. 1996. The age of Alu subfamilies. J Mol Evol. 42:59–65. Kumar S, Hedges SB. 1998. A molecular timescale for vertebrate evolution. Nature. 392:917–920. Kunst CB, Warren ST. 1994. Cryptic and polar variation of the fragile X repeat could result in predisposing normal alleles. Cell. 77:853–861. Origin of ATXN10 ATTCT Pentanucleotide Repeats 2579 Kurosaki T, Ninokata A, Wang L, Ueda S. 2006. Evolutionary scenario for acquisition of CAG repeats in human SCA1 gene. Gene. 373:23–27. Labuda D, Striker G. 1989. Sequence conservation in Alu evolution. Nucleic Acids Res. 17:2477–2491. Lander ES, Linton LM, Birren B, et al. (256 co-authors). 2001. Initial sequencing and analysis of the human genome. Nature. 409:860–921. Mariani-Costantini R, Horn TM, Callahan R. 1989. Ansestry of a human endogenous retrovirus family. J Virol. 63:4982–4985. Matsuura T, Achari M, Khajavi M, Bachinski LL, Zoghbi HY, Ashizawa T. 1999. Mapping of the gene for a novel spinocerebellar ataxia with pure cerebellar signs and epilepsy. Ann Neurol. 45:407–411. Matsuura T, Fang P, Lin X, et al. (14 co-authors). 2004. Somatic and germline instability of the ATTCT repeat in spinocerebellar ataxia type 10. Am J Hum Genet. 74:1216–1224. Matsuura T, Fang P, Pearson CE, Jayakar P, Ashizawa T, Roa BB, Nelson DL. 2006. Interruptions in the expanded ATTCT repeat of spinocerebellar ataxia type 10: repeat purity as a disease modifier? Am J Hum Genet. 78:125–129. Matsuura T, Yamagata T, Burgess DL, et al. (17 co-authors). 2000. Large expansion of the ATTCT pentanucleotide repeat in spinocerebellar ataxia type 10. Nat Genet. 26:191–194. Montermini L, Andermann E, Labuda M, et al. (14 co-authors). 1997. The Friedreich ataxia GAA triplet repeat: premutation and normal alleles. Hum Mol Genet. 6:1261–1266. Pulst SM, Nechiporuk A, Nechiporuk T, et al. (16 co-authors). 1996. Moderate expansion of a normally biallelic trinucleotide repeat in spinocerebellar ataxia type 2. Nat Genet. 14: 269–276. Rasmussen A, Matsuura T, Ruano L, Yescas P, Ochoa A, Ashizawa T, Alonso E. 2001. Clinical and genetic analysis of four Mexican families with spinocerebellar ataxia type 10. Ann Neurol. 50:234–239. Roy-Engel AM, Carroll ML, El-Sawy M, Salem AH, Garber RK, Nguyen SV, Deininger PL, Batzer MA. 2002. Non-traditional Alu evolution and primate genomic diversity. J Mol Biol. 316:1033–1040. Sanpei K, Takano H, Igarashi S, et al. (25 co-authors). 1996. Identification of the spinocerebellar ataxia type 2 gene using a direct identification of repeat expansion and cloning technique, DIRECT. Nat Genet. 14:277–284. Smit AF, Toth G, Riggs AD, Jurka J. 1995. Ancestral, mammalian-wide subfamilies of LINE-1 repetitive sequences. J Mol Biol. 246:401–417. Sobczak K, Krzyzosiak WJ. 2004. Imperfect CAG repeats form diverse structures in SCA1 transcripts. J Biol Chem. 279: 41563–41572. Sobczak K, Krzyzosiak WJ. 2005. CAG repeats containing CAA interruptions form branched hairpin structures in spinocerebellar ataxia type 2 transcripts. J Biol Chem. 280: 3898–3910. Steinhuber S, Brack M, Hunsmann G, Schwelberger H, Dierich MP, Vogetseder W. 1995. Distribution of human endogenous retrovirus HERV-K genomes in humans and different primates. Hum Genet. 96:188–192. Sverdlov ED. 2000. Retroviruses and primate evolution. BioEssays. 22:161–171. Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. 1997. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25:4876–4882. Wakamiya M, Matsuura T, Liu Y, Schuster GC, Gao R, Xu W, Sarkar PS, Lin X, Ashizawa T. 2006. The role of ataxin 10 in the pathogenesis of spinocerebellar ataxia type 10. Neurology. 67:607–613. Xing J, Hedges DJ, Han K, Wang H, Cordaux R, Batzer MA. 2004. Alu element mutation spectra: molecular clocks and the effect of DNA methylation. J Mol Biol. 344:675–682. Zu L, Figueroa KP, Grewal R, Pulst SM. 1999. Mapping of a new autosomal dominant spinocerebellar ataxia to chromosome 22. Am J Hum Genet. 64:594–599. Norihiro Okada, Associate Editor Accepted July 28, 2009
© Copyright 2026 Paperzz