1997 Oxford University Press Human Molecular Genetics, 1997, Vol. 6, No. 7 1117–1123 Human MSH2 binds to trinucleotide repeat DNA structures associated with neurodegenerative diseases Christopher E. Pearson1,*, Amy Ewel2, Samir Acharya2, Richard A. Fishel2,* and Richard R. Sinden1 1Center for Genome Research, Institute of Biosciences and Technology in the Texas Medical Center, Texas A&M University, 2121 West Holcombe, Houston, TX, 77030, USA and 2DNA Repair and Molecular Carcinogenesis Program, Kimmel Cancer Center, Thomas Jefferson University, Philadelphia, PA 19107, USA Received February 14, 1997; Revised and Accepted April 9, 1997 The expansion of trinucleotide repeat sequences is associated with several neurodegenerative diseases. The mechanism of this expansion is unknown but may involve slipped-strand structures where adjacent rather than perfect complementary sequences of a trinucleotide repeat become paired. Here, we have studied the interaction of the human mismatch repair protein MSH2 with slipped-strand structures formed from a triplet repeat sequence in order to address the possible role of MSH2 in trinucleotide expansion. Genomic clones of the myotonic dystrophy locus containing disease-relevant lengths of (CTG)n·(CAG)n triplet repeats were examined. We have constructed two types of slipped-strand structures by annealing complementary strands of DNA containing: (i) equal numbers of trinucleotide repeats (homoduplex slipped structures or S-DNA) or (ii) different numbers of repeats (heteroduplex slipped intermediates or SI-DNA). SI-DNAs having an excess of either CTG or CAG repeats were structurally distinct and could be separated electrophoretically and studied individually. Using a band-shift assay, the MSH2 was shown to bind to both S-DNA and SI-DNA in a structure-specific manner. The affinity of MSH2 increased with the length of the repeat sequence. Furthermore, MSH2 bound preferentially to looped-out CAG repeat sequences, implicating a strand asymmetry in MSH2 recognition. Our results are consistent with the idea that MSH2 may participate in trinucleotide repeat expansion via its role in repair and/ or recombination. INTRODUCTION The etiology of nine human genetic diseases, including Huntington’s disease, spinocerebellar ataxia type 1 (SCA-1), spino-bulbar muscular atrophy, myotonic dystrophy (DM) and fragile X-syndrome (FRAXA) have been traced to genetic variation in the lengths of (CTG)n·(CAG)n or (CGG)n·(CCG)n trinucleotide repeats within the affected locus (for a review see ref. 1). In normal individuals, these genes contain 5–37 (CTG)n·(CAG)n repeats, while increases in the number of repeats to 50–200 are associated with unstable alleles and disease symptoms (1,2). Explosive expansion to >3000 repeats can occur at the DM and FRAXA loci. The probability of expansion strongly depends upon the number of existing trinucleotide repeats; the longer the repeat tract, the greater the possibility for expansion. This length-sensitive probability of mutation has led to the term ‘dynamic mutation’ to describe this process (3). In SCA-1 and FRAXA, the repeat tracts are normally interrupted with the non-repeat units CAT and AGG respectively (4–7). In each case, the non-repeat interruptions confer increased genetic stability to the repeat tract, such that interrupted tracts are less likely to undergo expansions. Loss of non-repeat interruptions, and hence an enhanced length of the pure repeat tracts, are correlated with instability and disease. In the case of FRAXA, changes in the repeat length are polar and occur primarily in the 3′ end of the (CGG)n tract, implicating replication direction in the process (6). For some of the diseases, there also appear to be specific tissues as well as ‘windows’ for triplet expansion which are believed to occur during gametogenesis and/or early postzygotic development (1). The mechanism(s) of repeat expansion is unknown. Since both the length and the sequence of the genetically unstable tandem repeats are limited to only certain triplet repeats [(CTG)n·(CAG)n, (CGG)n·(CCG)n, (AAG)n·(CTT)n], it is thought that some type of DNA secondary structure may contribute to trinucleotide instability in these neurodegenerative diseases (1–13). Spontaneous mutations in prokaryotic and eukaryotic (including human) cells frequently are associated with simple DNA repeat sequences that can form alternative non-B DNA structures (reviewed in 10 and 11). The formation of slipped-strand structures between repeated trinucleotide sequences is one possible mechanism that may contribute to expansion (Fig. 1). Such slipped-strand structures may be formed during DNA *To whom correspondence should be addressed. C.E.P. Tel: +1 713 677 7676; Fax: +1 713 677 7689; Email: [email protected], or R.F. Tel: +1 215 503 1345; Fax: +1 215 923 1098; Email: [email protected] 1118 Human Molecular Genetics, 1997, Vol. 6, No. 7 via recognition and binding to DNA at the site of a mispair (17–20). The yeast (21) and human (22,23) MSH2 protein also bind specifically to duplex oligonucleotides containing single base pair mismatched nucleotides as well as short (5–16 nucleotide) insertions/deletions loops (IDLs), suggesting that these proteins play a role similar to that of the bacterial MutS. Human tumor cell lines that are deficient in MSH2 display a generalized increase in spontaneous mutation rates, instability in simple repeat (microsatellite) sequences, and yield extracts that are defective for single base pair and IDL mismatch repair in vitro (24–26). Mutations in MSH2 appear to account for ∼50% of hereditary nonpolyposis colon cancers (HNPCCs), suggesting that MSH2 plays a central role in carcinogenesis (27). Furthermore, hMSH2 forms specific mispair binding complexes with two other MutS homologs, MSH3 (28) and MSH6 (GTBP/p160) (28,29). Although the contribution of mismatch repair to trinucleotide repeat stability in human cells remains controversial (30), we have demonstrated that mutations in the bacterial mutS, mutL or mutH genes decrease the frequency of large deletions of plasmid-borne trinucleotide repeats in E.coli (9), implicating a functional mismatch repair system in the mechanism of large deletions in bacteria. To address further a possible role for human mismatch repair in trinucleotide repeat expansion, we have investigated the binding of MSH2 to the slipped-strand structures expected to be intermediates in trinucleotide expansions as a model for the fundamental recognition process that is required in mismatch repair. Figure 1. Replication slippage models. (A) Slippage of the DNA polymerase and the nascent strand backwards on the template strand would result in the formation of structures containing an excess of repeats on the nascent strand, resulting in expansion products. (B) Slippage of the DNA polymerase and the nascent strand forward on the template strand would result in the formation of structures containing an excess of repeats on the template strand, giving rise to deletion products. Both expansion and deletion processes will give rise to replicated duplex DNAs having a stretch of mismatched repeats (a heteroduplex region). The expected products (in the absence of repair) of two rounds of replication are shown. For simplicity, we have assumed that structure formation occurs only in the lagging strand, although it may occur in either strand. (C) Alternative homoduplex repeat structures may form in non-replicating DNA (12). Thin lines represent repeat tracts, broad lines are non-repetitive flanks. replication, and may result in both expansions and deletions of the trinucleotide repeated sequences (Fig. 1A and B) (12,13). In addition, slipped-strand structures may also be formed between complementary strands with identical numbers of repeats (Fig. 1C) (12), as may occur intrachromasomally or during meiotic chromosome synapsis. All of these slipped-strand structures contain unpaired nucleotides and may be substrates for mismatch repair (12,13). Using genomic clones of the DM, SCA-1 and FRAXA loci having normal, pre-mutant and full mutant lengths of repeats, we previously have demonstrated the formation of an apparent slipped-strand structure (12,14, C.E. Pearson unpublished results). In each case, the probability of slipped-strand structure formation increased with increased repeat length. In the case of SCA-1 (CAG)n repeats or FRAXA (CGG)n repeats the presence of CAT or AGG interruptions, respectively, inhibited structure formation (14, C.E. Pearson unpublished results). These results support the participation of a slipped-strand structure in the process of trinucleotide repeat instability. The human homolog of the Escherichia coli MutS protein, MSH2, is a member of the highly conserved family of mismatch repair proteins (15,16). In E.coli, MutS initiates mismatch repair RESULTS Slipped structures Two types of trinucleotide DNA structures were used—homoduplex slipped strand DNA structures (S-DNA) (12), formed between two complementary strands having equal numbers of repeats, and heteroduplex slipped intermediate DNA structures (SI-DNA), in which the strands have different numbers of repeats. Two genomic clones derived from the human DM locus that have identical non-repetitive flanking sequences and 30 or 50 triplet repeats (Fig. 2A) (12), corresponding to the normal and expanded alleles (1), respectively, were used. Plasmids were linearized by HindIII digestion, 32P labeled (*) on the 3′ ends to identical specific activity then treated by denaturation–renaturation (12). For the production of S-DNA, only individual plasmids were reduplexed and for SI-DNA equimolar amounts of the 30 and 50 triplet repeat-containing plasmids were mixed and heteroduplexed. Following renaturation, DNAs were digested with EcoRI, resulting in DM inserts uniquely labeled on the 3′ end of the CTG strand (12). Reduplexing resulted in a reproducible pattern of slow migrating products not present in the non-treated samples (12). The ethidium-stained patterns of the reduplexed n = 30 or n = 50 S-DNAs were identical to their respective autoradiographic patterns (Fig. 2, compare B with C). In the case of reduplexed (CTG)30·(CAG)30-containing plasmid, 39% of the repeat-containing fragment migrated as a series of closely spaced distinct products between 232 and 410 bp, with a single region of greatest intensity migrating as 287 bp (Fig. 2B). For reduplexed (CTG)50·(CAG)50-containing plasmid, 70% of the repeatcontaining fragment migrated as a series of closely spaced distinct products between 295 and 696 bp, with a single region of greatest intensity migrating as 370 bp (Fig. 2B). A larger percentage of the 1119 Human Genetics, 1997, 6, No. NucleicMolecular Acids Research, 1994, Vol. Vol. 22, No. 1 7 1119 tures of the SI-DNAs containing looped-out CTG or CAG strands are sufficient to confer different electrophoretic mobilities to the sister heteroduplexes, with limited overlap. We previously have demonstrated the electrophoretic resolution of two sister threeway junctions of identical sequence composition (31,32). For both reduplexing and heteroduplexing reactions, the pattern and relative proportion of products was highly reproducible. Each of the linear non-treated DNAs, the major S-DNAs and the major SI-DNAs were gel purified (12). We found that the gel-purified products were stable as judged by a minimal interconversion back to linear or other forms (compare Fig. 2C with D) as previously observed (12). However, longer autoradiographic exposures (data not shown) revealed that both the isolated SI-DNAs demonstrated some spontaneous self-association to very slow migrating products (positions indicated by arrowheads in Fig. 2D), whose intensity varied from one preparation to another. It is likely that these represent intermolecular multimeric associations similar to those observed for (CA)n·(TG)n and (GA)n·(TC)n tracts (33,34). Figure 2. Reduplexing/heteroduplexing DM DNA fragments results in slowly migrating DNAs. (A) Map of the HindIII–EcoRI (CTG)n·(CAG)n-containing fragments from DM genomic clones, where n = 30 and n = 50 (12). Human non-repetitive flanking sequences (positions 357–375 and 391–433, as in ref. 2) are in bold, while the thin lines represent plasmid vector sequences. Positions of the unique 3′ CTG radiolabels are indicated (*). LNT, linear non-treated DNAs of n = 30 and n = 50 plasmids (lanes 1 and 4, respectively), reduplexed DNAs (S) from n = 30 and n = 50 plasmids (lanes 2 and 5, respectively), heteroduplexed DNAs (SI) of n = 30 and n = 50 plasmids (lanes 3 and 6, respectively) and the *HindIII–EcoRI pUC19 vector band are indicated. Products were separated on a 4% polyacrylamide gel, stained with ethidium bromide (B) or dried and exposed for autoradiography (C and D). For (B) and (C), samples and lane numbers are identical, and the upper and lower arrows indicate the *(CTG)30·(CAG)50 and *(CTG)50·(CAG)30 SI-DNAs, respectively. Samples in (D) are gel-purified products from their corresponding lanes in (C). These gel-purified products were loaded in equimolar amounts. Positions of the 123 bp ladder are shown. As previously observed (12), a small amount of conversion to the linear form is evident in both gel-purified S-DNA and SI-DNA samples. The three filled arrowheads indicate post-elution multimeric associations of SI-DNAs, which are more evident with longer exposures. DNA containing expanded repeat length (n = 50) formed S-DNA structures than DNA containing repeat lengths found in normal individuals (n = 30) (Fig. 2B and C, compare lanes 2 and 5), confirming a previously observed relationship that increasing length of trinucleotide repeats results in increased slipped-strand DNA products (12). The ethidium-stained products of the *30×50 or *50×30 heteroduplexing reactions revealed that both n = 30 and n = 50 linear and S-DNA products were formed. The two major slower migrating products and the fainter closely spaced products, that only formed in the reactions that contained both the n = 30 and n = 50 parent DNAs, are the heteroduplex SI-DNAs (Fig. 2B, see arrows). The ethidium-stained pattern was identical, regardless of whether n = 30 or the n = 50 was radiolabeled (Fig. 2B). By selectively labeling the (CTG)30- or (CTG)50-containing strands, it is apparent that the uppermost major band represents the *(CTG)30·(CAG)50 heteroduplex (Fig. 2C, lane 3, see arrow), while the faster migrating major band represents the *(CTG)50·(CAG)30 heteroduplex (Fig. 2C, lane 6, see arrow). As with the S-DNA structures formed from reannealing single parent DNAs, there appear to be several structural isomers, with the major product presumably representing the most energetically favorable conformation. Apparently, the difference in the struc- MSH2 binding A band-shift assay was used to detect MSH2 binding to each of the gel-purified substrate DNAs similar to that of Fishel et al. (1994) (22,23). Under these conditions, MSH2 did not bind significantly to the linear *(CTG)30·(CAG)30 or *(CTG)50· (CAG)50 fragments, although, some low level binding was observed (Fig. 3A, lanes 1–3 and 10–12, see open arrowheads). Binding of hMSH2 to either of the S-DNAs resulted in a single and distinct shifted complex (Fig. 3A, lanes 4–6 and 13–15, see arrow). MSH2 also bound to the *(CTG)30·(CAG)50 SI-DNA, resulting in a similarly shifted complex (Fig. 3A, lanes 7–9, see arrow). MSH2 bound poorly to the *(CTG)50·(CAG)30 SI-DNA (Fig. 3A, lanes 16–18; Fig. 3B). These results suggest that purified hMSH2 binds to the slipped-strand S-DNAs and SI-DNAs. Moreover, MSH2 appears to bind the SI-DNAs in a structure- or sequence-specific manner in which there is preferential binding to looped-out CAG repeats (Fig. 3B and C). Binding specificity To test the specificity of MSH2 binding, competition assays were performed (22,23,31). In these studies, the amount of MSH2 bound to *(CTG)30·(CAG)30 S-DNA, *(CTG)50·(CAG)50 S-DNA, or *(CTG)30·(CAG)50 SI-DNA was examined, when excess gel-purified linear non-treated DNAs, the S-DNAs or the SI-DNAs was introduced into the binding reaction. Such a competition reaction, where MSH2 binding to the *(CTG)30·(CAG)50 SI-DNA is examined, is shown in Figure 4A and B. We found that the linear (CTG)30·(CAG)30 DNA was a poor competitor, while the S-DNA and both SI-DNAs were effective competitors of MSH2 binding. At low concentrations (50-fold molar excess), the (CTG)30·(CAG)50 SI-DNA was more effective than the (CTG)50·(CAG)30 SI-DNA as competitor (Fig. 4B–D). The efficiency of each competitor for MSH2 binding to *(CTG)30·(CAG)30 S-DNA (Fig. 4C) or *(CTG)50·(CAG)50 S-DNA (Fig. 4D) was similar to that observed for binding to *(CTG)30·(CAG)50 SI-DNA (Fig. 4B). These results suggest that the binding of MSH2 to S-DNA and SI-DNAs is specific and is similar to the binding of hMSH2 to single base pair and IDL mismatched nucleotides. 1120 Human Molecular Genetics, 1997, Vol. 6, No. 7 *(CTG)30·(CAG)50 SI-DNA (Fig. 3B), which has an excess of 20 CAG repeats, and suggests that there is an intrinsic difference in the recognition of CAG trinucleotide repeat sequences over CTG trinucleotide repeat sequences. These results also suggest that on either of the homoduplex S-DNAs it is the looped-out CAG, and not the looped-out CTG strand that is being bound preferentially by MSH2. DISCUSSION Figure 3. Binding of MSH2 protein to trinucleotide repeat structures. (A) Band-shift assays were performed as described in Materials and Methods. Gel-purified 32P-labeled trinucleotide repeat DNAs in the linear non-treated (LNT), major S-DNA (S) and major SI-DNA (SI) forms, as indicated. The arrows indicate the band-shifted slipped DNA–hMSH2 complexes, while open arrowheads indicate the linear DNA–MSH2 complexes. (B) Binding titration of MSH2 to *(CTG)30·(CAG)50 SI-DNA or *(CTG)50·(CAG)30 SI-DNA, as indicated, using 0, 485 ng (4.6 pmol), 727.5 ng (6.7 pmol) and 970 ng (9 pmol) of MSH2. (C) Quantification of 3–5 binding assays as shown in (A) and (B), error bars represent plus and minus one standard deviation. To address further the possibility that there is an intrinsic difference in the ability of MSH2 to recognize CTG and CAG intrastranded structures, we tested MSH2 binding to synthetic (CTG)15 and (CAG)15 oligonucleotides (Fig. 5). We found that MSH2 bound to the (CAG)15 oligonucleotide 4- to 5-fold more efficiently than to the (CTG)15 oligonucleotide. This result is similar to the observed preferential binding of the The yeast and human MSH2 proteins are required for the recognition of single base pair mismatched nucleotides, duplex oligonucleotides containing short IDLs, palindromic DNA IDL structures and Holliday junctions (21–23,35). Both purified proteins demonstrated a binding preference for the IDL substrates, and binding appeared to increase with increased loop size. Here, we observed significant and specific MSH2 binding to both slippedstrand S-DNAs and SI-DNAs and minimal binding to linear duplex DNAs. There appears to be a binding hierarchy with (CTG)50· (CAG)50 S-DNA>(CTG)30·(CAG)30 S-DNA ∼(CTG)30·(CAG)50 SI-DNA>(CTG)50·(CAG)30 SI-DNA>>(CTG)50·(CAG)50 linear = (CTG)30·(CAG)30 linear. In addition, we found a direct association between MSH2 binding and the number of trinucleotide repeats. This relationship parallels the effect of repeat length on instability in vivo. The KD of hMSH2 binding to these trinucleotide repeat structures was calculated to be in the high nanomolar range (50–200 nM), which is similar to the mispair binding affinity exhibited by the bacterial MutS protein and our previously reported MSH2 mispair binding affinity (36). The structure(s) within the S-DNA and SI-DNA substrates that are recognized by MSH2 are unknown. However, these structures are likely to be somewhat different from the IDL substrates that previously were shown to be recognized efficiently by the human MSH2 protein (22,23). This is because the slipped-strand DNA structures formed within both S-DNA and SI-DNA substrates are capable of branch migration, which can contribute to the production of a wide variety of structural isomers. Additionally, the looped-out region(s) formed by SI-DNAs are significantly larger than that used in our previous experiments (21–23). Several studies have shown that the efficiency and specificity of substrate recognition and correction varies with the mismatch, loop, DNA structure (hairpin, palindrome or chemical modification) and the sequence context surrounding the mismatch (17–23,26,35–38). These results suggest that MSH2 may recognize a variety of mispair and DNA structures with wide-ranging affinity. It is interesting to note that all of the slipped-strand structures tested here gave rise to a single similarly shifted complex. There are a number of possible DNA structures that might be recognized by MSH2 within these slipped-strand substrates and include: (i) the junction of the interstrand region with the looped-out strand; (ii) a CAG or CTG hairpin/loop; (iii) an A·A, T·T, C·C, C·T, A·C, G·G A·G or G·T mismatch(es) contained in perfectly or imperfectly annealed CTG or CAG hairpins; or (iv) the tip of an intrastrand hairpin. The binding to the (CAG)15 and (CTG)15 oligos appears to diminish a loop junction or three-way junction as the only recognition site since these oligonucleotides are not large enough to form such structures. Our studies show a clear MSH2 binding preference for the CAG strand. Presumably some, as yet undefined, difference in the structure of the two looped-out CTG and CAG strands accounts 1121 Human Genetics, 1997, 6, No. NucleicMolecular Acids Research, 1994, Vol. Vol. 22, No. 1 7 1121 Figure 4. Structure-specific binding by human MSH2 protein. DNA competition assays were performed (23, 24) using gel-purified linear non-treated DNAs (LNT), S-DNAs (S) and SI-DNAs (SI) as competitors. Binding was done as in Figure 3 using 0 and 727.5 ng (6.7 pmol) of MSH2. After binding, the quantity of competitor indicated was added and the mixture was incubated for an additional 15 min. Reaction products were separated by gel electrophoresis, the gels were dried and autoradiographed. (A) Autoradiograph of *(CTG)30·(CAG)50 SI-DNA–MSH2 complex competition assay with 100 and 200 molar-fold amount of competitor; no protein (NP), no competitor (NC). The arrow indicates the band-shifted MSH2–DNA complex. (B) Quantification of *(CTG)30·(CAG)50 SI-DNA–MSH2 complex competition assay, with 50, 100 and 200 molar-fold amount of competitor. (C) Quantification of *(CTG)30·(CAG)30 S-DNA–hMSH2 complex competition assay. (D) Quantification of *(CTG)50·(CAG)50 S-DNA–MSH2 complex competition assay. Quantification of 3–5 competition assays as shown; error bars represent plus and minus one standard deviation. Figure 5. Human MSH2 protein binding to (CAG)15 and (CTG)15. (A) Individual oligonucleotides were twice purified by TLC (SurePure, USB) and radiolabeled to identical specific activity. Binding assays were performed as in Figure 3. DNAs were incubated with 0, 485 ng (4.6 pmol) and 727.5 ng (6.7 pmol) of MSH2. Products were separated on a 4% polyacrylamide gel, dried and exposed for autoradiography. (B) Quantification of 3–5 binding assays as shown in (A); error bars represent plus and minus one standard deviation. for the difference in the recognition by MSH2. While both CTG and CAG DNA strands will form hairpins, the stability of the CTG hairpin is greater than that of the CAG hairpin (39–41). We have shown previously that the CAG strand in the S-DNAs is preferentially sensitive to the single-strand-specific mung bean nuclease (12), suggesting a greater single-stranded character of the looped-out/hairpin CAG relative to the CTG strand. Perfect hairpins formed by the CAG and CTG strands would contain an A·A and T·T mispair respectively, flanked by two C·G base pairs. In simian cells, A·A and T·T mismatches are corrected with 64 and 39% efficiency, respectively (37). Thus, if only these mispairs in their respective looped-out stems were recognized, the preferential binding of MSH2 to the slipped-strand intermediate with an excess of CAG correlates with the efficiency of mispair correction in vivo. Several studies have shown that trinucleotide repeat sequences are unstable to a degree greater than that observed for mono- and dinucleotide repeat tracts in tumors or tumor cell lines that display microsatellite instability (42,43). Furthermore, the relative frequency of insertions versus deletions differed depending on the sequence of the repeat tract (43). Interestingly, neither the DM nor the FRAXA repeat loci displayed large repeat expansions in the MSH2-deficient human tumor cell line Lovo, although shorter instabilities of these repeats was observed (30). These results suggested that an MSH2 deficiency alone does not cause the large changes in trinucleotide repeat lengths typical of DM and FRAXA individuals. However, it is possible for the post-replication mismatch repair process to be unrelated to large trinucleotide expansions, while the mismatch repair genes (proteins) may play a fundamental role. For example, the mismatch repair genes have been found to participate in several other DNA metabolic processes, such as transcription-coupled repair (TCR) (44,45) and the elimination of genetic recombination between the divergent DNA sequences (termed: homoeologous recombination) 1122 Human Molecular Genetics, 1997, Vol. 6, No. 7 (46,47). These latter studies have suggested that the mismatch repair proteins may be part of a system that alters or aborts recombination events initiated between divergent repeated sequences, thereby maintaining the genetic distinction of these sequences as well as the global integrity of chromosome containing such repeats. Furthermore, genetic studies suggest that Saccharomyces cerevisiae MSH2-dependent mismatch repair controls the length of a mitotic genetic recombination tract (48,49). Thus, the observed lack of large trinucleotide expansion in mismatch repair-deficient human tumor cell lines may not address directly the possibility of tissue specificity, altered expression and or mutation of MSH2 during development, the contribution(s) of transcription/TCR, or cells undergoing high rates of recombination as a causes of trinucleotide expansion. Furthermore, while HNPCC patients do not appear to be predisposed to the neurodegenerative diseases associated with expansion of trinucleotide repeat sequences, it should be noted that they are heterozygous carriers, while only homozygous or hemizygous mutation(s) of the mismatch repair genes result in a mismatch repair deficiency. Mismatch repair in human cells appears to be far more complicated than in bacteria. It is now clear that MSH2 forms at least two heteromeric protein complexes that enhance the specificity of MSH2-dependent mispair recognition (28,29,50). These two complexes have overlapping and redundant mispair binding activities (28) that appear to account for the lack of observed mutations in MSH6 and, by analogy MSH3, in HNPCC. Furthermore, while the binding of MSH2 alone appears to be somewhat lower than MSH2 complexed with either MSH3 or MSH6, its fundamental mispair binding activity appears to completely recapitulate the types of binding observed when it is complexed with either MSH3 or MSH6. The enhancement of mispair binding observed for the MSH2–MSH6 and MSH2–MSH3 protein complexes over MSH2 alone has suggested that MSH3 and MSH6 are ‘specificity factors’ that increase the stability of the MSH2 mispair binding activity (28). These observations suggest that MSH2 may provide a model system for mispair binding in the absence of clear knowledge of the appropriate protein specificity partner and that the association of MSH3 and/or MSH6 with MSH2 may broaden the number of mismatch repair proteins (genes) that might contribute to trinucleotide repeat expansion. Our results suggest that slipped intermediates having an excess of CTG repeats may escape detection more frequently than those with CAG repeats. This asymmetry may indicate that it is the CTG hairpin, rather than the CAG hairpin, which determines trinucleotide instability at neurodegenerative disease loci. These results appear to parallel the observation that certain palindromic sequences can escape repair in yeast and mammalian cells (51,52). An alternative possibility is that MSH2 either alone or in a complex form may stabilize or protect slipped-strand CAG loops from repair, thus allowing replication to finalize an expansion. It is also possible that MSH2 binding to the slipped-strand CAG loops exposes the looped-out CTG strand to resolvases. Such asymmetric recognition may account for strand-specific differences in the replication fidelity (8,53), inequality in mutation rates between complementary DNA strands (54,55) and perhaps the polar mutability of trinucleotide repeat tracts observed in vivo (5–8). Finally, our observation that MSH2 binding increases with increasing length of the slipped-strand structure suggests that the role of MSH2 in trinucleotide expansion may become more important with longer repeat tracts. Whether MSH2 or other mismatch repair factors are intimately involved in the biology of trinucleotide expansion clearly requires further study. MATERIALS AND METHODS Plasmids and DNA treatments The DM clones have been described previously (12) (see also Fig. 2). Isolation, radiolabeling and gel purification were as described (12,31,32). Denaturation and renaturation for reduplexing and heteroduplexing DNAs was by alkali treatment (pH 13) and neutralization (pH 8) with T.E. at 68C, as described in detail (12). hMSH2 expression and purification MSH2 protein was expressed in a baculovirus system and purified by a modification of the procedure described by Fishel et al. (28). Band-shift assay Band-shift assays were performed (23,24) using 1.9 pmol/reaction and 2.4 pmol/reaction of gel-purified 32P-labeled 30×30 or 50×50 DNAs, or 1.4 pmol/reaction of oligonucleotide. DNAs were incubated with 0–970 ng (0–9 pmol) of purified MSH2 (97 ng/µl) in binding buffer [20 mM potassium phosphate (pH 7.5), 50 mM KCl, 0.1 mM dithiothreitol (DTT), 1 mM EDTA, and 23% glycerol], plus 150 pmol of double-stranded poly(dI–dC) (Pharmacia) in 20 µl reactions, and incubated at 25C for 15 min. Reaction products were separated on a 4% polyacrylamide gel, dried and exposed for autoradiography as described (23,24). All quantification was by PhosphorImage analysis using ImageQuant software (Molecular Dynamics) and represent 3–5 experiments. ACKNOWLEDGEMENTS We thank Tetsuo Ashizawa, Fred Gimble, David L. Nelson, Vladimir Potaman, Stephen T. Warren and Samuel Wilson for critical comments on the manuscript. This work was funded in part by funds from the Institute of Biosciences and Technology and from NIH grant GM52982 (R.R.S.) and CA67007 (R.F.). REFERENCES 1. Ashley, C.T.,Jr and Warren, S.T. (1995) Trinucleotide repeat expansion and human disease. Annu. Rev. Genet., 29, 703–728. 2. Mahadevan, M., Tsilfidis, C., Sabourin, L., Shutler, G., Amemiya, C., Jansen, G., Neville, C., Narang, M., Barceló, J., O’Hoy, K., LeBlond, S., Earle-MacDonald, J., DeJong, P.J., Wieringa, B. and Korneluk, R.G. (1992) Myotonic dystrophy mutation: an unstable CTG repeat in the 3′ untranslated region of the gene. Science, 255, 1253–1255. 3. Richards, R.I. and Sutherland, G.R. (1992) Heritable unstable DNA sequences. Nature Genet., 1, 7–9. 4. Chung, M.-y., Ranum, L.P.W., Duvick, L.A., Servadio, A., Zoghbi, H.Y. and Orr, H.T. (1993) Evidence for a mechanism predisposing to intergenerational CAG repeat instability in spinocerebellar ataxia type I. Nature Genet., 5, 254–258. 1123 Human Genetics, 1997, 6, No. NucleicMolecular Acids Research, 1994, Vol. Vol. 22, No. 1 7 5. Eichler, E.E., Holden, J.J.A., Popovich, B.W., Reiss, A.L., Snow, K., Thibodeau, S.N., Richards, C.S., Ward, P.A. and Nelson, D.L. (1994) Length of uninterrupted CGG repeats determines instability in the FMR1 gene. Nature Genet., 8, 88–94. 6. Kunst, C.B. and Warren, S.T. (1994) Cryptic and polar variation in the fragile X repeat could result in predisposing normal alleles. Cell, 77, 853–861. 7. Snow, K., Tester, D.J., Kruckeberg, K.E., Schaid, D.J. and Thibodeau, S.N. (1994) Sequence analysis of the fragile X trinucleotide repeat: implications for the origin of the fragile X mutation. Hum. Mol. Genet., 9, 1543–1551. 8. Kang, S., Jaworski, A., Oshima, K. and Wells, R.D. (1995) Expansion and deletion of CTG repeats from human disease genes are determined by the direction of replication in E.coli. Nature Genet., 10, 213–218. 9. Jaworski, A., Rosche, W.A., Gallibolian, R., Kang, S., Shimizu, M., Bowater, R.P., Sinden, R.R. and Wells, R.D. (1995) Mismatch repair in Escherichia coli enhances instability of (CTG)n triplet repeats from human hereditary diseases. Proc. Natl Acad. Sci. USA, 92, 11019–11023. 10. Sinden, R.R. and Wells, R.D. (1992) DNA structure, mutations, and human genetic disease. Curr. Opin. Biotechnol., 3, 612–622. 11. Wells, R.D. and Sinden, R.R. (1993) In Davies, K.E. and Warren, S.T. (eds), Genome Analysis, Genome Rearrangement and Stability. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, vol. 7, pp. 107–138. 12. Pearson, C.E. and Sinden, R.R. (1996) Alternative structures in duplex DNA formed within the trinucleotide repeats of the myotonic dystrophy and fragile X loci. Biochemistry, 35, 5041–5053. 13. Richards, R.I. and Sutherland, G.R. (1994) Simple repeat DNA is not replicated simply. Nature Genet., 6, 114–116. 14. Pearson, C.E. Eichler, E.E., Lorenzetti, D., Acharya, S., Kramer, P.R., Kramer, S.F., Nelson, D.L., Zoghbi, H.Y. and Sinden, R.R. (1996) Slipped strand structures in neurodegenerative disease-associated trinucleotide repeats: a role for human mismatch repair and cryptic interruptions. Am. J. Hum. Genet., 59 (suppl.), A48, 244. 15. Fishel, R., Lescoe, M.K., Rao, M.R.S., Copeland, N.G., Jenkins, N.A., Garber, J., Kane, M. and Kolodner, R. (1993) The human mutator gene homolog MSH2 and its association with hereditary nonpolyposis colon cancer. Cell, 75, 1027–1038. 16. Leach, F.S. et al., (1993) Mutations of a mutS homolog in hereditary nonpolyposis colorectal cancer. Cell, 75, 1215–1225. 17. Su, S.-S. and Modrich, P. (1986) Escherichia coli mutS-encoded protein binds to mismatched DNA basepairs. Proc. Natl Acad. Sci. USA, 83, 5057–5061. 18. Su, S.-S., Lahue, R.S., Au, K.G. and Modrich, P. (1988) Mispair specificity of methyl-directed DNA mismatch correction in vitro. J. Biol. Chem., 263, 6829–6835. 19. Jiricny, J., Su, S.-S., Wood, S.G. and Modrich, P. (1988) Mismatch-containing oligonucleotide duplexes bound by the E.coli mutS-encoded protein. Nucleic Acids Res., 16, 7843–7853. 20. Parker, B.O. and Marinus, M.G. (1992) Repair of DNA heteroduplexes containing small heterologous sequences in Escherichia coli. Proc. Natl Acad. Sci. USA, 89, 1730–1734. 21. Alani, E., Chi, N.-W. and Kolodner, R.D. (1995)The Saccharomyces cerevisiae Msh2 protein specifically binds to duplex oligonucleotides containing mismatched DNA base pairs and insertions. Genes Dev., 9, 234–247. 22. Fishel, R., Ewel, A. and Lescoe, M.K. (1994) Purified human MSH2 protein binds to DNA containing mismatched nucleotides. Cancer Res., 54, 5539–5542. 23. Fishel, R., Ewel, A., Lee, S., Lescoe, M.K. and Griffith, J. (1994) Binding of mismatched microsatellite DNA sequences by the human MSH2 protein. Science, 266, 1403–1405. 24. Bhattacharyya, N.P., Skandalis, A., Ganesh, A., Groden, J. and Meuth, M. (1994) Mutator phenotype in human colorectal carcinoma cell lines. Proc. Natl Acad. Sci. USA, 91, 6319–6323. 25. Parsons, R., Li, G.-M., Longely, M.J., Fang, W.-H., Papadopoulos, N., Jen, J., de la Chapelle, A., Kinzler, K.W., Vogelstein, B. and Modrich, P. (1993) Hypermutability and mismatch repair deficiency in RER+ tumor cells. Cell, 75, 1227–1236 26. Umar, A., Boyer, J.C. and Kunkel, T.A. (1994) DNA loop repair by human cell extracts. Science, 266, 814–816. 27. Fishel, R. and Kolodner, R.D. (1995) Identification of mismatch repair genes and their role in the development of cancer. Curr. Opin. Genet. Dev., 5, 382–395. 28. Acharya, S., Wilson, T., Gradia, S., Kane, M.F., Guerrette, S., Marsischky, G.T., Kolodner, R. and Fishel, R. (1996) hMSH2 forms specific mispair-binding complexes with hMSH3 and hMSH6. Proc. Natl Acad. Sci. USA, 93, 13629–13634. 29. Drummond, J.T., Li, G.-M., Longley, M.J. and Modrich, P.(1995) Isolation of an hMSH2–p160 heterodimer that restores DNA mismatch repair to tumor cells. Science, 268, 1909–1912. 1123 30. Kramer, P.R., Pearson, C.E. and Sinden, R.R. (1996) Stability of triplet repeats of myotonic dystrophy and fragile X loci in human mutator mismatch repair cell lines. Hum. Genet., 98, 151–157. 31. Pearson, C.E., Ruiz, M.T., Price, G.B. and Zannis-Hadjopoulos, M. (1994) Cruciform DNA binding protein in HeLa cell extracts. Biochemistry, 33, 14185–14196. 32. Pearson, C.E., Zannis-Hadjopoulos, M., Price, G.B. and Zorbas, H. (1994) A novel type of interaction between a cruciform DNA structure and a cruciform binding protein from HeLa cells. EMBO J., 14, 1571–1580. 33. Gaillard, C. and Strauss, F. (1994) Association of poly(CA)·(poly(TG) DNA fragments into four-stranded complexes bound by HMG1 and 2. Science, 264, 433–436. 34. Belotserkovskii, B.P. and Johnston, B.H. (1996) Polypropylene tube surfaces may induce denaturation and multimerization of DNA. Science, 271, 222–223. 35. Alani, E., Lee, S., Kane, M.F., Griffith, J. and Kolodner, R. (1997) Saccharomyces cerevisiae MSH2, a mispair recognition protein, also recognizes Holliday junctions. J. Mol. Biol., 265, 289–301 36. Mello, J.A., Acharya, S. Fishel, R. and Essigman, J.M. (1996) The mismatch repair protein hMSH2 binds selectively to DNA adducts of the anticancer drug cisplatin. Chem. Biol., 3, 579–589. 37. Brown, T.C. and Jiricny, J. (1988) Different base/base mispairs are corrected with different efficiencies and specificities in monkey kidney cells. Cell, 54, 705–711. 38. Jones, M., Wagner, R. and Radman, M. (1987) Repair of a mismatch is influenced by the base composition of the surrounding nucleotide sequence. Genetics, 115, 605–610. 39. Gacy, A.M., Goellner, G., Juranic, N., Macura, S. and McMurray, C.T. (1995) Trinucleotide repeats that expand in human disease form hairpin structures in vitro. Cell, 81, 533–540. 40. Mitas, M., Yu, A., Dill, J., Kamp, T.J., Chambers, E.J. and Haworth, I.S. (1995) Hairpin properties of single-stranded DNA containing a GC-rich triplet repeat: (CTG)15. Nucleic Acids Res., 23, 1050–1059. 41. Smith, G.K., Jie, J., Fox, G.E. and Gao, X. (1995) DNA CTG triplet repeats involved in dynamic mutations of neurobiologically related gene sequences form stable duplexes. Nucleic Acids Res., 23, 4303–4311. 42. Wooster, R., Cleton-Jansen, A.-M., Collins, N., Mangion, J., Cornelis, R.S., Cooper, C.S., Gusterson, B.A., Ponder, B.A.J., von Demling, A., Wiestler, O.D., Cornelisse, C.J., Devilee, P. and Stratton, M.R. (1994) Instability of short tandem repeats (microsatellites) in human cancers. Nature Genet., 6, 152–156. 43. Shibata, D., Peinado, M.A., Ionov, Y., Malkhosyan, S. and Perucho, M. (1994) Genomic instability in repeated sequences is an early somatic event in colorectal tumorigenesis that persists after transformation. Nature Genet., 6, 273–281. 44. Mellon, I. and Champe, G.N. (1995) Products of DNA mismatch repair genes mutS and mutL are required for transcription-coupled nucleotide excision repair of the lactose operon in Escherichia coli. Proc. Natl Acad. Sci. USA, 93, 1292–1297. 45. Mellon, I., Deepak, K.R., Koi, M.C., Boland, R.C. and Champe, G.N. (1996) Transcription-coupled repair deficiency and mutations in human mismatch repair genes. Science, 272, 557–560. 46. Rayssiguier, C., Thaler, D.S. and Radman, M. (1989) The barrier to recombination between Escherichia coli and Salmonella typhimurium is disrupted in mismatch-repair mutants. Nature, 342, 396–401. 47. Bailis, A.M. and Rothstein, R. (1990) A defect in mismatch repair in Saccharomyces cerevisiae stimulates ectopic recombination between homeologous genes by an excision repair dependent process. Genetics, 126, 535–547. 48. Alani, E., Reenan, R.A.G. and Kolodner, R. (1994) Interaction between mismatch repair and genetic recombination in Saccharomyces cerevisiae. Genetics, 137, 19–39. 49. Datta, A., Adjiri, A., New, L., Crouse, G.F. and Jinks-Robertson, S. (1996) Meiotic crossovers between divergent sequences are regulated by mismatch repair proteins in Saccharomyces cerevisiae. Mol. Cell. Biol., 16, 1085–1093. 50. Palombo, F., Gallinari, P., Iaccarino, I., Lettieri, T., Hughes, M., D’Arrigo, A., Truong, O., Hsuan, J.J. and Jiricny, J. (1995) GTBP, a 160-kilodalton protein essential for mismatch-binding activity in human cells. Science, 268, 1912–1914. 51. Nag, D.K., White, M.A. and Petes, T.D. (1989) Palindromic sequences in heteroduplex DNA inhibit mismatch repair in yeast. Nature, 340, 318–320. 52. Bollag, R.J., Elwood, D.R., Tobin, E.D., Godwin, A.R. and Liskay, R.M. (1992) Formation of heteroduplex DNA during mammalian intrachromasomal gene conversion. Mol. Cell. Biol., 12, 1546–1552. 53. Bohr, V.A. (1991) Gene specific DNA repair. Carcinogenesis, 12, 1983–1992. 54. Trinh, T.Q. and Sinden, R.R. (1991) Preferential DNA secondary structure mutagenesis in the lagging strand of replication in E.coli. Nature, 352, 544–547. 55. Wu, C.-I. and Maeda, N. (1987) Inequality in mutation rates of the two strands of DNA. Nature, 327, 169–170.
© Copyright 2026 Paperzz