Plant Biotechnology Journal (2011), pp. 1–14 doi: 10.1111/j.1467-7652.2011.00628.x Research article Transcript-specific, single-nucleotide polymorphism discovery and linkage analysis in hexaploid bread wheat (Triticum aestivum L.) Alexandra M. Allen1, Gary L.A. Barker1, Simon T. Berry2, Jane A. Coghill1, Rhian Gwilliam3, Susan Kirby3, Phil Robinson3, Rachel C. Brenchley4, Rosalinda D’Amore4, Neil McKenzie5, Darren Waite5, Anthony Hall4, Michael Bevan5, Neil Hall4 and Keith J. Edwards1,* 1 School of Biological Sciences, University of Bristol, Bristol, UK 2 Limagrain UK Limited, Woolpit Business Park, Suffolk, UK 3 KBioscience, Unit 7, Maple Park, Hertfordshire, UK 4 School of Biological Sciences, University of Liverpool, Liverpool, UK 5 John Innes Centre, Norwich Research Park, Norwich, UK Received 5 April 2011; accepted 24 April 2011. *Correspondence (fax 44 117 925 7374; email [email protected]) Keywords: wheat, next-generation sequencing, KASPar genotyping, single-nucleotide polymorphism. Summary Food security is a global concern and substantial yield increases in cereal crops are required to feed the growing world population. Wheat is one of the three most important crops for human and livestock feed. However, the complexity of the genome coupled with a decline in genetic diversity within modern elite cultivars has hindered the application of marker-assisted selection (MAS) in breeding programmes. A crucial step in the successful application of MAS in breeding programmes is the development of cheap and easy to use molecular markers, such as single-nucleotide polymorphisms. To mine selected elite wheat germplasm for intervarietal single-nucleotide polymorphisms, we have used expressed sequence tags derived from public sequencing programmes and next-generation sequencing of normalized wheat complementary DNA libraries, in combination with a novel sequence alignment and assembly approach. Here, we describe the development and validation of a panel of 1114 single-nucleotide polymorphisms in hexaploid bread wheat using competitive allele-specific polymerase chain reaction genotyping technology. We report the genotyping results of these markers on 23 wheat varieties, selected to represent a broad cross-section of wheat germplasm including a number of elite UK varieties. Finally, we show that, using relatively simple technology, it is possible to rapidly generate a linkage map containing several hundred single-nucleotide polymorphism markers in the doubled haploid mapping population of Avalon · Cadenza. Introduction With the world’s population forecasted to reach nine billion by 2050, food security has become a critical global challenge for the 21st century. It has been estimated that cereal production needs to increase by 50% by 2030 (Foresight: The Future of Food and Farming, 2011). Wheat is the dominant cereal crop grown in temperate countries, and globally, one of the three most important crops for human and livestock feed (Shewry, 2009). Consequently, increasing wheat yields is now one of the top priorities for agricultural research (Reaping the benefits: Science and the sustainable intensification of global agriculture, 2009). When compared with other crops, the increase in wheat yields has slowed since the ‘green revolution’ of the 20th century (Alston et al., 2010). While diploid crops such as rice and barley have benefited from extensive genetic analysis and molecular breeding programmes, the complexity of the wheat genome has hindered these types of studies. The allohexaploid genome of modern bread wheat (AABBDD) is derived from the hybridization of the diploid DD genome of Aegilops tauschii with the AABB tetraploid genome of Triticum turgidum (Dubcovsky and Dvorak, 2007). The genetic bottleneck caused by this polyploid speciation event approximately 10 000 years ago has been intensified by the domestication of wheat, resulting in a decline of genetic diversity within modern inbred, elite cultivars (Haudry et al., 2007). To increase the genetic diversity of bread wheat, it has been suggested that greater use could be made of the various wild relatives of wheat and that by introducing genes from these relatives it might be possible to tap novel sources of stress, pest and disease resistance (Reynolds et al., 2011). However, such strategies, sometimes referred to as pre-breeding, can be resource intensive (Valkoun, 2001). A crucial step in making wheat prebreeding more efficient is the development of molecular markers capable of tracking the introduced genomic regions in large numbers of lines. There are several challenges to developing molecular markers in a polyploid species such as wheat, for instance, the molecular markers must be capable of distinguishing between the relatively large numbers of polymorphisms seen in homoeologous and paralogous genes compared with ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd 1 2 Alexandra M. Allen et al. the relatively infrequent varietal polymorphisms (Barker and Edwards, 2009). Homoeologous ⁄ paralogous and varietal singlenucleotide polymorphisms (SNPs) have previously been studied and used in polyploid crops (Akhunov et al., 2009; Bernardo et al., 2009; Bundock et al., 2009; Edwards et al., 2009; Imelfort et al., 2009); however, these studies have also shown that distinguishing intervarietal markers from intergenomic polymorphisms is complicated and prone to error. For example, a previous in silico study by our group showed that of the 71 000 putative SNPs identified in the wheat EST database, only 3500 appear to be intervarietal (5%) with the majority being SNPs in homoeologous and paralogous genes (Barker and Edwards, 2009). These results, together with the large size of the wheat genome (17 300 Mb), mean that despite the global importance of wheat, there are still relatively few validated varietal SNP markers in regular use. While this situation remains, it is likely that, when compared with other crops such as maize and rice, wheat will continue to lag behind in terms of markerassisted selection in breeding programmes (Mochida and Shinozaki, 2010). The development of SNP-based genotyping platforms has lead to an increase in the number of protocols available for analysing the genetic variation in numerous species (Perkel, 2008). However, as with the development of SNPs, large-scale genotyping in polyploid species is still a significant challenge because of the presence of the homoeologous and paralogous genes. Despite these challenges, a number of platforms have recently been developed to perform high-density genotyping (large numbers of SNPs, with small numbers of individual plants), and these have been successfully employed to genotype wheat (Akbari et al., 2006; Akhunov et al., 2010; Bérard et al., 2009; Chao et al., 2010). However, these technologies can be difficult to optimize and as such they have yet to be generally adopted by the wheat community leaving few options for wheat breeders and geneticists who wish to carry out mediumto low-density genotyping on large or very large numbers of individual plants. In the present study, we undertook to mine the UK wheat germplasm for varietal SNPs. To do this, we used both the available wheat expressed sequence tags (ESTs) present in the public database and next-generation sequencing (NGS), together with a novel sequence alignment and assembly approach to identify varietal SNPs in lines of interest to European wheat breeders. We then investigated whether these SNPs could be validated by a relatively new, high-throughput genotyping procedure, the KBioscience Competitive Allele-Specific polymerase chain reaction (KASPar) assay (Orrù et al., 2009). Finally, we made use of the existing Avalon · Cadenza mapping population to examine whether the validated SNPs could be efficiently placed onto the existing linkage groups identified within this doubled haploid population. whole-seedling cDNA from five wheat varieties (Avalon, Cadenza, Rialto, Savannah, Recital) on the Illumina GAIIx platform (Table 1). For each line, we generated between 24 and 45 million, 75-base pair (bp) paired-end reads. SNP discovery, carried out as described in experimental procedures, resulted in the identification of 14 078 putative SNPs in 6255 distinct reference sequences covering 2.7 megabases in total. On average, this equates to five varietal SNPs per kilobase in the reference sequences containing one or more SNP (Table S2). SNP validation and characterization A subset of 1659 putative SNPs were selected for validation with genomic DNA. Of these, 213 (from 199 contigs covering 25 773 bases) were derived from sequences obtained from the NCBI database and 1446 (1237 contigs covering 5 59 059 bases) from the NGS carried out as part of this study. For the SNPs mined from the NGS data, these were selected on the basis of their predicted polymorphism level in the sequenced varieties. In most cases, loci were limited to one SNP per cDNA contig, to maximize the genome coverage. SNPs were validated using the KASPar genotyping platform on 21 hexaploid wheat varieties, a diploid and a tetraploid wheat (Table 2). The wheat varieties were selected following a survey of UK academics and wheat breeders to ensure that material of use to the whole community was represented, e.g. the inclusion of parents of other published mapping populations. Of the 1659 putative varietal SNPs, 1114 (67%) were polymorphic between the different varieties, 70 (4%) were monomorphic in the hexaploid varieties and 475 (29%) failed to generate a useful amplification signal (Table S3). Where a SNP probe failed to generate a useful signal, no attempt was made to redesign the primer. To compare the different SNP generation techniques, these statistics were assessed separately. Of the 213 SNPs generated from the EST database, 174 (82%) were polymorphic between the 23 varieties, four were monomorphic (2%) and 35 failed to generate a useful signal (16%; Table S1), while of the 1446 SNPs from NGS data, 940 (65%) were polymorphic, 66 were monomorphic (4.5%) and 440 failed to generate a useful signal (30%; Table S2). Based upon the SNP genotyping data of the 23 varieties screened, the polymorphism information content (PIC) values of the validated SNPs varied from 0.080 to 0.375 with an average value of 0.300 (Figure 1). The PIC values of the SNPs generated from the EST database (average 0.306) were not significantly different from those generated from NGS data (average 0.298). To test whether the non-polymorphic SNPs were homoeologous rather than homologous SNPs, we screened 28 on the Chinese Spring nullisomic lines (Sears, 1954; Devos et al., Table 1 Summary of wheat lines used and next-generation sequences obtained Results No. sequences SNP discovery Two complimentary approaches were taken to identify putative varietal SNPs. First, screening of the publicly available wheat EST database (generated from numerous cDNA libraries, from a random collection of varieties relating to different stages of grain development and various stress treatments) yielded 3500 putative varietal SNPs in 8668 sequences (Table S1). These data were supplemented by sequencing normalized uniquely assigned No. of SNPs Variety No. of sequences to reference (compared to Avalon) Avalon 33 199 506 17 419 590 N⁄A Cadenza 34 296 300 16 938 920 3251 Rialto 25 522 334 13 241 931 2944 Savannah 24 349 192 12 271 061 3492 Recital 43 961 514 22 151 006 3792 ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 Wheat SNP markers 3 Table 2 Details of wheat varieties used in this study Wheat Line Breeder ⁄ Source Ploidy Planting season Pedigree Avalon PBIC Hexaploid Winter Maris-Ploughman · Bilbo Alchemy Limagrain UK Ltd Hexaploid Winter Claire · (Consort · Woodstock) Alcedo NIAB Hexaploid Winter (Record, AUT · Poros) · Carstens-VIII Bacanora CIMMYT Hexaploid Spring (Jupateco-73 · BlueJay) · Ures-81 Brompton Elsoms Seeds Ltd Hexaploid Winter CWW 92.1 · Caxton Cadenza PBIC Hexaploid Spring Axona · Tonic Chinese Spring (L 42) JIC Hexaploid Spring Chinese Land Race Claire Limagrain UK Ltd Hexaploid Winter Wasp · Flame Hereward RAGT seeds Ltd Hexaploid Winter Norman ‘sib’ · Disponent Opata CIMMYT Hexaploid Spring Bluejay · Jupateco-73 Recital C.C. Benoist Hexaploid Winter? Mexique-267 (R-267) · 9369 Renan INRA Hexaploid Winter (Mironovskaya-808 · Maris-Huntsman) · Courtot Rialto PBIC Hexaploid Winter Haven · Fresco Robigus KWS UK Ltd Hexaploid Winter Z836 · 1366 (PUTCH) Riband · Brigadier Savannah Zeneca Seeds UK Ltd. Hexaploid Winter Shamrock Verneuil Recherche Hexaploid Winter COMPL TIG 323-1-3m · CWW 4899 ⁄ 25 Soissons Maison Florimond Desprez, Franc Hexaploid Winter Iena (Jena) · HN35 Synthetic JIC Hexaploid Spring Unknown Weebil CIMMYT Hexaploid Spring Unknown Xi19 Limagrain UK Ltd Hexaploid Winter (Cadenza · Rialto) · Cadenza TA1618 WGGRC Diploid Spring Aegilops Tauschii (D genome) Croc WGGRC Tetraploid Spring Triticum turgidum subsp. Dicoccoides (AB genome) Synthetic 39 WGGRC Hexaploid Spring TA1618 · Croc PBIC, Plant Breeding International Cambridge, Ltd. (UK); NIAB, National Institute of Agricultural Botany; CIMMYT, International Maize and Wheat Improvement Centre; JIC, John Innes Centre; INRA, French National Institute for Agricultural Research; WGGRC, Wheat Genetic and Genomic Resources Center. 250 Hereward and Shamrock. The synthetic varieties formed a clear outlying group, with the other varieties falling into two main groupings. The first included subgroups of Renan, Robigus and Avalon and Hereward, Shamrock, Brompton, Rialto and Alchemy. The second group contained more diverse material including the French lines Recital and Soissons, and the German variety Alcedo. 200 Genetic map construction 150 An Avalon · Cadenza doubled haploid population comprising 190 individuals was scored for 548 loci (53 from the EST data set and 495 from the NGS data set) using KASPar genotyping. Of these, 38 loci showed significant segregation distortion and were removed from the data set before constructing the linkage groups. Using the information generated, in conjunction with the 574 non-SNP markers previously available for this population, we were able to map 480 SNP markers (50 from the EST data set and 430 from the NGS data set) to 21 linkage groups representing chromosomes. A further 20 loci mapped to unassigned linkage groups and 10 loci were unlinked (Table 3, Figure 3, Table S4). The linkage groups ranged from 53.0 to 240.5 centiMorgans (cM) in size, with 9–108 markers. The total map length was 2999 cM, with an average spacing of 2.8 cM between loci. The total map lengths of each of the three genomes were approximately similar: 1131 cM for the A genome, 1075 cM for the B genome and 792 cM for the D genome (Table 3). However, there was a significant difference in the distribution of SNP marker loci between the three genomes with the A genome having 354 markers (163 SNPs), the B genome having 526 markers (267 SNPs) and the D genome having 174 450 400 Number of markers 350 300 100 50 0 0.0–0.10 0.10–0.15 0.15–0.20 0.20–0.25 0.25–0.30 0.31–0.35 0.35–0.40 Polymorphism information content (PIC) Figure 1 Distribution of polymorphism information content (PIC) of SNPs among the 23 wheat varieties. The PIC was calculated for each marker according to Botstein et al. (1980), using Excel Microsatellite Toolkit add-in software (Park, 2001). 1999); of these, we were able to confirm that 26 (93%) were in fact homoeologous SNPs and were therefore specific for either the A, B or D genome, but not a specific variety. To investigate the genetic relationship between the 23 lines, we carried out a hierarchical cluster analysis using all 1114 validated SNPs between the varieties used in this study (Figure 2). Several varieties clustered tightly together including the pairs of Cadenza and Xi19; Claire and Alchemy; Opata and Weebil and ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 4 Alexandra M. Allen et al. Figure 2 Indicator of genetic relationships between the of 23 wheat lines screened with 1114 KASPar probes. Hierarchical cluster analysis was performed in PASW (SPSS Inc.) using the Squared Euclidean Distance method. markers (50 SNPs). The homoeologous chromosome groups also showed variation in size and marker density; the combined group 5 chromosome linkage maps measured the largest at 613 cM, while the group 4 linkage groups measured a total of 267 cM. Marker density was highest in the group 1 linkage maps (average 1.7 cM between loci) and lowest in the group 7 linkage maps (average 3.8 cM between loci). Our mapped SNP data set contained 48 loci that we had provisionally mapped to specific chromosome regions using the Kansas deletion lines (Endo and Gill, 1996; Qi et al., 2003). Comparison of the map locations determined using both the Kansas deletion lines and the genetic mapping confirmed the map location for 29 of the 48 SNP markers (60%). A further 10 SNP loci mapped to the same homoeologous group, which gives a total of 81% of the loci in common mapping to the same homoeologous group. To extend this analysis, we also BLASTN searched the 500 mapped SNP marker reference sequences against the deletion-mapped ESTs produced by Qi et al. (2004). Of the 155 SNP markers contained within these two data sets, 107 (69%) mapped to the same homoeologous group. The same 500 mapped sequences were also BLASTN searched against the Brachypodium distachyon genome sequence (International Brachypodium Initiative., 2010) resulting in 266 matches. A syntenic match was assumed where two or more consecutive wheat mapped SNPs matched the same B. distachyon chromosome. These criteria resulted in 205 of the 266 mapped wheat SNPs being placed on the B. distachyon genome. SNP coverage was generally too low to explore synteny in detail, but where sufficient markers existed, as for wheat linkage group 2 versus B. distachyon chromosome 5, the syntenic relationship was consistent with known chromosome relationships (Figure 5). The putative functions of 500 sequences containing validated and mapped SNP markers were determined, via BLASTX searches, against the National Centre for Biotechnology Information (NCBI) non-redundant protein sequence database using an e-value cut-off of 10)5 (Table S5). This search indicated that 213 sequences (43%) had detectable similarity to sequences in the database, while 287 sequences (57%) failed to detect any sequences with a similarity at the level at or above the cut of value. Of the 213 sequences with detectable sequence similarities, we were able to identify 184 unique hits, as determined by the SNP having either a unique hit in the database or having a similar hit, but with a different map location. The remaining 29 SNPs were distributed between 13 contigs; one contig having four SNPs, one contig having three SNPs and eleven contigs having two SNPs each. Optimization of primer design for homoeologous-specific KASPar markers Although no attempt was made to design homoeologousspecific probes, 111 of the 1114 validated KASPar probes (10%) appeared to be specific for a single homoeologous gene. In practical terms, these probes can be used to routinely screen for heterozygotes in segregating populations, as well as establishing the purity of inbred lines. Because such probes might be of considerable interest to wheat breeders, we examined whether it was possible to convert generic KASPar probes to homoeologous-specific assays. To do this, we used the publicly available 5 · sequence coverage of the Chinese Spring wheat genome (http://www.cerealsdb.uk.net/search_reads.htm) to design homoeologous-specific KASPar reverse primers near the existing varietal SNP ‘BS00000329’. In its original form, KASPar probe BS00000329 detects either an ‘A’ or an ‘AG’ ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 Wheat SNP markers 5 Table 3 Summary of linkage groups and mapped loci Linkage Group Chromosome Total Number Average spacing Size Number of Bristol between loci (cM) of Loci SNP loci (cM) SNPs, a technique that yielded 13239 putative varietal SNPs in 6255 sequences, with an average of five varietal SNPs per kilobase across the five varieties sequenced. Both SNP discovery techniques identified a similar number of varietal SNPs per kilobase and are in agreement with previous estimates for varietal SNPs in wheat (Ravel et al., 2007; Barker and Edwards, 2009). 1 1A 104.9 52 28 2.0 2 1B 148.9 108 58 1.4 SNP validation and characterization 3 1D 108.9 50 20 2.2 4 2A 180.5 53 22 3.4 5 2B 181.1 85 43 2.1 6 2D 96.9 35 11 2.8 7 3A 155.5 37 18 4.2 8 3B 224.6 74 33 3.0 9 3D 53.0 15 1 3.5 10 4A 143.1 40 18 3.6 11 4B 53.7 26 16 2.1 12 4D 70.2 9 1 7.8 13 5A 179.5 51 23 3.5 14 5B 240.5 104 53 2.3 15 5D 193.1 20 8 9.7 16 6A 155.7 67 35 2.3 17 6B 132.0 82 37 1.6 18 6D 57.9 10 2 5.8 19 7A 212.3 54 19 3.9 20 7B 94.2 47 27 2.0 21 7D 212.3 35 7 6.1 2.8 A subset of putative varietal SNPs (1659) were selected for validation using the KASPar genotyping platform. The KASPar system gave a conversion rate of 67% (1114 SNPs) using a standard set of PCR conditions, which is extremely important for routine MAS applications. However, this figure of 67% could probably be increased with primer design and PCR optimization if absolutely necessary. This conversion rate is lower than previous studies on diploid species such as mouse (77%; Petkov et al., 2004), but relatively high for a complex polyploid such as wheat (Edwards et al., 2009). Of the putative varietal SNPs, 4% were monomorphic in the hexaploid varieties, but polymorphic between the hexaploid varieties and the diploid and ⁄ or the tetraploid. This pattern suggests that these markers represent intravarietal homoeologous SNPs. To test this theory, we screened 28 markers from this category against the Chinese Spring nullisomic lines and confirmed that 26 (93%) were in fact homoeologous SNPs. These markers are likely to have been misidentified as varietal SNPs in the discovery pipeline. Nevertheless, 4% is low when considering that intravarietal homoeologous SNPs account for 74% of all SNPs in wheat (Barker and Edwards, 2009) and clearly demonstrates that our data sets have been significantly enriched for varietal SNPs. The failure of the remaining SNP markers to validate (29% failed to produce a defined or useful cluster) is likely to be because of the presence of paralogous sequences, primer design issues (such as primers spanning intron ⁄ exon boundaries or incorporating different homoeologous SNPs) and ⁄ or the need to optimize PCR conditions. We made no attempts to optimize any failed assays, and therefore, it is possible that these SNPs could be validated with alternative primers. The PIC values of the validated markers ranged from a minimum of 0.08–0.375, with an average value of 0.300 (Figure 1), a figure comparable to other estimates in wheat (Chao et al., 2009; Edwards et al., 2009). No significant difference was observed between the PIC values of markers from different genomes or different homoeologous groups. A hierarchical cluster analysis performed on the genotyping data at the SNP loci used in this study indicates possible genetic relationships between the 23 lines (Figure 2). Although the SNP markers were selected based upon their polymorphism between just a few lines (Avalon, Cadenza, Rialto, Savannah and Recital), the expected clustering of varieties was observed, suggesting that the SNP markers were representative of the genome. In particular, the close relationship between the varieties Cadenza and Xi19 (derived from a Cadenza cross) and Claire and Alchemy (derived from a Claire cross) is illustrated in the dendrogram. The different varieties are split into two main clades with the synthetics as an outlying group. The first clade includes subgroups of Renan, Robigus and Avalon and Hereward, Shamrock, Brompton, Rialto and Alchemy; all winter wheat varieties. The second clade contains more divergent material including the French lines Recital and Soissons, and the German variety Alcedo. A close relationship in this grouping is indicated between Opata, Weebil and Bacanora, all spring wheat lines Total 2998.8 1054 480 A genome 1131.4 354 163 3.2 B genome 1075.0 526 267 2.0 D genome 792.3 174 50 4.6 Group 1 362.8 210 106 1.7 Group 2 458.4 173 76 2.6 Group 3 433.1 126 52 3.4 Group 4 267.0 75 35 3.6 Group 5 613.1 175 84 3.5 Group 6 345.6 159 74 2.2 Group 7 518.7 136 53 3.8 genotype. Using the available 5 · Chinese Spring genomic sequences, we were able to design a new KASPar reverse primer that converted the original generic probe into a homoeologous-specific KASPar probe detecting either an ‘A’ or a ‘G’ genotype (Figure 5b,c). Discussion SNP discovery This study represents the first large scale assembly of genotyping and genetic map information for elite UK wheat varieties based on individual SNP markers. The SNPs described in this study were mined from two sources; the NCBI wheat EST database and next-generation cDNA sequencing data. Both sources of data specifically targeted the identification of varietal SNPs in coding regions, to generate molecular markers potentially linked to genes of interest and include both synonymous and nonsynonymous point mutations. The EST data yielded 3500 putative varietal SNPs from 8668 sequences, with an average of 4.3 varietal SNPs per kilobase (Barker and Edwards, 2009). The existing EST data were supplemented by mining NGS data for varietal ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 6 Alexandra M. Allen et al. 1A 1B Xgdm033 Xpsp2999 Xgwm136 Xcfd015 XBS00004382 XBS00001635 XBS00010039 XBS00012226 Xwmc336 XBS00010054 XBS00003761 XBS00003572 Xbarc119 XBS00009926 XBS00009885 XBS00003851 XBS00009866 XBS00010004 XBS00010452 XBS00009495 XBS00009444 XBS00010416 XBS00010595 XBS00010481 XBS00003758 XBS00009603 Xpsp3027 Xgwm164 Xgwm498 XBS00000853 XBS00009559 XBS00003829 XBS00003656 Glu-A1 Xwmc093 XBS00009459 XBS00003743 XBS00009700 XBS00003863 Xgwm099 0.0 12.1 15.0 23.0 23.6 28.4 29.0 34.8 50.0 50.6 56.5 57.5 58.1 59.2 63.2 64.3 64.9 65.4 66.6 70.2 70.7 75.7 78.0 84.7 19.3 24.9 39.8 47.8 53.6 56.8 57.3 59.7 61.4 63.7 65.0 67.1 67.7 69.3 69.8 75.7 78.4 80.0 81.1 81.6 86.5 88.8 93.1 106.6 116.4 117.0 143.3 145.6 2A 0.0 Xgwm614 Xbarc124 XBS00003735 XBS00010624 XBS00004385 48.8 56.1 62.0 Xgwm296 XBS00014736 XBS00003845 XBS00003867 XBS00016676 Xgwm359 XBS00009933 74.7 XBS00009730 88.2 90.5 91.1 91.6 92.2 92.7 93.3 98.9 123.5 135.7 138.9 141.0 146.9 160.0 Xgwm445 Xbarc011 XBS00003779 XBS00003574 XBS00003898 XBS00004004 XBS00009256 XBS00003878 XBS00002994 XBS00010672 Xgwm122 Xwmc453 Ppd-A1 Xwmc181 XBS00001260 XBS00003663 XBS00004090 XBS00009959 XBS00009295 180.5 Xgwm526 93.8 0.0 0.5 1.8 6.7 16.9 17.4 25.1 27.3 28.4 43.5 46.4 62.8 73.4 73.9 77.7 78.7 79.3 79.8 80.9 81.4 83.6 90.2 94.1 95.3 125.2 126.3 139.8 140.3 153.2 162.7 169.4 170.0 XBS00003806 0.0 16.9 Xcfd015 24.5 26.9 38.0 43.1 56.0 57.1 57.6 58.8 64.2 64.7 65.3 69.1 Xwmc336 XBS00003751 Xcfd021 XBS00010661 XBS00010486 Xgwm458 Xbarc240 Xcfd019 Glu-D1 XBS00004282 XBS00003558 XBS00003559 XBS00003816 Xgdm111 XBS00009412 XBS00010402 XBS00010498 XBS00003725 XBS00003573 XBS00010669 XBS00009375 XBS00003687 XBS00009625 XBS00010713 XBS00000424 XBS00009389 107.2 108.4 108.9 2B 24.7 25.2 30.2 51.2 1D Xgwm033 Xpsp3000 Xgwm264 XBS00003930 XBS00010592 XBS00003831 XBS00001634 XBS00010076 XBS00003880 XBS00003917 XBS00010508 XBS00010032 XBS00003684 XBS00021052 XBS00003619 XBS00009381 XBS00014413 XBS00003839 XBS00003822 XBS00000496 Xgwm011 Xgwm018 Xbarc240 XBS00009675 XBS00009977 XBS00003575 XBS00003844 XBS00009921 XBS00009502 XBS00009952 XBS00010549 XBS00009547 XBS00009270 XBS00003567 XBS00003634 XBS00003677 XBS00003600 XBS00003627 XBS00009945 XBS00001224 XBS00012056 Xwmc419 XBS00010779 XBS00010475 XBS00012456 XBS00010888 XBS00001414 XBS00010625 XBS00010536 Glu-B1 XBS00003934 XBS00009706 XBS00010009 XBS00004328 XBS00012502 XBS00000651 XBS00010436 Xcfd002 XBS00003633 XBS00002250 XBS00004129 XBS00009909 XBS00010610 XBS00009709 Xwmc044 XBS00003702 Xbarc080 XBS00001128 0.0 13.2 14.4 15.8 18.6 2D Xbarc035 Xgwm210 XBS00010734 Xwmc489 XBS00009972 XBS00010318 XBS00002660 XBS00003833 XBS00010637 XBS00012209 XBS00010700 Xwmc243 XBS00009801 XBS00003719 XBS00009848 Xbarc010 Xgwm257 XBS00010055 XBS00004433 XBS00009305 XBS00003714 XBS00003810 XBS00009722 XBS00010012 XBS00003788 XBS00009989 Xwmc175 Xgwm501 XBS00003585 XBS00009247 XBS00009461 Xcfd073 Xgwm526 XBS00004405 XBS00003597 XBS00010361 XBS00010640 XBS00009791 XBS00009864 XBS00003589 XBS00003673 XBS00009437 XBS00009519 XBS00010081 XBS00009460 Xbarc159 XBS00010685 XBS00003584 XBS00010323 XBS00010442 XBS00009901 XBS00009533 XBS00009637 XBS00010393 0.0 5.2 15.3 17.6 22.6 25.1 27.3 Xbarc124 Xcfd056 XBS00009771 XBS00003791 XBS00009976 Xcfd036 Xcfd044 Xgwm261 Xgwm503 Xgwm296 Xgwm132 65.9 69.6 72.3 76.3 76.9 78.1 81.7 83.9 87.2 87.7 92.0 96.9 Xcfd161 Xgwm539 Xbarc011 Xwmc245 XBS00000905 XBS00021483 Xwmc018 XBS00009932 XBS00000002 XBS00003804 XBS00010514 XBS00010681 XBS00003660 Xgwm102 Xwmc453 0.0 Xgwm349 10.1 Xgwm320 Figure 3 Genetic linkage maps of wheat derived from 190 Avalon · Cadenza doubled haploid lines. Each linkage group was assigned to a chromosome indicated above the linkage group and chromosomes are arranged with the short arm above the long arm. Simple sequence repeat (SSR) markers are shown in red, known genes are shown in green and SNP loci mapped in this study are shown in black (designated XBS). Map distances, calculated using the Kosambi mapping function (Kosambi, 1944), are shown in centiMorgans (cM). ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 Wheat SNP markers 7 3A XBS00003774 XBS00009793 XBS00010059 XBS00003733 0.0 1.1 23.6 XBS00000628 37.9 48.0 60.8 61.4 63.6 66.5 68.7 69.3 71.0 75.4 77.1 77.6 102.8 Xgwm369 Xcfd079 Xwmc505 XBS00003801 Xbarc019 Xwmc489 XBS00003837 XBS00003964 XBS00009465 XBS00011012 Xwmc264 XBS00010022 XBS00003778 XBS00003768 XBS00003924 XBS00010414 XBS00000646 Xgwm155 128.1 XBS00009649 155.5 XBS00003849 78.7 0.0 12.0 14.8 18.5 26.1 26.6 27.7 32.3 46.8 47.3 49.0 49.6 50.2 51.3 73.4 83.3 89.3 90.9 92.0 92.6 93.2 94.8 95.4 3B 95.9 97.5 98.6 99.7 100.3 103.0 114.1 138.8 156.7 157.8 178.0 181.4 183.0 4A Xcfd079 XBS00004108 XBS00009839 Xgwm493 XBS00003596 XBS00001335 XBS00012316 Xgwm389 Xbarc075 XBS00009992 XBS00012531 XBS00003814 Xbarc133 XBS00009393 XBS00004074 XBS00003902 XBS00003871 XBS00003864 XBS00003708 Xgwm285 XBS00010715 XBS00009289 Xbarc164 XBS00009475 XBS00009651 XBS00009579 XBS00000793 XBS00012452 XBS00003781 XBS00003727 XBS00012388 XBS00009892 XBS00001242 XBS00020861 XBS00000978 Xgwm131 XBS00010572 Xwmc326 XBS00003836 XBS00003931 XBS00003717 XBS00003884 Xgwm547 5.3 12.0 20.2 Xgwm052 Xcfd004 Xgwm456 Xcfd062 Xgdm072 Xwmc505 Xgwm645 Xgwm314 50.3 57.3 Xcfd009 XBS00003688 0.0 4.6 4B 4D XBS00003914 Xcfd002 0.0 XBS00009480 XBS00010115 13.2 14.3 XBS00003776 XBS00001207 10.8 11.4 12.5 13.7 16.5 17.6 18.2 20.3 26.3 27.9 XBS00010339 XBS00009974 XBS00001241 23.0 24.7 XBS00009253 XBS00003646 Xbarc020 Xwmc089 XBS00009530 XBS00009915 XBS00003858 XBS00009672 Xgwm513 XBS00010409 XBS00003759 XBS00010067 XBS00003765 XBS00004407 33.4 36.1 38.4 XBS00010455 Xdupw004 XBS00003623 Xbarc170 34.5 Xwmc349 40.6 Xgwm538 43.9 45.0 XBS00003674 XBS00010504 51.2 Xwmc258 53.7 56.1 Xwmc161 XBS00003704 XBS00003927 XBS00009333 59.9 XBS00010582 0.0 1.2 3D 0.0 Xgwm350 5.0 5.6 6.7 7.2 XBS00003692 XBS00003587 XBS00009703 XBS00009680 13.7 Xbarc078 17.5 Xgwm160 31.9 XBS00000346 XBS00009882 XBS00009590 Xbarc070 22.5 0.0 Rht-D1 11.2 12.5 15.7 Xcfd023 Xwmc089 XBS00003854 56.3 Xgwm194 70.3 Xgwm624 Figure 3 Continued ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 8 Alexandra M. Allen et al. 5A 5B 0.0 6.8 Xbarc010 XBS00004373 30.8 XBS00003696 55.6 58.5 69.5 76.0 77.1 81.8 82.9 84.0 89.8 92.5 95.3 Xgwm595a Xgwm129 Xwmc489 Xwmc150 Xgwm293 XBS00005311 XBS00000645 Xgwm156 XBS00015653 XBS00000373 Xgwm595b Xgwm186 XBS00009658 XBS00003746 XBS00009531 XBS00000435 XBS00000648 Xgwm617 104.0 107.2 113.2 118.7 Vrn-A1 XBS00011360 XBS00010702 XBS00010419 XBS00001735 0.0 0.6 10.3 Xgwm126 Xgwm595c Xgwm291 XBS00010048 XBS00001953 XBS00003693 XBS00000037 XBS00000028 XBS00009274 XBS00003501 XBS00000006 0.0 5.9 18.1 23.2 23.7 97.3 97.9 108.9 109.4 110.2 117.0 118.2 121.8 129.7 130.8 131.9 134.6 136.2 140.5 149.9 152.7 153.2 154.3 6A 6B 0.0 1.7 0.0 6.2 6.7 Xgwm334 XBS00012414 XBS00012150 XBS00003632 XBS00004278 XBS00009759 2.8 3.4 4.6 36.5 38.6 49.9 51.6 62.9 67.1 67.6 68.2 68.7 70.3 81.0 5D XBS00009264 Xgwm234 XBS00003739 Xdupw115 XBS00010491 XBS00003900 XBS00003876 Xbarc074 XBS00009719 XBS00003705 XBS00003681 Xgwm213 XBS00003726 XBS00003715 XBS00003789 XBS00001817 XBS00009607 XBS00009311 XBS00010636 XBS00010675 XBS00010033 XBS00000865 XBS00003736 XBS00009851 XBS00011782 XBS00010341 XBS00003879 XBS00010542 XBS00009931 XBS00010581 XBS00009373 XBS00009581 XBS00009382 XBS00003724 XBS00000701 XBS00010590 XBS00009414 XBS00004427 XBS00003565 Xgwm408 XBS00003904 XBS00011453 XBS00003770 XBS00003637 XBS00003615 XBS00003740 XBS00000592 XBS00010573 XBS00003592 XBS00009621 XBS00010802 XBS00011183 XBS00003655 Xbarc142 XBS00009388 0.0 12.9 16.6 28.8 38.5 56.1 58.3 64.7 66.4 70.0 70.5 71.1 72.2 72.7 84.7 86.5 92.4 93.8 95.7 96.8 XBS00009584 XBS00003635 XBS00003861 Xbarc023 XBS00009746 XBS00009664 XBS00001037 XBS00009376 Xbarc107 Xbarc171 XBS00003581 XBS00001132 XBS00009871 XBS00004377 XBS00003881 XBS00009988 XBS00009783 XBS00003616 XBS00010586 XBS00003916 XBS00003877 XBS00009261 XBS00010730 XBS00001405 XBS00003676 Xgwm570 XBS00003835 104.2 Xgwm169 115.6 Xgwm617 129.0 129.5 130.1 130.6 131.2 136.8 XBS00003339 XBS00010526 XBS00010408 XBS00003818 XBS00003958 XBS00003905 XBS00003723 Xdupw167 40.3 41.0 58.3 59.4 61.9 63.1 63.6 64.1 67.8 70.0 70.5 71.6 83.1 0.0 Xgwm190 43.1 48.6 51.3 Xgwm293 Xgwm358 Xgwm159 68.9 Xgwm174 125.4 Xgdm063 137.8 Xbarc110 162.3 169.3 169.9 170.9 172.5 174.2 XBS00003711 XBS00000622 XBS00003826 XBS00009294 XBS00003783 XBS00004065 XBS00011769 193.1 XBS00003563 XBS00010693 XBS00003772 XBS00009825 XBS00003812 XBS00003659 XBS00003593 XBS00012481 XBS00003785 XBS00009795 XBS00010580 XBS00009967 XBS00003792 XBS00003622 XBS00003748 XBS00010551 XBS00003786 Xcfd013 XBS00003852 XBS00010420 XBS00003893 XBS00010415 XBS00012491 Xwmc105 Xgwm680 Xgwm608 XBS00010537 XBS00003641 XBS00010328 XBS00010602 XBS00004016 XBS00009844 XBS00003620 XBS00009695 XBS00003897 XBS00019726 XBS00014363 XBS00003671 XBS00003562 Xgwm219 102.1 Xgwm285 Xwmc089 Xbarc134 117.7 XBS00010618 131.5 132.0 XBS00010627 XBS00009548 Pinb-D1 11.9 6D 0.0 1.2 2.3 Xbarc183 XBS00009806 XBS00010742 14.9 Xbarc173 43.6 45.9 47.2 Xgwm325 Xbarc054 Xcfd019 56.7 Xbarc175 Figure 3 Continued ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 Wheat SNP markers 9 7A 0.0 2.9 7B XBS00009543 XBS00010006 11.2 Xgwm635 21.0 Xwmc388 27.5 XBS00003665 XBS00009404 0.0 1.8 2.4 Xgwm130 Xbarc127 XBS00002556 16.7 17.2 17.7 XBS00009978 XBS00010307 XBS00010605 26.9 29.6 32.1 Xbarc121 Xbarc108 Xgwm631 Xbarc029 Xbarc049 0.0 2.3 2.9 17.8 18.9 20.0 22.1 XBS00003350 XBS00009290 XBS00009556 Xwmc517 31.5 XBS00009398 36.4 XBS00009861 Xdupw398 51.0 51.6 60.4 62.6 XBS00009464 XBS00010660 Xgwm577 XBS00010327 XBS00009879 XBS00003672 XBS00005062 Xbarc032 XBS00009245 XBS00009379 XBS00004376 XBS00003756 XBS00003649 XBS00004171 XBS00003703 XBS00010142 64.5 66.2 67.4 69.9 70.4 74.7 48.8 50.4 52.0 XBS00009550 XBS00009975 XBS00003894 87.4 88.5 96.3 97.9 99.5 101.8 XBS00003621 XBS00001450 XBS00014199 XBS00004348 XBS00009886 XBS00003865 Xgwm344 XBS00000663 XBS00003720 119.8 Xcfa2040 Xbarc172 Xbarc176 XBS00003830 XBS00003494 XBS00003760 78.5 79.1 80.2 94.2 22.7 24.0 24.5 27.8 7D Xpsp3113 Xbarc172 XBS00009457 Xgwm437 Xcfd021 39.1 Xwmc150 0.0 Xbarc070 XBS00010454 XBS00003664 XBS00010560 XBS00009619 XBS00004403 15.4 15.9 0.0 1.2 2.4 2.9 7.8 17.7 XBS00009565 XBS00010064 Xgwm428 XBS00003945 XBS00010071 XBS00009623 XBS00003749 Xbarc097 Xcfa2040 Xbarc076 Figure 3 Continued developed by CIMMYT. Savannah does not appear to cluster with the other hexaploid varieties, reflecting the exotic genetic background to this line. The hierarchical cluster analysis illustrates the genetic relationship between the varieties used in this study, but also indicates the reduction in germplasm variation within elite lines and the need for introducing genetic diversity into breeding programmes (Haudry et al., 2007). Genetic map A linkage map is often the first step towards understanding genome assembly and evolution and provides an essential framework for mapping agronomic traits of interest (Xia et al., 2010). We present here the first SNP-based linkage map for wheat, constructed using 500 transcript-linked varietal SNPs generated in this study in combination with 574 previously developed markers (Figure 3). The total map length was 2999 cM, similar in length to previous framework linkage maps produced in wheat using microsatellite markers (2569 cM; Somers et al., 2004), DArT markers (2937 cM; Akbari et al., 2006) and a combination of markers (3522 cM; Quarrie et al., 2005). While each of the three genomes had similar map lengths (1131 cM for the A genome; 1075 cM for the B genome and 792 cM for the D genome), there was a significant difference between the distribution of markers (i.e. SSR, SNP, AFLP and DArT loci) between the three genomes (A genome 354 mark- ers; B genome 526 markers; D genome 174 markers), a ratio of 33 : 50 : 17, and this difference was more apparent when calculated for the 480 SNP markers alone (34 : 56 : 10). The ratio of total markers on the three genomes is similar to that observed by Quarrie et al. (2005) who found the distribution of markers between the three genomes at a ratio of 40 : 40 : 20 for the A, B and D genomes, respectively. Other comparable ratios from wheat linkage maps for the A ⁄ B ⁄ D genomes were 33 : 41 : 25 (Röder et al., 1998), 30 : 39 : 31 (Somers et al., 2004) and 28 : 60 : 12 (Bernardo et al., 2009). As might be expected, the distributions of markers on each of the several homoeologous chromosome groups were not uniform with large gaps of more than 30 cM on D genome chromosomes and concentrations of markers around the centromeres that were particularly noticeable on A and B chromosomes. Our SNP marker set contained a considerably lower number of D genome markers than previously developed marker sets; this is a likely consequence of SNP markers targeting genic regions, while other marker types are distributed more evenly throughout the genome. The lack of D genome markers compared with the A and B genomes has been attributed to the loss of polymorphism in coding regions during the genetic bottleneck that accompanied the development of modern elite cultivars and indicates that future SNP discovery efforts will be required to specifically target the D genome to offset the ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 effects of lower levels of genetic diversity in this genome (Caldwell et al., 2004; Chao et al., 2009). Marker distribution between the homoeologous linkage groups ranged from 7 to 20% of the total markers, with group 4 containing the lowest number of markers, a pattern also seen in the linkage map of Somers et al. (2004). In a genome-wide study of nucleotide diversity by Akhunov et al. (2010) the group 4 chromosomes were found to have low levels of diversity, with chromosomes 4A and 4B having similar diversity to the D genome. It has been hypothesized that homoeologous group 4 chromosomes have a lower number of genes than the remaining six homoeologous groups (Qi et al., 2004) and hence a lower level of recombination and resulting genetic diversity (Akhunov et al., 2010). The genetic map locations of the 500 SNP loci generated in this study were compared with their predicted map location from physical mapping experiments. Of 48 genetically mapped loci, 81% mapped to the same homoeologous group predicted by the Kansas deletion lines. A similar comparison of 155 SNP loci with deletion-mapped ESTs revealed that 69% mapped to the same homoeologous group (Qi et al., 2004). While this is a relatively high level of correlation, discrepancies in the data are not unexpected because of the numerous secondary and tertiary deletions seen in the deletion lines (Qi et al., 2004). Genetic map locations of SNP loci that can also be placed on physical maps are a potentially useful resource for identifying and characterizing these additional deletions. The same 500 mapped sequences were also BLASTN searched against the B. distachyon genome sequence resulting in 205 wheat SNPs being placed on the B. distachyon genome. Although SNP coverage was generally low when mapped markers were divided between wheat chromosomes, where sufficient markers existed, as for wheat linkage group 2 versus B. distachyon chromosome 5, the relationship was consistent with known chromosomal relationships (International Brachypodium Initiative., 2010; Figure 4). We found that the synteny between wheat group 2 and B. distachyon chromosome 5 was generally in line with expectations; however, we did find that two SNPs (BS0010442 and BS00009533) that mapped to the same point in linkage group 2 were placed at opposite ends of B. distachyon chromosome 5. To investigate this further, we BLAST searched both the rice and Sorghum bicolor genomes with the sequences flanking BS0010442 and BS00009533. We found no match against the rice genome, but both SNPs mapped to within 135 kilobases of each other on S. bicolor chromosome 6. From these results, we tentatively conclude that the ancestral state of these markers is to lie in close proximity and that a chromosomal rearrangement, such as a reciprocal telomeric translocation, has altered their relative positions in the lineage leading to B. distachyon. Screening the 500 sequences against the NCBI non-redundant protein sequence database generated 213 hits, of which 184 were unique. Owing to the nature of the normalized cDNA libraries used, the unique sequences identified did not show a significant bias to sequences associated with either leaf or root gene expression (the tissues used to generate the cDNA libraries); instead, they reveal an interesting mix of sequences, such as various disease resistance-like genes, sequences involved in DNA repair and various genes involved in numerous metabolic pathways. While this analysis indicated that a number of sequences (13) had multiple SNPs, it also revealed that 17 SNPs showed similarity to the same database entries, but mapped to different regions of the genome. For instance, three SNPs Position in brachypodium chr 5 (fl) 10 Alexandra M. Allen et al. 1.2 1 0.8 0.6 0.4 0.2 0 0 50 100 150 Position in wheat group 2 (cM) 200 Figure 4 Scatterplot showing the syntenic relationship between the wheat SNP map positions generated in this study for linkage group 2 and the corresponding location in Brachypodium distachyon, shown as fraction of length (fl) along chromosome 5. The wheat SNP flanking sequences were matched to the B. distachyon genome using BLASTN with an e-value cut-off of 1e)10. Wheat linkage group 2 is shown here as it has the highest number of markers mapped onto the B. distachyon genome. The point marked in red is SNP BS0010442, and that in black is SNP BS00009533, both of which map to 181 cM in wheat linkage group 2. showed sequence similarity to rice sequence H0322F07.7; one of these SNPs, BS00009929, mapped to an unassigned linkage group, while the second SNP, BS00010640, mapped to 2B and the third SNP, BS00014363, mapped to 6B. The presence of homoeologous and paralogous sequences throughout the wheat genome has been documented (Gupta et al., 2008). However, the specific example shown here and the remaining 16 examples, in which similar sequences could be mapped to individual locations, suggest that KASPar-based genotyping is sufficiently sensitive and robust to map individual paralogous genes in the wheat genome. Homoeologous versus non-homoeologous-specific KASPar probes In wheat, the majority (90%) of the KASPar probes detect both the polymorphic and non-polymorphic homoeologous loci. While such probes are suitable for screening inbred wheat varieties, screening heterozygous material is more problematical. To overcome this, we investigated the possibility of converting a standard KASPar probe (BS0000329; Figure 5b) to a homoeologous-specific KASPar probe. Using the 5 · Chinese Spring wheat genome sequence, we were able to design three homoeologous-specific reverse primers, which when used in conjunction with the original varietal SNP-specific KASPar primers generated one of two possible results; a non-polymorphic probe (when the homoeologous copy being amplified did not contain a varietal SNP; Figure 5c) or a polymorphic probe (when the homoeologous copy being amplified did contain a varietal SNP; Figure 5d). Use of the redesigned homoeologous-specific primers to screen an F2 population confirmed that they were capable of discriminating between homozygotes and heterozygotes (Figure 5e). Examination of the genomic sequence around several other KASPar loci suggested that it should be possible to design homoeologous-specific primers for the majority of the SNPs generated in this investigation. However, in the absence of both suitable primer design software and a more complete genome sequence, this process is time-consuming taking approximately 30 min per probe and requires further experi- ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 Wheat SNP markers 11 (b) (c) (d) (e) (a) Homoeolog 1v1 Homoeolog 2v1 Homoeolog 3v1 Homoeolog 1v2 Homoeolog 2v2 Homoeolog 3v2 K ASP ar pri mer A K ASP ar Pri mer G #1 #1 #1 #1 #1 #1 AAAATTCATG AAAATTCATG AAAATTCATG AAAATTCATG AAAATTCATG AAAATTCATG AATGCCCGCA AATGCCCGCA AATGCCCGCG AATGCCCGCA AATGCCCGCA AATGCCCGCG CGC G GC G GATACCTGTA GATACCTGTA GATACCTGTA GATACCTGTA GATACCTGTA GATACCTGTA G ATA CCT GTA G ATA CCT GTA GGCATGTACA GGCATGTACA GGCATGTGCA GGCATGTACA GGCATGTACA GGCATGTACA GG CAT GTA GG CAT GTG Homoeolog 1v1 #41 Homoeolog 2v1 #41 Homoeolog 3v1 #41 Homoeolog 1v2 #41 Homoeolog 2v2 #41 Homoeolog 3v2 #41 Reverse primer 1 Homoeolog 1 SRP Homoeolog 2 SRP Homoeolog 3 SRP ATCGTTATCA ATCGTTATCA ATCGTTATCA ATCGTTATCA ATCGTTATCA ATCGTTATCA TAGCAATAGT GTAAATTCTG GTAAATTCTG GTAAATTCTG GTAAATTCTG GTAAATTCTG GTAAATTCTG CATTTAAGAC CCGTCCTAGA CCGTCCTAGA CCGTCCTAGA CCGTCCTAGA CCGTCCTAGA CCGTCCTAGA CTCCATCGCA CTCCATCACA CTCCATGACA CTCCATCGCA CTCCATCACA CTCCATGACA Homoeolog Homoeolog Homoeolog Homoeolog Homoeolog Homoeolog Homoeolog Homoeolog Homoeolog TGTCTTGTTT TGTCTTGTTT TGTCTTGTTT TGTCTTGTTT TGTCTTGTTT TGTCTTGTTT ACAGAACAAA ACAGAACAAA ACAGAACAAA 1v1 2v1 3v1 1v2 2v2 3v2 1 SRP 2 SRP 3 SRP #81 #81 #81 #81 #81 #81 GCGT GTGT CTGT CCTAGCTTCC CCTAGCTTCC CCTAGCTTCC CCTAGCTTCC CCTAGCTTCC CCTAGCTTCC GGATC GGATC GGATC TGGA TGGA TGGA TGGA TGGA TGGA Figure 5 Effect of primer design of specificity of the KASPar reaction. (a) Sequence of region around SNP BS00000329 in the three wheat homoeologous genes and across two varieties (v1 and v2) together with the six probes designed for this region. The varietal SNP (A ⁄ G) and the homoeologous SNPs are highlighted. The reverse primer 1 and the three homoeologous-specific primers (SRP) are shown as designed in the reverse compliment to the BS00000329 sequence. (b-d) KASPar plots following amplification with the various primer combinations. For b–c, the standard 20 hexaploid varieties were used together with a negative control (E3, E6 and E9) and a diploid, tetraploid and resulting synthetic line. In all cases, the KASPar primers A and G were used. In (b), the generic reverse primer was used; in (c), Homoeolog 1 SPR was used while in (d) Homoeolog 3 SPR was used. No results are presented for Homoeolog 2 SPR as this gave the same result as Homoeolog 1 SPR. (e) KASPar plot of F2-derived material following amplification with the KASPar primers A and G together with the Homoeolog 3 SRP primer. In (e), both homozygotes (Blue and red spots) and heterozygotes (green spots) are shown as discrete clusters. mental work to validate the appropriate primer combination. Therefore, it is our belief that currently it is more productive to design nonspecific KASPar probes and convert these to homoeologous-specific probes only when they are found to be useful for a specific purpose, for instance, when they are found to be tightly linked to a locus of interest. Conclusions To the best of our knowledge, this is the first report of a public linkage map for hexaploid wheat containing several hundred individual SNP markers. Bernardo et al. (2009) have reported a linkage map of wheat consisting of 923 single feature polymorphisms (SFPs) obtained by using Affymetrix arrays on 71 recombination inbred lines from the cross Ning 7840 · Clark. However, as reported, attempts to convert the array-based SFP markers to useful SNP markers that can be used in single locus assays were time-consuming and only partially successful with 33 working SNaPshot markers being generated from 58 SFPs tested. In addition, we believe this report is also the first demonstration of KASPar-based technology to both genotype wheat varieties and generate a linkage map. To generate the SNPbased linkage map described, we carried out 102 220 individual KASPar reactions (538 probes · 190 plants). These reactions were carried out within a 24-h period, using simple microplate technology and a relatively inexpensive and widely available fluorescence resonance energy transfer plate reader. Using this and similar technology, we believe that it will be possible for wheat breeders to achieve one of their most important goals, to rapidly and cheaply genotype thousands of plants with a large and flexible number of markers. Although we believe that the work described here meets the requirements of both wheat breeders and academics, it is likely that other technologies will be developed, which will increase the rate of SNP-based genotyping ⁄ mapping and decrease the cost of the individual data points. However, our studies have also shown that there is need for further SNPs, especially for the D genome and the homoeologous group 4 chromosomes, ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 12 Alexandra M. Allen et al. but with continued SNP development genome-wide association studies will soon become possible in wheat. The extensive genomics resources developed by whole-genome shotgun sequencing of Chinese Spring, coupled to re-sequencing multiple breeding lines, promises to dramatically increase the number of informative SNPs, permitting unprecedented levels of precision genetic analysis in wheat breeding. Our work has also highlighted the need to capture all of the information associated with the SNP markers, for instance, the sequence surrounding the SNP being assayed. This information will allow the SNP to be adapted and used in a variety of genotyping platforms as and when they are developed. It is for this reason that via this publication, we have made all the data associated with the SNP, including the available flanking sequences (usually 120 base pairs or greater) available via the associated supplementary data (Tables S1 and S2). We hope that by making this information available, it will encourage other wheat geneticists to ensure that such markers are made public and free to use. Only by following this principle will we help to ensure that wheat breeders across the world have the tools required to breed new varieties. Experimental procedure Plant material The wheat lines included in this study were selected to give the best representation of the UK wheat germplasm. Twenty-three different wheat varieties were grown for nucleic acid extraction (for details see Table 2). The Avalon · Cadenza population (supplied by the John Innes Centre) of doubled haploid (DH) individuals, derived from F1 progeny of a cross between cvs Avalon and Cadenza, was developed by Clare Ellerbrook, Liz Sayers and the late Tony Worland (John Innes Centre), as part of a Defra funded project led by ADAS. The parents were originally chosen (to contrast for canopy architecture traits) by Steve Parker (CSL), Tony Worland and Darren Lovell (Rothamsted Research, West Common, Harpenden, Hertfordshire, AL5 2JQ, UK). All plants were grown in pots in a peat-based soil and maintained in a glasshouse at 15–25 C under a light regime of 16 h light, 8 h dark. Root and leaf tissues were harvested from 6-week-old plants. All harvested tissues were immediately frozen on liquid nitrogen and stored at )80 C until nucleic acid extraction. Preparation of normalized cDNA libraries Total RNA was extracted from root and leaf tissue of five wheat varieties (Avalon, Cadenza, Rialto, Savannah, Recital) using the TRIzol RNA extraction protocol (Invitrogen Ltd., Paisley, UK) according to the manufacturer’s instructions and purified using the RNeasy MinElute Kit (QIAGEN Ltd., Crawley, UK). Complementary DNA (cDNA) was synthesized from total RNA using the MINT kit (Evrogen, Moscow, Russia) according to the manufacturer’s instructions. Double-stranded cDNA was purified using the QiaQuick PCR purification kit (QIAGEN Ltd.). The purified cDNA samples were then normalized using the TRIMMER kit (Evrogen) according to the manufacturer’s instructions and purified using the QiaQuick PCR purification kit (QIAGEN Ltd.). Next-generation sequencing For each variety, 5 lg of normalized cDNA was processed for sequencing by the University of Bristol Transcriptomics Facility. Independent sequence libraries were generated by following the manufacturer’s protocol for sequencing of genomic DNA (Illumina Inc., San Diego, CA), with a modification to allow multiplexing of samples. Five sets of custom adapters were designed based upon the Illumina PCR primers plus an extra 4-base barcoding tag. Sequencing was carried out on a pairedend flowcell, using Illumina’s v.4 Cluster Generation and multiple v.4 36-Cycle Sequencing Kits. The Illumina Genome Analyzer IIx, was run for 2 · 76 bases of data acquisition. Image analysis and base calling was performed with SCS 2.3 software (Illumina Inc, San Diego, CA). SNP discovery Putative varietal SNPs were mined from two sources. First, public wheat Expressed Sequence Tag (EST) data held at NCBI was used as described by Barker and Edwards (2009). In this data set, 3500 SNPs were deemed to have a high probability of being varietal (as opposed to homoeologous), and from these, we selected 213 for validation. In the second approach, SNPs were mined from NGS data as follows. A reference sequence data set comprising 91 368 sequences was produced by combining the data from NCBI wheat unigene build 38 with the NGS data used in this study. The NCBI sequences were sampled as pseudoreads of 75 bases, combined with the NGS data and de-novo assembled using ABYSS (Simpson et al., 2009) on the Bristol Bluecrystal HPC cluster. NGS from the five UK varieties were mapped to our custom reference using ELAND (Illumina Inc.) with a seed length of 32 bases and the resulting sorted export files used for downstream analysis. Uniquely mapped reads were analysed using a series of custom PERL scripts designed to identify only differences between varieties as opposed to those between each variety, and the reference sequence. In this way, homoeologous SNPs (which are not useful markers) were excluded from our SNP discovery pipeline. SNPs were called where at least two alternative bases were found at a reference position, each represented by 2 or more independent reads or 5% of all reads examined (whichever was the greater). Only bases at the centre of a three base window of PHRED quality ‡20 were included in the analysis. Sequences were discarded if they showed more than 5% sequence variation from the reference over their length or if they mapped equally well to more than one locus as in either case they were deemed to have uncertain mapping. Finally, where multiple reads started at the same position in the reference, all but one were ignored to guard against clonal reads being sampled more than once. The NGS data will be made available at: http://www.cerealsdb.uk.net/NGSdata/AllenSupplement. SNP validation Genomic DNA was prepared from leaf tissue using a phenol– chloroform extraction method (Sambrook et al., 1989). Genomic DNA samples were treated with RNase-A (New England Biolabs UK Ltd. Hitchin, UK) according to the manufacturer’s instructions and purified using the QiaQuick PCR purification kit (QIAGEN Ltd.). For each putative varietal SNP, two allele-specific forward primers and one common reverse primer (Table S6) were designed (KBioscience, Hoddesdon, UK). Genotyping reactions were performed in a Gene Pro Thermal cycler (Bioer Technology, Hangzhou, China) in a final volume of 5 lL containing 1X KASP Reaction Mix (KBioscience), 0.07 lL Assay mix (containing 12 lM each allele-specific forward primer and 30 lM reverse pri- ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 Wheat SNP markers 13 mer) and 10–20 ng genomic DNA. The following cycling conditions were used: 15 min at 94 C; 10 touchdown cycles of 20 s at 94 C, 60s at 65–57 C (dropping 0.8 C per cycle); and 26–35 cycles of 20 s at 94 C, 60 s at 57 C. Fluorescence detection of the reactions was performed using an Omega Fluorostar scanner (BMG LABTECH GmbH, Offenburg, Germany), and the data were analysed using the KlusterCaller 1.1 software (KBioscience). The polymorphic information content (PIC) was calculated for each marker according to Botstein et al. (1980), using Excel Microsatellite Toolkit add-in software (Park, 2001). Hierarchical cluster analysis was performed in PASW (SPSS Inc. IBM Corporation, Somers, NY). Genetic map construction The software program MapDisto v. 1.7 (Lorieux, 2007) was used to place the SNP markers in the previously established genetic map for Avalon · Cadenza (http://www.wgin.org.uk/ resources/MappingPopulation/TAmapping.php). A chi-square test was performed on all loci to test for segregation distortion from the expected 1 : 1 ratio of each allele in a DH population, and any loci showing significant distortion were removed from the data set before constructing the linkage groups. Loci were assembled into linkage groups using likelihood odds (LOD) ratios with a LOD threshold of 6.0 and a maximum recombination frequency threshold of 0.40. The linkage groups were ordered using the likelihoods of different locus-order possibilities and the iterative error removal function in MapDisto and drawn in MapChart (Voorrips, 2002). The Kosambi mapping function (Kosambi, 1944) was used to calculate map distances (cM) from recombination frequency. Comparative mapping of SNPs to ESTs and the Brachypodium distachyon and Sorghum bicolor genomes The consensus sequence surrounding mapped SNPs were matched to published, mapped wheat EST sequences (http:// wheat.pw.usda.gov/cgi-bin/westsql/map_locus.cgi) and to the B. distachyon genome sequence (http://www.brachypodium. org/) using NCBI BLASTN (Altschul et al., 1990) with an e-value cut-off of 1e)10. Only the top BLASTN hit was used for comparative mapping. Selected wheat SNP flanking sequences were also BLASTN searched to the S. bicolor genome at http://www.phytozome. net using Phytozome version 6. Acknowledgements We are grateful to the Biotechnology and Biological Sciences Research Council, UK, for providing the funding for this work (awards BB ⁄ F007523 ⁄ 1, BB ⁄ F010370 ⁄ 1, BB ⁄ G012865 ⁄ 1, BB ⁄ I003207 ⁄ 1). We thank Dr Peter Jack from RAGT, Dr Ian MacKay from NIAB and Dr Simon Griffiths from the JIC for numerous valuable discussions throughout this project. The population of doubled haploid individuals, derived from F1 progeny of a cross between cvs Avalon and Cadenza, was developed by Clare Ellerbrook, Liz Sayers and the late Tony Worland (John Innes Centre), as part of a Defra funded project led by ADAS. The parents were originally chosen (to contrast for canopy architecture traits) by Steve Parker (CSL), Tony Worland and Darren Lovell (Rothamsted Research). We are grateful to the Wheat Genetic Improvement Network for making the mapping data relating to the cvs Avalon x Cadenza population public. For further details of the Avalon x Cadenza mapping population, please refer to the Wheat Genetic Improvement Network web site at: http://www.wgin.org.uk/ resources/MappingPopulation/TAmapping.php. References Akbari, M., Wenzl, P., Caig, V., Carling, J., Xia, L., Yang, S., Uszynski, G., Mohler, V., Lehmensiek, A., Kuchel., H., Hayden, M.J., Howes, N., Sharp, P., Vaughan, P., Rathmell, B., Huttner, E. and Kilian, A. (2006) Diversity arrays technology (DArT) for high-throughput profiling of the hexaploid wheat genome. Theor. Appl. Genet. 113, 1409–1420. Akhunov, E., Nicolet, C. and Dvorak, J. (2009) Single nucleotide polymorphism genotyping in polyploidy wheat with the Illumina GoldenGate assay. Theor. Appl. Genet. 119, 507–517. Akhunov, E.D., Akhunova, A.R., Anderson, O.D., Anderson, J.A., Blake, N., Clegg, M.T., Coleman-Derr, D., Conley, E.J., Crossman, C.C., Deal, K.R., Dubcovsky, J., Gill, B.S., Gu, Y.Q., Hadam, J., Heo, H., Huo, N., Lazo, G.R., Luo, M.C., Ma, Y.Q., Matthews, D.E., McGuire, P.E., Morrell, P.L., Qualset, C.O., Renfro, J., Tabanao, D., Talbert, L.E., Tian, C., Toleno, D.M., Warburton, M.L., You, F.M., Zhang, W. and Dvorak, J. (2010) Nucleotide diversity maps reveal variation in diversity among wheat genomes and chromosomes. BMC Genomics, 11, 702. Alston, J.M., Babcock, B.A. and Pardey, P.G. eds. (2010) The Shifting Patterns of Agricultural Production and Productivity Worldwide (2010) CARDMATRIC on-line volume, Ames: Iowa State University. Altschul, S.F., Gish, W., Miller, W., Myers, E.W. and Lipman, D.J. (1990) Basic local alignment search tool. J. Mol. Biol. 215, 403–410. Barker, G.L.A. and Edwards, K.J. (2009) A genome-wide analysis of single nucleotide polymorphism diversity in the world’s major cereal crops. Plant Biotechnol. J. 7, 318–325. Bérard, A., Le Paslier, M.C., Dardevet, M., Exbrayat-Vinson, F., Bonnin, I., Cenci, A., Haudry, A., Brunel, D. and Ravel, C. (2009) High-throughput single nucleotide polymorphism genotyping in wheat (Triticum spp.). Plant Biotechnol. J. 7, 364–374. Bernardo, A.N., Bradbury, P.J., Ma, H., Hu, S., Bowden, R.L., Buckler, E.S. and Bia, G. (2009) Discovery and mapping of single feature polymorphisms in wheat using Affymetrix arrays. BMC Genomics, 10, 251. Botstein, D., White, R.L., Skolnick, M. and Davis, R.W. (1980) Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am. J. Hum. Genet. 32, 314–331. Bundock, P.C., Eliott, F.G., Ablett, G., Benson, A.D., Casu, R.E., Aitken, K.S. and Henry, R.J. (2009) Targeted single nucleotide polymorphism (SNP) discovery in a highly polyploid plant species using 454 sequencing. Plant Biotechnol. J. 7, 347–354. Caldwell, K.S., Dvorak, J., Lagudah, E.S., Akhunov, E., Luo, M.C., Wolters, P. and Powell, W. (2004) Sequence polymorphism in polyploid wheat and their D-genome diploid ancestor. Genetics, 167, 941–947. Chao, S., Zhang, W., Akhunov, E., Sherman, J., Ma, Y., Luo, M.C. and Dubcovsky, J. (2009) Analysis of gene-derived SNP marker polymorphism in US wheat (Triticum aestivum L.) cultivars. Mol. Breed. 23, 23–33. Chao, S., Dubcovsky, J., Dvorak, J., Luo, M.C., Baenziger, S.P., Matnyazov, R., Clark, D.R., Talbert, L.E., Anderson, J.A., Dreisigacker, S., Glover, K., Chen, J., Campbell, K., Bruckner, P.L., Rudd, J.C., Haley, S., Carver, B.F., Perry, S., Sorrells, M.E. and Akhunov, E.D. (2010) Population- and genome-specific patterns of linkage disequilibrium and SNP variation in spring and winter wheat (Triticum aestivum L.). BMC Genomics, 11, 727. Devos, K.M., Sorrells, M.E., Anderson, J.A., Miller, T.E., Reader, S.M., Lukaszewski, A.J., Dubcovsky, J., Sharp, P.J., Faris, J. and Gale, M.D. (1999) Chromosome aberrations in wheat nullisomic-tetrasomic and ditelosomic lines. Cereal Res. Commun. 27, 231–239. Dubcovsky, J. and Dvorak, J. (2007) Genome plasticity: a key factor in the success of polyploid wheat under domestication. Science, 316, 1862–1866. Edwards, K.J., Reid, A.L., Coghill, J.A., Berry, S.T. and Barker, G.L.A. (2009) Multiplex single nucleotide polymorphism (SNP)-based genotyping in allohexaploid wheat using padlock probes. Plant Biotechnol. J. 7, 375–390. ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14 14 Alexandra M. Allen et al. Endo, T.R. and Gill, B.S. (1996) The deletion stocks of common wheat. J. Hered. 87, 295–307. Foresight (2011) The Future of Food and Farming, Final Project Report. London: The Government Office for Science. Gupta, P.K., Mir, R.R., Mohan, A. and Kumar, J. (2008) Wheat genomics: present status and future prospects. Int. J. Plant Genomics, 2008, Article ID 896451 Haudry, A., Cenci, A., Ravel, C., Bataillon, T., Brunel, D., Poncet, C., Hochu, I., Poirier, S., Santoni, S., Glémin, S. and David, J. (2007) Grinding up wheat: a massive loss of nucleotide diversity since domestication. Mol. Biol. Evol. 24, 1506–1517. Imelfort, M., Duran, C., Batley, J. and Edwards, D. (2009) Discovering genetic polymorphisms in next-generation sequencing data. Plant Biotechnol. J. 7, 312–317. International Brachypodium Initiative. (2010) Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature, 463, 763–767. Kosambi, D.D. (1944) The estimation of map distances from recombination values. Ann. Eugen. 12, 172–175. Lorieux, M. (2007) MapDisto: a free user-friendly program for computing genetic maps. In: Plant and Animal Genome XV Conference, San Diego, CA Mochida, K. and Shinozaki, K. (2010) Genomics and bioinformatics resources for crop improvement. Plant Cell Physiol. 51, 497–523. Orrù, L., Catillo, G., Napolitano, F., De Matteis, G., Scatà, M.C., Signorelli, F. and Moioli, B. (2009) Characterization of a SNPs panel for meat traceability in six cattle breeds. Food Control, 20, 856–860. Park, S.D.E. (2001) Trypanotolerance in West African Cattle and the Population Genetic Effects of Selection [Ph.D. thesis] University of Dublin. Perkel, J. (2008) SNP genotyping: Six technologies that keyed a revolution. Nat. Methods, 5, 447–453. Petkov, P.M., Ding, Y., Cassell, M.A., Zhang, W., Wagner, G., Sargent, E.E., Asquith, S., Crew, V., Johnson, K.A., Robinson, P., Scott, V.E. and Miles, M.E. (2004) An efficient SNP system for mouse genome scanning and elucidating strain relationships. Genome Res. 14, 1806–1811. Qi, L., Echalier, B., Friebe, B. and Gill, B.S. (2003) Molecular characterization of a set of wheat deletion stocks for use in chromosome bin mapping of ESTs. Funct. Integr. Genomics. 3, 39–55. Qi, L.L., Echalier, B., Chao, S., Lazo, G.R., Butler, G.E., Anderson, O.D., Akhunov, E.D., Dvorak, J., Linkiewicz, A.M., Ratnasiri, A., Dubcovsky, J., Bermudez-Kandianis, C.E., Greene, R.A., Kantety, R., La Rota, C.M., Munkvold, J.D., Sorrells, S.F., Sorrells, M.E., Dilbirligi, M., Sidhu, D., Erayman, M., Randhawa, H.S., Sandhu, D., Bondareva, S.N., Gill, K.S., Mahmoud, A.A., Ma, X.F., Miftahudin, Gustafson, J.P., Conley, E.J., Nduati, V., Gonzalez-Hernandez, J.L., Anderson, J.A., Peng, J.H., Lapitan, N.L.V., Hossain, K.G., Kalavacharla, V., Kianian, S.F., Pathan, M.S., Zhang, D.S., Nguyen, H.T., Choi, D.W., Fenton, R.D., Close, T.J., McGuire, P.E., Qualset, C.O. and Gill, B.S. (2004) A chromosome bin map of 16,000 expressed sequence tag loci and distribution of genes among the three genomes of polyploid wheat. Genetics, 168, 701–712. Quarrie, S.A., Steed, A., Calestani, C., Semikhodskii, A., Lebreton, C., Chinoy, C., Steele, N., Pljevljakusić, D., Waterman, E., Weyen, J., Schondelmaier, J., Habash, D.Z., Farmer, P., Saker, L., Clarkson, D.T., Abugalieva, A., Yessimbekova, M., Turuspekov, Y., Abugalieva, S., Tuberosa, R., Sanguineti, M.C., Hollington, P.A., Aragués, R., Royo, A. and Dodig, D. (2005) A high density genetic map of hexaploid wheat (Triticum aestivum L.) from the cross Chinese Spring x SQ1 and its use to compare QTLs for grain yield across a range of environments. Theor. Appl. Genet. 110, 865–880. Ravel, C., Praud, S., Canaguier, A., Dufour, P., Giancola, S., Balfourier, F., Chalhoub, B., Brunel, D., Linossier, L., Dardevet, M., Beckert, M., Rousset, M., Murigneux, A. and Charmet, G. (2007) DNA sequence polymorphisms and their application to bread wheat quality. Euphytica, 159, 331–336. Reaping the benefits (2009) Science and the sustainable intensification of global agriculture Royal Society Report. London. Reynolds, M., Bonnett, D., Chapman, S.C., Furbank, R.T., Manes, Y., Mather, D.E. and Parry, M.A.J. (2011) Raising yield potential of wheat I. Overview of a consortium approach and breeding strategies. J. Exp. Bot. 62, 439– 452. Röder, M.S., Korzun, V., Wandehake, K., Planschke, J., Tixier, M.H., Leroy, P. and Ganal, M.W. (1998) A microsatellite map of wheat. Genetics, 149, 2007–2023. Sambrook, J., Fritsch, E.F. and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2nd edn. Cold Spring Harbor: Cold Spring Harbor Laboratory Press. Sears, E.R. (1954) The aneuploids of common wheat. Research Bull. 572, Missouri Ag. Exp. Sta. 57 p. Shewry, P.R. (2009) Wheat. J. Exp. Bot. 60, 1537–1553. Simpson, J.T., Wong, K., Jackman, S.D., Schein, J.E., Jones, S.J. and Birol, I. (2009) ABySS: a parallel assembler for short read sequence data. Genome Res. 19, 1117–1123. Somers, D.J., Isaac, P. and Edwards, K.J. (2004) A high-density microsatellite consensus map for bread wheat (Triticum aestivum L.). Theor. Appl. Genet. 109, 1105–1114. Valkoun, J.J. (2001) Wheat pre-breeding using wild progenitors. Euphytica, 119, 17–23. Voorrips, R.E. (2002) MapChart: software for the graphical presentation of linkage maps and QTLs. J. Heredity, 93, 77–78. Xia, J.H., Liu, F., Zhu, Z.Y., Fu, J.J., Feng, J.B., Li, J.L. and Yue, G.H. (2010) A consensus linkage map of the grass carp (Ctenopharyngodon idella) based on microsatellites and SNPs. BMC Genomics, 11, 135. Supporting information Additional Supporting information may be found in the online version of this article: Table Table Table Table Table Table S1 S2 S3 S4 S5 S6 EST-derived SNPs. NGS-derived SNPs. Genotyping calls. Mapping data. BLAST results. KASPar assays and primer sequences. Please note: Wiley-Blackwell are not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article. ª 2011 The Authors Plant Biotechnology Journal ª 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd, Plant Biotechnology Journal, 1–14
© Copyright 2026 Paperzz