Chapter 11 DNA Barcoding Methods for Land Plants Aron J. Fazekas, Maria L. Kuzmina, Steven G. Newmaster, and Peter M. Hollingsworth Abstract DNA barcoding in the land plants presents a number of challenges compared to DNA barcoding in many animal clades. The CO1 animal DNA barcode is not effective for plants. Plant species hybridize frequently, and there are many cases of recent speciation via mechanisms, such as polyploidy and breeding system transitions. Additionally, there are many life-history trait combinations, which combine to reduce the likelihood of a small number of markers effectively tracking plant species boundaries. Recent results, however, from the two chosen core plant DNA barcode regions rbcL and matK plus two supplementary regions trnH–psbA and internal transcribed spacer (ITS) (or ITS2) have demonstrated reasonable levels of species discrimination in both floristic and taxonomically focused studies. We describe sampling techniques, extraction protocols, and PCR methods for each of these two core and two supplementary plant DNA barcode regions, with extensive notes supporting their implementation for both low- and high-throughput facilities. Key words: DNA barcoding, Plant field collecting, Plant DNA extraction, PCR amplification, Cycle sequencing, rbcL, matK, trnH–psbA, Internal transcribed spacer 1. Introduction The land plants encompass an enormous diversity of form and function. They consist of the seed plants (angiosperms and gymnosperms), along with the bryophytes (mosses, hornworts, and liverworts), ferns, and fern allies. Estimates of total species numbers vary greatly among authors (1–3), but a recent estimate has suggested that there are approximately 380,000 species of land plants, comprising ca. 352,000 species of angiosperms, ca. 1,300 species of gymnosperms, and ca. 13,000 species each of bryophytes and ferns/fern allies (4). The standard animal DNA barcode comprising a portion of the mitochondrial gene CO1 evolves too slowly in plants to serve as a W. John Kress and David L. Erickson (eds.), DNA Barcodes: Methods and Protocols, Methods in Molecular Biology, vol. 858, DOI 10.1007/978-1-61779-591-6_11, © Springer Science+Business Media, LLC 2012 223 224 A.J. Fazekas et al. useful DNA barcode (5). This has led to the search for an equivalent DNA barcode for land plants. The primary focus of this search has been the plastid genome, with many authors recognizing that multiple regions are required (5–12). Selecting a standard plant DNA barcode has been difficult, as all of the various candidate loci have different strengths and weaknesses, with no clear-cut front runners. In a community-authored paper, the combination of portions of the plastid regions rbcL and matK was suggested as the core DNA barcode for land plants (13) and subsequently provisionally adopted by the Consortium for the Barcode of Life. In addition to this core DNA barcode, other loci are often required to increase the levels of species resolution. At the 2009 International Barcode of Life Conference in Mexico City, it was recommended that the community continue to gather data from additional DNA barcoding loci to establish whether other loci should be formally incorporated into the plant DNA barcode. The two most widely used supplementary loci are the plastid intergenic spacer trnH–psbA (one of the leading contenders for the core plant DNA barcode) and the nuclear ribosomal internal transcribed spacers (ITS). The nuclear ribosomal ITS regions had previously been discounted as a standard DNA barcode due to concerns over paralogy and the presence of putative pseudogenes which lead to sequencing difficulties in many plant groups (e.g., refs. 14–18). However, the increased resolution of ITS over plastid DNA barcodes in many studies (e.g., ref. 19) suggests that it should continue to be explored as part of the plant DNA barcode, and some authors have noted that just using a subset of the ribosomal cassette (ITS2) can lead to greater amplification and sequencing success compared to the entire ITS region (20). We, therefore, include methods for all four of these regions [rbcL, matK, trnH–psbA, and ITS (including ITS2)] to provide the maximum utility to users of plant DNA barcoding. Details of other loci that have been used in plant DNA barcoding studies can be found elsewhere (e.g., refs. 5, 11, 13, 21). It should be noted that levels of species discrimination in plants with standard DNA barcoding loci are in general lower than those obtained by CO1 in many animal groups (22). This is in part due to the lower rate of nucleotide substitution in the plastid genome, but also due to other reasons, including hybridization, polyploidy, speciation via breeding system transitions, species defined on very narrow taxon concepts, large ancestral population sizes, and low levels of intraspecific gene flow for plastid markers (23, 24). These issues are not evenly distributed among all plant groups; therefore, it is expected that resolution at the species level will be reasonably good in some groups and quite poor in others. In floristic contexts where geographical limitation usually restricts the number of closely related species, rates of species discrimination are expected to be greater (e.g., refs. 25, 26). Methods are invariably open to improvement from a variety of sources, and there are often many ways to achieve the same result. 11 DNA Barcoding Methods for Land Plants 225 For example, the reader may have a different way of drying plant samples or prefer to do PCR in larger reaction volumes. Where multiple methods are commonly in use, we attempt to provide details for each. The notes provided in the last section illuminate some of the principles that the methods we have provided aim to achieve. Some of the methods provided have been optimized to be cost-efficient, and are those currently in use at the Canadian Centre for DNA Barcoding (http://www.ccdb.ca/pa/ge/ research/protocols). 2. Materials 2.1. Field Collecting 1. Field press with blotting paper and spacers for voucher preparation. 2. Jewelry tags for labeling. 3. Silica gel (with 10–30% indicating silica beads). 4. Waterproof markers or pens. 5. Container(s) for silica drying of tissue, e.g., 20-ml scintillation vials, sealable plastic whirl-packs, zip-lock bags, or coin envelopes/tea bags that can be placed in a sealable container. 2.2. Tissue Sample Storage 1. Use of a climate-controlled facility if available or airtight containers filled with silica gel desiccant to archive tissue samples. 2.3. Tissue Subsampling for DNA Extraction 1. Grinding beads: for example, stainless steel 440C 3.17 mm beads. 2. Small forceps. 3. Latex or nitrile disposable gloves. 4. Ethanol: 100%. 5. ELIMINase®, DNA AWAY®, or a similar product. 6. Alcohol burner. 7. For single tube-based extractions: 2-ml screw-cap tubes with O-ring seals that are strong enough to withstand the homogenization process without breaking. 8. For plate-based extractions: Racked sterile mini tube strips with cap strips (e.g., PROgene® Mini Tube System 1.1 ml 8 Strip Pre-sterilized Mini Tube and sterile cap strips). 2.4. DNA Extraction: Single Sample-Based Extraction: Commercial Kits 1. Equipment for tissue grinding: for example, FastPrep® or TissueLyser with tube adaptor. 2. Microcentrifuge with a rotor for 2-ml tubes. 3. Vortex. 226 A.J. Fazekas et al. 4. Ethanol: 100%. 5. Heating block/incubator capable of heating to 70°C. 6. Pipettes and pipette tips. 7. 1.5- or 2-ml microcentrifuge tubes. 8. Individual tube-based DNA extraction kit. 9. Latex or nitrile disposable gloves. 10. ELIMINase®, DNA AWAY®, or a similar product. 2.5. DNA Extraction: Single Sample Extraction: Non-kitBased Method (Adapted from Ref. 26) 1. ELIMINase®, DNA AWAY®, or a similar product. 2. Silica-membrane spin columns (e.g., EconoSpin® mini spin columns, Epoch Life Science Inc.). 3. Equipment for tissue grinding: FastPrep® or TissueLyser with tube adaptor. 4. Microcentrifuge with a rotor for 2-ml tubes. 5. Vortex. 6. Ethanol: 100%. 7. Molecular biology grade water. 8. Heating block/incubator capable of heating to 70°C. 9. Pipettes and pipette tips. 10. Latex or nitrile disposable gloves. 11. 1.5- and 2-ml microcentrifuge tubes. 12. CTAB lysis buffer: 2% cetyltrimethylammonium bromide (CTAB), 100 mM Tris–HCl pH 8.0, 20 mM EDTA, and 1.4 M NaCl. 13. Binding buffer: 5 M guanidine thiocyanate, 20 mM EDTA pH 8.0, 10 mM Tris–HCl pH 6.4, and 4% Triton® X-100. 14. First wash buffer: 50% ethanol, 3 M GuSCN, 10 mM EDTA pH 8.0, 5 mM Tris–HCl pH 6.4, and 2% Triton® X-100. 15. Second wash buffer: 60% ethanol, 50 mM NaCl, 10 mM Tris– HCl pH 7.4, and 0.5 mM EDTA pH 8.0. 2.6. DNA Extraction: Plate-Based Extraction (96 Samples): Commercial Kits 1. Equipment for tissue grinding (e.g., TissueLyser with plate adaptor). 2. Centrifuge with a deep-well swinging bucket rotor capable of achieving 5,600–6,000 × g force. 3. Ethanol: 100%. 4. Incubator capable of heating to 70°C. 5. Pipettes and pipette tips. 6. Latex or nitrile disposable gloves. 7. ELIMINase®, DNA AWAY®, or a similar product. 11 DNA Barcoding Methods for Land Plants 227 8. 96-well microplate. 9. Reagent reservoirs (100 ml). 10. Plate-based DNA extraction kit. 2.7. DNA Extraction: Plate-Based Extraction (96 samples): Non-kitBased Method (Adapted from Ref. 26) 1. 96-well microplate. 2. AcroPrep™ 96 1 ml filter plate with 1.0 μm Glass Fiber media (PALL Life Sciences). 3. Equipment for tissue grinding: FastPrep® or TissueLyser with tube adaptor. 4. Centrifuge with a deep-well swinging bucket rotor capable of achieving 5,600–6,000 × g force. 5. Vortex. 6. Orbital Shaker for microplates. 7. Laboratory tape. 8. Molecular biology grade water. 9. Ethanol: 100%. 10. Incubator capable of heating to 70°C. 11. Pipettes and pipette tips. 12. Latex or nitrile disposable gloves. 13. ELIMINase®, DNA AWAY®, or a similar product. 14. CTAB lysis buffer: 2% cetyltrimethylammonium bromide (CTAB), 100 mM Tris–HCl pH 8.0, 20 mM EDTA, and 1.4 M NaCl. 15. Binding buffer: 5 M guanidine thiocyanate, 20 mM EDTA pH 8.0, 10 mM Tris–HCl pH 6.4, and 4% Triton® X-100. 16. First wash buffer: 50% ethanol, 3 M GuSCN, 10 mM EDTA pH 8.0, 5 mM Tris–HCl pH 6.4, and 2% Triton® X-100. 17. Second wash buffer: 60% ethanol, 50 mM NaCl, 10 mM Tris– HCl pH 7.4, and 0.5 mM EDTA pH 8.0. 18. Square-well block PALL collar (PALL Life Sciences). 19. Square-well block. 2.8. PCR 1. D-(+)-Trehalose dehydrate: 10 and 20% solutions. 2. 10× Polymerase Chain Reaction (PCR) Buffer, without Mg (Invitrogen). 3. Magnesium chloride: 50 mM solution. 4. Molecular biology grade water. 5. Latex or nitrile disposable gloves. 6. Pipettes and pipette tips. 7. Deoxynucleotide solution mix: 10 mM. 8. Oligonucleotide primers (Table 1). a Trehalose buffer 10× Buffer MgCl2 dNTPs Forward primer Reverse primer Polymerase Second Third Fourth Fifth Sixth Seventh Eighth Total volume of reaction DNA (30–50 ng/μl) Recommended amount to mix for a 96-well plate Last Molecular-grade water First Total volume of PCR mix Component Order to add PCR components 0.05 mM 0.1 μM 0.1 μM 0.025 U/μl 10 μM 10 μM 5 U/μl 2.5 mM 50 mM 10 mM 1× 5% Final concentration 10× 10% Stock concentration Table 1 General PCR mix for rbc L, ITS, ITS2, and trn H–psb A 12.5 2 10.5 0.0625 0.125 0.125 0.0625 0.625 1.25 6.25 2 Volume for 1 reaction (ml) 1,050 6.25 12.5 12.5 6.25 62.5 125 625 200 Volume for 100 reactionsa (ml) 228 A.J. Fazekas et al. 11 DNA Barcoding Methods for Land Plants 229 9. Platinum Taq DNA Polymerase (Invitrogen). 10. PCR 96-well microplate. 11. Aluminum Sealing Film (Axygene Scientific, VWR). 12. Clear Sealing Film (Axygene Scientific, VWR). 13. Thermocycler. 14. Microcentrifuge. 15. Centrifuge with a swinging bucket rotor for microplates. 16. PCR workstation. 2.9. PCR Product Determination: Precast E-gel Method 1. Precast agarose gel (e.g., 2% E-gel, Invitrogen). 2. E-Base. 3. Reagent reservoir. 4. Molecular biology grade water. 5. Latex or nitrile disposable gloves. 6. Pipette and pipette tips. 7. Gel imaging system. 2.10. PCR Product Determination: Routine Agarose Gels 1. Gel rig and combs. 2. Agarose. 3. Latex or nitrile disposable gloves. 4. Pipette and pipette tips. 5. Gel imaging system. 6. 1× TBE buffer: 90 mM Tris base, 90 mM boric acid, 2 mM EDTA. 7. DNA stain: Ethidium bromide or equivalent (e.g., SYBR® Safe DNA gel stain, Invitrogen). 8. Gel loading solution (e.g., Gel loading solution Sigma G7654) * if not already in the PCR mixture. 9. Size standard (e.g., 1 kb DNA ladder). 10. Power supply. 2.11. Cycle Sequencing 1. D-(+)-Trehalose dehydrate: 10% solution. 2. 5× Sequencing Buffer: 400 mM Tris–HCl pH 9.0, 10 mM MgCl2. 3. Molecular biology grade water. 4. Latex or nitrile disposable gloves. 5. Pipette and pipette tips. 6. 96-well PCR microplate. 7. Aluminum sealing foil. 230 A.J. Fazekas et al. 8. Clear sealing film. 9. Microcentrifuge. 10. Thermocycler. 11. Centrifuge with a swinging bucket rotor for microplates. 12. PCR workstation. 13. Oligonucleotide primer: 10 μM. 14. BigDye™ Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems). 2.12. Cycle Sequencing Reaction Cleanup and Processing for an ABI 3730xl Capillary Sequencer 1. Sephadex® G50 (Sigma). 2. Acroprep™ 96 Filter plate, 0.45 μM GHP (PALL Corporation Catalog No. 5030). 3. Molecular biology grade water. 4. Latex or nitrile disposable gloves. 5. Pipette and pipette tips. 6. Septum (Applied Biosystems). 7. Black plate base (Applied Biosystems). 8. White plate retainer (Applied Biosystems). 9. Pop-7™ Polymer for 3730xl DNA Analyzers (Applied Biosystems). 10. 3730xl DNA Analyzer Capillary Array, 50 cm (Applied Biosystems). 11. 10× Running buffer for 3730xl DNA Analyzers (Applied Biosystems). 12. MicroAmp 96-well reaction plate (Applied Biosystems). 3. Methods 3.1. Field Collecting 1. Prior to going to the field, dispense the silica gel into scintillation vials (~2/3–3/4 full), plastic bags (~15 ml of silica), or a 1-L container (~15% full) for coin envelopes or tea bags. 2. Harvest the plant: whole plant if small, or a branch with leaves from woody shrubs or trees. 3. Place the voucher in the field press such that identifying features (flowers, fruits, both sides of leaves) can be easily inspected when dried. 4. Identify the voucher with a unique collecting number, either with a jewelry tag attached to the voucher or by writing on the paper the sample is pressed in. 11 DNA Barcoding Methods for Land Plants 231 5. Take a small amount of leaf tissue (3–10 cm2; see Notes 1–6), and place in either: the scintillation vial containing silica gel, the plastic bag containing silica gel, or a coin envelope/tea bag, which is placed in the 1-L container with silica gel. 6. Label the container or coin envelope with the same collection number as the voucher. 3.2. Sample Storage 1. Store the tissue samples in a dry location or retain in silica until ready to subsample for DNA extraction (see Note 7). 3.3. Tissue Subsampling: For DNA Extraction Using Single Tubes 1. Clean the bench working area with ELIMINase®, DNA AWAY®, or a similar product. 2. With clean gloves and forceps, add one clean grinding bead to each tube and recap tubes. 3. Sterilize the forceps by dipping them in alcohol and flaming them. 4. Open a container with the sample, break off a piece of leaf or find a piece of the right size (see Notes 8–13), and insert it into a tube. 5. Label the tube with the collection number. 6. Clean the forceps by dipping them in alcohol and flaming, and then repeat step 4 for the remaining samples. 7. Change gloves often (or any time, you feel that they may have become contaminated). 3.4. Tissue Subsampling: For DNA Extraction Using 96-Well Plate Format 1. At a computer, organize the sample names in a spreadsheet in the plate format (8 rows × 12 columns). A good practice is to organize samples such that different genera are in adjacent wells. This facilitates the detection of cross contamination. 2. In the lab, clean the bench working area with ELIMINase®, DNA AWAY®, or a similar product. 3. With clean gloves and forceps, add one clean grinding bead to each tube in the plate, and add the strip caps to the tubes. 4. Organize the physical tissue samples in silica gel (vials, bags, or coin envelopes) on the bench, in columns and rows corresponding with the spreadsheet created in step 1. 5. Work with one strip of eight tubes (each corresponding to a numbered column) at a time. Remove one set of eight tubes to a new holder to physically separate the eight tubes being filled from the others. 6. Remove the lids from the strip of eight tubes and put them somewhere where they will not be contaminated by any flying plant material (e.g., between two kimwipes or on a kimwipe covered with a plastic lid). 232 A.J. Fazekas et al. 7. Sterilize the forceps by dipping them in alcohol and flaming them. 8. Open the container with the sample, break off a piece of leaf or find a piece of the right size (see Notes 8–13), and insert it into the correct tube. Pieces of plant tissue that are linear in shape (e.g., grass leaves and stems, conifer needles) need to be broken into smaller pieces to achieve proper homogenization using the grinding beads. 9. Clean the forceps by dipping them in alcohol and flaming, and wipe the gloves with a kimwipe moistened with ethanol in order to remove any plant tissue. 10. Repeat steps 8 and 9 for the remaining seven samples in the column. 11. Once the column of eight tubes is loaded, discard the gloves and put on new ones. 12. Attach the clean strip cap to the tubes, making sure that the lids are on tightly (they may pop off if not pushed all the way on). 13. Repeat the process from step 5, changing gloves after each set of eight tubes (or any time, you feel that they may have become contaminated). 3.5. Tissue Disruption 1. Homogenize the plant material with the grinding bead using a FastPrep®, TissueLyser, or a similar instrument: for the TissueLyser, apply 28 Hz for 30 s, then rotate the adaptors, and repeat once (or a maximum of two more times if necessary to obtain good disruption) (see Note 14). 2. Briefly centrifuge the tubes or the plate of strip tubes after homogenization to limit the amount of material stuck to the cap (see Note 15). 3.6. DNA Extraction: Kit-Based Protocols 1. For kit-based instructions. methods, follow the manufacturer’s 3.7. DNA Extraction: For Non-kit, Single Sample-Based Methods (Adapted from Ref. 26) 1. Carefully remove the screw caps from each tube and discard the caps. Powderized plant tissue will be adhered to the cap and will easily dislodge if the caps are not handled carefully (see Note 15). 2. Dispense 200 μl of CTAB lysis buffer to each tube and recap the tubes with new caps. 3. Gently invert each tube in order to mix the powderized plant material with the lysis buffer, and briefly centrifuge the tubes for 1,000 × g force for 1 min to collect the sample to the bottom. 4. Incubate the samples for 1 h at 65°C with occasional mixing by inversion. 11 DNA Barcoding Methods for Land Plants 233 5. Centrifuge the tubes at 1,500 × g force for 1 min. 6. Remove the caps and transfer 50 μl of lysate from each sample to a new 1.5-ml microcentrifuge tube (see Note 16). 7. Add 100 μl of binding buffer to each tube with lysate. 8. Immediately after addition of the binding buffer, carefully and slowly mix three to four times by aspirating and dispensing 100 μl. 9. Transfer 150 μl of each lysate into a spin column, placed in a 1.5-ml microcentrifuge tube, and close the cap on the spin column. 10. Centrifuge at 5,000 × g force for 5 min to bind the DNA to the membrane of the spin column. 11. Add 200 μl of the first wash buffer to each spin column. 12. Centrifuge at 5,000 × g force for 2 min. 13. Remove the spin column from the tube, discard the flow through, and replace the spin column in the tube. 14. Add 500 μl of the second wash buffer to the spin column. 15. Centrifuge at 5,000 × g force for 5 min. 16. Remove the spin column from the tube and discard the tube and contents. 17. Open the cap of the spin column, place the spin column on the lid of a tip box, and incubate at 56°C for 30 min to evaporate residual ethanol. 18. Place the spin column in a new 1.5-ml microcentrifuge tube. 19. Add 50 μl of ddH2O (at 56°C) to the center of the spin column. 20. Incubate at room temperature for 1 min. 21. Centrifuge at 5,000 × g force for 5 min to collect the DNA eluate. 22. Remove the spin column and discard it. 23. Store the DNA at 4°C for short-term storage or at −20°C (preferably at −80°C) for long-term storage. 3.8. DNA Extraction: For Non-kit, PlateBased Methods (Adapted from Ref. 26) 1. Remove one set of strip tubes to a separate holder for cap removal and addition of CTAB lysis buffer. 2. Carefully remove the strip of caps using each individual cap tab to pull the cap off the tube, and discard the strip caps. Powderized plant tissue will be adhered to the cap and will easily dislodge if the caps are not handled carefully (see Note 15). 3. Dispense 200–350 μl of CTAB lysis buffer to each tube (depending on the amount of sample) and recap the tubes with a new strip cap. 234 A.J. Fazekas et al. 4. Repeat steps 2 and 3 for the remaining 11 sets of strip tubes. 5. Use tape to tightly seal the caps on the tubes (which may otherwise pop off during incubation). 6. Gently invert the rack of tubes once to mix the powderized plant material with the lysis buffer. 7. Briefly centrifuge the tubes at 1,000 × g force for 1 min to collect the sample to the bottom. 8. Incubate the samples for 1 h at 65°C using shaker (80–100 rpm). Do not invert rack. 9. Centrifuge the plate at 1,500 × g force for 1 min. 10. Remove the strip caps and transfer 50 μl of lysate from each sample to the corresponding position of a 96-well microplate (see Note 16). 11. Add 100 μl of binding buffer to each well. 12. Immediately after addition of the binding buffer, carefully and slowly mix three to four times by aspirating and dispensing 100 μl. 13. Transfer 150 μl of each lysate into a well in a 1 ml Acroprep™ 96-well glass fiber plate, placed on a 2-ml square-well block (see Note 17). 14. Seal the glass fiber plate with clear PCR film. 15. Centrifuge at 5,000 × g force for 5 min to bind the DNA to the glass fiber membrane. 16. Remove the PCR film and add 200 μl of the first wash buffer to each well of the glass fiber plate. 17. Seal the plate with clear PCR film and centrifuge at 5,000 × g force for 2 min. 18. Remove the PCR film and add 750 μl of the second wash buffer to each well of the glass fiber plate. 19. Seal the plate with clear PCR film and centrifuge at 5,000 × g force for 5 min. 20. Remove the seal, place the glass fiber plate on the lid of a tip box, and incubate at 56°C for 30 min to evaporate residual ethanol. 21. Position a collar on the collection microplate (optional) and place the glass fiber plate on top. 22. Add 50 μl of ddH2O (at 56°C) to each well of the glass fiber plate. 23. Seal the glass fiber plate with clear PCR film. 24. Incubate at room temperature for 1 min. 25. Place the assembled glass fiber plate and microplate on top of a square-well block to prevent cracking of the collection plate and centrifuge at 5,000 × g force for 5 min to collect the DNA eluate. 11 DNA Barcoding Methods for Land Plants 235 26. Remove the glass fiber plate and retain it at −20°C as a backup until the extraction is determined to be successful, after which it can be discarded. 27. Cover the DNA plate with aluminum sealing film and store at 4°C for short-term storage or at −20°C (preferably at −80°C) for long-term storage. 3.9. PCR: PCR Mixture 1. Prepare and label a 1.5-ml microcentrifuge tube for the PCR cocktail of 100 reactions (Table 1). This number of reactions is recommended when using a 96-well plate to accommodate pipetting error. 2. Defrost all components of the cocktail at room temperature, except the polymerase which has to be kept at −20°C at all times prior to use. 3. Prepare the PCR cocktail adding the components in order listed in Tables 1–3 (see Notes 18–21). See also Table 4 for the standard primers for amplification of rbcL, matK, ITS, ITS2, and trnH–psbA. 4. Vortex the mix and centrifuge at 1,000 × g force briefly. 5. Dispense 10.5 μl of the PCR cocktail in each well using the same tip [replace tip occasionally (every 16 wells) to reduce pipetting error]. 6. Add 2 μl of the sample DNA (30–50 ng/μl) to each well. Leave one or two wells blank as a negative control. Use a fresh tip for each DNA sample. 7. Seal the plate tightly with aluminum foil (using a roller to seal) or thermo-seal cover (apply heat to seal) (see Note 22). 8. Centrifuge the plate at 1,000 × g force for 1 min (see Note 23). 9. Place the plate into the thermo-cycling block, close it, and apply the appropriate PCR program. 3.10. PCR Thermal Cycling Programs 1. rbcL, trnH–psbA (see Notes 24 and 25): 94°C for 4 min; 35 cycles of 94°C for 30 s, 55°C for 30 s, 72°C for 1 min; final extension 72°C for 10 min. 2. trnH–psbA for ferns and allies, and bryophytes (see Note 25): 94°C for 4 min; 2 cycles of 94°C for 45 s, 50°C for 45 s, 72°C for 1 min; 35 cycles of 94°C for 45 s, 45°C for 45 s, 72°C for 1 min; final extension 72°C for 10 min. 3. trnH–psbA using Phusion polymerase (see Note 26, Table 3): 98°C for 45 s; 35 cycles of 98°C for 10 s, 64°C for 30 s, 72°C for 40 s; final extension 72°C for 10 min. 4. matK first round (matK-KIM1R/matK-KIM3F) (see Note 27): 94°C for 1 min; 35 cycles of 94°C for 30 s, 52°C for 20 s, 72°C for 50 s; final extension 72°C for 5 min. Polymerase Eighth 0.5 μM 0.5 μM 0.1 U/μl 10 μM 10 μM 5 U/μl 0.15 0.375 0.375 0.15 7.5 Reverse primer Seventh 0.2 mM 10 mM 0.225 Total volume of reaction Forward primer Sixth Recommended amount to mix for a 96-well plate a dNTPs Fifth 1.5 mM 50 mM 0.75 1.875 1 MgCl2 Fourth 1× 5% 10× 20% 2.60 Volume for 1 reaction (ml) DNA (3–5 ng/μl) 10× Buffer Third Last Trehalose buffer Second Final concentration 6.5 Molecular-grade water First Stock concentration Total volume of PCR mix Component Order to add PCR components Table 2 PCR mix for mat K 650 15.0 37.5 37.5 15.0 22.5 75 187.5 260 Volume for 100 reactionsa (ml) 236 A.J. Fazekas et al. Polymerase Seventh 0.025 U/μl 2 U/μl b Recommended amount to mix for a 96-well plate Note that in limited trials HF buffer does not appear to be compatible with trehalose a 0.1 μM 10 μM 0.125 0.1 0.1 10 Reverse primer Sixth 0.1 μM 10 μM 0.056 2 Total volume of reaction Forward primer Fifth 0.056 mM 10 mM 1× 1 dNTPs Fourth 5× 6.32 0.3 Volume for 1 reaction (ml) DNA (30–50 ng/μl) HF buffer (containing 1.5 mM MgCl2)b Third Last Molecular-grade water Second 3% Final concentration 9 DMSO First Stock concentration Total volume of PCR mix Component Order to add PCR components Table 3 PCR mix for use with Phusion polymerase 900 12.5 10 10 5.6 200 632 30 Volume for 100 reactionsa (ml) 11 DNA Barcoding Methods for Land Plants 237 AB101 AB102 psbAF trnH2 psbA trnH(GUG) psbA501f ITS trnH–psbA GTTATGCATGAACGTAATGCTC CGCGCATGGTGGATTCACAATCC CGAAGCTCCATCTACAAATGG ACTGCCTTGATCCACTTGGC TTTCTCAGACGGTATGCC ACGAATTCATGGTCCGGTGAAGTGTTCG TAGAATTCCCCGGTTCGCTCGCCGTTAC ATGCGATACTTGGTGTGAAT TCCTCCGCTTATTGATATGC R F R F R F R F R F R F R F R F R R R F Direction Sang et al. (29) Tate and Simpson (28) Hamilton (30) Hamilton (30) Cox et al. (31) Sun et al. (39) Sun et al. (39) Chen et al. (20) White et al. (38) Ki-Joong Kim, personal communication Ki-Joong Kim, personal communication Cuenoud et al. (37) Cuenoud et al. (37) Damon Little, personal communication Damon Little, personal communication Fazekas et al. (5) Fazekas et al. (5) Fazekas et al. (5) Levin et al. (35), modified from Soltis et al. (34) Kress and Erickson (24), modified from Fofana et al. (36) References See Notes 24–27 for primer usage and alternatives. As different authors use different conventions as to what constitutes “forward” and what constitutes “reverse” primers, the notation of F and R on primer names can mean different things. This is particularly problematic for matK and trnH–psbA. The “Direction” column indicates primer orientation with reference to the direction of the reading frame of rbcL and matK and following the convention of clockwise orientation for trnH–psbA ITS-S2F ITS4 ITS2 rbcLajf634R ACCCAGTCCATCTGGAAATCTTGGTTC CGTACAGTACTTTTGTGTTTACGAG CGATCTATTCATTCAATATTTC TCTAGCACACGAAAGTCGAAGT CTGGATYCAAGATGCTCCTT GGTCTTTGAGAAGAACGGAGA CCCTATTCTATTCAYCCNGA CGTATCGTGCTTTTRTGYTT GAAACGGTCTCTCCAACGCAT rbcLa-R matK-KIM1R matK-KIM3F matK-390f matK-1326r NY552F NY1150R matKpkF4 matKpkR1 GTAAAATCAAGTCCACCRCG rbcLa-F rbcL matK ATGTCACCACAAACAGAGACTAAAGC Primer name Region Sequence (5′–3′) Table 4 Primers commonly used for DNA barcoding in plants 238 A.J. Fazekas et al. 11 DNA Barcoding Methods for Land Plants 239 5. matK second-round failure tracking (matK-390f/matK-1326r) (see Note 27): 94°C for 1 min; 35 cycles of 94°C for 30 s, 50°C for 40 s, 72°C for 40 s; final extension 72°C for 5 min. 6. ITS (AB101/AB102) (see Note 28): 94°C for 5 min; 30 cycles of 94°C for 1 min, 55°C for 1 min, 72°C for 1 min, 45 s; final extension 72°C for 10 min. 7. ITS2 (ITS-S2F/ITS4): 94°C for 5 min; 35 cycles of 94°C for 30 s, 56°C for 30 s, 72°C for 45 s; final extension 72°C for 10 min. See Note 29 in situations, where results from PCR are unsuccessful or poor. 3.11. PCR Product Determination: Electrophoresis with Precast E-gels 1. Open the package with precast agarose gel (see Note 30), remove the plastic comb, and place the gel on the mother E-base. 2. Set the mother E-base at “EG” program and a runtime of 4 min. 3. Load 14 μl of molecular-grade water into each well of the 96-well precast agarose gel. 4. Load 3–4 μl of each PCR product into the corresponding E-gel well. 5. Slide E-gel into electrode connections of mother E-base and start electrophoresis. A green light indicates the beginning of run. A red light and beeping indicate the end of run. Stop the current by pressing pwr/prg button. 6. Remove E-gel from base and capture a digital image with the imaging documentation system. 3.12. PCR Product Determination: Electrophoresis with Routine Agarose Gels There is a large selection of gel combs and trays on the market designed to accommodate different numbers of samples. Please refer to the manufacturer’s notes for the recommended volume of agarose to be used. 1. Select the appropriate gel tray and combs for the number of samples to be run (leaving an appropriate number of wells free for size standards). Seal the ends of the tray with masking tape or use a gel-forming cassette. 2. Weigh out the agarose and place in a glass conical flask. To check PCR success, a 1% agarose gel is used; 1% agarose gel = 1 g of agarose per 100 ml of 1× TBE buffer. 3. Add the appropriate volume of 1× TBE buffer to the agarose and gently swirl. 4. Heat the solution in a microwave on maximum heat setting for approx. 30 s, remove flask from the microwave, and gently swirl to mix. Continue to heat, mixing occasionally. Carefully 240 A.J. Fazekas et al. remove the solution from the microwave, gently swirl, and check that all the agarose has dissolved. 5. Place the gel on the bench and leave to cool (or cool under cold running water), until it is comfortable to touch the side of the conical flask. 6. Add the appropriate volume of DNA gel stain (see Note 30). For Sybrsafe, this is 1 μl/10 ml of agarose gel; for ethidium bromide, this should be to a final concentration of 0.5 μg/ml of agarose gel. Gently swirl the solution to mix. 7. Pour the gel into the gel tray and leave to set for approx. 30 min. 8. If gel-loading solution is not already in the PCR mixture, prepare your samples for gel electrophoresis by mixing the gelloading solution with the PCR product (3 μl gel-loading solution plus 5 μl PCR product). 9. Carefully remove the masking tape or undo the clamp of the gel-forming cassette and gently remove the comb. 10. Place the agarose gel in the electrophoresis tank containing 1× TBE buffer, making sure that the gel is totally immersed in buffer. The buffer should just be covering the surface of the gel. 11. Load the recommended volume of size standard into the assigned lanes (typically, 0.1 μg of standard per millimeter lane width). Then, load the samples into the subsequent wells. 12. Run gel for 30 min to 1 h at 80 V. 13. Transfer the gel to the imaging documentation system and capture a digital image. 3.13. Cycle Sequencing 1. Dilute the PCR product: (a) For rbcL, ITS, ITS2, trnH–psbA: one part of PCR product/two parts of water. (b) For matK: one part of PCR product/nine parts of water. 2. Cover the plate with plastic seal, and spin at 1,000 × g force for 1 min. 3. Defrost sequencing reagents (Table 5) at room temperature. Keep BigDye™ away from light exposure prior to use. 4. Prepare sequencing mix adding components in the order listed in Table 5 (see Note 31). After adding BigDye™, mix components gently by inverting the tube several times. Do not vortex. Add one primer. Mix gently with tip. Note that separate reactions are carried out using the forward or reverse primers. 5. Dispense 9.0 μl of sequencing mix into each well of 96-well plate. Primer Fifth Last BigDye™ Fourth 11 Total volume of reaction 9.0 1 0.25 1.875 0.875 5 Volume for 1 reaction (ml) 2 10 μM 5× Final concentration Diluted PCR product Total volume of sequencing mix Sequencing buffer Third 10% Stock concentration b Recommended amount to mix for a 96-well plate Sequencing buffer: for 50 ml: 20 ml of 1 M Tris–HCl pH 9, 500 μl of 1 M MgCl2, 29.5 ml of molecular-grade water a Molecular-grade water Second b Trehalose (Sigma-Aldrich, No. T9531-100 g) Component First Order to add components Table 5 General cycle-sequencing mix 936 104 26 195 91 520 Volume for 104 reactionsa (ml) 11 DNA Barcoding Methods for Land Plants 241 242 A.J. Fazekas et al. 6. Add 2 μl of diluted PCR product to each well (use fresh tip for each PCR product). 7. Place aluminum foil or heat-seal cover over the top of the 96-well plate. Apply heat for heat-seal cover, and use roller to close the plate tightly (see Note 22). 8. Spin the plate using centrifuge at 1,000 × g force for 1 min (see Note 23). 9. Place the plate into the thermocycler block and apply the program (see Note 32): 96°C for 2 min; 30 cycles of 96°C for 30 s, 55°C for 15 s, 60°C for 4 min; hold at 4°C. 10. After cycle sequencing reaction is complete, keep the plate in a dark box at 4°C to avoid degradation of BigDye™. 3.14. Cycle Sequencing Cleanup and Processing for an ABI 3730xl Capillary Sequencer 1. Measure dry Sephadex G-50 (Sigma-Aldrich, Cat. No. G5080500 g) with the MultiScreen Column Loader (Millipore, Cat. No. MACL09645) into the Acroprep 96 Filter plate with 0.45 μm GHP membrane (PALL, Cat. No. PN5030). This loader adds the specific amount of Sephadex required (see Note 33). 2. Hydrate each well with 300 μl of molecular-grade water using a pipette. 3. Let the Sephadex hydrate overnight at 4°C or for 3–4 h at room temperature before use. 4. Assemble the Sephadex plate onto the collection plate and secure with two rubber bands. 5. Centrifuge at 750 × g force for 3 min to drain the water from wells. Discard water from the collection plate (when centrifuging two plates, make sure that both sets have equal weight which can be achieved by using additional rubber bands). The collection plate can be reused without autoclaving. 6. Add the entire volume of the sequencing reaction to the centre of the Sephadex columns using a pipette. 7. Add 25 μl of 0.1 mM EDTA to each well of the Sephadex plate. 8. Elute clean sequencing reaction by attaching a 96-well plate to the bottom of Sephadex plate and secure with rubber bands. 9. To balance two plates, attach additional rubber bands as needed. 10. Centrifuge at 750 × g force for 3 min. Remove Sephadex plate. 11. Cover the top of the collection plate with a septum. 12. Place 96-well plate into black plate bases and attach white plate retainer. 11 DNA Barcoding Methods for Land Plants 243 13. Stack assembled plate in ABI 3730xl capillary sequencer and import plate record using Plate manager module of the Data Collection software (Applied Biosystems). 14. Begin sequencing run with Run Scheduler. 3.15. Sequence Editing Careful and consistent editing of the raw sequence data is a critical component of generating a high-quality dataset. There are a number of software programs (e.g., Sequencher, CodonCode Aligner, etc.) that allow the import of raw trace files and include a variety of editing features. Since each sequence editing program is different, we cannot include a software-specific detailed editing procedure. We present instead the chain of events involved in going from the output of the sequencer to a useable sequence. 1. Retrieve electropherogram trace files from sequencer. 2. Import trace files into a sequence editing software package. 3. Generate sequence-quality scores for individual trace files. 4. Trim primer sequences from the sequences. 5. Trim sequences from both ends based upon minimum quality threshold (e.g., mean QV > 20 and no more than 2 bp QV < 20 in any 20-bp window). 6. Assemble forward and reverse sequence traces for each individual sample to create a sequence contig. 7. Manually edit individual sequences: pay particular attention to bases with low-quality scores or ambiguous calls (see Notes 34–37). 8. Acquire sequence-quality statistics for individual forward and reverse sequences (e.g., length of read, proportion of bases with QV > 20). 9. Generate consensus sequence. 10. Acquire consensus sequence quality statistics (e.g., length of consensus, percentage of bidirectional coverage, proportion of bases with QV > 20 for unidirectional and bidirectional portions of the consensus). 11. Export consensus sequence for downstream analysis. 4. Notes 1. Properly collected plant tissue is essential for maximizing PCR and sequencing success. Key to this process is that material from which DNA is extracted must be dried as quickly as possible to prevent the degradation of the DNA. Field collections 244 A.J. Fazekas et al. of specimens must be immediately split into two components: (a) the voucher and (b) a portion of the voucher (typically, leaf tissue) which is placed in a container with silica gel or similar drying agent. It is important that the portion taken for DNA is put into silica gel as rapidly as possible after harvesting from the field. This should ideally be done immediately, but if impractical, it should be done no later than at the end of the collecting day. Delays to drying material in silica gel can result in samples with reduced DNA quality and lower PCR success. 2. It is important to keep the freshly collected tissue samples in separate containers. Pooling different samples into a single Ziploc bag, for example, increases the chances of cross contamination. 3. We describe three types of containers that we have used in various settings, each with relative advantages and disadvantages. (a) Scintillation vials provide a separate enclosed environment for each sample. This can be useful in humid conditions, in which coin envelopes may absorb some moisture from the air, slowing the tissue drying process, or for tissue that has a high water content and dries more slowly inside a coin envelope rather than when in direct contact with the silica. (b) Coin envelopes are probably the simplest medium for sampling plant tissue. It is easier to insert a sample into an envelope than into the narrow opening of a scintillation vial. Multiple coin envelopes can be stored in an airtight container with silica gel, requiring less space than scintillation vials. The envelopes also keep the silica gel separate from the tissue, facilitating tissue subsampling. Tea bags can also be used in place of envelopes; they are more porous, facilitating the drying process, but are also slightly more fragile. (c) Small (~10 × 15 cm) plastic bags with silica can work well in the field, but are prone to punctures from thorns or prickles, and are somewhat permeable which does exhaust the silica over time. When the samples are dry, the plastic bags need to be handled carefully to prevent excessive breakage of the plant tissue. 4. In the case of specimens that are likely to take a long time to dry (such as samples with waxy leaves), tear the leaf sample into smaller fragments or chop with a sterile blade to increase the surface area available for contact with the silica gel. 5. The best samples for plant DNA extraction typically come from actively growing plant tissues; senescing, damaged, or infected tissues should be avoided. The usual choice of plant tissue is leaves, but shoot tips or flower buds or petals can also be used. For canopy tree species in which reaching leaf or flower material 11 DNA Barcoding Methods for Land Plants 245 is logistically challenging, an alternative approach is to use a leather punch to obtain samples of cambium tissue which avoids the need for tree climbing (27). 6. Sampling herbarium material for DNA extraction can be successful, but success is often variable and unpredictable. The quality of the extraction is most likely a function of the age of the specimen, the species in question, and the speed with which samples have been dried, which is often unknowable. The priority should be given to samples not much greater than 10 years old. However, the most critical criterion is that the samples should still be green in color. Brown coloration of the herbarium sample indicates that the tissue quickly oxidized after collection or was infected by mold, indicating that the DNA is most likely degraded and/or contaminated by fungal DNA. 7. Tissue samples from which DNA extractions are made should be prevented from rehydrating from the atmosphere. This can be achieved through a climate-controlled facility or in airtight containers (refresh the silica as necessary). Long-term experiments are still needed to provide empirical data on optimum storage procedures for tissue samples. 8. The sampling of silica-dried material into tubes for DNA extraction and the extraction process are probably the most important steps in the process of generating good-quality DNA barcode data. It is the step that is the easiest for contamination or sample mix-up to occur. Thus, it is very important to follow the steps outlined in Subheading 3.3 to prevent this. A poor-quality extraction will result in inefficient or failed PCR reactions. 9. The appropriate amount of plant material to sample for DNA extraction is 10–15 mg dry tissue. In this case, more is not better; using more than this amount of tissue will result in a poorly ground sample, overwhelm the buffers used in the extraction process, and result in low-yield or poor-quality DNA. This amount usually corresponds to ~0.5 cm2, but may be smaller depending on the leaf thickness. Plastic materials (such as sampling tubes) often have a static charge that will attract small particles of plant tissue. Fragments of plant material literally jump from one well to another, so care must be exercised when placing bits of leaves into the tubes. 10. Plant tissues that are linear in shape (e.g., grass leaves and stems, conifer needles) need to be broken into smaller pieces to achieve proper pulverization using the grinding beads. 11. When sampling plant tissue from herbarium samples in areas where an alcohol burner is prohibited, it is good practice to wipe the forceps after each sample with a kimwipe moistened with ethanol. 246 A.J. Fazekas et al. 12. It is important to keep freshly collected silica-dried material and older herbarium-sampled tissue in separate extraction plates, as they may require different extraction protocols. 13. A note on bryophytes: Extreme care is required when sampling bryophytes due to the common occurrence of mixed species samples being collected from the field. Tissue subsampling is best done at the same time as determinations are made. 14. A frequency higher than 28 Hz can destroy the tubes. We do not recommend homogenization for longer than a total of 1 min, with the exception of samples with very tough tissue in which case an additional run of 30 s can be applied. 15. After the plant tissue is ground to a fine powder, the tubes require careful handling. Centrifuging does not help significantly in removing powderized plant tissue from the lids or caps as the static charge is strong enough to keep them adhered to the interior surface of the tube’s walls and caps. Opening the caps should be done with extreme care to avoid cross contamination prior to addition of the lysis buffer. 16. In the non-kit-based protocols are provided, the entire volume of the CTAB lysate is not used. Unused lysate can be stored at −20°C as a backup until the extraction is determined as being successful as indicated by the results of first PCR reaction. The lysate can also be used as a source for additional extractions if more testing of the DNA is necessary. 17. The square-well blocks that are specified in the protocol have enough volume to collect all the wash buffers without needing to discard between washes. However, if a block with a smaller volume is used, it may be necessary to discard the wash buffer between steps 16 and 17 of Subheading 3.8. 18. Trehalose (which is also a potent PCR enhancer) acts as a cryoprotectant for Taq polymerase when PCR mixes are prepared in large volume batches and frozen for future use. 19. Many available PCR protocols for matK include 4% DMSO. Experiments based on several hundred reactions have demonstrated that a 5% Trehalose solution can replace DMSO without any significant difference in PCR success or sequence quality. 20. After DNA extraction, it is recommended to begin the first round of PCR for the rbcL DNA barcoding marker using the nearly universal primers rbcLa-F/rbcLa-R; a greater degree of PCR success and quality is obtained in bryophytes with the reverse primer rbcLajf634R. These primers generate a high rate of PCR success with DNA of good quality. Hence, this first PCR for rbcL acts as a test for DNA quality for a broad variety of taxa among angiosperms, gymnosperms, ferns, and mosses. 11 DNA Barcoding Methods for Land Plants 247 21. PCR cleanup is both expensive and time consuming, but can be avoided through use of the low concentrations of primers and dNTPs in the PCR mix and the subsequent dilution of the PCR product prior to cycle sequencing reaction. This protocol provides a high success rate for PCR and sequences for regions that are amplified by universal, highly conserved primers (plastid rbcL, trnH–psbA, and nuclear ribosomal ITS2). In contrast, the matK DNA barcoding region needs distinct conditions for successful PCR amplification. For matK, the concentration of the primers, dNTPs, and Taq polymerase cannot be significantly reduced. Based on experiments optimizing the PCR conditions for matK, we recommend a protocol with diluted DNA (0.3–0.5 ng/μl) and a smaller PCR reaction volume (7.5 μl). These conditions have yielded a higher rate of PCR success and increased sequence quality over the general PCR mix. 22. The volumes of the PCR and cycle sequencing reactions recommended here are very small. Thus, it is very important to follow the instructions in Subheadings 3.9 and 3.14 carefully. The foil or thermal-seal cover should be placed evenly and tightly over the PCR plate without wrinkles or holes to prevent evaporation during PCR cycling. 23. Centrifuging is required to collect the PCR components at the bottom of the well and eliminate any air bubbles that might have been trapped. It also aids in mixing the PCR components with the DNA sample, or cycle sequencing mix with PCR product. 24. Although rbcL is present in the vast majority of land plants, there are some groups, such as holoparasites, that no longer have a functioning copy of this gene. As a result, the primers most commonly used typically do not work in these groups. 25. The primers most widely used for PCR amplification of the plastid trnH–psbA intergenic spacer for DNA barcoding are those recommended by Kress et al. (7) or Kress and Erickson (10) (Table 4). They are, respectively, trnH2 (originally from ref. 28) and psbAF (originally from ref. 29) or trnH(GUG) and psbA (originally from ref. 30). In bryophytes, this region is often short (<200 bp) and an alternative primer, psbA501f (31) located further back in the psbA gene can be used to obtain additional characters in combination with the primer trnH2. 26. The trnH–psbA region often contains homopolymer runs that are known to reduce sequence quality after the run. Fazekas et al. (32) have shown that the use of alternative polymerases, such as Phusion (Finnzymes) or Herculase II Fusion (Agilent), can improve quality for runs of up to 13–14 bases. 248 A.J. Fazekas et al. 27. The matK gene region is more difficult to amplify and sequence than rbcL for a number of reasons. First, it is approximately 300 bp longer than rbcL, and thus more sensitive to DNA degradation. Second, the presence of mononucleotide repeats in some groups dramatically affects the quality of the sequence reaction, resulting in a contig that is primarily supported by two unidirectional sequences with only a small amount of overlap. Finally, matK requires different primer combinations for different taxonomic groups. Therefore, it is important to estimate the taxonomic composition of the plate prior to amplification. If the plate contains angiosperms from different genera, families, and even orders, the combination of matK-KIM1R/ matK-KIM3F is the optimal first choice. This primer combination was recommended as the first choice by the CBOL Plant Working Group (13), and has been confirmed in thousands of PCRs from floristic projects in biodiversity hot spots. Those samples, which failed in the first round, are collected into a new plate, and subjected to a second round of PCR using the primers matK390f/matK1326r (failure tracking). Usually, the combination of these two sets of matK primers yields around 80% successful sequences in floristic projects focusing on angiosperms. However, if the project is represented by one specific group like a genus or family that does not work well with any of these primer combinations, it is best to search for appropriate primers for this group. A selection of order-specific primers has been published by Dunning and Savolainen (33). Two alternate primer pairs for gymnosperms (NY552F/NY1150R and matKpkF4/matKpkR1) are given in Table 4. Routine amplification of this region is difficult in non-seed plants and the development of primers for ferns/fern allies and bryophytes is currently underway. 28. ITS and ITS2 offer higher levels of species discrimination in some groups. However, one risk with ITS is that of fungal contamination. Even the cleanest leaf sample will likely have fungal hyphae associated with it, and in some groups this can be a serious source of contamination. For the entire ITS region, the use of the angiosperm-specific primers AB101 and AB102 can reduce this problem for flowering plants. 29. In situations where PCR is unsuccessful or patchy in its success, some optimization of PCR conditions can improve success. An often-successful first approach is to dilute the DNA ten times. DNA is often not limiting in PCR, but the extraction process occasionally does not remove all PCR inhibitors sufficiently. A dilution often reduces inhibitors to the point, where PCR can succeed. Other steps to improve success are as follows: (a) for faint or absent PCR products: a decrease in primer-annealing temperatures, an increase in primer concentration, an increase 11 DNA Barcoding Methods for Land Plants 249 in MgCl2, or an increase in the number of cycles and (b) when multibanded PCR products are produced: an increase in primer-annealing temperatures, a decrease in primer concentration, or a decrease in MgCl2. These various approaches of course depend on the initial PCR conditions one starts with; there are minimums and maximums, which, if exceeded, will usually result in failed PCR. 30. Ethidium bromide is highly toxic and a mutagen. Handling agarose gels and/or buffers containing ethidium bromide requires a designated work area isolated from other space in the lab, dedicated pipettes, and water reservoirs. Nitrile gloves should be worn when handling any components, and discarded when the PCR check is completed to avoid contamination by ethidium bromide. There are numerous alternatives to ethidium bromide that are less toxic. SYBR® Safe DNA gel stain, for example, is not classified as hazardous waste under the US federal regulations but has a comparable sensitivity to ethidium bromide. 31. The use of 5.5% trehalose in the sequencing reaction mix allows sequencing mixes to be premade in primer specific batches, aliquoted directly into plates, covered with PCR film, and frozen for future use. The mix with the primer can be stored at −20°C for up to 3 months. However, one must avoid unnecessary thawing of the mix before use, which can cause degradation of the Dye Terminator. The mix should be thawed only just prior to use. 32. The annealing temperature of the sequencing reaction can also be adjusted to the specific primer conditions. For example, for those primers which require a 50°C annealing temperature during PCR, the annealing temperature in the cycle sequencing program also can be set at 50°C. However, it is not recommended to use an annealing temperature for the sequencing reaction lower than 50°C. 33. For higher throughput, the semiautomated AutoDTR™ method from EdgeBio® (http://www.edgebio.com/catalog/ dye-terminator-removalproducts-AutoDTR™-96-c-28_1005. html) has been used at the Canadian Centre for DNA Barcoding. This sequencing cleanup method is less sensitive to PCR product concentration and allows longer high-quality reads and a further reduction of BigDye™ in the sequencing reaction due to increased sensitivity. 34. Check and maintain the proper orientation of the sequences. Sequences that are in the reverse complement orientation disrupt proper alignment, causing subsequent analytical problems if not corrected. 35. Look for odd gaps or insertions in the sequence, especially at the ends of the read. The trace file you view is a result of an 250 A.J. Fazekas et al. algorithm that interprets the fluorescence pattern and compression artifacts often result at the ends of a read. These are particularly problematic with multiple repeats, where the algorithm may have difficulty distinguishing whether there are 4 or 5 Adenine nucleotides in a row for example. 36. Check for an open reading frame (ORF) in coding regions, such as rbcL and matK. This is an important first step in identifying stop codons and errors in editing. In the correct reading frame, there should be no internal stop codons present in coding regions. There are two potential reasons for observing stop codons in the sequence. First, it may be a real stop codon indicating that the region that has been sequenced is no longer functional, and likely a pseudogene. These cases are quite rare in both rbcL and matK genes, and usually observed in only a few specific groups of plants. Second, it can be evidence of errors due to editing, e.g., miscalls, or a frameshift of the ORF due to missed or extra base calls. Each instance needs to be carefully investigated and addressed. 37. Check for chimeric sequences (a result of nonspecific PCR amplification from multiple templates), bacterial, virus, fungal, and algal contaminations, and remove them from the consensus sequence. Acknowledgments We are grateful to Michelle Hollingsworth, Alan Forrest, David Erickson, and John Kress for helpful comments on this manuscript. References 1. Govaerts R (2001) How many species of seed plants are there? Taxon 50:1085–1090 2. Thorne RF (2002) How many species of seed plants are there? Taxon 51:511–522 3. Scotland RW, Wortley AH (2003) How many species of seed plants are there? Taxon 52:101–104 4. Paton AJ, Brummitt N, Govaerts R, Kehan H, Hinchcliffe S, Allkin B, Lughadha EN (2008) Towards target 1 of the global strategy for plant conservation: a working list of all known plant species: progress and prospects. Taxon 57: 602–611 5. Fazekas AJ, Burgess KS, Kesanakurti PR et al (2008) Multiple multilocus DNA barcodes from the plastid genome discriminate plant species equally well. PLoS One 3:e2802 6. Chase MW, Salamin N, Wilkinson M et al (2005) Land plants and DNA barcodes: shortterm and long-term goals. Phil Trans Lond B 360:1889–1895 7. Kress WJ, Wurdack KJ, Zimmer EA et al (2005) Use of DNA barcodes to identify flowering plants. Proc Natl Acad Sci USA 102:8369–8374 8. Newmaster SG, Fazekas AJ, Ragupathy S (2006) DNA barcoding in the land plants: an evaluation of rbcL in a multigene tiered approach. Can J Bot 84:335–341 9. Cowan RS, Chase MW, Kress WJ, Savolainen V (2006) 300,000 species to identify: problems, progress, and prospects in DNA barcoding of land plants. Taxon 55:611–616 10. Kress WJ, Erickson DL (2007) A two-locus global DNA barcode for land plants: the coding 11 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. rbcL gene complements the non-coding trnHpsbA spacer region. PLoS One 2:e508 Chase MW, Cowan RS, Hollingsworth PM, van den Berg C, Madriñan S, Petersen G, Seberg O, Jørgsensen T, Cameron KM, Carine M, Pedersen N, Hedderson TAJ, Conrad F, Salazar GA, Richardson JE, Hollingsworth ML, Barraclough TE, Kelly L, Wilkinson M (2007) A proposal for a standardised protocol to barcode all land plants. Taxon 56:295–299 Newmaster SG, Fazekas AJ, Steeves RAD, Janovec J (2008) Testing candidate plant barcode regions with species of recent origin in the Myristicaceae. Mol Ecol Notes 8:480–490 CBOL Plant Working Group (2009) A DNA barcode for land plants. Proc Natl Acad Sci USA 106:12794–12797 Álvarez I, Wendel JF (2003) Ribosomal ITS sequences and plant phylogenetic inference. Mol Phylogenet Evol 29:417–434 Bailey CD, Carr TG, Harris SA, Hughes CE (2003) Characterization of angiosperm nrDNA polymorphism, paralogy, and pseudogenes. Mol Phylogenet Evol 29:435–455 Moeller M (2000) How universal are universal rDNA primers? A cautionary note for plant systematists and phylogeneticists. Edinburgh J Bot 57:151–156 Gonzalez MA, Baraloto C, Engel J et al (2009) Identification of Amazonian trees with DNA barcodes. PLoS One 4:e7483 Razafimandimbison SG, Kellogg EA, Bremer B (2004) Recent origin and phylogenetic utility of divergent ITS putative pseudogenes: a case study from Naucleeae (Rubiaceae). Syst Biol 53:177–192 Okuyama Y, Kato M (2009) Unveiling cryptic species diversity of flowering plants: successful biological species identification of Asian Mitella using nuclear ribosomal DNA sequences. BMC Evol Biol 9:105 Chen S, Yao H, Han J et al (2010) Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS One 5:e8613 Taberlet P, Coissac E, Pompanon F et al (2007) Power and limitations of the chloroplast trnL (UAA) intron for plant DNA barcoding. Nucleic Acids Res 35:e14 Fazekas AJ, Kesanakurti PR, Burgess KS et al (2009) Are plant species inherently harder to discriminate than animal species using DNA barcoding markers? Mol Ecol Res 9:130–139 Hollingsworth PM, Graham SW, Little DP (2011) Choosing and using a plant DNA bar- DNA Barcoding Methods for Land Plants 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 35. 36. 37. 251 code. PLoS ONE 6(5):e19254. doi:10.1371/ journal.pone.0019254 Kress WJ, Erickson DL, Jones FA et al (2009) Plant DNA barcodes and a community phylogeny of a tropical forest dynamics plot in Panama. Proc Natl Acad Sci USA 106:18621–18626 Burgess KS, Fazekas AJ, Kesanakurti PR et al (2011) Discriminating plant species in a local temperate flora using the rbcL + matK DNA barcode. Methods Ecol Evol 2:333–340 Ivanova NV, Fazekas AJ, Hebert PDN (2008) Semi-automated, membrane-based protocol for DNA isolation from plants. Plant Mol Biol Rep 26:186–198 Colpaert N, Cavers S, Bandou E et al (2005) Sampling tissue for DNA analysis of trees: trunk cambium as an alternative to canopy leaves. Silvae Genet 54:265–269 Tate JA, Simpson BB (2003) Paraphyly of Tarasa (Malvaceae) and diverse origins of the polyploid species. Syst Bot 28:723–737 Sang T, Crawford DJ, Stuessy TF (1997) Chloroplast DNA phylogeny, reticulate evolution and biogeography of Paeonia (Paeoniaceae). Am J Bot 84:1120–1136 Hamilton MB (1999) Four primer pairs for the amplification of chloroplast intergenic regions with intraspecific variation. Mol Ecol 8:513–525 Cox CJ, Goffinet B, Shaw AJ, Boles SB (2004) Phylogenetic relationships among mosses based on heterogeneous Bayesian analysis of multiple genes from multiple genomic compartments. Syst Bot 29:234–250 Fazekas AJ, Steeves R, Newmaster SG (2010) Improving sequencing quality from PCR products containing long mononucleotide repeats. Biotechniques 48:277–285 Dunning LT, Savolainen V (2010) Broad-scale amplification of matK for DNA barcoding plants, a technical note. Bot J Linn Soc 164:1–9 Soltis PS, Soltis DE, Smiley CJ (1992) An rbcL sequence from a Miocene Taxodium (bald cypress). Proc Nat Acad Sci USA 89:449–451 Levin RA, Wagner WL, Hoch PC, Nepokroeff M, Pires JC, Zimmer EA, Sytsma KJ (2003) Family-level relationships of Onagraceae based on chloroplast rbcL and ndhF data. Am J Bot 90:107–115 Fofana B, Harvengt L, Jardin P, Baudoin JP (1997) New primers for the polymerase chain amplification of cpDNA intergenic spacers in Phaseolus phylogeny. Belg J Bot 129:118–122 Cuenoud P, Savolainen V, Chatrou LW, Powell M, Grayer RJ, Chase MW (2002) Molecular 252 A.J. Fazekas et al. phylogenetics of Caryophyllales based on nuclear 18S rDNA and plastid rbcL, atpB, and matK DNA sequences. Am J Bot 89: 132–144 38. White TJ, Bruns T, Lee S, Taylor J (1990) Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. In: Innis MA, Gelfand DH, Sninsky JJ, White TJ (eds) PCR protocols: a guide to methods and applications. Academic Press, New York, pp 315–322 39. Sun Y, Skinner DZ, Liang GH, Hulbert SH (1994) Phylogenetic analysis of Sorghum and related taxa using internal transcribed spacers of nuclear ribosomal DNA. Theor Appl Gen 89:26–32
© Copyright 2026 Paperzz