NimbleGen Sequence Capture Guidelines for Working with Developer Products Introduction NimbleGen Sequence Capture products, including microarrays and solution-based SeqCap EZ probe libraries, are tested and validated for use with human genome reference sequences (e.g. hg18, hg19) and human DNA samples. However, many researchers are interested in developing their own custom applications and protocols for use with the core Sequence Capture technology and materials. Some examples include the following: Sequence Capture of genomic DNA targets from non-human, traditional genetic research organisms (e.g. mouse, zebrafish, Drosophila, C. elegans, maize) Sequence Capture of genomic DNA targets from non-human organisms with less wellcharacterized genomes (e.g. cetaceans, poplar trees, mosquitoes) Sequence Capture of non-nuclear nucleic acid targets from humans or other organisms (e.g. cDNA, mitochondrial DNA, chloroplast DNA, viruses) Sequence Capture of human genomic DNA targets using probes designed from non-reference sequences (e.g. individual genome sequences) Sequence Capture of nucleic acid targets of any type using non-standard capture probe design criteria requested by the researcher To accommodate the needs for these custom applications from researchers, Roche NimbleGen offers the following Developer products: Sequence Capture Developer 385K/2.1M Arrays SeqCap EZ Developer Libraries Because protocols for use in non-standard applications like those listed above have not been tested by NimbleGen researchers, we cannot provide users of Developer products with the same performance assurances or level of technical support that we can for customers who use our standard Sequence Capture products in fully validated applications. To help address the needs of the broader scientific community, this document provides some general guidelines intended to help Developer users achieve better results in their novel research applications. Recommendations are provided for the following phases of a Sequence Capture experiment: Selecting targets and designing probes Performing Hybridization Performing Enrichment QC by qPCR For life science research only. Not for use in diagnostic procedures. Selecting Capture Targets and Designing Probes The most important factor in designing a successful Sequence Capture Developer experiment is the quality of the input sequence used to select the target and capture probes. Even when working with a well-characterized dataset like the human genome, there are still gaps in the reference assembly and regions that are subject to hypervariability and copy number variation that can introduce problems at the design stage. These problems increase significantly when creating designs for species where the amount and quality of the beginning sequence data are much lower. NimbleGen can design Sequence Capture Developer probes from a complete genome sequence or from incomplete sequences such as genome contigs, genome scaffolds, and even expressed sequence tag (EST) and mRNA datasets. However, the ability to determine whether any given probe is contiguous along its length with the actual genome target, or whether a probe is targeting a single copy sequence rather than a repetitive element, can be compromised when using such data. Although we do not allow probes from multiple species to be included on one design, when the sequence of the targeted species is minimal it is sometimes possible to use the more complete sequence of an evolutionarily related species, if available, to determine whether probes selected for the targeted species are likely to be unique or repetitive. One approach that has been used to assist in the Sequence Capture Developer design process is to perform a preliminary Comparative Genomic Hybridization (CGH) experiment targeting the region intended for capture. The results of the CGH experiment can provide useful data on the hybridization properties of individual probes and permit design modifications before attempting the capture experiment. Similarly, one may consider an iterative design process in which data from a pilot Sequence Capture experiment can be used to create an empirically optimized second-generation design. If you are considering either option, it will be important to discuss your plans in advance with NimbleGen scientists so that they can provide additional recommendations to facilitate the process. When the planned Sequence Capture Developer experiment entails using a previously untested species or capture design, or the input sequence used to select the target and probes is suspect or minimal, it is always recommended that an initial, small scale, pilot experiment be performed before proceeding with larger scale studies. Performing Hybridization The inclusion of human Cot-1 DNA as a blocking reagent in a human sequence capture (microarray or SeqCap EZ probe library-based) experiment reduces the capture of non-targeted sequences. Part of this reduction is likely due to the blocking of non-specific DNA binding by the glass (microarray) or the biotin/streptavidin (SeqCap EZ probe library) complexes, but the remainder is due to competitive inhibition of secondary capture. Secondary capture occurs when a capture probe hybridizes to its intended target on a sample library DNA fragment, but a different part of that captured fragment contains repetitive sequences. Those repetitive sequences, since they are single-stranded, can act as secondary capture ‘‛probes‛ and subsequently hybridize to any DNA fragment in the sample library containing a homologous repetitive sequence. For Sequence Capture Developer experiments targeting species for which the appropriate Cot-1 DNA is commercially available (e.g. mouse, rat, cow), we recommend substituting that species’ Cot-1 DNA in place of human Cot-1 in the hybridization reaction. For experiments targeting species for which the appropriate Cot-1 DNA is not commercially available, it is possible to prepare your own custom Cot-1 DNA. Examples of published methods include the following: Britten, et al. (1974) Analysis of repeating DNA sequences by reassociation. Methods Enzymol., 29, 363–418. Zwick et al. (1997). A rapid procedure for the isolation of Cot-1 DNA from plants. Genome, 40(2): 138–142. In some cases, it may be possible to obtain adequate results by using Cot-1 DNA derived from a closely related species that shares similar repetitive sequences. The necessity of including a species-specific Cot1 blocker in the hybridization reaction will vary depending on the frequency and type of genomic 2| NimbleGen Sequence Capture: Guidelines for Working with Developer Products v1.0 repetitive elements in the genome of the species targeted for capture. The absolute amount of Cot-1 DNA that is required to reduce non-specific or secondary capture in a Developer experiment may differ from the amount of Cot-1 recommended in human sequence capture protocols due to the frequency of different repeat types, their hybridization kinetics and overall genome complexity. In some cases, it may be sufficient to use a general nucleic acid blocker (e.g. sonicated yeast genomic DNA, herring sperm DNA) to reduce non-specific capture, or no blocker at all. Performing Enrichment QC by qPCR This section provides general guidelines for developing novel quantitative PCR (qPCR) assays to estimate Sequence Capture enrichment from non-human genomes using custom Sequence Capture Developer Arrays or SeqCap EZ Developer probe libraries. Use these guidelines in combination with the NimbleGen Sequence Capture User’s Guide that is most similar to your intended application. qPCR provides a fast, reliable and low-cost means of estimating the success of a Sequence Capture experiment prior to sequencing captured DNA fragments. All NimbleGen Sequence Capture arrays and SeqCap EZ probe libraries include capture probes targeting a set of human control loci. Following a capture experiment, standardized qPCR assays are used to compare the pre-captured amplified sample library with the captured amplified sample library to estimate the relative enrichment at these control loci and to detect potentially failed capture experiments. Because the utility of this particular set of control loci and qPCR assays is limited to experiments with human samples, we strongly recommend the design and implementation of similar, species specific, qPCR assays and controls for use in custom Developer experiments by following the general guidelines provided below: Step 1. Identifying Candidate Control Loci for Your Species 1. We recommend developing at least 4 control loci/qPCR assays for use with NimbleGen Sequence Capture Developer experiments. Use of greater than 4 control loci may produce more accurate and consistent results, but may be impractical for low-cost and/or high-throughput applications. 2. From the entire target for which capture probes were successfully designed, choose a subset that is representative of the larger target in both size and G+C content. Choose targets that are less likely to be affected by copy number variation or population genetic heterogeneity. 3. Keep in mind that of the loci initially chosen, not all may have the optimal probe coverage required for use as a control locus, or the sequence that permits the design of effective qPCR assays. You may have to begin with an initial set of >20 targets to ultimately obtain 4 useful control loci. Step 2. Developing qPCR Assays Specific to the Candidate Control Loci 1. Design qPCR assays for each of the selected loci. Either intercalating (e.g. SYBR Green) or probebased (e.g. TaqMan® probes) assays may be developed, but the inherent advantages and disadvantages of both types should be considered. Use similar assay design parameters (e.g. primer melting temperatures, product lengths) to ensure that the resulting assays can be performed under the same reaction conditions. 2. Test the qPCR assays. The assays should be: Specific (i.e. produce a single peak at the predicted melting temperature) Consistent (i.e. produce similar results in replicated reactions and experiments) Sensitive (i.e. able to detect the target at a wide range of concentrations) We recommend testing the qPCR assays with genomic DNA of all the species and substrains that will be used in planned Sequence Capture experiments. The amount of DNA to use as template will vary depending on genome size and complexity, and should be determined empirically. For NimbleGen Sequence Capture: Guidelines for Working with Developer Products v1.0 |3 developing mammalian qPCR assays, we recommend testing each assay using 0.04 ng, 0.2 ng, 1 ng, 5 ng, and 25 ng of mammalian genomic DNA as template, in triplicate reactions. 3. Use the resulting data to rank the assays by specificity, sensitivity, and efficiency of each qPCR assay. The PCR efficiency number is used later in calculations to determine target enrichment (for more details, see the chapter on ‚Measurement of Enrichment Using qPCR‛ in one of the NimbleGen Sequence Capture User’s Guides). Discard any assays that generate multiple products or show a low efficiency (<1.5). Step 3. Evaluating Candidate Control Loci Assays 1. Candidate control loci assays should be functionally tested by performing a Sequence Capture experiment and comparing the qPCR enrichment estimates directly with sequence from the enriched library. Refer to the chapter on ‚Measurement of Enrichment Using qPCR‛ in one of the NimbleGen Sequence Capture User’s Guides for general instructions on performing control loci assays, substituting your candidate control loci for the standard set of human loci. Assays that predict high or low enrichment are both useful as long as the prediction is confirmed by the sequence analysis. Note: A Sequence Capture enrichment experiment may be simulated for the purpose of developing QC-assays, by adding known concentrations of PCR products from the candidate control loci into genomic DNA at different ratios. 2. Select a set of control loci that are likely to well represent the larger set of target loci and have a distribution of capture efficacy (i.e. some loci enrich poorly, some loci enrich very well, and most loci enrich somewhere in between). This QC assay set should be tested in multiple Sequence Capture experiments to generate dependable performance data. Alternative Method 06465528001 11/11 One possible alternative to developing qPCR assays using the approach described above is to use the Roche Universal ProbeLibrary. The probes in this library were developed from transcribed sequences and intended primarily for use in gene expression analysis, but they can be adapted for use in qPCR assays to estimate Sequence Capture enrichment. For more information about the Roche Universal ProbeLibrary, go to https://www.roche-applied-science.com/sis/rtpcr/upl/ezhome.html. For life science research only. Not for use in diagnostic procedures. NIMBLEGEN, SEQCAP and TAQMAN are trademarks of Roche. Exiqon and ProbeLibrary are registered trademarks of Exiqon A/S, Vedbaek, Denmark. SYBR is a registered trademark of Molecular Probes, Inc. All other product names and trademarks are the property of their respective owners. 4| Published by Roche NimbleGen, Inc. 504 S. Rosa Rd Madison, WI 53719 USA © 2011 Roche NimbleGen, Inc. All rights reserved. NimbleGen Sequence Capture: Guidelines for Working with Developer Products v1.0
© Copyright 2026 Paperzz