(This is a sample cover image for this issue. The actual cover is not yet available at this time.) This article appeared in a journal published by Elsevier. The attached copy is furnished to the author for internal non-commercial research and education use, including for instruction at the authors institution and sharing with colleagues. Other uses, including reproduction and distribution, or selling or licensing copies, or posting to personal, institutional or third party websites are prohibited. In most cases authors are permitted to post their version of the article (e.g. in Word or Tex form) to their personal website or institutional repository. Authors requiring further information regarding Elsevier’s archiving and manuscript policies are encouraged to visit: http://www.elsevier.com/copyright Author's personal copy Journal of Microbiological Methods 91 (2012) 38–44 Contents lists available at SciVerse ScienceDirect Journal of Microbiological Methods journal homepage: www.elsevier.com/locate/jmicmeth Improved real-time PCR estimation of gene copy number in soil extracts using an artificial reference T.J. Daniell a,⁎, J. Davidson a, C.J. Alexander b, S. Caul a, D.M. Roberts a a b Ecological Sciences, The James Hutton Institute, Invergowrie, Dundee, DD2 5DA, UK Biomathematics & Statistics Scotland, Invergowrie, Dundee, DD2 5DA, UK a r t i c l e i n f o Article history: Received 1 May 2012 Received in revised form 4 July 2012 Accepted 10 July 2012 Available online 17 July 2012 Keywords: Gene copy quantification Internal standard Quantitative PCR Relative estimation Soil ecology a b s t r a c t Application of polymerase chain reaction (PCR) techniques has developed significantly from a qualitative technology to include powerful quantitative technologies, including real-time PCR, which are regularly used for detection and quantification of nucleic acids in many settings, including community analysis where culture-based techniques are not suitable. Many applications of real-time PCR involve absolute quantification which is susceptible to inaccuracies caused by losses during DNA extraction or inhibition caused by co-extracted compounds. We present here an improvement to this approach involving the addition of an artificial internal standard, prior to nucleic acid extraction. The standard was generated by in-situ mutagenesis from an E. coli template to ensure it both did not amplify with bacterial primers used for quantification and was short enough to minimise possible interference with other analyses. By estimating gene target copies by relative abundance, this approach accounts for both loss during extraction and inhibition effects. We present a novel application of relative real time PCR, using the internal standard as a reference, allowing accurate estimation of total bacterial populations both within and across a wide range of soils and demonstrate its improvement over absolute quantification by comparison of both approaches to ester linked fatty acid analysis of the same soils. © 2012 Elsevier B.V. All rights reserved. 1. Introduction Analysis of soil communities is affected by the huge observed levels of microbial diversity (Curtis et al., 2002) combined with low level of culturability (Torsvik and Øvreås, 2002) leading to the wide application of molecular techniques for the analysis of community structure. This often involves the application of techniques such as denaturing gradient gel electrophoresis (DGGE) or terminal restriction fragment length polymorphism (T-RFLP) which rely on polymerase chain reaction (PCR) amplification of diagnostic genomic regions to target groups either phylogenetically or functionally and produce data at a presence/absence or relative abundance level (for a review see Hirsch et al., 2011). These techniques are limited to the analysis of community structure and are incapable of estimating community size which has led to the application of real-time PCR methods using gene copy number as a proxy of community size (Hermansson and Lindgren, 2001; Hirsch et al., 2011; Macdonald et al., 2011). Alternatively a wide range of techniques including lipid based methods such as phospholipid-derived fatty acid (PLFA) and ester linked fatty acid (ELFA) (Frostegård and Bååth, 1996), direct counting (Bogosian et al., 1996) or chloroform fumigation (Anderson ⁎ Corresponding author. Tel.: +44 1382 568821; fax: +44 1382 568502. E-mail addresses: [email protected] (T.J. Daniell), [email protected] (J. Davidson), [email protected] (C.J. Alexander), [email protected] (S. Caul), [email protected] (D.M. Roberts). 0167-7012/$ – see front matter © 2012 Elsevier B.V. All rights reserved. doi:10.1016/j.mimet.2012.07.010 and Domsch, 1978) have been used to estimate population size or biomass. Typically, application of real-time PCR methods in soil systems targets DNA in order to estimate target-gene copy as a proxy for population size, for example bacterial quantification using fragments of the 16S gene (Stubner, 2002), or quantifying genes to target functional groups such as the ammonium monooxygenase A subunit to estimate nitrifier population size (Okano et al., 2004). The application of real-time PCR and other molecular methods for the quantification of functional groups is reviewed in Sharma et al. (2007). The majority of applications have applied absolute methods where a standard curve is generated from a serial dilution of a clone of the target amplicon allowing estimation of copy number in reactions using the same conditions. Whilst population estimation using molecular approaches have many advantages over culture-based or microscopy-based methods, they do have limitations. They can, for example, suffer from inhibition of any PCR amplification by co-extracted compounds such as humic acids that inhibit enzymatic reactions, including PCR amplification (Jackson et al., 1997; Okubara et al, 2005; Zhou et al., 1996), or inadequacies in the DNA extraction that can lead to losses that are difficult to predict or quantify. Losses can be expected at every step of extraction including cell lysis and losses during purification, for instance where phenol:chloroform extraction and ethanol precipitation steps occur in the DNA extraction process (Mumy and Findlay, 2004; Tsai et al., 1991). For example, losses during ethanol precipitation of between 70% and 20% dependent on a number of factors including initial DNA levels, temperature and length of centrifugation (Zeugin and Hartley, 1985) and phenol:chloroform extraction efficiencies Author's personal copy T.J. Daniell et al. / Journal of Microbiological Methods 91 (2012) 38–44 of 49–59% have been estimated for community extractions (Weinbauer et al., 2002). These potential issues make accurate estimation of copy number difficult with absolute real-time PCR quantification, especially if experiments include soil from different origins or with treatments likely to alter the level of inhibitors. Many applications of real-time PCR, predominantly in relatively clean systems such as culture based experiments or large organism studies where samples are taken from single individuals, estimate expression levels of genes of interest using housekeeping genes to normalise results (Pfaffl et al., 2002). This option is not available to estimate gene copy number in soil systems since sample size, in terms of organism number or biomass, is hard if not impossible to standardise. A similar approach has been suggested for molecular approaches including PCR from soil (using DGGE), for example Park and Crowley (2005) who used Escherichia coli JM 109 cells containing plasmid which were added to soil and detected using primers specific for the plasmid in real-time PCR analyses to normalise results. Additionally the presence of the E. coli band in DGGE was used to normalise intensity across this analysis. The issue here is that the presence of E. coli genomic DNA potentially skews estimates of any target contained within this genome, complicating analysis. The ideal situation would be to utilise a normalisation target and methodology that can both assess extraction and PCR efficiency whilst not interfering in any way with normal analysis of the samples. To achieve this any spike has to be independent of any intended analysis methodology which precludes the use of a natural DNA template. Here we aimed to improve the accuracy of real-time PCR estimation of gene-copy number in DNA extracted from soil samples through the application of an artificial reference spike and relative real-time PCR. To achieve this aim we generated the spike through in-situ mutation of an E. coli DNA template for amplification of a fragment of the 16S ribosomal gene altering the three 3′ terminal bases of the PCR primer target sequences to their complementary bases. This spike would thus not be amplified by the normal primer set used to target the 16S gene or, due to its short length, primers normally used for community structure analysis allowing extracts to be used for normal analysis with no interference from the added reference spike. The mutated spike was then added to a wide range of soil types and DNA extracted and used to test the efficiency of relative real-time PCR targeting the bacterial 16S gene as a proxy for cell number or biomass using the spike as reference for relative quantification. We have further contrasted results against estimates generated using the same amplification but applying absolute quantification and comparing both with an established method to estimate soil biomass. The application of this methodology should allow far more accurate control of technical variability recently recognised by Bustin (2010) as a primary cause for inconsistency and irregularity in quantitative PCR results. 2. Methods 2.1. Soil collection/storage The location of each sample site is given in Supplementary Table 1. These sites were selected to cover a wide range of soil types. Six cores were randomly taken at each site, to a depth of ten centimetres using a grass plot sampler (Van Walt Ltd, Haslemere, UK), and combined to obtain a homogeneous sample from each site. Samples were sieved to 4 mm to remove large debris, mixed and eight replicate sub-samples taken. Four were used to allow the assessment of the accuracy of estimation of each soil type by providing measurements from each sub-sample using both real-time PCR techniques. The other set of four sub-samples were used to measure soil bacteria using the ELFA technique as a ‘gold standard’. Samples were snap-frozen in liquid nitrogen and aliquots stored at −80 °C prior to DNA or lipid extraction. 39 2.2. ELFA Ester linked fatty acids (ELFA) were extracted as described by Schutter and Dick (2000). Briefly, frozen aliquots of soil were extracted with KOH in methanol, neutralised with acetic acid and fatty acid methyl esters extracted with hexane. Following drying, fatty acid methyl esters were re-dissolved in 100 μl isohexane and analysed by gas chromatography on a Agilent 6890N Network system fitted with a flame ionization detector, Agilent HP-5 column (Agilent Technologies UK Ltd, Stockport, UK). Hydrogen was used as carrier gas and helium as the make-up gas, with the following programme: 160 °C for 2 min, an increase by 4 °C per min to 270 °C, hold for 10 min. Fatty acid 23:0 was added as an internal standard. Double bonds of fatty acids were referred to the methyl end (ω) of the molecule and the nomenclature for ELFA was used as described previously (Zogg et al., 1997). Bacterial biomass was calculated as the concentration of fatty acids that could be related to bacterial groups (Zelles, 1999). 2.3. Preparation of calibration controls and of reference spike The 16S wild type calibrator control was generated by PCR amplification of a 16S gene fragment from E. coli genomic DNA generated from a lysate generated by boiling a colony suspended in 100 μl water for 5 min. 1 μl of this preparation was then used as template for PCR using Primer-1 and Primer-2 (all primer sequences in Table 1) in a final reaction volume of 25 μl containing: 2.5 U DNA polymerase (Expand High Fidelity enzyme mix. Roche Diagnostic Systems, Burgess Hill, UK), 5 pmol of each primer, 0.5 mM of each nucleotide, 1× Expand High Fidelity buffer (Roche Diagnostic Systems, Burgess Hill, UK) and 10 μg of bovine serum albumin (BSA) (Roche Diagnostic Systems, Burgess Hill, UK). PCR was performed with a DNA Engine DYAD thermocycler (MJ Research, UK) with an initial denaturation step of 95 °C for 5 min followed by 30 cycles of 95 °C for 1 min, 54 °C for 1 min and 72 °C for 30 s.; cycling was completed by a final extension period of 72 °C for 5 min. The reference spike was generated by PCR in situ mutagenesis (Vallette et al., 1989) to form a product in which the three 3′ terminal recognition bases of both PRIMER-1 and PRIMER-2 were altered to the requisite complementary bases providing a template suitable for amplification with the Mut-F and Mut-R primers. This was performed in a two stage PCR process using 342FMut and Primer 2 in the first reaction and Mut-F and 534RMut in the second using the PCR conditions given above. 1 μl E. coli genomic DNA was used as template in round one and 1 μl of 100 fold diluted round one product in the second reaction. PCR success was assessed by 1.5% agarose gel electrophoresis. Products were cloned into pGEM-T Easy (Promega, Southampton, UK), following the manufacturer's instructions and transformed into E. coli DH10B electrocompetent cells prepared following the method of Tung and Chow (1995). Colonies were screened for mutation success by PCR with relevant PCR primers and conditions outlined above. Sequence identity of selected clones for both the mutated and wild-type products was confirmed by sequencing in a total volume of 10 μl using 1:8 dilution of BigDye® Terminator v3.1 Cycle Sequencing kit (Applied Biosystems, Warrington, UK) with vector primers directed against the SP6 or T7 promoter regions and following manufacturer's instructions. Sequencing reactions were purified Table 1 Primers used in this study, all primers supplied by Thermo Fisher (Loughborough, UK). Primer name Sequence (5′ 3′) Reference Primer-1 Primer-2 Mut-F Mut-R 342FMut 534RMut CCTACGGGAGGCAGCAG ATTACCGCGGCTGCTGG CCTACGGGAGGCAGGTC ATTACCGCGGCTGCACC CCTACGGGAGGCACGTCTGGGGAATAT ATTACCGCGGCTGGACCCACGGAGTTA (Muyzer et al., 1993) (Muyzer et al., 1993) (this study) (this study) (this study) (this study) Author's personal copy 40 T.J. Daniell et al. / Journal of Microbiological Methods 91 (2012) 38–44 by ethanol precipitation and run on an ABI Prism 3700 DNA Analyser (Applied Biosystems, Warrington, UK). Transformants with the correct insert were placed into long term storage at −80 °C in 20% glycerol and plasmid generated from a single selected colony by large scale plasmid prep using Plasmid Midiprep System (Promega, Southampton, UK), following manufacturer's instructions. Plasmid was quantified by absorption at 260 nm using a DU 640 spectrophotometer with a 50 μl cuvette (Beckman Coulter, High Wycombe, UK) to give a mass concentration and converted to copies using the following formula: ðTotal number of A=T bases in the insert þ plasmid 601Þ þ ðTotal number of G=C bases in the insert þ plasmid 618Þ ¼ molecular weight ðg=moleÞ of the plasmid þ insert: The number of copies of plasmid=μl ¼ ðμg plasmid DNA calculated from 260 nm absorption= weight of plasmid þ insertÞ Avogadro’s number After linearization with NotI (Promega, Southampton, UK), dilution series of both wild-type and mutant plasmid were used to generate standard curves and/or as reference spike for real-time experiments. included in each PCR run as a calibrator to allow comparison to the wild-type calibration curve. For relative quantification, a second PCR was performed on each soil-DNA extract using the reference spike primers (Mut-F and Mut-R) under the above amplification conditions and using mutated plasmid as a calibrator with the reference calibration curve. 2.7. Calibration curve generation Mutated and wild-type plasmids were diluted to produce standards of known copy-number to allow the generation of curves containing 109–10 1 copies per reaction in a tenfold dilution series. Triplicate replicates of these standards (mutated and wild-type) were amplified, in the presence of DNA free soil extract to give an equivalent amount of co-extracted material, using the appropriate primers and the supplied software (Roche Diagnostic Systems, Burgess Hill, UK) to produce and store calibration curves for mutated and wild-type templates in a soil extract background. 2.8. Absolute and relative estimation An additional sample of soil from Bullion field (Location: 56:27.3397N3:4.6401W, freely drained brown forest soil, same soiltype as sample 2) was processed as described in Section 2.4 and the resulting DNA-elution was digested with RQ1 RNase-Free DNase (Promega, Southampton, UK), using the manufacturer's protocol, to remove DNA. Briefly, extract was incubated at 37 °C for 30 min followed by inactivation by the addition of RQ1 stop solution and heating at 65 °C for 10 min. Extract was then phenol:chloroform extracted and precipitated with isopropanol/NaOAc as performed for DNA extraction. Efficient removal of DNA was confirmed by PCR. Following the initial testing on a selection of samples for optimal amplification (see Section 2.6 below) soil extracts were diluted 1:10 in water and 1 μl per reaction added to the PCR master mix when producing calibration curves for real time PCR to provide a soil extract background. Absolute estimation using real-time PCR is based on direct estimation of copy number by comparison with a standard curve of diluted template. Relative estimation involves the separate amplification of the target gene and a reference of known copy number, here in the form of a ‘spike’ added at the DNA extraction stage. Both were assessed using the supplied software (Roche Diagnostic Systems, Burgess Hill, UK) applying the second derivative maximum method to quantify template DNA. To estimate the number of 16S copies per gram of dry soil using absolute quantification it is necessary to take into account losses expected during DNA extraction. With the DNA extraction method applied here the following factors have to be taken into consideration: the actual dry-weight of soil extracted, the proportion of water in the soil extracted, losses involved in avoiding carry-over during the phenol:chloroform extraction and any dilution of the DNA during precipitation or post extraction. These factors were calculated to produce a multiplication factor specific for each sample. This factor was used to convert the absolute estimation of 16S copies into an estimation of 16S copies per gram dry weight of soil extracted. As it is impossible to measure the exact loss during the precipitation or the phenol:chloroform stage, we were forced to assume full recovery leading to an inevitable underestimation of copy number. Relative quantification of the target DNA was assessed using the supplied software (Roche Diagnostic Systems, Burgess Hill, UK) for advanced relative quantification using default settings comparing it to stored standard curves for both the PCR reference and target. Relative Quantification compares the levels of two different target sequences in a single sample (16S and mutated 16S spike) and expresses the final result as a ratio of these targets. This ratio is used to calculate the number of gene copies per gram dry weight soil using (Spike copies ×ratio)/dry weight soil added. 2.6. Real-time cycling conditions 2.9. Statistical analysis Amplification of 16S DNA and mutated reference plasmid was performed using Roche Lightcycler 480 SYBR Green I master Kit (Roche Diagnostic Systems, Burgess Hill, UK) following manufacturer's instructions. Following initial tests 2 μl of a 1/10 dilution template DNA was amplified using a Lightcycler 480 (Roche Diagnostic Systems, Burgess Hill, UK) and the following conditions: an initial denaturing step at 95 °C for 15 min was followed by 40 cycles of 95 °C for 10 s, 54 °C for 10 s, 72 °C for 10 s and a single acquisition step at 81 °C for 5 s. This was followed by a melt curve: 95 °C for 10 s, 65 °C for 15 s then heating to 95 °C at a ramp rate of 0.1 °C/s, with continuous acquisition. A reaction containing 107 copies of the wild-type plasmid was The data to be analysed comprised measurements of bacterial population size on samples of soil taken from locations with differing soil types. Four sub-samples were taken from each location's material and, for each of these sub-samples, a measurement of copy number per gram d.w. calculated using both the absolute and relative real-time PCR techniques. These eight estimates from fourteen locations gave a total of 112 real-time PCR measurements. Four further sub-samples from each location were used to provide measurements of soil bacteria using ELFA. The mean of these four values at each location provided 14 values used as a gold standard against which comparisons were made. Both ELFA and real-time PCR data were transformed to log base 10 to 2.4. DNA extraction DNA was extracted from soil samples by the method of Deng et al. (2010). Briefly, 1 g of soil was suspended in a solution of 0.12 M Na2HPO4 1% SDS to form a slurry, 1 ml of which was subjected to bead beating using 1 mm glass beads (TissueLyser, Qiagen, Hilden, Germany). The resultant suspension was clarified by centrifugation and the supernatant extracted with phenol:chloroform and chloroform. DNA was precipitated using isopropanol and sodium acetate and further purified with polyvinylpolypyrrolidone using a spin column. Mutated reference-spike was added to the lysis buffer prior to addition to soil to give a final spike of 109 copies per extracted sample. 2.5. Production of DNA-free soil extract Author's personal copy T.J. Daniell et al. / Journal of Microbiological Methods 91 (2012) 38–44 stabilise variance prior to analysis. Initially separate tests of the relationship of each real-time PCR measurement versus ELFA were performed using Analysis of Variance (ANOVA). Measurements from each technique were analysed with a term fitted for ELFA as a covariate and resolving the residual sum of squares into lack of fit and pure error (Draper and Smith, 1998). Deviations among the four copy number values for the same soil about their mean represented ‘pure error’ and were the result of variation within the same experimental material and a lack of repeatability of the measurement procedure. Deviations of the observed mean copy number for each soil sample from the line fitted to the covariate, ELFA, were referred to as ‘observed lack of fit’ and measured the goodness of fit of this line. This lack of fit variation gave a measure of validity of the technique in its agreement with the assumed ‘gold standard’ of the ELFA values. The estimated variances for pure error and observed lack of fit were then compared between the two real-time PCR approaches with an F-test to determine if there were any significant differences between the two molecular quantification approaches. Since the measurements of the absolute and relative techniques were obtained from the same material in each of the four technical replicates for each soil sample, these did not provide independent estimates of the pure error variance. Similarly, there was no independence in the estimates of variance of the soil sample means about their respective fitted lines. This problem was addressed by using a mixed modelling approach on the combined datasets. In this approach pure error had the same interpretation as with the ANOVAs; however the observed lack of fit variation was decomposed into contributions due to the pure error and the further variation of soil sample means not attributable to pure error ‘underlying lack of fit’. Both pure error and underlying lack of fit were modelled as random effects but with covariance structures chosen in order to contain the relevant variance components and to address the lack of independence in the samples. In this way, we could formally test for differences in the variance components between the two techniques. Additional benefits of this approach were formal statistical tests for differences in the fixed effects of mean copy number recovered between the techniques and also differences in the slope of the fitted linear relationship to ELFA. Three models were fitted. The full model had, within each random effect, separate variance components estimated for each technique as well as a covariance term. Two nested models could then be fitted, each testing one of the two hypotheses: that there is no difference between absolute and relative techniques in (i) the pure error variance components or (ii) the underlying lack of fit variance components. In each sub-model this was achieved by constraining the variance components for the two techniques to be equal. The difference in deviance could then be tested against that of the full model and its statistical significance determined against a chi-squared distribution with 1 degree of freedom since the number of parameters in the model for random variation differed by one. The ANOVAs were fitted using GenStat for Windows, 14th Edition (VSN International Ltd., Hemel Hempstead, UK) as were the mixed models using the Residual Maximum Likelihood (REML) facility. 41 Griffiths et al., 2007). ELFA was performed on each soil type to obtain an estimation of total biomass based on lipid content and this data was used as a baseline to assess the performance and accuracy of real-time data. There was excellent reproducibility in lipid data over four replicates from each soil sample (Table 4), sufficient to justify the amalgamation of the replicates into a mean-value to investigate the variability within and between the two real-time quantification approaches and confirming the uniformity of the replicated soil samples. There was significant variation between soils in biomass estimation by ELFA (see Table 4), ranging from 10.20 to 623.29 nmol g−1 dry weight soil (dws). Variation in biomass related to bacterial population sizes has been reported by others. For example, Hinojosa et al. (2005) in a single soil type observed similar variation when comparing polluted, reclaimed and pristine soils with both ELFA and PLFA methods with extracted ELFA between 22 and 184 nmol g−1 dws with pristine soils exhibiting concentrations at the higher end of the range. There are few studies comparing across soil types; Frostegård and Bååth (1996) analysed a range of soils from forest (Spruce and Beach), grassland, garden and arable fields and using marker PLFA representing the bacterial populations estimated biomass to be between 164 and 1739 nmol g−1 organic matter in the different soils. Taking account of the measured % organic matter in each of these soils, these values convert to 34.4– 516.1 nmol g −1 dws. These estimates are within the range determined by this study but direct comparison is difficult due to their application of PLFA as opposed to ELFA techniques. Additionally, Frostegård and Bååth (1996) could find no correlation either between the amounts of bacterial biomass as estimated by PLFA and pH or between the amounts of bacterial PLFA and the organic matter content of the soils studied similar to the patterns found here. 3.2. Real-time PCR Calibration curves for both the target and spike were produced using a linearised plasmid containing a PCR fragment of either E. coli 16S or a mutated version respectively. The PCR efficiency, in a DNA-free soil extract background, of both the wild-type and mutated PCR targets was high confirming the robustness of both PCRs. The calibration curves for the mutated template was linear over nine orders of magnitude (101–109), the calibration curves for the wild-type template was linear over eight orders of magnitude (102–10 9). This difference is likely due to absence of template from the PCR kit as the variation did not increase with decreasing template copy number. The sensitivity of the calibration curves reliably attained 102 copies per reaction (Table 2). 3.3. Efficiency of recovery of DNA from soil as estimated by real-time PCR A number of groups have used real-time or competitive PCR to investigate the efficiency of recovery of DNA from soil or sediment extractions (Table 3). These studies used a range of added targets to estimate the efficiency of recovery and estimation. Targets include genomic DNA, bacterial cultures containing a high copy number plasmid and bacterial cells seeded into sterilised soil samples, and vary in extraction methodologies 3. Results and discussion 3.1. ELFA To assess the suitability of absolute and relative quantification of 16S copy number to estimate soil bacterial populations it was necessary to compare these molecular-based estimations against an established technique. The use of ELFA to estimate total bacterial populations in soil has previously been established (Schutter and Dick, 2000). ELFA analysis was used in preference to the more routinely employed phospholipid fatty analyses (PLFA) because ELFA analysis is more rapid and both PLFA and ELFA approaches are comparable in their ability to discriminate between microbial communities (Hinojosa et al., 2005; Table 2 Showing the mean crossing point (±SE) for the wild-type and the mutated calibration curves generated from linearised plasmid containing wild type or mutated insert respectively. Means are based on three replicates for each concentration. Copies Wild-type mean Cp Mutated mean Cp 108 107 106 105 104 103 102 101 10.30 ± 0.04 13.74 ± 0.05 16.01 ± 0.01 20.21 ± 0.04 22.46 ± 0.16 25.95 ± 0.03 28.28 ± 0.05 27.87 ± 0.09 9.63 ± 0.15 13.48 ± 0.06 17.30 ± 0.26 20.66 ± 0.17 24.43 ± 0.01 27.65 ± 0.01 30.90 ± 0.29 35.0 ± 0.01 Author's personal copy 42 T.J. Daniell et al. / Journal of Microbiological Methods 91 (2012) 38–44 Table 3 Comparison of DNA extraction/detection efficiencies reported by previous groups and this study for different soil types. Details of soil types (this study) are given in Supplementary Table 1. Efficiency (this study) is calculated as average % of spike recovered over 4 replications and standard deviation. Reference Extraction method Quantification Target method Lee et al., Chemical 1996 Competitive quantitative PCR Competitive touchdown PCR Rose et al., 2003 Chemical Wizard® Tsai et al., Chemical 1991 Mumy and Findlay, 2004 This study Membrane hybridisation UltraClean QC-PCR FastDNA SoilMaster Chemical Sybr green real-time PCR 100 ng−1 E. coli DNA Soil type Sterilised pasture soil 180 ng g−1 Sterilised E. coli DNA silt loam 1.8 pg g−1 E. coli DNA 1.6 × 109 cells Sterile southern California soil Lake E. coli + sediment pBR322-λ 9 1 × 10 cells 1 × 109 linear Soil 1 DNA Soil 2 Soil 3 Soil 4 Soil 5 Soil 6 Soil 7 Soil 8 Soil 9 Soil 10 Soil 11 Soil 12 Soil 13 Soil 14 Efficiency of recovery (%) 3.68 ± 0.11 34 ± 7 6.1 ± 1.3 60 ± 5 14.9 ± 16.0 28.3 ± 10.5 2.4 ± 0.1 10.93 ± 2.87 9.13 ± 5.29 4.10 ± 1.01 7.08 ± 6.7 2.77 ± 1.9 5.84 ± 2.04 4.09 ± 2.44 11.48 ± 5.40 5.64 ± 4.39 5.67 ± 2.93 13.66 ± 4.44 2.99 ± 1.21 2.61 ± 2.17 8.19 ± 3.51 with one study comparing several methods. The majority of these studies do not speculate on the likely source of variation in the estimation or cause of losses although the choice of extraction method (Mumy and Findlay, 2004) and/or the amount of added target (Rose et al., 2003) can clearly play a role. Soil, when compared to cultures, as a starting material also negatively affects both the recovery and variability of extraction of both DNA and RNA (Mumy and Findlay, 2004; Okubara et al., 2005; Tsai et al., 1991) highlighting the need to understand issues with variability or performance of extraction and subsequent PCR inhibition both between and within soil types. Our studies found recovery efficiencies of the spike of between 2.61% and 13.66%, dependent on, but with no clear link to, soil-type, which is of a similar range to recoveries observed in other studies (Table 3). Analysis of our spike-recovery data suggests that limitations in the efficiency of the soil extraction driven by differential effects of different soils play a part. Fig. 1 shows a significant variation in recovery of spike between soil types (pb 0.001) but suggests that variation was not driven by different perceived organic matter content between soil types. For example the two peat soils (13 and 14) vary significantly. A high degree of variation is found in the recovery of this internal standard for the wide range of soil-types included in this study as well as the high level of variation in recovery between replicates. There was also no correlation between efficiency of recovery and the amount of soil extracted (data not shown). This difference in performance is likely due to variation in the distribution of inhibitors between aliquots or variation in the performance of the extraction between aliquots. The observed variability in recovery as estimated by an absolute real time PCR reinforces the improved performance of the relative tool discussed below since, unlike an unspiked soil, a fixed amount of template for the test PCR reaction was added to the extraction removing the possibility that variation was driven by natural variation in population size. Where applying absolute quantification we attempted to account for all measurable losses in the DNA extraction process and yet this variation of DNA recovery efficiency demonstrates the limitations of simple absolute-quantification for estimation of DNA copies present in samples of different soil-types. For relative quantification this variation is countered by the internal standard used, as demonstrated by the significantly reduced variation between replicates. This reduced variation allows a far more precise estimation of population size. 3.4. Estimation of soil-bacterial population size ELFA and real-time PCR data were converted to log base 10 to ensure residual variance remained constant prior to comparing the absolute (Fig. 2a) and relative (Fig. 2b) real-time quantifications against the ELFA data. The results of the separate ANOVAs for each technology are shown in Table 5 and demonstrate that there was evidence of significant differences between both lack of fit and pure error variation. Also shown in Fig. 2a and b are the fitted lines from the REML analysis using the full model with separate variance components estimated for each of the respective estimation techniques. This analysis demonstrates the improved performance of the relative technique on a number of criteria. Total copy number per gram d.w. recovered were at least an order of magnitude higher for all soil types, ranging from a factor of 11.4 for soil type 6 to a factor of 23.0 for soil type 3 (intercept for the relative line was 8.61 compared to 7.70 for absolute). The slope of the fitted relationship for the relative technique was also significantly steeper (pb 0.001) — slope parameter was 0.91 for relative technique and 0.75 for absolute. This steeper slope demonstrates the greater ability to discriminate amongst soil types of the relative technique compared to absolute estimates. For example, the mean detected copy-number for samples of soil type 3 was greater than for soil type 1 when using relative estimation (which agrees with the gold standard) but this ranking is reversed when using the absolute approach. These differences are likely due to differences in extracted inhibitors between soil types. Since REML estimates variance components for (i) pure error, or, the repeatability of each technique and (ii) underlying lack of fit we fitted models which tested for the statistical significance of any difference in performance. The pure error variance component was tested for significant differences between the techniques by comparing deviance between models where pure error was identical (Model II in Table 6) and where different variances were fitted (Full Model, Table 6). This showed a highly significant difference (pb 0.001) with the best fit model estimating the absolute technique's variance component at 0.173 compared to 0.011 for the relative; there was also a high correlation of 0.68 between the estimates from the two techniques performed on the same technical replicates. This evidence shows the relative Fig. 1. Efficiency of recovery of added template, showing the recovered copy-number (E+06) of the spike for each soil-type estimated by absolute quantification (1 × E+09 added prior to extraction). Bars indicate ±1 standard error, letters indicate significant classes generated by post-hoc testing using a Fisher LSD test following ANOVA where soil types was significantly different (p b 0.001). Soil types as described in Supplementary Table 1. Author's personal copy T.J. Daniell et al. / Journal of Microbiological Methods 91 (2012) 38–44 43 Fig. 2. Fitted linear terms from REML analysis showing: (a) copy number from 16S absolute quantification and (b) copy number from 16S relative quantification versus the same ELFA measurements. For the four replicates from each soil type, ELFA values were set to the mean ELFA value. All axes are scaled on log base 10. technique's ability to achieve more consistent results on repeated analysis of the same soil samples. In the comparison of sample mean squares for observed lack of fit in the ANOVAs, although the stratum variances were highly different (as shown in Table 5, F ratio was 5.33), this included a contribution from the pure error variance in the lower stratum. REML decomposed this variance and allowed the testing of the contribution from underlying lack of fit alone. This was done by testing a model with separate variance components for underlying lack of fit for the two techniques (the Full Model in Table 6) against one constrained to have the same variance component for both (Model I in Table 6). The Full Model gave variance components of 0.0766 for the absolute and 0.0197 for the relative techniques but there was no evidence for a difference when compared with Model I which fitted a single variance component of 0.0228. The reduction of ‘noise’ coupled with the more significant positive correlation between the ELFA data and the relative quantification estimation of population numbers also would allow accurate comparison of populations between soil types which would otherwise have significant error attached due to differences in efficiency of estimation. This type of analysis would not be reliable with the absolute quantification approach due to the variance between repeats of the same soil type. The lack of fit between the ELFA data and the absolute quantification makes this approach less appropriate when working across soil types. The relative quantification approach is far more suitable for this type of analysis. Table 4 Data summary of bacterial gene copy number for the soils used in this study. Estimations are by absolute and relative real-time PCR and are ranked by the relative estimate. Also shown is the estimate of ELFA. Details of soil types (this study) are given in Supplementary Table 1. Land use Soil number ELFA nmol g−1 dws mean, (%CV) Absolute copies Relative copies (108) (108) per gram mean per gram mean (%CV) (%CV) Non-arable Non-arable Arable Arable Arable Non-arable Non-arable Arable Arable Arable Non-arable Arable Arable Arable 13 14 10 11 9 12 5 6 7 8 4 2 3 1 Average 532.67 (7.1) 623.29 (3.3) 92.87 (4.7) 107.14 (6.5) 88.59 (11.4) 156.70 (7.6) 58.24 (2.7) 62.65 (6.6) 66.70 (4.2) 67.88 (6.8) 58.65 (3.7) 35.85 (3.5) 38.11 (4.5) 10.20 (15.6) 146.29 47.27 (92.9) 129.32 (47.1) 40.65 (63.4) 100.26 (52.3) 31.64 (99.4) 9.47 (99.3) 8.60 (93.9) 13.64 (53.3) 12.48 (89.4) 27.67 (81.3) 12.56 (103.2) 9.33 (64.6) 3.08 (14.3) 3.43 (41.1) 32.10 1110.47 (33.9) 1030.87 (23.6) 539.17 (15.6) 415.85 (24.4) 313.27 (24.2) 306.28 (17.0) 230.82 (17.0) 205.09 (24.4) 200.55 (7.5) 192.50 (25.5) 119.35 (19.8) 90.39 (28.7) 84.89 (36.0) 26.17 (25.7) 347.55 Published estimates of bacterial population size in soil are variable, even when targeting the same gene. For example, estimates of gene copy number in agricultural soil based on 16S-targeted real-time PCR range from 6 × 108 to 2.2 × 109 copies per gram d.w. soil (Dandie et al., 2007; Okano et al., 2004). As described in Section 3.3, this range can be due to different soil-types supporting different population sizes of bacteria, effect of soil-type on efficiency of DNA extraction and/or PCR amplification or on limitations inherent in the measurement approach. The contrasting results between absolute and relative estimations observed in this study suggest that the application of relative methods may control for a number of these causes of error. Our estimations for soil-bacterial populations are 3.2× 109 and 3.5× 1010 for absolute and relative quantification respectively (see Table 4), averaged over all soil types. Our estimation by absolute quantification is higher than that published previously, due, we believe, to our having made stringent attempts to account for all known losses during the DNA extraction (Section 2.8). Our estimation by relative quantification is above the published range, due to the relative quantification approach accounting for losses and inhibitory affects and therefore allowing a far more precise estimation of population size. 4. Conclusion The application of relative quantification methods is standard in expression studies where housekeeping genes are used to control for differences in extraction efficiency or assay performance. Here we demonstrate that the addition of an artificial spike to soil extractions allows relative quantification methods to be applied in such a non-defined starting material by providing a fixed term of reference. This allows both a significant improvement in the estimation of gene copy number and a decrease in the variability in the estimation. The difference between relative and absolute methods of estimation suggests that the application of absolute measures, commonly applied in soil studies, can Table 5 ANOVA table of results for separate analyses on data from absolute and relative techniques with ELFA as covariate. Absolute Relative Source of variation df ss ms df Soil sample stratum log10(ELFA) Fitted line Lack of fit ss ms F-ratio Sig. 13 1 12 11.745 5.998 5.746 0.903 5.998 0.479 13 1 12 9.886 8.808 1.078 0.760 8.808 0.090 5.33 0.003 Technical rep stratum Pure error 42 Total 55 7.248 18.993 0.173 42 55 0.465 10.351 0.011 15.58 b0.001 Author's personal copy 44 T.J. Daniell et al. / Journal of Microbiological Methods 91 (2012) 38–44 Table 6 Comparison of models for testing equality of variances between absolute and relative techniques. The full model has no constraints on its parameters. Variance components constrained to be equal (shown in italics) are: soil sample for Model I and the technical replicates for Model II. Variance components are labeled vaa for the variance of the absolute technique, vrr the variance for the relative and var the covariance between the two. LRS is the likelihood ratio test statistic (equal to the change in deviance relative to the full model), df = degrees of freedom and Sig. = significance. Soil sample Technical rep Model vaa vrr var vaa vrr var Deviance LRS df Sig. Full I II 0.0766 0.0228 0.0973 0.0197 0.0228 0.0040 0.0189 0.0098 0.0194 0.173 0.199 0.090 0.011 0.012 0.090 0.029 0.033 0.028 −206.62 −204.05 −125.03 2.57 81.59 1 1 0.109 b0.001 underestimate the gene copy count by factors ranging from 11.4 to 23.0. Additionally, the generation of an artificial spike negates concerns relating to erroneous detection of added spike as part of the soil community under analysis. Supplementary data to this article can be found online at http:// dx.doi.org/10.1016/j.mimet.2012.07.010. Acknowledgements This work was financially supported by the Scottish Government Rural and Environment Science and Analytical Services Division. We would like to thank both internal and external reviewers for comments which have improved the manuscript. References Anderson, J.P.L., Domsch, K.H., 1978. A physiological method for the quantitative measurement of microbial biomass in soils. Soil Biol. Biochem. 10, 215–221. Bogosian, G., Sammons, L.E., Morris, P.J.L., O'Neil, J.P., Heitkamp, M.A., Weber, D.B., 1996. Death of the Escherichia coli K-12 strain W3110 in soil and water. Appl. Environ. Microbiol. 62, 4114–4120. Bustin, S.A., 2010. Why the need for qPCR publication guidelines? The case for MIQE. Methods 50, 217–226. Curtis, T.P., Sloan, W.T., Scannell, J.W., 2002. Estimating prokaryotic diversity and its limits. Proc. Natl. Acad. Sci. U. S. A. 99, 10494–10499. Dandie, C.E., Miller, M.N., Burton, D.L., Zebarth, B.J., Trevors, J.Y., Goyer, C., 2007. Nitric oxide reductase-targeted real-time PCR quantification of denitrifier populations in soil. Appl. Environ. Microbiol. 73, 4250–4258. Deng, H., Zhang, B., Yin, R., Wang, H.-L., Mitchell, S.M., Griffiths, B.S., Daniell, T.J., 2010. Long-term effect of re-vegetation on the microbial community of a severely eroded soil in sub-tropical China. Plant Soil 328, 447–458. Draper, N.R., Smith, H., 1998. Applied Regression Analysis, 3rd edition. John Wiley & Sons, Inc., New York, U.S.A. Frostegård, A., Bååth, E., 1996. The use of phospholipid analysis to estimate bacterial and fungal biomass in soil. Biol. Fertil. Soils 22, 59–65. Griffiths, B.S., Caul, S., Thompson, J., Birch, A.N.E., Cortet, J., Andersen, M.N., Krogh, P.H., 2007. Microbial and microfaunal community structure in cropping systems with genetically modified plants. Pedobiologia 51, 195–206. Hermansson, A., Lindgren, P.-E., 2001. Quantification of ammonia-oxidizing bacteria in arable soil by real-time PCR. Appl. Environ. Microbiol. 67, 972–976. Hinojosa, M.B., Carreira, J.A., Garcia-Ruíz, R., Dick, R.P., 2005. Microbial response to heavy metal-polluted soils: community analysis from phospholipids-linked fatty acids and ester-linked fatty acids extracts. J. Environ. Qual. 34, 1789–1800. Hirsch, P.R., Mauchline, T.H., Clark, I.M., 2011. Culture-independent molecular techniques for soil microbial ecology. Soil Biol. Biochem. 42, 878–887. Jackson, C.R., Harper, J.P., Willoughby, D., Rosen, E.E., Churchill, P.F., 1997. A simple, efficient method for separation of humic substances and DNA from environmental samples. Appl. Environ. Microbiol. 63, 4993–4995. Lee, S.Y., Bollinger, J., Bezdicek, D., Ogram, A., 1996. Estimation of the abundance of an uncultured soil bacterial strain by a competitive quantitative PCR method. Appl. Environ. Microbiol. 62, 3787–3793. Macdonald, C.A., Clark, I.M., Hirsch, P.R., Zhao, F.-J., McGrath, S.P., 2011. Development of a real-time PCR assay for detection and quantification of Rhizobium leguminosarum bacteria and discrimination between different biovars in zinc-contaminated soil. Appl. Environ. Microbiol. 77, 4626–4633. Mumy, K.L., Findlay, R.H., 2004. Convenient determination of DNA extraction efficiency using an external DNA recovery standard and quantitative–competitive PCR. J. Microbiol. Methods 57, 259–268. Muyzer, G., Waal, E.C.D., Uitterlinden, A.G., 1993. Profiling of complex microbial populations by denaturing gradient gel electrophoresis analysis of polymerase chain reaction-amplified genes coding for 16SrRNA. Appl. Environ. Microbiol. 59, 695–700. Okano, Y., Hristova, K.R., Leutenegger, C.M., Jackson, L.E., Denison, R.F., Gebreyesus, B., Lebauer, D., Scow, K.M., 2004. Application of real-time PCR to study effects of ammonium on population size of ammonia-oxidizing bacteria in soil. Appl. Environ. Microbiol. 70, 1008–1016. Okubara, P.A., Schroeder, K.L., Paulitz, T.C., 2005. Real-time polymerase chain reaction: applications to studies on soil-borne pathogens. Can. J. Plant. Pathol. 27, 300–313. Park, J.W., Crowley, D.E., 2005. Normalization of soil DNA extraction for accurate quantification of target genes by real-time PCR and DGGE. Biotechniques 38, 579–585. Pfaffl, M.W., Horgan, G.W., Dempfle, L., 2002. Relative expression software tool (REST©) for group-wise comparison and statistical analysis of native expression results in real-time PCR. Nucleic Acids Res. 30, e36. Rose, P., Harkin, J.M., Hickey, W.J., 2003. Competitive touchdown PCR for estimation of Escherichia coli DNA recovery in soil DNA extraction. J. Microbiol. Methods 52, 29–38. Schutter, M.E., Dick, R.P., 2000. Comparison of fatty acid methyl ester (FAME) methods for characterizing microbial communities. Soil Sci. Soc. Am. J. 64, 1659–1668. Sharma, S., Radl, V., Hai, B., Kloos, K., Fuka, M.M., Engel, M., 2007. Quantification of functional genes from procaryotes in soil by PCR. J. Microbiol. Methods 68, 445–452. Stubner, S., 2002. Enumeration of 16S rDNA of Desulfotomaculum lineage 1 in rice field soil by real-time PCR with SybrGreen™ detection. J. Microbiol. Methods 50, 155–164. Torsvik, V., Øvreås, L., 2002. Microbial diversity and function in soil: from genes to ecosystems. Curr. Opin. Microbiol. 5, 240–245. Tsai, Y.L., Park, M.J., Olson, B.H., 1991. Rapid method for direct extraction of mRNA from seeded soils. Appl. Environ. Microbiol. 57, 765–768. Tung, W.L., Chow, K.C., 1995. A modified medium for efficient electrotransformation of E. coli. Trends Genet. 11, 128–129. Vallette, F., Mege, E., Reis, A., Adesnik, M., 1989. Construction of mutant and chimeric genes using the polymerase chain reaction. Nucleic Acids Res. 17, 723–733. Weinbauer, M.G., Fritz, I., Wenderoth, D.F., Höfle, M.G., 2002. Simultaneous extraction from bacterioplankton of total RNA and DNA suitable for quantitative structure and function analyses. Appl. Environ. Microbiol. 68, 1082–1087. Zelles, L., 1999. Fatty acid patterns of phospholipids and lipopolysaccharides in the characterisation of microbial communities in soil: a review. Biol. Fertil. Soils 29, 111–129. Zeugin, J.A., Hartley, J.L., 1985. Ethanol precipitation of DNA. Focus, 7 (4). Bethesda Research Laboratories/Life Technologies, Inc., Maryland, U.S.A. Zhou, J., Bruns, M.A., Tiedje, J.M., 1996. DNA recovery from soils of diverse composition. Appl. Environ. Microbiol. 62, 316–322. Zogg, G.P., Zak, D.R., Ringelberg, D.B., MacDonald, N.W., Pregitzer, K.S., White, D.C., 1997. Compositional and functional shifts in microbial communities due to soil warming. Soil Sci. Soc. Am. J. 61, 475–481.
© Copyright 2026 Paperzz