ARTICLE Pattern-Mapped Multiple Detection of 11 Pathogenic Bacteria Using a 16S rDNA-Based Oligonucleotide Microarray Byeong Hee Hwang, Hyung Joon Cha National Research Laboratory of Molecular Biotechnology, Department of Chemical Engineering, Pohang University of Science and Technology, Pohang 790-784, Korea; telephone: þ82-54-279-2280; fax: þ82-54-279-2699; e-mail: [email protected] Received 26 October 2009; revision received 30 December 2009; accepted 11 January 2010 Published online 20 January 2010 in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/bit.22674 Introduction ABSTRACT: Pathogen detection is an important issue in human health due to the threats posed by severe communicable diseases. In the present study, to achieve efficient and accurate multiple detection of 11 selected pathogenic bacteria, we constructed a 16S rDNA oligonucleotide microarray containing doubly specific capture probes. Many target pathogens were specifically detected by the microarray with the aid of traditional perfect match-based analysis using our previously proposed two-dimensional visualization plot tool. However, some target species or subtypes were difficult to discriminate by perfect match analysis due to nonspecific binding of conserved 16S rDNA-derived capture probes with high sequence similarity. We noticed that the patterns of specific spots for each strain were somewhat different in the two-dimensional gradation plot. Therefore, to discriminate subtle differences between phylogenetically related pathogens, a pattern-mapping statistical model was established using an artificial neural network algorithm trained by experimental repeats. The oligonucleotide microarray system harboring doubly specific capture probes combined with the pattern-mapping analysis tool resulted in successful detection of all target pathogens including even subtypes of two closely related species showing strong nonspecific binding. Collectively, the results indicate that our novel combined system of a 16S rDNA-based DNA microarray and a pattern-mapping statistical analysis tool is a simple and effective method for detecting multiple pathogens. Biotechnol. Bioeng. 2010;106: 183–192. ß 2010 Wiley Periodicals, Inc. KEYWORDS: DNA chip; oligonucleotide microarray; 16S rDNA; pathogenic bacteria; multiple detection; pattern mapping Correspondence to: H.J. Cha Contract grant sponsor: Ministry for Flood, Agriculture, Forestry and Fisheries Contract grant number: 109187-3 Contract grant sponsor: Young-Woong Satellite Laboratory Grant Contract grant sponsor: Ministry of Education, Science and Technology (Brain Korea 21 Program) ß 2010 Wiley Periodicals, Inc. To date, numerous studies have investigated the detection and diagnosis of pathogens to prevent food-poisoning outbreaks. The well-known conventional methods have several limitations, such as culture dependency, long detection time (2–7 days), labor-intensive protocols, and difficulties in parallel strain detection (de Boer and Beumer, 1999). Therefore, molecular methods such as polymerase chain reaction (PCR), fluorescent in situ hybridization, immunoassay, and microarrays have been developed. In particular, the microarray is one of the most efficient methods owing to the possibility of parallel identification using several detection components including DNA (Call et al., 2003; Guschin et al., 1997; Schena et al., 1995). Short oligonucleotide probes are useful for pathogen detection because subtle discrimination of single-nucleotide polymorphisms is possible (Call, 2005; Yershov et al., 1996). Therefore, several studies have demonstrated the applicability of oligonucleotide arrays for environmental microbial detection (Eom et al., 2007; Guschin et al., 1997; Loy et al., 2005; Rudi et al., 2000; Sergeev et al., 2004; Wu et al., 2003). In particular, several microarrays based on 16S rDNA information have been used in phylogenetic discrimination (Chandler et al., 2003; Eom et al., 2007; Guschin et al., 1997; Liu et al., 2001; Peplies et al., 2006; Rudi et al., 2000). The use of 16S rDNA as a detection modality has several advantages such as an abundance of complete sequence information and the availability of reported specific capture probes and universal primers (Woese, 1987). Previously, we also proposed the use of a 16S rDNA-based oligonucleotide microarray for detection of seven pathogens using a single specific capture probe (from the first variable region) and a two-dimensional visualization strategy (Eom et al., 2007). However, most studies on 16S rDNA-based detection, including our previous research, showed inevitable nonspecific binding because 16S rDNA sequences offer very low sequence diversity; thus, difficulties arise in unambiguous Biotechnology and Bioengineering, Vol. 106, No. 2, June 1, 2010 183 discrimination of phylogenetically close bacteria or subspecies. In addition, the relatively low sensitivity of our previous system resulted from use of the statistical array format involving repeated redundant spots (Eom et al., 2007). After experimental 16S rDNA sequence analysis, we also found that sequence dissimilarity (percentage of mismatched sequence between capture probe and target) was the most important factor affecting probe specificity. In the present study, we constructed an oligonucleotide microarray system for detecting 11 selected pathogenic bacteria (pathogenic Escherichia coli, Escherichia coli H7:O157, Listeria monocytogenes, Salmonella enterica serotype Choleraesuis, Salmonella enterica serotype Enteritidis, Shigella dysenteriae, Staphylococcus aureus, Vibrio cholerae, Vibrio vulnificus, Vibrio parahaemolyticus, and Yersinia enterocolitica) by strategic optimal design and experimental selection of doubly specific capture probes from characteristic regions of total 16S rDNA sequence. To improve detection specificity, we considered sequence dissimilarity to be the most important design factor and used an artificial standard probe strategy, as in our previous report (Hwang and Cha, 2008). For easy, rapid, and accurate classification of phylogenetically close pathogens, we also developed a pattern-mapping analysis tool, featuring a probability estimation by an artificial neural network (ANN) algorithm, that could distinguish the spot pattern specific to each pathogen. Additional 16S rDNA sequences were downloaded from the RDP II database and the NCBI gene bank. Because 16S rDNA sequences show high similarity between closely related species, we employed a closest comparison method that checks species-specific regions by alignment of the closest two 16S rDNA sequences. To this end, we employed BioEdit software (Ibis Biosciences, Carlsbad, CA). Next, these regions were compared to all 16S rDNA sequences of target and target-related species. Based on the identified regions, capture probe candidates were designed with sequence dissimilarities of over 10–15%, and with similar melting temperatures, using Primer Premier 5 (Premier Biosoft International, Palo Alto, CA). Finally, capture probe candidates were inspected using the RDP II database and NCBI Blast searches. We designed 22 new specific capture probe candidates based on the above criteria. Next, the probes were chemically synthesized with a 50 amino linker modification to contain a six-atom spacer composed of ethylene glycol units between the oligonucleotide and amine (BioBasic, Ontario, Canada). After experimental selection of probe candidates (data not shown), we chose 16 specific capture probes from the newly designed candidates and 4 specific capture probes from previously designed molecules (Eom et al., 2007) (Table I). Positive control and artificial standard capture probes were as in our previous reports (Eom et al., 2007; Hwang and Cha, 2008) (Table I). Materials and Methods Format Design and Preparation of DNA Microarray Microbial Strains and Genomic DNA Isolation The 11 microbial strains are pathogenic E. coli (American Type Culture Collection (ATCC; Manassas, VA) 25922), E. coli O157:H7 (ATCC 29425), L. monocytogenes (ATCC 15313), S. enterica serotype Choleraesuis (ATCC 13311), S. enterica serotype Enteritidis (Institute for Fermentation (IFO; Osaka, Japan) 3313), S. dysenteriae (ATCC 13313), S. aureus (ATCC 6538), V. cholerae (ATCC 14035), V. vulnificus (ATCC 27562), V. parahaemolyticus (ATCC 17802), and Y. enterocolitica (ATCC 23715). All species were cultured in nutrient broth (Difco, Kansas, MO) at 30 or 378C, except V. vulnificus in trypticase soy broth (Difco) with 1% NaCl at 308C and two E. coli strains in LB medium at 378C. Genomic DNA was extracted and purified using the DNeasy1 Tissue Kit (Qiagen GmbH, Hilden, Germany). Purified bacterial genomic DNA was used as template for PCR, followed by fluorescence labeling of each target. DNA concentrations and purities were determined with a UV/vis spectrometer (Mecasys, Daejeon, Korea). Design and Synthesis of Capture Probes Eleven 16S rDNA sequences were directly obtained by sequencing from the chromosome of each target strain. 184 Biotechnology and Bioengineering, Vol. 106, No. 2, June 1, 2010 We designed a capture probe arrangement (Fig. 1) in a manner similar to that of our previous report (Hwang and Cha, 2008). Black, gray, and white spots represent artificial standard, positive control, and specific capture probes, respectively. Four repeat spots of each specific capture probe were surrounded by five replicate spots of artificial standard capture probe, in a rectangular shape. Consequently, each oligonucleotide microarray contained 22 5 spots of the artificial standard capture probe, 1 4 spots of the positive control capture probe on the first line, and 1 4 spots of each specific capture probe (to include a total of 20 probes). Thus, each oligonucleotide chip contained a double array of sets, with 194 spots in all. The gap between spots within an array set was 566 mm, sufficient to minimize cross-talk. The total dimension of the spotted area was 3.2 mm 16.8 mm, and the whole array was duplicated at intervals of 19.3 mm to permit two independent experiments to be carried out using a single chip. Each NH2-modified oligonucleotide (20 mM) was dissolved in 3X SSC spotting buffer (450 mM NaCl, 3 mM tri-sodium citrate, and 1.5 M N,N,N-trimethyl glycine [betaine; Sigma, St. Louis, MO], pH 6.6, final c ¼ 1.5 M). The capture probe solutions were spotted onto aldehydecoated slides (Super Aldehyde; Telechem International, Sunnyvale, CA) using a Microsys 5100 microarrayer (Cartesian Technologies, Ann Arbor, MI) with the Chip Table I. Oligonucleotide sequences employed as capture probes and standard target, and their thermodynamic properties.a Probe name Antisense sequences (50 –30 , 30 -amine-spacer, spacer:C6) Length (bp) Tm (8C) Rating RSD Species All bacteria (positive control) Escherichia coli Escherichia coli O157:H7 Listeria monocytogenes Salmonella enterica serotype Cholerasuis Salmonella enterica serotype Enteritidis Shigella dysenteriae Staphylococcus aureus Vibrio cholerae Vibrio parahaemolyticus Vibrio vulnificus Yersinia enterocolitica Artificial standard Artificial standard target POCO ESCO ESCOO1 ESCOO2 LIMO1 LIMO2 SACH1 SACH2 SAEN SHDY1 SHDY2b STAU1 STAU2 VICH1 VICH2 VIPA1 VIPA2b VIVU1b VIVU2b YEEN1 YEEN2 ARST ARSTT GCCGCCAGCGTTCAATCTGA GAAGGCACATTCTCATCTCTGAAAAC CAGCAAAGAAGCAAGCTTCTTCCT ACTCGTCAGCAAAGAAGCAAGCT GCATGCGCCACACTTTATCATT CCATCTTTCAAAAGCGTGGCAT TGCTGCGGTTATTAACCACAACA GACTCAAGCCTGCCAGTTTCGA AGGCACAAATCCATCTCTGGATTC AGGCACCCTCGTATCTCTACAAGG CCGCCACTCGTCAGCAAAGCA AACTAGCTAATGCAGCGCGGAT AGATGTGCACAGTTACTTACACATATGTTCT CCTCTACCGGGCAATTTCC CTCTACCGGGCAATTTCCCA CCCGAAGGTTCAGATAACTCGTTT CGTTATCGTTCCCCGAAGTTCAGAT AAACAAGTTTCTCTGTGCTGCCGC TGAGCCGAAGCTATCATGCGG GTTATTGGCCTTCCTCCTCGCT TGCGAGTAACGTCAATCCAACAA CCCAAGGGAACCCAAGGGAAA TTTCCCTTGGGTTCCCTTGGG-Alexa flour 647 20 26 24 23 22 22 23 22 24 24 21 22 31 19 20 24 25 24 21 22 23 21 21 66.5 62.9 63.8 62.7 62.6 64.0 63.2 64.2 63.5 63.1 68.9 63.4 63.0 63.5 62.3 63.1 66.9 66.9 65.8 63.8 63.1 66.8 66.8 91 91 73 86 80 90 86 87 74 89 100 81 74 79 79 88 93 91 83 84 89 85 85 1.9 5.9 4.1 1.6 2.1 8.6 4.5 3.7 33.3 7.9 10.8 1.6 3.6 4.0 4.0 8.4 10.2 4.0 7.2 0.6 2.4 8.3 a All thermodynamic properties were calculated by Primer Premier. From the previously designed capture probes (Eom et al., 2007). b Figure 1. Schematic diagram of the repeated array format. The layout for the specific capture probes of each target strain is indicated above. Hwang and Cha: Pattern-Mapped Multiple Pathogen Detection Chip Biotechnology and Bioengineering 185 Maker 2 pin (Telechem International) at 74% humidity in a class 10,000 clean room. After spotting and incubation overnight in low (30%) humidity conditions, microarray slides were incubated in a solution containing 1.3 g NaBH4 dissolved in 375 mL phosphate-buffered saline (PBS; pH 7.4) and 125 mL ethanol for 5 min, followed by washing twice in 0.2% (w/v) sodium dodecyl sulfate (SDS) for 1 min each time, and twice with ddH2O. Slides were dried by centrifugation at 1,500 rpm for 3 min, and stored at room temperature under vacuum until further use. Target Preparation and Hybridization 16S rDNA sequences were amplified as targets by asymmetric PCR at a 50:1 primer ratio. The universal PCR primer set (16S-43F: 50 -TGGCTCAGATTGAACGCTGGCGGC-30 16S-1392R: 50 -ACGGGCGGTGTGTAC-30 ; Genotech, Daejeon, Korea) for each strain was based on the first and ninth conserved regions. The PCR mixture was composed of 2U Taq polymerase (Takara, Otsu, Japan), 3.5 mM forward universal primer, 0.07 mM reverse universal primer, 0.5 mM dATP, 0.5 mM dCTP, 0.5 mM dGTP, 0.3 mM dTTP, 0.15 mM amine-modified dUTP, and 1X Taq buffer. PCR was performed in a DNA thermal cycler (Eppendorf, Hamburg, Germany) under the following conditions: 958C for 5 min, 30 cycles of 958C for 1 min, 608C for 1 min, and 728C for 1 min 24 s, followed by 728C for 5 min. The QIAquick1 PCR kit (Qiagen GmbH) was employed for purification of amine-modified amplicons, followed by ethanol precipitation. Next, labeling was performed using the ARESTM Alexa Fluor1 647 DNA labeling kit (Molecular Probes, Eugene, OR). Labeled target DNA samples included both conserved and variable (first to ninth) regions ranging from 1,385 to 1,400 bp in length. For multiple detection of two pathogens, a chromosome mixture at a 1:1 ratio was used as PCR template. Alexa Fluor1 647-labeled artificial standard target was the same as in our previous reports (Eom et al., 2007; Hwang and Cha, 2008) (Table I). The fabricated oligonucleotide microarray was prehybridized in buffer containing 3X SSC solution (450 mM NaCl, 3 mM tri-sodium citrate, pH 7.0) with 1% (w/v) bovine serum albumin (BSA; Sigma) and 0.1% (w/v) SDS for 30 min at 50, 55, or 608C. After washing four times (0.2% SDS, w/v, twice, distilled water twice), the array was dried by centrifugation at 1,500 rpm for 3 min. The array was hybridized with a DNA mixture of 9 mL PCR-amplified target (50–150 mg/mL) and 1 mL artificial standard target (1 mM) in a fresh 10 mL aliquot of 2X hybridization solution (6X SSC, 0.2% [w/v] SDS, 0.4% [w/v] BSA) under a supported coverslip at the appropriate temperature for 12 h. Next, the array was washed three times, first with lowstringency buffer (1X SSC, 0.2% [w/v] SDS) preheated to the hybridization temperature for 3 min, second with highstringency wash buffer (0.1X SSC, 0.2% [w/v] SDS) for 3 min at room temperature, and finally with 0.1X SSC for 3 min at room temperature. Next, the hybridized microarray was dried by centrifugation at 1,500 rpm for 3 min. 186 Biotechnology and Bioengineering, Vol. 106, No. 2, June 1, 2010 Fluorescence Intensity Scanning and Data Analysis A commercial confocal laser scanner (ScanArray Lite; GSI Lumonics, Wilmington, MA) and quantitative microarray analysis software (QuantArray; GSI Lumonics) were used for image acquisition and fluorescence intensity analysis. Scanning parameters were set at 95% of laser intensity and 85% of photomultiplier tube gain. Fluorescence intensities of spots in scanned images were numbered by the analysis software. Acquired experimental raw intensity data were converted into two-dimensional plots for standard and specific spots, with a gray gradation, using the Matlab m-file as in our previous work (Eom et al., 2007; Hwang and Cha, 2008). For classification of spot patterns peculiar to each target strain, a pattern-mapping statistical analysis program was constructed using an ANN algorithm. ANN is consisted of many processing elements (PEs) which can process data by summarizing and transforming them using a series of mathematical functions like neurons in the brain (Groth, 1999). PEs are interconnected and each connections have the weight which means connection strength. Therefore, iterative training continue to change the weight of connections to fit the data until the outcome values by ANN model match the known outcome values within a specified accuracy level. With the acquired numerical data, all specific spots were normalized by summation of 20 specific averages. Next, the 20 specific normalized spots were assigned as an input set to one of 11 pre-specified classes, as an output. The pattern-mapping analysis tool was programmed by Visual Cþþ. At least five independent experimental repeats for each pathogen were employed to train the pattern-mapping model. In addition, extra experimental sets were used for verification of the model, with an error rate under 5%. Results Design and Assessment of Oligonucleotide Probes In the present study, we used sequence dissimilarity greater than 10–15% as main considering factor for probe design to ensure specificity. Full 16S rDNA sequences were compared only between the two phylogenetically closest species. By such direct sequence comparison, we easily obtained specific sequences for any two species, with two or more mismatches. In addition, the specific sequences were verified to be more then 10–15% dissimilar to all aligned sequences of the 11 target pathogens. Next, the thermodynamic properties of several probe candidates with the unique signature sequences were calculated (Table I). A similar melting temperature was regarded as the most important thermodynamic property to ensure similar stringencies during hybridization. The second probe design principle was match analysis using the RDP II and NCBI databases. Next, two or three candidate probes for each target bacterium were selected, in order of specificity. Finally, after experimental selection of probe candidates from the first to the ninth variable regions of the 16S rDNA sequence, 20 specific capture probes with high specificity were selected for 11 target pathogens (Table I). While we obtained two specific capture probes for nine pathogens, only one capture probe was chosen for each of E. coli and S. enterica serotype Enteritidis due to the limited specific region of 16S rDNA. The positive control capture probe was based on the first conserved region sequence, to check for successful PCR labeling (Eom et al., 2007). The artificial standard capture probe and target were designed to have no significant similarity with any bacterial or eukaryote sequence in the NCBI database for verification of regional and chip-to-chip variations as well as quantitative analysis (Hwang and Cha, 2008). Analyses of Artificial Standard and Control Spots In the present work, we used asymmetric PCR and an artificial standard as the reference control (Hwang and Cha, 2008). Asymmetric PCR amplifies more sense target sequence than antisense, and each product length was close to 1,400 bp and contained variable sequences from the first to the ninth regions of 16S rDNA. After assessment of the concentration effect (0- to 10,000-fold dilution) of the artificial standard target (final concentrations from 5 mM to 0.5 nM were used), we found that the fluorescence intensity of artificial standard spots was saturated when the artificial target was 0- to 10-fold diluted (thus, at 5 and 0.5 mM) (data not shown). Therefore, the artificial standard target was used at a 100-fold (50 nM) dilution in subsequent experiments. The hybridization temperature was set at 508C owing to low sensitivity at 55 and 608C (data not shown). At first glance, all labeled positive control and artificial standard spots were clearly detected after hybridization (Fig. 2A). The uniformly strong fluorescence was observed on the positive control spots (first four spots in Fig. 2A). Because it was difficult to distinguish intensity differences between spots on scanned raw images, the intensities of artificial standard spots were represented by a two-dimensional visualization (Eom et al., 2007; Hwang and Cha, 2008) using a gray gradation plot (Fig. 2B). In this system, white indicates the highest spot intensity and black the lowest. With this technique, almost all artificial standard spot intensities appeared uniform except for those of S. enterica serotype Enteritidis and V. vulnificus (Fig. 2B-e and B-j). gradation (Fig. 2C). Generally, analysis of pathogen detection is based on a perfect match between the target and specific capture probes (Keramas et al., 2003; Warsen et al., 2004). We found that our microarray system clearly discriminated seven strains among 11 target pathogens (Fig. 2C), showing perfect matches on specific spots. However, it was impossible to discriminate between S. enterica serotypes Choleraesuis and Enteritidis (Fig. 2C-d vs. C-e). In addition, with the two E. coli pathogens, strong nonspecific hybridization to Shigella spots was observed, even though specific spots for each E. coli subspecies were also seen (Fig. 2C-a vs. C-b). Nonspecific binding to Shigella spots was also noted when the two Salmonella serotypes were tested, although the nonspecific intensities were not strong (Fig. 2C-d vs. C-e). We tested the sensitivity of our microarray system using S. aureus as a representative strain. In the range of 1–10 cell numbers, noticeable specific spots were not shown (Fig. 3A-a and A-b) while double specific spots (STAU1 and STAU2) were clearly observed for 102–104 cells (Fig. 3A-c to A-e). When we plotted fluorescence intensity values of two specific spots with cell numbers, we confirmed that low cell numbers (1–10 cells) had almost similar intensity values with negative control spots, but the intensity values of the high cell number samples (102–104 cells) were all clearly distinguishable (Fig. 3B). Interestingly, almost linear correlations were revealed for both specific spots of STAU1 and STAU2 in the range of 102–104 cell numbers. Thus, we determined that the sensitivity of the specific capture probes is around 10–102 cells, equivalent to 50– 500 fg of pathogen chromosome, based on the presence of 4.5 1015 g of DNA in a single bacterial cell (Call et al., 2003). The relative standard deviation (RSD) of each specific capture probe was calculated as the standard deviation for the average number of perfect matches multiplied by 100 in at least five repeated independent experiments. The factors affecting RSD values are spot-to-spot or chip-to-chip variation during chip making, labeling, hybridization, and washing steps. In bioanalytical assays, the acceptable upper limit for the RSD is generally 10–25 and a low RSD indicates that each experiment yielded accurate and reliable data (Braggio et al., 1996; Findlay et al., 2000). Among the specific capture probes, that for S. enterica serotype Enteritidis had a high RSD of 33.3 resulting from very low and fluctuated intensity of specific matching (Table I). All other probes showed RSD values less than 11%, demonstrating appropriate precision and reliability. Analysis of Specific Spots Based on Perfect Matches With the specific spots, each line of four spots represents specific capturing of individual pathogen DNA and perfect match positions are indicated by white boxes in the scanned raw images (Fig. 2A). As with the artificial standard spots discussed above, it was difficult to distinguish intensity differences between spots on the simple scanned raw images, so the intensities of specific spots were represented by a gray Statistical Analysis of Detection Results Based on Pattern Mapping We noticed that the patterns of fluorescent spots varied somewhat according to the target pathogen even with the two Salmonella serotypes and the two E. coli subspecies (Fig. 2C). Therefore, to classify a unique spot pattern for Hwang and Cha: Pattern-Mapped Multiple Pathogen Detection Chip Biotechnology and Bioengineering 187 Figure 2. A: Raw hybridization data obtained when each asymmetric amplified target was exposed to the probe array in hybridization buffer. Each rectangular box indicates a perfect match. Two-dimensional visualization plots for (B) the artificial standard and (C) specific spots. Intensities are represented by gray gradations. The maximum intensity is white, whereas the minimum intensity is black. (a) E. coli, (b) E. coli O157:H7, (c) L. monocytogenes, (d) S. enterica serotype Choleraesuis, (e) S. enterica serotype Enteritidis, (f) S. dysenteriae, (g) S. aureus, (h) V. cholerae, (i) V. vulnificus, (j) V. parahaemolyticus, and (k) Y. enterocolitica. Figure 3. A: Two-dimensional visualization plots for specific spots according to S. aureus cell numbers. (a) 1 cell, (b) 10 cells, (c) 102 cells, (d) 103 cells, and (e) 104 cells. B: Semi-log scattered plot for fluorescence intensity values of two specific spots according to cell numbers. 188 Biotechnology and Bioengineering, Vol. 106, No. 2, June 1, 2010 Figure 4. Predicted detection of each target pathogen using the pattern-mapping statistical analysis program with an artificial neural network algorithm. The upper bar graph is the average of the 20 spot intensities, with standard deviations, for each pathogen, and the lower bar graph shows the detection probabilities for the 11 pathogens. each target strain, we used statistical analysis involving a pattern-mapping strategy. We constructed the patternmapping model using an ANN algorithm (Fig. 4). The pattern-mapping analysis model was developed using experimental datasets from more than five independent repeat tests for each pathogen. The graphic user interface for the pattern-mapping analysis program was designed to show detection probability (Fig. 4). Numerical data on the 20 specific spot intensities from scanned raw images were directly inserted into the model as inputs and the analyzed outputs were plotted as two two-dimensional bar graphs; 20-spot intensity averages with standard deviations in the upper plot and the detection probabilities for the 11 species in the lower plot. By this method, all pathogens were clearly discriminated by signature pattern analysis (Fig. 4). Importantly, the two Salmonella serotypes were clearly distinguished. The two E. coli subspecies were also clearly discriminated by pattern mapping. The probability of specific pathogen detection was almost 100% for all 11 pathogens, and the probability of nonspecific detection was less than 10%. We checked interference of our microarray system using nontarget bacterium (data not shown). In results, Bacillus cereus, different species with 11 target pathogens, was not detected in any specific spots and pattern-mapping analysis result. Next, to evaluate the possibility of simultaneous pathogen detection using pattern-mapping analysis, we performed detection experiments employing two-chromosome mixtures. As shown in Figure 5, the mixed species pairs of E. coli and E. coli O157:H7, S. aureus and V. parahaemolyticus, and V. cholerae and V. vulnificus were clearly detected. Detection probabilities for E. coli and E. coli O157:H7, S. aureus and V. parahaemolyticus, and V. cholerae and V. vulnificus were 1 and 0.999 (Fig. 5A-d), 1 and 0.523 (Fig. 5B-d), and 1 and 0.60 (Fig. 5C-d), respectively. Discussion Most microarrays using the 16S rDNA detection principle suffer from nonspecific binding caused by low sequence diversity (Chandler et al., 2003; Eom et al., 2007; Guschin et al., 1997; Liu et al., 2001; Peplies et al., 2006; Rudi et al., 2000). Thus, several specific genes have been used as Hwang and Cha: Pattern-Mapped Multiple Pathogen Detection Chip Biotechnology and Bioengineering 189 Figure 5. Simultaneous detection of (A) E. coli and E. coli O157:H7, (B) S. aureus and V. parahaemolyticus, and (C) V. cholerae and V. vulnificus. (a) Raw hybridization data. Two-dimensional visualization plots for (b) the artificial standard and (c) specific spots. (d) Predicted detection of the two target pathogens using the pattern-mapping program. detection principles for multiple target pathogens to achieve accuracy in the absence of nonspecific hybridization (Bodrossy et al., 2003; Volokhov et al., 2003; Wilson et al., 2002). However, multiple detection principles require complicated preparation (such as multiplex PCR) of multiple target DNAs, limiting the number of targets available for efficient simultaneous detection, and are associated with difficulties in linear amplification of targets and subsequent quantitative analysis. On the contrary, a single detection principle, such as that based on 16S rDNA, can yield simple and easy detection of multiple target pathogens. 16S rDNA also has further advantages such as resolution of species discrimination level, involvement of phylogenetic information, and detection feasibility of unknown phylogenetically close species. To reduce nonspecific binding in 16S rDNA-based microarray systems, various strategies have been used for the design of specific capture probes (Loy et al., 2005; Peplies et al., 2003; Reyes-Lopez et al., 2003). Reyes-Lopez et al. (2003) used multiple alignment for probe design from within the two conserved regions flanking a highly variable region. Peplies et al. (2003) achieved specific probe design by considering secondary structures in 16S rRNA that could affect hybridization between capture probe and target. Loy et al. (2005) designed 16S rDNA probes using melting temperature (Tm) and DG values as probe design criteria. Previously, we have used multiple alignment, Tm, DG values, and secondary structure when considering the design of specific capture probes (Eom et al., 2007). In the present study, we were of the view that sequence dissimilarity greater than 10–15% was the most important probe characteristic 190 Biotechnology and Bioengineering, Vol. 106, No. 2, June 1, 2010 ensuring specificity as described in previous studies (Call et al., 2003; Eom et al., 2007). After the present experimental analyses, we also confirmed that sequence dissimilarity was the most important factor affecting probe specificity. To greatly reduce the time and effort traditionally required for the design of specific capture probes for target species sharing similar sequences, we directly compared full 16S rDNA sequences between the two phylogenetically closest species. By such design strategy, we easily obtained specific sequences for any two species, with two or more mismatches. Thus, our design strategy efficiently found specific capture probes for target microorganisms. However, although we attempted to design two or more specific capture probes for each target pathogen to improve detection specificity and precision, we chose only one specific capture probe for each of E. coli and S. enterica serotype Enteritidis because, except for one variable region, these strains share very similar sequences with their close relatives (Table I). Uniformity of artificial standard spot intensity indicates reproducibility of the hybridization and washing steps. Intensity differences in artificial standard spots between chips can result from chip-to-chip variation. In addition, differences can be due to higher fluorescence intensity at initial loading of the hybridization mixture onto the slide, heterogeneous hybridization, and/or irregular spot size. Therefore, using this artificial standard probe strategy (Hwang and Cha, 2008), regional and chip-to-chip variations can be verified, and quantitative analysis may also be possible. From the uniformly bright intensities of the four positive control spots, it was clear that PCR labeling of the target probe was successful and that hybridization data would be reliable. Usually, analysis based on the scanned raw images is used to distinguish intensity differences. However, because it was difficult to distinguish subtle intensity differences between spots on scanned raw images, here, we used a two-dimensional visualization analysis with gray gradation (Eom et al., 2007, Hwang and Cha, 2008). The gray gradation visualization tool enables facile and minute discrimination between spot intensities, through enlargement of the intensity scale. Our microarray system clearly discriminated 7 strains among 11 target pathogens (Fig. 2C). Thus, the specificities of the capture probes used in the present work were significantly better than those achieved with previous probes (Eom et al., 2007) due to the advanced probe design and selection strategies. However, because S. enterica serotypes have very conservative 16S rDNA sequences, it was impossible to discriminate between two serotypes using simple observation of specific spots (Fig. 2C). With the two E. coli pathogens, strong nonspecific hybridization to Shigella spots was also observed. We surmise that the nonspecific fluorescence of Shigella spots may be attributed to the low (4.8%) level of sequence dissimilarity afforded by a single base pair mismatch between S. dysenteriae and E. coli or Salmonella serotypes, and/or because the Shigella-specific probe had the highest melting temperature of 68.98C (GC content of 61.9%) among the 20 specific capture probes (Table I). Thus, we determined that some pathogens could not be clearly discriminated by perfect match analysis when a conserved sequence such as 16S rDNA was used for detection. Therefore, to provide clear discrimination between very closely related pathogens, multiple detection principles or other analytical strategies may be required. Based on the observation of somewhat different fluorescent spot patterns according to the target pathogen even with the two S. enterica serotypes and the two E. coli subspecies (Fig. 2C), we constructed the statistical patternmapping model to classify a unique spot pattern for each target strain. As a well-known statistical decision-making tool, the ANN has been used to model nonlinear relationships between inputs and outputs or to discover patterns in data (Jain et al., 1996, 2000). This algorithm has mainly been applied to cancer diagnosis (Khan et al., 2001; Wei et al., 2004); variable spots are detected in a two-color comparison using a cDNA microarray and several cancer genotypes have been discriminated by clear differences in spot patterns. Importantly, the two Salmonella serotypes and the two E. coli subspecies were clearly distinguished by pattern mapping (Fig. 4); this had not been possible by perfect match analysis (Fig. 2). Thus, our pattern-mapping statistical analysis strategy successfully provided clear discrimination between very close species or subspecies, as detected by a microarray system using conservative 16S rDNA information as the sole detection principle. From the evaluation of simultaneous pathogen detection possibility using pattern-mapping tool, we found that detection probabilities of second target strains were relatively lower than first target strains (Fig. 5). This result showed that the pattern-mapping model developed using only single pathogen detection data might not be fully sufficient for more reliable detection of mixed species. Therefore, to achieve more accurate mixed detection with a high probability, it might be necessary to use multiple pathogen detection data for training of the pattern-mapping model. The sensitivity value of our microarray system was close to the 10 cells reported to be the lower detection limit of a similar method using a PCR-amplified fluorescence-labeled microarray (Call, 2005) and was much better than the 103– 104 cells acquired in our previous study (Eom et al., 2007). The sensitivity improvement might result from the lower hybridization temperature and/or greatly reduced repeat levels in specific capture spots. The detectable cell number is close to the infectious dose (10 cells) of certain pathogens such as S. dysenteriae and V. vulnificus (Kothary and Babu, 2001). In conclusion, we designed and generated a 16S rDNA information-based oligonucleotide microarray system using doubly specific capture probes for efficient and accurate detection of 11 selected pathogenic bacteria. Many target pathogens were clearly differentiated by perfect match analysis using two-dimensional visualization plots for standard and specific spots. However, some phylogenetically close strains, including E. coli subspecies and Salmonella serotypes, showed high nonspecific binding due to conserved 16S rDNA sequences, resulting in difficulties in detection based on only perfect match analysis. As we observed that spot patterns were somewhat different according to the target pathogen used (even between phylogenetically closely related strains), we employed a pattern-mapping statistical analysis strategy using an ANN model. After development of the pattern-mapping model using at least five sets of independently obtained data for each pathogen, all target pathogens were easily and clearly discriminated. Thus, the proposed novel system combining a 16S rDNA-based microarray and pattern-mapping analysis can reliably classify species and even some subtypes of target pathogens easily, rapidly, accurately, and simultaneously. In addition, this total solution system can be successfully expanded using the easy ‘‘add-up’’ concept to detect many other microorganisms in foods or clinical environments. This work was supported by Technology Development program for Agriculture and Forestry (#109187-3) from the Ministry for Flood, Agriculture, Forestry and Fisheries, Young-Woong Satellite Laboratory grant, and the Brain Korea 21 program from the Ministry of Education, Science and Technology, Korea. We thank Ms. Jae-Won Lee (POSTECH) and Mr. Jung-Soo Lee (Young-Woong) for help with pathogenic bacteria preparation and Dr. Jin-Hyun Park (P&I Consulting) for help with microarray data analysis. References Bodrossy L, Stralis-Pavese N, Murrell JC, Radajewski S, Weilharter A, Sessitsch A. 2003. Development and validation of a diagnostic microbial microarray for methanotrophs. Environ Microbiol 5:566–582. Hwang and Cha: Pattern-Mapped Multiple Pathogen Detection Chip Biotechnology and Bioengineering 191 Braggio S, Barnaby RJ, Grossi P, Cugola M. 1996. A strategy for validation of bioanalytical methods. J Pharm Biomed Anal 14:375–588. Call DR. 2005. Challenges and opportunities for pathogen detection using DNA microarrays. Crit Rev Microbiol 31:91–99. Call DR, Borucki MK, Loge FJ. 2003. Detection of bacterial pathogens in environmental samples using DNA microarrays. J Microbiol Methods 53:235–243. Chandler DP, Newton GJ, Small JA, Daly DS. 2003. Sequence versus structure for the direct detection of 16S rRNA on planar oligonucleotide microarrays. Appl Environ Microbiol 69:2950–2958. de Boer E, Beumer RR. 1999. Methodology for detection and typing of foodborne microorganisms. Int J Food Microbiol 50:119–130. Eom HS, Hwang BH, Kim DH, Lee IB, Kim YH, Cha HJ. 2007. Multiple detection of food-borne pathogenic bacteria using a novel 16S rDNAbased oligonucleotide signature chip. Biosens Bioelectron 22:845–853. Findlay JW, Smith WC, Lee JW, Nordblom GD, Das I, DeSilva BS, Khan MN, Bowsher RR. 2000. Validation of immunoassays for bioanalysis: A pharmaceutical industry perspective. J Pharm Biomed Anal 21:1249– 1273. Groth R. 1999. Data mining: Building competitive advantage. Upper Saddle River: Prentice Hall, Inc. Guschin DY, Mobarry BK, Proudnikov D, Stahl DA, Rittmann BE, Mirzabekov AD. 1997. Oligonucleotide microchips as genosensors for determinative and environmental studies in microbiology. Appl Environ Microbiol 63:2397–2402. Hwang BH, Cha HJ. 2008. Quantitative oligonucleotide microarray data analysis with an artificial standard probe strategy. Biosens Bioelectron 23:1738–1744. Jain AK, Duin RPW, Mao J. 2000. Statistical pattern recognition: A review. IEEE Trans Pattern Anal Machine Intell 22:4–37. Jain AK, Mao J, Mohiuddin KM. 1996. Artificial neural networks: A tutorial. Computer 29:31–44. Keramas G, Bang DD, Lund M, Madsen M, Rasmussen SE, Bunkenborg H, Telleman P, Christensen CB. 2003. Development of a sensitive DNA microarray suitable for rapid detection of Campylobacter spp. Mol Cell Probes 17:187–196. Khan J, Wei JS, Ringner M, Saal LH, Ladanyi M, Westermann F, Berthold F, Schwab M, Antonescu CR, Peterson C, Meltzer PS. 2001. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat Med 7:673–679. Kothary MH, Babu US. 2001. Infectious dose of foodborne pathogens in volunteers: A review. J Food Saf 21:49–73. Liu WT, Mirzabekov AD, Stahl DA. 2001. Optimization of an oligonucleotide microchip for microbial identification studies: A non-equilibrium dissociation approach. Environ Microbiol 3:619–629. Loy A, Schulz C, Lucker S, Schopfer-Wendels A, Stoecker K, Baranyi C, Lehner A, Wagner M. 2005. 16S rRNA gene-based oligonucleotide 192 Biotechnology and Bioengineering, Vol. 106, No. 2, June 1, 2010 microarray for environmental monitoring of the betaproteobacterial order ‘‘Rhodocyclales’’. Appl Environ Microbiol 71:1373–1386. Peplies J, Glockner FO, Amann R. 2003. Optimization strategies for DNA microarray-based detection of bacteria with 16S rRNAtargeting oligonucleotide probes. Appl Environ Microbiol 69:1397– 1407. Peplies J, Lachmund C, Glockner FO, Manz W. 2006. A DNA microarray platform based on direct detection of rRNA for characterization of freshwater sediment-related prokaryotic communities. Appl Environ Microbiol 72:4829–4838. Reyes-Lopez MA, Mendez-Tenorio A, Maldonado-Rodriguez R, Doktycz MJ, Fleming JT, Beattie KL. 2003. Fingerprinting of prokaryotic 16S rRNA genes using oligodeoxyribonucleotide microarrays and virtual hybridization. Nucleic Acids Res 31:779–789. Rudi K, Skulberg OM, Skulberg R, Jakobsen KS. 2000. Application of sequence-specific labeled 16S rRNA gene oligonucleotide probes for genetic profiling of cyanobacterial abundance and diversity by array hybridization. Appl Environ Microbiol 66:4004–4011. Schena M, Shalon D, Davis RW, Brown PO. 1995. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270:467–470. Sergeev N, Distler M, Courtney S, Al-Khaldi SF, Volokhov D, Chizhikov V, Rasooly A. 2004. Multipathogen oligonucleotide microarray for environmental and biodefense applications. Biosens Bioelectron 20:684–698. Volokhov D, Chizhikov V, Chumakov K, Rasooly A. 2003. Microarraybased identification of thermophilic Campylobacter jejuni, C. coli, C. lari, and C. upsaliensis. J Clin Microbiol 41:4071–4080. Warsen AE, Krug MJ, LaFrentz S, Stanek DR, Loge FJ, Call DR. 2004. Simultaneous discrimination between 15 fish pathogens by using 16S ribosomal DNA PCR and DNA microarrays. Appl Environ Microbiol 70:4216–4221. Wei JS, Greer BT, Westermann F, Steinberg SM, Son CG, Chen QR, Whiteford CC, Bilke S, Krasnoselsky AL, Cenacchi N, Catchpoole D, Berthold F, Schwab M, Khan J. 2004. Prediction of clinical outcome using gene expression profiling and artificial neural networks for patients with neuroblastoma. Cancer Res 64:6883–6891. Wilson WJ, Strout CL, DeSantis TZ, Stilwell JL, Carrano AV, Andersen GL. 2002. Sequence-specific identification of 18 pathogenic microorganisms using microarray technology. Mol Cell Probes 16:119–127. Woese CR. 1987. Bacterial evolution. Microbiol Rev 51:221–271. Wu CF, Valdes JJ, Bentley WE, Sekowski JW. 2003. DNA microarray for discrimination between pathogenic 0157:H7 EDL933 and non-pathogenic Escherichia coli strains. Biosens Bioelectron 19:1–8. Yershov G, Barsky V, Belgovskiy A, Kirillov E, Kreindlin E, Ivanov I, Parinov S, Guschin D, Drobishev A, Dubiley S, Mirzabekov A. 1996. DNA analysis and diagnostics on oligonucleotide microchips. Proc Natl Acad Sci USA 93:4913–4918.
© Copyright 2026 Paperzz