Mol. Cells, Vol. 18, No. 1, pp. 53-62 M olecules and Cells / KSMCB 2004 Analysis of the Root Nodule-enhanced Transcriptome in Soybean Hyoungseok Lee, Cheol-Goo Hur1, Chang Jae Oh, Ho Bang Kim, Sun-Yong Park1, and Chung Sun An* School of Biological Sciences, Seoul National University, Seoul 151-742, Korea; 1 Korea Research Institute Bioscience and Biotechnology, Genome Research Center, Daejeon 305-333, Korea. (Received March 5, 2004; Accepted May 25, 2004) For high throughput screening of root nodule-enhanced genes, cDNA libraries specific for three different developmental stages of soybean root nodules were constructed after inoculation with Bradyrhizobium japonicum USDA110. 5,469 cDNA clones were sequenced and grouped into 2,511 non-redundant (nr) ESTs consisting of 769 contigs and 1,742 singletons. Using similarity searches against several public databases we constructed a functional classification of the ESTs into root nodule-specific nodulin genes, stress-responsive genes and genes related to carbon and nitrogen metabolism. We also constructed a cDNA microarray with 382 selected clones that appeared to be up-regulated in the root nodule. Using the microarray we compared the transcript levels of uninfected roots and root nodules from four developmental stages. We identified 81 genes that were differentially expressed, and grouped them into seven clusters according to the similarity of their expression profiles, using a hierarchical clustering algorithm. Clusters 1, 2, 3, and 6, comprised of 58 genes, showed root nodule-enhanced expression. The information from this study will be used to analyze the roles of root nodule-specific genes and signaling pathways during root nodule development. Keywords: EST; Microarray; Root Nodule; Soybean. Introduction Nitrogen-fixing nodules are formed on the roots of legumes as a result of infection by the genus Rhizobium. The establishment of a root nodule and its subsequent development involve complex interactions between the two organisms resulting in structural and biochemical changes * To whom correspondence should be addressed. Tel: 82-2-880-6678; Fax: 82-2-872-6881 E-mail: [email protected] in both partners (Beringer et al., 1979). Several events are involved in this plant-bacterium interaction, including invasion of the root hair followed by development of the infection thread, release of bacteria into cortical cells, development of the peribacteroid membrane, and differentiation of the bacteria into bacteroids. A group of host gene products, termed nodulins, play essential roles in nodule formation and functional symbiosis (Legocki and Verma, 1980). The nodulin genes characterized so far are divided into early and late genes according to the time of appearance of their mRNAs (Nap and Bisseling, 1990). It has been suggested that the early nodulins are involved in the infection process and/or nodule morphogenesis (Gloudemans and Bisseling, 1989), and the late ones in nodule function and maintenance (Richter et al., 1991). In the last two decades random sequencing of expressed sequence tags (ESTs) has become a valuable method for gene discovery (Hillier et al., 1996; Marra et al., 1999; Sterky et al., 1998). There have been multitissue EST projects in several plant species, as well as more specialized, tissue-specific projects on root hairenriched Medicago truncatula tissue and root nodules (Covitz et al., 1998; Fedorova et al., 2002), woodforming tissues of poplar (Sterky et al., 1998), and hairy root of ginseng (Chung et al., 2003). cDNA microarray technology is also widely used to identify global gene expression patterns during organ development. Its sensitivity and reliability have been demonstrated in the analysis of a variety of phenomena including fruit ripening (Aharoni et al., 2002), the hypersensitive response to pathogens (Schenk et al., 2000), and the wound response (Cheong et al., 2002). To date, more than 330,000 expressed sequence tags (ESTs) from various organs of soybean have been registered in dbEST soybean (Shoemaker et al., 2002), and three studies on root nodule-specific ESTs have been reported in the model legumes, M. truncatula and Lotus japonicus (Fedorova et al., 2002; Journet et al., 2002; Szczyglowski et al., 1997). Although several groups have 54 The Root Nodule-enhanced Transcriptome in Soybean / identified novel nodulins in early and late stage soybean nodules by gene cloning (Jacobs et al., 1987; Kouchi and Hata, 1993; Richter et al., 1991; Sandal et al., 1987; Sengupta-Gopalan et al., 1986), and by proteomic approaches (Legocki and Verma, 1980; Panter et al., 2000), there are no reports of large-scale nodule-specific EST or microarray experiments in soybean. We have used EST and cDNA microarray technologies to identify large numbers of genes specifically involved in this symbiosis in soybean. We describe here the isolation of numerous genes that are up-regulated during root nodule development many of which encode functions that have not previously been implicated in the symbiosis. Materials and Methods Bacterial strain and plant material To nodulate soybean [Glycine max (L.)] cultivar Backtae, seeds were imbibed on paper towels moistened with distilled water. After three days, the sprouted seeds were inoculated with 100 ml of approximately 5 × 107 colony-forming units/ml (cfu/ml) of Bradyrhizobium japonicum strain USDA 110, which had been cultured in YEM (Vincent, 1970), and suspended in buffered nodulation medium (BNM, Ehrhardt et al., 1992). The seedlings were grown in pots filled with vermiculite at 27°C in a 16 h/8 h light/dark cycle in a growth chamber, and watered with 0.5× BNM during growth. To isolate root nodules, plants were removed from the vermiculite, washed in tap water and the nodules removed with forceps. As a control, total RNA was isolated from the roots of uninoculated plants. The harvested tissues were frozen in liquid nitrogen and stored at –80°C. RNA Isolation and cDNA library construction Nodule-specific soybean cDNA libraries were created from RNA prepared from root nodules 10 days after inoculation (DAI), 5 weeks after inoculation (WAI), and at 10 WAI. Total RNA was isolated as previously described (Uhde-Stone et al., 2003), and poly(A)enriched RNA was prepared using an Oligotex mRNA mini kit (Qiagen, USA). RNase-free DNase (Promega, USA) was used to remove any genomic DNA contamination in the RNA samples during RNA purification. cDNA libraries were constructed as previously described (Kwon and An, 2003). Briefly, cDNA was prepared using a ZAP-cDNA synthesis kit, directionally ligated to ZAP II, and packaged with Gigapack III Gold packaging extracts (Stratagene, USA). Plasmids containing cDNA inserts were excised with Ex-Assist helper phage and propagated in SOLR cells according to the manufacturer’s instructions (Stratagene, USA). The libraries were designated AN01, AN02, and AN03. EST analysis ESTs were generated by 5′ end sequencing and the resulting sequences were automatically edited to remove vector and bacterial sequences, as well as ambiguous regions. Individual ESTs were assembled into groups of contigs representing unique transcripts using the CAP3 program. Consensus sequences of all the contigs were generated based on 75% homology over a minimum of 30 bp. The individual ESTs were searched against the GenBank nr database using the BLASTX algorithm. EST hits on the same target gene were manually assembled into single contigs. On the basis of the BLASTX results, we classified ESTs into three categories: known, unknown, and no hit. A known EST was declared when the score was greater than 80 and the E-value less than 10−14. Unknown was defined as a score greater than 40 and less than 80, and an E-value of less than 10−14 but greater than 10−2. No hit was defined as a score of less than 40 with an E-value greater than 10−2. The soybean EST database is available at http://kropbase.snu.ac.kr/ cgi-bin/soybean_db_idx.cgi. The functional assignment of ESTs was based on comparisons with the Arabidopsis thaliana annotation database in the Munich Information Center for Protein Sequences (MIPS) using the BLASTX algorithm. Preparation of the cDNA microarray To prepare a 382-cDNA microarray, plasmid DNA was isolated using a Millipore plasmid kit (Millipore, USA) and inserts were PCR-amplified using T3 and T7 promoter primers. The PCR cycles were 94°C for 2 min of initial denaturation, followed by 94°C for 45 s, 55°C for 45 s, and 72°C for 2 min (for a total of 30 cycles), ending with a 10 min extension at 72°C. The PCR products were examined by 1% agarose gel electrophoresis, purified on Sephadex G-50 columns, dried and resuspended in 50% DMSO. The DNAs were spotted with an OmniGrid Microarrayer (GeneMachines, USA) onto silanized glass slides (CMT-GAPS, Corning, USA) which were crosslinked with 300 mJ/cm2 of short wave ultraviolet irradiation (Stratalinker, Stratagene, USA) and stored in a desiccator until use. Preparation of probes for microarray hybridization cDNA probes were prepared from total RNA extracted from triplicate sets of uninoculated soybean roots, roots harvested at 4 DAI, and root nodules at 10 DAI, 5 WAI, and 10 WAI. The cDNA from 4 DAI roots and 10 DAI, 5 WAI, and 10 WAI root nodules was labeled with Cy5, and the cDNA from the uninoculated roots with Cy3 (to be used as a control in all the hybridizations). In each case 80 µg of total RNA was reverse transcribed in the presence of Cy3- or Cy5-dUTP (Hegde et al., 2000). In brief, 24 µl of the RNA and 4.5 µg of oligo(dT) (Bionics, Korea) were warmed at 65°C for 10 min and cooled on ice. Thereafter we set up the following reaction in a total of 40 µl: 24 µl of the RNA oligo(dT) mixture, 1× first strand reaction buffer, 10 mM dithiothreitol, 0.5 mM dATP, dCTP, and dGTP, 0.3 mM dUTP, 50 µm Cy3- or Cy5-dUTP (Perkin Elmer, USA), and 400 units of SuperscriptIII (Invitrogen, USA) and incubated it for 2 h at 50°C. The resulting Cy3 and Cy5 probes were paired according need, and unincorporated nucleotides were removed with a PCR cleaning kit (Qiagen, USA), and the cleaned probes precipitated with absolute EtOH. Microarray hybridization and analysis The concentrated cDNA probes were suspended in 10 µl of nuclease free water Hyoungseok Lee et al. 55 / Table 1. Primers used in the semi-quantitative RT-PCR. PCR primers (5′ → 3′) Genes 1-deoxy-D-xylulose 5-phosphate reductoisomerase Polyamine oxidase Tyrosine aminotransferase Succinoaminoimidazole carboximide ribonucleotide synthetase Polygalacturonase Nicotianamine aminotransferase B Thiamine biosynthetic enzyme UDP-glucose:salicylic acid glucosyl transferase Integral membrane protein Forward CAACCTCCTAGGCTCCTATT TGTAACCGTGTACACCAAAA ACACCAGATCCGTTGTTCGT Reverse CAAGACTCTTCCAGCTTCAC CACCATCTGGGAACAAGTAT GTTTGCTCGCAATCTCAGCA GATATATACGACGCTGGGGA CAATATGAGGCCATGCTCCA ATTATGGTGCCACGGGCAA AAGCTACTGATTCTGTTTC CTTGACATGAACAAGGCAGA CATGCATCAGTTCTCCAAGCT TCTCTATGTTCTCACCAGCA TGCTGATGAAGGTGGATCCAT ATGGATTGCTAGGATTGATG GAGACAAAGGTGTGGATGGT GAGGAGAGGAATGAGGGCAT ACAAGTGGAAGTGCCTGGAC / Table 2. Soybean nodule cDNA libraries from this study. CDNA library AN01 AN02 AN03 Total Developmental stage of nodule 2 WAI 5 WAI 10 WAI Total No. of ESTs No. of Singletons No. of contigs No. of NR sequence 2,021 1,824 1,624 5,469 1,690 1,777 1,745 1,993 205 516 209 815 1,895 1,293 1,954 2,748 BLAST search result Known 1,331 1,048 1,744 1,904 Unknown 319 184 159 607 No Hit 245 061 051 0237 / and 10 µl of preheated (55°C) formamide-based hybridization buffer (50% formamide, 8× SSC, 1% SDS, 4× Denhardt’s solution). They were layered on a cDNA microarray slide, covered with 22 × 22 cover slip and incubated in a water bath at 42°C in a hybridization chamber (GenomicTree, Korea) for 16 h. The slides were washed once in 2× SSC for 5 min, twice in 0.2× SSC for 3 min, then scanned in a Genepix 4000B scanner (Axon Instruments, USA). We performed twelve hybridizations, consisting of three sets of four experiments. To identify genes differentially expressed in the nodules and roots, we performed background intensity subtraction and printtip normalization. Using image analysis software, signal intensities were determined for each dye at each element of the array, and the ratio of Cy5 to Cy3 intensity was calculated. Signal intensities were averaged from three independent experiments and the Cy5/Cy3 fluorescence ratios were log transformed (base 2 for simplicity) in order to treat inductions and repressions of identical magnitude as numerically equal but of opposite sign. For clustering analysis of the microarray data, we used Quintet (KRIBB, Korea) to generate hierarchical clusters and selforganizing maps (SOM) for these clusters. Reverse transcription-PCR Semi-quantitative RT-PCR was performed to analyze the expression of selected genes during nodule development using appropriate PCR primers (Table 1). Primers for amplifying actin were used as a quantitative control (Lee et al., 2004). Total RNA (1 µg) was used as template for reverse transcription after treatment with RNase-free DNase (Promega, USA). The PCR cycles were 95°C for 5 min of initial denaturation, followed by 94°C for 15 s, 52°C for 50 s, and 72°C for 50 s (for a total of 25 cycles), ending with a 5 min extension at 72°C. The amplified products were separated on a 1% agarose gel. Results Generation of ESTs from soybean nodules at different developmental stages Three cDNA libraries were constructed from the soybean nodules at 2 weeks after inoculation (WAI) with B. japonicum, at 5 WAI, and at 10 WAI, and named AN01, AN02, and AN03, respectively. Singlepass 5′ sequencing of these yielded 5,469 sequences. The non-redundant sequences among these were clustered into a total of 815 contigs and 1,993 singletons, giving a total of 2,748 non-redundant (nr) ESTs (Table 2). Using the BLASTX algorithm (Altschul et al., 1990), these nr ESTs were translated into their corresponding amino acid sequences and searched against the nr protein database of GenBank. 1,904 ESTs were matched to known or predicted genes and gene products. Each contig sequence is clustered with overlapping sequences and therefore presumably represents a unique transcript. The variability in the number of ESTs comprising each contig therefore probably reflects differences in the abundance of the transcripts of the corresponding genes. As seen in Table 3, 13 of the 20 most abundant contigs in terms of numbers of ESTs, are known nodulins, as expected. 56 The Root Nodule-enhanced Transcriptome in Soybean / Table 3. The 20 most abundant contigs from the root nodule ESTs. No. of ESTs Contig ID cl6ct776cn825 a In this work Nodulea Roota 637 59 0 E-Value Score e-110 399 Strongest BLASTX Hit Nodulin 44 (G. max, P04672) cl12ct292cn313 131 44 4 3e-74 281 Leghemoglobin A (G. max, P02238) cl12ct291cn312 102 22 0 5e-75 282 Leghemoglobin C1 (G. max, P02235) cl6ct775cn824 59 28 0 5e-92 339 Nodulin C51 (G. max, P04671) cl12ct290cn310 48 22 0 7e-76 285 Leghemoglobin C3 (G. max, P02237) cl12ct289cn309 46 39 0 9e-76 285 Leghemoglobin C2 (G. max, P02236) cl15ct30cn31 41 6 0 4e-94 346 Nodulin 20 (G. max, P08960) cl15ct29cn30 37 10 0 e-101 370 Nodulin 22 (G. max, P08961) cl113ct137cn145 35 4 0 4e-96 353 Nodulin 26B (G. max, P08863) cl55ct72cn75 35 2 2 e-127 457 Hypersentive-induced protein (Z. mays, AF236375) cl77ct98cn103 23 21 22 e-148 525 Phosphatase, putative (G. max, CAD57680) cl112ct133cn141 17 38 9 0 1170 Nodulin 100, sucrose synthase (G. max, AAC39323) cl66ct79cn82 16 18 4 5e-49 197 Nodulin 36B (G. max, Q02919) cl77ct97cn101 15 21 21 e-132 472 Hydrolase, putative (G. max, AAM94615) cl122ct146cn154 13 5 1 7e-62 239 Nodulin 16 (G. max, P23233) cl12ct288cn306 13 8 1 e-41 171 Nodulin 93 (G. max, Q02921) cl566ct632cn674 13 81 102 e-161 571 Sali3-2 protein, aluminium-induced (G. max,T08896) cl127ct159cn167 12 1 1 0 679 ABC transporter family (A. thaliana, NP190357) cl27ct32cn34 12 0 0 e-103 378 Unknown protein (A. thaliana, BAB10214) cl349ct406cn437 12 0 0 0 1151 Cytochrome P450 83D1p (G. max, T05940) Frequencies Values represent frequencies of ESTs transcribed in the nodule and the root from the soybean gene index (GmGI). In addition to BLASTX using the nr protein database of GenBank, we identified the protein in the Arabidopsis database (MATDB) in the Munich Institute of Protein Sequences (MIPS) using the highest BLASTX score to identify the functional category of each nr EST. A total of 402 nr ESTs could be automatically assigned to 13 functional classes in the MIPS classification system, while known nodulins were manually assigned (Fig. 1). 15% of the classified ESTs were assigned to the category of metabolism, 10% each to cell rescue, defense, cell death, and aging, and 8% to known nodulins. Construction of cDNA microarrays To select candidate genes that might be primarily regulated in the root nodule, we investigated the numbers of sequences homologous with our nr ESTs represented in public soybean EST database. For this, we made two sub-databases, one the nodule EST sub-database comprising 7,733 ESTs, the other the root EST sub-database comprising 13,750 ESTs. Afterward, homology searches were done using the 2,748 nr ESTs from our root nodule ESTs as queries, and both nodule and root EST sub-databases as subjects using the BLASTN algorithm. By this process, the frequency of the corresponding ESTs in root and nodule sub-databases was Fig. 1. Functional classification according to the MIPS classification scheme. 402 ESTs were grouped into 14 groups. counted for each nr EST in our EST database; some of the results are shown in Table 3. 264 ESTs were represented only in the nodule and 72 in both organs, but with significantly enhanced expression in the nodule. In addition to these 336 ESTs, 46 ESTs with annotations involving nodule functions were selected manually. In the end a total of Hyoungseok Lee et al. 57 / compasses many of the nodulin genes. In general the first significant increases in transcript levels were detected at 10 DAI and were maintained or augmented up to 10 WAI. We identified 34 known nodulin genes in our EST analysis; 25 genes in soybean, six in other legumes, and three in non-legume species. Of the 25 soybean genes 18 were in cluster 2. Beside the nodulin genes, nine genes were assigned to metabolism. In addition, we identified three cluster 2 genes, remorin, peroxidase, and a RING-H2 zinc finger protein, whose homologues showed noduleenhanced expression in the nodules of M. truncatula (Fedorova et al., 2002). Cluster 3 contained only one gene, a WRKY transcription factor, expressed specifically at 5 WAI, and cluster 6 also has only one gene, a nodulin 3like gene, expressed at 10 WAI. Fig. 2. Cluster analysis of 58 cDNA clones shown to be upregulated during root nodule development. Classification of the clones was based on the similarity of their expression profiles using the hierarchical clustering technique. The following experiments are on the x axis. 1, root versus 4 DAI (days after inoculation) root; 2, root versus 10 DAI nodule; 3, root versus 5 WAI (weeks after inoculation) nodule; 4, root versus 10 WAI nodule. 382 cDNA inserts from the selected ESTs were arrayed at low density on microarray slides to identify genes whose expression is differentially regulated during root nodule development. Differential gene expression during root nodule development From the cDNA microarray analysis, 81 of the 382 genes showed significant changes in expression during nodule development. Their expression patterns were grouped into seven clusters using the hierarchical clustering method. Two large clusters and five small clusters, each with distinct expression patterns, were apparent (data not shown). Among them, the genes in clusters 1, 2, 3, and 6 were up-regulated in the nodule (Fig. 2). The expression ratios of the 58 genes in these four clusters are shown in Table 4. Cluster 1 consists of seven genes whose transcript levels were not changed in 4 DAI and 10 DAI nodules, but increased in 5 and 10 WAI nodules, hence specifically in mature and senescent nodules. Of four genes whose functions were assigned to metabolism, two cysteine synthases are known to be involved in metabolism related to nitrogen. Also one transcription factor, ethylene-responsive element binding protein 1, and one cysteine proteinase in this cluster, are expressed abundantly in late stage soybean nodules. Cluster 2, the largest, has 49 genes and en- Expression of selected genes assessed by semi-quantitative RT-PCR To confirm the microarray data, we performed some semi-quantitative RT-PCR analyses with gene-specific primer sets (Table 1). We analyzed two genes in cluster 1 and seven in cluster 2, and used actin as a quantitative control (Fig. 3). The results of the RT-PCR analysis agreed in most cases with the profiles derived from the microarray data. / Discussion We used three successive procedures to select noduleenhanced transcripts. In the first step we constructed nodule-specific cDNA libraries and EST databases. Second, by in silico analysis comparing our database with public soybean EST database, we selected candidate genes that seemed to be more highly expressed in nodules than in control roots. Third, by cDNA microarray analysis, we examined the expression patterns of selected genes during nodule development. As a result, 58 genes were identified as nodule-enhanced genes; among them, 20 are known nodulin genes, nine are of unknown function, and six including a cysteine proteinase, coproporphyrinogen oxidase, and remorin, were identified as nodule-enhanced genes by EST and expression analysis in nodules of M. truncatula and L. japonicus (Fedorova et al., 2002; Szczyglowski et al., 1997). On the other hand, many genes had different expression pattern in the different legume species; for example, the transcripts of phytochelatin synthetase, bark agglutinin precursor, and B12D protein were up-regulated exclusively in M. truncatula, while those encoding polyamine oxidase, ethylene-responsive element binding protein, and RpoB transcripts were upregulated only in soybean. These differences may result from natural variation between species or from differences in the techniques used in the two studies. For example we analyzed expression patterns by both computational approaches and cDNA microarrays, while in the 58 The Root Nodule-enhanced Transcriptome in Soybean / Fig. 3. Semi-quantitative RT-PCR of selected genes that showed enhanced expression during root nodule development. Gene identities are indicated on the right side of each panel. Lane R, root as quantitative control; lane 1, root 4 DAI; lane 2, nodule 10 DAI; lane 3, nodule 5 WAI; lane 4, nodule 10 WAI. study of M. truncatula only computational methods were employed (Fedorova et al., 2002). Finally, we have identified 23 nodule-enhanced genes that are new at least to the field of large-scaled transcriptome analysis. Although ESTs and cDNA microarrays have been used on a limited scale to identify nodule-enhanced soybean genes (Jacobs et al., 1987; Kouchi and Hata, 1993; Richter et al., 1991; Sandal et al., 1987; Sengupta-Gopalan et al., 1986), this is the first report to employ these approaches on a large scale to identify genes related to soybean-Rhizobium symbiosis. Recently, four publications on the proteome of the peribacteroid membrane of symbiosomes from various legume species, have reported the presence of a number of proteins functioning as transporters, molecular chaperones, signaling proteins, and various metabolic enzymes (Catalano et al., 2004; Panter et al., 2000; Saalbach et al., 2002; Wienkoop and Saalbach, 2003). We found several of these proteins including the F1-ATPase delta subunit, sugar transporter, integral membrane protein, protein disulfide isomerase, chaperonin 60 alpha chain, and malate dehydrogenase in our soybean root nodule EST database and used them in the 382cDNA microarray. Although their expression levels in nodules were not as high as some nodulins, they were enhanced as much as two to five fold (data not shown). Although we did not identify any novel ESTs from the soybean root nodules, these results along with our data for the frequencies of the ESTs (Table 3) and the RT-PCR analysis of selected genes (Fig. 3) demonstrate that our EST and microarray approach to identifying noduleenhanced genes was reasonably successful. The following is a brief discussion of the noduleenhanced genes in the three major functional categories. Genes involved in metabolism The bacterium produces the nitrogenase complex to reduce nitrogen to ammonia, while the host plants provide energy from photosynthesis and various other metabolites. It follows that soybean genes related to various metabolic processes are expressed during symbiosis (Beringer et al., 1979). The functions of 13 of the 58 up-regulated genes were assigned to metabolism. Among them, five, i.e., two cysteine synthases, tyrosine aminotransferase, nicotianamine aminotransferase B, and succinoaminoimidazole carboximide ribonucleotide synthetase, are classified in nitrogen metabolism. In particular two of them, the cysteine synthases, are the terminal enzymes of cysteine synthesis. Cysteine cluster proteins are known to be present in nodules of pea (Scheres et al., 1990), broad bean (Kato et al., 2002), and M. truncatula (Fedorova et al., 2002). It is possible that the function of the cysteine synthases of soybean root nodules may be to supply the cysteine for various cysteine cluster proteins. In addition to the genes involved in nitrogen metabolism, two genes encoding polygalacturonase and glycosyl transferase, which are assigned to carbon metabolism, two encoding coproporphyrinogen oxidase and a thiamin biosynthetic enzyme, assigned to the metabolism of vitamins and cofactors, and three encoding cytochrome P450 monooxygenase, UDPglucose:salicylic acid glucosyltransferase, and polyamine oxidase functioning in secondary metabolism, were also up-regulated in the soybean root nodule. These results Hyoungseok Lee et al. 59 / Table 4. Functional classification of transcripts up-regulated in the soybean root nodule (Cluster 1, 2, 3, and 6). EST ID Cluster 1 Metabolism AN02016F07 AN02018G07 AN02002D03 AN03001F03 Transcription cl157ct199cn218 Strongest BLASTX Hit Nodule developmental stage 4 DAI 10 DAI 5 WAI 10 WAI Cysteine synthase (Spinacia oleracea, BAA01279) Cysteine synthase (G. max, AF452451) 1-deoxy-D-xylulose 5-phosphate reductoisomerase (Z. mays, CAC03581) Polyamine oxidase (A. thaliana, BAC43225) -0.17527 -0.43315 0.819503 0.278987 -0.1591 -1.17178 3.653666 2.386178 2.688039 3.736222 1.97382 1.227413 0.073529 0.057025 2.450013 1.271627 Ethylene-responsive element binding protein 1 (G. max, AAM45475) -0.11394 -0.18533 2.969707 0.89363 Protein destination AN02006E07 Cysteine proteinase (G. max, BAA06030) Unclassified, unknown proteins AN03008D06 At5g16360.1 1.76716 0.276625 3.908089 3.094974 -0.45339 0.032898 2.61737 3.283059 Cluster 2 Metabolism cl649ct724cn773 cl604ct677cn723 0.410262 -0.48492 4.069916 2.87653 2.455869 2.426556 5.534338 2.632074 Tyrosine aminotransferase (A. thaliana, NP_200208) Succinoaminoimidazolecarboximide ribonucleotide synthetase (Vigna unguiculata, AAL48317) Beta-1,4-N-acetylglucosaminyltransferase (A. thaliana, NP_172759) Polygalacturonase (A. thaliana, NP_195292) Thiamin biosynthetic enzyme (G. max, BAA88226) Coproporphyrinogen oxidase (G. max, CAA50400) Cytochrome P450 monooxygenase (Cicer arietinum, CAD31843) Nicotianamine aminotransferase B (Hordeum vulgare, BAA87053) UDP-glucose:salicylic acid glucosyltransferase (Nicotiana tabacum, AF190634) 0.727592 2.065705 2.423111 1.092563 -2.47727 -0.22257 0.461948 0.197091 7.782655 5.471685 4.387613 4.204318 5.217395 1.745241 4.300804 3.989846 5.504981 3.966187 4.141745 5.31058 -0.16523 2.274022 0.696865 3.134205 -0.21944 1.553638 2.190294 5.313119 Glycolate oxidase (Lens culinaris, 1803516A) 0.251307 2.539481 0.608502 2.718349 0.770395 5.529406 4.895271 5.023331 0.36232 0.125743 -0.17425 3.91691 5.763927 1.177731 2.093165 4.807397 2.370692 3.433199 4.94928 1.296266 Cellular rescue, defense, cell death and ageing AN01006C12 Peroxidase (G. max, AF145350) 0.528484 2.873938 3.12138 2.957388 Plant hormonal regulation cl632ct706cn754 Gibberellin 20-oxidase (Phaseolus vulgaris, T11848) 1.855501 2.315797 2.501222 2.874458 0.061328 2.65016 2.902327 3.393847 AN02011G10 cl309ct363cn389 cl1ct1cn1 cl552ct623cn665 AN01017F01 AN02009G09 cl299ct353cn379 Energy cl287ct341cn366 Transcription AN03013A02 RpoB (G. max, AAL07334) Cellular communication/signal transduction cl253ct305cn330 Remorin 1 (L. esculentum, AF123265) AN02002A11 14-3-3-like protein (Pisum sativum, CAB42547) AN03006G06 Inositol 1,3,4-trisphosphate 5/6-kinase-like protein (A. thaliana, NP_567334) Unclassified, unknown protein AN01010D06 (Continued) Auxin-induced protein (A. thaliana, NP_180016) 60 The Root Nodule-enhanced Transcriptome in Soybean / Nodule developmental stage EST ID Strongest BLASTX Hit 4 DAI 10 DAI 5 WAI 10 WAI AN03008F09 cl198ct245cn267 cl547ct618cn660 cl247ct299cn323 AN03007B05 AN02019H11 cl230ct281cn303 cl591ct664cn709 AN01005E05 cl336ct391cn420 AN02002C07 AN02005B02 AN03015F04 Resistant specific protein-1(4) (Vigna radiata, BAC22499) F-box protein family (A. thaliana, NP_189030) RING-H2 zinc finger protein (A. thaliana, AAM66032) At1g19180 (A. thaliana, AAL87391) At1g21460.1 At4g39235.1 Hypothetical protein (A. thaliana, NP_176128) Unknown protein (Saccharomyces cerevisiae, AAL79278) Unknown protein (A. thaliana, AAL60024) Unknown protein (A. thaliana, AAN13116) Unknown protein (O. sativa, AC098566) Prolyl 4-hydroxylase, alpha subunit (O. sativa, AC068923) Integral membrane protein (A. thaliana, AAD17424) 2.447672 0.707786 0.787688 1.675975 0.089254 -0.32142 0.090157 -0.36468 0.552937 -0.45606 -1.10649 -0.88452 1.837438 2.559374 5.123451 2.567572 1.564894 4.925796 3.047452 5.16215 1.411966 6.79795 2.965997 3.205291 4.255725 2.660995 1.519024 6.142547 5.656819 4.806285 2.353765 5.107818 3.453145 2.081727 4.661285 4.035498 2.41315 2.889657 1.175476 2.303462 7.322282 5.484843 1.502134 2.956531 4.408153 3.375886 3.386288 4.919842 3.996326 4.102879 3.315073 2.736692 Leghemoglobin C1 (G. max, P02235) Leghemoglobin C2 (G. max, P02236) Leghemoglobin C3 (G. max, P02237) Hemoglobin, 2-on-2 (A. thaliana) Nodulin 20 (G. max, P08960) Nodulin 21 (G. max, P16313) Nodulin 22 (G. max, P08961) Nodulin 22 (G. max, P08961) Nodulin 24 (G. max, P04145) Nodulin 26B (G. max, P08863) Nodulin 35, uricase (G. max, BAA13184) Nodulin 36B, early (G. max, Q02919) Nodulin 44 (Nodulin E27) (G. max, P04672) Nddulin 55-2, early (G. max, Q02917) Nodulin 6l (G. max, AF434718) Nodulin 93 Early (G. max, Q02921) Nodulin C51 (G. max, P04671) Nodulin Nlj16, late (Lotus japonicus, AAC49692) Nodulin-16 (G. max, CAA38204) -0.21868 -0.07827 0.064438 0.786141 -0.06405 0.848704 -0.56838 -0.77968 -0.06684 -0.14529 -0.24232 -0.05228 1.204219 0.03798 -0.55615 -1.14944 -0.10639 -0.58216 0.557936 6.072787 6.314265 6.724354 2.327818 8.108586 4.867635 9.400088 5.41842 6.566009 5.817806 4.222555 5.7062 6.421188 3.680623 4.855964 4.354611 10.23526 2.359448 7.613594 5.810749 5.882737 6.33176 2.99174 6.228092 5.343719 7.025001 4.455443 5.929285 5.891865 4.176018 0.893629 5.96872 1.724324 4.445715 5.97144 7.170326 3.819925 7.177962 5.82704 6.169947 6.74856 2.831268 6.145062 4.901672 6.603087 4.463273 5.38838 6.222952 4.869505 2.823086 5.91081 2.910271 6.051122 6.000292 7.574951 4.40942 8.919058 1.07141 -1.29268 2.7729 -1.31867 0.353109 2.818226 -0.30378 -2.91668 Known nodulins cl12ct109cn119 cl12ct109cn115 cl12ct110cn116 AN03007H02 cl15ct18cn18 cl584ct656cn700 cl63ct74cn77 cl15ct17cn17 cl23ct25cn25 cl113ct143cn157 cl34ct39cn41 cl66ct78cn81 cl6ct484cn521 cl302ct356cn382 cl172ct215cn235 AN02016F12 cl6ct483cn520 AN02021A09 cl102ct130cn143 Cluster 3 Transcription AN02002D08 Transcription factor WRKY4 (Petroselinum crispum, AF204925) Cluster 6 Known nodulin cl446ct507cn544 Nodulin 3 like protein (M. truncatula, CAA69976) reflect the nature of the active metabolism in developing nodules. Genes involved in protein degradation Protein degradation has an important role in the process of senescence and in the regulation of various plant hormone responses. We found that the expression of a cysteine proteinase was dramatically increased at 5 WAI and at 10 WAI during root nodule senescence, whereas it was down-regulated in the developing root nodules. Cysteine proteinases are particularly abundant among proteinases and well-known senescence-associated proteins (Noh and Amasino, 1999). However, the microarray data showed that only AN03008D06 of the seven cysteine proteinase genes was signifi- Hyoungseok Lee et al. / cantly enriched in late stage nodules. Thus, AN03008D06 encoding a vacuolar-located protease appears to play a role during root nodule senescence. Also, transcripts of an F-box family protein and a RING-H2 zinc finger protein accumulated during root nodule development, particularly at 5 WAI and 10 WAI, late stages of root nodule development. These proteins are components of an SCF (for SKP1, Cullin, and F box protein) E3 ubiquitin ligase complex, and the F box is important for recognition of the cognate SCF complex. The subunit containing the RINGH2 motif (Rbx1/Hrt1/Roc1) has also been identified as an essential SCF component. The F-box subunit interacts directly with targets for ubiquitination via a C-terminal protein–protein interaction domain. Formation of a polyubiquitin chain on the substrate targets it for destruction by the 26S proteasome, which regulates responses to six major plant hormones, i.e., auxin, jasmonic acid, GA, cytokinin, brassinosteroid, ABA, and ethylene (Itoh et al., 2003; Potuschak et al., 2003). During root nodule development, protein degradation and various effects of plant hormones may well regulate the development of the root nodule and associated metabolic pathways. The F-box and zinc finger proteins may be essential components of a 26S proteasome complex functioning specifically in the soybean root nodule. Known nodulin genes As originally defined, nodulin genes are those expressed exclusively in nodules (Legocki and Verma, 1980). However that definition has been modified recently because a number of nodulin genes are expressed to a limited extent in other plant organs (Coba de la Pena et al., 1997; Fedorova et al., 2002; Kapranov et al., 1997; Mathesius et al., 2001). Also in our in silico analysis (data not shown) soybean nodulin 26 was detected in flower and hypocotyl, and soybean early nodulin N93 in root, shoot, and seedlings. Although Nlj16 was up-regulated with other soybean nodulin genes, expression of another five of the nine nodulin genes with homologues in other legume species did not increased significantly in our microarray analysis. Furthermore, homologues of ENOD8.3 and MtN3 of M. truncatula and of a nodulin-like protein in Arabidopsis were actually down-regulated during soybean nodule development (data not shown). These results suggest that the genes needed for nodule development and the network of interactions between them may differ depending on the legume species. In conclusion, we have extended our understanding of the soybean gene products involved in symbiotic nitrogen fixation by identifying 58 genes that appear to be upregulated in root nodules, including 20 nodulin genes. Our analyses also revealed 23 genes with enhanced expression in nodules that had been previously overlooked, such as UDP-glucose:salicylic acid glucosyltransferase, polygalacturonase, and polyamine oxidase related to me- 61 tabolism, and an F-box protein and RING-H2 zinc finger protein related to the SCF complex. Additional experiments, such as larger cDNA microarray analyses, in situ hybridization, genomic clones and promoter elements analysis, will certainly further our understanding of the coordinated gene expression in the soybean root nodule. The information from this study will be useful in further analysis of the roles of nodule specific genes and their signaling network during root nodule development. Acknowledgments This work was supported by a grant from the Crop Functional Genomics Center (Project No. M101-KG01000103K0701-01810, Korea) to C. S. An. The Brain Korea 21 Project of the Ministry of Education supported H. Lee and C. J. Oh. References Aharoni, A., Keizer, L. C., Van Den Broeck, H. C., BlancoPortales, R., Munoz-Blanco, J., Bois, G., Smit, P., De Vos, R. C., and O’Connell, A. P. (2002) Novel insight into vascular, stress, and auxin-dependent and -independent gene expression programs in strawberry, a non-climacteric fruit. Plant Physiol. 129, 1019−1031. Altschul, S. F., Gish, W., Miller, W., Meyers, E. W., and Lipman, D. (1990) Basic local alignment search tool. J. Mol. Biol. 215, 403−410. Beringer, J. E., Brewin, N., Johnston, A. W., Schulman, H. M., and Hopwood, D. A. (1979) The Rhizobium--legume symbiosis. Proc. R. Soc. Lond. B. Biol. Sci. 204, 219−233. Catalano, C. M., Lane, W. S., and Sherrier, D. J. (2004) Biochemical characterization of symbiosome membrane proteins from Medicago truncatula root nodules. Electrophoresis 25, 519−531. Cheong, Y. H., Chang, H. S., Gupta, R., Wang, X., Zhu, T., and Luan, S. (2002) Transcriptional profiling reveals novel interactions between wounding, pathogen, abiotic stress, and hormonal responses in Arabidopsis. Plant Physiol. 129, 661−677. Chung, H., Cho, I. S., Kim, J., In, D. S., Hur, C. G., Song, J. S., Woo, S. S., Choi, D. W., and Liu, J. R. (2003) Changes in gene expression during hairy root formation by Agrobacterium rhizogenes infection in ginseng. J. Plant Biol. 46, 187−198. Coba de la Pena, T., Frugier, F., McKhann, H. I., Bauer, P., Brown, S., Kondorosi, A., and Crespi, M. (1997) A carbonic anhydrase gene is induced in the nodule primordium and its cell-specific expression is controlled by the presence of Rhizobium during development. Plant J. 11, 407−420. Covitz, P. A., Smith, L. S., and Long, S. R. (1998) Expressed sequence tags from a root-hair-enriched Medicago truncatula cDNA library. Plant Physiol. 117, 1325−1332. Ehrhardt, D. W., Atkinson, E. M., and Long, S. R. (1992) Depolarization of alfalfa root hair membrane potential by Rhizobium meliloti Nod factors. Science 256, 998−1000. Fedorova, M., van de Mortel, J., Matsumoto, P. A., Cho, J., Town, C. D., VandenBosch, K. A., Gantt, J. S., and Vance, C. P. (2002) Genome-wide identification of nodule-specific transcripts in the model legume Medicago truncatula. Plant Physiol. 130, 519−537. 62 The Root Nodule-enhanced Transcriptome in Soybean / Gloudemans, T. and Bisseling, T. (1989) Plant gene expression in early stages of Rhizobium-legume symbiosis. Plant Sci. 65, 1−14. Hegde, P., Qi, R., Abernathy, K., Gay, C., Dharap, S., Gaspard, R., Hughes, J. E., Snesrud, E., Lee, N., and Quackenbush, J. (2000) A concise guide to cDNA microarray analysis. Biotechniques 29, 548−550. Hillier, L. D., Lennon, G., Becker, M., Bonaldo, M. F., and Chiapelli, B. (1996) Generation and analysis of 280,000 human expressed sequence tags. Genome Res. 6, 807−828. Itoh, H., Matsuoka, M., and Steber, C. M. (2003) A role for the ubiquitin-26S-proteasome pathway in gibberellin signaling. Trends Plant Sci. 8, 492−497. Jacobs, F. A., Zhang, M., Fortin, M. G., and Verma, D. P. (1987) Several nodulins of soybean share structural domains but differ in their subcellular locations. Nucleic Acids Res. 15, 1271− 1280. Journet, E. P., van Tuinen, D., Gouzy, J., Crespeau, H., Carreau, V., Farmer, M. J., Niebel, A., Schiex, T., Jaillon, O., Chatagnier, O., Godiard, L., Micheli, F., Kahn, D., Gianinazzi-Pearson, V., and Gamas, P. (2002) Exploring root symbiotic programs in the model legume Medicago truncatula using EST analysis. Nucleic Acids Res. 30, 5579−5592. Kapranov, P., de Bruijn, F. J., and Szczyglowski, K. (1997) Novel, highly expressed late nodulin gene (LjNOD16) from Lotus japonicus. Plant Physiol. 113, 1081−1090. Kato, T., Kawashima, K., Miwa, M., Mimura, Y., Tamaoki, M., Kouchi, H., and Suganuma, N. (2002) Expression of genes encoding late nodulins characterized by a putative signal peptide and conserved cysteine residues is reduced in ineffective pea nodules. Mol. Plant-Microbe Interact. 15, 129−137. Kouchi, H. and Hata, S. (1993) Isolation and characterization of novel nodulin cDNAs representing genes expressed at early stages of soybean nodule development. Mol. Gen. Genet. 238, 106−119. Kwon, S. I. and An, C. S. (2003) Cloning and expression of mitochondrial MnSOD from the small radish (Raphanus sativus L.). Mol. Cells 16, 194−200. Lee, H., Kim, H., and An, C. S. (2004) Cloning and expression analysis of 2-on-2 hemoglobin from soybean. J. Plant Biol. 47, 92−98. Legocki, R. P. and Verma, D. P. (1980) Identification of “nodulespecific” host proteins (nodoulins) involved in the development of Rhizobium-legume symbiosis. Cell 20, 153−163. Marra, M., Hillier, L., Kucaba, T., Allen, M., and Barstead, R. (1999) An encyclopedia of mouse genes. Nat. Genet. 21, 191−194. Mathesius, U., Keijzers, G., Natera, S. H., Weinman, J. J., Djordjevic, M. A., and Rolfe, B. G. (2001) Establishment of a root proteome reference map for the model legume Medicago truncatula using the expressed sequence tag database for peptide mass fingerprinting. Proteomics 1, 1424−1440. Nap, J. P. and Bisseling, T. (1990) Nodulin function and nodulin gene regulation in root nodule development; in Molecular Biology in Symbiotic Nitrogen Fixation, Gresshoff, P. M. (ed.), pp. 181−229, CRC Press, Boca Raton. Noh, Y. S. and Amasino, R. M. (1999) Identification of a promoter region responsible for the senescence-specific expression of SAG12. Plant Mol. Biol. 41, 181−194. Panter, S., Thomson, R., de Bruxelles, G., Laver, D., Trevaskis, B., and Udvardi, M. (2000) Identification with proteomics of novel proteins associated with the peribacteroid membrane of soybean root nodules. Mol. Plant Microbe Interact. 13, 325− 333. Potuschak, T., Lechner, E., Parmentier, Y., Yanagisawa, S., Grava, S., Koncz, C., and Genschik, P. (2003) EIN3-dependent regulation of plant ethylene hormone signaling by two arabidopsis F box proteins: EBF1 and EBF2. Cell 115, 679−689. Richter, H. E., Sandal, N. N., Marcker, K. A., and SenguptaGopalan, C. (1991) Characterization and genomic organization of a highly expressed late nodulin gene subfamily in soybeans. Mol. Gen. Genet. 229, 445−452. Saalbach, G., Erik, P., and Wienkoop, S. (2002) Characterisation by proteomics of peribacteroid space and peribacteroid membrane preparations from pea (Pisum sativum) symbiosomes. Proteomics 2, 325−337. Sandal, N. N., Bojsen, K., and Marcker, K. A. (1987) A small family of nodule specific genes from soybean. Nucleic Acids Res. 15, 1507−1519. Schenk, P. M., Kazan, K., Wilson, I., Anderson, J. P., Richmond, T., Somerville, S. C., and Manners, J. M. (2000) Coordinated plant defense responses in Arabidopsis revealed by microarray analysis. Proc. Natl. Acad. Sci. USA 97, 11655−11660. Scheres, B., van Engelen, F., van de Knaap, E., van de Wiel, C., van Kammen, A., and Bisseling, T. (1990) Sequential induction of nodulin gene expression in the developing pea nodule. Plant Cell 2, 687−700. Sengupta-Gopalan, C., Pitas, J. W., Thompson, D. V., and Moffman, L. M. (1986) Expression of host genes during root nodule development in soybeans. Mol. Gen. Genet. 203, 410−420. Shoemaker, R., Keim, P., Vodkin, L., Retzel, E., Clifton, S. W., Waterston, R., Smoller, D., Coryell, V., Khanna, A., Erpelding, J., Gai, X., Brendel, V., Raph-Schmidt, C., Shoop, E. G., Vielweber, C. J., Schmatz, M., Pape, D., Bowers, Y., Theising, B., Martin, J., Dante, M., Wylie, T., and Granger, C. (2002) A compilation of soybean ESTs: generation and analysis. Genome 45, 329−338. Sterky, F., Regan, S., Karlsson, J., Hertzberg, M., Rohde, A., Holmberg, A., Amini, B., Bhalerao, R., Larsson, M., Villarroel, R., Van Montagu, M., Sandberg, G., Olsson, O., Teeri, T. T., Boerjan, W., Gustafsson, P., Uhlen, M., Sundberg, B., and Lundeberg, J. (1998) Gene discovery in the wood-forming tissues of popular: analysis of 5,692 expressed sequence tags. Proc. Natl. Acad. Sci. USA 95, 13330−13335. Szczyglowski, K., Hamburger, D., Kapranov, P., and de Bruijn, F. J. (1997) Construction of a Lotus japonicus late nodulin expressed sequence tag library and identification of novel nodule-specific genes. Plant Physiol. 114, 1335−1346. Uhde-Stone, C., Zinn, K. E., Ramirez-Yanez, M., Li, A., Vance, C. P., and Allan, D. L. (2003) Nylon filter arrays reveal differential gene expression in proteoid roots of white lupin in response to phosphorus deficiency. Plant Physiol. 131, 1064− 1079. Vincent, J. M. (1970) A Manual for the Practical Study of RootNodule Bacteria, I.B.P. Handbook, No. 15, Blackwell, Oxford. Wienkoop, S. and Saalbach, G. (2003) Proteome analysis. Novel proteins identified at the peribacteroid membrane from Lotus japonicus root nodules. Plant Physiol. 131, 1080−1090.
© Copyright 2026 Paperzz