Journal of Experimental Botany, Vol. 57, No. 11, pp. 2887–2897, 2006 doi:10.1093/jxb/erl057 Advance Access publication 12 July, 2006 RESEARCH PAPER Divergence in spatial expression patterns and in response to stimuli of tandem-repeat paralogues encoding a novel class of proline-rich proteins in Oryza sativa Rong Wang1,2, Kang Chong1 and Tai Wang1,* 1 Key Laboratory of Photosynthesis and Environmental Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, 20 Nanxincun, Xiangshan, Haidianqu, Beijing 100093, China 2 Graduate School, Chinese Academy of Sciences, Beijing 100049, China Received 13 March 2006; Accepted 12 May 2006 Abstract Gene duplication has been recognized as a major route to supply raw sequences for genome novelty in evolution, but the mechanism underlying the retention of duplicate genes is not fully understood yet. Divergence in spatial expression patterns was investigated here and in response to stimuli of the four members of a rice proline-rich protein gene family (OsPRP1), which encode a class of proline-rich proteins. The four paralogues are tandemly organized within a 20 kbp range of chromosome 10 without any interval of other open reading frames and with a median KS value of 0.474. These paralogues showed little similarity in their regulatory regions but high conservation in coding regions. Search of an intergenomic cis-element database predicted their promoter regions with divergent cis-element fingerprints. Further expression analyses involving different tissues/organs and nine types of stimuli by a promoter::GUS-fusion strategy revealed that the four paralogues were expressed mainly in vascular cylinders of different organs and showed diversity in tissue/organ specificity and in response to these stimuli, with some overlapping expression. Furthermore, these data show that OsPRP1.2 appeared to inherit most of the functions from their multifunctional progenitor, whereas the other three genes diverged after duplication events. Thus, the retention of paralogues in a multigene family seems to require a more complicated diversification process than originally thought. In addition, the promoter::GUS strategy is a powerful way to explore function divergence of a tandem-repeat gene family. Key words: Divergence, gene duplication, gene expression, Oryza sativa, proline-rich proteins (PRPs), tandem duplication. Introduction The widespread existence of duplicated genes in all sequenced genomes (Zhang, 2003) has supported the notion that gene duplication is an important source of new genes and hence plays a vital role during the evolution of organisms (Gilbert et al., 1997; Bowers et al., 2003; Gu et al., 2003, 2005). Duplication of individual genes (tandemly repeated or dispersedly duplicated), of large chromosomal regions (segmentally duplicated), or of complete genomes (polyploidization) accounts for the generation of sequence redundant copies (see review by Adams and Wendel, 2005; Hurles, 2004). In rice, the model plant of grasses, and significant grain species, duplicated segments cover more than half of the whole genome (Guyot and Keller, 2004; Yu et al., 2005). Functional divergence between duplicates is generally accepted as being required for their long-term retention in a genome, yet the evolutionary mechanism underlying the divergence is not well understood. Ohno (1970) has proposed the fate of two duplicates: one gene maintains the ancestor’s function and the other can freely mutate and evolve. Therefore pure retention, gain of new function (neofunctionalization) and loss of function (pseudogenization or non-functionalization) are possible fates for duplicates. An alternative model, the duplication–degeneration– complementation (DDC) model (Force et al., 1999; Lynch and Force, 2000), suggests another fate (subfunctionalization), whereby all duplicates partition the ancestral functions, * To whom correspondence should be addressed. E-mail: [email protected] ª The Author [2006]. Published by Oxford University Press [on behalf of the Society for Experimental Biology]. All rights reserved. For Permissions, please e-mail: [email protected] 2888 Wang et al. complement each other, and together carry out the same functions of the ancestral gene. In addition, the functional preservation of all duplicates without diversification can provide more expression products (Gu et al., 2003). Recent expression analyses at the genome scale have revealed that, in yeast, humans, and Arabidopsis, most duplicated gene pairs have diverged in expression and the divergence seems to be correlated positively with evolution time (with KS used as a proxy of evolution time) (Li, 1997; Gu et al., 2002; Makova and Li, 2003; Blanc and Wolfe, 2004), which supports the notion that expression divergence might be a key step toward the retention of duplicated genes (Ohno, 1970; Ferris and Whitt, 1979; Force et al., 1999). Further studies in yeast and Arabidopsis genomes showed that these duplicated pairs diverging in transcriptional regulation regions take up a higher proportion than those diverging in coding sequences (Conant and Wagner, 2003). Therefore, the effect of selection pressure on coding regions is probably greater than that on regulatory sequences (Haberer et al., 2004). The mutations in regulatory regions of duplicates may alter temporal and/or spatial expression patterns or transcriptional responses to internal and external stimuli, thus resulting in partitioning the ancestral functions between the duplicates (Moore and Puruggannan, 2005). Therefore, the expression divergence of duplicates represents an important aspect of their functional diversification. Genome-scale analyses have depicted a general picture of expression divergence between duplicate gene pairs, but the expression datasets may underestimate the proportion of divergently expressing duplicates because (i) these studies mainly involved two members or duplicate pairs of multigene families (Makova and Li, 2003; Hughes and Friedman, 2005; Haberer et al., 2004); (ii) tandem-repeated genes share high similarity over their transcripts, so completely excluding cross-hybridization is difficult (Blanc and Wolfe, 2004); and (iii) datasets used in these studies were possibly from partial tissues (Gu et al., 2002) and did not include the expression information of duplicates responding to environmental stimuli and plant growth substances (Haberer et al., 2004). Therefore, studies of expression divergence between members of multigene families, which represent a large proportion of duplicates in higher eukaryotes, is essential to understand how duplicates were retained. However, present knowledge of divergent expression in multigene families is derived only from several limited examples, such as the Arabidopsis MADS box family of 105 genes, with retention involving divergent expression and putative silencing (Kofuji et al., 2003; Parenicova et al., 2003; Irish and Litt, 2005). Expression divergence in spatial specificity has been investigated here and in response to plant growth substances and environmental stresses among the four members of the OsPRP1 of rice family exhibiting tandem duplication, which encodes a class of proline-rich proteins (PRPs). The four paralogues show little similarity in their regulatory regions, but their coding regions are highly conserved. These data revealed non-uniform expression in tissue/organ specificity and in response to additional stimuli. These results, combined with the preferential expression of the genes in the vascular cylinders of the tissues/organs examined, indicated the possibly important role of this family in plant development. These data also suggest a multiform and/or mixed divergence mode (degeneration, subfunctionalization, and part functional redundancy) for these duplicates, and the functional divergence of multigene families may be more complicated than the theoretical models suggest. Materials and methods Plant materials Rice cultivar Zhonghua10 (Oryza sativa L .cv japonica) was used for all genetic transformations. For seed germination and callus inducement, seeds were husked manually and then surface-sterilized by soaking in 70% ethanol for 30 s and 0.1% HgCl2 for 10 min, then rinsed several times with sterile distilled water. The seedlings were planted in a greenhouse and grown under 12/12 h light/dark at 28 8C. T-DNA construction The promoter regions of OsPRP1.1, OsPRP1.2, OsPRP1.3, and OsPRP1.4 (from OSJNBa0031A07, chromosome 10) were amplified with the primer pairs P1F/R P2F/R, P3F/R, and P4F/R (Table 1), Table 1. Primer pairs used for PCR The adopted Restriction Enzyme (RE) sites are underlined. Primer pair Sequence RE P1F P1R P2F P2R P3F P3R P4F P4R PGF PGR 59>AAGCTTGAAGCTAGACAGCGAGATG<39 59>CCCGGGTTTCCAGTGGCCTTGCTCTC<39 59>CTGCAGTTTGACCACACGGGATAATCG<39 59>TCTAGACCACCTGGAGATCTGCACAA<39 59>TTAAGCTTTAGTTGCTTTCTTGCTTAGTGGG<39 59>AATCCCGGGAGGTCTGCATAGTAAAATTCC<39 59>ATCAAGCTTACCACAATAGTGCACGTGCCTTC<39 59>TTACCCGGGCACTTGATCGCCACCTGGAG<39 59>CAAAGCCAGTAAAGTAGAACGGT<39 59>GCATGTTACGTCCTGTAGAAACCC<39 HindIII SmaI PstI XbaI HindIII SmaI HindIII SmaI Spatial expression patterns in rice 2889 Fig. 1. Characterization of the OsPRP1 family. (A) Tandem arrangement of OsPRP1 family in chromosome 10 and its flanking regions. Arrows indicate the transcription direction of the respective paralogues. (B) Multiple alignments of the amino acid sequences. Arrowhead marks the cleavable sites of signal peptidase, and the two 18 amino acid repeats are marked by ‘stars’. (C) Multiple alignments of 50 bp 59 and 39 termini of the introns show the introns spliced out at the canonical GT-AG site (marked by ‘stars’) and flanking exons (italics). (D) Pairwise comparison of similarity among the promoters (a), 59 UTRs (b), coding region (c), introns (d), and 39 UTRs (e). (E) Synonymous divergence (KS) computed by pairwise combinations of OsPRP1 members. respectively, and ligased into pGEM-T vector (Promega) through T/A cloning after being gel purified. Because recent studies have indicated that introns may be implicated in the regulation of gene expression (Majewski and Ott, 2002; Rose, 2004; Seoighe et al., 2005), these amplified fragments containing predicted promoter regions, 59 untranslated regions, and the first exon and the intron, covered 2.5 kbp upstream of nucleotide 62 of the second exon for OsPRP1.1, 2.7 kbp upstream of nucleotide 19 of the second exon for OsPRP1.2, 2.5 kbp upstream of nucleotide 1 of the second exon for OsPRP1.3, and 2.6 kbp upstream of nucleotide 24 of the second exon for OsPRP1.4. After being confirmed by sequencing, each of the cloned fragments in T-vectors was cut and then ligased into pBI121 (Clonetech, San Francisco) to replace the CaMV 35S promoter, generating an immediate vector containing the OsPRP1.n promoter::GUS::TerN8S fusion sequence. Finally, this fusion sequence was inserted into the HindIII–EcoRI site of pCAMBIA1300 (CAMBIA, Canberra, Australia), resulting in the expected constructs, designated as pPRP1.1P-GUS, pPRP1.2P-GUS, 2890 Wang et al. pPRP1.3P-GUS, and pPRP1.4P-GUS. All these constructs were confirmed by DNA sequencing. Agrobacterium-mediated transformation of rice Each resulting construct was introduced into Agrobacterium tumefaciens strain EHA105 by a freeze–thaw method described previously (An, 1987), and then into rice embryonic calli to generate transgenic plants by following the methods of Hiei et al. (1994) and Toki (1997). The desired transgenic lines were determined by two methods: (i) PCR with the GUS-specific primer pair PGF/R (Table 1) and the genome DNA templates of T0 plants; and (ii) GUS activity detected in T1 plants by histochemical staining (see below). T1 seeds from T0 plants were germinated and grown on a solid half- strength MS medium containing 30 mg lÿ1 hygromycin (Hyg) for the selection of transgenic progeny. T2 seeds were harvested from independent T1 plants and put on plates on the same medium for screening of homozygous T2 plants. Homozygous T2 seeds were used for further experiments. GUS histochemical staining Different tissues from transgenic lines and wild-type plants were used for histochemical detection of GUS as described by Jefferson et al. (1987). Staining reactions were conducted at 37 8C for 1–4 h, then the stained materials were dipped into 95% ethanol to clear the chlorophyll and observed under a stereomicroscope. To define the type of cells showing GUS signals, the stained root materials were embedded into Spurr resin and sectioned for optical microscopy. Treatment of plant growth substances and stresses T2 generation seedlings harbouring each construct were grown on half-strength MS salt solution under 12/12 h light/dark at 28 8C. Seedlings at the 2-leaf stage were transferred into a half-strength MS salt solution containing either 0.1 lM IAA (indole acetic acid), 1 lM GA3 (gibberellin A3), 1 lM ABA (abscisic acid), 1 lM KT (kinetin), 300 lM SA (salicylic acid), 10 lM ET (ethephon), 50 mM NaCl (sodium chloride), or 200 mM Sor (sorbitol), with only roots immersed completely in this solution and cultured for 12 h at 28 8C. For cold treatment, seedlings were exposed to 10 8C, with controls exposed to 28 8C for 12 h. Treated seedlings were used to extract total proteins for the GUS fluorogenic assay described next. For each treatment, the experiments were repeated three times. Fig. 2. Homology tree of OsPRP1 and other known or annotated PRPs. A multiple alignment by the full alignment method (default parameters). Sequences include OsPRP1.1;OsPRP1.4 (this study, Oryza sativa, accession no. NP_919846; NP_919847; NP_919848; NP_919849), annotated proteins (accession no. NP_919831, NP_919834, NP_919837, NP_919838, NP_919839, NP_919840, NP_919842, NP_919843, NP_919845 and AAD29802), a maize PRP (Zea mays, accession no. CAB65536), AtPRP2 and AtPRP4 (Arabidopsis thaliana, accession no. AAF64549 and AAF64551), NtGPP1 (Nicotiana tabacum, accession no. AAF28387), and StGPP1 (Solanum tuberosum, protein_id: CAA04449). GUS fluorogenic assay GUS fluorogenic assay was performed as described by Jefferson (1987). Briefly, total soluble proteins were extracted by grinding the seedlings into a fine powder in liquid N2 and homogenating with GUS extraction buffer (100 mM sodium phosphate buffer, pH 7.0, 10 mM EDTA, 10 mM DTT, 0.1% (v/v) Triton X-100, 0.1% sodium lauryl sarcosine). Supernatant was collected by centrifuging at 15 000 g for 5 min at 4 8C and stored at ÿ80 8C in aliquots after being quickfrozen in liquid N2 until the enzyme activity assay. For enzyme activity assay, 5 mM 4-methylumbelliferyl glucuronide (4-MUG) in the GUS extraction buffer was added to the enzyme samples to a final concentration of 1 mM. All the reactions were carried out on 96-well plates at 37 8C. Relative fluorescence units (RFU) were read by the use of a fluorometer (DTX 880 Multimode Detector, Beckman Coulter) with a 360 nm Exfilter and 465 nm Emfilter every 30 min. The total soluble protein was determined by Bradford assay (Bradford, 1976). Table 2. Classification of known PRPs in higher plants on the basis of motifs, domains and biochemical characteristics Class Motif type Distribution and number of motif Proline-rich domains I PPXZ(K/T) (X, Z=V, Y, H or E) Tandem or in cluster C-terminal II PPYV and PPTPRPS Dispersed or in cluster or no N-terminal III IV PPV, P(V/I)YK, KKPCPP PEPK Dispersed or in cluster Tandem or dispersed V PKPE, P(V/E)PPK Dozens of repeats Dispersed Several repeats C-terminal Whole protein chain C-terminal Other characteristics Examples and references SbPRP1/2 (Hong et al., 1989) C-terminal hydrophobic domain Containing Cys No distinct proline-rich/poor domain MsPRP5 (Györgyey et al., 1997) ZmHyPRP1 (Josè-Estanyol et al., 1992) AtPRP2/4 (Fowler et al., 1999) WPRP1 (Raines et al., 1991) OsPRP1.1;4 (This study) Spatial expression patterns in rice Statistical analysis Experiments were conducted in triplicate. ANOVA was used to analyse relative activity of the GUS enzyme. P <0.05 was considered significant. 2891 regions. Signal peptides were predicted by use of SMART (Schultz et al., 1998; Letunic et al., 2004). Results Sequence analysis TBLASTN in the dbEST of GenBank and BLASTP (Altschul et al., 1997) were tried using OsPRP1.1 amino acid sequence as a query to search for similar sequences of the OsPRP1 family. DNA and amino acid sequence alignments, hydrophobicity profiles, and secondary structures of proteins were analysed by use of DNAMAN5.2.2 (Lynnon Biosoft Corp.), with which a homology tree was developed, with a distance matrix following the UPGMA method (Sneath and Sokal, 1973). The putative cis-elements in each promoter were predicted by use of the PLACE database (Higo et al., 1999; Prestridge, 1991) to estimate the divergence degree in promoter Organization of the OsPRP1 families A previous study revealed that OsPRP1 is present in the rice genome as a small gene family of four members (Wu et al., 2003). Interestingly, the members are constrained within a 20 kbp range, and arranged tandemly, without any interval of other open-reading frames (ORFs). They have an identical structure of two exons interrupted by one intron (Fig. 1A), with relatively identical sizes of exons (154 kbp for exon 1; 536 kbp for exon 2, except 521 kbp for Fig. 3. The cis-element fingerprints in each promoter of the paralogues. Cis-elements were predicted by use of the PLACE database to estimate the variation of the promoters during evolution. Their relevant functions and species origin are listed. Ruler marks the location upstream of transcription initiation sites (+1). Listed is the frequency (a) of each type of element per 1.5 kbp random sequence and number of each type of matched elements within the promoters of OsPRP1.1 (b), OsPRP1.2 (c), OsPRP1.3 (d), and OsPRP1.4 (e). Ar, Agrobacterium; As, Avena sativa; At, Arabidopsis thaliana; Bn, Brassica napus; Dc, Daucus carota; Gm, Glycine max; Hv, Hordeum vulgare; Le, Lycopersicon esculentum; Nt, Nicotiana tabacum; Os, Oryza sativa; Pc, Petroselinum crispum; Ph, Petunia hybrida; Ps, Pisum sativum; Pv, Phaseolus vulgaris; So, Spinacia oleracea; Vf, Vicia faba; Zm, Zea mays. 2892 Wang et al. Table 3. A summary of the expression characteristics of the four paralogues detected by GUS staining of transgenic rice lines ‘+’ and ‘ÿ’ indicate positive and negative staining results, respectively. The more ‘+’, the relatively denser the colour blue is. Young leaf Mature leaf Stamen/anther Pistil Glume Lodicule Stem/shoot Node/internode Root Germination Seed maturation OsPRP1.1 OsPRP1.2 OsPRP1.3 OsPRP1.4 ÿ ÿ ÿ ÿ ÿ + + + ÿ ÿ ÿ +++ +++ ++ + +++ ÿ +++ +++ + +++ + + ÿ + ÿ ÿ + ++ ++ ÿ + ÿ + + + ÿ + ÿ ++ ++ ÿ + + OsPRP1.1). The intron of each is spliced at a canonical GT-AG site and the splicing site occurs in the same amino acid position (K51-N52) of the four deduced polypeptides (Fig. 1B). But the introns showed great variation in similarity and size (114, 148, 353, and 114 kbp for OsPRP1.1, OsPRP1.2, OsPRP1.3, and OsPRP1.4, respectively). All but OsPRP1.1 share the same transcriptional direction (Fig. 1A). All these data indicate that this family originated by gene duplication. Pairwise analysis of synonymous substitutions per synonymous site (KS), which has been accepted as the best proxy of evolution time (Gu et al., 2002), revealed that OsPRP1.2 is distant from the other three genes in evolution (Fig. 1E). Furthermore, the promoter sequences (1.5 kbp upstream of transcription initiation site), 59 UTR, intron, 39 UTR, and coding region of each paralogue was extracted manually. Pairwise comparison revealed high identity (>88.2%) between coding sequences (Fig. 1D), which is consistent with high identity values (>90%) between amino acid sequences (Fig. 1B). By contrast, promoters, introns, or 39 UTRs showed less sequence similarity (Fig. 1D), which suggests that these sequences diverged more than did the coding regions after gene duplication. Further analysis revealed that proximal to the OsPRP1 family is one retroviral gag-pol-like gene, one potential transposable element similar to Wanderer_110 type miniature inverted-repeat transposable element (MITE) (Turcotte et al., 2001), and several simple repeats, including (ATGC)n, G340, and (AT)n (Fig. 2A). Simple repeats and transposable elements can help with unequal crossing over, which results in DNA rearrangement or duplication (Walker et al., 1995; Parniske et al., 1997). Accordingly, illegal crossing over and inversion may be involved in OsPRP1 members being produced from the same progenitor. A novel class of small PRPs The four deduced polypeptides with their molecular mass of 24 kDa shared high amino acid identity (>90%) and had an identical primary structure, including one hydrophobic head lacking proline at the N terminus (;40 amino acids), one transition domain in the middle region and one hydrophilic tail rich in proline at the C terminus (;70 amino acids) (Fig. 1B). In the N terminus, a signal peptide of 22 amino acids was a probable target of signal peptidase (Fig. 1B). Within the C terminus, a length of ;100 amino acids contains some proline-rich motifs, including five PKPE, three P(V/E)PPK, and two 18-amino acid repeats of DHFHKKPVPPKPEPKPEP (except in OsPRP1.2, where the first E in the first repeat and the sixth P in second are replaced by D and S, respectively) (Fig. 1B). These prolinerich motifs are distinct from those identified from the known four classes of plant PRPs (Table 2). In addition, it was found that all EST (top100 hits) are from rice. If the number of alignments is set as 1000, all the EST (1168 hits) are from poaceous species, of which 49% are from rice and the rest is from other cereal plants. It seemed that the OsPRP1 family was restricted to rice, at least to cereal species, so far. Moreover, a BLASTP search in nrNCBI (non-redundant polypeptide sequence database) indicated only 15 matches (E-value <7e-05) with partial similarity. These hits included 10 unknown proteins annotated from the rice genome and five known PRPs from other genomes. Homology analysis revealed that these sequences can be sorted into different groups (Fig. 2). The four OsPRP1 members, along with a maize PRP (accession no. CAB65536), were grouped into a clade, which suggests that the maize PRP is a maize homologue of OsPRP1. But the maize PRP has a much higher molecular mass (54.5 kDa) and more complicated repetitive proline-rich motifs both in type and number (e.g. 7 times PEPK, 8 times KPEP, and 5 times TPIYHPP) (Table 2) than OsPRP1 proteins. The four paralogues display non-uniform tissue/organ specificity 1.5 kbp promoter regions were analysed upstream of the putative transcriptional initiation sites predicted by a comparison with cDNA sequences (Wu et al., 2003). These promoters showed different cis-element fingerprints (Fig. 3) with 51 predicted elements for OsPRP1.1, 61 for OsPRP1.2, 50 for OsPRP1.3, and 62 for OsPRP1.4. All these elements were matched to 31 consensus sequences, of which seven were identified from rice and the other 24 from other plant species. This finding suggests potential non-uniformity of their expression, although some, possibly most, of these elements may be false positive or functionless. Because the four paralogues show high similarity in their cDNA sequences, it is almost impossible to distinguish their expression sites in different tissues by the use of RNAbased strategies such as in situ hybridization. Therefore, four fusions of OsPRP1.n promoter::GUS (also see Materials and methods) were constructed to examine their Spatial expression patterns in rice 2893 Fig. 4. GUS staining of transgenic rice lines carrying each promoter::GUS construct. The expression specificity of OsPRP1.1 (A, B), OsPRP1.2 (C–X), OsPRP1.3 (Y–AC), and OsPRP1.4 (AD–AJ) in leaves (C, D, E, Y, AD), shoots (F, X, AJ), nodes (G, H, I, J, Z), stems (G, I, J, Z), roots (K–N), and flowers (O, P, Q, R, S, AA, AB, AE, AF, AG). (M), (N), (Q), (R), and (S) were observed under an optical microscope and the others under a stereomicroscope. Preparation of transverse sections of roots involved resin Spurr-embedded tissues (N). The materials in (M), (J), and (Z) were sectioned manually. The glumes were removed before staining (P, AB, AG). The mature pollen grains were not stained (Q). (V), (W), (X), (AC), (AI), and (AJ) show the germinating seeds of T1 generation. (T), (U), and (AH) indicate the seed maturing process of T2 generation. An, anther; AR, adventitious root; Co, coleoptile; Cx, cortex; Ep, epidermis; Ex, exodermis; EZ, elongation zone; Fi, filament; Le, lemma; Lo, lodicule; LR, lateral root; LS, leaf sheath; MZ, meristematic zone; Ov, ovule; Pa, palea; Pe, peduncle; PG, pollen grain; PR, primary root; Sc, scutellum; Se, seed; Sh, shoot; VC, vascular cylinder. expression patterns by GUS histochemical staining. The organs examined included shoots/stems, young leaves, nodes/internodes and roots from seedlings, and flowers and mature leaves from flowering plants. All the four paralogues were expressed in young vegetable organs, including shoots and nodes/internodes (Table 3). In detail, OsPRP1.2 was expressed in almost all the organs examined (Fig. 4C–X), with OsPRP1.1 in lodicules (Fig. 4B); OsPRP1.3 in young leaves (Fig. 4Y), stamens (Fig. 4A, B) and the interconnecting sites of stems and new adventitious 2894 Wang et al. roots (Fig. 4Z); and OsPRP1.4 in mature leaves (not shown), young leaf sheaths (Fig. 4A, D), glumes (Fig. 4A, F), expanding lodicules (Fig. 4A, G), and stamens (Fig.4AG). None of these paralogues was expressed in mature pollen grains and mature embryos (data not shown). Only OsPRP1.2 was expressed in roots (Fig. 4K–N). Members of OsPRP1 also showed differential expression profiles during seed maturation and germination. Within 5 d after pollination, no GUS signal was detected for all the four paralogues. As this phase proceeded, OsPRP1.2 (Fig. 4U) and OsPRP1.4 (Fig. 4A, H) were expressed in the upper part of young seeds. In mature seeds, no GUS signal was detected. OsPRP1.1 and OsPRP1.3 were not expressed in developing seeds. In germinating seeds, OsPRP1.2 (Fig. 4V–X), OsPRP1.3 (Fig. 4AC), and OsPRP1.4 (Fig. 4AI, AJ) were expressed in scutella and the first true leaves. Both OsPRP1.2 (Fig. 4V–X) and OsPRP1.4 (Fig. 4AI, AJ) were expressed in coleoptiles. In addition, only OsPRP1.2 was expressed in primary (Fig. 4K, W), adventitious (Fig. 4L, X) and lateral roots (Fig. 4L, X) and only OsPRP1.4 in the peduncles of inflorescences and solitary flowers (Fig. 4AE). Of the four paralogues, OsPRP1.1 showed expression at the lowest levels while OsPRP1.2 showed expression at the highest levels on the basis of relative intensity of GUS staining. Finally, the transverse sections of GUS-stained Fig. 5. The relative proportion (%) of GUS enzyme activity induced by different treatments compared with their respective controls. Fluorescent units of three aliquots of each sample were read. Error bars show the standard deviations (SDs) of three replicates. roots (Fig. 4N) revealed much denser GUS signals in the vascular tissues than in other parts. This similar situation was also observed in leaves/leaf sheaths (Fig. 4D, H, I), styles (Fig. 4S), and anther filaments (Fig. 4S), which suggests that OsPRP1 was expressed predominantly in vascular tissues. The four paralogues respond to additional stimuli differently The predicted cis-elements from each promoter contain those potentially responsible for responding to plant growth substances and environmental stresses, such as PBF-core, DRE2-core, MART-box, MYB/C, and PYRIMIDINEbox for ABA; ASF1, and CATATG for IAA; GARE, PYRIMIDINE-box, and WRKY71 for GA; ASF1, ELRE, GT1-consensus, and W-box for SA; ARR1 for KT; GCCcore for ET; CDTDRE and LTRE for cold stress; GT1GMSCAM4 for salt stress; and MYB/C and MYBcore for water stress (Fig. 3). Different cis-element fingerprints in the four promoters (Fig. 3) implied their diverse abilities to respond to these additional stimuli. The responses of the four paralogues to nine different additional stimuli were examined here by the use of the fluorogenic assay of GUS. The four paralogues had differential responses to these stimuli (Fig. 5; Table 4): (i) OsPRP1.1 was up-regulated by SA, and coldness, while down-regulated by NaCl and Sor. (ii) SA and IAA promoted the expression of OsPRP1.2, while GA3, ABA, NaCl, Sor, and coldness depressed its expression; (iii) KT, NaCl, and Sor up-regulated OsPRP1.3 whereas GA3, ABA, SA, ET, and coldness down-regulated it. (iv) OsPRP1.4 was down-regulated by IAA, GA3, SA, ET, NaCl, and coldness, and no stimulus up-regulated this gene (Fig. 5; Table 4). Obviously, a given factor had different or even reverse effects on the expression of the paralogues; for example, SA promoted the expression of OsPRP1.1 and OsPRP1.2 but repressed that of OsPRP1.3 and OsPRP1.4. Discussion PRPs have been identified in several plants including the cereals wheat and maize (Hong et al., 1989; Raines et al., Table 4. Generalization of the effects of the different plant growth substances and environmental stresses on expression of each paralogue The ‘+’ means up-regulation while ‘ÿ’ indicates down-regulation (P <0.05). The symbol ‘n’ represents that there is no prominent difference (P >0.05) between a given treatment and its control. OsPRP1.1 OsPRP1.2 OsPRP1.3 OsPRP1.4 IAA GA ABA KT SA ET NaCl Sor Cold n + n ÿ n ÿ ÿ n n ÿ ÿ ÿ n n + n + + ÿ ÿ n n ÿ ÿ ÿ ÿ + ÿ ÿ ÿ + n + ÿ ÿ ÿ Spatial expression patterns in rice 1991; Sheng et al., 1991; Fowler et al., 1999; Vignols et al., 1999) and proposed to be implicated in the integrity of the cell wall, the structural maintenance of organs (Cheung et al., 1993), and defence reaction to pathogen infection (Brisson et al., 1994). Thus, they are generally accepted as a type of cell wall proteins (Showalter, 1993). Several data also suggest that these proteins have other functions (Cheung et al., 1993; Fowler et al., 1999). But their determinate functions are ambiguous so far. OsPRP1 reported here is a tandemly duplicated family of four members encoding PRPs in rice. It is believed that this is the first report of a PRP gene family in monocots. Our previous comparisons of primary structures between OsPRP1 and other known PRPs indicated that OsPRP1 represents a novel class of PRPs (Wu et al., 2003). The joint expression sites of the four paralogues covered almost all the examined tissues, with preferential expression in the vascular cylinder, and suggests the importance of the family in regulating plant development. Furthermore, their ability to respond to multiple internal and/or external stimuli indicates that this family may play roles in regulating the adaptation of plants to environmental conditions, such as defence reactions to adverse environmental conditions or/and pathogen infection elicited by SA, which up-regulated OsPRP1.1 and OsPRP1.2 by 3fold and 6-fold, respectively. Obviously, OsPRP1.2 is a main member of this family and may perform most of the physiological functions of the family. The other three can complement OsPRP1.2 in some expression sites and responses to some internal and/or external stimuli. For instance, OsPRP1.3 may function in responses to salt or osmotic stress, and OsPRP1.1 may protect plants from chilling injury on exposure to lower temperatures. The tandem arrangement, identical structures, and high conservation in both coding region and polypeptide sequences suggest that the four paralogues originated by tandem duplication from one common ancestor. Gene duplication and translocation events could be responsible for the birth of this gene family, because tandem duplication may be introduced by unequal crossing over (paralogue recombination) or non-homologous replication error (Hurles, 2004), and the gene inversion may be the result of some translocation-like events. The presence of simple repeats and MITE sequences, which have been considered potential illegal recombination sites (Fedoroff, 2000; Mao et al., 2000; Turcotte et al., 2001), in the flanking regions of the family provides support for this suggestion. A pairwise comparison indicated that KS values between OsPRP1.2 and any one of the other three was much higher than those between the other OsPRP1s, which suggests that OsPRP1.2 might be ‘older’ than the other three genes. In contrast to highly conserved coding sequences, the predicted promoter regions, as well as other possible regulatory sequences such as introns, showed less similarity. Further cis-element analysis revealed divergence in 2895 the predicted cis-elements in the four promoter regions. This observation suggests that these promoters accumulated mutations after the occurrence of the family, perhaps because the regulatory sequences were subjected to lower selection pressure than the coding regions (Haberer et al., 2004). Compared with the more ubiquitously expressed OsPRP1.2, the other three paralogues appeared to be expressed in limited tissues and tended towards specialization in functions. Seemingly, partial degradation in functions occurred during the evolutionary processes of OsPRP1.1, OsPRP1.3, and OsPRP1.4, whereas OsPRP1.2 preserved the most original functions of their progenitor. These data support, in part, the DDC model, which proposes that duplicated genes were retained by the complementary loss of separate subfunctions of their multifunctional ancestor. The responses of the four paralogues to the nine stimuli showed obvious degenerate mutations, with OsPRP1.1 responding only to four, OsPRP1.2 to seven, OsRPR1.3 to eight, and OsPRP1.4 to six stimuli, which seems to be compatible with the DDC model. However, besides the functional degeneration of response to GA, ABA, KT, and ET, the four members showed reverse response (up- versus down-regulation) to the remaining factors. Because of the complexity of transcriptional regulation, it is difficult to explain the meaning of the phenomena in evolutionary terms. Further experimental identification of functional cis-elements involved in the responses is necessary. Proper functional redundancy in duplicates is necessary for an organism’s fitness by the increase in dosage of gene products (Gu et al., 2003). During the evolution of the duplicated OsPRP1, besides expression divergence, functional overlap between the four paralogues was seen from their tissue/organ specificity and/or responses to different stimuli. The redundancy in expression sites mainly occurred in fast-growing young tissues, such as shoots, internodes, and young leaves (Table 3). This fact, together with the expression preferential in vascular cylinders detected by GUS staining (Fig. 4) and by in situ hybridization (Wu et al., 2003), indicates that their functional redundancy may be essential for the normal development of plants. Acknowledgement The study is supported by NSFC (Grant numbers 30370138 and 30270144). References Adams KL, Wendel JF. 2005. Polyploidy and genome evolution in plants. Current Opinion in Plant Biology 8, 135–141. Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research 25, 3389–3402. 2896 Wang et al. An G. 1987. Binary Ti vectors for plant transformation and promoter analysis. Methods in Enzymology 153, 292–305. Blanc G, Wolfe KH. 2004. Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. The Plant Cell 16, 1667–1678. Bowers JE, Chapman BA, Rong JK, Paterson AH. 2003. Unraveling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature 422, 433–438. Bradford MM. 1976. A rapid and sensitive assay for the quantitation of microgram quantitites of protein utilizing the principle of protein–dye binding. Analytical Biochemistry 72, 248–254. Brisson LF, Tenhaken R, Lamb C. 1994. Function of oxidative cross-linking of cell wall structural proteins in plant disease resistance. The Plant Cell 6, 1703–1712. Cheung AY, May B, Kawata EE, Gu Q, Wu HM. 1993. Characterization of cDNAs for stylar transmitting tissue-specific proline-rich proteins in tobacco. The Plant Journal 3, 151–160. Conant GC, Wagner A. 2003. Asymmetric sequence divergence of duplicate genes. Genome Research 13, 2052–2058. Fedoroff N. 2000. Transposons and genome evolution in plants. Proceedings of the National Academy of Sciences, USA 97, 7002–7007. Ferris SD, Whitt GS. 1979. Evolution of differential regulation of duplicate genes after polyploidization. Journal of Molecular Evolution 12, 267–317. Force A, Lynch M, Pickett FB, Amores A, Yan Y-L, Postlethwait J. 1999. Preservation of duplicate genes by complementary, degenerative mutations. Genetics 151, 1531–1545. Fowler TJ, Bernhardt CB, Tierney ML. 1999. Characterization and expression of four proline-rich cell wall protein genes in Arabidopsis coding two distinct subsets of multiple domain proteins. Plant Physiology 121, 1081–1091. Gilbert W, de Souza SJ, Long M. 1997. Origin of genes. Proceedings of the National Academy of Sciences USA 94, 7698–7703. Gu X, Zhang Z, Huang W. 2005. Rapid evolution of expression and regulatory divergences after yeast gene duplication. Proceedings of the National Academy of Sciences, USA 102, 707–712. Gu Z, Cavalcanti A, Chen FC, Bouman P, Li WH. 2002. Extent of gene duplication in the genomes of Drosophila, nematode, and yeast. Molecular Biology and Evolution 19, 256–262. Gu Z, Steinmetz LM, Gu X, Scharfe C, Davis RW, Li WH. 2003. Role of duplicate genes in genetic robustness against null mutations. Nature 421, 63–66. Guyot R, Keller B. 2004. Ancestral genome duplication in rice. Genome 47, 610–614. Györgyey J, Németh K, Magyar Z, Kelemen Z, Alliotte T, Inzé D, Dudits D. 1997. Expression of a novel-type small proline-rich protein gene of alfalfa is induced by 2,4-dichlorophenoxyacetic acid in dedifferentiated callus cells. Plant Molecular Biology 34, 593–602. Haberer G, Hindemitt T, Meyers BC, Mayer KFX. 2004. Transcriptional similarities, dissimilarities, and conservation of cis-elements in duplicated genes of Arabidopsis. Plant Physiology 136, 3009–3022. Hiei Y, Ohta S, Komari T, Kumashiro T. 1994. Efficient transformation of rice (Oryza sativa L.) mediated by Agrobacterium and sequence analysis of the boundaries of the T-DNA. The Plant Journal 6, 271–282. Higo K, Ugawa Y, Iwamoto M, Korenaga T. 1999. Plant cis-acting regulatory DNA elements (PLACE) database. Nucleic Acids Research 27, 297–300. Hong JC, Nagao RT, Key JL. 1989. Developmentally regulated expression of soybean proline-rich protein genes. The Plant Cell 1, 937–943. Hughes AL, Friedman R. 2005. Expression patterns of duplicate genes in the developing root of Arabidopsis thaliana. Journal of Molecular Evolution 60, 247–256. Hurles M. 2004. Gene duplication: the genomic trade in spare parts. PLoS Biology 2, e206. Irish VF, Litt A. 2005. Flower development and evolution: gene duplication, diversification and redeployment. Current Opinion in Genetics and Development 15, 1–7. Jefferson RA. 1987. Assaying chimeric genes in plants: the GUS gene fusion system. Plant Molecular Biology Reporter 5, 387–405. Jefferson RA, Kavanagh TA, Bevan MW. 1987. GUS fusions: bglucuronidase as a sensitive and versatile gene fusion marker in higher plants. EMBO Journal 6, 3901–3907. Josè-Estanyol M, Ruiz-Avila L, Puigdomènech P. 1992. A maize embryo-specific gene encodes a proline-rich and hydrophobic protein. The Plant Cell 4, 413–423. Kofuji R, Sumikawa N, Yamasaki M, Kondo K, Ueda K, Ito M, Hasebe M. 2003. Evolution and divergence of the MADS-box gene family based on genome-wide expression analyses. Molecular Biology and Evolution 20, 1963–1977. Letunic I, Copley RR, Schmidt S, Ciccarelli FD, Doerks T, Schultz J, Ponting CP, Bork P. 2004. SMART 4.0: towards genomic data integration. Nucleic Acids Research 32, 142–144. Li WH. 1997. Molecular evolution. Sunderland, MA: Sinauer Associates. Lynch M, Force A. 2000. The probability of duplicate gene preservation by subfunctionalization. Genetics 154, 459–473. Majewski J, Ott J. 2002. Distribution and characterization of regulatory elements in the human genome. Genome Research 12, 1827–1836. Makova KD, Li WH. 2003. Divergence in the spatial pattern of gene expression between human duplicate genes. Genome Research 13, 1638–1645. Mao L, Wood TC, Yu Y, Budiman MA, Woo SS, Sasinowski M, Goff S, Dean RA, Wing RA. 2000. Rice transposable elements: a survey of 73 000 sequence-tagged-connectors. Genome Research 10, 982–990. Moore RC, Purugganan MD. 2005. The evolutionary dynamics of plant duplicate genes, Current Opinion in Plant Biology 8, 122–128. Ohno S. 1970. Evolution by gene duplication. New York, NY: Springer-Verlag. Parenicová L, de Folter S, Kieffer M, et al. 2003. Molecular and phylogenetic analyses of the complete MADS-box transcription factor family in Arabidopsis: new openings to the MADS world. The Plant Cell 15, 1538–1551. Parniske M, Hammond-Kosack KE, Golstein C, Thomas CM, Jones DA, Harrison K, Wulff BBH, Jones JDG. 1997. Novel disease resistance specificities result from sequence exchange between tandemly repeated genes at the Cf-4/9 locus of tomato. Cell 91, 821–832. Prestridge DS. 1991. SIGNAL SCAN: a computer program that scans DNA sequences for eukaryotic transcriptional elements. Computer Applications in the Biosciences 7, 203–206. Raines CA, Lloyd JC, Chao SM, John UP, Murphy GJ. 1991. A novel proline-rich protein from wheat. Plant Molecular Biology 16, 663–670. Rose AB. 2004. The effect of intron location on intron-mediated enhancement of gene expression in Arabidopsis. The Plant Journal 40, 744–751. Schultz J, Milpetz F, Bork P, Ponting CP. 1998. SMART, a simple modular architecture research tool: identification of signaling domains. Proceedings of the National Academy of Sciences, USA 95, 5857–5864. Spatial expression patterns in rice Seoighe C, Gehring C, Hurst LD. 2005. Gametophytic selection in Arabidopsis thaliana supports the selective model of intron length reduction. PLoS Genetics 1, e13. Sheng J, D’Ovidio R, Mehdy MC. 1991. Negative and positive regulation of a novel proline-rich protein mRNA by fungal elicitor and wounding. The Plant Journal 1, 345–354. Showalter AM. 1993. Structure and function of plant cell wall proteins. The Plant Cell 5, 9–23. Sneath PH, Sokal RR. 1973. Numerical taxonomy. San Francisco: Freeman. Toki S. 1997. Rapid and efficient Agrobacterium-mediated transformation in rice. Plant Molecular Biology Reporter 15, 16–21. Turcotte K, Srinivasan S, Bureau T. 2001. Survey of transposable elements from rice genomic sequences. The Plant Journal 25, 169–179. 2897 Vignols F, José-Estanyol M, Caparrós-Ruiz D. 1999. Involvement of a maize proline-rich protein in secondary cell wall formation as deduced from its specific mRNA localization. Plant Molecular Biology 39, 945–952. Walker EL, Robbins TP, Bureau TE, Kermicle J, Dellaporta SL. 1995. Transposon-mediated chromosomal rearrangements and gene duplications in the formation of the maize R-r complex. EMBO Journal 14, 2350–2363. Wu XH, Mao AJ, Wang R, Wang T, Song Y, Tong Z. 2003. Cloning and characterization of OsPRP1 involved in anther development in rice. Chinese Science Bulletin 48, 2458–2465. Yu J, Wang J, Lin W, et al. 2005. The genomes of Oryza sativa: a history of duplications. PLoS Biology 3, e38. Zhang J. 2003. Evolution by gene duplication: an update. Trends in Ecology & Evolution 18, 292–298.
© Copyright 2026 Paperzz