JOURNAL OF GENETICS AND GENOMICS J. Genet. Genomics 37 (2010) 219−230 www.jgenetgenomics.org A comparative genomic analysis of plant hormone related genes in different species Zhiqiang Jiang, Hongwei Guo* Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing 100871, China Received for publication 26 October 2009; revised 5 March 2010; accepted 6 March 2010 Abstract Plant hormones are small molecules that play important roles throughout the life span of a plant, known as auxin, gibberellin, cytokinin, abscisic acid, ethylene, jasmonic acid, salicylic acid, and brassinosteroid. Genetic and molecular studies in the model organism Arabidopsis thaliana have revealed the individual pathways of various plant hormone responses. In this study, we selected 479 genes that were convincingly associated with various hormone actions based on genetic evidence. By using these 479 genes as queries, a genome-wide search for their orthologues in several species (microorganisms, plants and animals) was performed. Meanwhile, a comparative analysis was conducted to evaluate their evolutionary relationship. Our analysis revealed that the metabolisms and functions of plant hormones are generally more sophisticated and diversified in higher plant species. In particular, we found that several phytohormone receptors and key signaling components were not present in lower plants or animals. Meanwhile, as the genome complexity increases, the orthologue genes tend to have more copies and probably gain more diverse functions. Our study attempts to introduce the classification and phylogenic analysis of phytohormone related genes, from metabolism enzymes to receptors and signaling components, in different species. Keywords: comparative genomics; Arabidopsis hormone related gene; plant hormone; orthologue Introduction Hormones are small organic molecules, synthesized by living organisms, and function in small doze in almost every aspect of growth and developmental processes. During the last century, eight types of phytohormones had been discovered using physiological, biochemical and geAbbreviations: IAA, auxin; GA, gibberellin; CK, cytokinin; ABA, abscisic acid; ET, ethylene; JA, jasmonic acid; SA, salicylic acid; BR, brassinosteriod; UPGMA, unweighted pair group method with arithmetic mean; MCO, most conserved orthologues; AHRG, Arabidopsis hormone related genes. * Corresponding author. Tel: +86-10-6276 7823; Fax: +86-10-6275 1526. E-mail address: [email protected] DOI: 10.1016/S1673-8527(09)60040-0 netic approaches. They are auxin, gibberellin, cytokinin, abscisic acid, ethylene, jasmonic acid, salicylic acid, and brassinosteroid. In the past decades, forward and reverse genetic methods had been used to identify molecular components functioning in each hormone’s action (Alonso and Ecker, 2006). Usually a simple and relatively specific phenotype was employed as readout to uncover genes involved in the actions of a given phytohormone (metabolism, transport or perception). By screening for mutants with enhanced or reduced response to the given hormone compared to wild type plants, a large number of mutants that either positively or negatively modulate specific hormone response pathways had been isolated. Subsequent molecular genetics and biochemical studies revealed more 220 Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 details on the identity and the function of the corresponding genes, and eventually a genetic pathway was thus established based on the knowledge of these molecular components. The illustration of these hormone response pathways had remarkably improved our understanding of how plant hormones are synthesized, metabolized, transported, and distributed, as well as how plants sense and respond to different types and levels of phytohormones in myriad biological processes. As a large amount of genome sequencing projects had been completed or are underway, we are able to obtain massive genomic information from a wide variety of species with genome sequence available. This information enables us to possibly carry out a comparative study of genes related to a specific pathway or process, and eventually to uncover the common features or divergent mechanisms of such pathway or process in an evolutionary scale. By looking for orthologues in simple organisms, it is possible to find out when and probably how a relatively complex pathway discovered in higher reference organisms (usually angiosperms) was evolved. For example, by analyzing the genome sequence of the moss Physcomitrella patens, a relative comprehensive ABA signaling pathway had been found, implying that ABA was used extensively in this lower plant species, probably to protect moss (an early land plant) from dehydration and water stress (Marella et al., 2006). In addition, a functional GA perception and signaling pathway was found to present in Selaginella kraussiana (a lycophyte), but not in Physcomitrella patens, illuminating the earlier events on the establishment of GA signaling (Yasumura et al., 2007). On the other hand, based on a recent phylogenetic study of the action of phytohormone auxin in algae, it was found that an auxin-dependent mechanism was originated as early as in algal lineages, but some major signaling components began to emerge later than microalgae (Lau et al., 2009). It seems that each type of hormone exhibited a quite distinctive origin and adaptation during the evolutionary course of land plants to suit plants for survival and fitness in various ecological environments. To obtain a systematic and comprehensive view of genes participating in plant hormonal regulation, as well as morphological phenotypes controlled by plant hormones, we have previously developed an Arabidopsis Hormone Database (AHD) (http://ahd.cbi.pku.edu.cn) (Peng et al., 2009). In the database, we have collected 1,043 genes related to Arabidopsis hormone response pathways, includ- ing hormone biosynthesis and metabolism, hormone transports, hormone perception and signal transduction (Peng et al., 2009). In this study, we selected 479 genes that were convincingly associated with various hormone actions based on genetic evidence. Of them 423 genes were supported by mutant analysis while 56 genes were derived from transgenic studies (Supplemental Table 1). By using these 479 genes as queries, a genome-wide search for their orthologues in several species was performed. Meanwhile, a comparative analysis was conducted to evaluate their evolutionary relationship. Our analysis revealed that the metabolisms and functions of plant hormones are generally more sophisticated and diversified in higher plant species, as there are more phytohormone related genes in higher plants than in lower plants or animals. In particular, we found that several phytohormone perception systems were not present in lower plants or animals. Meanwhile, as the genome complexity increases, the orthologue genes tend to have more copies and probably more diverse functions. Material and methods Searching for phytohormone related gene orthologues in different species Phytohormone related genes were derived from Arabidopsis Hormone Database, and the whole genome orthologue clusters were downloaded from INPARANOID 7.0 current version (http://inparanoid.sbc.su.se/download/ 7.0_current/). For a meaningful comparison, whole genome orthologue cluster for some animals and A. thaliana were also downloaded. Using phytohormone related genes as queries, orthologue genes were derived from the dataset of each species. The algorithm was as described in the website, and a cut-off pairwise similarity score of 50 (the BLAST program report asymmetric scores between sequence pairs) was set and the matching segment must cover at least 50% of its total protein length. Most analyses were based on deriving common orthologue gene set between each two species. The group of orthologue genes identified in all selected species was named MCO (Most Conserved Orthologues). And comparison of gene orthologues numbers revealed gene duplication events in different species. By calculating orthologue number variance between two species, we were Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 able to elucidate tandem duplication events. Phylogenetic analysis In order to compare an evolutionary relationship of hormone related genes in different species, we compared component similarity in hormone action, with analogy to comparison of amino acid similarity in protein sequence, and constructed phylogenetic trees using PHYLIP software. The scoring matrix was substituted from sequence similarity to orthologue gene similarity between each two species. Neighbour-Joining method was chosen for generating orthologue phylogenetic tree. In order to determine relationships between orthologue phylogenetic tree and sequence phylogenetic tree, we generated two protein sequence phylogenetic trees as well. The genes selected for generating protein phylogenetic tree were randomly chosen from MCO. One was AT3G28860, encoding an ATP binding cassette gene and the other one was AT3G57530, encoding a calcium dependent protein kinase family gene. Orthologues were also downloaded from INPARANOID webset. Protein sequence phylogenetic trees were generated using MEGA version 4.0 and UPGMA method with a bootstrap of 10,000 replicates. Orthologue comparison in different species We classified orthologue clusters according to their functions (metabolism related, transport, signaling related) in A. thaliana. A histogram was constructed to visualize their function distribution in different species. The percentage of orthologue genes in AHRG (Arabidopsis Hormone Related Genes) of all the selected species was shown in histogram. Heat map was used to present the number of receptor orthologue genes and the map was generated using Gene Cluster 3.0. The data was clustered by complete hierarchical method and an uncentered Pearson correlation algorithm was used. The result was visualized as heat map generated by TreeView-1.1.3. Result Phylogenetic comparison by gene similarity and protein sequence similarity In order to compare phylogeny difference of hormone 221 related genes and find out if the evolution of hormone related gene is correlated with species evolution, we chose a wide array of species, covering from microorganisms to lower and higher plants, as well as a number of animal species. We selected six types of plants whose whole genome had been sequenced: A. thaliana, O. sativa, S. bicolor, P. trichocarpa, P. patens, C. reinhardtii. Several microorganisms were selected including two plant fungal pathogens: M. grisea, R. oryzae, a yeast S. pombe and an archeabacterium T. thermophila. To make a comprehensive comparison, we also chose seven animal species: C. elegans, C. japonica, X. tropicalis, D. melanogaster, B. taurus, M. domestica, and H. sapiens. Because the method in identifying orthologues has not been standardized, we decided to take the most commonly used method supported by INPARANOID program (O’Brien et al., 2005). In the INPARANOID algorithm, BLASTN program is used for all-versus-all pairwise alignment between two species. A minimal similarity score of 50 bit and matching segments exceeding 50% of the total length were set as our cutoff between and within species all-versus-all BLAST search. In general, this program is commonly used in orthologue identification due to its efficacy and measures to avoid false-positive results. Thus, we used the previously defined 479 AHRGs as queries, collected orthologue information from INPARANOID orthologues searching result in different species respectively. In constructing phylogenetic trees, similarity percentage of amino acids in protein sequences was used to calculate distances. As an attempt, similarity percentage of orthologue genes was simply defined as the percentage of orthologue genes in non-Arabidopsis species versus those in Arabidopsis. And thus the similarity could be used to represent the relative distance between the non-Arabidopsis species and Arabidopsis. Calculation of the similarity percentage of the gene list between each two species (Supplemental Table 2) allowed us to construct a matrix similar to constructing sequence phylogenetic trees. Using this matrix we generated a phylogenetic Neighbour-Joining tree with all orthologue genes (Fig. 1A). For comparison, a phylogenetic tree of above-mentioned species was provided (Fig. 1B). In order to compare the accuracy of the gene phylogenetic tree we constructed, protein sequence phylogenetic tree of the gene AT3G28860 and AT3G57530 orthologues was also generated (Fig. 1, C and D). These two genes were randomly chosen from a set of orthologue genes identified in all selected species (MCOs), and we 222 Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 found that both of the two genes are commonly identified in various biological processes and in various species. Generally these trees were of the same style that plant, microoganism and animal genes are clustered separately. However, there were several differences between the orthologue phylogenetic tree and the other three trees. First, different from species phylogenetic tree, plant genes were not clustered into one clade. It can be explained that gene variance of different plants is greater than the sequence variance. Different from plant orthologue clusters, animal orthologue clusters showed little variation. This could be due to that AHRG orthologues in animals are limited, which curtails the phylogenetic distance among animals in orthologue gene phylogenetic tree. It was a little surprising to observe that A. thaliana is located between P. patens and C. reinhardtii, but not closer to higher plant in orthologue gene phylogenetic tree. One possible explanation is that all orthologues were obtained from similarity comparison with the genes in A. thaliana, so that the orthologue clusters would have higher similarity compared with A. thaliana, which breaks the “Triangle Rule” topologically. Just as mentioned above, the distance of the orthologue phylogenetic tree is a relative distance as compared to Arabidopsis orthologue genes, which is a disadvantage and cause the problem. The sequence phylogenetic trees, on the other hand, seemed to fit well with Fig. 1. Phylogenetic tree generated by orthologue similarity and protein sequence similarity. A: NJ-phylogenetic tree generated by orthologue genes, using orthologue gene similarity as compared to A. thaliana hormone related genes to show the relative distance to A. thaliana. The number in brackets shows the quantity of AHRGs in the particular organism. B: species systematic phylogenetic tree provided by INPARANOID. C and D: protein sequence phylogenetic trees of two random selected genes AT3G28860 and AT3G57530, respectively. Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 the systematic phylogenetic tree of species. All of the three trees showed several common patterns: first, P. trichocarpa and A. thaliana were clustered into one subclade. Second, those animal species were clustered into the same clade. Third, the microorganism clade was the most distant 223 to other clades. By selecting the common set of orthologues identified in all selected species, we were able to collect a list of the MCOs of AHRG in all examined species (Table 1). The majority of the MCOs were related to protein kinases or protein degradation, which were Table 1 List of MCOs in all searched species Locus identifier AT3G16650 (PRL2) AT5G45620(RPN9) Hormone ABA IAA, BR Gene description PP1/PP2A phosphatases pleiotropic regulator 2 (PRL2) 26S proteasome regulatory subunit, putative (RPN9) AT3G28860 (PGP19, MDR11, MDR1, ATPGP19, ABCB19) IAA Belongs to the family of ATP-binding cassette (ABC) transporters. Also known as AtMDR11 and PGP19. Possibly regulates auxin-dependent responses by influencing basipetal auxin transport in the root. Acts upstream of phyA in regulating hypocotyl elongation and gravitropic response. Exerts nonredundant, partially overlapping functions with the ABC transporter encoded by AtPGP1. AT3G51260 (PAD1) JA 20S proteosomal alpha subunits. Interacts with SnRK, SKP1/ASK1 during proteasomal binding of an SCF ubiquitin ligase. AT4G01370 (MPK4) ET, JA Encodes a nuclear and cytoplasmically localized MAP kinase involved in mediating responses to pathogens. Its substrates include MKS1. AT2G47000 (MDR4) CK, IAA Multidrug resistance P-glycoprotein (MDR/PGP) subfamily of ABC transporters. Functions in the basipetal redirection of auxin from the root tip. Exhibits apolar plasma membrane localization in the root cap and polar localization in tissues above. AT1G05180 (AXR1) IAA, JA Encodes a subunit of the RUB1 activating enzyme that regulates the protein degradation activity of Skp1-Cullin-Fbox complexes, primarily, but not exclusively, affecting auxin responses. Acts alongside AS1 to exclude BP expression from leaves. AT2G32410 (AXL) IAA AXR1-LIKE (AXL) AT5G04870 (PAL, CPK1) ABA A calcium-dependent protein kinase that can phosphorylate phenylalanine ammonia lyase (PAL), a key enzyme in pathogen defense. AT1G18890 (CPK10) ABA Encodes a calcium-dependent protein kinase whose gene expression is induced by dehydration and high salt. Kinase activity could not be detected in vitro. AT1G74740 (CPK30) ABA Member of calcium-dependent protein kinase AT3G57530 (CPK32) ABA Calcium-dependent protein kinase. ABA signaling component that regulates the ABA-responsive gene expression via ABF4. AtCPK32 has autophosphorylation activity and can phosphorylate ABF4 in vitro. AT3G48750 (CDC2, CDKA1) ABA A-type cyclin-dependent kinase. Together with its specific inhibitor, the Kip-related protein KRP2, both of them regulate the mitosis-to-endocycle transition during leaf development. Dominant negative mutations abolish cell division. Loss of function phenotype has reduced fertility with failure to transmit via pollen. Pollen development is arrested at the second mitotic division. Expression is regulated by environmental and chemical signals. Part of the promoter is responsible for expression in trichomes. Functions as a positive regulator of cell proliferation during development of the male gametophyte, embryo and endosperm. Phosphorylation of threonine 161 is required for activation of its associated kinase. AT1G75820 (CLV1) CK Putative receptor kinase with an extracellular leucine-rich domain. Controls shoot and floral meristem size, and contributes to establish and maintain floral meristem identity. Negatively regulated by KAPP (kinase-associated protein phosphatase). CLV3 peptide binds directly CLV1 ectodomain. AT1G56070 (LOS1) ABA Encodes a translation elongation factor 2-like protein that is involved in cold-induced translation. Mutations in this gene specifically blocks low temperature-induced transcription of cold-responsive genes but induce the expression of CBF genes and mutants carrying the recessive mutations fail to acclimate to cold and is freezing sensitive. AT4G15900 (PRL1) ABA, CK, ET, IAA Mutations confer hypersensitivity to glucose and sucrose and augments sensitivity to cytokinin, ethylene, ABA and auxin. Encodes a nuclear WD40 protein that is imported into the nucleus. Essential for plant innate immunity. Interacts with MOS4 and AtCDC5. It is also predicted to have two DWD motifs. It can bind to DDB1a in Y2H assays, and DDB1b in co-IP assays, and may be involved in the formation of a CUL4-based E3 ubiquitin ligase, and may affect the stability of AKIN10. AT1G64520 (RPN12a) IAA, CK Regulatory particle non-ATPase 12a (RPN12a) AT1G02500 (SAM1, MAT1) ET Encodes an S-adenosylmethionine synthetase. SAM1 is regulated by protein S-nitrosylation. The covalent binding of nitric oxide (NO) to the Cys114 residue inhibits the enzyme activity. AT1G71830 (SERK1) BR Plasma membrane LRR receptor-like serine threonine kinase expressed during embryogenesis in locules until stage 6 anthers, with higher expression in the tapetal cell layer. SERK1 and SERK2 receptor kinases function redundantly as an important control point for sporophytic development controlling male gametophyte production. 224 Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 commonly found in various signal transduction pathways. Additionally, some genes may have altered functions in different species. For examples, SAM1 was known to be a key enzyme in ethylene biosynthesis in A. thaliana, but in animals it is widely used as one-carbon unit carrier. Intriguingly, the number of ABA related genes was over-represented, indicating that certain aspects of ABA response pathways were shared by a wide range of species, which is coincided with the finding that some human diseases are influenced by ABA (Nagamune et al., 2008; Bodrato et al., 2009). Besides ABA-related genes, the number of IAA-relate genes was also over-represented, consistent with the hypothesis that some auxin-dependent mechanisms emerged before land plants occurred (Lau et al., 2009). However, only a small portion of IAA-relate genes showed ancient orthologues, suggesting that an intact auxin response pathway was not evolved prior to land plants. Comparison of Arabidopsis hormone related components in different species To categorize the function of orthologues in different hormone pathways, the percentage of the function distribution of AHRG orthologue in all selected species was shown in histogram (Fig. 2 and Supplemental Fig. 1). Different colors represent distinct function category (metabolism, transport, signaling). In general, the overall distribution of the orthologues in different species was largely similar. However, the histogram revealed that the numbers of hormone related components were increased with the evolution of plants, but remained almost the same in different animals. The expanded orthologues were mainly related to signal transduction, especially transcription factors. The increasing repertoire of transcription factors indicate more sophisticated and multi-leveled regulations evolved in the higher plants. Compared with higher plants, animals have much limited orthologues, suggesting that much of the functions and processes operated by the AHRG orthologues in plants were not present in animals. The hormone receptors represent the first perception site and probably the most critical step in plant hormone signaling pathways. In A. thaliana, several receptors have been identified in 7 hormones except for SA. Orthologues of these receptors have been searched in all selected species, and the number of receptor orthologues of each hormone was clustered by species in complete hierarchical method and in uncentered Pearson algorithm. The result was visualized in a heat map (Fig. 3). By clustering species, we found that all selected plants have most of hormone receptors genes (except BR receptors in P. patens), supporting the conserved feature of plant hormone responses over the evolution course. Not surprisingly, two microorganisms M. grisea and T. thermophila have numerous cytokinin receptors-like orthologues (a type of histidine kinases) that are common in prokaryotic two-component systems but absent in animals. Whether phytohormone Fig. 2. Distribution of orthologues in function category of hormone related genes in different species. The height of each bar showing in different color represents the percentage of orthologue genes in AHRG of selected plants as compared to that of A. thaliana. Color in blue shows orthologues belonging to hormone metabolism related genes; color in purple shows orthologues belonging to hormone transport genes; color in light yellow shows orthologues belonging to genes related to signal transduction. The number in brackets shows the quantity of AHRGs in the particular organism. Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 225 Fig. 3. Heat map to visualize the complete hierarchy clustering of the species based on the number of hormone receptor orthologue genes. Uncentered Pearson algorithm is used. Different color represents the numbers of orthologues each species possesses. Species names are listed on the right side of the map; hormones names are listed above the map. cytokinin can influence the physiology of these two microorganisms remains unknown yet. Although GA, ABA and BR receptor orthologues were found to exist in animals, their authentic functions might differ dramatically from their counterparts in plants. In general, the clustering clades of species in receptor clustering heat map also show an evolutionary pattern. Together with previous function distribution of AHRG orthologues, the results revealed that the identification of receptor orthologue genes is accompanied by identification of more downstream component orthologues. It indicates that the expansion of signaling components may depend on the evolution of perception systems. Comparison of orthologues in plants Further study was performed to compare the difference between higher and lower plants, as well as monocot and dicot plant (Fig. 4 and Supplemental Table 3). The difference between higher and lower plants was pronounced, because several hormone response pathways were absence in lower plants. For instance, two lower plant species (C. reinhardtii and P. patens) did not contain a key component in Arabidopsis ethylene signaling pathway namely EIN3 (Chao et al., 1997; Alonso et al., 1999). By contrast, the difference between monocot and dicot plant was marginal. Most of divergent genes between monocot and dicot plants are from gene family, with more family members found in dicot plants. Some of the new-evolved members in dicot plants might carry on new gene functions. The increase of complexity in dicots may integrating more hormone responses and gene regulation in biological processes, which may as well enhance the robustness of hormone response network, making the network fine-tunable and flexible enough for numerous intrinsic or environmental disturbances. The orthologues shared by all selected plants were limited to certain signaling related genes, including protein kinases, and some hormone metabolism related genes. However, the number of orthologues shared by higher plants was increased (Supplemental Table 4). The number of orthologues in a specific function category (in the mean of percentage relative to the number in A. thaliana) was presented in broken line chart (Fig. 5). From the chart we can see that higher plants have more orthologues than lower plants. Besides, orthologues in higher plants are more conserved than lower plants. One possible explanation is that genome complexity is increasing from lower to higher plants, and higher plants have evolved more 226 Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 Fig. 4. Venn graph of AHRG orthologues uniquely and commonly identified in higher plants. A: orthologue genes between monocot and dicot plants. Color in red represents the orthologue genes uniquely identified in dicot plants, color in yellow represents the orthologue genes uniquely identified in S. bicolor, and color in blue represents orthologue genes uniquely identified in rice. The overlapping part represents the common orthologue genes shared by S. bicolor and/or rice together with dicot plants. B: a detailed analysis of 278 higher plant commonly shared orthologues (gray part in A) as compared to lower plants. Color in red represents orthologues uniquely identified in higher plants, color in yellow represents orthologue genes uniquely identified in algae, and color in blue represents orthologue genes uniquely identified in moss. The overlapping part represents commonly shared orthologues. Fig. 5. Breaking line chart representing component changes in different species. Each color represents particular function category of hormone related component. The height of each point represents the percentage derived from the numbers of AHRG orthologues in a given non-Arabidopsis species versus the number of AHRGs. Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 complicated and diverse regulation thus possess more genes participating in hormone responses. In general, the number of genes in each function category increases from lower to higher plants. An exception was that orthologues involved in protein degradation remains quite constant in number, suggesting the importance of this regulatory mechanism during the evolution of hormone actions. Comparison of higher plant orthologue tandem duplications We next sought to study the variation of the copy numbers of hormone related orthologue genes among species. To test orthologue copy-number variation, we determined the copy numbers of each orthologue gene in O. sativa, S. bicolor, P. trichocarpa, and compared them with that of A. thaliana. The result presenting in bar chart showed the distribution of each kind of copy-number variation by percentage (Fig. 6). We concluded that in monocot plants, single-copy orthologues are the majority, whereas in P. trichocarpa (a dicot), nearly half of genes have two orthologues. This can be explained by the fact that P. trichocarpa is a hybrid species whose genome contains allele polymorphism that is absent in A. thaliana. It was reported that at the genome level, each A. thaliana gene has an average of 1.4–1.6 orthologues in P. trichocarpa (yet no such study reported in rice or S. bicolor), except for certain type of genes such as F-Box family genes. In a 227 previous study, 4DTV test was conducted to evaluate the genome duplication of P. trichocarpa as compared to A. thaliana (Tuskan et al., 2006; Jansson and Douglas, 2007). The result showed that no genome duplication occurred between P. trichocarpa and A. thaliana. At the same time, the relative frequency of domains represented in the protein database (Prints, Prosite, Pfam, ProDom and SMART) in these two genomes is similar. In conclusion, tandem duplication predominately contributes to the observation of coding gene expansion (Tuskan et al., 2006; Jansson and Douglas, 2007). Our study revealed that each hormone related gene in A. thaliana had an average of 2.67 orthologues in P. trichocarpa, which is a little higher than the average of global genes. This finding is consistent with the previous study in P. trichocarpa that some hormone related gene family are greatly expanded, mainly due to independent expansion of these family genes (Tuskan et al., 2006). As a woody plant with a much longer life time, P. trichocarpa has to face a significantly more environmental disturbance throughout its life span. A higher level of allele polymorphism may provide genetic basis to boost the complexity of the regulatory networks through the process of gene divergent evolution or gene redundant regulation, which is also consists with previous finding. Interestingly, we found only one of the two key F-Box genes (EBF1/2) in ethylene signaling pathway in P. trichocarpa genome, suggesting that not every gene family was expanded in this woody species (Guo and Ecker, 2003). Fig. 6. Comparison of the copy numbers of AHRG orthologues in two cereals (rice and S. bicolor), P. trichocarpa, and A. thaliana. Different colors represent ratios that are calculated by the number of orthologue genes in selected species versus the number of AHRG in A. thaliana. 228 Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 Discussion Plant hormones play important roles during a plant life span. Biosynthesized hormones are perceived by hormone receptors and the signal passes downstream through protein kinases, transcription factors together with other signaling components. Meanwhile, F-boxes, ubiquitins and other protein/nucleic acid degradation related genes might provide negatively-regulating brakes to the pathways, making responses in a controllable manner. Recently, study of the responses to phytohormones in lower plants or even animals became attractive (Marella et al., 2006; Yasumura et al., 2007; Nagamune et al., 2008; Bodrato et al., 2009; Lau et al., 2009). The release of massive genome information enables us to identify orthologues of hormone related genes within species beyond A. thaliana. Comparison of orthologues in different species would no doubt provide us with a more comprehensive understanding on the action of phytohormones from an evolutionary view. In the present study, we used 479 Arabidopsis hormone related genes as queries, searching for orthologues in several species across from animals to plants, monocot plants to dicot plants, lower plants to higher plants. In general, species phylogenetically close to each other would have a high similarity of their orthologue clusters. It indicated that an evolutionary process had been initiated in AHRG evolution. Meanwhile, MCOs of all selected species were collected (Table 1). These orthologue genes in Arabidopsis are mostly involved in auxin and ABA signaling pathways, consistent with previous studies that auxin-dependent mechanism was early evolved, and that ABA might have a biological role in both plants and animals. Most of the MCOs are protein kinases, indicating an important role of protein kinases in hormone perception. Histograms of orthologue function distribution in different species were also generated to visualize the difference between different species (Fig. 2 and Supplemental Fig. 1). The number of orthologue genes in lower plants was limited than higher plants. However, there are even fewer hormone related gene orthologues identified in H. sapiens than that of C. reinhardtii. It did not indicate a simple regulation used by H. sapiens. These orthologues, on the other hand, may interact with other genes in human genome to carry out different functions or regulations. Different from animals, failure to find key components orthologues in lower plants implied that some signaling pathways might be not actu- ally functional in those organisms. Receptor is the starting point of a signaling pathway. Comparison of hormone receptor gene orthologues in different species has several implications. First, orthologue genes involved in downstream signal transduction tend to expand as the receptor gene evolved. Second, higher plants have all hormone receptors genes, supporting the conserved feature of plant hormone perception over the evolution course. Third, although some receptor-like genes were found in animals, for instance, GA, ABA and BR receptor orthologues, their authentic functions might differ dramatically from their counterparts in plants. Forth, microorganisms also contain plant hormone receptors-like orthologues (e.g., histidine kinase receptors), but the physiological function of these genes might be distinctive from that in plants given that prokaryotes frequently use two-component systems to sense environmental changes. In a detailed analysis of plant orthologue clusters, in comparison to higher plants, lower plants contain fewer number of component orthologues. By contrast, the majority of orthologue genes were shared by both monocot plants and dicot plants. It seemed that components related to hormone responses tend to be convergent as species became complex. Variance in higher plants orthologue clusters was mainly copy number difference. As a decadeslong life span genus organism, P. trichocarpa has more gene copies than A. thaliana and rice. The larger gene pools might enable P. trichocarpa population to gain enhanced viability and ability to detect and respond to deleterious stresses and thus adapt to various environmental conditions. However, tandem duplication events of hormone related genes were not observed in monocot plants such as rice and S. bicolor, although in four higher plants assayed gene contents are quite similar to each other. It indicates that after monocot plants diverged from dicot plants, the genome sequences of monocot plants had changed in a slower evolutionary pace. The genome sequences of dicot plants, on the other hand, evolved much faster. We found that in P. trichocarpa, which is a woody dicot plant, the number of hormone-related genes more than doubles that in A. thaliana, although certain classes of genes such as F-Box genes were lost. Thus, both as a Dicotyledoneae species, P. trichocarpa and A. thaliana chose different evolutionary routes. Many eukaryotes duplicate their genomes and the resultant duplicated genes would either produce redundant or new functions (Soyer and Bonhoeffer, 2006; Ma et al., Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 2009). At the same time, divergent evolutionary, de novo gene evolution events take place anywhere (Lynch and Conery, 2003). The intrinsic tendency of increasing hormone regulation network size may result from an increasing complexity of genome. Our analyses also revealed that when a perception system (indicated by the appearance of receptors) is formed in a given organism, the numbers of components downstream of the receptors will increase dramatically and a primitive pathway would thus occur. During the evolutionary process, the initial members of the pathway may recruit more partners. This process gradually increases the complexity and amount of genes participating in the hormone actions. Under this circumstance, the hormone-related pathways will expand even more significantly along with the whole genome duplication. It is thus conceivable that the primary causes in phytohormone related gene evolution should be the diversification of the pathway functions as well as the increasing complexity of the genome. The evolution of receptors should be regarded as the first event for the most hormone signal pathways to form. Afterward, during the evolutionary process, existing genes may be recruited to join the pathway. Otherwise, the existing genes may be lost or became functional divergent. The next step of the work is to concentrate on a more detailed analysis of each hormone pathway components. A deep comparison of eight hormones would tell us additional information about the natural force shaping the formation and evolution of all hormone related genes. Acknowledgements This work was supported by the National Science Foundation of China (No. 30625003, and 30730011) and the China Ministry of Education (No. 20060047). We thank Professor Jingchu Luo, He Zhang, and Ming Ni at Center of Bioinformatics of Peking University for the helpful discussion. We thank Pengpeng Li for manuscript revision, and Wenyang Li, Mingzhe Li for their technical assistance. Supplemental data Supplemental Fig. 1 and Tables 1–4 associated with this article can be found in the online version at www.jgenetgenomics.org. 229 References Alonso, J.M., and Ecker, J.R. (2006). Moving forward in reverse: genetic technologies to enable genome-wide phenomic screens in Arabidopsis. Nat. Rev. Genet. 7: 524−536. Alonso, J.M., Hirayama, T., Roman, G., Nourizadeh, S., and Ecker, J.R. (1999). EIN2, a bifunctional transducer of ethylene and stress responses in Arabidopsis. Science 284: 2148−2152. Bodrato, N., Franco, L., Fresia, C., Guida, L., Usai, C., Salis, A., Moreschi, I., Ferraris, C., Verderio, C., Basile, G., Bruzzone, S., Scarfi, S., De Flora, A., and Zocchi, E. (2009). Abscisic acid activates the murine microglial cell line N9 through the second messenger cyclic ADP-ribose. J. Biol. Chem. 284: 14777−14787. Chao, Q., Rothenberg, M., Solano, R., Roman, G., Terzaghi, W., and Ecker, J.R. (1997). Activation of the ethylene gas response pathway in Arabidopsis by the nuclear protein ETHYLENE-INSENSITIVE3 and related proteins. Cell 89: 1133−1144. Guo, H., and Ecker, J.R. (2003). Plant responses to ethylene gas are mediated by SCF(EBF1/EBF2)-dependent proteolysis of EIN3 transcription factor. Cell 115: 667−677. Jansson, S., and Douglas, C.J. (2007). Populus: a model system for plant biology. Annu. Rev. Plant Biol. 58: 435−458. Lau, S., Shao, N., Bock, R., Jurgens, G., and De Smet, I. (2009). Auxin signaling in algal lineages: fact or myth? Trends Plant Sci. 14: 182−188. Lynch, M., and Conery, J.S. (2003). The origins of genome complexity. Science 302: 1401−1404. Ma, W., Trusina, A., El-Samad, H., Lim, W.A., and Tang, C. (2009). Defining network topologies that can achieve biochemical adaptation. Cell 138: 760−773. Marella, H.H., Sakata, Y., and Quatrano, R.S. (2006). Characterization and functional analysis of ABSCISIC ACID INSENSITIVE3-like genes from Physcomitrella patens. Plant J. 46: 1032−1044. Nagamune, K., Hicks, L.M., Fux, B., Brossier, F., Chini, E.N., and Sibley, L.D. (2008). Abscisic acid controls calcium-dependent egress and development in Toxoplasma gondii. Nature 451: 207−210. O’Brien, K.P., Remm, M., and Sonnhammer, E.L. (2005). Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Res. 33: D476−D480. Peng, Z.Y., Zhou, X., Li, L., Yu, X., Li, H., Jiang, Z., Cao, G., Bai, M., Wang, X., Jiang, C., Lu, H., Hou, X., Qu, L., Wang, Z., Zuo, J., Fu, X., Su, Z., Li, S., and Guo, H. (2009). Arabidopsis Hormone Database: a comprehensive genetic and phenotypic information database for plant hormone research in Arabidopsis. Nucleic Acids Res. 37: D975−D982. Soyer, O.S., and Bonhoeffer, S. (2006). Evolution of complexity in signaling pathways. Proc. Natl. Acad. Sci. USA 103: 16337−16342. Tuskan, G.A., Difazio, S., Jansson, S., Bohlmann, J., Grigoriev, I., Hellsten, U., Putnam, N., Ralph, S., Rombauts, S., Salamov, A., Schein, J., Sterck, L., Aerts, A., Bhalerao, R.R., Bhalerao, R.P., Blaudez, D., Boerjan, W., Brun, A., Brunner, A., Busov, V., Campbell, M., Carlson, J., Chalot, M., Chapman, J., Chen, G.L., Cooper, D., Coutinho, P.M., Couturier, J., Covert, S., Cronk, Q., Cunningham, R., Davis, J., Degroeve, S., Dejardin, A., Depamphilis, C., Detter, J., Dirks, B., Dubchak, I., Duplessis, S., Ehlting, 230 Zhiqiang Jiang et al. / Journal of Genetics and Genomics 37 (2010) 219−230 J., Ellis, B., Gendler, K., Goodstein, D., Gribskov, M., Grimwood, J., Groover, A., Gunter, L., Hamberger, B., Heinze, B., Helariutta, Y., Henrissat, B., Holligan, D., Holt, R., Huang, W., Islam-Faridi, N., Jones, S., Jones-Rhoades, M., Jorgensen, R., Joshi, C., Kangasjarvi, J., Karlsson, J., Kelleher, C., Kirkpatrick, R., Kirst, M., Kohler, A., Kalluri, U., Larimer, F., Leebens-Mack, J., Leple, J.C., Locascio, P., Lou, Y., Lucas, S., Martin, F., Montanini, B., Napoli, C., Nelson, D.R., Nelson, C., Nieminen, K., Nilsson, O., Pereda, V., Peter, G., Philippe, R., Pilate, G., Poliakov, A., Razumovskaya, J., Richardson, P., Rinaldi, C., Ritland, K., Rouze, P., Ryaboy, D., Schmutz, J., Schrader, J., Segerman, B., Shin, H., Siddiqui, A., Sterky, F., Terry, A., Tsai, C.J., Uberbacher, E., Unneberg, P., Vahala, J., Wall, K., Wessler, S., Yang, G., Yin, T., Douglas, C., Marra, M., Sandberg, G., Van de Peer, Y., and Rokhsar, D. (2006). The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313: 1596−1604. Yasumura, Y., Crumpton-Taylor, M., Fuentes, S., and Harberd, N.P. (2007). Step-by-step acquisition of the gibberellin-DELLA growth-regulatory mechanism during land-plant evolution. Curr. Biol. 17: 1225−1230.
© Copyright 2026 Paperzz