A peer-reviewed version of this preprint was published in PeerJ on 4 May 2017. View the peer-reviewed version (peerj.com/articles/3232), which is the preferred citable publication unless you specifically need to cite this preprint. Ning P, Liu C, Kang J, Lv J. (2017) Genome-wide analysis of WRKY transcription factors in wheat (Triticum aestivum L.) and differential expression under water deficit condition. PeerJ 5:e3232 https://doi.org/10.7717/peerj.3232 Genome-wide analysis of WRKY transcription factors in wheat (Triticum aestivum L.) and differential expression under water deficit treatment Pan Ning 1 2 3 4 1 , Congcong Liu 2 , Jingquan Kang 3 , Jinyin Lv Corresp. 4 College of Science, Northwest Agriculture and Forestry University, Yangling, Shannxi, China College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, China College of Life Science, Northwest Agriculture and Forestry University, Yangling, Shannxi, China College of Life Science, Northwest Agriculture and Forestry University, Yangling, Shaanxi, China Corresponding Author: Jinyin Lv Email address: [email protected] WRKY proteins, comprising one of the largest transcription factor families in plant kingdom, play crucial roles in the plant development and stress responses. Despite several studies on WRKYs in wheat (Triticum aestivum L.) were investigated, functional annotation information about wheat WRKYs was limited. Here, 171 TaWRKY transcription factors (TFs) were identified from the whole wheat genome and compared with proteins from other 19 species representing nine major plant lineages. A phylogenetic analysis, coupled with gene structure analysis and motif determination, divided these TaWRKYs into seven subgroups (Group I, IIa-e, III). Chromosomal location showed that the most TaWRKY genes were enriched on four chromosomes, especially on chromosome 3B, and 85 (49.7%) genes were either tandem (5) or segmental duplication (80), which suggested that though tandem duplication has contributed to the expansion of TaWRKY family, segmental duplication probably played a more pivotal role. The cis-acting elements analysis revealed putative functions of WRKYs in wheat during development as well as under numerous biotic and abiotic stresses. Finally, the expression of TaWRKY genes in flag leaves, glumes and lemmas under water stress were analyzed, and we found different TaWRKY genes preferentially express in specific tissue during the grain-filling stage. Our results provide a more extensive knowledge on TaWRKYs, which helps to complete the information of WRKY gene family in wheat, and also contribute to screen more candidate genes for further investigation on function characterization of WRKYs under various stresses. PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 1 Genome-wide analysis of WRKY transcription factors in wheat (Triticum 2 aestivum L.) and differential expression under water deficit treatment 3 Pan Ning1, Congcong Liu3, Jingquan Kang2, Jinyin Lv2* 4 1 College of Science, Northwest A&F University, Yangling, Shaanxi, China 5 2 College of Life Science, Northwest A&F University, Yangling, Shaanxi, China 6 3 College of Animal Science and Technology, Huazhong Agricultural University, Wuhan, 7 Hubei, China 8 9 10 Corresponding Author: Jinyin Lv 11 12 E-mail address: [email protected] 13 1 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 14 ABSTRACT 15 WRKY proteins, comprising one of the largest transcription factor families in plant kingdom, 16 play crucial roles in the plant development and stress responses. Despite several studies on 17 WRKYs in wheat (Triticum aestivum L.) were investigated, functional annotation information 18 about wheat WRKYs was limited. Here, 171 TaWRKY transcription factors (TFs) were 19 identified from the whole wheat genome and compared with proteins from other 19 species 20 representing nine major plant lineages. A phylogenetic analysis, coupled with gene structure 21 analysis and motif determination, divided these TaWRKYs into seven subgroups (Group I, 22 IIa-e, III). Chromosomal location showed that the most TaWRKY genes were enriched on four 23 chromosomes, especially on chromosome 3B, and 85 (49.7%) genes were either tandem (5) 24 or segmental duplication (80), which suggested that though tandem duplication has 25 contributed to the expansion of TaWRKY family, segmental duplication probably played a 26 more pivotal role. The cis-acting elements analysis revealed putative functions of WRKYs in 27 wheat during development as well as under numerous biotic and abiotic stresses. Finally, the 28 expression of TaWRKY genes in flag leaves, glumes and lemmas under water stress were 29 analyzed, and we found different TaWRKY genes preferentially express in specific tissue 30 during the grain-filling stage. Our results provide a more extensive knowledge on TaWRKYs, 31 which helps to complete the information of WRKY gene family in wheat, and also contribute 32 to screen more candidate genes for further investigation on function characterization of 33 WRKYs under various stresses. 34 Keywords: wheat, WRKY, water deficit, expression 35 INTRODUCTION 36 Plants have evolved a wide range of unique strategies to cope with various biotic and abiotic 37 stresses through physical adaption, molecular and cellular changes (Ahuja et al., 2010; Knight 38 & Knight, 2001). Transcription regulation of gene expression in response to developmental 39 and environment changes, mediated by the DNA-binding transcription factors, is one of the 40 most important regulatory mechanisms in plants (Ahuja et al., 2010; Buscaill & Rivas, 2014). 41 WRKYs, one of the largest families of regulators, have been proven to play key roles in 2 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 42 numerous stress responses and some development processes(Rushton et al., 2010). Since the 43 first report of WRKY TFs identified in sweet potato(Ishiguro & Nakamura, 1994), WRKY 44 proteins have been found throughout the plant lineage and also in some diplomonads, social 45 amoebae, fungi incertae seais, and amoeboza in succession(Rinerson et al., 2015). 46 WRKY TFs were defined by the presence of one or two highly conserved WRKY 47 domains (WDs) of 60 amino acid residues, including the almost invariant WRKYGQK 48 heptapeptide at the N-terminus, followed by a C2H2 (C-X4-5-C-X22-23- H-X-H) or C2HC 49 (C-X7-C-X23-H-X-C) zinc-finger structure at the C-terminus (Eulgem et al., 2000; Rushton et 50 al., 2010). The WRKY family members are classified into three groups (I, II and III) based on 51 the number of WDs and the features of their zinc-finger-like motif (Eulgem et al., 2000; 52 Rushton et al., 2010). Group I typically contains two WDs including a C2H2 zinc-finger 53 structure, whereas Group II and III are characterized by a single WD, including a C2H2 and 54 C2HC zinc-finger motif, respectively. Group II can be further divided into five subgroups 55 (IIa-IIe) based on phylogenetic analysis of the WDs (Eulgem et al., 2000; Rushton et al., 56 2010). Members of WRKY family regulate gene expression through exclusively binding to 57 the W-box (TTGACC/T), a cis-element in the promoter region of target genes (Bakshi & 58 Oelmüller 2014; Ulker & Somssich, 2004) 59 Recently, accumulating evidence has demonstrated that WRKY TFs, as important 60 components of plant signaling web, regulate specific transcriptional programs during plant 61 development, as well as in response to a variety of biotic and abiotic stimuli (Ahuja et al., 62 2010; Bakshi & Oelmüller, 2014; Rushton et al., 2012; Rushton et al., 2010). For example, 61 63 of the PtrWRKY genes in Populus are induced by biotic and abiotic treatments, such as 64 Marssonina bruuea, salicylic acid (SA), methyl jasmonate (MeJA), wounding, cold and 65 salinity (Jiang et al., 2014). In rice, expression of OsWRKY71 gene is induced by cold 66 stress(Kim et al., 2016), while it also encodes a transcriptional repressor of GA signaling in 67 aleurone cells (Zhang et al., 2004). Five transgenic broccoli lines over-expressing BoWRKY6 68 demonstrated significant increased resistance to downy mildew, with resistant levels from 69 low to very high(Jiang et al., 2016). In addition, WRKY TFs were also implicated to 70 modulate plant development, such as seed development and germination (Raineri et al., 2016; 71 Xie et al., 2007; Zhang & Wang, 2011), root growth (Ding et al., 2015; Ranjan & Sawant, 3 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 72 2014) , stem elongation (Yu et al., 2012; Zhang et al., 2011), embryogenesis (Jimmy & Babu 73 2015; Lagace & Matton 2004), senescence (Ricachenevsky et al., 2010; Sakuraba et al., 74 2016), trichome development (Johnson, 2002). 75 Wheat (Triticum aestivum L.), one of the world’s three main cereal with the highest 76 monetary(Keating et al., 2014), is affected by multi-environment stresses, such as salinity, 77 extreme temperature and especially drought, which is a principal constraint to global 78 production of wheat (Mwadzingeni et al., 2016; Wang et al., 2015). However, the mechanism 79 by which wheat responds to abiotic stress has been still poorly understood, which might due 80 to its large genome (approximately 17GB). The identification and functional characterization 81 of the WRKY family in wheat will contribute to clarify the mechanism of stress response. 82 Several studies on wheat WRKY identification have been reported in succession. Forty-three 83 and 92 putative TaWRKYs were early identified from publicly available EST by Niu et al. 84 (2012) and Zhu et al. (2013), respectively. Okay et al. (2014) characterized 160 TaWRKYs 85 and their expression profiling in RNA-Seq libraries. Recently, Zhang et al. (2016) identified 86 116 WRKYs, and 13 of them were characterized as senescence-associated genes. Here, we 87 provide a more extensive knowledge on TaWRKYs based on the whole genome sequence of 88 wheat. A total of 1113 WRKY TFs were identified in 20 plants representing the nine major 89 evolutionary lineages to gain preliminary insight into the evolution of the WRKY family in 90 Plantae. In addition, we identified 171 TaWRKYs from wheat, and detailed analysis, 91 including gene classification, physical and chemical parameters prediction, phylogenetic 92 analysis, chromosomal location, duplication events, conserved motif determination, 93 exon-intron structure and cis-acting element analysis were employed. Finally, gene 94 expression patterns of TaWRKY genes in flag leaf, glume and lemma tissues under water 95 deficit treatment were further determined using qRT-PCR. These results will helps to 96 complete the information of WRKY gene family in wheat, and also contribute to screen more 97 candidate genes for future functional investigation of TaWRKYs under various stresses. 98 Materials AND Methods 99 Database search and identification of WRKYs 4 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 100 The protein sequences of 20 plants from nine different major taxonomic lineages were 101 downloaded from several public databases. All of the amino acid sequences were obtained 102 from the following sources: the Eudicots Arabidopsis thaliana (At), Populus trichocarpa and 103 the Monocots Brachypodium distachyon, Oryza sativa, Sorghum bicolor, Triticum aestivum 104 (Ta, IWGSC1+popseq.31.pep) and Zea mays, and Basal Magnoliophyta Amborella 105 trichopoda and the Bryophyte Physcomitrella patens and the Lycophyte Selaginella 106 moellendorffii and the Chlorophytes Ostreococcus lucimarinus and the Rhodophytes 107 Cyanidioschyzon merolae 108 info/website/ftp/index.html); the Eudicot Cucumis sativus and Chlorophytes Coccomyxa 109 subellipsoidea C-169, Micromonas pusilla CCMP1545 and Volvox carteri 110 Cyanophora paradoxa 111 OrganismDownload.jsf?organism= PhytozomeV9); the Chlrophyte Ostreococcus tauri, 112 Gymnosperms Picea sitchensis and Rhodophyte Galdieria sulphuraria from NCBI 113 (http://www.ncbi.nlm.nih.gov/protein/). The evolutionary relationship of these 20 species 114 were obtained from NCBI (https://www.ncbi.nlm.nih.gov/Taxonomy/CommonTree/ 115 wwwcmt.cgi), and visually displayed by phylogenetic tree using FigTree v1.4.3 program 116 (http://tree.bio.ed.ac.uk/software/figtree/). from the Ensembl Plants (http://plants.ensembl.org/ and Glaucophyte from Phytozome (http://genome.jgi.doe.gov/pages/dynamic 117 To identify the WRKY TFs in various species, the HMM profile of the WRKY domain 118 (PF03106) downloaded from the Pfam database (http://pfam.xfam.org) (Finn et al., 2016) was 119 applied as a query to search against the local protein database using HMMsearch program 120 (HMMER3.0 software: http://hmmer.janelia.org/) (Finn et al., 2011) with an E-value cutoff of 121 1.0. The sequences obtained were then submitted to the Pfam database to detect the presence 122 of WRKY domains. The protein sequences containing complete or partial WRKY domains, 123 which may be pseudogenes, incomplete assemblies, sequencing errors or mispredictions 124 (Rinerson et al., 2015), were both considered as putative WRKYs. The physical and chemical 125 properties including number of amino acids (NA), molecular weight (MW), theoretical pI 126 (pI) , grand average of hydropathicity (GRAVY), aliphatic index (AI) and instability index (II) 127 of putative TaWRKY proteins were calculated using the online ExPASy-ProtParam tool 128 (http://web.expasy.org/ protparam/). 5 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 129 Phylogenetic analysis 130 MEGA7.0 program was employed to construct the unrooted phylogenetic tree of identified 131 WRKY protein domains in Triticum aestivum L. and Arabidopsis thaliana L. using the 132 maximum likelihood method(Kumar et al., 2016). The parameters of the constructed trees 133 were: test of phylogeny: bootstrap (1000 replicates), gaps/missing data treatment: partial 134 deletion, model/method LG model, rates among sites: gamma distributed with invariant sites 135 (G). Only the value of bootstrap more than 60 could be displayed on the tree. 136 Chromosomal location of TaWRKY genes 137 To map the locations of WRKY gene transcripts in Triticum aestivum L., MapInspect software 138 (http://www.softsea.com/download/ MapInspect. html) was employed to visualize the 139 chromosomal distribution of deduced TaWRKY genes according to their initial position and 140 length of chromosome. The chromosomal location information of TaWRKYs was obtained 141 from Ensembl Plants website (http://archive.plants.ensembl.org/Triticum_aestivum/Info/ 142 Index). 143 To detect the gene duplication, the CDS sequences of WRKY genes in wheat were blasted 144 against each other (e-value<1e-10, identity > 90%) (Song et al., 2014). Tandem duplicated 145 TaWRKY genes were defined as two or more adjacent homologous genes located on a single 146 chromosome, while homologous genes between different chromosomes were defined as 147 segmental duplicated genes (Bi et al., 2016). 148 Characterization of gene structure, conserved motif and putative cis-acting elements 149 The exon-intron structures of TaWRKY genes were obtained by mapping the CDS to DNA 150 sequences using the Gene Structure Display Server2.0 (http://gsds.cbi.pku.edu.cn/) (Hu et al., 151 2015). CDS (IWGSC1+popseq.31.cds) and genomic sequences (IWGSC1+popseq.31.dna) in 152 Triticum aestivum L. were downloaded from Ensembl Plants website 153 (ftp://ftp.Ensemblgenomes.org/pub/release-31/plants/fasta/triticum_aestivum/). 154 To discover motifs in TaWRKY protein sequences, the online tool MEME 4.11.2 155 (Multiple Expectation Maximization for Motif Elication: http://meme-suite.org/) was utilized 156 to identify the conserved motifs in full length TaWRKYs (Bailey et al., 2009). The optimized 6 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 157 parameters were as follows: distribution of motifs, zero or one occurrence per sequence; 158 maximum number of motifs, 10; minimum sites, 6; maximum width 60. 159 The 1.5kb upstream of the transcription start site (-1) of all identified TaWRKY transcripts 160 was extracted as promoter to predict cis-acting elements using the PlantCARE online 161 (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) (Lescot M, 2002). Then 162 statistics derived from hits of various cis-acting elements in all TaWRKY transcripts were 163 made, and displayed by diagram. 164 Plant materials, drought treatment and qRT-PCR 165 A hexaploid winter wheat (Triticum aestivum L.) cv. Zhengyin1 (St1472/506) was taken in 166 our pot experiment, which was carried out from October, 2015 to June, 2016 in a greenhouse. 167 Seeds were sown in each plastic pot filled with 7 kg of soil, earth-cumuli-orthic Anthrosols 168 collected in northwest China. An equivalent of 0.447 g (urea)/kg-1 (soil) and 0.2 g (K2HPO3)/ 169 kg-1 (soil) were mixed in soil with a net water content of 29.2% at the largest field water 170 capacity. Water control was carried out from anthesis (April 17, 2016). Normal water supply 171 and artificial soil desiccation were implemented with 70-75(control group)and 45-50% 172 ( moderate water stress) of the largest field capacity respectively. Spikes and flag leaves of 173 wheat collected at 0, 1, 3, 5, 10, 15, and 25 days after anthesis (DAA), were immediately 174 frozen in liquid nitrogen and then stored at -80℃ for subsequent analysis. Spikes were 175 separated as glume, lemma, grain, palea, and rachis, and only glume and lemma were used in 176 experiment. 177 Total RNA was extracted from wheat tissues using the Trizol reagent (Tiangen, China) 178 following the manufacturer’s instruction, and then digested with RNase-free DNaes I. The 179 quantity and concentration of RNA was evaluated by UV spectrophotometry. The first strand 180 cDNA synthesis was generated using PrimeScriptTM RT Reagent Kit (TaKaRa, China), and 181 the synthesized cDNA products were diluted 1:9 with nuclease-free water to use in qRT-PCR. 182 Primer Primer 5.0 and AllelelID 6.0 (http://www.premierbiosoft.com/index.html) were used 183 to design gene-specific primers (Table S1). Wheat Tublin was taken as the reference gene. 184 185 The qRT-PCR was carried out using based on SYBR GreenSYBR® Premix Ex TaqTM (TaKaRa, Japan) according to the manufacturer’s instructions with Bio Rad CFX96TM 7 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 186 real-time PCR detection system (BioRad, USA). Reaction parameters for thermal cycling 187 were: 95℃ for 30 s, followed by 39 cycles of 95℃ for 5 s and 60℃ for 30 s, and at last a 188 melting curve (65-95℃, at increments of 0.5℃) generated to check the amplification. The 189 gene expression levels were calculated with the 2-ΔΔCT methods (Livak & Schmittgen, 2001), 190 and three biological replicates were used. 191 Results 192 Identification of WRKYs in wheat and comparative analysis 193 To comprehensively analyze and identify WRKY TFs in plants, 20 plants representing the 194 nine major evolutionary lineages were chosen for analysis. After searched by HMMER and 195 detected WRKY domains by Pfam database, a total of 1113 WRKY TFs were obtained (Table 196 S2). The evolutionary relationships of various species and the number of WRKY TFs were 197 shown in Fig.1. Most of terrestrial plants, including Monocots, Eudicots and Bryophytes, 198 contained 86 to 171 WRKY proteins, while Picea sitchenis belonging to the Gymnosperm 199 carried only eight WRKY TFs, which could be due to sequencing unfinished. However, six or 200 less WRKY proteins were found in aquatic algaes, of which the number of WRKY TFs in 201 Rhodophytes and Glaucophytes showed zero. In general, the number of WRKY TFs in many 202 higher plants was more than that in lower plants, which suggested that the WRKY TFs may 203 play an important role in the process of plant evolution. The number of WRKY proteins 204 increased as plants evolved, possibly because of genome duplication. 205 In wheat, a total of 174 WRKY proteins were searched using HMM search program. 206 Subsequently, all obtained sequences were verified by Pfam database, which resulted in the 207 identification of 171 WRKY TFs (Table S3). There were 115 TaWRKYs named as 208 TaWRKY1 to TaWRKY115 based on their chromosome location, and other 56 sequences 209 called TaWRKY116 to TaWRKY171 as they were anchored in the scaffolds. Among the 171 210 TaWRKYs, manual inspection showed that some of them are partial sequences, in which 211 WRKY domains or zinc-finger structures were incomplete or even did not exist. 212 The parameters used to describe the TaWRKY proteins were shown in Table S3. 213 Molecular weight, theoretical pI and aliphatic index could not be computed in sequences 214 containing several consecutive undefined amino acids. The lengths of TaWRKY proteins 8 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 215 ranged from 44 (TaWRKY121) to 1,482 residues (TaWRKY78), whereas the PI ranged from 216 4.96 (TaWRKY164) to 10.73 (TaWRKY40). This suggested that different TaWRKYs might 217 operate in various microenvironments (Wang et al., 2014). The values of grand average of 218 hydropathicity were all negative, which indicated that TaWRKY proteins were all hydrophilic. 219 Almost all TaWRKYs were defined as unstable proteins, and only 30 TaWRKYs with 220 instability index less than 40 were considered to be stable proteins. 221 Classification and phylogenetic analysis of TaWRKYs 222 To categorize and investigate the evolutionary relationship of the TaWRKY proteins in detail, 223 we constructed an unrooted maximum-likelihood phylogenetic tree with 262 putative WDs in 224 Arabidopsis and wheat (Fig. 2). Based on the classification of AtWRKYs and primary amino 225 acid structure feature of WRKY (Eulgem et al., 2000), TaWRKYs were classified into three 226 major groups (Groups Ⅰ, Ⅱ and Ⅲ). The 30 TaWRKYs possessing two WDs and C2H2-type 227 zinc finger motifs (C-X3-4-C-X22-23- H-X1 -H) were classified into group I. Group II was 228 comprised of 95 sequences, and each protein contained a single WD and C2H2-type zinc 229 finger structure (C-X4-5-C-X23-H-X1-H). We further divided Group II into five subgroups, 230 including Ⅱa, Ⅱb, Ⅱc, Ⅱd and Ⅱe with 11, 7, 50, 17 and 10 members, respectively. Finally, 45 231 TaWRKYs with a single WD were assigned to Group Ⅲ because of their C2HC zinc-finger 232 structure (C-X6-7-C-X23-28-H-X1-C). 233 As shown in Table S3, besides the highly conserved WRKYGQK motifs, we found three 234 variants in TaWRKYs, namely WRKYGKK (10), WRKYGEK (11) and WSKYGQK (1), 235 which were distributed in subgroup Ⅱc, Ⅲ and TaWRKY157, respectively. In addition, two 236 zinc-finger form variants, C-X6-P-X23-H-X-C and C-X6-F-X23-H-X-C were identified in 237 TaWRKY80 and TaWRKY166, respectively. The unique one in all putative TaWRKYs is 238 TaWRKY157, which contained two WDs but with C2HC-type zinc finger structure 239 (C-X7-C-X23--H-X1-C). The “Group I Hypothesis” sees all WRKY genes evolving from 240 Group I C-terminal WRKY domains (Rinerson et al., 2015). Therefore, TaWRKY157 could 241 be taken as an intermediate type of Group I-III member preliminary, though it was classified 242 into Group Ⅲ in phylogenetic tree. 9 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 243 In this study, the Group Ⅱ was found to be the largest group of WRKY TFs family in 244 wheat. The members in Group Ⅱ accounted for approximately 55.6% of all putative 245 TaWRKYs, which was consistent with Musa balbisiana (Goel et al., 2016), pepper(Diao et 246 al., 2016) and soybean(Song et al., 2016a). Subgroups Ⅱa and Ⅱb were separated from one 247 clade, and Ⅱd and Ⅱe clustered to a branch, which is similar to previous study in wheat (Okay 248 et al., 2014; Zhu et al., 2013) 249 Chromosomal location of TaWRKY genes 250 Among the 171 TaWRKY genes, 115 were mapped onto the 21 wheat chromosomes, and the 251 other 56 were anchored in the scaffolds (TaWRKY116-171) (Table S3, Fig. 3). There were 252 relatively more TaWRKY genes distributed in Chromosomes 3B (18, 15.7%), 5B (11, 9.57%), 253 2A (9, 7.8%) and 5D (9, 7.8%). In contrast, chromosome 6B, 7A and 7B contained only one 254 TaWRKY genes (0.870%). In general, most identified TaWRKY genes were observed in the 255 top and bottom sections of chromosomes and only less in central sections. It suggested that 256 the TaWRKY genes were mapped on the all chromosomes with a significantly non-random 257 and uneven distribution. The TaWRKY genes density in each chromosome was ranged from 258 0.004/Mb (7B) to 0.056/Mb (5D) (Fig. S1). 259 Duplication events of WRKY genes have been found universally in a number of plants, 260 such as peanut (Song et al., 2016b), white pear (Huang et al., 2015) and Brassica napus (He 261 et al., 2016). In this study, we identified 79 TaWRKY gene duplication pairs which 262 corresponded to 85 genes (Table S4, Fig.3). It indicated that some of the TaWRKY genes have 263 more than one duplicated gene, which could be due to the multiple rounds of whole genome 264 duplication in wheat. As shown in Fig. 3, two WRKY tandem duplication clusters 265 (TaWRKY59/TaWRKY60, TaWRKY113/TaWRKY114/TaWRKY115) were identified on 266 chromosome 3B and 7D, respectively. In addition, there were 80 genes have undergone 267 segmental duplication, which were paralogs of WRKY genes on different chromosomes (Bi et 268 al., 2016). 269 Gene structure analysis of WRKY genes in wheat 270 The exon-intron distribution was analyzed to further detect structural features of TaWRKY 271 genes. Figure S2 showed that the number of introns in TaWRKY family genes varied from 0 10 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 272 to 5, while 0 to 8 in rice (Xie et al., 2005) and 0 to 22 in Musa acuminate (Goel et al., 2016), 273 respectively. It suggested that WRKYs in wheat show lower gene structure diversity. A total of 274 72 (42.11%) TaWRKY genes with two introns accounted for the largest proportion, followed 275 by 44 (25.73%), 23 (13.45%), 15 (8.77%), 14 (8.19%) and 3 (1.75%) genes, possessing 1, 3, 276 4, 0 and 5 introns, respectively. The distribution pattern of introns and exons was 277 group-specific, which is similar to cassava (Wei et al., 2016) and carrot (Li et al., 2016), and 278 TaWRKY gens belonging to the same subfamily shared similar exon-intron structure. For 279 example, TaWRKYs in Group Ⅲ contained 0-5 introns, approximately 91.11% (41/45) of 280 which possessed 1-2 introns. 281 There were two types of introns (V-type and R-type) located in WRKY domain, which 282 were characterized based on their splice site (Bi et al., 2016; Wang et al., 2014; Xie et al., 283 2005). V-type introns (phase 0) have a splice site before the V (Valine) residue in C2H2 zinc 284 finger structure, and R-type introns (phase 2) on the R (Arginine) residue of the WRKY 285 domain (Bi et al., 2016; Wang et al., 2014; Xie et al., 2005). In our study, all of the TaWRKY 286 genes (17) in Group IIa and IIb only contained V-type introns except TaWRKY165, which had 287 no intron. However, R-type introns were mostly observed in all the other groups (Group I, IIc, 288 IId, IIe and III) (Fig. S2). It indicated that the intron phases were significantly conserved 289 within the same group but remarkably different between groups (Chen, 2014). These results 290 provided an additional evidence to support the phylogenetic groupings and TaWRKYs 291 classification. 292 Motif composition analysis of TaWRKYs 293 The conserved motifs of WRKY proteins in wheat were predicted to explore the similarity 294 and diversity of motif compositions. A total of 10 distinct motifs, named motif 1-10, were 295 detected using MEME online program (Fig. 4). Of these 10 motifs, motif 1 and 4 contained a 296 WRKYGQK sequence, which is basic feature of TaWRKYs. At least one of them contained 297 in almost all deduced TaWRKYs, except several incomplete proteins, such as TaWRKY11, 298 43 and 162. Motif 1 was observed almost in all groups, whereas motif 4 dispersed in Group I 299 mostly. 11 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 300 As displayed schematically in Fig. 4, TaWRKYs within the same group or subgroup 301 shared similar motif compositions. For instance, motif 6 and 10 were unique to Group I, 302 whereas motif 7 is specific to Group Ⅲ. The motif unique to a particular group is likely to be 303 involved in specific biological process in plants. Therefore, it could be speculated that each 304 family or subfamily of WRKY genes might be responsible for the specific biological process 305 (Goel et al., 2016; Lippok, 2003). Furthermore, members in subgroup IIa and IId showed 306 almost the identical motif distribution pattern, indicating functional similarity among them. 307 Interestingly, these two subgroups were also clustered to a branch in phylogenetic tree. 308 Likewise, the same phenomenon was also observed in subgroup IId and IIe. These results 309 further validated the categorization of TaWRKYs and phylogenetic relationships. 310 Variety of cis-acting elements in the promoter regions of wheat WRKY genes 311 It was reported that cis-acting elements in the promoter of are crucial to gene expression, 312 which is an en essential part of its function (Dehais, 1999; Lescot M, 2002). The 1.5kb 313 upstream promoter regions of all TaWRKYs were used to predict cis-acting elements using the 314 online database PlantCARE. Here we found various cis-acting elements in 142 out of 171 315 TaWRKY genes, while the rest of WRKYs could not be detected because of short sequence in 316 their upstream regions (Table S5, Fig.5). A great deal of cis-acting elements were related to 317 response of hormones and biotic stresses, including methyl jasmonate (MeJA), abscisic acid 318 (ABA), salicylic acid (SA), gibberellins(GA), auxin (IAA), zein and fungus. 319 MeJA-responsive elements with largest portion were found in the promoter regions of 109 320 TaWRKY genes. Additionally, some elements involved in various abiotic stresses, such as 321 light, wound, cold, heat, anaerobic induction and drought, were identified in a large number 322 of TaWRKY genes. A total of 44 light-responsive elements were almost distributed in all of 323 the TaWRKYs. There were also some elements observed in genes may regulate expression of 324 different tissues (seed, root, shoot, leaf, phloem/xylem, endosperm and meristem) in wheat 325 development. Interestingly, a total of 76 TaWRKYs contained W-box (TTGACC), which 326 regulates gene expression by binding WRKY, indicating these genes may auto-regulated by 327 itself or cross-regulated with others (Chi et al., 2013; Jiang et al., 2014). MBSI, a MYB 328 binding site involved in flavonoid biosynthetic genes regulation, only existed in TaWRKY87 12 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 329 and TaWRKY142, which suggested that these two genes may regulate flavonoid metabolism. 330 There were two unique genes, TaWRKY58 and TaWRKY94, which might response cold and 331 drought stresses for containing a cold and dehydration responsive elements, C-repeat/DRE. 332 Another special MYB binding site MBS, participated in drought response, were identified in 333 103 genes, indicating that most of TaWRKYs seem to be involved in drought stress response 334 (Table S5). It was noteworthy that all members analyzed contained more than one cis-element. 335 Our analysis and previous studies both suggested that TaWRKY genes are involved in 336 transcriptional regulation of plant growth and stress responses (Bakshi & Oelmüller 2014; 337 Ding et al., 2015; Raineri et al., 2016; Rushton et al., 2010). 338 Expression profiles of TaWRKY genes under drought stress. 339 With the exception of two shorter sequences (TaWRKY122 and 169), 12 out of 171 transcripts 340 were selected as candidate drought responsive genes according to their orthologous WRKYs in 341 Arabidopsis, which are involved in water deprivation, using the Biomart (http://plants. 342 ensembl.org/index.html) (Table S6). The AtWRKYs responding to water stress were obtained 343 based on function annotation in TAIR database (http://www.arabidopsis.org/index.jsp). To 344 validate these candidate 12 drought-response genes, we determined their expression pattern in 345 flag leaves, glumes and lemmas using qRT-PCR. In our study, expression of genes could be 346 detected at the transcript level almost in all selected tissues during the grain-filling period 347 except TaWRKY8 (Table S7, Fig. 6). 348 As shown in Fig. 6, we found that TaWRKY genes in glumes and lemmas share more 349 similar expression pattern compared with in flag leaves. A relatively large group of genes, 350 including TaWRKY1, 20, 31, 112, 123, 142, and 149 were significantly up-regulated in flag 351 leaves at 0, 3 or 5 DAA, which suggested that these genes were highly induced at the early 352 grain-filling stage (0-8 DAA). Among them, some genes, like TaWRKY123 and 142, 353 maintained slightly up-regulated initially and then were restrained followed by an increase in 354 the last point of the drought treatment in glumes. In addition, peaks in the expression of 355 several members (TaWRKY1, 20, 123 and 142) were mostly found at 5DAA in lemma, which 356 lags behind other two tissues. Furthermore, water stress induced the most rapid upregulation 357 of some genes, and showed differences in three tissues. For example, both TaWRKY31 and 13 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 358 TaWRKY149 were induced quickly in flag leaves after the onset of the drought treatment (0 359 DAA), approximately increasing up 6.67 and 8.22-fold, respectively. However, later 360 induction was observed in glumes and lemmas. The immediate transcription response 361 observed upon water stress appeared to be related to a more rapid perception of the drought 362 (Eulgem et al., 2000; Rushton et al., 2010). Genes in another interesting cluster, composed of 363 TaWRKY90, 97, 120 and 133, were down-regulated or slightly changed in glumes and 364 lemmas during the early grain-filling stage, and induced during the middle (9-15 DAA) or 365 late grain-filling stage (16-25 DAA) under drought stress. Whereas, two genes (TaWRKY120, 366 133) of them in flag leaves were strongly induced 3 days after water stress. It indicated that 367 TaWRKY120 and TaWRKY 133 genes were predominantly expressed in flag leaves at the 368 early-filling stage. Our data suggested that the tissues-specific expression of TaWRKYs was 369 existed in wheat, and it appeared to be consistent with their role in tissues. 370 Discussion 371 WRKY TFs are one of the largest families of transcriptional regulators in plants, and form 372 integral parts of signalling webs that regulate many plant process (Rushton et al., 2010). 373 Although some investigation on wheat WRKYs have been reported in sucession, 374 characterization and functional annotation information about TaWRKYs was still sufficient. 375 Niu et al. (2012) identified 43 putative TaWRKYs, named TaWRKY1 to TaWRKY43, which 376 were represented with the same names in another study carried out by Okay et al. (2014). Zhu 377 et al. (2013) identified 92 TaWRKYs from the NCBI dbEST and/or the DFCI gene index, and 378 constructed a phylogeny map. A total of 160 TaWRKYs were characterized in terms of their 379 HMM profiles, conserved domains, distribution among WRKY groups and phylogenetic 380 relationships, and discovered some drought responsive members validated in leaf and root 381 tissues (Okay et al., 2014). Recently, Zhang et al. (2016) identified 116 WRKYs, and 13 of 382 them were characterized as senescence-associated genes. 383 Our present study improves the information of WRKYs in wheat, and provides a more 384 comprehensive knowledge based on the wheat genome. We identified WRKYs in 20 species 385 and characterized 171 wheat WRKYs in terms of gene classification, physical and chemical 386 parameters prediction, phylogenetic analysis, chromosomal location, duplication events, 14 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 387 conserved motif determination, exon-intron structure and cis-acting element analysis, which 388 might help to screen candidate stress-responsive genes in wheat for further study. 389 The whole genome duplication can result in divergence and formation of species over 390 time, accompanied with retention or loss of some duplicated genes (Dehal P, 2005). To 391 understand the evolution of WRKY transcription factors family, we identified a total of 1113 392 WRKY proteins in wheat and other 19 species representing the nine major plant lineages. 393 Interestingly, the number of WRKY TFs in many higher plants was more than that in lower 394 plants, which implied that the WRKY TFs might play significant roles during evolution from 395 simpler unicellular to more complex multicellular forms. Thus, we could preliminary 396 speculate that multiple rounds of duplication during evolution might be the main reason for 397 the different numbers of the WRKY proteins in diverse species. 398 Compared with the species analyzed in this study, the wheat (Triticum aestivum L.) 399 genome contained the highest number of WRKY TFs (171). The expansion of WRKY gene 400 family in wheat might be due to the following reasons. (1) Triticum aestivum L. is an 401 allohexaploid, and originated from two recent hybridizations between three diploid 402 progenitors, donors of the A, B and D subgenomes (Glover et al., 2015). Triticum aestivum L. 403 genome experienced the whole genome duplication events after two hybridizations at 404 approximately 0.8 and 0.4 million years ago (MYA), respectively (Glover et al., 2015), which 405 would produce a large number of paralogs (Conant & Wolfe 2008). (2) Likewise, small-scale 406 gene duplication, including segmental and tandem duplication, might be also significant in 407 the evolution of WRKY gene family in wheat. The origin of new genes during evolution was 408 dependent on gene duplication. Gene duplication allows essential genes to undergo mutations 409 in the duplicated copy, suggesting that similar genes would diverge over the long 410 evolutionary time period, and then improving the expansion and evolution of gene family 411 (Conant & Wolfe 2008; De et al., 2008). The current investigate showed that 85 of 171 412 (49.7%) TaWRKY genes evolved from either tandem duplication or segmental duplication. 413 Interestingly, 80 of WRKY genes were segmentally duplicated and only five were tandemly 414 duplicated, implying that high segmental and low tandem duplications existed in TaWRKY 415 genes, consistent with white pear (Huang et al., 2015), grapevine (Wang et al., 2014) and 416 soybean (Song et al., 2016a). Our results showed that number of duplicated genes was mainly 15 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 417 determined by segmental events, because genes generated from segmental duplication have 418 more chance to be retained due to subfunctionalization or neofunctionalization (Huang et al., 419 2015; Lynch, 2000; Moore & Purugganan, 2005; Wang et al., 2005). Therefore, it was 420 inferred that though tandem duplication has contributed to the expansion of TaWRKY family, 421 segmental duplication probably played a more pivotal role (Zhu Y, 2014). (3) The expansion 422 of gene families along a specific lineage can be due to chance or the result of natural selection. 423 Adaptive expansion of gene families occurs when natural selection would favor additional 424 duplicated genes (Demuth & Hahn, 2009). 425 Increasing research suggests that the WRKY transcription factors are involved in various 426 biological processes, including plant development, responses to biotic and abiotic stresses 427 (Eulgem & Somssich, 2007; Liu et al., 2015; Luo et al., 2013; Rushton et al., 2010; Zhao et 428 al., 2015). Plant hormones, as essential endogenous signal molecules within the plant, can 429 regulate cellular processes, plant growth and development under severe stress conditions 430 (Grove, 1979; Kermode, 2005; Ryu & Cho, 2015). Many evidence indicated that the 431 expression of WRKY genes was affected after hormone treatment (Jiang et al., 2014; Yang et 432 al., 2009). In wheat, a large amount of cis-acting elements responding to phytohormones, 433 such as MeJA, ABA, SA, GA etc., were detected in TaWRKY genes. It suggested that these 434 WRKY genes might regulate growth and development of wheat by functioning as key factors 435 in regulating specific signalling pathways. In addition, WRKY transcription factors were 436 involved in responses to abiotic stresses. For instance, TaWRKY44 in transgenic tobacco 437 confers multiple abiotic stress tolerances, including drought, salt and osmotic stress (Wang et 438 al., 2015). Twelve GmWRKY genes were differentially expressed under salt stress (Song et al., 439 2016a). In this work, a large number of TaWRKY genes contained several cis-acting elements 440 associated with abiotic stresses, such as light, wound, cold, heat, anaerobic induction and 441 drought, implying that a number of WRKY genes in wheat participate in various abiotic 442 stresses. In general, the results indicated that most of TaWRKY genes were involved in 443 multiple biotic and abiotic stresses, which was consistent with previous study (Eulgem & 444 Somssich, 2007; Jiang et al., 2014). 445 446 Drought is one of the most significant stresses resulting in reduction of wheat production (Keating et al., 2014). The grain-filling is mainly sustained by photosynthesis of flag leaves 16 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 447 and spikes under drought treatment, and photosynthesis of spike is less sensitive to drought 448 than that in flag leaves (Jia et al., 2015; Tambussi et al., 2005). However, the investigation on 449 TaWRKY genes was mostly focus on root and leaves, and limited in spikes. In this study, we 450 determined relative expression of TaWRKY genes in flag leaves, glumes and lemmas during 451 the grain-filling period upon water deficit using qRT-PCR. Obvious differences of gene 452 expression pattern between flag leaves and spikes (glumes and lemmas) were observed in 453 wheat. For example, TaWRKY142 were up regulated at 3 DAA in flag leaves, but induced in 454 glumes and lemmas since the imposition of the water stress and maintained upregulation 455 during the early grain-filling stage, which suggested the putative role of TaWRKY142 gene in 456 spike tissues. In addition, the majority of TaWRKYs in a group shared similar expression 457 pattern, but some were not. For instance, TaWRKY1, 120 and 133, the orthologs to AtWRKY3, 458 all belong to Group I. Both TaWRKY120 and TaWRKY133 were up regulated at 3DAA and 459 the middle-late grain filling stage in flag leaves, while induction of TaWRKY1 were only 460 observed at 3DAA. The structure analysis showed that TaWRKY120 and TaWRKY133 have 461 similar exon-intron structure and same motif composition, while TaWRKY1 showed 462 difference with them, which may help to explain why TaWRKYs in a group do not necessarily 463 shared the similar expression pattern. The similar phenomenon was also reported in Salvia 464 miltiorrhiza. Five genes (SmWRKY2, 24, 39, 54 and 55), belonging to Group I, were 465 predominantly expressed in roots, whereas the other Group I members, including SmWRKY42, 466 13 and 60 were mainly expressed in stems, leaves and flowers, respectively (Li et al., 2015). 467 Conclusion 468 In this study, our results provide a more extensive knowledge on WRKYs in wheat. With the 469 whole wheat genome sequence, 171 TaWRKYs were identified, and compared with proteins 470 from other 19 species representing nine major plant lineages to gain preliminary insight into 471 the evolution of the WRKY family in Plantae. A phylogenetic analysis, coupled with gene 472 structure analysis and motif determination, divided these TaWRKYs into seven subgroups 473 (Group I, IIa-e, III). Chromosomal location showed that the most TaWRKY genes were 474 enriched on four chromosomes, especially on chromosome 3B, and 85 (49.7%) genes were 475 either tandem or segmental duplication. The cis-acting elements analysis revealed putative 17 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 476 functions of WRKYs in wheat during development as well as under numerous biotic and 477 abiotic stresses. Finally, the differential expression of TaWRKY genes in flag leaves, glumes 478 and lemmas under drought stress were analyzed, and we found different TaWRKY genes 479 preferentially express in specific tissue during the grain-filling stage. Taken together, our 480 results will help to complete the information of WRKY gene family in wheat, and also 481 contribute to screen more candidate genes for further investigation on function 482 characterization of WRKYs under various stresses. 483 ACKNOWLEDGEMENTS 484 We are grateful to Mr. Dangxiao Liu and Mr. Huawei Liu for providing valuable suggestions. 485 ADDITIONAL INFORMATION AND DECLARATIONS 486 Funding 487 This study was supported by the projects of the National Natural Science Foundation of 488 China (grant number 31271624). The funders had no role in study design, data collection and 489 analysis, decision to publish, or preparation of the manuscript. 490 Grant Disclosures 491 The following grant information was disclosed by the authors: 492 National Natural Science Foundation of China: 31271624. 493 Competing financial interests 494 The authors declare no competing financial interests. 495 Author Contributions 496 Pan Ning conceived and designed the experiments, performed the experiments, analyzed 497 the data, wrote the paper, prepared the figures and/or tables, reviewed drafts of the paper. 498 499 Congcong Liu conceived and designed the experiments, performed the experiments, analyzed the data, prepared figures and /or tables. 500 Jingquan Kang contributed materials, performed the experiments. 501 Jinyin Lv conceived and designed the experiments, contributed regents, reviewed drafts of 502 the paper. 18 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 503 Data Availability 504 The following information was supplied regarding data availability: 505 The raw data has been supplied as Supplemental Information. 506 REFERENCES 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 Ahuja I, de Vos RC, Bones AM, and Hall RD. 2010. Plant molecular stress responses face climate change. Trends in Plant Science 15:664-674. DOI:10.1016/j.tplants.2010.08.002. Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, and Noble WS. 2009. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Research 37:W202-208. DOI: 10.1093/nar/gkp335. Bakshi M, and Oelmüller R. 2014. WRKY transcription factors. Plant Signaling & Behavior 9:e27700. DOI: 10.4161/psb.27700. Bi C, Xu Y, Ye Q, Yin T, and Ye N. 2016. Genome-wide identification and characterization of WRKY gene family in Salix suchowensis. PeerJ 4:e2437. DOI: 10.7717/peerj.2437. Buscaill P, and Rivas S. 2014. Transcriptional control of plant defence responses. Current Opinion in Plant Biology 20:35-46. DOI: 10.1016/j.pbi.2014.04.004. Chen WLXWRLWLK. 2014. Genome-Wide Analysis of the NADK Gene Family in Plants. PLoS One 9:e101051-e101051. DOI: 10.1371/journal.pone.0101051. Chi Y, Yang Y, Zhou Y, Zhou J, Fan B, Yu JQ, and Chen Z. 2013. Protein-protein interactions in the regulation of WRKY transcription factors. Molecular Plant 6:287-300. DOI: 10.1093/mp/sst026. Conant GC, and Wolfe KH. 2008. Turning a hobby into a job: how duplicated genes find new functions. Nature reviews. Genetics 9:938-950. DOI: 10.1038/nrg2482. De GA, Lanave C, and Saccone C. 2008. Genome duplication and gene-family evolution: the case of three OXPHOS gene families. Gene 421:1-6. DOI: 10.1016/j.gene.2008.05.011. Dehais SRMVPRP. 1999. PlantCARE, a plant cis-acting regulatory element database. Nucleic Acids Research 27:295-296. Dehal P BJ. 2005. Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate. PLoS Biology 3. DOI: 10.1371/journal.pbio.0030314.g001. Demuth JP, and Hahn MW. 2009. The life and death of gene families. BioEssays 31:29-39. 10.1002/bies.080085 Diao WP, Snyder JC, Wang SB, Liu JB, Pan BG, Guo GJ, and Wei G. 2016. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L. Froniter in Plant Sience 7:211. DOI: 10.3389/fpls.2016.00211. Ding ZJ, Yan JY, Li CX, Li GX, Wu YR, and Zheng SJ. 2015. Transcription factor WRKY46 modulates the development of Arabidopsis lateral roots in osmotic/salt stress conditions via regulation of ABA signaling and auxin homeostasis. The The Plant Journal: for Cell and Molecular Biologyournal: for Cell and Molecular Biology 84:56-69. DOI: 10.1111/tpj.12958. Eulgem T, Rushton. PJ, Robatzek. S, and Somssich IE. 2000. The WRKY superfamily of plant transcription factors. Trends in Plant Science 5:199 - 206. DOI: 10.1016/S1360-1385(00)01600-9. Eulgem T, and Somssich IE. 2007. Networks of WRKY transcription factors in defense signaling. Current Opinion in Plant Biology 10:366-371. DOI: 10.1016/j.pbi.2007.04.020. Finn RD, Clements J, and Eddy SR. 2011. HMMER web server: interactive sequence similarity searching. Nucleic Acids Research 39:W29-37. DOI: 10.1093/nar/gkr367. 19 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, Potter SC, Punta M, Qureshi M, Sangrador-Vegas A, Salazar GA, Tate J, and Bateman A. 2016. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Research 44:D279-285. DOI: 10.1093/nar/gkv1344. Glover NM, Daron J, Pingault L, Vandepoele K, Paux E, Feuillet C, and Choulet F. 2015. Small-scale gene duplications played a major role in the recent evolution of wheat chromosome 3B. Genome Biologyogy 16:188. DOI: 10.1186/s13059-015-0754-6. Goel R, Pandey A, Trivedi PK, and Asif MH. 2016. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress. Froniter in Plant Sience 7:299. DOI: 10.3389/fpls.2016.00299. Grove MDS, G. F.; Rohwedder, W. K.; Mandava, N.; Worley, J. F.; Warthen, J. D.; Steffens, G. L.; Flippen-Anderson, J. L.; Cook, J. C. 1979. Brassinolide, a plant growth-promoting steroid isolated from Brassica napus pollen. Nature 281:216-217. He Y, Mao S, Gao Y, Zhu L, Wu D, Cui Y, Li J, and Qian W. 2016. Genome-Wide Identification and Expression Analysis of WRKY Transcription Factors under Multiple Stresses in Brassica napus. PLoS One 11:e0157558. DOI: 10.1371/journal.pone.0157558. Hu B, Jin J, Guo AY, Zhang H, Luo J, and Gao G. 2015. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics 31:1296-1297. DOI: 10.1093/bioinformatics/btu817. Huang X, Li K, Xu X, Yao Z, Jin C, and Zhang S. 2015. Genome-wide analysis of WRKY transcription factors in white pear (Pyrus bretschneideri) reveals evolution and patterns under drought stress. BMC Genomics 16:1104. DOI: 10.1186/s12864-015-2233-6. Ishiguro S, and Nakamura K. 1994. Characterization of a cDNA encoding a novel DNA-binding protein, SPF1, that recognizes SP8 sequences in the 50 upstream regions of genes coding for sporamin and beta-amylase from sweet potato. Molecular & General Genetics 244:563 - 571. DOI: 10.1007/BF00282746. Jia S, Lv J, Jiang S, Liang T, Liu C, and Jing Z. 2015. Response of wheat ear photosynthesis and photosynthate carbon distribution to water deficit. Photosynthetica 53:95-109. 10.1007/s11099-015-0087-4. Jiang M, Liu QE, Liu ZN, Li JZ, and He CM. 2016. Over-expression of a WRKY transcription factor gene BoWRKY6 enhances resistance to downy mildew in transgenic broccoli plants. Australasian Plant Pathology 45:327-334. DOI: 10.1007/s13313-016-0416-5. Jiang Y, Duan Y, Yin J, Ye S, Zhu J, Zhang F, Lu W, Fan D, and Luo K. 2014. Genome-wide identification and characterization of the Populus WRKY transcription factor family and analysis of their expression in response to biotic and abiotic stresses. Journal of Experimental Botany 65:6629-6644. DOI: 10.1093/jxb/eru381. Jimmy JL, and Babu S. 2015. Role of OsWRKY transcription factors in rice disease resistance. Tropical Plant Pathology 40:355-361. DOI: 10.1007/s40858-015-0058-0. Johnson CS. 2002. TRANSPARENT TESTA GLABRA2, a Trichome and Seed Coat Development Gene of Arabidopsis, Encodes a WRKY Transcription Factor. The Plant Cell Online 14:1359-1375. DOI: 10.1105/tpc.001404. Keating BA, Herrero M, Carberry PS, Gardner J, and Cole MB. 2014. Food wedges: Framing the global food demand and supply challenge towards 2050. Global Food Security 3:125-132. DOI: 10.1016/j.gfs.2014.08.004. Kermode AR. 2005. Role of Abscisic Acid in Seed Dormancy. Journal of Plant Growth Regulation 24:319-344. DOI: 10.1007/s00344-005-0110-2. Kim CY, Vo KTX, Nguyen CD, Jeong DH, Lee SK, Kumar M, Kim SR, Park SH, Kim JK, and Jeon JS. 2016. 20 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 Functional analysis of a cold-responsive rice WRKY gene, OsWRKY71. Plant Biotechnology Reports 10:13-23. DOI: 10.1007/s11816-015-0383-2. Knight H, and Knight MR. 2001. Abiotic stress signalling pathways: specificity and cross-talk. Trends in Plant Science 6:262-267. DOI: 10.1016/S1360-1385(01)01946-X. Kumar S, Stecher G, and Tamura K. 2016. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Molecular Biology and Evolution 33:1870-1874. DOI: 10.1093/molbev/msw054. Lagace M, and Matton DP. 2004. Characterization of a WRKY transcription factor expressed in late torpedo-stage embryos of Solanum chacoense. Planta 219:185-189. DOI: 10.1007/s00425-004-1253-2 Lescot M DP, Thijs G, Marchal K, Moreau Y, Van de Peer Y, Rouzé P, Rombauts S. 2002. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Research 30:325-327. DOI: 10.1093/nar/30.1.325. Li C, Li D, Shao F, and Lu S. 2015. Molecular cloning and expression analysis of WRKY transcription factor genes in Salvia miltiorrhiza. BMC Genomics 16:200. DOI: 10.1186/s12864-015-1411-x. Li MY, Xu ZS, Tian C, Huang Y, Wang F, and Xiong AS. 2016. Genomic identification of WRKY transcription factors in carrot (Daucus carota) and analysis of evolution and homologous groups for plants. Scientific Reports 6:23101. DOI: 10.1038/srep23101. Lippok MKMBISB. 2003. Members of the Arabidopsis WRKY Group III Transcription Factors Are Part of Different Plant Defense Signaling Pathways. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 16:295–305. DOI: 10.1094/MPMI.2003.16.4.295. Liu H, Sun M, Du D, Pan H, Cheng T, Wang J, and Zhang Q. 2015. Whole-Transcriptome Analysis of Differentially Expressed Genes in the Vegetative Buds, Floral Buds and Buds of Chrysanthemum morifolium. PLoS One 10:e0128009. DOI: 10.1371/journal.pone.0128009. Livak KJ, and Schmittgen TD. 2001. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 25:402-408. DOI: 10.1006/meth.2001.1262. Luo X, Sun X, Liu B, Zhu D, Bai X, Cai H, Ji W, Cao L, Wu J, Wang M, Ding X, and Zhu Y. 2013. Ectopic expression of a WRKY homolog from Glycine soja alters flowering time in Arabidopsis. PLoS One 8:e73295. DOI: 10.1371/journal.pone.0073295. Lynch M. 2000. The Evolutionary Fate and Consequences of Duplicate Genes. Science 290:1151-1155. DOI: 10.1126/science.290.5494.1151. Moore RC, and Purugganan MD. 2005. The evolutionary dynamics of plant duplicate genes. Current Opinion in Plant Biology 8:122-128. DOI: 10.1016/j.pbi.2004.12.001. Mwadzingeni L, Shimelis H, Dube E, Laing MD, and Tsilo TJ. 2016. Breeding wheat for drought tolerance: Progress and technologies. Journal of Integrative Agriculture 15:935-943. DOI: 10.1016/s2095-3119(15)61102-9. Niu CF, Wei W, Zhou QY, Tian AG, Hao YJ, Zhang WK, Ma B, Lin Q, Zhang ZB, Zhang JS, and Chen SY. 2012. Wheat WRKY genes TaWRKY2 and TaWRKY19 regulate abiotic stress tolerance in transgenic Arabidopsis plants. Plant Cell Environ 35:1156-1170. DOI: 10.1111/j.1365-3040.2012.02480.x. Okay S, Derelli E, and Unver T. 2014. Transcriptome-wide identification of bread wheat WRKY transcription factors in response to drought stress. Molecular Genetics and Genomics: MGG 289:765-781. DOI: 10.1007/s00438-014-0849-x. Raineri J, Hartman MD, Chan RL, Iglesias AA, and Ribichich KF. 2016. A sunflower WRKY transcription factor stimulates the mobilization of seed-stored reserves during germination and post-germination growth. Plant Cell Reports 35:1875-1890. DOI: 10.1007/s00299-016-2002-2. Ranjan A, and Sawant S. 2014. Genome-wide transcriptomic comparison of cotton (Gossypium herbaceum) leaf 21 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 and root under drought stress. 3 Biotech 5:585-596. DOI: 10.1007/s13205-014-0257-2. Ricachenevsky FK, Sperotto RA, Menguer PK, and Fett JP. 2010. Identification of Fe-excess-induced genes in rice shoots reveals a WRKY transcription factor responsive to Fe, drought and senescence. Molecular Biology Reports 37:3735-3745. DOI: 10.1007/s11033-010-0027-0. Rinerson CI, Rabara RC, Tripathi P, Shen QJ, and Rushton PJ. 2015. The evolution of WRKY transcription factors. BMC Plant Biology 15:66. DOI: 10.1186/s12870-015-0456-y. Rushton DL, Tripathi P, Rabara RC, Lin J, Ringler P, Boken AK, Langum TJ, Smidt L, Boomsma DD, Emme NJ, Chen X, Finer JJ, Shen QJ, and Rushton PJ. 2012. WRKY transcription factors: key components in abscisic acid signalling. Plant Biotechnology Journal 10:2-11. DOI:10.1111/j.1467-7652.2011.00634.x. Rushton PJ, Somssich IE, Ringler P, and Shen QJ. 2010. WRKY transcription factors. Trends in Plant Science 15:247-258. DOI: 10.1016/j.tplants.2010.02.006. Ryu H, and Cho Y-G. 2015. Plant hormones in salt stress tolerance. Journal of Plant Biology 58:147-155. DOI: 10.1007/s12374-015-0103-z. Sakuraba Y, Han SH, Yang HJ, Piao W, and Paek NC. 2016. Mutation of Rice Early Flowering3.1 (OsELF3.1) delays leaf senescence in rice. Plant Molecular Biology 92:223-234. DOI: 10.1007/s11103-016-0507-2. Song H, Wang P, Hou L, Zhao S, Zhao C, Xia H, Li P, Zhang Y, Bian X, and Wang X. 2016a. Global Analysis of WRKY Genes and Their Response to Dehydration and Salt Stress in Soybean. Froniter in Plant Sience 7:9. DOI: 10.3389/fpls.2016.00009. Song H, Wang P, Lin JY, Zhao C, Bi Y, and Wang X. 2016b. Genome-Wide Identification and Characterization of WRKY Gene Family in Peanut. Froniter in Plant Sience 7:534. DOI: 10.3389/fpls.2016.00534. Song XM, Huang ZN, Duan WK, Ren J, Liu TK, Li Y, and Hou XL. 2014. Genome-wide analysis of the bHLH transcription factor family in Chinese cabbage (Brassica rapa ssp. pekinensis). Molecular Genetics and Genomics: MGG 289:77-91. DOI: 10.1007/s00438-013-0791-3. Tambussi EA, Nogues S, and Araus JL. 2005. Ear of durum wheat under water stress: water relations and photosynthetic metabolism. Planta 221:446-458. DOI: 10.1007/s00425-004-1455-7. Ulker B, and Somssich IE. 2004. WRKY transcription factors: from DNA binding towards biological function. Current Opinion in Plant Biology 7:491-498. DOI: 10.1016/j.pbi.2004.07.012. Wang M, Vannozzi A, Wang G, Liang YH, Tornielli GB, Zenoni S, Cavallini E, Pezzotti M, and Cheng ZM. 2014. Genome and transcriptome analysis of the grapevine (Vitis vinifera L.) WRKY gene family. Horticulture Research 1:14016. DOI: 10.1038/hortres.2014.16. Wang X, Shi X, Hao B, Ge S, and Luo J. 2005. Duplication and DNA segmental loss in the rice genome: implications for diploidization. The New Phytologist 165:937-946. 10.1111/j.1469-8137.2004.01293.x Wang X, Zeng J, Li Y, Rong X, Sun J, Sun T, Li M, Wang L, Feng Y, Chai R, Chen M, Chang J, Li K, Yang G, and He G. 2015. Expression of TaWRKY44, a wheat WRKY gene, in transgenic tobacco confers multiple abiotic stress tolerances. Froniter in Plant Sience 6:615. DOI: 10.3389/fpls.2015.00615. Wei Y, Shi H, Xia Z, Tie W, Ding Z, Yan Y, Wang W, Hu W, and Li K. 2016. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava. Froniter in Plant Sience 7:25. DOI: 10.3389/fpls.2016.00025. Xie Z, Zhang ZL, Hanzlik S, Cook E, and Shen QJ. 2007. Salicylic acid inhibits gibberellin-induced alpha-amylase expression and seed germination via a pathway involving an abscisic-acid-inducible WRKY gene. Plant Molecular Biology 64:293-303. DOI: 10.1007/s11103-007-9152-0. Xie Z, Zhang ZL, Zou X, Huang J, Ruas P, Thompson D, and Shen QJ. 2005. Annotations and functional analyses of the rice WRKY gene superfamily reveal positive and negative regulators of abscisic acid signaling in aleurone cells. Plant Physiology 137:176-189. DOI: 10.1104/pp.104.054312. 22 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 Yang B, Jiang Y, Rahman MH, Deyholos MK, and Kav NN. 2009. Identification and expression analysis of WRKY transcription factor genes in canola (Brassica napus L.) in response to fungal pathogens and hormone treatments. BMC Plant Biology 9:68. DOI: 10.1186/1471-2229-9-68. Yu F, Huaxia Y, Lu W, Wu C, Cao X, and Guo X. 2012. GhWRKY15, a member of the WRKY transcription factor family identified from cotton (Gossypium hirsutum L.), is involved in disease resistance and plant development. BMC Plant Biology 12. DOI: 10.1186/1471-2229-12-144. Zhang CQ, Xu Y, Lu Y, Yu HX, Gu MH, and Liu QQ. 2011. The WRKY transcription factor OsWRKY78 regulates stem elongation and seed development in rice. Planta 234:541-554. DOI: 10.1007/s00425-011-1423-y. Zhang H, Zhao M, Song Q, Zhao L, Wang G, and Zhou C. 2016. Identification and function analyses of senescence-associated WRKYs in wheat. Molecular cell biology research communications: MCBRC 474:761-767. DOI: 10.1016/j.bbrc.2016.05.034. Zhang Y, and Wang Z. 2011. Photosynthetic diurnal variation characteristics of leaf and non-leaf organs in winter wheat under different irrigation regimes. Acta Ecologica Sinica 31:1312-1322. Zhang ZL, Xie Z, Zou X, Casaretto J, Ho TH, and Shen QJ. 2004. A rice WRKY gene encodes a transcriptional repressor of the gibberellin signaling pathway in aleurone cells. Plant Physiology 134:1500-1513. DOI: 10.1104/pp.103.034967. Zhao JL, Wang YL, Yao DQ, Zhu WY, Chen L, He HL, Pan JS, and Cai R. 2015. Transcriptome profiling of trichome-less reveals genes associated with multicellular trichome development in Cucumis sativus. Molecular Genetics and Genomics: MGG 290:2007-2018. DOI: 10.1007/s00438-015-1057-z. Zhu X, Liu S, Meng C, Qin L, Kong L, and Xia G. 2013. WRKY Transcription Factors in Wheat and Their Induction by Biotic and Abiotic Stress. Plant Molecular Biology Reporter 31:1053-1067. DOI: 10.1007/s11105-013-0565-4. Zhu Y WN, Song W, Yin G, Qin Y, Yan Y, Hu Y. 2014. Soybean (Glycine max) expansin gene superfamily origins: segmental and tandem duplication events followed by divergent selection among subfamilies. BMC Plant Biology 14. DOI: 10.1186/1471-2229-14-93. 702 23 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 Figure 1(on next page) Evolutionary relationship of 20 species among nine lineages within the Plantae. PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 Species Number of WRKYs Brachypodium distachyon 86 Triticum aestivum 171 Oryza sativa 88 Zea mays 163 Sorghum bicolor 96 Populus trichocarpa 122 Cucumis sativus 88 Arabidopsis thaliana 91 Basal Magnoliophytas Amborella trichopoda 33 Gymnosperms Picea sitchensis Lycophytes 8 Selaginella moellendorffii 35 Bryophytes Physcomitrella patens 117 Monocots Eudicots Chlorophytes Rhodophytes Glaucophytes Ostreococcus lucimarinus 2 Ostreococcus tauri 6 Micromonas pusilla CCMP1545 2 Coccomyxa subellipsoidea C-169 1 Volvox carteri 4 Galdieria sulphuraria 0 Cyanidioschyzon merolae 0 Cyanophora paradoxa 0 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 Figure 2(on next page) Phylogenetic tree of WRKY domains from wheat and Arabidopsis. PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 AtWRKY16 AtWRKY16 AtWRKY29 AtWRKY22 AtWRKY27 TaWRKY16 3 AtWRKY35 TaWRKY5 TaWRK 1 Y 3 TaW TaW RKY100 TaW RKY107 TaW RKY119 R TaW KY15 AtW RKY10 5 Ta RKY1 3 AtWWRKY5 4 At RKY 5 At WRK 69 Ta WRK Y69 TaWWRKYY65 A R 54 Ta tWR KY89 AtWWRKKY21 AtW 60 RK Y22 Y T T aW RKY 15 T aW RKY 15 A aW RK 80 Ta tWR RKY Y91 A W K 1 T tW R Y3 58 T aW R KY 9 T aW R KY 14 Ta aW RK KY1 7 8 W RK Y1 59 RK Y2 54 Y3 7 7 94 82 65 73 65 60 RKY 8 AtW RKY118 AtW RKY 4 AtWRKY1486 Y TaWaWRKY156 T RK Y87 K TaW R Y40 TaWWRKY1145 At RK Y16 7 TaWWRK Y16 13 Ta WRK KY1 170 Ta WR KY 115 1 Ta aWR RKY 16 6 T aW RKY KY3 Y9 R K 7 T W Ta tW R KY4 Y6 W A At WR RK Y42 At AtW RK W At 74 65 5 Y3 1 RK Y 1 9 W RK Y 3 3 Ta tW RK KY7 81 A tW R KY 1 A aW R KY7 3 T aW R Y9 7 T aW RK Y1 T aW RK Y67 T tW RK 11 A aW RKY 74 T tW RKY 151 A tW KY 9 A aWR KY5 46 T tWR KY1 1 A WR KY12 2 Ta WR Y12 Ta WRK Y169 80 Ta WRK Y112 Ta WRK 46 Ta WRKY 57 72 At WRKY 8 Ta RKY5 8 91 TaW RKY7 7 TaW RKY13 TaW RKY70 AtW RKY41 AtW RKY41 71 AtW Y53 AtWRK Y30 AtWRK Y110 82 TaWRK 13 TaWRKY 73 TaWRKY46 TaWRKY126 TaWRKY134 TaWRKY29 AtWRKY55 TaWRKY95 TaWRKY153 TaWRKY24 TaWRKY92 TaWRKY TaWRK 84 TaWRK Y40 AtWRK Y157 AtW YX TaW RKY55 95 TaWRRKY101 TaW KY59 93 TaW RKY13 78 TaW RKY60 6 RKY7 TaW T a RKY 6 96 Ta WRKY 72 94 TaWWRKY665 T a RK 6 68 Ta WRK Y74 Ta WR Y70 98 Ta WR KY10 92 61 TaWWRKKY5 T R Y4 T aW KY T aW RKY 14 Ta aWR RKY5 9 9967 T W K 6 T aW RK Y44 A aW RK Y6 A tW R Y1 4 At tW RK KY13 32 A W R Y 5 At tW RK KY6 67 W RK Y 6 7 RK Y 6 4 Y6 3 6 62 T A aW A tW R Ta tW RK KY1 R Y 9 W Ta AtW RK KY6 45 W A R RK Y52 8 Ta tWR KY1 Y28 Ta WR KY 30 A WR KY5 48 At tWR KY4 0 AtWWRK KY718 At R Y4 T WR KY4 3 70 Ta aWR KY1 3 TaWWRKYKY212 TaW RKY 102 106 R K TaW Y1 TaW RKY305 TaW RKY3 8 TaW RKY454 82 TaW RKY49 RK TaW Y129 AtWR RKY7 82 TaWR KY50 KY TaW 139 TaWRKRKY2 Y4 2 TaWRKY6 63 3 TaWRKY1 TaWRKY535 AtWRKY51 AtWRKY8 TaWRKY31 AtWRKY57 TaWRKY97 TaWRKY12 TaWRKY123 TaWRKY90 TaWRKY149 TaWRKY207 AtWRKY5 23 AtWRKY Y6 TaWRKY16 K TaWRKY117 TaWR RKY17 TaW RKY18 TaW KY116 R TaW tWRKYX6 A KY16 8 R TaW RKY1124 TaW RKY1 TaW 69 8 Y3 2 RK Y 6 2 5 W RK Y1 40 At tW RK KY1 143 A aW R KY 43 T W R 7 Ta aW RKY Y14 5 T aW RK Y8 0 T aW RK Y16 2 T aW RK 14 T W KY 4 714 9 Ta aWRRKY5 11 T tW KY 2 A aWR KY6 27 R Y1 T 80 TaWWRKKY49 8 Ta tWR KY12 A WR Y61 Ta WRK Ta Ta T WR At aWR KY1 Ta TaWWRK KY4 31 W At RK RKY Y31 7 W 6 Ta AtW RK Y10 8 Ta WR RKY Y72 8 68 Ta WRK KY1 61 WR Y 64 1 At K 0 AtWWRKY1624 Y A RKY 19 At tWR 19 60 At WRK KY AtWWRKYY4 TaW RK 3 AtW RKY6Y4 TaW RKY3 9 T a RKY 2 TaWWRKY975 4 R TaW KY150 TaWR RKY82 AtWRKY141 K Y 5 TaWR 8 AtWR KY30 TaWRKKY10 Y7 AtWRKY27 TaWRKY2 TaWRKY263 TaWRKY96 AtWRKY33 TaWRKY33 TaWRKY88 TaWRKY39 AtWRKY26 AtWRKY34 TaWRKY133 AtWRKY25 97 TaWRKY145 83 TaWRKYY9 9 TaWRK Y44 AtWRKKY44 92 AtWR RKY1 TaW Y120 K 71 TaWRWRKY8 a T 11 RKY1 TaW RKY1098 79 TaW RKY1671 TaW RKY1Y20 TaWtWRK Y20 A RK 26 AtW RKY152 AtWRKY Y98 K TaWaWR KY791 T WR KY4 2 Ta WR KY3 5 73 a T WR KY2 Y1 Ta WR RK Y1 Ta AtW RK 56 Y 4 W At RK KY2 13 W At tWR RKY 138 5 A tW KY Y7 8 A WR RK Y2 36 K Ta AtWWR RKY Ta aW T PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 Figure 3(on next page) Chromosome distribution of TaWRKY genes. The chromosomal position of each TaWRKY was mapped according to the wheat genome. The chromosome numbers were shown at the top of each chromosome. Fifty six TaWRKYs on the scaffold (TaWRKY115-171) could not be anchored onto any specific chromosome. The location of each WRKY gene was pointed out by a line. The scale is in mega bases (Mb). The black boxes indicated the tandem duplication genes, and the segmental duplicated genes were shown with red dots. PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 Chr 1A Chr 1D Chr 1B 25 TaWRKY1 TaWRKY2 75 TaWRKY3 TaWRKY14 TaWRKY15 TaWRKY16 TaWRKY17 TaWRKY18 Mb 125 150 TaWRKY29 TaWRKY22 TaWRKY30 TaWRKY38 TaWRKY39 TaWRKY40 TaWRKY41 Chr 3A Chr 3D Chr 4A Chr 4B TaWRKY65 TaWRKY63 TaWRKY42 Chr 3B TaWRKY69 TaWRKY70 TaWRKY71 TaWRKY64 TaWRKY43 TaWRKY66 TaWRKY67 TaWRKY45 TaWRKY46 TaWRKY72 TaWRKY47 TaWRKY48 TaWRKY44 TaWRKY68 TaWRKY33 TaWRKY4 TaWRKY5 TaWRKY6 TaWRKY7 TaWRKY27 TaWRKY9 TaWRKY10 TaWRKY11 275 TaWRKY34 TaWRKY35 TaWRKY12 TaWRKY13 Chr 4D 0 Chr 5A Chr 5B 75 Mb 100 125 150 175 Chr 6A Chr 6B Chr 6D Chr 7A Chr 7B TaWRKY108 TaWRKY101 25 50 Chr 5D Chr 7D TaWRKY73 TaWRKY74 TaWRKY81 TaWRKY75 TaWRKY82 TaWRKY83 TaWRKY76 TaWRKY77 TaWRKY78 TaWRKY79 TaWRKY80 TaWRKY106 TaWRKY94 TaWRKY95 TaWRKY96 TaWRKY111 TaWRKY84 TaWRKY85 TaWRKY86 TaWRKY87 TaWRKY88 TaWRKY110 TaWRKY92 TaWRKY93 TaWRKY107 TaWRKY97 TaWRKY98 TaWRKY99 TaWRKY100 TaWRKY109 TaWRKY102 TaWRKY103 TaWRKY104 TaWRKY105 TaWRKY112 200 225 TaWRKY89 TaWRKY90 TaWRKY91 TaWRKY113 TaWRKY114 TaWRKY115 Group I Subgroup IIa Subgroup IIb Subgroup IIc Subgroup IId Subgroup Group PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec:IIe 15 Feb 2017,III publ: 15 Feb 2017 TaWRKY49 TaWRKY50 TaWRKY51 TaWRKY52 TaWRKY53 TaWRKY54 TaWRKY55 TaWRKY56 TaWRKY57 TaWRKY58 TaWRKY59 TaWRKY60 TaWRKY61 TaWRKY62 0 25 50 75 100 125 150 175 200 225 250 275 300 325 350 375 400 425 450 475 500 525 550 575 600 625 650 675 700 725 750 Mb TaWRKY23 TaWRKY24 TaWRKY25 TaWRKY26 200 250 TaWRKY20 TaWRKY21 Chr 2D TaWRKY36 TaWRKY37 TaWRKY31 TaWRKY32 175 225 Chr 2B TaWRKY28 TaWRKY8 50 100 Chr 2A TaWRKY19 0 Figure 4(on next page) Schematic diagram of conserved motifs in TaWRKYs. PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 (A) Name p-value Name Motif Location p-value Motif Location IIa IIb I IId IIe III IIc Motif 2 Motif 10 Motif 7 Motif 6 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 Motif 9 Motif 8 Motif 5 Motif 3 Motif 4 Motif 1 (B) Figure 5(on next page) The number of TaWRKY genes containing various cis-acting elements. The cis-acting elements were identified with the online PlantCARE program using the 1.5kb upstream from the transcription start site of TaWRKY genes. The graph was generated based on the presence of cis-acting element responsive to specific elicitors/conditions/processes (xaxis) in WRKY gene family members (y-axis). PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 0 TATA-box CAAT-box A-box OBP-1 site ATGCAAAT motif 5UTR Py-rich stretch AT-rich element circadian MSA-like E2Fa E2Fb Skn-1_motif GCN4_motif CCGTCC-box CAT-box OCT dOCT NON-box RY-element HD-Zip 1 HD-Zip 2 as1 motif I AC-I AC-II ABRE motif IIb CE3 CE1 GARE-motif P-box TATC-box TCA-element SARE TGA-element AuxRR-core TGA-box AuxRE O2-site ERE CGTCA-motif TGACG-motif TC-rich repeats ARE GC-motif ELI-box3 Box-W1 box E EIRE AT-rich sequence W box box S GCC box WUN-motif JERE LTR C-repeat/DRE HSE MBS CCAAT-box MBSI MRE G-box Sp1 G-Box GAG-motif I-box GT1-motif MNF1 ACE Box I Box 4 GATA-motif GA-motif CATT-motif AE-box TCT-motif TCCC-motif ATCT-motif 3-AF1 binding site rbcS-CMA7a LAMPelement ATC-motif Box III box II TGG-motif chs-Unit 1 m1 chs-CMA2a Box II L-box GTGGC-motif CG-motif C-box Gap-box CAG-motif AAAC-motif 4cl-CMA2b ATCC-motif GATT-motif chs-CMA2c Pc-CMA2c chs-CMA1a chs-CMA2b LS7 CGT-motif as-2-box Number of TaWRKY genes 150 140 130 120 110 100 90 80 70 60 50 40 30 20 10 Transcription Cell cycle Development Hormone Abiotic / biotic stress PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 Type of cis-acting element Figure 6(on next page) The expression pattern of TaWRKY genes in flag leaves, glumes and lemmas during the grain filling stage under drought stress. (A) Bar graphs showing the relative expression values of each TaWRKY genes after drought treatment. Samples were collected 0, 1, 3, 5, 10, 15, and 25 days after anthesis, from which water control was carried out. The mean ± SE of three biological replicates are presented. Relative fold changes were obtained by qRT-PCR using the 2-ΔΔCT method. (B) Heat map showing the expression profile of TaWRKYs in flag leaves, glumes and lemmas under drought stress. Heat map was generated based on log2 transformed count value from three replicates of qRT-PCR data using R language. Red and green boxes indicated high and low expression levels of genes, respectively. Blank indicated the gene was not detected. PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017 15 10 10 Flag leaf 5 5 Glume 5 0 1 3 5 10 15 25 TaWRKY 90 20 15 10 5 0 4 0 1 3 5 10 15 25 Ta WRKY 120 1 3 5 10 15 25 TaWRKY 97 0 20 15 15 10 10 5 5 0 10 0 1 3 5 10 15 25 TaWRKY 123 0 6 0 1 3 5 10 15 25 10 15 25 10 15 25 TaWRKY 112 0 1 3 5 TaWRKY 133 4 6 2 4 1 8 20 0 8 3 0 0 2 2 0 1 3 5 10 15 25 TaWRKY 142 6 0 15 0 1 3 5 10 15 25 0 0 1 3 5 Days after anthesis / d TaWRKY 149 10 4 5 2 0 0 1 3 5 10 15 25 0 0 Days after anthesis / d 1 3 5 10 15 25 Color Key and Histogram 35 Relative expression TaWRKY 31 Lemma 10 25 Relative expression 15 TaWRKY 20 20 0 Relative expression 15 TaWRKY 1 Count Relative expression 25 Days after anthesis / d 0 15 (A) −2 (B) Glume Flag leaf 0d 1d 3d 5d 10d 15d 25d 0d 1d 3d 5d −1 0 Row Z−Score 1 2 Lemma 10d 15d 25d 0d 1d 3d 5d 10d 15d 25d TaWRKY001 TaWRKY020 TaWRKY142 TaWRKY123 TaWRKY031 TaWRKY149 TaWRKY112 TaWRKY097 TaWRKY090 TaWRKY120 TaWRKY133 PeerJ Preprints | https://doi.org/10.7287/peerj.preprints.2809v1 | CC BY 4.0 Open Access | rec: 15 Feb 2017, publ: 15 Feb 2017
© Copyright 2026 Paperzz