Published July 7, 2016 o r i g i n a l r es e a r c h Identification and Expression Profiling of the Lectin Gene Superfamily in Mulberry Bushra Saeed, Vinay K. Baranwal, and Paramjit Khurana* Abstract Core Ideas Lectins are a diverse group of ubiquitously present, highly specific sugar-binding proteins. Members of this large gene family have been assigned broad biological functions from defense to acting as storage proteins. Despite possessing several interesting characteristics, their functions remain essentially undefined. Mulberry (Morus spp.) known for its medicinal benefits is also a rich source of lectins. Using an exhaustive hidden Markov model (HMM)-based search, we identified the lectin gene complement in M. notabilis C.K. Schneid with around 197 members. These putative lectin genes were classified into 12 distinct gene families based on the presence of characteristic sugar-binding domains. Members of this superfamily were assigned varied gene ontologies (GOs) to identify putative functions and determine cellular localizations. Interestingly, characteristic expression patterns were observed across the lectin superfamily in response to a variety of environmental cues. This is suggestive of specialized functions under diverse conditions possibly by linking the specificity of sugar recognition with mediating precise stress responses in plants. The identification of putative gene family members from the genus Morus developed in this study can find wide applicability in lectin gene identification and characterization. It can also contribute immensely in the understanding of lectins from mulberry with potential medicinal uses. • • • Identification of lectin catalogues in Morus notabilis genome Genome-wide expression profiling of lectins Role of lectins in stress responses in mulberry L ectins can be broadly defined as proteins that contain at least one noncatalytic sugar-binding domain and bind a specific sugar reversibly. Lectins have been mostly associated with binding glycans for recognition and defense; however, with recent advances, a new role for plant lectins in stress responses is emerging. The presence of domains other than the sugar-binding domain in lectins suggests that these proteins are not mere sugar-binding proteins but have diverse uncharacterized functions (Van Damme et al., 2008). Previous studies have suggested that lectins are synthesized in response to abiotic and biotic stimulus and insect herbivory (Lannoo and Van Damme, 2010), involved in mediating stress responses (Zhang et al., 2000), symbiosis (Roberts et al., 2013), hormone induction (Cammue et al., 1989; Lannoo et al., 2007; Lannoo and Van Damme, 2010), or signaling (Jin et al., 2009). Lectins are also implicated in interactions with histones in response to stress (Schouppe et al., 2011), providing viral resistance (Yamaji et al., 2012), and having anticancerous properties (Fu et al., 2011; Ochoa-Alvarez Published in Plant Genome Volume 9. doi: 10.3835/plantgenome2015.10.0107 B. Saeed, V.K. Baranwal, and P. Khurana, Dep. of Plant Molecular Biology, Univ. of Delhi South Campus, New Delhi 110021, India. Received 28 Oct. 2015. Accepted 12 Apr. 2016. *Corresponding author ([email protected]). © Crop Science Society of America 5585 Guilford Rd., Madison, WI 53711 USA This is an open access article distributed under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Abbreviations: ABA, abscissic acid; AD, aerial drying; cDNA, complementary DNA; CS, cold stress; GO, gene ontology; HMM, hidden Markov model; JA, jasmonic acid; RO, reverse osmosis; SA, salicylic acid. the pl ant genome ju ly 2016 vol . 9, no . 2 1 of 13 et al., 2012). Recently, a lectin receptor kinase 1.9 has been identified as a critical receptor for extracellular ATP (Choi et al., 2014). Studies like these indicate the potential involvement of lectins in diverse responses and are suggestive of functions that have not been yet well elucidated. Apart from its critical role in the sericulture industry, mulberry is widely popular for its medicinal properties and has been long exploited in traditional medicine. Mulberry has also been recognized as a good source of lectins with potent biological activities (Khurana and Checker, 2011). For instance, two lectins have been identified in M. alba L. that bind sialic acid present on the surfaces of pathogenic bacteria and cancerous cells (Ratanapo et al., 1998). These lectins were later shown to have antibacterial activity against Pseudomonas syringae pv mori (Ratanapo et al., 2001). Interestingly, two Jacalins, galactose specific MornigraG and a mannose specific MornigraM, have been suggested to act as major storage proteins in the bark of M. nigra L. (Van Damme et al., 2002). Similarly, galactose-specific lectins (Yeasmin et al., 2001) and N-terminal sequences of a mannosespecific lectin (Absar et al., 2005) have also been isolated from mulberry seeds. An antiproliferative lectin, mulberry leaf lectin specific to galactose, galactosamine, and N-acetyl galactosamine was identified in M. alba leaves (Deepa and Priya, 2012). Mulberry leaf lectin has been found to induce apoptosis in breast and colon cancer cells in humans (Deepa et al., 2012). Recently, galactosebinding lectins have also been found to accumulate in mulberry laticifers, which is suggestive of a defense related role (Kitajima et al., 2013). The biosynthesis, structure, and roles of different lectin domains have been extensively discussed elsewhere (Van Damme et al., 1998). Lectins have been recently described in Arabidopsis thaliana (L.) Heynh., soybean [Glycine max (L.) Merr.], and rice (Oryza sativa L.) as a large superfamily containing ~12 families based on the presence of characteristic sugar-binding domains (Jiang et al., 2010). Thus, to understand this complex gene superfamily in mulberry, we identified the putative members from the recently sequenced genome of M. notabilis (He et al., 2013). By using HMM-based search centered on the presence of lectin domain, we isolated ~197 putative lectin genes from the M. notabilis genome. Recent trends have suggested a possible association between lectin genes and stress responses. Accordingly, their expression pattern was also investigated under different stress treatments. The induction of lectin encoding genes provides experimental evidence that corroborates with the previous reports of a potential intersection between lectins and stress responses in plants. Materials and Methods Identification of Lectin Genes Hidden Markov model profiles for domains corresponding to the lectin gene families (Jiang et al., 2010) were retrieved from Pfam database (December, 2014 version). 2 of 13 A HMM search was performed using hmmsearch (using default E-value of 10) from HMMER package 3.1b1 (http:// hmmer.org) on mulberry proteome downloaded from Morus genome database (Li et al., 2014; http://morus. swu.edu.cn/morusdb) and sequences were retrieved using custom written Perl script (Supplemental Data S1). The sequences from mulberry and corresponding sequences identified in Arabidopsis, soybean, and rice (Jiang et al., 2010) were also merged with the previously built HMM profiles. These sequences were used for domain search in NCBI Conserved Domain Database (NBCI-CDD) (Marchler-Bauer et al., 2011) and Simple Modular Architecture Research Tool (SMART) (Schultz et al., 1998; Letunic et al., 2004) databases; domains were fetched and aligned using ClustalX and a custom HMM was build. Finally, the HMM thus generated was used for further searching the mulberry proteome till no new accessions could be identified. To remove redundancy, only those sequences with unique accessions were retrieved. Structure Prediction, Domain Identification, and Phylogenetic Analysis For the identification of gene structure, genomic and messenger RNA sequences of all putative lectins were downloaded from Morus genome database (Li et al., 2014; http://morus.swu.edu.cn/morusdb). Coordinates of corresponding transcripts for all lectin genes identified were fetched from Morus genome database (Li et al., 2014; http://morus.swu.edu.cn/morusdb) and a custom written GFF file prepared. This GFF file along with the genomic sequences of lectin genes were used in CLC Genomics Workbench version 6.0.3 for the construction of gene structures. The lectin genes identified were checked for the presence of the domains by NCBI-CDD (MarchlerBauer et al., 2011) and SMART (Schultz et al., 1998; Letunic et al., 2004) searches. The domain sequences corresponding to the lectin families were employed for multiple sequence alignment using ClustalX version 2.1 (Larkin et al., 2007) with manual editing. To predict the evolutionary dynamics, DIVEIN (Deng et al., 2010) web server was used with the following parameters: substitution model of LG and equilibrium frequencies as optimized. Best of the NNI and subtree pruning and regrafting type of tree improvement was applied. Bootstrap assessment with 100 replications was performed. Gene Expression Analysis Briefly, all the five transcriptomes of M. notabilis representing different tissues (bark, leaf, male flower, root, and winter bud) were aligned together using de novo aligner Trinity 20140413p1 (Grabherr et al., 2011). Trinity script align_and_estimate_abundance.pl included in the same package was used, which rely on RSEM package (Li and Dewey, 2011) of R, to estimate the abundance of the transcripts. RSEM-estimated abundance values were used to construct a matrix with cross sample fragments per kilobase of transcript per million mapped reads (Haas et al., 2013) normalization. Expression analysis using this matrix the pl ant genome ju ly 2016 vol . 9, no . 2 was conducted using run_DE_analysis.pl script from Trinity, which uses edgeR package from bioconductor release 3.0 (Gentleman et al., 2004). Heat maps showing the expression profiles (log2) for these genes were plotted using PtR script included in the Trinity package. BLASTX from blastall version 2.2.26 was used to identify the corresponding transcripts from the assemblies for lectin genes. Stress Treatments For expression analysis, mature leaves of M. indica ‘K2’ plants maintained in University of Delhi South Campus were employed. Detached leaves were immersed in water solution for various treatments. Second and third leaves from the apex were employed for various treatments. For aerial drying (AD), detached leaves were air-dried on a blotting sheet in a culture room maintained at 28°C for 6 h. For cold stress (CS), detached leaves were placed in reverse osmosis (RO) water in an incubator at a temperature of 6 2C for 4 h. For control (Con) treatment, detached leaves were mock treated by immersing in RO water at 28C for required durations. For hormonal treatments, detached leaves were placed in 100 μM abscissic acid (ABA), jasmonic acid (JA), and salicylic acid (SA) at 28C for 4 h. For simulated dehydration stress (DS), detached leaves were placed in 0.5 M mannitol for 6 h. For wounding stress, detached leaves were pricked by needle and placed in RO water for 6 h. Quantitative Real-Time Polymerase Chain Reaction For the quantification of transcripts, quantitative realtime polymerase chain reaction (PCR) was employed as described previously (Saeed et al., 2015). Briefly, total RNA was extracted from stress-treated mulberry leaves following a modified GITC procedure (Chomczynski and Sacchi, 1987) and DNase treated using RNeasy plant mini kit (Qiagen) according to manufacturer’s instructions. First-strand complementary DNA (cDNA) was prepared from 1 μg total RNA in a reaction volume of 25 μL using high-capacity cDNA archive kit (Applied Biosystems). Because of the nonavailability of sequences from M. indica, the available transcriptomes of mulberry, that is, five transcriptomes of M. notabilis and two leaf transcriptomes of M. laevigata Wall. ex Brandis and M. serrata Roxb. (Saeed et al., 2016) were assembled together, and the sequences thus obtained were used for primer designing. The primers were designed by using Primer Express 2.0 software (PE Applied Biosystems) using default parameters and checked by BLAST tool against M. notabilis genome and melting curve analysis following PCR. The PCR was performed in Stratagene Mx3005P (Agilent Technologies). The data represents average of two biological replicates with three technical replicates each. The data was normalized using elongation factor 1α as an internal control and relative expression values were calculated using ΔΔCT method. List of primers used in this study can be found in Supplemental Data S2. saeed et al .: lectin gene su perfa mi ly in mu lberry Table 1. Identification of lectin superfamily in mulberry. Family B_lectin Calreticulin Chitin_bind_1 Gal_lectin Gal-bind_lectin Jacalin Lectin_C Lectin_legB LysM PP2 (phloem protein 2 ) Ricin_B_lectin RicinB_lectin_2 Pfam_ID Members pfam01453 pfam00262 pfam00187 pfam02140 pfam00337 pfam01419 pfam00059 pfam00139 pfam01476 pfam14299 pfam00652 pfam14200 76 4 9 10 6 23 1 31 21 14 1 1 Analysis of Regulatory Elements in the Putative Promoters of Lectins For the promoter analysis, a 1-kb upstream region (from translation start site) of the 15 lectins analyzed by quantitative real-time PCR was extracted from Morus genome database (Li et al., 2014; http://morus.swu.edu. cn/morusdb). The putative cis-acting elements in the 1-kb upstream region were analyzed by scanning in PLACE database (Higo et al., 1999). Different stress-responsive motifs thus identified were classified as ABA responsive, MYB binding site, MYC binding site, dehydrationresponsive elements, and cold-responsive elements. The motifs responsive to JA, SA, and wounding were clubbed together as biotic-stress-responsive elements. The positions of these elements were marked and custom GFF file was prepared that was used for construction of promoter structures using CLC Genomics Workbench version 6.0.3. Results and Discussion Identification of Lectin Genes in Mulberry Genome and Phylogenetic Analysis An HMM search was employed for the identification of members of the lectin superfamily in mulberry. Multiple searches were performed until no new members could be identified. Domain analysis performed by NCBI-CDD (Marchler-Bauer et al., 2011) and SMART (Schultz et al., 1998; Letunic et al., 2004) searches confirmed the presence of the respective domains. This rigorous search led to identification of a total of 197 putative lectin genes that were categorized into 12 families (Table 1) following a previously suggested method of classification (Jiang et al., 2010). Pfam domains of all but two families, that is, phloem protein 2 and EEA families, have been previously reported (Jiang et al., 2010). However, with recent curation, we were able to identify Pfam domains for these families, that is, PP2 (phloem protein 2 , Pfam14299) for phloem lectins and RicinB_lectin_2 (Pfam14200) for EEA family. The EEA family has been referred to as RicinB_lectin_2 family. 3 of 13 Fig. 1. Phylogenetic analysis of lectin gene superfamily in mulberry. Respective domains of the different families of lectins from Morus notabilis were aligned using ClustalX version 2.1 and the phylogenetic tree was constructed using DIVEIN webserver. Five transcriptomes (bark, leaf, male flower, root, and winter bud) of M. notabilis (He et al., 2013) were downloaded from Morus genome database (Li et al., 2014; http://morus.swu.edu.cn/morusdb) and used to check the expression evidence of all the members of the lectin gene superfamily in the respective tissues. Expression evidence for all the putative lectin genes was found in the M. notabilis transcriptomes possibly because of the high throughput and wider range of the developmental stages used to generate the transcriptome data. The 4 of 13 respective domains from all the 12 families were used for performing the phylogenetic analysis of the lectin gene superfamily in mulberry. All the 197 putative lectins separated well into 12 clades (Fig. 1) representing each family as described for rice, soybean, and Arabidopsis (Jiang et al., 2010). Maximum number of members was identified in B_lectin family that also happens to be the largest lectin gene family in soybean and rice (Jiang et al., 2010). Ricin_B_lectin, Lectin_C, and RicinB_lectin_2 families came out to be monotypic gene family in M. notabilis. the pl ant genome ju ly 2016 vol . 9, no . 2 Likewise, Ricin_B_lectin family was also found to have a single member in rice, soybean, and Arabidopsis (Jiang et al., 2010). However, Lectin_C family has been reported to have a single member in rice and Arabidopsis and two in soybean. Interestingly, in the RicinB_lectin_2 family, seven members in rice, two in soybean, and one in Arabidopsis have been reported (Jiang et al., 2010). Even though both Ricin_B_lectin and RicinB_lectin_2 adopt ricin B-fold, the two have been classified separately. Based on the phylogenetic analysis performed on the members of the two families, these members were found to cluster separately in soybean (Van Holle and Van Damme, 2015). Several lectins with tandemly arrayed sugar-binding domains were also identified in Jacalin and LysM families (data not shown). Such repetitive domain arrangements have also been observed in other organisms (Van Holle and Van Damme, 2015). Details of members from each family along with the sequence information (obtained from Morus genome database), subcellular localization predictions (Emanuelsson et al., 2000), and their corresponding domains (start and end positions) can be found in Supplemental Data S3. Gene Structure Analysis Upon examination of gene structure, it was observed that majority of lectins contain introns. The maximum number of introns (20) were identified in Morus008787, which belongs to the Gal_lectin family (Fig. 2). Out of 76 members in B_lectin family, 35 members were found to be intronless. Among the B_lectin family members, Morus008727 was found to have the maximum number of exons (16 exons and 15 introns). Interestingly, the longest intron of 27,965 bp was found in Morus006136 from this family. In the Calreticulin family, all the four members contain introns, with Morus004111 containing 14 exons and 13 introns. Likewise, out of nine members comprising the Chitin_bind_1 gene family members, Morus026586 was found to be intronless. Six members of this family contain two exons, while rest of the members contain three exons. All the 10 members of Gal_lectin family contain a large number of exons per gene ranging from 13 to 21. Morus008787 from this family contains the largest number of exons (21) among all the predicted lectin genes. Similarly, in the Gal_binding_lectin family containing six members, Morus000677 was found to be intronless. Two members (Morus000472 and Morus021051) from this family contain six and eight exons, respectively, while the rest of the members contain seven exons each. Of the total 23 Jacalins predicted in this analysis, 10 were found to have three exons, while six have two exons. The majority (~65%) of the members from Lectin_legB are intronless genes, while Morus021495 from this family contains 10 exons. The three monotypic families, that is, Lectin_C, Ricin_B_ lectin, and RicinB_lectin_2, have seven, 14, and three exons, respectively; however, the length of introns varies considerably. Five genes from the LysM family were found to be intronless, thus a considerable heterogeneity saeed et al .: lectin gene su perfa mi ly in mu lberry in terms of gene length and exon–intron organization was observed in all the lectin families. Identification of Other Functional Domains Lectins containing domains other than the carbohydrate-binding domain possessing an entirely different biological activity have been identified (Van Damme et al., 1998). As described previously (Jiang et al., 2010), on analysis of domains in the putative lectins, several other co-occurring domains were identified (Fig. 3). We observed that the S_Locus_Glyco_protein domain was associated with the B_lectin family (55.26% members). This domain is also found in receptor kinases and has been implicated in mediating self-incompatibility responses in Brassica spp. (Cui et al., 2000). Kinase domain (90.78% members) has also been found to be associated with the B_lectin family along with a Pan Apple domain (68.42% members) that can mediate protein and sugar interactions (www.ebi.ac.uk). Interestingly, OsSIK2 containing B_lectin, S-Locus Glycoprotein, PAN domain, and an intracellular Ser/Thr kinase domain has been shown to enhance tolerance to salt and drought, promote leaf emergence, and delay leaf senescence (Chen et al., 2013). Association of F-box domain (57.14% members) with the phloem protein 2 family was also observed as described (Jiang et al., 2010) in rice (46.4% members), Arabidopsis (56.7% members), and soybean (82.1%). Phloem protein 2 is abundantly present in the phloem sap and its association with the F-box domain suggests a putative role in mediating protein degradation via sugar–protein interactions (Dinant et al., 2003). O-Glycosyl hydrolase domains, such as Glyco_hydro_42 and Glyco_hydro_35, were identified in 90 and 100% members, respectively, of the Gal_lectin family. These are involved in hydrolysis of the glycosidic bonds (www.smart.embl.de). Likewise, Glyco_hydro_35 domain was reported to be present in all the members in rice and soybean and in 69.2% members in Arabidopsis of Gal_lectin family (Jiang et al., 2010). Another β-galactosidase domain, betaGal_dom4_5, was identified in 30% of Gal_lectin members in mulberry. Lastly, a Galactosyl_T domain was identified in five out of six members of the Gal-binding lectin family. This domain was reportedly found in all the members of the respective family in rice, soybean, and Arabidopsis (Jiang et al., 2010). However, any of these functional domains could not be identified in the rest of the lectin gene families. Association of these domains with the lectin family members across all the four genera probably indicates a conserved function, which might be dependent on the sugar specificity of the lectin. Gene Ontology Assignment Gene ontology term assignment, a convenient tool for annotating novel transcripts, was used for annotating lectin genes from mulberry. Out of 197 putative lectin genes in M. notabilis, 179 were assigned GO terms: biological process, cellular component, and molecular 5 of 13 Fig. 2. Gene structure of lectin genes from Morus notabilis showing exon–intron organization. Members from each family have been represented in different colors. Boxed arrows represent exons and lines represent introns. function (Fig. 4). Of the 129 genes assigned the biological process terms, the maximum number of annotations involved metabolic processes, suggesting possible higher representation in cellular metabolism. In this category, lectin genes show an association with primary metabolism processes including carbohydrate metabolism, N compound metabolism, and other macromolecular mechanisms. Interestingly, the GO term, response to stress, was assigned to approximately three genes. Ten genes were assigned cellular processes GO terms, which 6 of 13 were associated with cytoplasm, endoplasmic reticulum, and membrane bound organelles (Fig. 4). This is in agreement with existing literature, which suggests that apart from the secretory pathway, lectins are also localized in the nuclear and cytoplasmic compartments (Van Damme et al., 2008). Of the 140 genes assigned molecular function terms, maximum representation was in catalytic activity, binding, kinase activity, ion binding, and transferase activity (Fig. 4). This observation corresponds well with our data on the occurrence of other functional the pl ant genome ju ly 2016 vol . 9, no . 2 Fig. 3. Identification of other functional domains and their abundance in lectins from Morus notabilis. Domain analysis was performed using NCBI-CDD and SMART searches. The different families have been represented at the x-axis and the types of domains with the number of members containing those domains have been represented on the y-axis. domains in mulberry lectins (discussed elsewhere). Thus, our GO assignment analysis indicates a wide range of functions, processes, and organeller localizations that lectins are possibly involved in. Several GOs identified in this study will pave the way for finding new functions of these lectin genes. Gene Expression Analysis Despite the diversity and abundance, the functions of lectins in plants remain largely unknown. To comprehend the biological roles of a protein, it is essential to understand its expression dynamics spanning a wider array of tissues. Thus, the expression profiles of the identified mulberry lectins were studied across five transcriptomes of M. notabilis representing different developmental stages. Several distinct clusters of genes with similar expression pattern were observed across the 12 families (Fig. 5). A varied pattern of expression was observed in the largest lectin gene family, the B_lectin family. Maximum number of genes in this family showing upregulation was observed in the leaf tissue followed by male flower and root tissues. Some root preferential genes were also identified. In the Lectin_legB family, genes having relatively higher expression were detected in root, male flower, and bark, while the largest number of upregulated genes was identified in the leaf. The Jacalin family members, in general, were found to be highly expressed saeed et al .: lectin gene su perfa mi ly in mu lberry in the winter bud and bark tissue. Likewise, in the LysM family, a greater number of genes were expressed in roots followed by leaf, bark, and male flowers. The single Ricin_B_lectin family member, Morus027674, was expressed in the bark tissue. Preferential accumulation of lectins in the bark suggests a crucial role as a defense related storage protein. Black locust (Robinia pseudoacacia L.) and elder (Sambucus nigra L.) are known to accumulate high amounts of lectins in the bark and are preferentially avoided by rodents and other animals (Peumans and Van Damme, 1995). Maximum expression was observed in the leaf followed by male flower and bark and lowest expression in winter bud in the Gal_lectin family. A similar pattern was observed in the phloem protein 2 family. Interestingly, in Chitin_bind_1 family, a very distinct pattern of expression was observed; the maximum upregulation was observed in the male flower and almost all were downregulated in the winter bud. This might be suggestive of a specific role of these lectins in the male flowers. The Gal-binding_lectin family members were found to be root preferential with a very low expression in other tissues. The expression was enhanced in leaf tissue and male flowers and was downregulated in bark tissue and winter buds in the Calreticulin family. The expression patterns of these lectins can aid in understanding their function by identifying the probable tissues and conditions in which they are expressed. Though we were able to identify several families showing tissue preferential expression, none of the families showed a distinct tissue specific expression pattern. This might suggest functional diversification and specialization of these genes in different tissues. Thus, our analysis collectively shows the possible differences in expression patterns of these lectins in the various tissues analyzed. Expression Profile of Lectin Gene Family Members Under Various Stress Conditions Plants have evolved complex expression patterns of genes in response to various environmental cues. Since recent studies have shown a possible stress mediated regulation of lectins, we quantified the expression of several lectins under abiotic and biotic stress treatments. Of the 15 genes analyzed, four belong to the B_lectin family; two to Lectin_legB, Gal-binding_lectin, Chitin_bind_1, and Calreticulin families; and a single member from Gal_lectin, Jacalin, and phloem protein 2 families. Quantitative real-time PCR showed that these genes are differentially regulated under most of the abiotic stress simulations such as ABA application, dehydration stress (simulated by aerial drying and treatment with mannitol), and cold stress (Fig. 6). Since, lectins have been long associated with a role in defense responses, the expression was checked under biotic stress simulating conditions such as JA and SA application and wounding (Fig. 6). Two members of Lectin_legB family (Morus021495 and Morus025486) showed responsiveness to both abiotic and biotic stresses treatments. Morus021495 was found to be significantly upregulated by both abiotic and biotic stresses, while 7 of 13 Fig. 4. Gene ontology term assignment of putative lectin genes in Morus notabilis. Gene ontology terms were assigned across the three categories: cellular component (CC), biological process (BP), and molecular function (MF). Morus025486 showed significant upregulation in cold and dehydration stresses. Similarly, members of Chitin_ bind_1 (Morus026586), Gal_lectin family (Morus013376), and Calreticulin (Morus004111) families showed significant upregulation under dehydration and ABA application, while the other two members of Chitin_bind_1 (Morus014362) and Calreticulin (Morus005581) families showed significant upregulation in both dehydration and biotic stress simulations. The expression level of Gal-binding_lectins (Morus015660 and Morus021051) 8 of 13 was significantly induced by ABA application; however, Morus021051 also showed a significant upregulation in response to dehydration stress. Morus020123 (Jacalin family) showed an abiotic stress-specific response with significant upregulation under dehydration and cold stresses. The expression of Morus006458 from the phloem protein 2 family was induced significantly under cold stress with slight induction under ABA treatment and dehydration stresses. Lastly, of the four members analyzed from the B_ lectin family, Morus004886, Morus013708, Morus016962, the pl ant genome ju ly 2016 vol . 9, no . 2 Fig. 5. Heat maps showing the expression profiles of putative Morus notabilis lectin genes in five transcriptomes representing five diverse tissues (bark, leaf, male flower, root, and winter bud). and Morus022540 showed a very significant and distinct responsiveness to dehydration stress treatments. A modest overlap between dehydration and coldstress signal transduction pathways has been observed (Yamaguchi-Shinozaki and Shinozaki, 2006). Interestingly, approximately nine genes were induced by dehydration stress (AD and DS) and CS, suggesting a role in ABA-independent stress responses, while six genes showed significant upregulation under ABA application and dehydration and cold stress, suggesting a role saeed et al .: lectin gene su perfa mi ly in mu lberry in ABA dependent stress responses. Consequently, from the expression analysis, it can be comprehended that lectins might act in both ABA-dependent and ABAindependent pathways. Similarly, an accumulation of wheat germ agglutinin in osmotic-stress-treated wheat seedlings, which was inhibited by fluridone (inhibitor of ABA biosynthesis), was reported (Cammue et al., 1989). The expression of reporter gene driven by upstream regulatory sequence of a B_lectin domain-containing protein was induced by salt and desiccation stress and 9 of 13 Fig. 6. Expression analysis of lectin genes under different environmental conditions in Morus indica. Con, treated with reverse osmosis water for required durations, biotic (red bars) stress simulations (treatment with 100 μM JA and SA for 4 h; WS, wounding stress for 6 h, abiotic stress (blue bars) simulations (treatment with 100 μM abscissic acid for 4 h; AD, aerial drying for 6 h; DS, dehydration stress by treatment with mannitol for 6 h; CS, cold stress for 4 h. Genes from different families have been highlighted using different backgrounds. The locus id of the best Blast hit from M. notabilis has been shown in the figure. Data represents average of two biological replicates (with three technical replicates each) standard deviation. For the determination of significance (p ≤ 0.05), Student’s t-test (two-sided tests for homoscedastic matrices) was performed using Excel software. not by ABA, indicating a role in ABA-independent stress-signaling pathway (Ray et al., 2012). Upregulation of 25 and 45% lectin genes by abiotic stresses such as cold, drought, and salinity in rice and Arabidopsis, respectively, has been reported (Jiang et al., 2010). 10 of 13 Significant upregulation of four genes was observed in biotic stress simulations (JA and SA application and wounding). A SA-responsive Jacalin domain containing protein LEM2 has been identified in barley (Hordeum vulgare L.) (Abebe et al., 2005). Similarly, induced the pl ant genome ju ly 2016 vol . 9, no . 2 Fig. 7. Cis-regulatory elements in the promoters of lectin genes used for expression analysis. 1 kb upstream region (from translation start site) of the promoters of all lectins analyzed by quantitative real-time PCR were analyzed using PLACE database. Different elements such as ABA responsive, Biotic stress responsive (wounding and JA/SA application), cold responsive (LTRE), and dehydration responsive elements such as DRE, MYB and MYC binding sites were identified. expression of Nictaba from tobacco (Nicotiana tabacum L.) was observed after methyl jasmonate treatment and insect herbivory (Lannoo et al., 2007). The role of lectin receptor-like kinases (LecRLKs) has been well documented under both abiotic and biotic stresses (Vaid et al., 2013). In fact, CERK1 (containing both LysM and kinase domains) has been suggested to be a critical player in fungal perception (Miya et al., 2007). It was later shown that CERK1 is a major chitin-binding protein in Arabidopsis and the three LysM domains are important for chitin binding (Petutschnig et al., 2010). Interestingly, introgression of Bph3 cluster containing three lectin receptor kinases (OsLecRLK1-3) into susceptible rice varieties led to enhanced resistance to both brown planthopper and white back planthopper (Liu et al., 2015). Likewise, 34 and 32% of the members of rice and Arabidopsis lectin gene families, respectively, were regulated by fungal and bacterial pathogens (Jiang et al., 2010). saeed et al .: lectin gene su perfa mi ly in mu lberry Together, these findings suggest a crucial role of lectins under different stress-responsive pathways in plants. Analysis of Cis-Acting Regulatory Elements of Stress Inducible Lectins Since distinct upregulation of lectins was observed under abiotic and biotic stress treatments, we analyzed the 1-kb upstream region (from translation start site) of the 15 lectin genes analyzed by quantitative real-time PCR (Fig. 7). Different regulatory elements involved in dehydration and cold stress signaling, such as ABA-responsive elements, dehydration-related element, low-temperatureresponsive element, and MYB- and MYC-binding sites were identified (details presented in Supplemental Data S4) by searching in PLACE database (Higo et al., 1999). The ABA responsive elements were found in three promoters, two (Morus004111 and Morus021051) of which showed significant upregulation after ABA treatment. Likewise, the promoters of the genes showing significant 11 of 13 upregulation under dehydration stress harbored dehydration-responsive cis-acting elements such as dehydration-related element and MYB- and MYC-binding sites . Also, low-temperature-responsive elements were found in the promoters of Morus21495, Morus025486, and Morus006458, which showed significant induction under cold stress. Several lectins (Morus21495, Morus14362, Morus005581, and Morus22540) were found to be significantly upregulated by biotic stress simulations such as JA and SA application and wounding. Correspondingly, biotic-stress-responsive cis-acting elements were identified in the promoters of these genes. Presence of these cis-acting elements further rationalizes the induction of these genes by the corresponding treatments. Collectively, our expression data and in silico promoter analysis strongly suggests a probable role of lectins in both abiotic and biotic stress signaling. Conclusions Despite the extensive research on plant lectins, their biological roles are hitherto vaguely defined. Nevertheless, given the ubiquitous occurrence and the structural diversity of lectins, it is discernable that they might play numerous decisive roles. Thus, identification and classification of the lectin complement at the wholegenome level can play a critical role in advancing our understanding of this very diverse superfamily. In the present study, we identified 197 putative lectin genes from mulberry, which is recognized as a rich source of lectins. These putative lectin genes were classified into 12 families based on the presence of corresponding lectin domains. Out of 197 putative lectin genes, 64 were found to be intronless, while genes with up to 20 introns were also identified, implying a high heterogeneity at the level of gene organization. Association of other functional domains with some families was also observed, which might reflect a wider range of biological functions. Correspondingly, GO assignment was performed to annotate the newly identified members, which implies that lectins are involved in a plethora of biological roles in different cellular compartments. This also suggests that sugar specificity lies at the center of these cellular responses. As stated earlier, lectins have been suggested to be responsive to different stresses; our analysis confirms the upregulation of lectins in response to abiotic and biotic stress treatments. Collectively, the information generated in this analysis can prove beneficial in determining the functional roles of lectins in plants, in general, and mulberry in particular. Supplemental Information Available Supplemental Data S1: Perl script used for sequence retrieval. Supplemental Data S2: List of primers used in the present study. Supplemental Data S3: Details of lectin gene family members. 12 of 13 Supplemental Data S4: Details of cis-acting elements found in the promoters. Author Contributions BS and VKB performed the in silico analysis. BS performed the experiments. BS, VKB, and PK analyzed the data. BS and PK wrote the manuscript. All authors read and approved the manuscript. Acknowledgments BS acknowledges Council for Scientific and Industrial Research (CSIR) for Senior Research Fellowship. This work was supported by grants from the Department of Biotechnology, Government of India. The authors thank the reviewers for improvement of the manuscript. References Abebe, T., R.W. Skadsen, and H.F. Kaeppler. 2005. A proximal upstream sequence controls tissue-specific expression of Lem2, a salicylate-inducible barley lectin-like gene. Planta 221:170–183. doi:10.1007/s00425-0041429-9 Absar, N., T. Yeasmin, M.S. Raza, S.K. Sarkar, and F. Arisaka. 2005. Single step purification, characterization and N-terminal sequences of a mannose specific lectin from mulberry seeds. Protein J. 24:369–377. doi:10.1007/s10930-005-7590-6 Cammue, B.P., W.F. Broekaert, J.T. Kellens, N.V. Raikhel, and W.J. Peumans. 1989. Stress-induced accumulation of wheat germ agglutinin and abscisic acid in roots of wheat seedlings. Plant Physiol. 91:1432–1435. doi:10.1104/pp.91.4.1432 Chen, L.J., H. Wuriyanghan, Y.Q. Zhang, K.X. Duan, H.W. Chen, Q.T. Li, et al. 2013. An S-domain receptor-like kinase, OsSIK2, confers abiotic stress tolerance and delays dark-induced leaf senescence in rice. Plant Physiol. 163:1752–1765. doi:10.1104/pp.113.224881 Choi, J., K. Tanaka, Y. Cao, Y. Qi, J. Qiu, Y. Liang, et al. 2014. Identification of a plant receptor for extracellular ATP. Science 343:290–294. doi:10.1126/science.343.6168.290 Chomczynski, P., and N. Sacchi. 1987. Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal. Biochem. 162:156–159. doi:10.1016/0003-2697(87)90021-2 Cui, Y., Y.M. Bi, N. Brugière, M. Arnoldo, and S.J. Rothstein. 2000. The S locus glycoprotein and the S receptor kinase are sufficient for selfpollen rejection in Brassica. Proc. Natl. Acad. Sci. USA 97:3713–3717. doi:10.1073/pnas.97.7.3713 Deepa, M., and S. Priya. 2012. Purification and characterization of a novel anti-proliferative lectin from Morus alba L. leaves. Protein Pept. Lett. 19:839–845. doi:10.2174/092986612801619516 Deepa, M., T. Sureshkumar, P.K. Satheeshkumar, and S. Priya. 2012. Purified mulberry leaf lectin (MLL) induces apoptosis and cell cycle arrest in human breast cancer and colon cancer cells. Chem. Biol. Interact. 200:38–44. doi:10.1016/j.cbi.2012.08.025 Deng, W., B.S. Maust, D.C. Nickle, G.H. Learn, Y. Liu, L. Heath, et al. 2010. DIVEIN: A web server to analyze phylogenies, sequence divergence, diversity, and informative sites. Biotechniques 48:405. doi:10.2144/000113370 Dinant, S., A.M. Clark, Y. Zhu, F. Vilaine, J.C. Palauqui, C. Kusiak, et al. 2003. Diversity of the superfamily of phloem lectins (phloem protein 2) in angiosperms. Plant Physiol. 131:114–128. doi:10.1104/pp.013086 Emanuelsson, O., H. Nielsen, S. Brunak, and G. von Heijne. 2000. Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J. Mol. Biol. 300:1005–1016. doi:10.1006/ jmbi.2000.3903 Fu, L., C. Zhou, S. Yao, J. Yu, B. Liu, and J. Bao. 2011. Plant lectins: Targeting programmed cell death pathways as antitumor agents. Int. J. Biochem. Cell Biol. 43:1442–1449. doi:10.1016/j.biocel.2011.07.004 Gentleman, R.C., V.J. Carey, D.M. Bates, B. Bolstad, M. Dettling, S. Dudoit, et al. 2004. Bioconductor: Open software development for the pl ant genome ju ly 2016 vol . 9, no . 2 computational biology and bioinformatics. Genome Biol. 5:R80. doi:10.1186/gb-2004-5-10-r80 Grabherr, M.G., B.J. Haas, M. Yassour, J.Z. Levin, D.A. Thompson, I. Amit, et al. 2011. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29:644–652. doi:10.1038/ nbt.1883 Haas, B.J., A. Papanicolaou, M. Yassour, M. Grabherr, P.D. Blood, J. Bowden, et al. 2013. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8:1494–1512. doi:10.1038/nprot.2013.084 He, N., C. Zhang, X. Qi, S. Zhao, Y. Tao, G. Yang, et al. 2013. Draft genome sequence of the mulberry tree Morus notabilis. Nat. Commun. 4:2445. doi:10.1038/ncomms3445 Higo, K., Y. Ugawa, M. Iwamoto, and T. Korenaga. 1999. Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res. 27:297–300. doi:10.1093/nar/27.1.297 Jiang, S.Y., Z. Ma, and S. Ramachandran. 2010. Evolutionary history and stress regulation of the lectin superfamily in higher plants. BMC Evol. Biol. 10:79. doi:10.1186/1471-2148-10-79 Jin, H., Z. Hong, W. Su, and J. Li. 2009. A plant-specific calreticulin is a key retention factor for a defective brassinosteroid receptor in the endoplasmic reticulum. Proc. Natl. Acad. Sci. USA 106:13612–13617. doi:10.1073/ pnas.0906144106 Khurana, P., and V.G. Checker. 2011. The advent of genomics in mulberry and perspectives for productivity enhancement. Plant Cell Rep. 30:825– 838. doi:10.1007/s00299-011-1059-1 Kitajima, S., Y. Yamamoto, K. Hirooka, C. Taki, and S. Hibino. 2013. Laticifers in mulberry exclusively accumulate defense proteins related to biotic stresses. Plant Biotechnol. 30:399–402. doi:10.5511/ plantbiotechnology.13.0326a Lannoo, N., and E.J. Van Damme. 2010. Nucleocytoplasmic plant lectins. Biochim. Biophys. Acta, Gen. Subj. 1800:190–201. doi:10.1016/j.bbagen.2009.07.021 Lannoo, N., G. Vandenborre, O. Miersch, G. Smagghe, C. Wasternack, W.J. Peumans, et al. 2007. The jasmonate-induced expression of the Nicotiana tabacum leaf lectin. Plant Cell Physiol. 48:1207–1218. doi:10.1093/ pcp/pcm090 Larkin, M.A., G. Blackshields, N. Brown, R. Chenna, P.A. McGettigan, H. McWilliam, et al. 2007. Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948. doi:10.1093/bioinformatics/btm404 Letunic, I., R.R. Copley, S. Schmidt, F.D. Ciccarelli, T. Doerks, J. Schultz, et al. 2004. SMART 4.0: Towards genomic data integration. Nucleic Acids Res. 32:D142–D144. doi:10.1093/nar/gkh088 Li, B., and C.N. Dewey. 2011. RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12:323. doi:10.1186/1471-2105-12-323 Li, T., Qi, X., Zeng, Q., Xiang, Z., and He, N. 2014. MorusDB: A resource for mulberry genomics and genome biology. Database 2014:bau054. doi:10.1093/database/bau054 Liu, Y., H. Wu, H. Chen, Y. Liu, J. He, H. Kang, et al. 2015. A gene cluster encoding lectin receptor kinases confers broad-spectrum and durable insect resistance in rice. Nat. Biotechnol. 33:301–305. doi:10.1038/ nbt.3069 Marchler-Bauer, A., S. Lu, J.B. Anderson, F. Chitsaz, M.K. Derbyshire, C. DeWeese-Scott, et al. 2011. CDD: A Conserved Domain Database for the functional annotation of proteins. Nucleic Acids Res. 39:D225– D229. doi:10.1093/nar/gkq1189 Miya, A., P. Albert, T. Shinya, Y. Desaki, K. Ichimura, K. Shirasu, et al. 2007. CERK1, a LysM receptor kinase, is essential for chitin elicitor signaling in Arabidopsis. Proc. Natl. Acad. Sci. USA 104:19613–19618. doi:10.1073/pnas.0705147104 Ochoa-Alvarez, J.A., H. Krishnan, Y. Shen, N.K. Acharya, M. Han, D.E. McNulty, et al. 2012. Plant lectin can target receptors containing sialic acid, exemplified by podoplanin, to inhibit transformed cell growth and migration. PLoS One 7:e41845. doi:10.1371/journal.pone.0041845 Petutschnig, E.K., A.M. Jones, L. Serazetdinova, U. Lipka, and V. Lipka. 2010. The lysin motif receptor-like kinase (LysM-RLK) CERK1 is a saeed et al .: lectin gene su perfa mi ly in mu lberry major chitin-binding protein in Arabidopsis thaliana and subject to chitin-induced phosphorylation. J. Biol. Chem. 285:28902–28911. doi:10.1074/jbc.M110.116657 Peumans, W.J., and E. Van Damme. 1995. Lectins as plant defense proteins. Plant Physiol. 109:347. doi:10.1104/pp.109.2.347 Ratanapo, S., W. Ngamjunyaporn, and M. Chulavatnatol. 1998. Sialic acid binding lectins from leaf of mulberry (Morus alba). Plant Sci. 139:141– 148. doi:10.1016/S0168-9452(98)00185-X Ratanapo, S., W. Ngamjunyaporn, and M. Chulavatnatol. 2001. Interaction of a mulberry leaf lectin with a phytopathogenic bacterium, P. syringae pv mori. Plant Sci. 160:739–744. doi:10.1016/S0168-9452(00)00454-4 Ray, S., S. Kapoor, and A.K. Tyagi. 2012. Analysis of transcriptional and upstream regulatory sequence activity of two environmental stressinducible genes, NBS-Str1 and BLEC-Str8, of rice. Transgenic Res. 21:351–366. doi:10.1007/s11248-011-9535-5 Roberts, N.J., G. Morieri, G. Kalsi, A. Rose, J. Stiller, A. Edwards, et al. 2013. Rhizobial and mycorrhizal symbioses in Lotus japonicus require lectin nucleotide phosphohydrolase, which acts upstream of calcium signaling. Plant Physiol. 161:556–567. doi:10.1104/pp.112.206110 Saeed, B., V.K. Baranwal, and P. Khurana. 2016. Comparative transcriptomics and comprehensive marker resource development in mulberry. BMC Genomics 17:1. doi:10.1186/s12864-016-2417-8 Saeed, B., M. Das, and P. Khurana. 2015. Overexpression of β-carotene hydroxylase1 (BCH1) in Indian mulberry, Morus indica cv. K2, confers tolerance against UV, high temperature and high irradiance stress induced oxidative damage. Plant Cell Tissue Organ Cult. 120:1003– 1014. doi:10.1007/s11240-014-0654-6 Schouppe, D., B. Ghesquière, G. Menschaert, W.H. De Vos, S. Bourque, G. Trooskens, et al. 2011. Interaction of the tobacco lectin with histone proteins. Plant Physiol. 155:1091–1102. doi:10.1104/pp.110.170134 Schultz, J., F. Milpetz, P. Bork, and C.P. Ponting. 1998. SMART, a simple modular architecture research tool: Identification of signaling domains. Proc. Natl. Acad. Sci. USA 95:5857–5864. doi:10.1073/pnas.95.11.5857 Vaid, N., A. Macovei, and N. Tuteja. 2013. Knights in action: Lectin receptor-like kinases in plant development and stress responses. Mol. Plant 6:1405–1418. doi:10.1093/mp/sst033 Van Damme, E.J., B. Hause, J. Hu, A. Barre, P. Rougé, P. Proost, et al. 2002. Two distinct jacalin-related lectins with a different specificity and subcellular location are major vegetative storage proteins in the bark of the black mulberry tree. Plant Physiol. 130:757–769. doi:10.1104/pp.005892 Van Damme, E.J., N. Lannoo, and W.J. Peumans. 2008. Plant lectins. Adv. Bot. Res. 48:107–209. doi:10.1016/S0065-2296(08)00403-5 Van Damme, E.J., W.J. Peumans, A. Barre, and P. Rougé. 1998. Plant lectins: A composite of several distinct families of structurally and evolutionary related proteins with diverse biological roles. Crit. Rev. Plant Sci. 17:575–692. doi:10.1016/S0735-2689(98)00365-7 Van Holle, S., and E.J. Van Damme. 2015. Distribution and evolution of the lectin family in soybean (Glycine max). Molecules 20:2868–2891. doi:10.3390/molecules20022868 Yamaguchi-Shinozaki, K., and K. Shinozaki. 2006. Transcriptional regulatory networks in cellular responses and tolerance to dehydration and cold stresses. Annu. Rev. Plant Biol. 57:781–803. doi:10.1146/annurev. arplant.57.032905.105444 Yamaji, Y., K. Maejima, K. Komatsu, T. Shiraishi, Y. Okano, M. Himeno, et al. 2012. Lectin-mediated resistance impairs plant virus infection at the cellular level. Plant Cell 24:778–793. doi:10.1105/tpc.111.093658 Yeasmin, T., M.A.K. Tang, A. Razzaque, and N. Absar. 2001. Purification and characterization of three galactose specific lectins from mulberry seeds (Morus sp.). Eur. J. Biochem. 268:6005–6010. doi:10.1046/j.00142956.2001.02470.x Zhang, W., W.J. Peumans, A. Barre, C.H. Astoul, P. Rovira, P. Rougé, et al. 2000. Isolation and characterization of a jacalin-related mannosebinding lectin from salt-stressed rice (Oryza sativa) plants. Planta 210:970–978. doi:10.1007/s004250050705 13 of 13
© Copyright 2026 Paperzz