2/3/2015 Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Introduction Evolution of RNA polyadenylation Sites Xiu-Qing Li Agriculture and Agri-Food Canada Fredericton, NB, Canada [email protected]; • Polyadenylation of RNA is at the centre of gene expression. • Knowledge about RNA poly(A) site may be valuable for designing new genes, distinguishing gene alleles, and regulating gene expression. • However, the characteristics and evolution of poly(A) site selection, poly(A) site region base composition, signal motifs, and base abundance patterns around mRNA poly(A) sites are still largely uncharacterized in most species. (Plant and Animal Genome XXII, San Diego January 10-14, 2015, Abstract W744) Agriculture and Agri-Food Canada Approaches • After comparing the sequence quality between mRNA database and Sequence Read Archive (SRA) datasets, we analyzed the poly(A) site regions of 31 species or phyla by mapping NCBI mRNA sequences to their reference genomes. • Redundant mRNA sequences or poly(A) sites were eliminated. Agriculture et Agroalimentaire Canada The poly(A) site region 5’ CDS UTR Site COR DNA: ____________________UC_________ Pre-RNA: ____________________UC_________ 3’ Attachment↕↕Starting position RNA (cleaved): ____________________U RNA (polyA tailed):____________________UAAAAAAAAAA… Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Results The alignment of 3′ end sequences of potato Kunitz-type protease inhibitor mRNAs. Note that each polyadenylation [poly(A)] tail likely starts from an adenosine. (Li and Du 2013) 1 2/3/2015 Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Adenosine content for the first six bases of 3’COR (Li, 2014, PloS ONE) (Li and Du, 2014) Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Base U contents of the poly(A) tail regions (3’COR) in different subkingdoms C/G ratios at the poly(A) tail attachment position of non-A-type poly(A) transcripts can clearly separate the three groups: animals < dicots < monocots. In order: dog, rabbit, rat, zebrafish, mouse, cattle, zebra finch, orangutan, chicken, human, pig, and fruit fly. The three dicot plants are, in order, M. truncatula, A. thaliana, and poplar. The three monocot plants are, in order, rice, maize, and sorghum. Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Region and subkingdom 3′UTR_Dicots 3′COR_Dicots 3′UTR_Monocots 3′UTR_Non-mammals 3′COR_Monocots 3′COR_Non-mammals Genome_Dicots 3′UTR_Mammals 3′COR_Mammals Genome_Non-mammals Genome_Mammals Genome_Monocots Duncan testa A B BC C C CD CDE DEF DEF EF FG G a:Means with the same letter are not significantly different (P < 0.05). ANOVA-Duncan’s multiple range tests (Li, 2014, PloS ONE) Agriculture and Agri-Food Canada 3′UTR base composition and the 3′UTR/genome ratio No. of Mean of uracil (U) species contents (%) 4 39.88 4 37.45 3 35.34 6 34.93 3 34.47 6 33.92 4 32.84 5 31.87 5 31.46 6 30.78 5 29.48 3 27.63 Agriculture et Agroalimentaire Canada 3′COR base composition and the 3′COR/genome ratio (Li, 2014, PloS ONE) (Li, 2014, PloS ONE) 2 2/3/2015 Agriculture and Agri-Food Canada Average base abundance at each position of the 45-nucleotide polyadenylation [poly(A)] site regions in protists. Agriculture et Agroalimentaire Canada Protist and potato polyA site seqlogs. Potato (Li and Du 2014). (Li and Du 2014 BMC Evol Biol) Average base abundance at each position of the 201-nucleotide polyadenylation site region, showing the U-A-U-SiteA-U base abundance pattern in plants and animals Type and location of the predominant motif in the poly(A) site region in different species (Li and Du 2014 BMC Evol Biol) Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada (Li and Du 2014 BMC Evol Biol) Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Most frequent 20 motifs in the poly(A) site region in 30,499 unique poly(A) sites mapped to the human genome Poly(A) site A maize mRNA showing that both the AAUAAA motif and the poly(A) sites are on loops. (Li and Du 2014 BMC Evol Biol) (Li, unpublished) 3 2/3/2015 Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Agriculture and Agri-Food Canada Agriculture et Agroalimentaire Canada Conclusions • The poly(A) site is usually an A in most (30/31) species or phyla • The base composition on the poly(A) tail attachment position can separate animals, dicots, and monocots. • Lower species (protists) placed the polyadenylation motif upstream of, on, or downstream of the poly(A) site, but fungi, animals, and most plants, place the signal motif upstream. • There is a mechanism operating in both animals and plants which is biased toward U and against C in the 3′COR and 3′UTR. • The 3′UTR and 3′COR, as functional units, minimized the difference between dicotyledonous and monocotyledonous plants, while the dicotyledonous and monocotyledonous genomes evolved into two extreme groups in terms of base composition. • Poly(A) site essayed several options during evolution prior to establishing the U-A-U-A-U pattern of the poly(A) site region. RNA poly(A) sites in microorganisms, plants and animals: Characteristics and evolution Xiu-Qing Li 李 修 庆 Agriculture and Agri-Food Canada Fredericton, NB, Canada [email protected]; 4
© Copyright 2026 Paperzz