From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Blood First Edition Paper, prepublished online June 5, 2014; DOI 10.1182/blood-2014-04-512756 A Tour Through the Transcriptional Landscape of Platelets Sebastian Schubert1, Andrew S. Weyrich1,2 , and Jesse W. Rowley1,2 The Molecular Medicine Program1 and the Department of Internal Medicine2, University of Utah School of Medicine, Salt Lake City, UT, USA Running Title: mRNA in Platelets Corresponding Author: Jesse W. Rowley, PhD Department of Internal Medicine University of Utah School of Medicine Eccles Institute of Human Genetics Building 533, Room 4260 15 North 2030 East Salt Lake City, Utah 84112 [email protected] Phone: 801-585-0706 Fax: 801-585-0701 1 Copyright © 2014 American Society of Hematology From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Abstract The RNA code found within a platelet, and alterations of that code, continue to shed light onto the mechanistic underpinnings of platelet function and dysfunction. It is now known that features of messenger RNA (mRNA) in platelets mirror those of nucleated cells. This review serves as a tour guide for readers interested in developing a greater understanding of platelet mRNA. The tour provides an in-depth and interactive examination of platelet mRNA, especially in the context of next-generation RNA-sequencing (RNA-seq). At the end of the expedition the reader will have a better grasp of the topography of platelet mRNA and how it impacts platelet function in health and disease. Introduction Despite their small size and anucleate status, platelets have a rich repertoire of RNAs including messenger RNAs (precursor-mRNA [pre-mRNA] and mature mRNA), structural and catalytic RNAs (ribosomal RNA [rRNA], transferRNA [tRNA], and small nucleolar RNA [snRNA]), and regulatory RNAs (microRNA [miRNA], long intergenic non-coding RNA [lincRNA], pseudogenes, and anti-sense RNA). Examination of RNA expression patterns in platelets have been used to identify biomarkers of disease, explain genetically or environmentally induced alterations in platelet function, and determine if genes are conserved between humans and mice1–13. The last decade has also revealed that platelets can translate mRNA into protein or transfer RNA to recipient cells where it regulates functional processes14–17. Because platelets are anucleate, transcriptional production of RNA (with the notable exception of mitochondria derived 2 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. transcripts) presumably occurs exclusively in the megakaryocyte. Therefore, the RNA profile in platelets also provides an accessible window into the transcriptional status of megakaryocytes, and the bone-marrow signals involved, at the time of platelet production. Several recent reviews have elegantly discussed the predictive value and functions of platelet RNAs2,18,19. This review explores the landscape of platelet RNA as mapped by next generation RNA sequencing (RNA-seq). Specifically, we examine what a platelet mRNA looks like at the molecular level and how specific features of the mRNA impact biological responses. Along the way, we provide a series of web links and access instructions (see Table I) for readers to follow as they explore key landmarks of platelet mRNA. Each stage of the tour (Figure 1) is discussed with reference to platelet function and in a broader biological context. While screenshots are shown in figure format, this review is meant to be interactive and dynamic, with the goal of providing the tools necessary for researchers to explore the innumerable transcript features in platelets in the context of their own research interests. I. How to navigate the landscape: an RNA-seq overview To appreciate key features of platelet mRNA, one first has to understand the fundamentals of next-generation RNA-sequencing (RNA-seq). RNA-seq has revolutionized medical research and, as shown in Table II, several groups have recently used this powerful technique to profile platelet mRNA1,20–22. RNA-seq libraries are prepared from either total RNA or a sub-fraction of RNA (i.e. poly-adenylated, ribosomal RNA depleted, 5 cap containing, small RNA, etc.) by ′ 3 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. random shearing of RNA into 100-400 base pair fragments. Barcodes and sequencing primer binding sites are then added, and the RNA is briefly amplified, followed by size selection of 100-400 bp fragments (also called inserts). This size selection captures most protein coding RNAs and long non-coding RNAs. For small RNA sequencing, which captures mature microRNAs, piwi-interacting RNAs, and other small RNAs, fragmentation is skipped and inserts < 40 bp are selected. Following insert selection, around 30 bp up to >150 bp of one end (single end read) or both ends (paired-end reads) of each fragment are sequenced. Sequences (reads) are then computationally mapped by finding unique (usually) matches between the annotated genome/transcriptome and the sequenced end of the RNA fragment. If paired-ends are sequenced, then each end is matched with the expected distance constrained to the approximate expected fragment size. If libraries are stranded, then the read can be mapped directionally to either the positive or negative strand of DNA. RNA-seq methodology is reviewed in more detail elsewhere23–26. Four long RNA sequencing datasets (i.e. protein coding and long non-coding RNA) from human platelets and one from mouse platelets have been published1,20–22. Other small RNA sequencing datasets (i.e. mature microRNAs) have also been published but are not discussed here21,27. As shown in Table II, each RNA-seq dataset offers some complementary features not provided by the other. For example, in our experience shorter (36 bp) reads map better to poorly annotated transcripts or novel small exons whereas longer reads (101 bp) aid in isoform discrimination and map better to lower complexity regions. Paired reads are useful for determining splicing and exon exclusion patterns. Stranded reads discriminate between genes overlapping in the genome, but 4 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. transcribed from opposite strands. Oligo-dT enriches for protein coding transcripts, whereas ribosomal depletion of total RNA will capture more non-coding transcripts. In one of their samples, Kissoupoulou et. al.20 use a 5 end tagging strategy that marks the ′ extreme 5 terminus of transcripts. The aligned reads from three of these datasets are ′ available for rapid navigation via web links. Instructions to access and visualize the aligned data in UCSC genome browser28 are given in Table I. II. A panoramic view of a platelet transcript as depicted by RNA-seq RNA-seq creates snapshots of mRNAs that provide a wealth of information, from gene expression to regional features and detailed sequence23. Before exploring subdomains of platelet mRNA, it is helpful to understand the topography of snapshots that are used to represent RNA-seq results. Figure 2A represents a visualization of RNA-seq alignments to glycoprotein IX (GP9). In this example, GP9 is annotated as it appears in platelets – a mature message that does not contain intronic sequences. Note that individual paired reads, which in this case are ~36 bp in length, map across the entire GP9 annotated transcript. The number of reads that cover each nucleotide along the transcript are summed, and the count (y-axis) is displayed in the histogram-like graph (coverage or read depth graph) above the reads. Various methods, ranging in robustness, for calculating expression exist, and are reviewed in29,30. One simple and commonly used index of abundance is the RPKM or FPKM (Read (or Fragment) Per Kilobase normalized to a Million)26. This is calculated from the sum of all the reads across a transcript, normalized to the transcript length and to the total number of reads mapping to the entire transcriptome. Variability in fragmentation, sequencing, and 5 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. mapping efficiency for sub-sequences within the transcript affect the read depth at each loci, hence the “hills” and “valleys” of expression across the transcript30. Thus, RNA-seq can quantify per-base, per-exon, per annotated region, or per-transcript expression on a genome-wide scale without prior knowledge of the transcript23. Unlike the depiction in Figure 2A, RNA-seq data is typically visualized in genomic space using a genome browser such as the UCSC genome browser28. Figure 2B depicts the same transcript, GP9, in a genome browser. Note now that the exons are separated by the introns encoded in the genome, and like canyons between peaks of expression, the introns are devoid of aligned reads. This is because GP9 is completely spliced. Reads connected by thin lines represent either connected paired or split reads. Although the intron spanned by paired reads is much longer than the expected 200 bp fragment size, when the transcript is spliced the paired reads map at the expected distance. III. Hidden structures: the 5′ cap and poly-A tail Although anucleate, platelet mRNAs mirror mRNAs found in nucleated cells. Eukaryotic RNAs are prepared for nuclear export and translation by the addition of a 5 ′ 7-methylguanosine cap, the addition of a 3 poly-A tail of variable length, and the ′ splicing out of introns31. The 5 cap facilitates ribosome loading for protein translation ′ via recognition and recruitment by eukaryotic initiation factor 4E (eIF4E)32,33. The polyA tail helps circularize mRNAs through interaction with the scaffold protein eIF4G, which initiates cap recognition by eIF4E32. Both ends protect mRNA from degradation32. The poly-A tail can be extended or shortened over the life of the RNA, but gradual poly-A shortening followed by decapping is the eventual fate of most RNAs34,35. In other 6 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. mammalian cells, typically only 5 capped, 3 polyadenylated, spliced mRNA will be ′ ′ exported from the nucleus, and used for translation31. The majority of unspliced cytoplasmic message is rapidly degraded by non-sense mediated decay (NMD)36. The customary use of oligo-dT to prime reverse transcription leaves little doubt that the majority of platelet transcripts are polyadenylated, consistent with descriptions of poly-A+ RNA in platelets reported by Roth and colleagues in 198937. Platelets contain poly-A binding proteins such as APP-138 that may support poly-A stimulated protein translation. Evidence that a subset of transcripts are 5 capped comes from reports that ′ protein translation can be initiated in platelets39. The poly-A tail is not genomically encoded and stretches of A’s cannot be effectively aligned from RNA-seq reads. Therefore, poly-A reads are hidden in typical RNA-seq workflows. The 5’ cap is also not detected by normal RNA sequencing reactions. To characterize capping and polyadenylation in platelets, we have performed RNAseq on transcripts captured with oligo-dT conjugated beads or by a hyper-affinity mutant eIF4E protein that enriches for transcripts with a 7-methylguanosine cap40. Figure 3A offers a pictorial of this strategy. When comparing RNA-seq of poly-A isolated mRNA with cap-captured RNA, poly-A isolation captures the majority of annotated protein coding transcripts, while some transcripts with a very short or no poly-A tail are missed. Such is the case for many histone coding transcripts with characteristically short or no poly-A tails41 which are missed by poly-A capture, yet readily observed when isolated by the 5 cap-based strategy (Figure 3B). Conversely, a handful of mRNAs detected by ′ poly-A based methods are not captured by 5 -based methods (not shown). Aside from ′ these exceptions, the bulk of annotated platelet transcripts that are captured by Oligo7 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. dT are also captured by cap binding strategies (unpublished data). A prime example is PAR1, whose mRNA was captured by both 3 and 5 based methods (Figure 3B). Other ′ ′ transcripts, in particular non-coding RNA, are undoubtedly present in platelets that contain neither a 5’ cap nor a 3’ poly-A tail21. The beginning of the poly-A tail can be inferred from the reads by a sharp decline in read depth at the end of the 3 UTR accompanied by 3 end mapped reads containing ′ ′ strings of A’s20,42. This type of analysis was used by Osman and coworkers to identify two poly-adenylation start sites for transcript PPBP20. On the other hand, the length of poly-A tails in platelets and whether tail shortening occurs over time is not known. This is in part because new and old populations of platelets are difficult to separate in vivo. In vitro studies suggest that some platelet mRNAs are not stable over time. Reticulated platelet counts potentially decrease after platelet storage43,44. mRNA for Pselectin and GAPDH disappear in stored platelets at 22oC degrees, but decline only 40% over 5 days when stored at 4oC degrees45. Sult2B1b mRNA, which affects platelet functional responses via catalysis of the sulfonation of cholesterol46, also diminishes rapidly at 37oC47. Sult2B1b mRNA levels decrease by 40% within just 30 minutes at 37oC, but are stable at 4oC. Interestingly, addition of high density lipoprotein to the platelets slowed the decay. Of note, Sult2B1b protein levels and its product level of cholesterol sulfate tracked with mRNA levels suggesting continued protein translation and functional implications of mRNA stability in platelets. Whether the poly-A tail and cap are involved in the stability and translation of these and other transcripts in platelets merits further investigation. 8 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. IV. Peaks and canyons: pre-mRNAs, splicing, and intron retention Surprisingly platelets contain a subset of precursor mRNAs that have escaped the nucleus and NMD without splicing48. Platelet activation induces splicing and enables protein production from the newly formed mRNA. Unspliced pre-mRNA can be observed in RNA-seq data visually or bioinformatically by examining the ratio of intronic to exonic reads. Interleukin-1β (IL-1β), a prototypical unspliced mRNA in platelets48, contains intronic reads in relatively high proportion to exonic reads2. According to our own analysis of platelet RNA-seq datasets, transcripts like IL-1β, with a large fraction of completely unspliced pre-mRNA, are infrequent. Despite this, additional examples of pre-mRNA splicing followed by protein production in platelets have surfaced in the literature49,50. In addition to these, using RNA-seq we have bioinformatically identified a handful of putatively unspliced transcripts in platelets that retain one or more introns. One example shown in Figure 3C is FOSB. FOSB is a transcription factor that regulates a wide array of cellular functions51. Whether FOSB can be spliced within platelets remains to be determined. V. Regulatory regions: 5′ and 3′ UTRs Serial analysis of gene expression (SAGE) in platelets previously demonstrated that the set of transcripts in platelets have longer untranslated regions (UTRs) than the set of transcripts found in other cells52. UTRs can also be assessed at the individual level. Figure 4A and 4B are examples of transcripts in platelets where the actual UTR does not match the predicted UTR. The 5 and 3 UTRs regulate RNA stability and ′ ′ protein translation53–55. Within the 5 UTR, secondary structure and primary sequence ′ 9 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. motifs direct ribosome scanning and initiation of translation56. A kozak sequence within the 5 UTR that contains the AUG start codon flags the ribosome to start translation of ′ the open reading frame (ORF). Premature upstream ORFs (uORFs) regulate the efficiency of initiation at the coding ORF and secondary structures in 5 UTR’s further ′ affect ribosome loading and scanning efficiency57. Proteins and small RNAs (i.e. microRNAs) bind to sequence motifs within the 5 UTR and 3 UTR to regulate ′ ′ translation or degradation53,54,56,58. In platelets for example, the 5 UTR of B cell ′ lymphoma 3 (BCL3) controls translation of the mRNA in an mTOR (mammalian Target of Rapamycin) dependent manner59. Given the many sequence dependent effects of the UTR on RNA stability and translation, the predicted biological relevance of an annotated UTR sequence versus the actual UTR expressed in platelets could be profoundly different. GFI1B controls megakaryocyte development60, and GFI1B mutants can cause diseases of abnormal platelet formation and function like Gray Platelet Syndrome (GPS)61. In Figure 4A, the annotated 5 UTR of GFI1B covers approximately 350 bp of ′ exons one and two. RNA-seq reads in platelets suggests that transcription begins an additional 300 base pairs upstream from the annotated transcripts (Figure 4A, upper panel). The preponderance of 5 terminally tagged reads (Figure 4A, lower panel) that ′ map nearly exclusively to the start of the extended region suggests a dedicated transcription start site rather than aberrant run-on transcription. The additional 5 ′ sequence contains 3 short upstream open reading frames. miRNAs regulate RNA stability and protein translation by binding nearly complementary target sequences in the 3 UTR of transcripts62. In platelets miRNAs are ′ 10 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. abundant and stable63,64. They have been reported as biomarkers and as functional modifiers of platelets18,19,65. Interestingly, platelets can transfer miRNAs to regulate targets in other cells15–17. miRNA studies in platelets are increasingly popular and are the subject of several reviews18,19,66,67. miRNA studies incorporate target prediction algorithms based on 3 UTR ′ annotations to identify potential microRNA-mRNA pairs68. Differences in 3 -UTR ′ annotations may affect the outcome of such studies. The P2Y purogenic receptor 1 (P2RY1) accounts for approximately 1/3 of adenosine diphosphate (ADP) receptors on platelets69. Stimulation of P2RY1 by ADP induces intracellular calcium fluxes and platelet aggregation70,71. P2RY1 is a potential target of anti-thrombotics69,71 and differences in P2RY1 expression alter bleeding times and thrombosis in mice72. In Figure 4B the annotated 3 UTR of P2RY1 is nearly 1 kb, whereas the actual 3 UTR in ′ ′ platelets is considerably longer. According to miRNA target prediction algorithms (miRanda73), this additional 2.5 kb harbors additional miRNA binding sites (Figure 4B). This indicates that actual transcript data, not predicted annotation data, should be used for miRNA target site predictions in platelets. Sequences in view can be extracted directly in UCSC genome browser by selecting “DNA” in the “view” menu. VI. Changing landscapes: alternative splice variants SNP association studies74 and mechanistic studies of platelet endothelial aggregation receptor 1 (PEAR1) link it strongly to an aggregatory role in platelets75. As illustrated in Figure 5A, RNA-seq distinguishes between multiple annotations for PEAR1. Alternative splicing affects nearly every multi-exon eukaryotic transcript76. 11 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Exon skipping, intron retention, alternate exon usage, and addition or truncation of exons are examples of alternative splicing76,77. Many alternative splice variants lead to NMD36. Other splice variants regulate protein synthesis or code for alternate protein products. Splice variants of the same gene can effect opposing functions, and some variants may alter physiologic responses to drugs78. Proteomics detects many novel protein isoforms in platelets. In one proteomics study79, a number of proteins with novel exon skipping events were identified. Three of these, ITGA2 (integrin, α2), FH (fumarate hydratase), and NPEPPS (aminopeptidase puromycin sensitive) were further validated by PCR at the RNA level79. Interestingly, activation altered the number of exon skipping events79. We have seen how alternate initiation and termination can affect regulatory elements within UTRs. Alternative splicing is another way to alter the 5 and 3 UTR. As ′ ′ shown in Figure 5B, RNA-seq can distinguish tissue factor pathway inhibitor (TFPI) splice variants in platelets. TFPI inhibits tissue factor activity and thereby limits excessive coagulation80. Multiple splice variants have been characterized for TFPI. Megakaryocytes and platelets express TFPI alpha, one of 2 major well characterized isoforms80 produced by alternative splicing of the 5 and 3 end of the mRNA. Part of the ′ ′ 5 UTR in exon 2, which can be removed by alternative splicing80,81, represses TFPI ′ translation. On the other hand, the 3 UTR of the TFPI alpha isoform in platelets relieves ′ exon 2 mediated repression81. The proteins produced by TFPI splice variants are also different. The C-termini of the TFPI isoforms differ in tissue factor inhibitory activity82. Furthermore TPFI alpha is soluble and not linked to the cell surface like TFPI beta82. 12 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. The production of a soluble versus membrane bound form of proteins is a recurring theme of alternative splicing in platelets. Soluble P-selectin is shed from platelets after activation, mediated in part by extra-cellular cleavage83. Alternative production of soluble P-selectin may also be involved. cDNA sequencing predicted the presence of a soluble splice variant of P-selectin84–86. Correspondingly, soluble P-selectin is found in the non-membrane fraction of platelets, totaling up to 10% of total P-selectin levels87. Splice variants of integrin beta388 code for a truncated protein, with the transmembrane and cytoplasmic domains missing. Protein assays have confirmed the expression of truncated ITGB3 (integrin, beta 3) protein in platelets88. Platelets contain multiple forms of the low affinity immunoglobulin Fc region receptor IIa (FCGRIIa) mRNA. Variants with and without the transmembrane exon are detected at approximately equal proportions in platelets89. Several other notable splice variants with relevance to platelet function have been reported. A variant of cyclooxygenase 2a (COX2a) is increased 200-fold after coronary artery bypass grafting90. Variants of PLC-beta (phospholipase C, beta 2) code for two different proteins in platelets91. An angiopoietin-1 splice variant that opposes its parent gene is found in platelet alpha granules92. These examples only scratch the surface of the extensive collection of splice variants found in platelets. Some platelet specific splice variants, including novel exons, have yet to be annotated within major databases. Such is the case with growth factor independent IB (GFI1B, Figure 5C). VII. Switchbacks: antisense and non-coding RNA 13 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Non-coding RNAs regulate the production of target mRNAs, the translation of target mRNAs into protein, or directly modulate the target proteins themselves. Platelets contain all major classes of non-coding RNAs including miRNAs (see above), lincRNAs pseudogenes, and anti-sense RNA. Of these, platelets are particularly enriched in miRNAs, pseudogenes, and anti-sense RNAs21. In eukaryotic cells, a major fraction of expressed genes are accompanied by an antisense93. Antisense transcripts regulate their complementary counterpart in numerous ways. Antisense RNAs mask miRNA binding sites, destabilize transcripts, repress translation, and promote or repress transcription93,94. They mediate both cytoplasmic and nuclear RNA-RNA and RNA-DNA interactions93. Figure 6 depicts an example antisense RNA found at the 5 end of cluster of differentiation (CD109). CD109 ′ is a highly expressed human platelet alloantigen95. In this example, the antisense to CD109 begins at the 5 end of CD109. ′ VIII. A sequence level view: SNPs, insertions, and deletions By zooming to sequence level, individual SNPs (single nucleotide polymorphisms), insertions, and deletions that are within the expressed transcript can be identified by individual RNA-seq reads. Searching for “platelet” within the “clinical synopsis” field within Online Mendelian Inheritance in Man (OMIM, http://omim.org/, April 4, 2014 update) retrieves 79 different inherited platelet disorders, corresponding to 59 different defective genes. The inherited platelet disorders include well characterized platelet gene/phenotypes such us ITGA2B/Glanzmann Thrombasthenia (GT)96 or NBEAL2/ 14 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. GPS97–99. Clicking on the genome coordinates (GRCh37) listed within any of these disorders, while the RNA-seq platelet data UCSC genome browser window is open, will bring you directly to the location listed overlaid with read sequence information. Of note, visualization of mutations in RNA-seq read sequences from a patient with GPS helped characterize a splicing defect in their NBEAL2 transcripts97. Genome Wide Association Studies (GWAS) studies have identified SNPs near more than 40 different genes associated with platelet size, count, or function100–104. Meta-analyses have extended this number to around 80 (for a convenient list, see101). In GWAS, a trait associated SNP serves as a regional marker of allelic association without necessarily being the cause of the trait105,106. Figure 7 depicts a reported GWAS common variant associated with platelet counts, RS6065100,107, that is found within the RNA-seq read sequences of GP1BA compared to another individual without the variant. As we understand more about the meaning of GWAS SNPs, the ability to link RNA transcripts with marker SNPs and the causative mutations of trait variation may become important. Perspective After completing this tour, it should be obvious that platelets have a complex mRNA signature that reflects and affects their function. Indeed, mRNA expression profiling was recently used to identify the molecular basis for differences in platelet reactivity between blacks and whites7. The study found by microarray that PCTP mRNA is four-fold higher in blacks than in whites. This correlated with PCTP protein levels and 15 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. platelet reactivity to PAR4. Another RNA, miR-376c, inversely correlated with platelet reactivity and regulated PCTP expression. This study is mentioned to underscore how platelet mRNA can be leveraged to identify and understand the molecular mechanisms that control human disease. Of note, the RNA-seq reads from 5 black and 5 white individuals platelets were subsequently published22. In this dataset (which is site 4 of Table I in this review), consistent with their microarray findings, PCTP mRNA is seven fold higher in blacks than in whites22. The intrinsic functional capacity and variability of platelets are determined by environmental signals and genetic factors seen by the megakaryocyte. As has been done for other cells108, integrating platelet and megakarycoyte RNA-seq with whole genome sequencing or genome wide microarray genotyping will provide valuable information regarding the relationship between the environment and genetics of altered gene expression within platelets. In addition to expression, RNA-seq analysis provides a tool for examining transcripts, and their plausible roles in platelet function and disease, at several other layers within the platelet terrain including the 5 cap and poly-A tail, the ′ 5 and 3 UTRs, intron retention, alternative splicing, strand, and single nucleotide ′ ′ sequences (summarized in Figure 8). In conjunction with published reports, we predict that forthcoming RNA-seq datasets will provide unprecedented opportunities to assess both “what’s in a platelet” and “what’s different about platelets in disease”. 16 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Authorship Contribution: S.S. and J.W.R. drafted and prepared the manuscript; A.S.W. reviewed and edited the manuscript. Conflict-of-interest disclosure: The authors declare no competing financial interests. Acknowledgements We are grateful for the contributions of Diana Lim in preparation of the figures. Work in this report was supported by NIH awards 1U54 HL112311-01, 1K01 GM103806-01, and 2R01 HL066277-11 as well as by the German Research Foundation SCHU 2561/1-1. 17 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Site Ref. Descripton: Web Link > Access instructions Coverage track: https://hci-bioapp.hci.utah.edu/gnomex/gnomexFlex.jsp?dataTrackNumber=DT1837 > select “UCSC Browser” ** Bam (individual reads and sequences) track: https://hci-bio2 Rowley app.hci.utah.edu/gnomex/gnomexFlex.jsp?dataTrackNumber=DT11785 > et. al.1 select “UCSC Browser” ** 3 Bray et. Coverage tracks and individual reads tracks: https://cm.jefferson.edu/data-tools-downloads/platelets-2012/ al.21 > select “A” to enter UCSC Browser** Coverage tracks and individual reads tracks: 4 Londin et. al.22 https://cm.jefferson.edu/data-tools-downloads/platelets-2014/ > select “A” to enter UCSC Browser** **Tips for UCSC Genome Browser: 1 • • • • • • • • Rowley et. al.1 After entering UCSC’s website, select “Genome Browser”. Search for a gene (“GP9”), location (“chr3:128,779,645-128,781,253”), or term of interest (“rs6065”). Scroll down to see the available tracks. The platelet tracks can be found under the header “Custom Tracks” for sites 1 and 2 or under the header “Mapping and Sequencing Tracks” for sites 3 and 4. Using the toggle boxes, set the visibility settings to “Full” for the platelet data track(s) of interest (i.e. “B2_Pltlts_...”). Set visibility to “Full” for one of the Gene Predictions tracks (i.e. RefSeq). “Hide” other tracks and “zoom” to view the data as presented in the figures. Select “Refresh” after making any changes to update the viewer. Other datasets, including mouse datasets are accessible from within Gnomex by searching for “platelet” in the search box at the top. Alternatively, the .Bam and .Bai index files can be downloaded from Gnomex and opened in a genome browser such as IGV109. If the links don’t initially work, try a different browser. For more information on using UCSC genome browser, see: https://genome.ucsc.edu/training.html. Table I: Links for viewing platelet RNA-seq datasets in UCSC Genome Browser 18 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Reference Rowley et. al.1 Bray e.t al.21 Londin et. al.22 Kissopoulou et. al.20 Platform Illumina ABI/LT SOLiD™ ABI/LT SOLiD™ Illumina # Samples Species Gender Race Leukocyte depletion Reads Sample prep 4 2 human/2 mouse 1 M/1 F of each species ND CD45 (Miltenyi) 36 bp, paired end, unstranded, Poly-A+ RNA 4 10 4 human human human ND 10 M 3 M/1 F ND 5 black, 5 white ND CD45 (Miltenyi) CD45 (Miltenyi) Pan-Leuko (Dynabeads) 50 bp, single end, stranded (30 bp stranded small RNA reads also available) Total RNA and ribosomal RNA depleted RNA 50 bp, single end, stranded Total RNA Table II: Published platelet RNA-seq (long-RNA) datasets 19 100 bp, single end, unstranded 100 bp, paired end, stranded Ribosomal RNA depleted and poly-A+/5’ end tagged RNA From www.bloodjournal.org by guest on June 18, 2017. For personal use only. References 1. Rowley JW, Oler A, Tolley ND, et al. Genome wide RNA-seq analysis of human and mouse platelet transcriptomes. Blood. 2011;118(14):e101–11. 2. Rowley JW, Schwertz H, Weyrich AS. Platelet mRNA: the meaning behind the message. Curr. Opin. Hematol. 2012;19(5):385–91. 3. Lood C, Amisten S, Gullstrand B, et al. Platelet transcriptional profile and protein expression in patients with systemic lupus erythematosus: up-regulation of the type I interferon system is strongly associated with vascular disease. Blood. 2010;116(11):1951–7. 4. Healy AM, Pickard MD, Pradhan AD, et al. Platelet expression profiling and clinical validation of myeloid-related protein-14 as a novel determinant of cardiovascular events. Circulation. 2006;113(19):2278–84. 5. Goodall AH, Burns P, Salles I, et al. Transcription profiling in human platelets reveals LRRFIP1 as a novel protein regulating platelet function. Blood. 2010;116(22):4646–4656. 6. Simon LM, Edelstein LC, Nagalla S, et al. Human platelet microRNA-mRNA networks associated with age and gender revealed by integrated plateletomics. Blood. 2014; 7. Edelstein LC, Simon LM, Montoya RT, et al. Racial differences in human platelet PAR4 reactivity reflect expression of PCTP and miR-376c. Nat. Med. 2013;19(12):1609–16. 8. Plé H, Maltais M, Corduan A, et al. Alteration of the platelet transcriptome in chronic kidney disease. Thromb. Haemost. 2012;108(4): 9. McManus DD, Beaulieu LM, Mick E, et al. Relationship among circulating inflammatory proteins, platelet gene expression, and cardiovascular risk. Arterioscler. Thromb. Vasc. Biol. 2013;33(11):2666–73. 10. Freedman JE, Larson MG, Tanriverdi K, et al. Relation of platelet and leukocyte inflammatory transcripts to body mass index in the Framingham heart study. Circulation. 2010;122(2):119–29. 11. Raghavachari N, Xu X, Harris A, et al. Amplified expression profiling of platelet transcriptome reveals changes in arginine metabolic pathways in patients with sickle cell disease. Circulation. 2007;115(12):1551–62. 12. Sun L, Gorospe JR, Hoffman EP, Rao AK. Decreased platelet expression of myosin regulatory light chain polypeptide (MYL9) and other genes with platelet 20 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. dysfunction and CBFA2/RUNX1 mutation: insights from platelet expression profiling. J. Thromb. Haemost. 2007;5(1):146–54. 13. Gnatenko D V, Zhu W, Xu X, et al. Class prediction models of thrombocytosis using genetic biomarkers. Blood. 2010;115(1):7–14. 14. Risitano A, Beaulieu LM, Vitseva O, Freedman JE. Platelets and platelet-like particles mediate intercellular RNA transfer. Blood. 2012;119(26):6288–95. 15. Clancy L, Freedman JE. New paradigms in thrombosis: novel mediators and biomarkers platelet RNA transfer. J. Thromb. Thrombolysis. 2014;37(1):12–6. 16. Laffont B, Corduan A, Plé H, et al. Activated platelets can deliver mRNA regulatory Ago2•microRNA complexes to endothelial cells via microparticles. Blood. 2013;122(2):253–61. 17. Gidlöf O, van der Brug M, Ohman J, et al. Platelets activated during myocardial infarction release functional miRNA, which can be taken up by endothelial cells and regulate ICAM1 expression. Blood. 2013;121(19):3908–17, S1–26. 18. Edelstein LC, McKenzie SE, Shaw C, et al. MicroRNAs in platelet production and activation. J. Thromb. Haemost. 2013;11 Suppl 1:340–50. 19. Dangwal S, Thum T. MicroRNAs in platelet physiology and pathology. Hamostaseologie. 2013;33(1):17–20. 20. Kissopoulou A, Jonasson J, Lindahl TL, Osman A. Next generation sequencing analysis of human platelet PolyA+ mRNAs and rRNA-depleted total RNA. PLoS One. 2013;8(12):e81809. 21. Bray PF, McKenzie SE, Edelstein LC, et al. The complex transcriptional landscape of the anucleate human platelet. BMC Genomics. 2013;14(1):1. 22. Londin ER, Hatzimichael E, Loher P, et al. The human platelet: strong transcriptome correlations among individuals associate weakly with the platelet proteome. Biol. Direct. 2014;9(1):3. 23. Ozsolak F, Milos PM. RNA sequencing: advances, challenges and opportunities. Nat. Rev. Genet. 2011;12(2):87–98. 24. Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat. Rev. Genet. 2009;10(1):57–63. 25. Costa V, Angelini C, De Feis I, Ciccodicola A. Uncovering the Complexity of Transcriptomes with RNA-Seq. J. Biomed. Biotechnol. 2010;2010:853916. 21 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 26. Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods. 2008;5(7):621–8. 27. Plé H, Landry P, Benham A, et al. The repertoire and features of human platelet microRNAs. PLoS One. 2012;7(12):e50746. 28. Kuhn RM, Haussler D, Kent WJ. The UCSC genome browser and associated tools. Brief. Bioinform. 2013;14(2):144–61. 29. Dillies M-A, Rau A, Aubert J, et al. A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis. Brief. Bioinform. 2012; 30. Garber M, Grabherr MG, Guttman M, Trapnell C. Computational methods for transcriptome annotation and quantification using RNA-seq. Nat. Methods. 2011;8(6):469–77. 31. Carmody SR, Wente SR. mRNA nuclear export at a glance. J. Cell Sci. 2009;122(Pt 12):1933–7. 32. Shatkin AJ, Manley JL. The ends of the affair: capping and polyadenylation. Nat. Struct. Biol. 2000;7(10):838–42. 33. Kong J, Lasko P. Translational control in cellular and developmental processes. Nat. Rev. Genet. 2012;13(6):383–94. 34. Norbury CJ. Cytoplasmic RNA: a case of the tail wagging the dog. Nat. Rev. Mol. Cell Biol. 2013;14(10):643–53. 35. Schoenberg DR, Maquat LE. Regulation of cytoplasmic mRNA decay. Nat. Rev. Genet. 2012;13(4):246–59. 36. Lewis BP, Green RE, Brenner SE. Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc. Natl. Acad. Sci. U. S. A. 2003;100(1):189–92. 37. Roth GJ, Hickey MJ, Chung DW, Hickstein DD. Circulating human blood platelets retain appreciable amounts of poly (A)+ RNA. Biochem. Biophys. Res. Commun. 1989;160(2):705–10. 38. Houng AK, Maggini L, Clement CY, Reed GL. Identification and structure of activated-platelet protein-1, a protein with RNA-binding domain motifs that is expressed by activated platelets. Eur. J. Biochem. 1997;243(1-2):209–18. 22 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 39. Schubert P, Devine D V. De novo protein synthesis in mature platelets: a consideration for transfusion medicine. Vox Sang. 2010;99(2):112–22. 40. Choi YH, Hagedorn CH. Purifying mRNAs with a high-affinity eIF4E mutant identifies the short 3’ poly(A) end phenotype. Proc. Natl. Acad. Sci. U. S. A. 2003;100(12):7033–8. 41. Marzluff WF, Wagner EJ, Duronio RJ. Metabolism and regulation of canonical histone mRNAs: life without a poly(A) tail. Nat. Rev. Genet. 2008;9(11):843–54. 42. Pickrell JK, Marioni JC, Pai AA, et al. Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature. 2010;464(7289):768–72. 43. Osei-Bimpong A. The effect of storage on the clinical utility of the immature platelet fraction. Hematology. 2009;14(2):118–21. 44. Nishiyama M, Hayashi S, Kabutomori O, et al. [Effects of anticoagulants and storage temperature on immature platelet fraction % (IPF%) values in stored samples measured by the automated hematology analyzer, XE-5000--utility of CTAD-anticoagulation and room temperature storage]. Rinsho Byori. 2011;59(5):452–8. 45. Stiegler G, Fischer G, Ramanathan G, et al. P-selectin mRNA is maintained in platelet concentrates stored at 4 degrees C. Transfusion. 2009;49(5):921–7. 46. Blache D, Becchi M, Davignon J. Occurrence and biological effects of cholesteryl sulfate on blood platelets. Biochim. Biophys. Acta. 1995;1259(3):291–6. 47. Yanai I, Benjamin H, Shmoish M, et al. Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification. Bioinformatics. 2005;21(5):650–9. 48. Denis MM, Tolley ND, Bunting M, et al. Escaping the nuclear confines: signaldependent pre-mRNA splicing in anucleate platelets. Cell. 2005;122(3):379–91. 49. Schwertz H, Tolley ND, Foulks JM, et al. Signal-dependent splicing of tissue factor pre-mRNA modulates the thrombogenicity of human platelets. J. Exp. Med. 2006;203(11):2433–40. 50. Münzer P, Tolios A, Pelzl L, et al. Thrombin-sensitive expression of the store operated Ca(2+) channel Orai1 in platelets. Biochem. Biophys. Res. Commun. 2013;436(1):25–30. 51. Milde-Langosch K. The Fos family of transcription factors and their role in tumourigenesis. Eur. J. Cancer. 2005;41(16):2449–61. 23 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 52. Dittrich M, Birschmann I, Pfrang J, et al. Analysis of SAGE data in human platelets: features of the transcriptome in an anucleate cell. Thromb. Haemost. 2006;95(4):643–51. 53. Jia J, Yao P, Arif A, Fox PL. Regulation and dysregulation of 3’UTR-mediated translational control. Curr. Opin. Genet. Dev. 2013;23(1):29–34. 54. Matoulkova E, Michalova E, Vojtesek B, Hrstka R. The role of the 3’ untranslated region in post-transcriptional regulation of protein expression in mammalian cells. RNA Biol. 2012;9(5):563–76. 55. Rosenberg BR, Hamilton CE, Mwangi MM, Dewell S, Papavasiliou FN. Transcriptome-wide sequencing reveals numerous APOBEC1 mRNA-editing targets in transcript 3’ UTRs. Nat. Struct. Mol. Biol. 2011;18(2):230–6. 56. Van der Velden AW, Thomas AA. The role of the 5’ untranslated region of an mRNA in translation regulation during development. Int. J. Biochem. Cell Biol. 1999;31(1):87–106. 57. Barbosa C, Peixeiro I, Romão L. Gene expression regulation by upstream open reading frames and human disease. PLoS Genet. 2013;9(8):e1003529. 58. Lai EC. Micro RNAs are complementary to 3’ UTR sequence motifs that mediate negative post-transcriptional regulation. Nat. Genet. 2002;30(4):363–4. 59. Weyrich AS, Denis MM, Schwertz H, et al. mTOR-dependent synthesis of Bcl-3 controls the retraction of fibrin clots by activated human platelets. Blood. 2007;109(5):1975–83. 60. Randrianarison-Huetz V, Laurent B, Bardet V, et al. Gfi-1B controls human erythroid and megakaryocytic differentiation by regulating TGF-beta signaling at the bipotent erythro-megakaryocytic progenitor stage. Blood. 2010;115(14):2784– 95. 61. Monteferrario D, Bolar NA, Marneth AE, et al. A dominant-negative GFI1B mutation in the gray platelet syndrome. N. Engl. J. Med. 2014;370(3):245–53. 62. Filipowicz W, Bhattacharyya SN, Sonenberg N. Mechanisms of posttranscriptional regulation by microRNAs: are the answers in sight? Nat. Rev. Genet. 2008;9(2):102–14. 63. Landry P, Plante I, Ouellet DL, et al. Existence of a microRNA pathway in anucleate platelets. Nat. Struct. Mol. Biol. 2009;16(9):961–6. 24 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 64. Stratz C, Nührenberg TG, Binder H, et al. Micro-array profiling exhibits remarkable intra-individual stability of human platelet micro-RNA. Thromb. Haemost. 2012;107(4):634–41. 65. Nagalla S, Shaw C, Kong X, et al. Platelet microRNA-mRNA coexpression profiles correlate with platelet reactivity. Blood. 2011;117(19):5189–97. 66. Edelstein LC, Bray PF. Small RNAs as potential platelet therapeutics. Handb. Exp. Pharmacol. 2012;(210):435–45. 67. Mayr M, Zampetaki A, Willeit P, Willeit J, Kiechl S. MicroRNAs within the continuum of postgenomics biomarker discovery. Arterioscler. Thromb. Vasc. Biol. 2013;33(2):206–14. 68. Peterson SM, Thompson JA, Ufkin ML, et al. Common features of microRNA target prediction tools. Front. Genet. 2014;5:23. 69. Smyth SS, Woulfe DS, Weitz JI, et al. G-protein-coupled receptors as signaling targets for antiplatelet therapy. Arterioscler. Thromb. Vasc. Biol. 2009;29(4):449– 57. 70. Jin J, Daniel JL, Kunapuli SP. Molecular basis for ADP-induced platelet activation. II. The P2Y1 receptor mediates ADP-induced intracellular calcium mobilization and shape change in platelets. J. Biol. Chem. 1998;273(4):2030–4. 71. Hechler B, Gachet C. P2 receptors and platelet function. Purinergic Signal. 2011;7(3):293–303. 72. Fabre JE, Nguyen M, Latour A, et al. Decreased platelet aggregation, increased bleeding time and resistance to thromboembolism in P2Y1-deficient mice. Nat. Med. 1999;5(10):1199–202. 73. John B, Enright AJ, Aravin A, et al. Human MicroRNA targets. PLoS Biol. 2004;2(11):e363. 74. Johnson AD, Yanek LR, Chen M-H, et al. Genome-wide meta-analyses identifies seven loci associated with platelet aggregation in response to agonists. Nat. Genet. 2010;42(7):608–13. 75. Kauskot A, Di Michele M, Loyen S, et al. A novel mechanism of sustained platelet IIb 3 activation via PEAR1. Blood. 2012;119(17):4056–65. α 76. β Wang ET, Sandberg R, Luo S, et al. Alternative isoform regulation in human tissue transcriptomes. Nature. 2008;456(7221):470–6. 25 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 77. Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 2008;40(12):1413–5. 78. Passetti F, Ferreira CG, Costa FF. The impact of microRNAs and alternative splicing in pharmacogenomics. Pharmacogenomics J. 2009;9(1):1–13. 79. Power KA, McRedmond JP, de Stefani A, Gallagher WM, Gaora PO. Highthroughput proteomics detection of novel splice isoforms in human platelets. PLoS One. 2009;4(3):e5001. 80. Wood JP, Ellery PER, Maroney SA, Mast AE. Biology of tissue factor pathway inhibitor. Blood. 2014; 81. Ellery PER, Maroney SA, Martinez ND, Wickens MP, Mast AE. Translation of human tissue factor pathway inhibitor- mRNA is controlled by alternative splicing within the 5’ untranslated region. Arterioscler. Thromb. Vasc. Biol. 2014;34(1):187–95. β 82. Maroney SA, Ellery PE, Wood JP, et al. Comparison of the inhibitory activities of human tissue factor pathway inhibitor (TFPI) and TFPI . J. Thromb. Haemost. 2013;11(5):911–8. α β 83. Fox JE. Shedding of adhesion receptors from the surface of activated platelets. Blood Coagul. Fibrinolysis. 1994;5(2):291–304. 84. McEver RP. Properties of GMP-140, an inducible granule membrane protein of platelets and endothelium. Blood Cells. 1990;16(1):73–80; discussion 80–3. 85. Johnston GI, Bliss GA, Newman PJ, McEver RP. Structure of the human gene encoding granule membrane protein-140, a member of the selectin family of adhesion receptors for leukocytes. J. Biol. Chem. 1990;265(34):21381–5. 86. Ishiwata N, Takio K, Katayama M, et al. Alternatively spliced isoform of P-selectin is present in vivo as a soluble molecule. J. Biol. Chem. 1994;269(38):23708–15. 87. Semenov A V, Romanov YA, Loktionova SA, et al. Production of soluble Pselectin by platelets and endothelial cells. Biochem. Biokhimiia. 1999;64(11):1326–35. 88. Djaffar I, Chen YP, Creminon C, et al. A new alternative transcript encodes a 60 kDa truncated form of integrin beta 3. Biochem. J. 1994;300 ( Pt 1:69–74. 89. Markovic B, Wu Z, Chesterman CN, Chong BH. Quantitation of soluble and membrane-bound Fc gamma RIIA (CD32A) mRNA in platelets and megakaryoblastic cell line (Meg-01). Br. J. Haematol. 1995;91(1):37–42. 26 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 90. Censarek P, Steger G, Paolini C, et al. Alternative splicing of platelet cyclooxygenase-2 mRNA in patients after coronary artery bypass grafting. Thromb. Haemost. 2007;98(6):1309–15. 91. Sun L, Mao G, Kunapuli SP, Dhanasekaran DN, Rao AK. Alternative splice variants of phospholipase C-beta2 are expressed in platelets: effect on Galphaqdependent activation and localization. Platelets. 2007;18(3):217–23. 92. Huang YQ, Li JJ, Karpatkin S. Identification of a family of alternatively spliced mRNA species of angiopoietin-1. Blood. 2000;95(6):1993–9. 93. Faghihi MA, Wahlestedt C. Regulatory roles of natural antisense transcripts. Nat. Rev. Mol. Cell Biol. 2009;10(9):637–43. 94. Carrieri C, Cimatti L, Biagioli M, et al. Long non-coding antisense RNA controls Uchl1 translation through an embedded SINEB2 repeat. Nature. 2012;491(7424):454–7. 95. Curtis BR, McFarland JG. Human platelet antigens - 2013. Vox Sang. 2014;106(2):93–102. 96. Nurden AT, Caen JP. An abnormal platelet glycoprotein pattern in three cases of Glanzmann’s thrombasthenia. Br. J. Haematol. 1974;28(2):253–60. 97. Kahr WH, Hinckley J, Li L, et al. Mutations in NBEAL2, encoding a BEACH protein, cause gray platelet syndrome. Nat. Genet. 2011;43(8):738–40. 98. Gunay-Aygun M, Falik-Zaccai TC, Vilboux T, et al. NBEAL2 is mutated in gray platelet syndrome and is required for biogenesis of platelet -granules. Nat. Genet. 2011;43(8):732–4. α 99. Albers CA, Cvejic A, Favier R, et al. Exome sequencing identifies NBEAL2 as the causative gene for gray platelet syndrome. Nat. Genet. 2011;43(8):735–7. 100. Gieger C, Radhakrishnan A, Cvejic A, et al. New gene functions in megakaryopoiesis and platelet formation. Nature. 2011;480(7376):201–8. 101. Bunimov N, Fuller N, Hayward CPM. Genetic loci associated with platelet traits and platelet disorders. Semin. Thromb. Hemost. 2013;39(3):291–305. 102. Geisler T, Schaeffeler E, Gawaz M, Schwab M. Genetic variation of platelet function and pharmacology: an update of current knowledge. Thromb. Haemost. 2013;110(5):876–87. 103. Kunicki TJ, Nugent DJ. The genetics of normal platelet reactivity. Blood. 2010;116(15):2627–34. 27 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 104. Kunicki TJ, Williams SA, Nugent DJ. Genetic variants that affect platelet function. Curr. Opin. Hematol. 2012;19(5):371–9. 105. Park J-H, Wacholder S, Gail MH, et al. Estimation of effect size distribution from genome-wide association studies and implications for future discoveries. Nat. Genet. 2010;42(7):570–5. 106. Brookfield JF. Q&A: Promise and pitfalls of genome-wide association studies. BMC Biol. 2010;8(1):41. 107. Kamatani Y, Matsuda K, Okada Y, et al. Genome-wide association study of hematological and biochemical traits in a Japanese population. Nat. Genet. 2010;42(3):210–5. 108. Lappalainen T, Sammeth M, Friedländer MR, et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501(7468):506–11. 109. Thorvaldsdóttir H, Robinson JT, Mesirov JP. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 2013;14(2):178–92. 28 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Figure 1. A tour through the transcriptional landscape of a platelet. This tour through the transcriptional landscape in platelets is divided into 8 segments, which are listed in this figure. Figure 2. General features of an mRNA transcript as visualized by RNA-seq. This figure shows RNA-seq read coverage plots and individual reads for platelet Glycoprotein 9 (GP9) mRNA as it appears (i.e. with introns removed) in the platelet (A) or as viewed along genomic coordinates in a genome browser (B). The blue annotations along the bottom are the consensus RefSeq annotations of the gene structure. Individual exons are represented by the blue bars, and introns (only in B) by the connecting blue lines. The thicker regions of the bars distinguish the coding region from the 5 and 3 UTRs. Arrows indicate the strand orientation (forward [+] or reverse [′ ′ ]) of the annotation. For GP9, there are 3 exons and 2 introns. In GP9, the 5 UTR ends ′ just beyond the start of the 3rd exon (at the ATG start codon of the coding region), and the 3 UTR starts near the end of the 3rd exon (after the TAA stop codon). Above the ′ annotation are the individual reads that align to the transcript. For simplicity, not all reads are shown here. For visualization purposes, the individual reads are colored according to which exon they align to. Above the reads is the read coverage map. This is generated by plotting the sums of mapped reads that overlap (cover) each nucleotide position along the genomic coordinates (x-axis). The y-axis represents the number of reads. The examples are as illustrated in Integrated Genome Viewer (IGV)109. See it yourself: To see GP9 in platelets via UCSC genome browser, follow the Web Link > 29 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Access instructions for Site 1 (see Table I). For individual reads, follow the Web Link > Access instructions for Site 2 (see Table I). Search for GP9 in the browser Figure 3. Platelet transcripts are capped, tailed, and spliced. A. depicts the techniques we have used to capture and concentrate poly-A mRNA or 5 capped mRNA ′ from platelets. The captured RNAs are concentrated by immunoprecipitation techniques. For poly-A pull down, bead conjugated oligo-dT sequence binds to the poly A tail. For the 5 cap pull down, a bead conjugated hyper-affinity mutant enzyme, binds ′ to the 7-methyl guanosine cap (7mG) cap at the 5 end of the mRNA. B. Bar graph of ′ the RNA-seq expression estimates (RPKM) of a histone transcript HIST1H4B and the transcript coding for the platelet thrombin receptor PAR1 (F2R) following mRNA isolation by a 5 cap or poly-A tail pull-down. C. RNA-seq coverage graph of FOSB, ′ which appears mostly unspliced in platelets. Note that reads align to both exons and introns. Compare this to GP9 in Figure 2 which has very few reads mapping to introns. Figure 4. Platelets transcripts have variable 5′ or 3′ UTRs. A. Shown are coverage graphs of the 5 end of GFI1b in platelets. The top panel represents coverage of all ′ reads mapping to the 5 end of GFI1b. The red box is drawn around the region that ′ maps beyond the known transcript annotation (blue). The bottom panels show only those reads that contain the sequence tag ligated onto the 5 end at time of sample ′ preparation (for details see Osman et. al.). The 5 tagged reads are only the extreme 5 ′ ′ terminal reads, and therefore mark the transcription start site. B. Coverage graph of the 30 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. 3 end of P2RY1 in platelets. The red box is drawn around the region that maps beyond ′ the known 3 UTR annotation. microRNA target analysis of this additional sequence ′ identified several microRNA binding sites. A representative subset of the target sites are marked underneath the graph. See it yourself: To see GFI1b and P2RY1 in platelets via UCSC genome browser, follow the Web Link > Access instructions for Site 1 (see Table I). For individual reads, follow the Web Link > Access instructions for Site 2 (see Table I). Search for GFI1b or P2RY1 in the browser. Note that the 5 terminal ′ reads in the bottom panels of A were processed from the dataset published by Osman et. al.20 and are therefore not yet available in a browser friendly format. Figure 5. Platelets express mRNA splice variants. Shown are coverage graphs of PEAR1 (A) TFPI (B) GFI1B (C) in platelets. A. Note that there are four different Ensembl annotations listed underneath PEAR1. The red box is drawn around the second exon which distinguishes the second annotation from the other three annotations. B. The annotations for two major isoforms of TFPI, alpha (iso A) and beta (iso B), are distinguished by their 3 ends. Note that platelets are rich in isoform alpha ′ but not isoform beta. Exon 2 (exon 2 is very small, so it is difficult to see the annotation here), which can also be removed by alternative splicing, is also present in the platelet. C. The arrow points to a putative novel exon of GFI1b. Annotations for an exon at this region are not present in any of the Ensembl, UCSC, or RefSeq databases. The red box outlines examples of paired reads that map from the unannotated novel exon to the known second exon suggesting it is part of the GFI1B transcript. See it yourself: To see PEAR1, TFPI, and GFI1B in platelets via UCSC genome browser, follow the Web 31 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Link > Access instructions for Site 1 (see Table I). For individual reads, follow the Web Link > Access instructions for Site 2 (see Table I). Search for PEAR1, TFPI, or GFI1B in the browser. Figure 6. Antisense transcripts in platelets. Shown are positive and negative strand coverage graphs of the 5 end of CD109 and of AK124950, an antisense transcript in ′ platelets. Positive strand reads are on the top panel in blue, and negative strand reads are in the bottom panel in red. Both the expression of CD109, and the expression of an antisense transcript (AK124950, surrounded by the red box) transcribed from the opposite strand of DNA at the 5 end of CD109 can be inferred when the read ′ orientation is taken into account. See it yourself: To visualize CD109 and its antisense transcript in platelets via UCSC genome browser, follow the Web Link > Access instructions for Site 3 or 4 (see Table I). Search for CD109 or AK124950 in the browser. Note that only stranded data (only sites 3 or 4 of Table I) can distinguish antisense from sense transcripts. Figure 7. Single nucleotide resolution of GP1BA in platelets. Individual reads from platelets from two different individuals are shown. The arrow points to the location of the SNP rs6065, which associates with platelet counts in GWAS studies. Looking at the individual nucleotide sequences compared to the reference sequence indicates that the reads from the bottom individual match the reference cyotosine (reference matches are grayed out), whereas approximately half of the reads from the top individual contain the 32 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. C to T polymorphism. See it yourself: To visualize rs6065 in GP1BA expressed in platelets via UCSC genome browser, follow the Web Link > Access instructions for Site 2 (see Table I). Search for rs6065 and zoom into the center of the region, or alternatively type chr17:4,836,371-4,836,392 in the search box. Figure 8. RNA-seq facilitates discovery in platelets. The table summarizes the various transcript features, and their respective functions, that are identifiable through RNA-seq in platelets. Each feature in platelets can be analyzed to provide mechanistic insights into platelet function (1), as biomarkers or modulators of disease (2), or for their role in megakaryocyte development and function (3). As an example, the vessel in 1 depicts platelets that express two different splice variants: a predominant three exon variant (purple-green-red) and a minor two exon variant (purple-red). In vessel 2, which is atherosclerotic, the predominant isoform in platelets is shifted to the two exon isoform that may serve as a biomarker for atherosclerosis and may additionally contribute to disease pathogenesis. As depicted in (3), a combination of genetics and environmental signals reaching the bone marrow megakaryocyte dictate the expression of the RNAs, and the features of RNA, that are captured as the platelets are formed. 33 A Tour Through the Transcriptional Landscape of a Platelet I. How to navigate the landscape: an RNA-seq overview II. A panoramic view of a platelet transcript as depicted by RNA-seq III. Hidden structures: the 5΄ cap and poly-A tail IV. Peaks and canyons: pre-mRNAs, splicing, and intron retention V. Regulatory regions: 5΄ and 3΄ UTRs VI. Changing landscapes: alternative splice variants VII. Switchbacks: antisense and non-coding RNA VIII. A sequence level view: SNPs, insertions, and deletions Figure 1 A GP9 mRNA Per base coverage (visual level of expression) Individual reads Known transcript annotation Exon 1 Exon 2 Coding region 5´ UTR B Exon 3 3´ UTR Chromosomal coordinate Per base coverage (visual level of expression) Individual reads Known gene annotation GP9 mRNA viewed in genome browser Intron Intron Exon 2 Exon 1 5´ UTR ATG Exon 3 Coding region TAA 3´ UTR Figure 2 B Oligo-dT Pull-down 7mG TTTTTTTTTT AAAAAAAAAAAAA Poly(A) tail 5´ Cap Pull Down 80 Capped Tailed 60 RPKM A 40 20 7mG AAAAAAAAAAAAA 0 HIST1H4B elF4EK119A (High affinity mutant*) F2R (PAR1) C FOSB Figure 3 GFI1b A Extended 5´ UTR GFI1b P2RY1 B P2RY1 Extended 3´ UTR miRNA20a-5p miRNA93-5p miRNA105-3p miRNA150-3p miRNA412-3p miRNA552-3p Figure 4 PEAR1 A B Exon 2 TFPI iso A iso B GFI1B C Paired reads Unannotated exon Figure 5 Positive strand reads ►►► CD109 ◄◄◄ Negative strand reads Antisense transcript Figure 6 GP1BA T T T T T T PLT count Associated GWAS SNP (rs6065) Figure 7 1. Insights into platelet function and phenotype Feature identified by RNA-seq Molecular function 5´ cap and poly-A tail Regulation of translation and RNA stability Unspliced pre-mRNA and intron retention Regulation of translation Extended and alternate 5´ and 3´ UTR Alternative regulatory motifs: uORFs, RNA binding proteins, and miRNA binding sites Known and novel alternative splice variants Regulation of translation, alternate functional protein isoforms, truncated proteins (i.e. secreted vs. membrane bound) Non-coding RNA and antisense transcripts Regulation of transcription and translation Insertions, deletions, single nucleotide mutations Altered functional proteins, altered protein expression 2. Biomarkers and regulators of disease 3. Insights into megakaryocyte development and function Bone marrow signals Figure 8 From www.bloodjournal.org by guest on June 18, 2017. For personal use only. Prepublished online June 5, 2014; doi:10.1182/blood-2014-04-512756 A tour through the transcriptional landscape of platelets Sebastian Schubert, Andrew S. Weyrich and Jesse W. Rowley Information about reproducing this article in parts or in its entirety may be found online at: http://www.bloodjournal.org/site/misc/rights.xhtml#repub_requests Information about ordering reprints may be found online at: http://www.bloodjournal.org/site/misc/rights.xhtml#reprints Information about subscriptions and ASH membership may be found online at: http://www.bloodjournal.org/site/subscriptions/index.xhtml Advance online articles have been peer reviewed and accepted for publication but have not yet appeared in the paper journal (edited, typeset versions may be posted when available prior to final publication). Advance online articles are citable and establish publication priority; they are indexed by PubMed from initial publication. Citations to Advance online articles must include digital object identifier (DOIs) and date of initial publication. Blood (print ISSN 0006-4971, online ISSN 1528-0020), is published weekly by the American Society of Hematology, 2021 L St, NW, Suite 900, Washington DC 20036. Copyright 2011 by The American Society of Hematology; all rights reserved.
© Copyright 2026 Paperzz