Biochimica et Biophysica Acta 1579 (2002) 203 – 206 www.bba-direct.com Short sequence-paper Sequence of goat cyclin T1 cDNA, gene organisation and expression analysis Xavier Mata *, Sead Taourit, Jean-Luc Vilotte Laboratoire de Génétique Biochimique et de Cytogénétique, INRA, 78352 Jouy-en-Josas Cedex, France Received 26 July 2002; received in revised form 20 September 2002; accepted 25 September 2002 Abstract The cyclin T1 (Cyc T1) protein has been recently identified, associated with the cyclin-dependent kinase 9 (CDK 9), as to be involved in the transcriptional activation of the Human Immunodeficiency Virus type 1 (HIV-1) by the Tat protein. In this study, the sequence of the 7 kb goat Cyc T1 cDNA is reported as well as the exon/intron structure of the gene. Its observed ubiquitous expression is consistent with the promoter structure. D 2002 Elsevier Science B.V. All rights reserved. Keywords: Cyclin T1; HIV-1; cDNA; Gene structure; Promoter analysis; Goat The cyclin T1 (Cyc T1) protein was recently identified as being a co-factor for the Human Immunodeficiency Virus type 1 (HIV-1) Tat Protein, in association with the cyclindependent kinase 9 (CDK 9) [1,2]. This complex is involved in the RNA pol II transcription elongation. The interaction of the Tat protein with the Cyc T1/CDK 9 complex strongly increases the affinity and specificity of the Tat protein for the cis-acting transcription response element (TAR), making a stable RNA stem-loop structure at the 5Vend of nascent viral transcripts [1,2]. Transcription of the human Cyc T1-encoding gene results in the synthesis of an 8-kb-long transcript in most tissues analysed. The gene appears to be ubiquitously expressed and is located on the human chromosome region 12 pter-qter. Structural analysis of the mRNA is still incomplete. So far, only its 5V-UTR and open reading frame (ORF) sequences have been reported in human, mouse and horse species [1– 4] (AF109179, AF190905). In the three abovementioned species, it only covers around 2 kb of the cDNA including only a few nucleotides of the 3VUTR. Nevertheless, by comparison with the human genome sequence, the partial structure of the corresponding gene has been determined and is reported in GenBank NT 009526. The sequence of the promoter region * Corresponding author. Tel.: +33-1-34-65-25-76; fax: +33-1-34-6524-78. E-mail address: [email protected] (X. Mata). has also been determined for the human and murine genes [6] and characterised both by sequence similarity analyses and transfections in various cell types [5,6]. We have recently reported the site-independent expression of a goat a-lactalbumin BAC transgene in mice [7] and the subsequent discovery within this BAC of the Cyc T1 gene and of its ubiquitous expression in transgenic mice [8]. To further define this chromosomal region, we have investigated the genomic structure of the Cyc T1 gene. In the present paper, we report the full-length cloning and sequence analysis of the goat Cyc T1 cDNA, the deduced intronic/exonic organisation of the corresponding gene and the determination of its proximal promoter sequence. Expression of this gene was studied by Northern blot analysis and RT-PCR in various goat tissues and was confirmed to be ubiquitous also in ruminants. The cloning of the full-length goat Cyc T1 cDNA has been realised in several steps. From a precedent work (Vilotte, unpublished data), involving partial sequencing of the goat alactalbumin BAC insert, two exons of the Cyc T1-encoding gene were identified (exons 7 and 8, Fig. 1B). This sequence information allowed us to define two primers (A and B in Table 1) that were used to perform a 5VRACE RT-PCR reaction using 6 Ag of total RNA from kidney and lactating mammary gland, following the manufacturer’s instructions (SuperScript First-Strand Synthesis System for RT-PCR, Invitrogen). With both RNA samples, a fragment of 830 bp was amplified, cloned and sequenced. It contained 72 bp of 0167-4781/02/$ - see front matter D 2002 Elsevier Science B.V. All rights reserved. PII: S 0 1 6 7 - 4 7 8 1 ( 0 2 ) 0 0 5 4 2 - 0 204 X. Mata et al. / Biochimica et Biophysica Acta 1579 (2002) 203–206 Fig. 1. Schematic representation of the Cyc T1 genomic DNA. (A) Representation of the BAC insert. Grey arrow: Cyc T1 gene. Black arrow: a-lactalbumin gene. Both loci are in the same orientation, as indicated by the arrows. (B) Structural organisation of the Cyc T1 transcription unit. Black boxes: coding exons. Grey boxes: 5Vand 3VUTR (72 and 4554 bp, respectively). Both representations are not at scale. Sizes of introns and exons are reported in Table 2. 5VUTR and part of the ORF of the cDNA. To ensure that the cloned 5VUTR was complete, this same experiment was repeated using primers located further upstream (C and D in Table 1). All amplified fragments ended at the same nucleotide (nt), strongly suggesting that a unique transcription start site is present within the goat Cyc T1 gene (data not shown). This is in contrast with the human gene which has several potential transcription initiation sites, at least as determined in vitro [5]. The abovementioned 830 bp partial cDNA was used as probe to screen a goat ovarian cDNA library, a kind gift from Dr. Pailhoux (INRA, France). 1.75 106 pfu of the unamplified oligo-dT primer library were plated and screened. Two positive clones were identified. Sequence analysis of their inserts revealed that both contained the whole ORF and 460 bp of 3VUTR. Analysis of the sequence upstream of the 3Vend of both cDNAs did not reveal the presence of a polyadenylation signal sequence. Screening of three other cDNA libraries from ovaries or testicular origins did not allow us to isolate cDNA clones with longer 3VUTR sequences (data not shown). Furthermore, Northern analysis performed on several goat tissues revealed a Cyc T1 mRNA size similar to the Table 1 Name and position of the different primers used during this work Primer name Position on the cDNA Orientation A B C D E F G H 747 – 765 728 – 746 290 – 310 263 – 284 6142 – 6161 6150 – 6169 6153 – 6173 6401 – 6523 Reverse Reverse Reverse Reverse Forward Forward Reverse Forward Positions refer to GenBank AF506739 sequence. human mRNA [1] (see below). Both observations strongly suggested that the cloned cDNAs resulted from retro-transcription events initiated within the mRNA sequence rather than at its 3V-end. Attempts to clone the missing 3VUTR by 3VRACE RTPCR were unsuccessful. In order to obtain sequence information on it, we aligned 15 kb of the human sequence (NT 009526) located downstream of the Cyc T1 stop codon with the human EST database. It revealed that the 5 kb downstream of the stop codon were homologous to a set of overlapping EST (data not shown). The most 3V-located of these ESTs (GenBank AI052701) contained a polyadenylation signal, suggesting that it derived from the 3V-end of a mRNA. Altogether, these observations were consistent with the hypothesis that the 3V-UTR of the Cyc T1 cDNA is comprised within a single exon encompassing the end of the ORF. Analysis of Ruminant ESTs revealed that a bovine EST (GenBank AW345578) was homologous to the human sequence 4 kb downstream of the Cyc T1 stop codon, thus within the putative 3VUTR of the gene. We used this EST to design three primers (E, F and G in Table 1) to clone the 3VUTR region by RT-PCR and 3V-RACE RT-PCR, using the corresponding kits from Invitrogen, 6 Ag of goat kidney total RNA and following the manufacturer’s instructions. The RTPCR experiment allowed us to clone the cDNA region located between the stop codon and nt 6191 in GenBank AF506739. The 3V-RACE RT-PCR strategy resulted in the isolation of a 280-bp-long fragment whose 3V-end corresponds to nt 6430 in GenBank AF506739. It was located upstream to the hypothesised 3Vend of the human gene transcription unit. Furthermore, no polyadenylation signal was observed within its sequence. It strongly suggested again that this amplified cDNA resulted from a retro-transcription event initiated within the mRNA sequence. To confirm it, a X. Mata et al. / Biochimica et Biophysica Acta 1579 (2002) 203–206 Table 2 Size of the goat and human exons as well as estimation of the goat and human introns length Size 1 2 3 4 5 6 7 8 9 Goat exons (bp) Human exons (bp) Goat introns (kbp) Human introns (kbp) 232 82 127 60 62 45 163 70 5937 169 – 412 81 128 60 62 45 163 70 ? 3.8 2 14 8.5 2.5 1.3 1.3 1 0.165 10 4.7 1.2 1.6 2 0.168 1.3 new 3VRACE RT-PCR experiment was performed using oligonucleotide H in Table 1. It resulted in the amplification of a 406-bp-long cDNA whose 3V-end was this time identical to that suspected in the human sequence. A polyadenylation signal was present 15 nt 5Vof the polyadenylation site. Sequence analysis confirmed the occurrence of an A-rich region around nt 6430, indirectly reinforcing our argument for an artefactual initiation of the reverse-transcription in the previous 3V-RACE RT-PCR experiment. The 72 bp 5VUTR of the cDNA has been well conserved during evolution as judged by the high homology observed between the horse, human, mouse and goat sequences, which is at least of 88%. This suggests that this region has a regulatory function, acting either at a co- or posttranscriptional level. The ORF of the Cyc T1 is also well conserved between these species with 92% identity with the horse, 90% with the human and 86% with the mouse sequences. All amino acid motifs described as having important functions are conserved in the goat species. This includes the cyclin box (aa 1 – 250), the potential Tat/TAR recognition motif (aa 254 –272), a putative coiled coil domain (aa 379– 430), the poly-HIS (aa 517 –528) and the poly-SER (aa 560 –570) [1] as well as a carboxy-terminal PEST sequence that are commonly found in G1 cyclins and serve to regulate protein turnover by cellular ubiquitination and proteolysis pathways [9]. Mouse cells do not sustain HIV replication. This observation was attributed to one amino acid polymorphism that differentiate the human and the murine Cyc T1 protein [2]. Occurrence of a tyrosine at position 261 in the mouse protein in place of a cysteine confers the loss of HIV-1 Tat transactivation. In the goat sequence, a tryptophane residue is found at this location. To our knowledge, no HIV-like illness has ever been described in goats. Whether or not this data is related to the occurrence of a tryptophane at position 261 of the Cyc T1 protein is an attractive hypothesis that remains to be tested. Notably, a very long 3VUTR (4.5 kbp) has been found in the goat Cyc T1 mRNA. The estimated sizes of the human Cyc T1 mRNAs suggest that this long 3VUTR is a common feature [1]. Interestingly, this long sequence was found to be well conserved between the human and the goat genes, with 81% homology, reinforcing the hypothesis that it has an important function. It was proposed that this region might 205 be involved in the post-transcriptional regulation of the expression of this gene [5], through a translational control mechanism. Several A/T-rich regions are present, and conserved in this 3VUTR. These regions are known to be the target of endonucleases that regulate the stability of a variety of transcripts. Thus, beside playing a potential role in the translational control of the mRNA, the 3VUTR may also be involved in stability control mechanisms. Alignment of the partial BAC sequence with the Cyc T1 cDNA was performed to determine the structure of the goat Cyc T1 gene transcription unit (Fig. 1B, GenBank AF506740-506747). Nine exons, whose sizes vary between 45 bp and 6 kbp, were identified (Fig. 1B and Table 2). When necessary, estimation of the length of the introns was performed using the long-range PCR kit (Expand Long Template PCR System, Roche) (Table 2). The complete transcription unit of the gene spans almost 42 kb and is thus longer than that observed in human. This results mostly from the longer sizes of introns 2 and 8. Differences in the size of introns are often associated with insertion and/or deletion of repetitive elements (not determined). All sequenced intron boundaries follow the ‘‘exon n-GT-intron n-AG-exon n + 1’’ rule [10,11]. Within the goat BAC, the Cyc T1 gene was found to be located 80 kb apart from the a-lactalbumin locus (Fig. 1A) [12]. The orientation of the transcription unit of the two genes was the same (Fig. 1A, data not shown). Alongside the exon 1, 2005 bp of the 5V-flanking region were cloned and sequenced. Functional analysis of the homologous human promoter in cell cultures suggested that its activity resides in the 545 nucleotides upstream to the translation initiation site [5,6]. Furthermore, several potential binding sites for transcription factors were identified within this proximal promoter and found to be conserved in the murine sequence [6]. Alignment of the goat sequence revealed that some of these putative regulatory elements are conserved in the three species, including consensus binding Fig. 2. Northern analysis of the goat Cyc T1 gene expression. Northern blot was performed with 1.4 Ag of goat mammary gland (MG) and 3.5 Ag of goat skeletal muscle (Mu) polyA+ RNA. Hybridisation with an oligolabelled Cyc T1 cDNA was performed as already described [5]. Sizes of the hybridising mRNAs were estimated using 18S and 28S distance of migration as well as a co-migrated 1-kb ladder size marker and are indicated. 206 X. Mata et al. / Biochimica et Biophysica Acta 1579 (2002) 203–206 sites for Sp1 and members of the Ets family of transcription factors (data not shown). This conservation across evolution could suggest that these sequences are indeed involved in the transcriptional regulation of the Cyc T1 promoter, although direct evidence of it is lacking. The homology between the 5V flanking region of this gene, with that of the two different species mentioned above, remains significant only over a short length of about 0.5 kb, indirectly reinforcing the hypothesis that this sequence contains most if not all of the promoter activities [5]. The human and murine Cyc T1 promoters are known to have an ubiquitous expression [6]. RT-PCR and Northern analyses were performed using goat RNA from five different tissues: mammary gland, testis, ovaries, kidney and skeletal muscle. RT-PCR showed that the goat gene is also ubiquitously expressed (data not shown), confirming the expression pattern obtained with this gene in transgenic mice [8]. The low expression level of this gene complicated its analysis by Northern blotting. Nevertheless, it revealed that the transcript that was detected in all tissues was of approximately 7 kb in length, a result concordant with the size of the cDNA (Fig. 2). Surprisingly, a shorter transcript was also detected in the skeletal muscle (Fig. 2). This transcript was of 1.5 –2 kb in length. Origin and biological significance of this mRNA remain unknown. The estimated size of this transcript is such that it cannot encompass the entire Cyc T1 ORF and is too short to correspond to the ‘‘partial’’ cDNAs cloned from the abovementioned libraries. Southern blotting hybridisation of PstI or EcoRI digested goat genomic DNA suggested that the Cyc T1 gene is a single copy gene (data not shown), consistent with the single FISH signal observed with this gene [7]. Thus, this transcript must derive from unusual splicing and/or termination events of the Cyc T1 gene. A shorter Cyc T1 transcript has been observed in some human tissues, the origin of which was not described [1]. However, its estimated size of 3 kb is much longer that the one determined for the short transcript present in the goat skeletal muscle. The structure of the goat Cyc T1 promoter is consistent with its pattern of expression. This promoter lacks TATA and CAAT boxes and contains several potential ubiquitous transcription factor binding sites located close to the transcriptional start site (like Sp1, Ets family, AP1. . .). It is also characterised by a relatively high G/C content (55 – 60%). These characteristics of housekeeping gene promoters are also shared by its human and murine counterparts and by several cycle-regulated promoter genes [13]. Ubiquitous promoters are potentially interesting for building gene therapy vectors. The previously described ubiquitous expression of the goat Cyc T1 gene in transgenic mice [8] and the confirmation of the potentially compact organisation of its promoter, as suggested by the currently reported sequence analysis, make it a good candidate among other house keeping genes. Its ability to target expression of reporter genes in currently tested in mice. Acknowledgements We are most grateful to Dr. Pailhoux (INRA) for the kind gift of cDNA libraries and goat RNA samples, to Dr. Persuy (INRA) for providing mammary gland mRNA and to Dr. Le Provost (INRA) for critical reading of this manuscript and mRNA samples. References [1] P. Wei, M.E. Garber, S.-M. Fang, W.H. Fischer, K.A. Jones, A novel CDK 9-associated C-type interacts directly with HIV61 Tat and mediates its high-affinity, loop-specific binding to TAR RNA, Cell 92 (1998) 451 – 462. [2] P.D. Bieniasz, T.A. Grdina, H.P. Bogerd, B.R. Cullen, Recruitment of a protein complex containing Tat and cyclin T1 to TAR governs the species specificity of HIV-1 Tat, EMBO J. 17 (1998) 7056 – 7065. [3] M.E. Garber, P. Wei, V.N. KewalRamani, T.P. Mayall, C.H. Herrmann, A.P. Rice, D.R. Littman, K.A. Jones, The interaction between HIV-1 Tat and human cyclin T1 requires zinc and a critical cysteine residue that is not conserved in the murine CycT1 protein, Genes Dev. 12 (1998) 3512 – 3527. [4] J. Peng, Y. Zhu, J.T. Milton, D.H. Price, Identification of multiple cyclin subunits of human P-TEFb, Genes Dev. 12 (1998) 755 – 762. [5] H. Liu, A. Rice, Isolation and characterisation of the human cyclin T1 promoter, Gene 252 (2000) 39 – 49. [6] J. Martin-Serrano, K. Li, P.D. Bieniasz, Cyclin T1 expression is mediated by a complex and constitutively active promoter and does not limit human immunodeficiency virus type 1 Tat function in unstimilated primary lymphocytes, J. Virol. (2002 Jan) 208 – 219. [7] M.-G. Stinnakre, S. Solange, L. Schibler, L. Lepourry, J.-C. Mercier, J.-L. Vilotte, Position-independent and copy-number-related expression of a goat bacterial artificial chromosome a-lactalbumin gene in transgenic mice, Biochem. J. 339 (1999) 33 – 36. [8] X. Mata, J.-L. Vilotte, Ubiquitous expression of goat cyclin T1 in transgenic mice, Transgenic Res. 11 (2002) 65 – 68. [9] M. Rechsteiner, S. Rogers, PEST sequences and regulation by proteolysis, Trends Biochem. Sci. 21 (1996) 267 – 271. [10] M.R. Green, Pre-mRNA splicing, Annu. Rev. Genet. 20 (1986) 671 – 708. [11] R. Reed, The organisation of 3Vsplicing site sequences in mammalian introns, Genes Dev. 3 (1989) 2113 – 2123. [12] S. Soulier, M.-G. Stinnakre, J. Costa Da Silva, L. Lepourry, X. Mata, N. Besnard, J.-L. Vilotte, Distal element(s) is(are) required for position-independent expression of the goat a-lactalbumin gene in transgenic mice. Potential relationship with the location of the cyclin T1 locus, Genet. Sel. Evol. 32 (2000) 621 – 630. [13] K. Kröner, L.A. Wolfraim, F.C. Lucibello, R. Müller, Characterisation of the TATA-less core promoter of the cell cycle-regulated cdc25C gene, Nucleic Acids Res. 25 (1997) 4921 – 4925.
© Copyright 2026 Paperzz