3′-Box-dependent processing of human pre-U1

Published online June 1, 2004
Nucleic Acids Research, 2004, Vol. 32, No. 10 2987±2994
DOI: 10.1093/nar/gkh619
3¢-Box-dependent processing of human pre-U1
snRNA requires a combination of RNA and protein
co-factors
Patricia Uguen and Shona Murphy*
Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK
Received February 26, 2004; Revised and Accepted May 4, 2004
ABSTRACT
Using an in vitro system we have recently shown
that the 3¢ ends of human pre-snRNAs synthesized
by RNA polymerase II are produced by RNA
processing directed by the snRNA gene-speci®c 3¢
box. Towards a complete characterization of this
processing reaction we have further investigated
the in vitro requirements for proper 3¢ end formation
of pre-U1 snRNA. Here we show that the 5¢ cap
plays a stimulatory role and processing requires
creatine phosphate. Our results also indicate that
the pre-U1 processing activity is heat sensitive and
that an RNA component is required. In addition, the
exact sequence adjacent to the 3¢ box in¯uences the
position of the pre-U1 3¢ end produced in vitro.
Interestingly, the processing extract active for 3¢box-dependent processing also contains an activity
that converts the 3¢ end of RNA containing the U1
Sm protein binding site and the 3¢ terminal stem±
loop into the mature form.
INTRODUCTION
The majority of vertebrate U snRNA genes (e.g. U1±U5) are
transcribed by RNA polymerase II (pol II) to yield short nonpolyadenylated 3¢-elongated pre-snRNAs. The 3¢ ends of these
RNAs are produced by a processing reaction dependent on the
13±16 nucleotide (nt) cis-acting 3¢ box element, located 9±19
nt downstream of the 3¢ end of the RNA-encoding region (1).
Most of the 3¢ extension is removed by further processing in
the cytoplasm before nuclear re-import (2,3). A compatible
snRNA promoter is also required for proper 3¢ end formation
of pre-snRNAs (4,5), highlighting the tight link between
transcription and the function of the 3¢ box. Relevant to this we
have recently demonstrated that the C-terminal domain (CTD)
of pol II is required for 3¢-box-dependent RNA processing
in vivo and that processing in vitro is activated by phosphoCTD (1,6).
3¢ End formation of pre-snRNAs is likely to involve
divalent-cation-dependent endonucleolytic activity, in common with 3¢ end formation of vertebrate pre-mRNAs or yeast
pre-snRNAs (1,7). However, neither subunits of the cleavage±
polyadenylation complex factors cleavage/polyadenylationspeci®city factor (CPSF), cleavage-stimulation factor (CstF)
and cleavage factor I (CFI), nor the human homologues of
yeast pre-snRNA processing factors hRNase III and PMScl100 appear to co-purify with the human pre-snRNA 3¢
processing activity (1). This suggests that there is little overlap
between known RNA 3¢ end processing factors and those used
for 3¢ end formation of human pre-snRNAs. Two cis-acting
elements, the U2 3¢ box and the upstream sequence element
(USE) act together to ef®ciently direct accurate pre-U2
snRNA 3¢ end formation both in vitro and in vivo (1), whereas
3¢ end formation of pre-U1 snRNA appears to depend largely
on the 3¢ box. Thus, there may also be differences in the cisacting sequences and trans-acting factors involved in 3¢ end
formation of different pre-snRNAs.
Further investigation of the biochemical requirements of
pre-U1 snRNA 3¢ end formation indicates that, in addition to
the phospho-CTD of pol II, creatine phosphate and a 5¢ cap
activate processing. We also demonstrate that 3¢ processing
requires an RNA component and that the activity is heat labile.
Thus, 3¢-box-dependent processing has some characteristics in
common with both AAUAAA-dependent processing (8) and
3¢ end formation of the replication-activated histone mRNAs
(9) as well as signi®cant differences. Furthermore, we show
that the ef®ciency of in vitro pre-U1 snRNA 3¢ end formation
is not affected by additional cis elements although the exact
sequence around the cleavage site in¯uences the site of the
®nal 3¢ end formed in vitro. The S100 fraction active for 3¢
end formation of pre-U1 snRNA is also active for 3¢ end
maturation and we show that these processes can occur
independently in vitro.
MATERIALS AND METHODS
DNA constructs
In 10end U1 3¢ box, 17end U1 3¢ box and 59end U1 3¢ box
templates, the 10, 17 and 59 nt, respectively, located upstream
of the 3¢ box of U1 3¢ box template described by Uguen and
Murphy (1) were replaced by the 10, 17 and 59 nt of the
endogenous U1 snRNA gene (GenBank accession number
J00318). The U1 3¢ box sequence, GTTTCAAAAGTAGA, is
deleted from the 59end D3¢ box template. DNA constructs
*To whom correspondence should be addressed. Tel: +44 1865 275616; Fax: +44 1865 275556; Email: [email protected]
Correspondence may also be addressed to Patricia Uguen at present address: Signalisation, DeÂveloppement et Cancer, BaÃtiment 442 bis, Universite Paris XI,
91405 Orsay ceÂdex, France. Tel: +33 1 69 15 65 93; Fax: +33 1 69 15 68 02; Email: [email protected]
Nucleic Acids Research, Vol. 32 No. 10 ã Oxford University Press 2004; all rights reserved
2988
Nucleic Acids Research, 2004, Vol. 32, No. 10
used to produce synthetic RNA were obtained by deleting the
U1 promoter region by EcoRI/BglII digestion as described
previously (1).
Ef®cient U1 3¢-box-dependent processing requires
creatine phosphate as a co-factor and is ATP
independent
S100 fraction preparation
Since creatine phosphate (CP) (usually used to transfer Pi to
ADP, creating ATP) improves the ef®ciency of some mRNA
processing steps (11), we tested its effect on U1 3¢-boxdependent processing (Fig. 1). Little processing (1%) is
detected in the absence of added CP after incubation for 2.5 h
at 30°C (Fig. 1A, lanes 2 and 4), whereas 7% of the input RNA
is processed in the presence of 20 mM CP (Fig. 1A, lanes 3 and
5). Processing ef®ciency is only slightly improved by
increasing the concentration of CP further (8% of input
RNA is processed in the presence of 40 or 60 mM CP; Fig. 1A,
lanes 6 and 7). Furthermore, nucleoside triphosphates,
including ATP, do not support U1 3¢-box-dependent processing in the absence of CP (Fig. 1A, lane 4, and B, lanes 3±6),
indicating that CP does not act simply to regenerate the energy
source. Serine phosphate (SP) can also activate processing
although not as effectively as CP (Fig. 1C, lanes 3±5)
suggesting that the effect of CP is not entirely due to the
presence of phosphate groups.
It has been proposed that CP activates pre-mRNA 3¢
cleavage by mimicking the effect of the pol II CTD rather than
affecting the reservoir of energy (11,12) and may facilitate a
conformational change of some factors. In support of this
notion, the phosphorylated form of the CTD of pol II activates
3¢-box-dependent processing in vitro better than the unphosphorylated form (1). However, no processing is detected in the
absence of CP when phospho-CTD is added (Fig. 1D). This
indicates that CP does not simply substitute for phospho-CTD
in our system.
To eliminate the ATP present in the S100 fraction, we
fractionated the processing activity by binding to heparin±
sepharose at 100 mM KCl and eluting by 1 M KCl. Although
this fractionation procedure should markedly reduce the level
of ATP in the active fraction it does not result in a requirement
for added ATP (Fig. 1E, lanes 2 and 6). Furthermore, addition
of both glucose and hexokinase (Fig. 1E, lane 5), which
together deplete the available pool of ATP (11), has no greater
effect than the addition of glucose or hexokinase alone
(Fig. 1E, lanes 3 and 4). These results indicate that 3¢-boxdependent processing is ATP independent. Interestingly,
AAUAAA-dependent 3¢ cleavage is also CP dependent and
ATP independent in vitro (11).
HeLa cell S100 extract was prepared using a slight modi®cation of the procedure of Shapiro et al. (10). After cell lysis in
buffer A (10 mM HEPES pH 7.9, 0.75 mM spermidine,
0.15 mM spermine, 10 mM KCl, 0.5 mM DTT) sucrose was
added to a ®nal concentration of 10% before spinning.
In vitro RNA synthesis and in vitro processing
Synthetic, capped RNA was produced as described previously
(1). To produce uncapped RNA, m7G(5¢)ppp(5¢)G was
omitted from the reaction. Processing was carried out as
described in (1) for 2.5 h at 30°C in the presence of 3 mM
MnCl2 unless stated otherwise in the ®gure legends. To reduce
the level of endogenous ATP, the S100 fraction was bound to
heparin±sepharose at 100 mM KCl and eluted by 1 M KCl.
This fraction was then incubated for 15 min at 30°C with 3 mM
MgCl2, 2 mM glucose and 0.6 U of hexokinase (Boehringer
Mannheim) to deplete any residual ATP.
S1 mapping analysis
After incubation with S100, RNA was extracted, annealed to
the S1 probe, digested and analysed on a 6% polyacrylamide±
8 M urea gel as described previously (1). The S1 probes used
for each construct (U1 3¢ box, 10end U1 3¢ box, 17end U1 3¢
box, 59end U1 3¢ box and 59end D3¢ box) were prepared as
described previously (1).
Western blotting
Western blotting was carried out as described by Medlin et al.
(6). Anti-stem±loop-binding protein (SLBP) antibody was
kindly provided by Berndt Mueller (University of Aberdeen)
and used diluted at 1/100.
RESULTS
Improving U1 3¢-box-dependent in vitro processing
We recently described an in vitro system where 3¢-boxdependent processing of substrate RNAs was detected for the
®rst time (1). However, long incubation times (>5 h) are
required to detect any accurate 3¢ end processing directed by
the U1 3¢ box in unpuri®ed S100. We therefore undertook to
improve the ef®ciency of the processing activity by modifying
the extract preparation procedure (see Materials and Methods)
as a prelude to characterization of the requirements for 3¢
processing. Modi®cation of the extract preparation method
described by Shapiro et al. (10) led to the production of S100
that yields up to 8% properly processed RNA after 2.5 h
incubation rather than the 13 h needed with S100 produced by
our previously described protocol (1). The increased sucrose
concentration used in production of this extract (see Materials
and Methods) may reduce the release of nucleases from the
nuclei upon spinning and consequently reduce turnover of
substrate RNA, which proved to be a major problem for in vitro
processing (1). Relevant to this, the amount of pol II and
factors involved in mRNA 3¢ end formation is reduced in this
S100 fraction (data not shown and see Fig. 4B).
U1 3¢-box-dependent processing is activated by a
7-methyl-G cap
To examine whether 5¢ cap binding proteins activate 3¢-boxdependent RNA processing in addition to splicing, 3¢
processing, transport and translation of mRNA (13), we
compared the ef®ciency of 3¢ processing of capped and
uncapped U1 3¢ box RNA. In the absence of a 5¢ cap structure,
U1 3¢-box-dependent processing is slower (Fig. 2A), suggesting that either the 5¢ cap itself, or more likely, cap binding
proteins present in the S100, activate processing. The decrease
of proper 3¢ end formation of uncapped RNA is not due simply
to RNA instability since input RNA is still detected after 3 h
incubation (Fig. 2A). Addition of 7-methyl-G cap analogue
[m7G(5¢)ppp(5¢)G] to the in vitro reaction inhibits processing
and the highest concentration of cap analogue tested
Nucleic Acids Research, 2004, Vol. 32, No. 10
2989
Figure 1. Ef®cient U1 3¢-box-dependent processing requires CP as a co-factor and is ATP independent. (A) U1 3¢ box RNA was incubated with S100 in the
presence of ATP (1 mM) and different concentrations of CP as indicated. The percentage of input RNA processed (% proc.) is indicated below this and
subsequent gels. The scheme represents the position and size of the S1 probe, input (IP) and processed (proc.) RNA. The diagram on the left side of this and
subsequent ®gures indicates the position of the U1 3¢ box in the input RNA. (B) U1 3¢ box RNA was incubated with S100 in the presence of 20 mM CP or
1 mM of the nucleosides triphosphates indicated. (C) U1 3¢ box RNA was incubated with S100 in the presence of 20 mM CP or different concentrations of
SP as indicated. (D) U1 3¢ box RNA was incubated with S100 in the presence or absence of 20 mM CP and in the presence of phospho-CTD as indicated.
(E) U1 3¢ box RNA was incubated with S100 fractionated on heparin±sepharose in the presence of 20 mM CP, and with or without ATP, hexokinase and
glucose. The processing reaction was carried out in the presence of 2 mM MgCl2.
2990
Nucleic Acids Research, 2004, Vol. 32, No. 10
Figure 2. U1 3¢-box-dependent processing is activated by a 7-methyl-G cap. (A) Capped (empty square) or uncapped (®lled diamond) U1 3¢ box RNA was
incubated with S100 for 0±3 h. The graph below represents the percentage relative processing activity calculated by comparison with the amount of
processing obtained after incubating capped RNA for 3 h. (B) Capped U1 3¢ box RNA was incubated with S100 in the presence of different concentrations of
unmethylated cap analogue [G(5¢)ppp(5¢)G; Epicenter] (lanes 3±7) or methylated cap analogue [m7G(5¢)ppp(5¢)G; Amersham Pharmacia] (lanes 8±12) as
indicated. The graph below represents the average percentage relative processing activity obtained in the presence of increasing concentrations of
unmethylated (®lled triangle) or methylated (empty square) cap analogue in two independent experiments.
effectively abolished processing (Fig. 2B, lanes 8±12). This
suggests that the cap analogue can sequester cap binding
proteins and interacting 3¢ processing factors. Consistent with
this, the effect of the cap structure of the RNA on processing
ef®ciency is lost after partial puri®cation of the 3¢ processing
activity as described by Uguen and Murphy (1) (data not
shown), indicating that the 5¢ cap per se is not suf®cient.
Moreover, addition of unmethylated cap analogue
G(5¢)ppp(5¢)G to the reaction has little effect on processing
(Fig. 2B, lanes 3±7). Taken together, these results suggest that
the nuclear cap binding complex (14) may be involved in the
recruitment of pre-snRNA 3¢ processing factor(s) in vivo.
U1 3¢-box-dependent processing requires an RNA
co-factor
Small non-coding RNAs function as co-factors in both
splicing and 3¢ processing of pre-mRNAs (8,9). To determine
whether 3¢ processing of U1 3¢ box RNA substrate is
dependent on a trans-acting nucleic acid component, the
S100 was treated with micrococcal nuclease (MN) before
incubation with U1 3¢ box RNA (Fig. 3). U1 3¢-box-dependent
processing is completely abolished by this treatment (lane 3).
However, when EGTA, which chelates the Ca2+ ions essential
for this nuclease to function, is added to the S100 before MN,
no reduction in processing is observed (lane 4). This suggests
the involvement of an RNA or DNA co-factor in U1 3¢-boxdependent processing. To distinguish between these two
possibilities, we treated the S100 with pancreatic DNase I
before incubation with U1 3¢ box RNA (lane 8). This has no
effect on 3¢ processing, indicating that the nucleic acid cofactor is composed of RNA. In order to test whether this RNA
component is the sole catalytic component of the 3¢ processing
activity, we extracted ribonucleic acid from the S100 by acidic
phenol/chloroform treatment and incubated it with U1 3¢ box
RNA. This fraction does not support 3¢ processing on its own
(lane 5). In addition, 3¢ processing is not recovered when the
MN-treated S100 is complemented with this RNA fraction
(lane 6). These data suggest that U1 3¢-box-dependent
processing involves both RNA and protein factor(s) and
leads us to propose that processing is carried out by an RNA±
protein complex.
U7 snRNA is required for 3¢ end formation of the mRNAs
for replication-activated histones and has a region of
complementarity to the substrate RNA (9). Since mammalian
U7 snRNAs contain no conserved sequences complementary
to 3¢ box sequences it is unlikely that U7 snRNA participates
in 3¢ end formation of U1 snRNA. Unfortunately, experiments
to test the effect of depletion of U7 snRNA and other
U snRNAs from the processing extract using antibodies to
the trimethylguanosine cap present at the 5¢ end were
inconclusive.
U1 3¢-box-dependent processing activity is heat labile
To analyse the sensitivity of the 3¢-box-dependent processing
activity to heat, the S100 fraction was heated for 10 min at
temperatures from 30 to 55°C before incubation with U1 3¢
Nucleic Acids Research, 2004, Vol. 32, No. 10
Figure 3. U1 3¢-box-dependent processing requires an RNA co-factor. U1
3¢ box RNA was incubated with S100 in the presence of 2 mM of MgCl2
before or after pretreatment with MN or DNase I and/or complemented with
the extracted RNA. MN (lane 3), S100 pretreated for 15 min at 25°C with
MN (5 U/ml of S100; Roche) in the presence of 1 mM CaCl2. The reaction
was stopped by adding 2 mM EGTA. MN in. (lane 4), S100 pretreated for
15 min at 25°C with MN inactivated by the addition of 2 mM EGTA; RNA
extracted (lane 5), S100 extracted by acidic phenol/chloroform treatment
and ethanol precipitated before resuspension in buffer D (15); RNA
extracted + MN (lane 6), S100 pretreated for 15 min at 25°C with MN in
the presence of 1 mM CaCl2 and complemented by the extracted RNA.
RNA extracted + MN in. (lane 7), S100 pretreated for 15 min at 25°C with
MN inactivated by the presence of 2 mM EGTA and complemented by the
extracted RNA; DNase I (lane 8), S100 pretreated for 15 min at 30°C with
pancreatic DNase I (10 U/ml of S100).
box RNA (Fig. 4). The ef®ciency of 3¢ processing is slightly
reduced at 45°C (inhibition of 33%; Fig. 4A, lane 4).
However, 3¢ processing is inhibited by 83% after preheating
the S100 fraction at 50°C (lane 5) and is lost completely after
treatment at 55°C (lane 6), suggesting that U1 3¢-boxdependent processing involves heat-sensitive factor(s).
3¢ Processing of replication-activated histone mRNAs is
similarly heat labile (16), raising the possibility of shared
factors with U1 3¢-box-dependent processing. A wellcharacterized SLBP is required for histone 3¢ end formation
(9) and we have used an anti-SLBP antibody to analyse our
processing extract by western blot (Fig. 4B). SLBP is not
readily detected in the S100 used for the in vitro processing
reactions shown in Figure 4A (Fig. 4B, lane 1) although it is
relatively abundant in our previously described S100 preparation (1) (Fig. 4B, lane 2). In addition, no SLBP is detected in
the partially puri®ed 0.4 M KCl SP fraction active for presnRNA 3¢ end formation (1) (Fig. 4C, lane 6). These results
suggest that SLBP is not involved in 3¢-box-dependent
processing.
Sequences upstream from the U1 3¢ box can in¯uence 3¢
processing
We have shown that accurate pre-U2 snRNA 3¢ end formation
requires two cis-acting elements, USE and U2, acting
synergically. Although the U1 3¢ box is suf®cient to direct
accurate formation of RNA 3¢ ends in vitro and in vivo
(1,4,17), the last 5±6 nt of the mature 3¢ end of U1 snRNA can
affect the in vivo accuracy of pre-U1 snRNA 3¢ end formation
(17). We therefore analysed the in¯uence of the natural
2991
Figure 4. U1 3¢-box-dependent processing activity is heat labile. (A) U1 3¢
box RNA was incubated with S100 preheated for 10 min at the indicated
temperature (temp.). The graph represents the percentage processing compared with the level obtained when S100 is preheated at 30°C. (B) Western
blot with SLBP antibody on the S100 used for processing in Figures 1±5
(lane 1) and on the S100 described by Uguen and Murphy (1), which is less
active for 3¢ processing (lane 2). SLBP is 45 kDa. (C) Western blot with
anti-SLBP antibody on the different fractions resulting from puri®cation of
the processing activity on different sepharose resins as described by Uguen
and Murphy (1). B-heparin and B-Q represent proteins bound to heparin±
and Q-sepharose, respectively; UB-SP represents proteins that did not bind
to SP-sepharose.
sequences preceding the U1 3¢ box on the ef®ciency and
accuracy of 3¢ processing in vitro. When RNA containing the
10 nt just upstream of U1 3¢ box (10end U1 3¢ box RNA) is
incubated with S100, no properly processed RNA is detected
(Fig. 5A, lanes 6±10, and C), even after 3 h incubation. This
may be due to instability of processed RNA rather than to a
complete loss of processing. When the 17 nt upstream of the
U1 3¢ box are present (17end U1 3¢ box), processing is
detected (Fig. 5A, lanes 11±15, and C) although the ef®ciency
is reduced by 45% with respect to U1 3¢ box RNA (Fig. 5A,
lanes 1±5, and C). A small amount of the 3¢ end processed
RNA maps to the same nucleotides seen with the U1 3¢ box
RNA but most map a few nucleotides upstream, corresponding
to the most highly represented 3¢ ends detected in vivo, which
end 6±8 nt upstream of the 3¢ box (17,18) (Fig. 5A, compare
lanes 1±5 and 11±15, and C). These results may indicate that
the addition of 17 nt of the natural U1 sequence in¯uences the
position of 3¢ endonucleolytic cleavage. Alternatively, 3¢
exonucleolytic trimming of the RNA occurs after cleavage
3±5 nt upstream from the 3¢ box.
We have also tested the effect of placing a longer
endogenous sequence (59 nt) upstream of the U1 3¢ box.
This substrate RNA contains both the Sm protein binding site
and the 3¢ terminal stem±loop, which are not required in vivo
for pre-U1 snRNA 3¢ end formation but are required for
ef®cient nuclear export (19). When 59end U1 3¢ box RNA
(Fig. 5B, lanes 3 and 4) is incubated with S100, RNA with the
mature 3¢ end mapping 10±12 nt upstream of the 3¢ box (17) is
produced. This indicates that the activities required for
2992
Nucleic Acids Research, 2004, Vol. 32, No. 10
Figure 5. Sequences upstream from the U1 3¢ box can in¯uence 3¢ processing. (A) U1 3¢ box, 10end U1 3¢ box and 17end U1 3¢ box RNA substrates were
incubated with S100 for 0±3 h. The scheme below indicates the position and size of the S1 probes, input (IP) and processed (proc.) RNA. (B) U1 3¢ box,
59end U1 3¢ box and 59end D3¢ box RNA substrates were incubated with S100. The scheme below indicates the position and size of the S1 probes, input
(IP), processed (proc.) and mature RNA. (C) The sequences of the 3¢ ends of the substrate RNAs in (A) and (B) are shown with the position of in vivo and
in vitro processing sites and mature ends. The endogenous U1 snRNA sequence is highlighted in grey. The U1 3¢ box is boxed. The vertical line indicates the
boundary between the end of the mature snRNA and the ®rst nucleotide of the non-coding RNA. The bold nucleotides represent the processing sites observed
in vivo (17,18). The arrows above the sequence indicate the pre-snRNA processing sites observed in vitro. The mature ends observed in vitro are indicated by
un®lled arrows above the sequence.
production of mature U1 3¢ ends (20) are present in the S100,
in addition to factors required for pre-U1 3¢-box-dependent
processing. Some accurately processed pre-U1 RNA 3¢ ends
are still detected and these map to the same sites as are
observed when only 17 nt of natural U1 sequence are present
upstream of the 3¢ box (17end U1 3¢ box RNA; Fig. 5A, lanes
11±15, and C). This may suggest that processing proceeds in a
stepwise manner as it does in vivo and the precise pre-snRNA
3¢ ends are required for the next step, which matures the 3¢ end
of the RNA, as has been shown for 3¢ end formation of U2
snRNA (21). 3¢ Ends mapping 6±8 nt upstream from the 3¢ box
are no longer detected when the 3¢ box is removed (Fig. 5B,
Nucleic Acids Research, 2004, Vol. 32, No. 10
lanes 5 and 6, and C). However, production of the mature U1
3¢ ends is unaffected (Fig. 5B, lane 6), indicating that these
two processing events occur independently in our in vitro
system. This suggests that any RNAs containing the Sm
protein binding site and the 3¢ terminal stem±loop can be 3¢
end matured, presumably by exonucleolytic trimming. Since
the 59end D3¢box RNA has 27±29 nt downstream from the
sites of the mature 3¢ ends, these results also indicate that
RNAs longer than pre-U1 at the 3¢ end are readily 3¢ matured
in vitro. In contrast, only pre-U1 snRNAs with a short 3¢
extension (no more than 10 nt) are further processed after
injection into Xenopus oocytes (20). This may highlight
differences between species in the 3¢ end maturation of
snRNAs. Alternatively, the exact sequence of the 3¢ extension
may in¯uence trimming and/or some in vivo constraints may
be lost in our in vitro system.
DISCUSSION
The ®rst step of expression of the vertebrate U1 snRNA genes
involves the production of a 3¢ extended pre-U1 snRNA,
which occurs by a co-transcriptional 3¢-box-dependent mechanism (4,5). Proper 3¢ end formation of pre-U1 snRNA is
important for the stability of the snRNA in the nucleus, in
addition to nucleocytoplasmic export (19,20). The exact 3¢
ends of U1 snRNA precursors can also in¯uence the
cytoplasmic 3¢ trimming of the RNA required for the nuclear
translocation of U1 snRNPs and subsequent maturation steps
(20). Towards the dissection of the molecular mechanism of 3¢
end formation of human pre-U1 snRNA we have determined
several of the co-factor requirements of U1 3¢-box-dependent
in vitro processing. We have also investigated the in¯uence of
sequences upstream of the U1 3¢ box on 3¢ end formation. The
results of these studies highlight similarities and differences
between 3¢ end formation of pre-snRNAs and other vertebrate
pol II transcripts. In addition, we show that 3¢ extended RNAs
containing the U1 Sm protein binding site and the 3¢ terminal
stem±loop can be processed to the mature U1 3¢ end in vitro.
Comparing U1 3¢-box-dependent processing and
formation of mRNA 3¢ ends
Some co-factor requirements for 3¢-box-dependent processing
are shared with AAUAAA-dependent processing. Both processing events require divalent cations and CP, are ATP
independent and are activated by 5¢ cap binding proteins
(8,11,13) (Fig. 6). In addition, both require the CTD of pol II
in vivo (6,13) and are activated by phospho-CTD in vitro
(1,12) (Fig. 6). So far there is no indication that any subunits of
CPSF, CstF and CFI factors required for AAUAAA-dependent processing participate in pre-snRNA 3¢ end formation (1).
However, it has been shown that CFI participates in 3¢
processing of some yeast snoRNAs and snRNAs (22).
There are also some similarities between 3¢ end formation
of U1 snRNA and replication-activated histone mRNAs. In
both cases a stem±loop structure is present just upstream of the
mature 3¢ end, the processing activities are heat labile and a
trans-acting RNA component is required (9). Our results
therefore point to mechanistic similarities in 3¢ end formation
of non-polyadenylated histone mRNA and snRNA. However,
our data indicate that the SLBP is not involved in U1 snRNA
3¢ end formation. In addition, the lack of complementarity
2993
Figure 6. Requirements for U1 3¢-box-dependent RNA processing. 3¢-Boxdependent 3¢ end formation of pre-U1 snRNA requires an RNA±protein
complex, CP and divalent cations (div. ion). The reaction is also activated
by 5¢ cap binding proteins (diamond) and the CTD of pol II. In addition, the
processing activity is abolished by relatively mild heat treatment, which
suggests the involvement of a heat-labile factor (HLF).
between U7 snRNA and 3¢ box sequences makes it likely that
a distinct RNA is required.
Divalent cations are also necessary for ef®cient activity of
the yeast RNase III homologues, Pac1p and Rnt1p (23,24)
which are involved in yeast snRNA 3¢ end formation (25).
However, human RNase III does not co-purify with the 3¢box-dependent processing activity and the cis-acting elements
do not correspond to RNase III recognition sequences (1).
Divalent cations may be required for the function of a 3¢-boxspeci®c processing factor or play a more direct catalytic role.
The currently known requirements for 3¢ end processing of
pre-U1 snRNA are summarized in Figure 6. Taken together
our results indicate that there is little overlap in the factors
required for 3¢-box-dependent processing and mRNA 3¢ end
formation apart from the shared co-factors noted above. This is
consistent with the differences in cis-acting sequences and the
differential promoter dependence of these signals in vivo (4,5).
Sequences immediately upstream of the U1 3¢ box
in¯uence 3¢ end formation
The ef®cient production of accurately 3¢ processed pre-U2
snRNA requires two distinct sequences: the U2 3¢ box and the
USE, which contains a `minihelix' stem±loop structure (1).
The U2 3¢ box alone directs formation of RNA a few
nucleotides longer than cellular pre-U2 snRNA while RNA
containing the USE is processed to the authentic pre-U2
snRNA 3¢ end. In addition, the amount of processed product
detected in vitro is increased when both the U2 3¢ box and the
USE are present. The minihelix may therefore in¯uence
processing by interacting with a processing factor and/or
prevent RNA cleaved by a 3¢-box-dependent endonuclease
from being rapidly turned over by exonucleases (1). Addition
of 17 nt of the natural U1 gene-encoded sequence upstream of
the U1 3¢ box also causes an apparent shift in the position of
the ®nal 3¢ ends, some of which now map to the most abundant
pre-U1 3¢ ends detected in cellular RNA (17,18). The
sequences directly upstream from the U1 3¢ box may therefore
play a similar role to the U2 USE to determine the ®nal
position of the pre-U1 3¢ end. In both cases the site of
endonucleolytic cleavage may be in¯uenced by the exact
sequence of the target RNA or changes in processing±factor
RNA interaction. Alternatively, the natural sequences may be
preferential substrates for limited trimming by exonucleases
following 3¢ box-directed cleavage. However, unlike the U2
USE, the 17 nt upstream from the U1 3¢ box do not cause an
2994
Nucleic Acids Research, 2004, Vol. 32, No. 10
increase in the amount of processed RNA detected but rather a
decrease. In addition, unlike the U2 USE (1), the 59 nt of U1
sequence upstream of the 3¢ box do not direct formation of preU1 3¢ ends, suggesting functional differences. In agreement
with this, little authentic pre-U1 RNA is produced from
transfected U1 templates lacking the 3¢ box (17).
A precise pre-U1 3¢ end is not required for RNA
maturation in vitro
Although pre-U2 snRNA can be further 3¢ processed in vitro
(2,21), maturation of the 3¢ ends of pre-U1 snRNA in vitro has
not previously been reported. However, in our in vitro system
the inclusion of U1 snRNA sequences containing the Sm
protein binding site and the 3¢ terminal stem±loop (19) in the
RNA substrate results in the production of properly processed
mature U1 3¢ ends. Interaction of Sm protein with its binding
site is required for the cytoplasmic 5¢ cap modi®cation of U1
snRNA, which is necessary for nuclear re-import (26) and it
also plays a role in maturation of the 3¢ ends of yeast U1
snRNA (27, reviewed in 3,28). Surprisingly, in vitro 3¢ end
maturation occurs independently of 3¢-box-dependent processing, suggesting that no intermediate processing step is
required even though the substrate U1 RNAs have relatively
long 3¢ extensions (27±29 nt). Instead, 3¢ end extensions longer
than 10 nt are not processed to the mature form when human
pre-U1 RNAs are injected into Xenopus oocytes (20). In
addition, specialized sequences distinct from the Sm protein
binding site are required for cytoplasmic trimming of the 3¢
end of human U2 snRNA (29), and pre-U2 RNA with the
natural 3¢ ends are preferential substrates for the reaction
in vitro (2,21). There may therefore be differences in 3¢ end
maturation of Xenopus and human U1 snRNAs, and between
human U1 and U2 snRNAs. The availability of in vitro systems
for both 3¢-box-dependent processing and maturation of 3¢
extended pre-U1 snRNAs will facilitate the molecular analysis
of all steps leading to accurate formation of U1 3¢ ends.
ACKNOWLEDGEMENTS
We thank Alice Taylor for excellent technical assistance. We
are grateful to Berndt Mueller for the kind gift of the antiSLBP antibody. We also thank Zbigniew Dominski, Nick
Proudfoot and Andre Furger for helpful suggestions and
comments on the manuscript. P.U. was supported by MRC Cooperative Component grant No. G9900343 and the Edward
Penley Abraham Trust. S.M. was supported by MRC Senior
Fellowship No. G117/309.
REFERENCES
1. Uguen,P. and Murphy,S. (2003) The 3¢ ends of human pre-snRNAs are
produced by RNA polymerase II CTD-dependent RNA processing.
EMBO J., 22, 4544±4554.
2. Huang,Q., Jacobson,M.R. and Pederson,T. (1997) 3¢ Processing of
human pre-U2 small nuclear RNA: a base-pairing interaction between
the 3¢ extension of the precursor and an internal region. Mol. Cell. Biol.,
17, 7178±7185.
3. Will,C.L. and LuÈhrmann,R. (2001) Spliceosomal UsnRNP biogenesis,
structure and function. Curr. Opin. Cell Biol., 13, 290±301.
4. Hernandez,N. and Weiner,A.M. (1986) Formation of the 3¢ end of U1
snRNA requires compatible snRNA promoter elements. Cell, 47,
249±258.
5. Neuman de Vegvar,H.E., Lund,E. and Dahlberg,J.E. (1986) 3¢ End
formation of U1 snRNA precursors is coupled to transcription from
snRNA promoters. Cell, 47, 259±266.
6. Medlin,J.E., Uguen,P., Taylor,A., Bentley,D. and Murphy,S. (2003) The
C-terminal domain of pol II and a DRB-sensitive kinase are required for
3¢ processing of U2 snRNA. EMBO J., 22, 925±934.
7. Ryan,K., Calvo,O. and Manley,J.L. (2004) Evidence that
polyadenylation factor CPSF-73 is the mRNA 3¢ processing
endonuclease. RNA, 10, 565±573.
8. Wahle,E. and RuÈegsegger,U. (1999) 3¢-End processing of pre-mRNA in
eukaryotes. FEMS Microbiol. Rev., 23, 277±295.
9. Dominski,Z. and Marzluff,W.F. (1999) Formation of the 3¢ end of
histone mRNA. Gene, 239, 1±14.
10. Shapiro,D.J., Sharp,P.A., Wahli,W.W and Keller,M.J. (1988) A highef®ciency HeLa cell nuclear transcription extract. DNA, 7, 47±55.
11. Hirose,Y. and Manley,J.L. (1997) Creatine phosphate, not ATP, is
required for 3¢ end cleavage of mammalian pre-mRNA in vitro. J. Biol.
Chem., 272, 29636±29642.
12. Hirose,Y. and Manley,J.L. (1998) RNA polymerase II is an essential
mRNA polyadenylation factor. Nature, 395, 93±96.
13. McCracken,S., Fong,N., Rosonina,E., Yankulov,K., Brothers,G.,
Siderovski,D., Hessel,A., Foster,S., Shuman,S. and Bentley,D.L. (1997)
5¢-Capping enzymes are targeted to pre-mRNA by binding to the
phosphorylated carboxy-terminal domain of RNA polymerase II. Genes
Dev., 11, 3306±3318.
14. Flaherty,S.M., Fortes,P., Izaurralde,E., Mattaj,I.W. and Gilmartin,G.M.
(1997) Participation of the nuclear cap binding complex in pre-mRNA 3¢
processing. Proc. Natl Acad. Sci. USA, 94, 11893±11898.
15. Dignam,J.D., Lebovitz,R.M. and Roeder,R.G. (1983) Accurate
transcription initiation by RNA polymerase II in a soluble extract from
isolated mammalian nuclei. Nucleic Acids Res., 11, 1475±1489.
16. Gick,O., Kramer,A., Vasserot,A. and Birnstiel,M.L. (1987) Generation of
histone mRNA 3¢ ends by endonucleolytic cleavage of the pre-mRNA in
a snRNP-dependent in vitro reaction. Proc. Natl Acad. Sci. USA, 84,
8937±8940.
17. Hernandez,N. (1985) Formation of the 3¢ end of U1 snRNA is directed by
a conserved sequence located downstream of the coding region.
EMBO J., 4, 1827±1837.
18. Patton,J. and Wieben,E. (1987) U1 precursors: variant 3¢ ¯anking
sequences are transcribed in human cells. J. Cell Biol., 104, 175±182.
19. Terns,M.P., Dahlberg,J.E. and Lund,E. (1993) Multiple cis-acting signals
for export of pre-U1 snRNA from the nucleus. Genes Dev., 7,
1898±1908.
20. Neuman de Vegvar,H.E. and Dahlberg,J.E. (1990) Nucleocytoplasmic
transport and processing of small nuclear RNA precursors. Mol. Cell.
Biol., 10, 3365±3375.
21. Huang,Q. and Pederson,T. (1999) A human U2 RNA mutant stalled in 3¢
end processing is impaired in nuclear import. Nucleic Acids Res., 15,
1025±1031.
22. Morlando,M., Greco,P., Dichtl, B., Fatica,A., Keller,W. and Bozzoni,I.
(2002) Functional analysis of yeast snoRNA and snRNA 3¢-end
formation mediated by uncoupling of cleavage and polyadenylation. Mol.
Cell. Biol., 22, 1379±1389.
23. Rotondo,G. and Frendewey,D. (1996) Puri®cation and characterization
of the Pac1 ribonuclease of Schizosaccharomyces pombe. Nucleic Acids
Res., 24, 2377±2386.
24. Lamontagne,B. and Elela,S.A. (2001) Puri®cation and characterization of
Schizosaccharomyces pombe Rnt1p nuclease. Methods Enzymol., 342,
159±167.
25. Allmang,C., Kufel,J., Chanfreau,G., Mitchell,P., Petfalski,E. and
Tollervey,D. (1999) Functions of the exosome in rRNA, snoRNA and
snRNA synthesis. EMBO J., 18, 5399±5410.
26. Mattaj,I.W. (1986) Cap trimethylation of U snRNA is cytoplasmic and
dependent on U snRNP protein binding. Cell, 46, 905±911.
27. Seipelt,R.L., Zheng,B., Asuru,A. and Rymond,B.C. (1999) U1 snRNA is
cleaved by RNase III and processed through an Sm site-dependent
pathway. Nucleic Acids Res., 27, 587±595.
28. Yong,J., Pellizzoni,L. and Dreyfuss,G. (2002) Sequence-speci®c
interaction of U1 snRNA with the SMN complex. EMBO J., 21,
1188±1196.
29. Jacobson,M.R., Rhoadhouse,M. and Pederson,T. (1993) U2 small nuclear
RNA 3¢ end formation is directed by a critical internal structure distinct
from the processing site. Mol. Cell. Biol., 13, 1119±1129.