Evolution of RPS4Y.

Evolution of RPS4Y
Andrew W. Bergen,*† Marina Pratt,* Patrick T. Mehlman,‡ and David Goldman*
*Laboratory of Neurogenetics, National Institute on Alcohol Abuse and Alcoholism, Rockville, Maryland; †Genetic
Epidemiology Branch, Division of Cancer Genetics and Epidemiology, National Cancer Institute, Rockville, Maryland; and
‡LABS of Virginia, Inc., Yemassee, South Carolina
Sequence variation within RPS4Y, a ribosomal protein gene located in the nonpseudoautosomal region of the Y
chromosome, was used to elucidate the origin of this gene in primates. Complete coding and additional flanking
sequences (949 bp) of the RPS4Y locus were determined in four nonhuman primate species. Phylogenetic reconstruction of RPS4 sequence evolution supports the monophyly of mammalian RPS4 and RPS4Y. Molecular evolutionary rate estimation reveals significantly elevated rates of DNA and protein evolution in RPS4Y compared with
its X-chromosome homologs. These rates enable us to estimate the timing of the transposition of RPS4X to the Y
chromosome (95% confidence interval, 32 MYA–74 MYA), and this estimate was verified by Southern hybridization
analysis of prosimian and simian genomic DNA. These data support a transposition event of ancestral primate
RPS4X to the Y chromosome prior to the divergence of Prosimii.
Introduction
Evolution of genes on the Y chromosome and of
pairs of homologous loci found on the X and Y chromosomes have been intensively investigated for the insights they provide into sex-chromosome mutation rates,
chromosome evolution, and the basis of sex determination and infertility. Estimates of evolutionary substitution rates have generally revealed that loci on the Y
chromosome are evolving approximately twice as fast
as their X-chromosome homologs (e.g., Pamilo and
Bianchi 1993; Shimmin et al. 1993, 1994), supporting
the male-driven mutation model of Miyata et al. (1987).
However, synonymous substitution analysis of 238 autosomal, 33 X-linked and 6 Y-linked rat–mouse homologous loci support a significantly reduced rate of rodent
X-chromosomal sequence evolution compared with rodent autosomes and the rodent Y chromosome where
the rate of Y-chromosomal sequence evolution does not
differ significantly from that of autosomes (McVean and
Hurst 1997). X- and Y-linked homologous sequences
have been used to reconstruct sex chromosome evolution, e.g., the present short- and long-arm pseudoautosomal boundaries and their sequence evolution (Ellis et
al. 1990; Kvaloy et al. 1994). Y-chromosome–linked
loci have also been used to investigate the evolution of
the sex-determination locus, Sry (Whitfield et al. 1993),
where high KA/KS ratios and limited evidence for positive selection in great apes have previously been described (Pamilo and O’Neill 1997).
The ribosomal protein small subunit 4 (RPS4) is a
small (29 Kd, 263 amino acid), basic ribosomal protein.
RPS4 is involved in mRNA binding and is located at
the 40S/60S subunit interface of the small ribosomal
subunit (Nygard and Nika 1982). The sequence, function, and position of RPS4 and other ribosomal proteins
are highly conserved (Lake 1985). The small ribosome
Key words: RPS4Y, RPS4X, Hominidae, relative rates, phylogenetic reconstruction.
Address for correspondence and reprints: Andrew W. Bergen,
12420 Parklawn Drive, Park 5 Bldg., Room 451, Rockville, Maryland
20852.
E-mail: [email protected] or [email protected].
Mol. Biol. Evol. 15(11):1412–1419. 1998
q 1998 by the Society for Molecular Biology and Evolution. ISSN: 0737-4038
1412
subunit is responsible for accurate translation, and the
functional interactions among proteins within this subunit are conserved across prokaryotic and eukaryotic
lineages (Alksne et al. 1993). The highly conserved position of RPS4 within the ribosome suggests that protein–protein interactions in the local environment may
have constrained the structural evolution of RPS4.
In the human, two homologous loci, RPS4X (Xq13)
and RPS4Y (distal Yp, 70 kb, proximal to the pseudoautosomal boundary of the Y chromosome), encode proteins with 92.8% amino acid identity (Fisher et al.
1990). Both human RPS4 genes complement a temperature-sensitive mutant of hamster Rps4 (Watanabe et al.
1993), and each is expressed and incorporated into functional ribosomes (Zinn et al. 1994). Human males therefore contain two different species of ribosome, although
the relative importance of the less highly expressed
RPS4Y gene product, expressed at one-tenth the level of
RPS4X, is unknown (Zinn et al. 1994). No other human
ribosomal protein is known to have a homologous copy
(Fisher et al. 1990). Rodent RPS4 cDNA have been
cloned from mouse, rat, and Chinese and Syrian hamster
species (Devi et al. 1989; Zinn et al. 1991; Watanabe et
al. 1993; M. Ebihara, personal communication). The
chicken contains an S4 homolog that is nearly identical
to RPS4X and that is probably autosomal (Zinn et al.
1994). S4-protein homologs have also been identified
and cloned in yeast (Synetos et al. 1992), fruit fly (Yokokura et al. 1993), and tetrahymena (Palm et al. 1995).
Reverse-transcription polymerase chain reaction (RTPCR) products corresponding to RPS4X have been sequenced from six species representing four orders and
six families of mammals (Omoe and Endo 1996). Amino acid identity of X-linked RPS4 protein among all
vertebrates is $98.5%, suggesting severe functional
constraints on structural evolution (table 1).
The Y-chromosome homolog, RPS4Y, has only
been detected in primates. Subsequent to the cloning of
the RPS4Y cDNA in humans (Fisher et al. 1990), RTPCR sequencing and genomic hybridization provided
evidence for the presence of a Y-specific locus within
Japanese macaques (Macaca fuscata, Mfu) (Omoe and
Endo 1996), and a single exon-to-exon PCR product
Evolution of RPS4Y
1413
Table 1
Nucleotide Distances (31,000) and Nonidentical Amino Acids in the RPS4 Gene Homologs of Primates, Rodents,
Chicken and Fruit Fly
HsaY
HsaY . . . .
Ppn . . . . .
Ptr . . . . .
Ggo . . . . .
Ppy . . . . .
HsaX . . .
Rra . . . . .
Mmu . . . .
Cgrc . . . .
Cgrs . . . .
Gga . . . . .
Dme . . . .
Ppn
5.1
1
1
1
8
19
19
19
19
19
16
70
0
2
9
20
20
20
20
20
17
71
Ptr
7.7
2.5
2
9
20
20
20
20
20
17
71
Ggo
10.2
12.8
15.4
7
20
20
20
20
20
17
71
Ppy
21.9
24.5
27.2
21.9
18
18
18
18
18
17
72
HsaX
208
206
206
202
199
0
0
1
0
4
66
Rra
Mmu
Cgrc
Cgrs
Gga
Dme
234
236
238
236
228
95.7
223
225
221
225
218
97.1
47.4
215
217
213
217
209
88.4
53
48.8
211
213
209
213
202
97.1
68.2
65.3
31.2
215
220
220
220
212
185
186
185
182
177
433
438
441
430
437
414
427
419
434
435
360
0
1
0
4
66
1
0
4
66
1
4
66
4
66
63
NOTE.—Nucleotide distances appear above the diagonal; nonidentical amino acids appear below the diagonal. Taxa and RSP4 homolog abbreviations are as
follows: HsaY, human RPS4Y; Ppn, pygmy chimpanzee (Pan paniscus); Ptr, common chimpanzee (Pan troglodytes); Ggo, gorilla (Gorilla gorilla); Ppy, orangutan
(Pongo pygmaeus); HsaX, human RPS4X; Rra, rat (Rattus rattus); Mmu, mouse (Mus musculus); Cgrc, Chinese hamster (Cricetulus griseus); Cgrs, Syrian hamster
(Cricetulus griseus); Gga, chicken (Gallus gallus); and Dme, fruit fly (Drosophila melanogaster).
provided evidence for RPS4Y in the Hominidae and in
gibbons (Hylobates lar, Hla; Samollow et al. 1996). Genomic hybridization and RT-PCR have failed to demonstrate male-specific bands corresponding to a RPS4Y
locus in all other mammalian orders tested (Omoe and
Endo 1996).
Sequence, RT-PCR, hybridization, and phylogenetic analyses of RPS4 sequences have been interpreted as
indicating the following: (1) RPS4 genes homologous to
RPS4X and RPS4Y existed before the mammalian radiation, and RPS4Y sequences were subsequently lost in
mammalian orders, specifically, in Rodentia, Carnivora,
and Artiodactyla, but not in Primates (Omoe and Endo
1996); and (2) avian RPS4 arose as a result of gene
conversion between ancestral RPS4X and RPS4Y homologs (Zinn et al. 1994). A lack of available full-length
RPS4Y sequences has limited tests of phylogenetic hypotheses regarding the origin of RPS4Y and its subsequent evolution. We determined the sequence of the entire RPS4Y gene in four nonhuman primate species, hybridized RPS4Y to Primate DNA, and performed phylogenetic and molecular evolutionary rate analyses to
resolve the uncertain origin, evolution, and phylogeny
of RPS4Y. We address the following issues: (1) the phylogeny and relative evolutionary rates of RPS4Y and
RPS4X and the possible origin of avian RPS4 by conversion between ancestral RPS4Y and RPS4X homologs,
and (2) the origin and evolution of the RPS4Y protein.
Materials and Methods
For sequencing, the nonhuman primate sample consisted of genomic DNA from two common chimpanzees
(Pan troglodytes, Ptr), two pygmy chimpanzees (Pan
paniscus, Ppn), two gorillas (Gorilla gorilla, Ggo), and
one orangutan (Pongo pygmaeus, Ppy) (a gift from S. J.
O’Brien). Exon-specific PCR primers that contained intronic flanking sequences for the seven exons of the
RPS4Y gene were synthesized (GenBank accession
number L24370; Zinn et al. 1994; L. Brown, personal
communication). 59 universal-tagged (221M13 and
M13REV) primer sequences were used for PCR, and
universal primers were used for direct sequencing of
PCR products as previously described (Bergen et al.
1996). Those exon products refractory to direct sequencing (e.g., exon 2) were amplified with intronic flanking
primers, gel purified, cloned, and sequenced as previously described (Bergen et al. 1996). A total of 949 bp
of RPS4Y sequence, including 792 bases of coding sequence, 94 bases of 59 UTR sequence, and 63 bases of
39 UTR sequence, was determined on both strands via
direct sequencing of amplified RPS4Y exons for each
nonhuman primate (fig. 1). The coding sequence only
was used for analysis in this paper, and these sequences
are available as GenBank accession numbers
AF076991–AF076994, for Ppn, Ptr, Ggo, and Ppy, respectively.
Phylogenetic analysis of RPS4 evolution among
vertebrates was performed using these novel sequences;
human RPS4Y, HsaY (M58459); human RPS4X, HsaX
(M58458); Chinese hamster, D26473 (Cricetulus griseus, Cgrc); Syrian hamster, D11087 (Cricetulus griseus, Cgrs); mouse, M73436 (Mus musculus, Mmu); rat,
X14210 (Rattus rattus, Rra); and chicken, L24368 (Gallus gallus, Gga); with the fruit fly RPS4 sequence,
D16257 (Drosophila melanogaster, Dme) as designated
outgroup taxon (table 1 and fig. 1). As all of the additional sequences were derived from cDNA clones that
included variable amounts of nontranslated sequence,
phylogenetic analyses were performed with coding sequence only (792 bases of sequence). Phylogenetic reconstruction of protein evolution among human RPS4X
and Hominidae RPS4Y sequences was performed with
the translated Hominidae RPS4Y and HsaX coding sequence (fig. 2).
Alignment of the published sequences was performed by inspection of the translated nucleotide sequence. The vertebrate sequences code for a 263–amino
acid protein, while the Drosophila sequence codes for a
261–amino acid protein; thus, the nucleotide sequences
were aligned from translation initiation codon to ter-
1414
Bergen et al.
FIG. 2.—RPS4 amino acid evolution. Most parsimonious phylogram of length 89 steps and consistency index 0.97. Asterisks denote
nonconservative amino acid substitutions. Phylogeny within rodents
(one substitution at Chinese hamster) and between fly and vertebrates
is not shown. Abbreviations of taxa and RPS4 homologs are as described in Materials and Methods.
FIG. 1.—RPS4 phylogeny: (a) Maximum-parsimony cladogram of
length 544 steps and consistency index 0.818. The number of base
changes are depicted above the branches. Bootstrap resampling support
is indicated above the branch, where a single asterisk indicates .90%
support and double asterisks indicate .95% support. Jackknife support
was .98% at all branches, except at the ancestral Gcr node. (b) Maximum-likelihood phylogram. The minimum-evolution phylogram had
an identical topology. Scale 5 0.1 divergence. Abbreviations of taxa
and RPS4 homologs are as described in Materials and Methods.
mination codon, with the only gaps required inserted
into the Drosophila sequence at codon 260 and at codon
263 to make all sequences equal in length and to optimize the nucleotide alignment. Gaps were considered to
be missing data for parsimony reconstruction (gaps are
considered to be missing data by default for maximumlikelihood and minimum-evolution reconstruction in
PAUP). While the mammalian RPS4 sequences analyzed derive from a diverse group of species, among
mammalian RPS4 homologs, divergence is moderate
(table 1), base frequencies (mean frequency of A 5
0.276, mean frequency of C 5 0.219, mean frequency
of G 5 0.258, and mean frequency of T 5 0.247) are
homogeneous (x2 5 4.87, df 5 27, P 5 0.999), and
mean transition/transversion ratios are 2.23 6 0.76 (SD).
The Kimura 2-parameter method (Kimura 1980) was
used to construct distance matrices (table 1). All phylogenetic analyses were performed using the beta-test
version 64 of PAUP 4.0 (Swofford 1998) and MacClade
3.0 (Maddison and Maddison 1992). Resampling was
performed with 1,000 repetitions and the bootstrap (all
792 characters) and jackknife (20% deletion) options in
PAUP 4.0.
Numbers of synonymous and nonsynonymous
changes were estimated by Synonym.exe (Pamilo and
Bianchi 1993). Differences in relative rates of substitution in RPS4Y versus RPS4X and Rps4 lineages were
evaluated by the relative-rate test (Wu and Li 1985) with
parameter values calculated from Synonym.exe output
and with chicken RPS4 as reference outgroup. A paired
t-test with pooled variances was used to determine the
significance of differences between rates. To test for nucleotide substitution rate differences, transversions at
fourfold degenerate sites alone (B4) were used because
of moderate transition substitution saturation at fourfold
degenerate sites (transition/transversion ratio 5 0.67 6
0.058), while to test for protein substitution rate differences, nonsynonymous substitution rates (Ka) were used
(Wu and Li 1985; Pamilo and Bianchi 1993). The standard error of the mean rate ratio between RPS4 homolog
lineages was calculated by the delta technique as in
McVean and Hurst (1997). The transposition time of
RPS4X onto the Y chromosome was estimated by the
Poisson model of Kimura (1969) with the calculated
nonsynonymous rates and with the numbers of amino
acid substitutions observed in the protein phylogeny.
Evolution of RPS4Y
Standard errors are given throughout, either obtained
from Synonym.exe or calculated here.
To evaluate the presence of RPS4X and RPS4Y loci
in various primates, Southern blotting of genomic DNA
(10 mg) from female and male Hsa, Tonkean macaque
(Macaca tonkeana, Mto), owl monkey (Aotus trivirgatus, Atr), and ring-tailed lemur (Lemur catta, Lca) was
performed by overnight digestion with EcoRI (Boehringer Mannheim), electrophoresis for 17 h at 2.4 V/cm in
a 0.7% agarose (BRL) horizontal gel with l HindIII
DNA markers (Research Genetics), and overnight capillary transfer to a nylon membrane (Boehringer Mannheim), followed by baking at 808C for 1 h. The nylon
membrane was prehybridized for 1 h at 558C in 0.5 M
NaHPO4, 7% SDS, 1 mM EDTA, pH 7.2 (Church and
Gilbert 1984) with 10 mg/ml of denatured human Cot-1
DNA (Boehringer Mannheim) and hybridized overnight
at 558C with dCTP 32P random-primer labeled (Feinberg
and Vogelstein 1984) RPS4Y probe at ;6 3 108 CPM/
mg. The RPS4Y probe was a XhoI-HindIII fragment of
pcDEBS4Y (a gift of A.R. Zinn; Watanabe et al. 1993),
containing the entire RPS4Y cDNA from pDP1278
(Fisher et al. 1990) and small amounts (;15 bp) of pBS
MCS sequences (M. Watanabe, personal communication). Following hybridization, filters were washed twice
at high stringency (558C washes at 0.2 3 SSC and 0.1%
SDS) and exposed at 2808C to Kodak XAR film. Approximate molecular weights of hybridizing DNA fragments were calculated by comparison with a l HindIIIsize standard.
Results and Discussion
Reconstruction of RPS4 Phylogeny
Phylogenetic reconstruction of RPS4 vertebrate homolog evolution across 12 species representing primates, rodents, and birds with the criterion of maximum
parsimony yielded two trees of equal length and consistency index (fig. 1). The only topological difference was
the presence or absence of a hamster (Cricetulus griseus) ancestor. Resampling analyses provided marginal
support (51.8% with bootstrap, 50.8% with jackknife)
for the monophyly of a Cgr ancestor but significant support (.90% with bootstrap, .95% with jackknife) for
all other internodes (fig. 1a). Phylogenetic reconstruction with the criteria of minimum evolution and maximum likelihood produced single trees with topology
identical to the maximum-parsimony tree shown (fig.
1b). Resampling with these two latter methods provided
increased bootstrap support for the hamster ancestor and
decreased bootstrap support for the ancestor of gorilla,
chimpanzee, and human, while jackknife support remained above 95%. Phylogenetic reconstruction of vertebrate RPS4 homologs thus supports divergence of
monophyletic avian and mammalian RPS4 sequences
from a common vertebrate ancestor and subsequent divergence of mammalian RPS4 into X- and Y-chromosomal homologs.
It has been suggested that the Gga RPS4 gene arose
through conversion of ancestral RPS4Y and RPS4X homologous sequences. This suggestion has been based on
1415
Table 2
Evolution of RPS4 Exon 4 Amino Acid Residues
SPECIES OR
HOMOLOG
HsaYa . . . .
Ppy . . . . . .
HsaX . . . .
Gga . . . . .
Dme . . . . .
CODON/RESIDUE
93
GAG Glu
GAT Asp
GAC Asp
GAG Glu
GAG Glu
98
102
116
CAT His
CAT His
AAT Asn
CAC His
TTC Phe
GTC Val
GTC Val
ATC Ile
GTG Val
GTC Val
GTG Val
GTG Val
CCT Pro
GCT Ala
GCC Ala
NOTE.—Abbreviations of taxa and RPS4 homologs are as follows: HsaY,
human RPS4Y; Ppy, orangutan (Pongo Pygmaeus); HsaX, human RPS4X; Gga,
chicken (Gallus gallus); and Dme, fruit fly (Drosophila melanogaster).
the findings that the only amino acid differences between human RPS4X and chicken RPS4 (residues 93,
98, 102, and 116) are within a single exon and that three
of the four nonidentical amino acid residues are identical
in chicken RPS4 and human RPS4Y (Zinn et al. 1994).
Inspection of the codon sequences from chicken RPS4,
human RPS4X, and RPS4Y reveals that, at amino acid
residues 93, 98, 102, and 116, human RPS4X has undergone the same number of substitutions (six) as human RPS4Y, if one assumes a RPS4 to RPS4X to RPS4Y
phylogeny (table 2). Parsimony reconstruction of amino
acid evolution identifies two changes at each of these
residues, one substitution occurring between chicken
and ancestral mammalian RPS4 and one substitution occurring between RPS4X and primate RPS4Y; that is,
these four residues are homoplasic (table 2 and fig. 2).
Within the single exon (exon 4) proposed to have undergone conversion, the number of nucleotide differences among the three species does support a closer relationship between Gga and RPS4Y (8 differences) than
between Gga and RPS4X (13 differences). However, for
the entire coding region, each of three phylogenetic reconstruction methods supports mammalian RPS4X and
Hominidae RPS4Y as monophyletic sister taxa paraphyletic to Gga (fig. 1), where Gga is closer to HsaX, not
HsaY (table 1). Nor does phylogenetic reconstruction of
RPS4X and RPS4Y homologs support gene conversion
events after the divergence of mammalian RPS4. Thus,
the suggested conversion of exon 4 of ancestral RPS4X
by ancestral RPS4Y is not supported by evaluation of
sequence evolution and phylogenetic reconstruction.
This phylogenetic reconstruction significantly supports monophyly of mammalian RPS4X and primate
RPS4Y, but the divergence of RPS4X and RPS4Y homologs appears to have taken place prior to the divergence
of primate RPS4X and rodent Rps4 (fig. 1). This was
the interpretation of a neighbor-joining analysis performed on 496 bases of RPS4 homolog sequence from
12 species (including five of the primate and rodent species used here; Omoe and Endo 1996). This interpretation made an assumption that nucleotide substitution
rates within RPS4X and RPS4Y sequence lineages are
equal. This assumption was not tested by Omoe and
Endo (1996) but is tested here by the relative-rate test
with multiple species and transversions at fourfold degenerate sites (a measure of synonymous substitution
rates less affected by evolutionary distance than transi-
1416
Bergen et al.
Table 3
Mammalian RPS4 Synonymous and Nonsynonymous
Nucleotide Substitution Rates
RPS4Y
B4a
Var (B4)
HsaY . . .
Ppn . . . .
Ptr . . . . .
Ggo . . . .
Ppy . . . .
t5
0.5697
0.5655
0.5697
0.5868
0.6047
2.38
0.008766
0.008563
0.008766
0.009498
0.010313
RPS4Y
Kab
Var(Ka)
HsaY . . .
Ppn . . . .
Ptr . . . . .
Ggo . . . .
Ppy . . . .
t5
0.0476
0.0492
0.0492
0.0500
0.0462
7.33
0.000049
0.000049
0.000049
0.000049
0.000049
RPS4X/Rps4
B4
Var(B4)
0.38773
0.42095
0.46641
0.42639
0.48769
0.003794
0.004460
0.005596
0.004616
0.006184
RPS4X/Rps4
Ka
Var(Ka)
HsaX
Rra
Mmu
Cgrc
Cgrs
P 5 0.000041
0.00159
0.0192
0.0209
0.0192
0.0192
0.000016
0.000016
0.000016
0.000016
0.000016
HsaX
Rra
Mmu
Cgrc
Cgrs
P 5 0.022
NOTE.—Taxa and RSP4 homolog abbreviations are as follows: HsaY, human
RPS4Y; Ppn, pygmy chimpanzee (Pan paniscus); Ptr, common chimpanzee (Pan
troglodytes); Ggo, gorilla (Gorilla gorilla); Ppy, orangutan (Pongo pygmaeus);
HsaX, human RPS4Y; Rra, rat (Rattus rattus); Mmu, mouse (Mus musculus);
Cgrc, Chinese hamster (Cricetulus griseus); and Cgrs, Syrian hamster (Cricetulus griseus).
a B 5 transversion rate at fourfold degenerate sites (synonymous substitu4
tion rate).
b K 5 nonsynonymous substitution rate.
a
tions), with chicken RPS4 sequence as a reference outgroup (table 3). Relative-rate estimates indicate that
Hominidae RPS4Y is evolving significantly faster than
are primate RPS4X and rodent Rps4 sequences (P 5
0.022).
RPS4Y appears to be evolving significantly faster
than its X-chromosome homologs, but this difference in
rates may be because of multiple factors. The mean synonymous rate ratio between Y- and X-chromosome
RPS4 lineages (1.32 6 0.14) is less than that previously
described for either primate or rodent ratios of approximately 2 for Y- and X-chromosome homologous loci
(Pamilo and Bianchi 1993; Shimmin et al. 1993;
McVean and Hurst 1997), yet significantly greater than
1.0. The synonymous rate ratio of primate RPS4Y to
rodent Rps4 is 1.27 6 0.13 (P 5 0.041), and the synonymous rate ratio of human RPS4Y to RPS4X is 1.35,
again, both are greater than 1 but are not significantly
different from the combined rate. Thus, these results are
concordant with the observed lower rate of nucleotide
substitution on the X chromosome as compared with the
Y chromosome (McVean and Hurst 1997). These results
are also compatible with the hypothesis that the mutation rate of the Y chromosome in males is higher than
the mutation rate of the X chromosome in females because of the greater number of germline mutations in
the male order, although they are significantly less than
the Y/X ratio, 2.23, reported for the ZF genes (Shimmin
et al. 1993). These synonymous rate estimates suggest
that it is inappropriate to assume equal rates of RPS4
evolution across Y- and X-linked lineages.
Transposition and Evolution of RPS4Y
The RPS4Y sequence determined in Hominidae can
be used to calculate amino acid substitution rates in the
FIG. 3.—Hybridization of RPS4Y to primates. Autoradiogram of
Southern blot of EcoRI-digested female and male Hsa (human), Mto
(Tonkean macaque), Atr (owl monkey), and Lca (Lemur catta) genomic
DNA hybridized with a RPS4Y cDNA probe. Arrowheads indicate
bands referred to in Results and Discussion.
RPS4Y protein and permits estimation of the time of
transposition of the ancestral RPS4 locus onto the Y
chromosome from the X chromosome. Transposition of
sequences from the X chromosome to the Y chromosome, followed by subsequent divergence due to lack of
recombination-mediated gene conversion, is accepted as
a major mechanism of mammalian Y-chromosome evolution (Graves 1995). The identical sequence and conserved physical location of the X-linked homolog in
mouse and man (Fisher et al. 1990; Hamvas et al. 1992;
Zinn et al. 1994) suggests that the transposition time of
the RPS4X gene to the Y chromosome can be estimated,
assuming that the RPS4X protein sequence represents
the ancestral Hominidae RPS4Y sequence. Inspection of
RPS4X homologs and comparison of nonsynonymous
substitution rates reveal that there has been nearly no
protein evolution among mammalian RPS4X homologs
(table 1, Omoe and Endo 1996, data not shown).
Nonsynonymous substitution rates between RPS4X
and primate RPS4Y sequences (0.053 6 0.0025) are 3.8
times the RPS4Y nonsynonymous substitution rates
among orangutan and the other Hominidae (0.014 6
0.0014), corresponding to a time period of 48 MYA (3.8
residues 3 12.5 MYA). This rate ratio reflects, in part,
the conservation of RPS4X. Furthermore, this point estimate with nonsynonymous rates considers sequence
differences only and is not capable of quantitating
homoplasic substitutions.
Phylogeny reconstruction indicates that a mean of
8.125 residues has evolved between orangutan and the
other primate RPS4Y sequences in the 12.5 Myr since
Ppy and the Homininae lineages diverged (fig. 3; Andrews 1992). Thus, one amino acid is diverging in the
Evolution of RPS4Y
less conserved RPS4Y protein every 2.2 Myr–4.7 Myr
(mean 6 2 SE; Kimura 1969). Phylogenetic reconstruction of RPS4Y protein evolution via parsimony reveals
that 16 amino acids have evolved between RPS4X and
the Hominidae RPS4Y ancestor (fig. 3). Assuming constancy of primate substitution rates (Easteal 1991), the
interval from the time of transposition of RPS4 from the
X to the Y chromosome to the radiation of the Hominidae 12.5 MYA is then approximately 36 Myr–75 Myr
(16 residues 3 2.2 Myr–4.7 Myr). However, assuming
that the reported twofold to fourfold hominoid reduction
in mutation rates (Li et al. 1987; Bailey et al. 1991)
results in an overall primate slowdown of perhaps 30%,
the interval from transposition of RPS4X to the Hominidae radiation would correspond to 24 Myr–50 Myr
(16 residues 3 1.5 Myr–3.1 Myr). A smaller reduction
in primate mutation rates would give a larger range, but
some reduction in rates seems to be necessary to be
consistent with the differences in rates observed in primate lineages alone. Averaging the constant and minimally reduced ranges, which is a conservative assumption, corresponds to a transposition date of the ancestral
RPS4X gene from the X chromosome to the Y chromosome of approximately 42 MYA–74 MYA.
This estimate of the divergence date of RPS4Y and
RPS4X indicates that RPS4 transposed from the X to the
Y chromosome after the mammalian radiation event 85
MYA but does not indicate whether RPS4 transposed
before or after the Anthropoidea–Prosimii divergence 55
MYA (Gingerich 1985). In order to place the transposition event into the appropriate era of mammalian evolution, a physical method comparing extant species for
the presence or absence of RPS4Y was used to provide
critical information on the timing of the transposition.
As assessed by both cDNA and genomic hybridization
and by RT-PCR analysis, other mammalian orders do
not demonstrate a Y-linked RPS4 locus (Hamvas et al.
1992; Zinn et al. 1994; Omoe and Endo 1996). However, Southern hybridization analysis of members of
four primate families (Lemuridae, Cebidae, Cercopithecidae, and Hominidae) representing prosimians, New
World and Old World monkeys, and great apes, respectively, in all cases revealed male-associated bands hybridizing to RPS4Y sequence. Male-specific RPS4 restriction fragments identified were of different sizes: approximately 5.0 kb and 6.0 kb in the Hsa male, 2.8 kb
and 4.8 kb in the Mto and Atr males, and 1.2 kb, 2.4
kb, and 16 kb in the Lca male.
Restriction fragments appearing in the male that
also appear in the female should represent the RPS4X
locus, whereas restriction fragments appearing in the
male that do not appear in the female represent either
the RPS4Y locus or a RPS4X restriction fragment length
polymorphism (RFLP). While the possibility of RPS4X
RFLPs potentially complicates the interpretation of
male-associated restriction fragments, here the interpretation of the presence of male-specific restriction fragments in the lemur male is of interest. The striking increase in the number of restriction fragments observed
in the Anthropoidea compared with the Prosimii species
may represent RPS4 pseudogenes that have accumulated
1417
in simian genomes since divergence of the primate suborders. The reduced or absent pseudogenes in the lemur
simplify interpretation of male-specific restriction fragments in the lemur. The male lemur has each of the
restriction fragments present in the female lemur but
also has three additional restriction fragments that total
nearly 20 kb. The presence of these restriction fragments
in the lemur male indicates that the RPS4Y locus was
present in the ancestor of this Prosimii species. Therefore, it is most parsimonious to conclude from molecular
evolutionary rate analyses and from the available RTPCR and molecular hybridization data that the RPS4 locus transposed from the X chromosome to the Y chromosome prior to the divergence of the Prosimii lineage
but subsequent to divergence of the ancestral primate
from the placental mammalian ancestor. The DNA hybridization results reveal that phylogenetic reconstructions of mammalian RPS4 homologs (fig. 2) that are
based on the constant-rate assumption are thus incorrect
in placing the divergence of RPS4X and RPS4Y homologs before the mammalian radiation. The error apparently arises because of the significant differences in both
synonymous and nonsynonymous substitution rates between RPS4X and RPS4Y loci.
This interpretation predicts that all primates will be
found to have a RPS4Y locus, excepting those species
in which RPS4Y may have undergone secondary loss or
evolution into a pseudogene, as has been suggested for
the STS-P locus (Yen et al. 1988). The alternative conclusion, that RPS4 transposed from the X chromosome
to the Y chromosome in the placental mammalian ancestor and was subsequently lost in all other placental
mammalian orders (Omoe and Endo 1996), is less parsimonious, requiring at least four events (one transposition in the placental mammalian ancestor and three
deletions in the ancestors of Rodentia, Artiodactyla, and
Carnivora). Molecular evolutionary rate analysis indicates that this alternative is not likely because the estimated divergence time of Rodentia from other mammalian orders is approximately 80 MYA–100 MYA (Li
et al. 1990). If the sequence of a single transposition
event and single or multiple deletions of RPS4Y necessary to explain the observed distribution of X-chromosome– and Y-chromosome–linked loci is hypothesized
to have happened in the ancestor of non-Rodentia Mammalia, multiple events (minimum two, maximum three)
are required, again rendering this a less likely phylogeny
than the single event proposed here.
Sequence identity comparisons reveal that RPS4Y
has diverged more from its homolog, RPS4X, than has
Hsa RPS4X from its homolog, Rps4, in other species.
There is complete sequence identity between human
RPS4X and mouse, rodent, and Syrian hamster Rps4
proteins but only 92.8% identity between human RPS4Y
and vertebrate RPS4X homologs (table 1). This remarkable degree of mammalian RPS4X sequence conservation (99.92% mean identity, Cgrc varies at one residue,
table 1) is presumably due to severe structural constraints that have acted on this functional ribosomal protein. However, despite RPS4Y’s role as a functional ribosomal protein (Watanabe et al. 1993), it has evolved
1418
Bergen et al.
as if less structurally constrained than RPS4X. Under
the assumption that the primate and rodent RPS4X sequence is identical to the ancestral RPS4X sequence that
transposed to the Y chromosome, phylogenetic reconstruction of mammalian RPS4X and Hominidae RPS4Y
with maximum parsimony shows 16 residues to have
evolved between RPS4X and the Hominidae ancestral
RPS4Y and shows 9 to have evolved among Hominidae
RPS4Y (fig. 3). Both conservative and nonconservative
amino acid substitutions (using side-group classifications of neutral–hydrophobic, neutral–polar, basic, and
acidic) occurred after the origin of RPS4Y by transposition and after the divergence of the Hominidae (fig.
3). The wide disparity in the ratio of conservative to
nonconservative amino acid substitutions between mammalian RPS4X and the Hominidae RPS4Y ancestor (6/
10) and this same ratio among Hominidae RPS4Y (7/2)
suggests that functional constraints may have restricted
further nonconservative substitutions, supporting the
functional role for RPS4Y (Watanabe et al. 1993). However, this is a small sample of amino acid substitutions
upon which to make the case for constrained evolution
due to selection.
The chromosomal locations of RPS4X and RPS4Y
support the general model of Y-chromosome rearrangements proposed to have occurred in primate evolution.
These rearrangements were a pericentric inversion involving illegitimate recombination between two pseudoautosomal boundary-like sequences to create the present Xp–Yp pseudoautosomal boundary, followed by another pericentric rearrangement to reinvert a smaller portion of the Y chromosome (Ellis et al. 1990, 1994;
Fukegawa et al. 1996; Sargent et al. 1996). The chromosomal distribution of several homologous gene pairs
supports the first rearrangement, e.g., on Xp/Yq, STS
(Yen et al. 1988), KAL (Incerti et al. 1992), and PBDX/
XGPY (Weller et al. 1995), and, on Xq/Yp, SOX/SRY
(Graves 1995), while some homologous gene pairs support the later rearrangement, e.g., on Xp/Yp, AMG (Bailey et al. 1992) and PKR (Schiebel et al. 1997). We
propose here that the location and history of the homologous gene pair RPS4X and RPS4Y support and help
to date this general model, in which RPS4Y arose
through a transposition from Xq to Yq prior to the prosimian–simian divergence, then was inverted to distal
Yp in the first pericentric inversion proposed to have
created the short-arm pseudoautosomal boundary after
the prosimian divergence (Fukagawa et al. 1996). The
observed protein evolution at the RPS4Y locus represents the combined effects of differential selection on
RPS4X and RPS4Y proteins and the differences in synonymous mutation rate between the X and Y chromosomes. The rate of sequence evolution of RPS4Y, its
presence in Prosimii and Anthropoidea, and its absence
in every other mammalian order support transposition
of RPS4X to the Y chromosome in early primate evolution and suggest that RPS4Y should be evaluated as
an ancestral locus in future studies of the evolution of
the Y chromosome of Primates (e.g., Glaser et al. 1997).
Acknowledgments
This publication was in partial fulfillment of the
Ph.D. requirements of A.W.B. at the George Washington
University Program in Genetics. We thank Andrew R.
Zinn for the gift of cDNA, Laura Brown for intronic
flanking sequence, and Stephen J. O’Brien for primate
DNA.
LITERATURE CITED
ALKSNE, L. E., R. A. ANTHONY, S. W. LIEBMAN, and J. R.
WARNER. 1993. An accuracy center in the ribosome conserved over 2 billion years. Proc. Natl. Acad. Sci. USA 90:
9538–9541.
ANDREWS, P. 1992. Evolution and environment in the Hominoidea. Nature 360:641–646.
BAILEY, D. M., N. A. AFFARA, and M. A. FERGUSON-SMITH.
1992. The X-Y homologous gene Amelogenin maps to the
short arms of both the X-chromosome and Y-chromosome
and is highly conserved in Primates. Genomics 14:203–205.
BAILEY, W. J., D. H. A. FITCH, D. A. TAGLE, J. CZELUSNIAAK,
J. L. SLIGHTOM, and M. GOODMAN. 1991. Molecular evolution of the Ch-globin gene locus: gibbon phylogeny and
the hominid slowdown. Mol. Biol. Evol. 8:155–184.
BERGEN, A., C.-Y. WANG, B. NAKHAI, and D. GOLDMAN. 1996.
Mass Allele Detection (MAD) of rare 5-HT1A structural variants with matrix pooling, allele-specific amplification and
electrochemiluminescence detection. Hum. Mutat. 7:135–
143.
CHURCH, G. M., and W. GILBERT. 1984. Genomic sequencing.
Proc. Natl. Acad. Sci. USA 81:1991–1995.
DEVI, K. R. G., Y.-L. CHAN, and I. G. WOOL. 1989. The primary structure of rat ribosomal protein S4. Biochim. Biophys. Acta. 1008:258–262.
EASTEAL, S. 1991. The relative rate of DNA evolution in primates. Mol. Biol. Evol. 8:115–127.
ELLIS, N. A., P. YEN, K. NEISWANGER, L. J. SHAPIRO, and P.
N. GOODFELLOW. 1990. Evolution of the pseudoautosomal
boundary in Old World monkeys and great apes. Cell 63:
977–986.
ELLIS, N. A., T.-Z. YE, S. PATTON, J. GERMAN, P. N. GOODFELLOW, and P. WELLER. 1994. Cloning of PBDX, an MIC2related gene that spans the pseudoautosomal boundary on
chromosome Xp. Nat. Genet. 6:394–400.
FEINBERG, A. P., and B. VOGELSTEIN. 1984. Addendum: a technique for radiolabeling DNA restriction endonuclease fragments to high specific activity. Anal. Biochem. 139:266–
267.
FISHER, E. M. C., P. BEER-ROMERO, L. G. BROWN, A. RIDLEY,
J. A. MCNEIL, J. B. LAWRENCE, H. F. WILLARD, F. R. BIEBER, and D. C. PAGE. 1990. Homologous ribosomal protein
genes on the X and Y chromosomes: escape from X inactivation and possible implications for Turner syndrome. Cell
63:1205–1218.
FUKAGAWA, T., Y. NAKAMURA, K. OKUMURA, M. NOGAMI, A.
ANDO, H. INOKO, N. SAITOU, and T. IKEMURA. 1996. Human pseudoautosomal boundary-like sequences: expression
and involvement in evolutionary formation of the presentday pseudoautosomal boundary of human sex chromosomes. Hum. Mol. Genet. 5:23–32.
GINGERICH, P. D. 1985. Nonlinear molecular clocks and apehuman divergence times. Pp. 411–416 in P. V. TOBIAS, ed.
Hominid evolution: past, present and future. Alan Liss, New
York.
GLASER, B., F. GRUTZNER, K. TAYLOR et al. (15 co-authors).
1997. Comparative mapping of Xp22 genes in hominoids—
Evolution of RPS4Y
evolutionary linear instability of their Y homologues. Chromosome Res. 5:167–176.
GRAVES, J. A. M. 1995. The origin and function of the mammalian Y chromosome and Y-borne genes—an evolving understanding. Bioessays 17:311–320.
HAMVAS, R. M. J., A. ZINN, J. T. KEER, E. M. C. FISHER, P.
BEER-ROMERO, S. D. M. BROWN, and D. C. PAGE. 1992.
Rps4 maps near the inactivation center on the mouse X
chromosome. Genomics 12:363–367.
INCERTI, B., S. GUIOLI, A. PRAGLIOLA et al. (11 co-authors).
1992. Kallman syndrome gene on the X and Y chromosomes: implications for evolutionary divergence of the human sex chromosomes. Nat. Genet. 2:311–314.
KIMURA, M. 1969. The rate of molecular evolution considered
from the standpoint of population genetics. Proc. Natl.
Acad. Sci. USA 63:1181–1188.
. 1980. A simple method for estimating evolutionary
rates of base substitutions through comparative studies of
nucleotide sequences. J. Mol. Evol. 16:111–120.
KVALOY, A., F. GALVAGNI, and W. R. A. BROWN. 1994. The
sequence organization of the long arm pseudoautosomal region of the human sex chromosomes. Hum. Mol. Genet. 3:
771–778.
LAKE, J. A. 1985. Evolving ribosome structure domains in archaebacteria, eubacteria, eocytes and eukaryotes. Annu.
Rev. Biochem. 54:507–530.
LI, W.-H., G. MANOLO, P. M. SHARP, C. O’HUIGIN, and Y.-W.
YANG. 1990. Molecular phylogeny of Rodentia, Lagomorpha, Primates, Artiodactyla, and Carnivora and molecular
clocks. Proc. Natl. Acad. Sci. USA 87:6703–6707.
LI, W.-H., M. TANIMURA, and P. M. SHARP. 1987. An evaluation of the molecular clock hypothesis using mammalian
DNA sequences. J. Mol. Evol. 25:330–342.
MADDISON, W. P., and D. R. MADDISON. 1992. MacClade: analysis of phylogeny and character evolution. Sinauer Associates, Sunderland, Mass.
MCVEAN, G. T., and L. D. HURST. 1997. Evidence for a selectively favourable reduction in the mutation rate of the X
chromosome. Nature 386:388–392.
MIYATA, T., H. HAYASHIDA, K. KUMA, K. MITSUYASA, and T.
YASUNAGA. 1987. Male-driven molecular evolution: a model and nucleotide sequence analysis. Cold Spring Harbor
Symp. Quant. Biol. 52:863–867.
NYGARD, O., and H. NIKA. 1982. Identification by RNA-protein cross-linking of ribosomal proteins located at the interface between the small and large subunits of mammalian
ribosomes. EMBO J. 1:357–362.
OMOE, K., and A. ENDO. 1996. Relationship between the monosomy X phenotype and Y-linked ribosomal protein S4
(Rps4) in several species of mammals: a molecular evolutionary analysis of Rps4 homologs. Genomics 31:44–50.
PALM, L., J. ANDERSEN, H. RAHBEK-NIELSEN, T. S. HANSEN,
K. KRISTIANSEN, and P. HOJRUP. 1995. The phosphorylated
ribosomal protein S7 in Tetrahymena is homologous with
mammalian S4 and the phosphorylated residues are located
in the C-terminal region. J. Biol. Chem. 270:6000–6005.
PAMILO, P., and N. O. BIANCHI. 1993. Evolution of the Zfx and
Zfy genes: rates and interdependence between the genes.
Mol. Biol. Evol. 10:271–281.
PAMILO, P., and R. J. W. O’NEILL. 1997. Evolution of the Sry
genes. Mol. Biol. Evol. 14:49–55.
1419
SAMOLLOW, P. B., L. M. CHERRY, S. M. WITTE, and J. ROGERS.
1996. Interspecific variation at the Y-linked RPS4Y locus in
hominids: implications for phylogeny. Am. J. Phys. Anthropol. 101:333–343.
SARGENT, C. A., H. BRIGGS, I. J. CHALMERS, B. LAMBSON, E.
WALKER, and N. A. AFFARA. 1996. The sequence organization of Yp/proximal Xq homologous regions of the human sex chromosomes is highly conserved. Genomics 32:
200–209.
SCHIEBEL, K., M. WINKELMANN, M. ANNELYSE, X. XIAOLING,
D. C. PAGE, D. WEIL, C. PETIT, and G. A. RAPPOLD. 1997.
Abnormal XY interchange between a novel isolated protein
kinase gene, PRKY, and its homologue, PRKX, accounts for
one third of all (Y1)XX males and (Y2)XY females. Hum.
Mol. Genet. 6:1985–1989.
SHIMMIN, L. C., B. H.-J. CHANG, and W. H. LI. 1993. Male
driven evolution of DNA sequences. Nature 362:745–747.
. 1994. Contrasting rates of nucleotide substitution in
the X-linked and Y-linked zinc finger genes. J. Mol. Evol.
39:569–578.
SWOFFORD, D. 1998. PAUP: phylogenetic analysis using parsimony. Version 4.0.0d64. Sinauer Associates, Sunderland,
Mass.
SYNETOS, D., M. D. DABEVA, and J. R. WARNER. 1992. The
yeast ribosomal protein S7 and its genes. J. Biol. Chem.
267:3008–3013.
WATANABE, M., A. R. ZINN, D. C. PAGE, and T. NISHIMOTO.
1993. Functional equivalence of human X- and Y-encoded
isoforms of ribosomal protein S4 consistent with a role in
Turner syndrome. Nat. Genet. 4:268–271.
WELLER, P. A., R. CRITCHER, P. N. GOODFELLOW, J. GERMAN,
and N. A. ELLIS. 1995. The human Y chromosome homologue of XG: transcription of a naturally truncated gene.
Hum. Mol. Genet. 4:859–868.
WHITFIELD, L. S., R. LOVELL-BADGE, and P. N. GOODFELLOW.
1993. Rapid sequence evolution of the mammalian sex-determining gene SRY. Nature 364:713–715.
WU, C.-I., and W.-H. LI. 1985. Evidence for higher rates of
nucleotide substitution in rodents than in man. Proc. Natl.
Acad. Sci. USA 82:1741–1745.
YEN, P., B. MARSH, E. ALLEN, S. P. TSAI, J. ELLISON, L. CONNOLLY, K. NEISWANGER, and L. J. SHAPIRO. 1988. The human X-linked steroid sulfatase gene and a Y-encoded pseudogene: evidence for an inversion of the Y chromosome
during primate evolution. Cell 55:1123–1135.
YOKOKURA, T., H. TEI, and D. YAMAMOTO. 1993. Sequence
and expression of a gene encoding a ribosomal protein S4
homolog from Drosophila melanogaster. Gene 132:251–
254.
ZINN A. R., R. K. ALAGAPPAN, L. G. BROWN, I. WOOL, and
D. C. PAGE. 1994. Structure and function of ribosomal protein S4 genes on the human and mouse sex chromosomes.
Mol. Cell Biol. 14:2485–2492.
ZINN, A. R., S. L. BRESSLER, P. BEER-ROMERO, D. A. ADLER,
V. M. CHAPMAN, D. C. PAGE, and M. DISTECHE. 1991. Inactivation of the Rps4 gene on the mouse X chromosome.
Genomics 11:1097–1101.
PEKKA PAMILO, reviewing editor
Accepted July 10, 1998