A Limited Role for Gene Duplications in the Evolution of Platypus

A Limited Role for Gene Duplications in the Evolution of
Platypus Venom
Emily S. W. Wong,*,1 Anthony T. Papenfuss,2 Camilla M. Whittington,1 Wesley C. Warren,3 and
Katherine Belov*,1
1
Faculty of Veterinary Science, The University of Sydney, Sydney, New South Wales, Australia
Bioinformatics Division, The Walter and Eliza Hall Institute for Medical Research, Parkville, Victoria, Australia
3
The Genome Center, Washington University School of Medicine
*Corresponding author: E-mail: [email protected]; [email protected].
Associate editor: Willie Swanson
2
Abstract
Key words: gene duplications, venom, platypus, evolution.
Introduction
Platypuses are venomous mammals. Male platypuses possess spurs on their hind legs that deliver venom during the
breeding season. In humans, envenomation causes both excruciating immediate and long-lasting pain that does not
respond to traditional painkillers such as morphine. Other
symptoms of envenomation include edema, a drop in blood
pressure, and blood coagulation (Martin and Tidswell 1895;
Fenner et al. 1992). Proteomic studies in the 1990s revealed
that platypus venom contains at least 19 different peptide
components (de Plater et al. 1995) of which only three
fractions were characterized: C-type natriuretic peptides,
defensin-like peptides, and nerve growth factors. More recently, a molecular transcriptomic study of an in-season
venom gland led to the identification of proteases, spider
venom alpha-latrotoxin-like peptides, cysteine-rich secretory proteins, cytolytic toxin-like peptides, and venom proteins with homology to stonefish stonustoxin (Whittington
et al. 2010).
Similar venom toxin gene families are found in a diverse
range of venomous animals including shrews, snakes and
other reptiles, amphibians, fish, mollusks, insects, cnidarians,
and echinoderms. These toxins have evolved in a convergent manner in different lineages independently (Fry et al.
2009). Protein scaffolds that are recruited into venom peptides include chitinase, cystatin, defensin, hyaluronidase,
Kunitz, lectin, lipocalin, natriuretic peptide, peptidase S1,
phospholipase A2, sphingomyelinase D, SPRY, and cysteine-rich secretory proteins including the AVIT family
(Fry et al. 2009).
Gene duplications appear to play an important role in
the recruitment and diversification of toxin genes (Kordis
and Gubensek 2000; Reza et al. 2006; Lynch 2007; Juárez
et al. 2008). The evolutionary fate of duplicated genes
was first developed by Ohno (1970). He proposed that immediately following a duplication event, one gene copy is
expected to maintain the ‘‘ancestral’’ role, whereas the other
copy is free to evolve new function (neofunctionalization).
It is now accepted that duplicated genes can be also maintained through the sharing of ancestral gene function (subfunctionalization) and that selective pressures can play
a role in the preservation of a gene duplicate (Bergthorsson
et al. 2007). Across frog species, caeruleins, which are skin
toxin decapeptides, have arisen by independent duplications
(Roelants et al. 2010). Venom serine proteases in the North
© Crown copyright 2011.
Mol. Biol. Evol. 29(1):167–177. 2012 doi:10.1093/molbev/msr180
Advance Access publication August 3, 2011
167
Research article
Gene duplication followed by adaptive selection is believed to be the primary driver of venom evolution. However, to date,
no studies have evaluated the importance of gene duplications for venom evolution using a genomic approach. The
availability of a sequenced genome and a venom gland transcriptome for the enigmatic platypus provides a unique
opportunity to explore the role that gene duplication plays in venom evolution. Here, we identify gene duplication events
and correlate them with expressed transcripts in an in-season venom gland. Gene duplicates (1,508) were identified. These
duplicated pairs (421), including genes that have undergone multiple rounds of gene duplications, were expressed in the
venom gland. The majority of these genes are involved in metabolism and protein synthesis not toxin functions. Twelve
secretory genes including serine proteases, metalloproteinases, and protease inhibitors likely to produce symptoms of
envenomation such as vasodilation and pain were detected. Only 16 of 107 platypus genes with high similarity to known
toxins evolved through gene duplication. Platypus venom C-type natriuretic peptides and nerve growth factor do not
possess lineage-specific gene duplicates. Extensive duplications, believed to increase the potency of toxic content and
promote toxin diversification, were not found. This is the first study to take a genome-wide approach in order to examine
the impact of gene duplication on venom evolution. Our findings support the idea that adaptive selection acts on gene
duplicates to drive the independent evolution and functional diversification of similar venom genes in venomous species.
However, gene duplications alone do not explain the ‘‘venome’’ of the platypus. Other mechanisms, such as alternative
splicing and mutation, may be important in venom innovation.
MBE
Wong et al. · doi:10.1093/molbev/msr180
American shrew have evolved adaptively through amino
acid changes to regulatory loops surrounding the active site,
also following gene duplications (Aminetzach et al. 2009).
Further examples of gene duplications in venom include
contoxins in the cone snail (Duda and Palumbi 1999),
a multitude of gene families including phospholipase A2
(Gutiérrez and Lomonte 1995), serine proteases (Kini
2005), serine protease inhibitors (Zupunski et al. 2003),
C-type lectins (Ogawa et al. 2005), coagulation factor V
(Minh Le et al. 2005; Reza et al. 2006), and three finger toxins (Fry et al. 2003), across a range of snake species and
cysteine-enriched toxins in scorpions (Zhijian et al. 2006).
Venom toxins across animal taxa are members of some
of the fastest duplicating gene families in humans: serine
proteases, defensins, and natural killer cell (NKC) receptor
genes (lectins) (Bailey et al. 2002). In humans, these genes
are involved in pathogen defense. In platypus, antimicrobial
peptides have given rise through tandem gene duplications
to venom peptides (Whittington et al. 2008). The rapid duplication rate in immune genes, driven by intense competition between host and pathogen, is likely to be favorable
toward the initial creation of new traits, such as venom
function. This is then followed by subsequent duplications
that are likely to reinforce the venom system. Venom genes,
like immune genes, are also likely to face strong adaptive
pressures. In most venomous animals, such pressure is
brought about through predator–prey interactions where
venom is used to immobilize prey and defend against predators and/or circumvent prey resistance (Poran et al. 1987;
Daltry et al. 1996; Biardi et al. 2006; Calvete et al. 2009).
However, in platypus, this adaptive pressure is most likely
exerted by intraspecific competition, to assert dominance
over other males during the breeding season (Grant 2007).
Prior studies on the selective constraints of gene duplicates in venom have focused on the venomous snakes and
the cone snail. In snakes, venom gene families such as phospholipase A2, serine protease, C-type lectin-like proteins,
metalloproteinases, and serine proteinase inhibitors appear
to have evolved through neofunctionalization under strong
selective constraints (Kordis and Gubensek 2000; Lynch
2007; Juárez et al. 2008; Aminetzach et al. 2009; Casewell,
Wagstaff, Harrison, Renjifo, et al. 2011). In the venomous
gastropod genus Conus, a number of contoxin families
block sodium and calcium channels and neural receptors
in prey species and have diversified their gene function
through strong positive selection following extensive duplications (Duda and Palumbi 1999). In addition to toxin peptides that have evolved through gene duplications, genes
involved in the biosynthesis of toxins, such as the sxt gene
cluster involved in the production of cyanobacteria neurotoxins, have also undergone duplications followed by adaptive evolution (Murray et al. 2011).
Although previous studies suggest that duplicated genes
are the major driver of venom evolution, the direct impact
of gene duplications on the evolution of venom has not
been characterized. This is primarily due to difficulties in
assigning gene duplicates in the absence of a sequenced
genome. Given that the platypus is the only venomous
168
animal with a sequenced genome (Warren et al. 2008)
and venom gland transcriptome (Whittington et al.
2010), we have taken a novel genome-wide approach to
assess how duplications have contributed to venom function. To place the role of gene duplication in the context of
platypus venom innovation, here, we identify lineagespecific gene duplicates in the platypus genome that are
also expressed in an in-season venom gland. Our results
suggest that gene duplications do not appear to be the major driver of toxin evolution in the platypus.
Materials and Methods
Identification of Platypus-Specific Gene
Duplications
We used homology data from Ensembl Compara Release 61
to identify all platypus genes that are duplicated in the
monotreme lineage including genes that are derived from
multiple lineage-specific duplications (Vilella et al. 2009;
Flicek et al. 2010). All cDNA and protein sequences from
monotreme gene duplicates were extracted. We took into
consideration the fact that gene loss in other lineages will
lead to incorrect inferences of monotreme duplications. A
phylogenetic tree may appear to show a species-specific
duplication when, in fact, the topology is the result of
multiple gene losses in other species (Casewell, Wagstaff,
Harrison, and Wüster 2011). However, due to the large
number of species in Ensembl, we expect this type of error
to be negligible. A list of nonredundant duplicated gene pairs
containing monotreme-specific duplication events was
constructed (supplementary file 1, Supplementary Material
online). Each pair contained at least one gene expressed in
the venom gland. We also constructed expansion groups
comprising two or more genes where at least one gene
is expressed in the venom gland. To confirm Ensembl gene
predictions in the absence of expression in the venom
gland, we checked for the expression of these genes in other
tissue transcriptomes.
Identification of Gene Duplicates Expressed in the
Venom Gland
A normalized cDNA library from a breeding season platypus venom gland was sequenced using the Illumina platform, as described in Whittington et al. 2010. The raw data
set comprised 19,069,168 reads of 36 nucleotides in length.
These reads were mapped against the platypus genome assembly (5.0) using MAQ (Li), and the number of reads
matching each Ensembl platypus gene was tallied and used
as digital expression values (see Whittington et al. 2010).
Multimapped reads were treated in a uniform random
manner to increase transcriptomic coverage and decrease
false negatives in the detection of expressed duplicates. Only monotreme genes with read counts above 100 were used
for further analysis. Protein searches were conducted using
BLASTP (Altschul et al. 1997) against Tox-Prot, a UniProtKB/
Swiss-Prot database of animal toxins (Release 2010_10) (Jungo and Bairoch 2005) with an E value cutoff of 1 105.
MBE
Gene Duplications and Platypus Venom Evolution · doi:10.1093/molbev/msr180
Identification of Secreted Gene Duplicates
Table 1. Number of Monotreme-Specific Duplications.
To date, all known toxin peptides are secretory proteins
(Fry et al. 2009). We used SignalP 3.0 hidden Markov models
(Bendtsen et al. 2004) and Gene Ontology (GO) (Ashburner
et al. 2000) terms assigned by basic local alignment search
tool (BLAST) to predict the cellular localization of expressed
protein sequences. Disulfide cross-linking, common to
secreted proteins, enhances stable tertiary structures (Mamathambika and Bardwell 2008). To confirm that our predictions are secreted, we compared the ratio of cysteine
residues with other amino acid residues in secretory and
nonsecretory peptides. Functional annotations were inferred by sequence similarity to human proteins. However,
we note that as these genes are the result of gene duplication, it is possible that completely novel function has
evolved.
Genes
Gene pairs
Number of duplication events
Pairs of duplicates
PAML
Test for Adaptive Selection
We used PAML (Yang 2007) to obtain dn/ds values for each
pair of duplicates under the pairwise M0 model. Since an
averaged omega ratio (M0) over the entire length of a gene
is not a sensitive measure of positive selection, which may
only occur on a small subset of sites (Anisimova et al. 2001),
we tested branches of venom gland expressed genes for
adaptive selection using the branch-site test, ‘‘test 2.’’
We note that both these approaches give conservative estimations of positive selection. Branch lengths taken from
M0 model runs were used as initial branch lengths in the
branch-site test. Pairs where branch lengths were greater
than five nucleotide substitutions per codon were removed
from further analysis. Alignments were constructed using
PRANK and a codon substitution matrix (Löytynoja and
Goldman 2008). Two test 2 runs were conducted. Sequences showing convergence of likelihood values with a difference of ,0.001 were kept. Alignments were carefully
checked to ensure gene duplicates were not incorrectly assigned due to short contig lengths. The chi-square distribution with one degree of freedom was used to evaluate
the likelihood ratio test statistic for positive selection.
FDR was used to adjust P values for multiple testing (Hochberg and Benjamini 1990). We checked for significant (P ,
0.05) cases of gene conversions using GENECONV (Sawyer
1999).
Phylogenetic Trees
Evolutionary relationships of protease protein sequences
were constructed using MEGA5 (Tamura et al. 2007).
The bootstrapped (500 replicates) trees were inferred using
the neighbor-joining (NJ) method with evolutionary distances computed using the Poisson correction method. Ambiguous positions were removed for each sequence pair.
Results
Identification of Duplicated Gene Groups
Fifty-six percent (10,676,863 reads) of the raw reads were
aligned to the platypus genome. We identified 378 pairs of
monotreme-specific duplicates, where at least one gene
3,741
1,508
421
378
159 pairs; 46 groups
NOTE.—‘‘Genes’’ denote all genes that are duplicated in the monotreme lineage.
‘‘Gene pairs’’ represent the number of duplications that have resulted in two
Ensembl genes. The ‘‘number of gene duplication events’’ denotes genes where
one or both duplicates are expressed. ‘‘Pairs of duplicates’’ consist of expressed
genes pair only. ‘‘PAML’’ duplicates include only genes that meet strict criteria;
these groups were subsequently used for selection tests.
copy (.100 Illumina reads) was one of the 5,410 protein-coding genes expressed in the platypus venom gland.
Of these, 327 showed venom gland expression in only one
duplicated copy, which is indicative of venom-specific
function evolution following gene duplication. A summary
of the number of monotreme-specific duplicates is shown
in table 1.
Gene family expansions were also identified. Forty-two
gene duplication events leading to a monophyletic clade
containing three or more genes were identified. In each
case, a gene duplication occurred within an internal, or ancestral, node. In the majority of cases, venom gland expression was not detected for all members of a clade. After
removing genes on short contigs, we found that the maximum number of genes expressed in the venom gland in
any monotreme-specific clade was three. We found evidence of expression of 34% of gene duplicates that were
not found in in-season venom gland in other tissue transcriptomes.
Duplicated Venom Genes
The vast majority of genes expressed in the venom gland
were not significantly similar to any known toxin proteins.
Only 107 genes (from 5,410 genes) matched existing toxin
and venom proteins from the Tox-Prot database (E value ,
1 105). Of these, only 16 genes were monotreme lineage-specific duplicates (supplementary file 2, Supplementary
Material online). Only two genes of these possessed signal
peptides (ENSOANG00000001418, ENSOANG00000012443).
To explore the possibility that unique toxin genes exist
in the platypus, it is necessary to distinguish likely toxin
peptides from genes with other functional roles that are
also expressed in the venom gland. We focused on expressed genes that have duplicated in the monotreme lineage and had features of secretory peptides, as we would
expect that these are more likely to have toxin function
(Fry et al. 2009). Forty-two putative secretory genes were
identified using a predictive algorithm implemented in SignalP and GO term analysis based on similarity to human
secretory proteins (table 2). GO terms were used due to the
fragmentation of the platypus assembly, and we recognize
that some secreted genes may have truncated 5# termini,
resulting in missing signal motifs and nondetection using
SignalP.
We expected that the cysteine contents of putative secretory genes would be higher than average compared with
169
MBE
Wong et al. · doi:10.1093/molbev/msr180
Table 2. List of Duplicated Putative Platypus Venom Genes that Are Expressed in an In-Season Venom Gland and Are Likely to be Secreted.
Ensembl ID
Protein degradation
ENSOANG00000001418
ENSOANG00000001214
ENSOANG00000002285
ENSOANG00000013866
ENSOANG00000013077
ENSOANG00000008022
ENSOANG00000005148
Immunity
ENSOANG00000002927
ENSOANG00000002926
ENSOANG00000006911
ENSOANG00000001407
ENSOANG00000001539
ENSOANG00000021638
ENSOANG00000003125
ENSOANG00000007061
ENSOANG00000010729
ENSOANG00000003004
ENSOANG00000005314
ENSOANG00000014191
Cell regulation
ENSOANG00000015762
ENSOANG00000008500
Transport
ENSOANG00000012538
ENSOANG00000011884
ENSOANG00000007590
ENSOANG00000022698
ENSOANG00000008168
ENSOANG00000004183
Metabolism
ENSOANG00000006512
ENSOANG00000006519
ENSOANG00000010188
ENSOANG00000001438
ENSOANG00000000453
ENSOANG00000009571
Protein folding
ENSOANG00000001063
ENSOANG00000000957
RNA processing
ENSOANG00000005365
ENSOANG00000000815
Cell adhesion
ENSOANG00000015748
Other physiological function
ENSOANG00000012443
ENSOANG00000010405
ENSOANG00000011883
ENSOANG00000004710
Symbol
Description
Signal Peptide
CMA1
ERAP1
CUL9
TPSG1
TPSG1
ADAMDEC1
MBTPS1
Chymase 1, mast cell
ERAP1
Cullin 9
Tryptase gamma 1
Tryptase gamma 1
Disintegrin and metalloproteinase domain-like protein
Membrane-bound transcription factor peptidase, site 1
Yes
Yes
Yes
Yes
Trun
Yes
Yes
CCL3
CCL3L1
CXCL2
CFH
B2M
MR1
CD94
CLEC2L
TRDC
HV303
HV303
A2M
Chemokine (C-C motif) ligand 3
Chemokine (C-C motif) ligand 3-like 1
Chemokine (C-X-C motif) ligand 2
Complement factor H
B2M
Major histocompatibility complex, class I related
Killer cell lectin-like receptor subfamily
Killer cell lectin-like receptor subfamily
T-cell receptor delta chain C region
Immunoglobulin heavy chain V region
Immunoglobulin heavy chain V region
Alpha-2-macroglobulin
Yes
Yes
Yes
Trun
Yes
Yesa
Yes
Trun
Yes
Yes
Yes
Trun
AGGF1
NENF
Angiogenic factor with G patch and FHA domains 1
Neuron-derived neurotrophic factor
Trun
Yes
TRPV6
S28A2
S28A2
ACRV1
SLC2A5
YIF1A
TRP cation channel, subfamily V, member 6
Sodium/nucleoside cotransporter 2
Sodium/nucleoside cotransporter 2
Acrosomal vesicle protein 1
Solute carrier family 2
yip1 interacting factor homolog A (Saccharomyces cerevisiae)
Yes
Yes
Yes
Yes
Yes
Yes
BTN2A1
PIGG
MOXD1
HHIPL2
FUT3
CES1
Butyrophilin, subfamily 2, member A1
Phosphatidylinositol glycan anchor biosynthesis, class G
Monooxygenase, DBH-like 1
Hedgehog interacting protein-like 2
Galactoside 3(4)-L-fucosyltransferase
Carboxylesterase 1 (monocyte/macrophage serine esterase 1)
Yes
Yes
Yes
Yes
Yes
Yes
PPIL4
TOR3A
Peptidylprolyl isomerase (cyclophilin)-like 4
Torsin family 3, member A
Yes
Yes
INTS10
RBM28
Integrator complex subunit 10
RNA binding motif protein 28
Yes
Yes
ICAM3
Intercellular adhesion molecule 3
Yes
BTN1A1
EN DOD1
Butyrophilin, subfamily 1, member A1
Endonuclease domain containing 1
Serpin peptidase inhibitor, clade A
(alpha-1 antiproteinase, antitrypsin), member 3
Microfibril-associated glycoprotein 3
Yes
Yes
SERPINA3
MFAP3
Trun
Yes
NOTE.—Gene name and gene descriptions reflect the most similar human gene. ‘‘Trun’’ denotes truncated 5# end.
a
Curated gene prediction. Gene pairs in italics denote duplication pairs where both copies are expressed and possess signal peptides.
intracellular genes due to disulfide bonding which provide
structural stability (Clark 2005). The average ratios of cysteine versus noncysteine residues for secreted and nonsecreted genes were 1:52 (standard error [SE] 5 7) and 1:77
(SE 5 4), respectively.
Twelve of these genes share similarity with known
venom peptides in other species. These included five genes
involved in protein degradation (three serine proteases and
two metalloproteinase), two protease inhibitors, two C-type
170
lectin genes, an isomerase, and one SPRY domain-containing
gene similar to snake Ohanin protein.
Adaptive Selection
Twenty of 159 pairs of duplicates (;12%) show evidence of
positive selection (FDR , 0.05) as identified using the
branch-site model implemented in PAML (test 2) (table 3).
To eliminate errors in our results, we took a series of steps
to ensure accurate inference of gene duplication and
MBE
Gene Duplications and Platypus Venom Evolution · doi:10.1093/molbev/msr180
Table 3. Venom Gland-Expressed Monotreme-Specific Duplicate Pairs under Positive Selection.
Ensembl Gene ID
ENSOANG00000004421
Ensembl Protein ID
ENSOANP00000007003
ENSOANG00000011604
ENSOANG00000004530
ENSOANP00000018383
ENSOANP00000007182
ENSOANG00000003557
ENSOANG00000007061
ENSOANG00000006435
ENSOANG00000006437
ENSOANG00000021791
ENSOANG00000012329
ENSOANG00000019992
ENSOANG00000010184
ENSOANP00000005638
ENSOANP00000011236
ENSOANP00000010269
ENSOANP00000010273
ENSOANP00000027772
ENSOANP00000019509
ENSOANP00000025376
ENSOANP00000016146
ENSOANG00000012538
ENSOANG00000020293
ENSOANG00000009915
ENSOANG00000004696
ENSOANG00000008958
ENSOANG00000007394
ENSOANG00000010340
ENSOANG00000007030
ENSOANG00000019734
ENSOANP00000019830
ENSOANP00000025784
ENSOANP00000015714
ENSOANP00000007439
ENSOANP00000014266
ENSOANP00000011768
ENSOANP00000016404
ENSOANP00000011196
ENSOANP00000025024
Ensembl Family Description
ATP-binding cassette subfamily B
member P glycoprotein
Golgin subfamily A member 1
Disintegrin and metalloproteinase domain
containing precursor ADAM
Ring finger 112 zinc finger 179
C-type lectin domain familya
Interferon-induced GTP binding
Interferon-induced GTP binding
Coiled-coil domaina
Presplicing factor SLU7
Ig domain familya
Interferon induced with tetratricopeptide
repeats IFIT
TRP cation channel subfamily V
Unknown
Ig gamma chain C region
2-Hydroxyacyl coA lyase 1
Polyadenylation factor subunit 2
Methylthioribulose 1 phosphate dehydratase
Ig domain familya
ATP sythase subunit D
Aldo-keto reductase familya
P Value
FDR
2.04 3 1026
8.73 3 1026
3.67 3 1024
7.86 3 1024
1.83 3.25 3
7.92 1.19 3
1.19 3
1.38 3
1.92 3
2.26 3
105
1025
105
1024
1024
1024
1024
1024
1.10 1.46 3
2.85 3.07 3
3.07 3
3.09 3
3.46 3
3.69 3
103
1023
103
1023
1023
1023
1023
1023
3
3
3
3
3
3
3
3
3
3
1024
1024
1024
1023
1023
1023
1023
1023
1023
1023
8.13
8.13
8.96
1.35
2.39
2.39
2.49
2.69
4.11
4.76
3
3
3
3
3
3
3
3
3
3
1023
1023
1023
1022
1022
1022
1022
1022
1022
1022
5.69
5.87
6.97
1.12
2.19
2.26
2.49
2.84
4.58
5.56
NOTE.—Genes in bold are functionally related to known toxins.
a
A self-annotated gene family where Ensembl descriptions were unavailable.
accurate alignment. We filtered pairs based on gene lengths,
sequence divergence, and likelihood of past gene conversions in order to eliminate factors that can lead to incorrect
results. First, we removed gene duplicates that did not show
overlap in multiple alignments with other species. We then
aligned sequences using PRANK incorporating a codon substitution model. This produces alignments with a lower
number of false positive results compared with other multiple aligners (Fletcher and Yang 2010). Due to the fragmentation of the platypus genome assembly, many genes are
truncated. This could lead to incorrect assignment of gene
duplicate, where one gene, split over two or more contigs, is
considered to be two or more genes and treated as gene
duplicates. Truncated genes can also lead to gene prediction
errors, which may elevate the level of positive selection. To
overcome these issues, we only kept putative gene duplicates that overlapped by 60 bp or more when aligned. Short
uninformative alignments were removed from the selection
analysis. Highly divergent sequences can lead to spuriously
high levels of positive selection due to alignment errors, different codon usage, and different nucleotide composition
(Fletcher and Yang 2010). To address this, we filtered gene
pairs based on tree branch lengths. We further removed all
gene pairs that showed signs of gene conversion, as recombination events can also lead to elevated rates of adaptive
selection (Anisimova et al. 2003).
We also performed selection tests on duplication groups
(.2 genes) that are likely to have toxin roles. We identified
strong positive selection acting on the lineage of an expressed
protease gene (P ,, 0.001; ENSOANG00000002931) belonging to a group of four serine protease inhibitors that have
expanded in the monotreme lineage. All genes contained
the Kunitz-type domain, which has been identified in a wide
range of venomous species, including snake, wasp, scorpion,
sea anemones, and coral (Fry et al. 2009). The branch of an
expressed C-type lectin gene (ENSOANG00000019797), within
a monotreme group containing six genes, was also under
positive selection P , 0.05.
Discussion
Our results show that only a small proportion of putative
venom genes are duplicated with a smaller subset displaying
evidence of adaptive evolution. Here, we discuss the role of
gene duplications in platypus venom evolution in secreted
proteases, protease inhibitors, C-type lectins, ion channel
proteins, isomerases, and beta-2-microglobulin (B2M).
Proteases
Whittington et al. (2010) identified 14 serine proteases in
the platypus genome. Only three of the 14 appear to result
from lineage-specific gene duplications, including two
genes which derive from a lineage-specific clade-expansion
of five genes that share similarity with human trypase (fig. 1)
and one gene that is most similar to human chymase. A
platypus peptidase with high similarity to a metalloprotease, endoplasmic reticulum aminopeptidase 1 (ERAP1)
was also identified. Both ERAP1 and chymase are involved
in the regulation of blood pressure in humans. The physiological function of ERAP1 in humans suggests that it may
cause hypotension by cleaving angiotensin II (Hattori et al.
2000). Hence, it is possible that this gene duplicate may
contribute to the hypotensive effect of platypus envenomation (Martin and Tidswell 1895). On the other hand,
chymase, a member of the peptidase S1 family, cleaves inactive angiotensin I into angiotensin II, causing an increase
171
Wong et al. · doi:10.1093/molbev/msr180
MBE
FIG. 1. Evolutionary relationship of serine protease protein sequences. There were 366 positions in the final data set. Bootstrap values greater
than 70% are shown. Tree is drawn to scale based on the number of amino acid substitutions per site. Platypus sequences are in bold;
a diamond denotes genes expressed in the platypus venom gland. Snake venom genes are denoted by a triangle. Labels contain the gene
accession (if no Ensembl accession is available the UniProt one is used).
172
Gene Duplications and Platypus Venom Evolution · doi:10.1093/molbev/msr180
in blood pressure (Wu et al. 2005) which is not consistent
with the physiological effect of platypus envenomation. It is
possible that this hypertensive effect may be neutralized by
the expression of ten serine protease inhibitors, which are
found in venom (Whittington et al. 2010). The presence of
both hypertensive and hypotensive proteins may aid in the
regulation of blood pressure inside the venom gland. Additionally, proteases typically possess a wide range of functions, including smooth muscle contraction, inflammation,
blood clotting, degradation of extracellular matrix, and
pain (Wu et al. 2005). Therefore, it is possible that the gene
duplicate was not selected for its role in hypertension but
rather for other functional properties. In support of this,
a platypus chymase-like duplicate also shares high levels
of homology to a toxin from short-trailed shrew (BLAST;
Accession: Q76B45; E value 5 5 1031), which has kallikrein-like activity and dilatory effects on blood vessel walls
(Kita et al. 2004).
Homology of platypus peptidases to chymase and trypase suggests that platypus venom serine proteases arose
from different ancestral genes to snake peptidases, which
are derived from tissue kallikrein (fig. 1) (Deshimaru et al.
1996). Venom peptidases have also independently evolved
in shrew, reptiles, and insects (Fry et al. 2009).
Protease Inhibitors
Two pairs of serine protease inhibitors have resulted from
lineage-specific gene duplications: SERPINA3 and A2M.
Both genes are associated with acute phase response in humans. Serpins belong to a superfamily of protease inhibitors
that are found in gene clusters and evolve through a series
of tandem gene duplications and control coagulation pathways and immune processes (van Gent et al. 2003). SERPINA3 inhibits chymase, which has also experienced a gene
duplication in monotremes, suggestive of coevolution of
the receptor and ligand pair in venom function. Serpin inhibition of chymase leads to in a drop in blood pressure
(Rubin et al. 1990), consistent with observations in rabbits
that have been injected with platypus venom. The platypus
gene duplicate may also have a protective function: In wasp
venom, serpins are believed to protect host tissues from
toxin effects generated by serine proteases (Colinet et al.
2009). A2M is also a protease inhibitor that is capable of
eliminating proteolytic effects of several crotalid snake venoms (Kress and Catanese 1981). A2M is homologous to C3,
a thiolester-containing protein of the vertebrate complement and homolog of cobra venom factor (Sottrup-Jensen
et al. 1985), suggesting a convergent role of A2M-like domains in venom. This is the first time, to our knowledge,
that two A2M genes have been identified in the venom
component of any species. We hypothesize that it functions in a protective and regulatory capacity.
Ion Channel Protein
A duplication of an ion channel gene similar to TRPV6 was
identified. This member of the transient receptor potential
(TRP) ion channel family plays a role in neurotransmission,
muscle contraction, and exocytosis using calcium as
MBE
a secondary messenger in humans (Clapham et al. 2001).
Given its expression in the venom gland, this receptor
may function in the secretion of venom.
Isomerase
We identified a secreted gene duplicate belonging to the
peptidylprolyl cis-trans isomerase (PPIase) family. PPIases
catalyze the cis-trans isomerization of prolines. Alterations
in the folding of proteins can lead to novel structures and
toxin function. This is supported by the discovery of
a platypus venom L–D-isomerase, which may protect toxins from protease degradation and increase toxin potency
(Torres et al. 2006; Torres et al. 2007). In addition to cellular signaling (Luban et al. 1993) and transcriptional regulation (Rycyzyn and Clevenger 2002), PPIases in cone
snail have been documented to accelerate the folding
of disulfide-rich toxins containing prolines by acting as
chaperons in protein folding (Safavi-Hemami et al.
2010). The PPIase may possess a similar role in platypus
venom.
Beta-2-Microglobulin
An expressed B2M duplicate was indentified. It is the third
most highly expressed gene in the Illumina data set, higher
than any of the characterized venom peptides. Expression
of B2M in venom or venom tissues in other venomous species has not been described, but its high expression level
and lineage-specific duplication warrants further investigation. Due to the absence of reports of B2M in the venom of
other species, we checked whether its high level of expression in platypus was due to the presence of noncoding
RNA. We mapped reads back to the genome and confirmed that its expression is indeed due to the proteincoding gene itself. High levels of B2M in plasma cause
destructive osteoarthropathies in humans, including joint
pain and swelling (Drüeke 1999). These symptoms are a result of B2M forming nonselective voltage independent
ion channels or pores in phospholipid bilayer membranes
(Hirakura and Kagan 2001). Formation of such channels
could compromise membrane potential and allow toxic extracellular components, such as calcium, into the tissue of
an envenomated victim, while causing the loss of crucial
ions including potassium and magnesium (Hirakura and
Kagan 2001). It may also cause cell death through the destruction of intracellular structures and leakage of lysosomal enzymes (Hirakura and Kagan 2001).
C-Type Lectins
Two of 12 C-type lectin lineage-specific duplicates were expressed in the venom gland over our threshold of 100 reads.
In snakes, over 80 C-type lectin venom peptides affecting
hemostatsis and thrombosis have been identified (Ogawa
et al. 2005). Lectins have also been found in the venoms of
caterpillars, blood-sucking insects, and stonefish (Fry et al.
2009). Interestingly, the platypus has experienced a massive
expansion of C-type lectin domain genes (Wong et al.
2009). Due to their similarity to NKC receptors, it was
thought that these genes primarily have immunological
173
MBE
Wong et al. · doi:10.1093/molbev/msr180
function, but evidence of high expression in venom gland
warrants further exploration of a venom function.
Evidence of Adaptive Selection
We do not expect that all genes under positive selection will
have venom-related roles. Rather, the majority appear to be
associated with ubiquitous cellular processes such as transcription and cell metabolism. Among the 20 pairs of gene
duplicates that showed evidence of positive selection, two
pairs belonged to gene families with a known venom function (table 2): a disintegrin and metalloproteinase (ADAM)
family protein (ENSOANP00000007182) and a C-type lectin
protein (ENSOANP00000011236). Genes from the ADAM
gene family are closely related to snake venom metalloproteinases (SVMPs), which have been shown to evolve
through positive selection (Casewell, Wagstaff, Harrison,
Renjifo, et al. 2011). SVMPs can be cleaved into disintegrin
peptides with potent inhibition of platelet aggregation activity but can also have hemorrhagic activities (Jia et al.
1996; Silva et al. 2003; Modesto et al. 2005). Based on
the coagulatory properties of platypus venom, we suggest
that the platypus peptide likely acts in a procoagulatory
capacity. We did not identify a predicted signal peptide
in the platypus ADAM peptide using SignalP. This is not
unexpected, as it is thought that certain venom metalloproteinases possess an internal signal peptide in the form
of a hydrophobic segment which only acts as a signal peptide in the mature peptide following proteolysis of the precursor (Kini 1995). Platypus C-type lectin genes have
previously been shown to evolve under positive selection
(Wong et al. 2009). However, this is the first time that positively selected C-type lectins have been identified in the
venom gland. In snakes, C-type lectins have also evolved
adaptively and have acquired numerous hemostatic and
thrombotic functions (Ogawa et al. 2005). We expect these
platypus lectins to possess coagulatory and hypotensive
roles due to the observed symptoms of envenomation.
We also identified that the TRP channel receptor gene, ENSOANP00000019830, possibly involved in venom secretion,
is also under adaptive selection.
In total, we found that approximately 12% of all platypus
venom-expressed lineage-specific duplicates showed evidence of positive selection. This is likely to be an underestimate for several reasons. There is a limited time period
postduplication when positive selection can be detected,
and signatures of positive selection can be obscured by purifying selection in older gene duplicates (Han et al. 2009).
In addition, positive selection likely acts on a small number
of amino acids for a short period of time after duplication
and therefore is most detectable in younger duplicates
(Zhang 2003). Positive selection is also difficult to detect
in recently diverged genes. The chi-square test, used to test
the significance of the model of positive selection, also gives
conservative results for short and highly similar data sets
(Anisimova et al. 2001). Moreover, it is generally difficult
to detect positive selection between gene pairs due to
the small numbers of sites available for comparison, resulting in little discriminatory power. Despite this, our result is
174
comparable to the amount of positive selection in lineagespecific gene duplicates in four eutherian lineages: human,
macaque, rat, and mouse (Han et al. 2009). There, the authors only examined ‘‘young’’ duplicates, those with ds , 1,
enriching their data set for genes that show evidence of
adaptive selection (Lynch and Conery 2000; Kondrashov
et al. 2002). In comparison, approximately 8% of our duplicates possessed a ds value greater than 1.
Limited Role of Gene Duplications
We did not find instances where all gene members of a large
multigene family expansion were expressed in the venom
gland. We identified expanded lineage-specific clades for
ADAM peptides, serine proteases, Kunitz-type serine proteases inhibitors, and C-type lectin genes, but only some
and not all members of the expanded clade were expressed
in the venom gland. This suggests multiple recruitment
events from the same gene family without further duplications that increase toxin copies. Extensive gene duplications can lead to increase in toxin expression, thereby
increasing the efficiency and potency of the venom (Fry
et al. 2009). Multiple gene copies can increase toxin concentration and hasten venom replacement. Cone snail toxins use this pattern of evolution; where between 50 and 100
thousand individual peptides have been identified across
a number of species (Bingham et al. 2010). There may
be several reasons why we did not observe expansions
of multivenom copies in platypus. Rapid venom replenishment may not be required in the platypus, as only small
quantities of venom are released (70 ll) and the full venom
gland capacity is not fully delivered by the animal when it
spurs (Whittington et al. 2009). Additionally, the role of
platypus venom is believed to be for asserting dominance
over conspecifics in the context of mating rather than to
kill or immobilize prey, as in cone snails and snakes. These
functional differences may account for differences in
strength and direction of selective pressure on venom genes
as it is likely that there is an increased fitness cost in producing lethal venom as opposed to a less potent venom
that incapacitates their mating competitors. However,
we note that lowly expressed genes, incorrect phylogenetic assignments or inaccurate gene predictions may
have affected our ability to detect multiple toxin copies
in the platypus.
Most platypus venom genes do not appear to have
arisen via gene duplication in the monotreme lineage. Only
16 of 107 putative venom genes identified through similarity to known toxin genes are lineage-specific duplicates. If
we assume convergent evolution of venom function, only
15% of platypus toxins have evolved through gene duplications. Consistent with this, nine of 12 serine proteases
identified by Whittington et al. (2010) have one-to-one orthologs with other species; in addition, platypus venom
protein fractions, C-type natriuretic peptides (ENSOANG00000012322) (de Plater et al. 1998) and nerve growth
factor (most likely to be ENSOANG00000004523) (de Plater
et al. 1995) identified previously via proteomic studies do
not appear to have duplicated in the platypus lineage.
Gene Duplications and Platypus Venom Evolution · doi:10.1093/molbev/msr180
We suggest that changes other than gene duplications
play an important part in the recruitment of toxin genes.
There are 12 known splice variants of platypus venom Ctype natriuretic peptide (Kita et al. 2009). Snake venom
C-type natriuretic peptides also undergo extensive molecular processing forming a large number of isoforms from
a single precursor mRNA (Higuchi et al. 1999). This extensive cleavage pattern has also been identified in viper sarafotoxins (Ducancel et al. 1993). Alternatively, spliced
variants have been identified in other venom components
including snake disintegrins (Scarborough et al. 1993) and
Factor X (St. Pierre et al. 2005). Splice variants are considered to be a major contributor to protein diversity (Modrek
and Lee 2002; Keren et al. 2010). Human genes (92–94%)
undergo alternative splicing and are responsible for the 4fold increase in the number of proteins synthesized compared with protein-coding genes (Wang et al. 2008; Keren
et al. 2010). Protein isoforms derived from alternatively
spliced genes can have distinct or even antagonistic functions (Das et al. 2002; Zhang and Mount 2009) and may
possess different tissue specificities (Wang et al. 2008).
Gene mutations can also increase the functional diversity
of a gene, through changes in either the regulatory or the
protein-coding regions. In snakes, CRISP and kallikrein
venom genes are believed to have evolved through mutational modifications (Fry 2005).
It is also possible that we have underestimated the number of putative venom genes that have undergone duplication. This could be due to several factors. First, limited
gene divergence or short gene length could lead to difficulties identifying duplicates. For example, previously gene
trees were not constructed for defensin-like venom peptides (Whittington et al. 2008) due to their high degree of
divergence. Second, although a large number of proteincoding genes were predicted in the platypus assembly
(17,951), it is possible that some venom gene duplicates
were missed. Third, the fragmented assembly is likely to
have led to truncations in some gene predictions and
hence failure to call some duplications.
Gene duplications are only partially responsible for the
adaptation of platypus venom. The availability of whole genome sequence has allowed us to detect gene duplicates
that have led to venom function. Multispecies comparisons
have allowed us to distinguish between duplicates that
stem from speciation events, where venom orthologs exist
in nonvenomous species and true lineage-specific innovations. Comparative studies of other venomous animals
will provide insights on the likely variable role of gene duplication in venom evolution. We anticipate that genomic
strategies will play an increasingly useful role in the understanding of venom evolution as whole genome sequences
from other venomous animals become available.
Conclusion
Our genomic and transcriptomic data analysis shows that
duplications of proteases, their inhibitors, and C-type lectins
play a role in the evolution of platypus venom. However,
MBE
extensive venom gene duplications were not identified
and key venom genes do not appear to have arisen through
gene duplications in the platypus lineage. This points to
other evolutionary mechanisms being important in platypus venom evolution. Our study highlights the usefulness
of genomic data in understanding the evolution of novel
gene functions.
Supplementary Material
Supplementary files 1 and 2 are available at Molecular Biology
and Evolution online (http://www.mbe.oxfordjournals.org/).
Acknowledgments
This work was funded by the University of Sydney and the
Australian Research Council. KB is supported by an ARC
Future Fellowship.
References
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W,
Lipman DJ. 1997. Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs. Nucleic Acids
Res. 25:3389–3402.
Aminetzach YT, Srouji JR, Kong CY, Hoekstra HE. 2009. Convergent
evolution of novel protein function in shrew and lizard venom.
Curr Biol. 19:1925–1931.
Anisimova M, Bielawski JP, Yang Z. 2001. Accuracy and power of the
likelihood ratio test in detecting adaptive molecular evolution.
Mol Biol Evol. 18:1585–1592.
Anisimova M, Nielsen R, Yang Z. 2003. Effect of recombination on
the accuracy of the likelihood method for detecting positive
selection at amino acid sites. Genetics 164:1229–1236.
Ashburner M, Ball CA, Blake JA, et al. (20 co-authors). 2000. Gene
ontology: tool for the unification of biology. The Gene Ontology
Consortium. Nat Genet. 25:25–29.
Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S,
Adams MD, Myers EW, Li PW, Eichler EE. 2002. Recent segmental
duplications in the human genome. Science 297:1003–1007.
Bendtsen JD, Nielsen H, von Heijne G, Brunak S. 2004. Improved
prediction of signal peptides: SignalP 3.0. J Mol Biol. 340:783–795.
Bergthorsson U, Andersson DI, Roth JR. 2007. Ohno’s dilemma:
evolution of new genes under continuous selection. Proc Natl
Acad Sci U S A. 104:17004–17009.
Biardi JE, Chien DC, Coss RG. 2006. California ground squirrel
(Spermophilus beecheyi) defenses against rattlesnake venom
digestive and hemostatic toxins. J Chem Ecol. 32:137–154.
Bingham J-P, Mitsunaga E, Bergeron ZL. 2010. Drugs from
slugs—past, present and future perspectives of [omega]conotoxin research. Chem Biol Interact. 183:1–18.
Calvete JJ, Sanz L, Angulo Y, Lomonte B, Gutiérrez JM. 2009.
Venoms, venomics, antivenomics. FEBS Lett. 583:1736–1743.
Casewell NR, Wagstaff SC, Harrison RA, Renjifo C, Wüster W. 2011a.
Domain loss facilitates accelerated evolution and neofunctionalization of duplicate snake venom metalloproteinase toxin
genes. Mol Biol Evol. [cited 2011 April 4]. PMID 21498373.
Casewell NR, Wagstaff SC, Harrison RA, Wüster W. 2011b. Gene tree
parsimony of multilocus snake venom protein families reveals
species tree conflict as a result of multiple parallel gene loss. Mol
Biol Evol. 28:1157–1172.
Clapham DE, Runnels LW, Strübing C. 2001. The TRP ion channel
family. Nat Rev Neurosci. 2:387–396.
Clark DP. 2005. Molecular biology. Burlington (MA): Academic Press.
175
Wong et al. · doi:10.1093/molbev/msr180
Colinet D, Dubuffet A, Cazes D, Moreau S, Drezen J-M, Poirié M.
2009. A serpin from the parasitoid wasp Leptopilina boulardi
targets the Drosophila phenoloxidase cascade. Dev Comp
Immunol. 33:681–689.
Daltry JC, Wuster W, Thorpe RS. 1996. Diet and snake venom
evolution. Nature 379:537–540.
Das S, Lin J-H, Papamatheakis J, Sykulev Y, Tsichlis PN. 2002.
Differential splicing generates Tvl-1/RFXANK isoforms with
different functions. J Biol Chem. 277:45172–45180.
de Plater G, Martin RL, Milburn PJ. 1995. A pharmacological and
biochemical investigation of the venom from the platypus
(Ornithorhynchus anatinus). Toxicon 33:157–169.
de Plater GM, Martin RL, Milburn PJ. 1998. A C-type natriuretic
peptide from the venom of the platypus (Ornithorhynchus
anatinus): structure and pharmacology. Comp Biochem Physiol C
Pharmacol Toxicol Endocrinol. 120:99–110.
Deshimaru M, Ogawa T, Nakashima K, et al. (11 co-authors). 1996.
Accelerated evolution of crotalinae snake venom gland serine
proteases. FEBS Lett. 397:83–88.
Drüeke TB. 1999. Extraskeletal problems and amyloid. Kidney Int
Suppl. 73:S89–S93.
Ducancel F, Matre V, Dupont C, Lajeunesse E, Wollberg Z, Bdolah A,
Kochva E, Boulain JC, Ménez A. 1993. Cloning and sequence
analysis of cDNAs encoding precursors of sarafotoxins. Evidence
for an unusual ‘‘rosary-type’’ organization. J Biol Chem.
268:3052–3055.
Duda TF, Palumbi SR. 1999. Molecular genetics of ecological
diversification: duplication and rapid evolution of toxin genes of
the venomous gastropod Conus. Proc Natl Acad Sci U S A.
96:6820–6823.
Fenner PJ, Williamson JA, Myers D. 1992. Platypus envenomation—a
painful learning experience. Med J Aust. 157:829–832.
Fletcher W, Yang Z. 2010. The effect of insertions, deletions, and
alignment errors on the branch-site test of positive selection.
Mol Biol Evol. 27:2257–2267.
Flicek P, Amode MR, Barrell D, et al. (52 co-authors). 2010. Ensembl
2011. Nucleic Acids Res. 39:D800–D806.
Fry BG. 2005. From genome to ‘‘venome’’: molecular origin and
evolution of the snake venom proteome inferred from
phylogenetic analysis of toxin sequences and related body
proteins. Genome Res. 15:403–420.
Fry BG, Roelants K, Champagne DE, et al. (12 co-authors). 2009. The
toxicogenomic multiverse: convergent recruitment of proteins
into animal venoms. Annu Rev Genomics Hum Genet. 10:483–511.
Fry BG, Wuster W, Kini RM, Brusic V, Khan A, Venkataraman D,
Rooney AP. 2003. Molecular evolution and phylogeny of elapid
snake venom three-finger toxins. J Mol Evol. 57:110–129.
Grant T. 2007. Platypus, 4th ed. Collingwood: CSIRO Publishing.
Gutiérrez J, Lomonte B. 1995. Phospholipase A2 myotoxins from
Bothrops snake venoms. Toxicon 33:1405–1424.
Han MV, Demuth JP, McGrath CL, Casola C, Hahn MW. 2009.
Adaptive evolution of young gene duplicates in mammals.
Genome Res. 19:859–867.
Hattori A, Kitatani K, Matsumoto H, Miyazawa S, Rogi T, Tsuruoka N,
Mizutani S, Natori Y, Tsujimoto M. 2000. Characterization of
recombinant human adipocyte-derived leucine aminopeptidase
expressed in Chinese hamster ovary cells. J Biochem. 128:755–762.
Higuchi S, Murayama N, Saguchi K, Ohi H, Fujita Y, Camargo ACM,
Ogawa T, Deshimaru M, Ohno M. 1999. Bradykinin-potentiating
peptides and C-type natriuretic peptides from snake venom.
Immunopharmacology 44:129–135.
Hirakura Y, Kagan BL. 2001. Pore formation by beta-2-microglobulin:
a mechanism for the pathogenesis of dialysis associated
amyloidosis. Amyloid 8:94–100.
Hochberg Y, Benjamini Y. 1990. More powerful procedures for
multiple significance testing. Stat Med. 9:811–818.
176
MBE
Jia L-G, Shimokawa K-I, Bjarnason JB, Fox JW. 1996. Snake venom
metalloproteinases: structure, function and relationship to the
ADAMs family of proteins. Toxicon 34:1269–1276.
Juárez P, Comas I, González-Candelas F, Calvete JJ. 2008. Evolution of
snake venom disintegrins by positive Darwinian selection. Mol
Biol Evol. 25:2391–2407.
Jungo F, Bairoch A. 2005. Tox-Prot, the toxin protein annotation program
of the Swiss-Prot protein knowledgebase. Toxicon 45:293–301.
Keren H, Lev-Maor G, Ast G. 2010. Alternative splicing and
evolution: diversification, exon definition and function. Nat Rev
Genet. 11:345–355.
Kini RM. 1995. Do we know the complete sequence of metalloproteinase and nonenzymatic platelet aggregation inhibitor
(disintegrin) precursor proteins? Toxicon 33:1151–1160.
Kini RM. 2005. Serine proteases affecting blood coagulation and
fibrinolysis from snake venoms. Pathophysiol Haemost Thromb.
34:200–204.
Kita M, Black DS, Ohno O, Yamada K, Kigoshi H, Uemura D. 2009.
Duck-billed platypus venom peptides induce Ca2þ influx in
neuroblastoma cells. J Am Chem Soc. 131:18038–18039.
Kita M, Nakamura Y, Okumura Y, Ohdachi SD, Oba Y, Yoshikuni M,
Kido H, Uemura D. 2004. Blarina toxin, a mammalian lethal
venom from the short-tailed shrew Blarina brevicauda: isolation
and characterization. Proc Natl Acad Sci U S A. 101:7542–7547.
Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV. 2002. Selection in the
evolution of gene duplications. Genome Biol. 3:RESEARCH0008.
Kordis D, Gubensek F. 2000. Adaptive evolution of animal toxin
multigene families. Gene 261:43–52.
Kress LF, Catanese JJ. 1981. Effects of human plasma [alpha]2
macroglobulin on the proteolytic activity of snake venoms.
Toxicon 19:501–507.
Li H. Maq: Mapping and Assembly with Qualities. Available from:
http://maq.sourceforge.net/faq.shtml.
Löytynoja A, Goldman N. 2008. Phylogeny-aware gap placement
prevents errors in sequence alignment and evolutionary analysis.
Science 320:1632–1635.
Luban J, Bossolt KL, Franke EK, Kalpana GV, Goff SP. 1993. Human
immunodeficiency virus type 1 Gag protein binds to cyclophilins
A and B. Cell 73:1067–1078.
Lynch M, Conery JS. 2000. The evolutionary fate and consequences
of duplicate genes. Science 290:1151–1155.
Lynch VJ. 2007. Inventing an arsenal: adaptive evolution and
neofunctionalization of snake venom phospholipase A2 genes.
BMC Evol Biol. 7:2.
Mamathambika BS, Bardwell JC. 2008. Disulfide-linked protein
folding pathways. Annu Rev Cell Dev Biol. 24:211–235.
Martin CJ, Tidswell F. 1895. Observations on the femoral gland of
Ornithorhynchus and its secretion; together with an experimental enquiry concerning its supposed toxic action. Proc Linn
Soc N S W. 9:471–500.
Minh Le TN, Reza MA, Swarup S, Kini RM. 2005. Gene duplication of
coagulation factor V and origin of venom prothrombin activator
in Pseudonaja textilis snake. Thromb Haemost. 93:420–429.
Modesto JC, Junqueira-de-Azevedo IL, Neves-Ferreira AG, Fritzen M,
Oliva ML, Ho PL, Perales J, Chudzinski-Tavassi AM. 2005.
Insularinase A, a prothrombin activator from Bothrops insularis
venom, is a metalloprotease derived from a gene encoding
protease and disintegrin domains. Biol Chem. 386:589–600.
Modrek B, Lee C. 2002. A genomic view of alternative splicing. Nat
Genet. 30:13–19.
Murray SA, Mihali TK, Neilan BA. 2011. Extraordinary conservation,
gene loss, and positive selection in the evolution of an ancient
neurotoxin. Mol Biol Evol. 28:1173–1182.
Ogawa T, Chijiwa T, Oda-Ueda N, Ohno M. 2005. Molecular
diversity and accelerated evolution of C-type lectin-like proteins
from snake venom. Toxicon 45:1–14.
Gene Duplications and Platypus Venom Evolution · doi:10.1093/molbev/msr180
Ohno S. 1970. Evolution by gene duplication. New York: Springer.
Poran NS, Coss RG, Benjamini E. 1987. Resistance of California
ground squirrels (Spermophilus beecheyi) to the venom of the
northern Pacific rattlesnake (Crotalus viridis oreganus): a study
of adaptive variation. Toxicon 25:767–777.
Reza MA, Minh Le TN, Swarup S, Manjunatha Kini R. 2006. Molecular
evolution caught in action: gene duplication and evolution of
molecular isoforms of prothrombin activators in Pseudonaja
textilis (brown snake). J Thromb Haemost. 4:1346–1353.
Roelants K, Fry BG, Norman JA, Clynen E, Schoofs L, Bossuyt F. 2010.
Identical skin toxins by convergent molecular adaptation in
frogs. Curr Biol. 20:125–130.
Rubin H, Wang ZM, Nickbarg EB, McLarney S, Naidoo N,
Schoenberger OL, Johnson JL, Cooperman BS. 1990. Cloning,
expression, purification, and biological activity of recombinant
native and variant human alpha 1-antichymotrypsins. J Biol
Chem. 265:1199–1207.
Rycyzyn MA, Clevenger CV. 2002. The intranuclear prolactin/
cyclophilin B complex as a transcriptional inducer. Proc Natl
Acad Sci U S A. 99:6790–6795.
Safavi-Hemami H, Bulaj G, Olivera BM, Williamson NA, Purcell AW.
2010. Identification of Conus peptidylprolyl cis-trans isomerases
(PPIases) and assessment of their role in the oxidative folding of
conotoxins. J Biol Chem. 285:12735–12746.
Sawyer SA. 1999. GENECONV: a computer package for the statistical
detection of gene conversion. Washington University in St.
Louis. Available from: http://www.math.wustl.edu/;sawyer.
Scarborough RM, Rose JW, Naughton MA, Phillips DR, Nannizzi L,
Arfsten A, Campbell AM, Charo IF. 1993. Characterization of the
integrin specificities of disintegrins isolated from American pit
viper venoms. J Biol Chem. 268:1058–1065.
Silva MB, Schattner M, Ramos CRR, et al. (11 co-authors). 2003. A
prothrombin activator from Bothrops erythromelas (jararacada-seca) snake venom: characterization and molecular cloning.
Biochem J. 369:129–139.
Sottrup-Jensen L, Stepanik TM, Kristensen T, et al. (12 co-authors). 1985.
Common evolutionary origin of a 2-macroglobulin and complement components C3 and C4. Proc Natl Acad Sci U S A. 82:9–13.
St. Pierre L, Masci PP, Filippovich I, Sorokina N, Marsh N, Miller DJ,
Lavin MF. 2005. Comparative analysis of prothrombin activators
from the venom of Australian elapids. Mol Biol Evol.
22:1853–1864.
Tamura K, Dudley J, Nei M, Kumar S. 2007. MEGA4: Molecular
Evolutionary Genetics Analysis (MEGA) software version 4.0.
Mol Biol Evol. 24:1596–1599.
Torres AM, Tsampazi M, Kennett EC, Belov K, Geraghty DP,
Bansal PS, Alewood PF, Kuchel PW. 2007. Characterization and
MBE
isolation of L-to-D-amino-acid-residue isomerase from platypus
venom. Amino Acids. 32:63–68.
Torres AM, Tsampazi M, Tsampazi C, Kennett EC, Belov K,
Geraghty DP, Bansal PS, Alewood PF, Kuchel PW. 2006.
Mammalian l-to-d-amino-acid-residue isomerase from platypus
venom. FEBS Lett. 580:1587–1591.
van Gent D, Sharp P, Morgan K, Kalsheker N. 2003. Serpins:
structure, function and molecular evolution. Int J Biochem Cell
Biol. 35:1536–1547.
Vilella AJ, Severin J, Ureta-Vidal A, Heng L, Durbin R, Birney E. 2009.
EnsemblCompara GeneTrees: complete, duplication-aware phylogenetic trees in vertebrates. Genome Res. 19:327–335.
Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C,
Kingsmore SF, Schroth GP, Burge CB. 2008. Alternative isoform
regulation in human tissue transcriptomes. Nature 456:
470–476.
Warren WC, Hillier LW, Marshall Graves JA, et al. (104 co-authors).
2008. Genome analysis of the platypus reveals unique signatures
of evolution. Nature 453:175–183.
Whittington CM, Koh JMS, Warren WC, Papenfuss AT, Torres AM,
Kuchel PW, Belov K. 2009. Understanding and utilising
mammalian venom via a platypus venom transcriptome. J
Proteomics. 72:155–164.
Whittington CM, Papenfuss AT, Bansal P, et al. (15 co-authors).
2008. Defensins and the convergent evolution of platypus and
reptile venom genes. Genome Res. 18:986–994.
Whittington CM, Papenfuss AT, Locke DP, et al. (12 co-authors).
2010. Novel venom gene discovery in the platypus. Genome Biol.
11:R95.
Wong ESW, Sanderson CE, Deakin JE, Whittington CM,
Papenfuss AT, Belov K. 2009. Identification of natural killer cell
receptor clusters in the platypus genome reveals an expansion
of C-type lectin genes. Immunogenetics 61:565–579.
Wu Q, Kuo H-C, Deng GG. 2005. Serine proteases and cardiac
function. Biochim Biophys Acta. 1751:82–94.
Yang Z. 2007. PAML 4: Phylogenetic Analysis by Maximum
Likelihood. Mol Biol Evol. 24:1586–1591.
Zhang J. 2003. Evolution by gene duplication: an update. Trends Ecol
Evol. 18:292–298.
Zhang X-N, Mount SM. 2009. Two alternatively spliced isoforms of
the Arabidopsis SR45 protein have distinct roles during normal
plant development. Plant Physiol. 150:1450–1458.
Zhijian C, Feng L, Yingliang W, Xin M, Wenxin L. 2006. Genetic
mechanisms of scorpion venom peptide diversification. Toxicon
47:348–355.
Zupunski V, Kordis D, Gubensek F. 2003. Adaptive evolution in the
snake venom Kunitz/BPTI protein family. FEBS Lett. 547:131–136.
177