Evolution of the Ikaros Gene Family

The Journal of Immunology
Evolution of the Ikaros Gene Family: Implications for the
Origins of Adaptive Immunity1
Liza B. John,*† Simon Yoong,* and Alister C. Ward2†
Members of the Ikaros family of transcription factors are important for immune system development. Analysis of Ikaros-related
genes from a range of species suggests the Ikaros family derived from a primordial gene, possibly related to the present-day
protostome Hunchback genes. This duplicated before the divergence of urochordates to produce two distinct lineages: one that
generated the Ikaros factor-like (IFL) 2 genes of urochordates/lower vertebrates and the Pegasus genes of higher vertebrates, and
one that generated the IFL1 genes of urochordates/lower vertebrates, the IKFL1 and IKFL2 genes of agnathans and the remaining
four Ikaros members of higher vertebrates. Expansion of the IFL1 lineage most likely occurred via the two intervening rounds of
whole genome duplication. A proposed third whole genome duplication in teleost fish produced a further increase in complexity
of the gene family with additional Pegasus and Eos members. These findings question the use of IFL sequences as evidence for the
existence of adaptive immunity in early chordates and vertebrates. Instead, this study is consistent with a later emergence of
adaptive immunity coincident with the appearance of the definitive lymphoid markers Ikaros, Aiolos, and Helios. The Journal
of Immunology, 2009, 182: 4792– 4799.
T
ranscription factors are often called “master regulators” of
development as they function in mediating the complex
changes in gene expression required to produce the diverse array of cells needed in a multicellular organism (1). A common motif found in transcription factors is the zinc finger motif,
consisting of a zinc atom in specific coordination with four cysteine and/or histidine residues. Such motifs can be involved in
either protein-nucleic acid or protein-protein interactions, with
more than one zinc finger commonly found in the same protein,
often in the form of tandem arrays (2, 3). Zinc finger transcription
factors exhibit broad evolutionary conservation, including hematopoiesis. For example, GATA-related proteins regulate blood cell
development from Drosophila to man (4, 5).
The Ikaros family of zinc finger transcription factors are important regulators of immune system development (6). This family is
characterized by two sets of highly conserved zinc finger motifs: a
set of usually four zinc fingers located at the N terminus involved
in DNA binding, and a set of two zinc fingers at the C terminus,
which enable the Ikaros proteins to dimerize with themselves or
other family members (7). The mammalian Ikaros family consists
of Ikaros, Helios, Aiolos, Eos, and Pegasus. Ikaros and Aiolos are
highly conserved and expressed in lymphoid tissues, such as the
spleen and thymus, as well as PBL (6, 8, 9). Loss or mutation of
Ikaros results in dramatic decreases in T cells, B cells, NK, and
lymphoid-derived dendritic cells, as well as effects on myeloid cell
*Center for Cellular and Molecular Biology, School of Life and Environmental Sciences, Deakin University, Burwood, Victoria, Australia; and †School of Medicine,
Deakin University, Waurn Ponds, Victoria, Australia
Received for publication July 21, 2008. Accepted for publication February 4, 2009.
The costs of publication of this article were defrayed in part by the payment of page
charges. This article must therefore be hereby marked advertisement in accordance
with 18 U.S.C. Section 1734 solely to indicate this fact.
1
This project was supported by a Deakin Central Research Grant. L.B.J. received an
Australian Postgraduate Award.
2
Address correspondence and reprint requests to Prof. Alister C. Ward, School of
Medicine, Deakin University, Waurn Ponds, Victoria, Australia. E-mail address:
[email protected]
Copyright © 2009 by The American Association of Immunologists, Inc. 0022-1767/09/$2.00
www.jimmunol.org/cgi/doi/10.4049/jimmunol.0802372
lineages (9 –11). Similarly, loss of Aiolos results in severe depletion of peritoneal and recirculating B cells and, in contrast, the
spontaneous production of autoantibodies (12). Deregulation of
Ikaros and Aiolos, particularly their splice variants, can result in
cellular hyperproliferation and the development of lymphomas and
leukemias (7, 9, 10, 13). Helios is also involved in T cell differentiation, at least partly via heterodimerization with Ikaros (7).
Constitutive expression of mutant forms of Helios leads to aggressive T cell lymphoma (14, 15). Eos is more broadly expressed,
being found in skeletal muscle, heart, and liver in addition to hematopoietic tissues such as thymus and spleen (16, 17). However,
its function remains relatively poorly understood. Pegasus is distinct from the other Ikaros members, as it contains only three Nterminal zinc fingers, with divergent sequence and DNA binding
specificity. It is also widely expressed but its function remains to
be determined (17).
Ikaros family homologues and related genes have been identified in a diverse range of species (17–26), the presence of
which has been used as evidence of lymphocyte-like gene regulatory programs in these organisms (19, 20, 22, 23, 27). However, despite this, a comprehensive perspective on the origins of
this important family has remained elusive. To further our evolutionary understanding, we have explored the repertoire of
Ikaros family of transcription factors in amphioxus, sea squirt,
lamprey, teleost fish, and other species. Detailed bioinformatics
analysis has allowed the formulation of an evolutionary model
for the genesis of the Ikaros gene family. This suggests that
Ikaros-related genes were derived from a primordial gene, possibly related to the present-day Hunchback genes of protostomes, which underwent duplication before the divergence of
urochordates/lower vertebrates from higher vertebrates. This
yielded a distinct Pegasus-related lineage that has remained
largely intact since that time, and another lineage that ultimately produced Ikaros, Aiolos, Helios, and Eos genes by the
time of divergence of the cartilaginous fish, likely via the two
intervening rounds of whole genome duplication that occurred
during this time. A further increase in complexity of the family
has occurred in teleost fish following a proposed third whole
The Journal of Immunology
4793
FIGURE 1. Sequence alignment of zinc finger motifs of Ikaros-related proteins. N-terminal (upper panel) and C-terminal (lower panel) zinc finger (ZF)
motifs from human (Hs; Homo sapiens), medaka (Ol; Oryzias latipes), lamprey (Pm; Petromyzon marinus and Lf; Lampetra fluviatilis), sea squirt (Ci;
Ciona intestinalis) and amphioxus (Bf; Branchiostoma floridae and Bb; Branchiostoma belcheri tsingtaunese) Ikaros-related proteins (Ikaros, Aiolos,
Helios, Eos, Pegasus, and Ikaros factor-like (IFL/IKFL) 1 and 2) and fruit fly (Dm; Drosophila melanogaster) Hunchback (Hb) were aligned using the
ClustalX program, as indicated. Asterisks (ⴱ) represent identical residues, colons (:) represent highly related residues and dots (.) represent moderately
related residues. Conserved essential cysteine (C) and histidine (H) residues, presumably involved in zinc coordination in each finger, are indicated in gray
with an alternate cysteine in ZF4 and ZF5 bolded. Note the absence of ZF1 in medaka Helios and Ikaros, and ZF4 in Pegasus and IFL2 sequences. The
absence of ZF1 in Ciona is likely due to limits in sequence availability.
genome duplication in this lineage. These findings have
prompted a reassessment of the origins of adaptive immunity.
Materials and Methods
Identification of new Ikaros-related sequences
Whole genomic sequence, high throughput genomic, and expressed sequence tag (EST)3 databases of amphioxus, sea squirt, lamprey, and teleost
fish were searched using known human orthologues of the Ikaros family to
identify Ikaros-related sequences. In addition, cDNA clones for ESTs representing zebrafish genes encoding Pegasus (AY394931-full-length), Eos
(CK397774 and AL927666-partial), and Helios (DN896784-partial) were
obtained and sequenced in full. Raw sequence analysis, contig assembly,
and basic sequence manipulation and were performed using Sequencher
4.1.4 (GeneCodes) or the web-based basic local alignment search tool
(BLAST) 2Sequences (http://www.ncbi.nlm.nih.gov/blast/bl2seq/wblast2.
cgi). GenomeScan (http://genes.mit.edu/genomescan.html) was used to
predict coding exons from sequences derived solely from whole genomic
sequence or genomic scaffolds. The following new sequences were identified; Xenopus tropicalis (western clawed frog) Aiolos (ENSX
ETP00000033186), Danio rerio (zebrafish) Pegasus (BN001214), Eos
(BN001212) and Helios (BN001213); Tetraodon nigroviridis (spotted
green pufferfish) Ikaros (BN001216), Aiolos (BN001289), Pegasus1
(BN001218), Pegasus2 (BN001290), Eos1 (BN001215), Eos2
(BN001291), and Helios (BN001217); Takifugu rubripes (fugu pufferfish)
Eos1 (BN001285), Eos2 (BN001286), Helios (BN001287), and Aiolos
(BN001288); Oryzias latipes (medaka) Eos1 (BN001294), Eos2
(BN001282), Helios (BN001283), and Aiolos (BN001284) and Petromyzon marinus (sea lamprey) Pegasus (Ikaros factor-like (IFL) 2)
(GENSCAN00000129115 and GENSCAN00000133626). The latter was
identified using pre-ensembl contigs (http://pre.ensembl.org/Petromyzon_
marinus/index.html) and the UCSC lamprey genome browser database
3
Abbreviations used in this paper: EST, expressed sequence tag; IFL/IKFL, Ikaros
factor-like; ZF, zinc finger; ISH, in situ hybridization; BLAST, basic local alignment
search tool.
(http://genome.ucsc.edu/cgi-bin/hgGateway?clade ⫽ other&org ⫽
Lamprey&db ⫽). Also retrieved were Ciona intestinalis (sea squirt) IFL1
(BN001219) and IFL2 (BN001220), which have been assigned model identity numbers (144428 and 131901, respectively) in the Ciona intestinalis
database version 1.0 (20). In addition, Branchiostoma floridae (amphioxus)
IFL1 N terminus (BN001292) and C terminus (BN001293), and IFL2 C
terminus (BN001281) were identified from GenBank and JGI Branchiostoma floridae v1.0 genome browser database (http://genome.jgi-psf.org/
Brafl1/Brafl1.home.html) searches. Predicted sequences of zinc finger domains were also retrieved for Ikaros members from Callorhinchus milii
(elephant shark).
Database interrogation and assembly of known sequences
BLASTN (nucleotide query vs nucleotide database), BLASTX (translated
query-protein), and TBLASTN (nucleotide-protein) search algorithms
were used on the National Center for Biotechnology Information (NCBI)
(www.ncbi.nlm.nih.gov/BLAST/) and Ensembl (www.ensembl.org) websites to identify Ikaros-related sequences from representative organisms.
Sequences retrieved from GenBank (http://www.ncbi.nlm.nih.gov) or Ensembl were: Homo sapiens (human) Ikaros AAP88838, Aiolos AAF13493,
Helios AAF09441, Eos AAG39221, and Pegasus AAG39220, Mus musculus (mouse) Ikaros NP_033604, Aiolos XP_283022, Helios NP_035900,
Eos NP_035902 and Pegasus NP_780324, Gallus gallus (chicken) Ikaros
NP_990419, Aiolos NP_990151, Helios NP_989938 and Pegasus
NP_001026766, Xenopus tropicalis (western clawed frog) Ikaros
NP_001015698, Helios NP_001116930, Eos ENSXETP00000020847 and
Pegasus NP_001017355, Danio rerio (zebrafish) Ikaros AAC61763,
Takifugu rubripes (fugu pufferfish) Ikaros ENSTRUG00000017779, Pegasus1 ENSTRUG00000001038 and Pegasus2 ENSTRUG00000010762,
Oryzias latipes (medaka) Ikaros ENSORLT00000025472, Pegasus1
ENSORLT00000007007 and Pegasus2 ENSORLT00000011938, Raja eglanteria (skate) Ikaros AF163848, Aiolos AF163850, Helios AF163847,
Eos AF163849, Petromyzon marinus (sea lamprey) IKFL1 AAF82350,
Lampetra fluviatilis (river lamprey) IKFL2 AAL67304, Myxine glutinosa
(hagfish) composite IFL (renamed IKFL1) AAP84653, Oikopleura dioica
(urochodate) IFL AAP84655, Strongylocentrotus purpuratus (purple sea
4794
IKAROS FAMILY EVOLUTION
FIGURE 2. Phylogenetic analysis of Ikaros-related proteins. The sequences of a comprehensive set of Ikaros-related proteins were aligned using
ClustalX, and a phylogenetic tree generated using the Neighbor-Joining (N-J) algorithm and visualized with the TreeView program. Branch labels
indicate the robustness of the tree over 100 bootstrap replicates, while the scale bar indicates an evolutionary distance of 0.05 amino acid
substitutions per site. Shading indicates respective Ikaros, Aiolos, Helios, Eos, and Pegasus clusters, while red, blue, and yellow rectangles indicate
potential groupings derived from successive duplication events during evolution. Newly identified proteins are shown in red. Abbreviations: Bf;
Branchiostoma floridae, amphioxus, Bb; Branchiostoma belcheri tsingtaunese, lancelet, Ci; Ciona intestinalis, sea squirt, Dr; Danio rerio, zebrafish,
Gg; Gallus gallus, chicken, Hs; Homo sapiens, human, Lf; Lampetra fluviatilis, river lamprey, Mg; Myxine glutinosa, hagfish, Mm; Mus musculus,
mouse, Od; Oikopleura dioica, tunicate, Ol; Oryzias latipes, medaka, Pm; Petromyzon marinus, sea lamprey, Re; Raja eglanteria, skate, Sp;
Strongylocentrotus purpuratus, purple sea urchin, Tn; Tetraodon nigroviridis, spotted green pufferfish, Tr; Takifugu rubripes, fugu pufferfish, Xt;
Xenopus tropicalis, western clawed frog.
The Journal of Immunology
4795
FIGURE 3. Microsynteny analysis of Ikaros-related gene loci. Schematic representation of sea squirt (Ci; Ciona intestinalis), lamprey (Pm; Petromyzon
marinus and Lf; Lampetra fluviatilis), human (Hs; Homo sapiens), frog (Xt; Xenopus tropicalis), and medaka (Ol; Oryzias latipes) Ikaros-related gene loci.
Orthologous genes are grouped within shaded boxes with species and individual chromosome (C), scaffold (S), and contig derivation indicated. Arrows
represent the relative orientation and position of each gene, color-coded on the basis of gene relatedness with Ikaros-related genes in red. A line with two
dashes as indicated for sea squirt IFL1 represents a large distance in the chromosome, while the zig-zag lines represent different scaffolds. Note that no other
genes are found on the contigs that lamprey IKFL genes are located. Gene abbreviations: ABCA12: ATP-binding cassette, subfamily A, member 12;
ACAD-SB/L: Acyl-Coenzyme A dehydrogenase, short/long branched chain; BARD1: BRCA1 associated RING domain 1; CHAT: choline acetyltransferase; CUZD1: CUB and zona pellucida domain 1; ERBB2/3/4: v-erb-a erythroblastic leukemia viral oncogene homologue 2/3/4; FIGNL1: Fidgetin-like
1; GPR26: G protein-coupled receptor 26; GRB7/10: Growth receptor bound 7/10; HMX3: Homeo box (H6 family) 3; IF-A: Intermediate filament protein
A; IGFBP4: insulin-like growth factor binding protein 4; MLLT6: myeloid/lymphoid or mixed-lineage leukemia translocated to 6; PARG: Poly(ADPribose) glycohydrolase; PA2G4: proliferation-associated 2G4; PSMB3: proteasome subunit, ␤ type 3; PSMD3: proteasome 26S subunit, non-ATPase 3;
RGR: retinal G protein coupled receptor; RPS26: ribosomal protein S26; SPAG16: Sperm associated Ag 16; STAT2: Signal transducer and activator of
transcription protein 2; SUOX: Sulfite oxidase; ZPBP (2): zona pellucida binding protein (2).
urchin) IFL1 CX559654 and Branchiostoma belcheri tsingtaunese (amphioxus) IFL1 ABH05923, together with the Hunchback genes Caenorhabditis elegans (nematode) Hunchback-like AAD16170, Drosophila
melanogaster (fruit fly) Hunchback NP_731267, Helobdella robusta
(leech) Hunchback-like AAY43810, Helobdella triserialis (leech) hbl1
CAA62741 and hbl2 X91395, and Platynereis dumerilii (Dumeril’s clam
worm) Hunchback-like CAJ78430.
Sequence analysis and nomenclature
For phylogenetic analysis, all known and novel protein sequences were
aligned using the ClustalX version 1.83 algorithm and GONNET matrix
model (Freeware) (28). This matrix was used to calculate the alignment
with penalties applied for opening and extending gaps in the alignment. A
phylogenetic tree was created using the bootstrapped Neighbor-Joining algorithm of 100 replicates in PAUP (Freeware). Trees were formatted using
njplot (http://pbil.univ-lyon1.fr/software/njplot.html) and viewed in TreeView (Freeware: http://taxonomy.zoology.gla.ac.uk/rod/treeview.html)
(29). Additional analysis using maximum parsimony (30) and maximum
likelihood (31) algorithms was performed using the PHYLIP version 3.68
(http://evolution.genetics.washington.edu/phylip.html) software package to
confirm phylogenetic topologies. Synteny analysis was performed using
Ensembl (http://www.ensembl.org) and NCBI Map Viewer (http://www.
ncbi.nlm.nih.gov/mapview) using the Ciona intestinalis (JGI v2), Petromyzon marinus (PMAR3), Homo sapiens (NCBI 36/Build 36.2), Xenopus
tropicalis (JGI v4.1) and Oryzias latipes (NIG MEDAKA1) databases.
Analysis of splicing was performed by comparing the genomic scaffold to
the relevant cDNA sequences, applying the GT-AG rule (32).
Results
Identification of Ikaros-related sequences in teleost fish,
lamprey, sea squirt, and amphioxus
Exhaustive bioinformatic analysis of zebrafish genomic databases
revealed the presence of single homologues for Helios, Eos, and
Pegasus, as well as the previously described Ikaros (33), but not
the related Aiolos (Fig. 2 and data not shown). The presence of
corresponding ESTs verified that all were expressed. A similar
analysis of the medaka, fugu, and green spotted pufferfish genomes
revealed an identical complement of Ikaros and Helios genes (Figs.
1 and 2), but identified duplications of Pegasus and Eos genes
(Figs. 1 and 2 and supplemental Fig. 2).4 Divergent Aiolos genes
were also identified in these species (Fig. 1 and 2). Predicted
genomic sequences retrieved from elephant shark also showed
high level of identity to zinc finger domains of Ikaros members
(supplementary Fig. 1). Bioinformatic analysis of the lamprey genome identified, in addition to the previously discovered IKFL1
and IKFL2 genes, a Pegasus-like gene, which grouped with the
IFL2 subset (Fig. 1 and 2). In sea squirt, two Ikaros-related (IFL)
4
The online version of this article contains supplemental material.
4796
IKAROS FAMILY EVOLUTION
FIGURE 4. Comparison of splicing patterns for the Ikaros-related
genes. Schematic representation of
the splicing pattern for Ikaros-related
genes; Ikaros (I), Aiolos (A), Helios
(H), Eos (E), and Pegasus (P) of the
indicated organisms. Splice sites are
indicated by open rounded triangles
and exons indicated by black boxes.
Zinc fingers are numbered and denoted by gray boxes. The dotted line
indicates limited sequence data available for amphioxus IFL1 and IFL2,
and Ciona and lamprey IFL2. Note
that medaka/fugu Aiolos and zebrafish Eos has an extra splice site
following ZF4, and medaka and zebrafish Helios, together with
medaka Ikaros, lack ZF1.
proteins, designated IFL1 and IFL2, with homology with IFL sequences identified in other urochordates and other early vertebrates
were discovered (Fig. 1 and data not shown). In addition, partial
sequences for the N terminus and C terminus zinc finger motifs of
IFL1 and IFL2 genes in two amphioxus species were also identified (Figs. 1 and 2).
Zinc fingers of Ikaros-related sequences show high conservation
and an alteration to the common C2H2 motif
Alignment of the N-terminal and C-terminal arrays of zinc finger
(ZF) motifs revealed high level of identity and conservation of
each motif between species and especially among the human and
medaka orthologues (Fig. 1). In general Ikaros, Aiolos, Helios, and
Eos proteins aligned more closely with amphioxus and Ciona
IFL1, whereas Pegasus proteins aligned more closely with IFL2
(Fig. 1). Pegasus and IFL2 both lack one of the N-terminal zinc
fingers, which the alignment suggests is ZF4, rather than ZF1 as
suggested by others (17), an assertion supported by phylogenetic
analysis of individual ZF3 and ZF4 sequences (data not shown).
Medaka Ikaros and Helios, in contrast, were missing ZF1. Hunchback showed consistent homology to the Ikaros-related proteins,
particularly in ZF2, ZF3, ZF5, and ZF6 (Fig. 1). Finally, the C2H2
motif was observed in all zinc fingers, except for ZF4 of IFL1 and
Aiolos, as well as Eos1 (medaka), which all possessed a variant
C2HC motif. However, many of the Ikaros sequences have a cysteine residue immediately before the last histidine in the ZF4 and
ZF5 motif, suggesting the IFL1 and Aiolos variants do not necessarily change the motif dramatically.
Phylogenic analysis of Ikaros-related sequences
To investigate potential evolutionary relationships, Ikaros-related
sequences were assembled from a range of different species, along
with the most closely related zinc finger protein from invertebrate
species, Hunchback. Phylogenetic trees were constructed using the
Neighborhood-Joining algorithm from ClustalX alignments of
these sequences (Fig. 2 and supplementary Fig. 2). The robustness
of these trees was confirmed by the relatively high bootstrapping
values for each branch, and by using the parsimony maximum
The Journal of Immunology
4797
FIGURE 5. Evolutionary model for the Ikaros-related gene family. Schematic representation of the evolution of the Ikaros-related gene family. Upper
panel, “X” represents the primordial gene (possibly related to protostome Hunchback indicated by “?”), from which all the Ikaros family members were
derived. Likely duplication events are denoted by red boxes, with those corresponding to known whole genome duplication events, numbered as shown.
The duplication of X produced two precursor lineages before cephalochordate and urochordate divergence; one related to IFL1 and the other related to IFL2,
denoted by blue boxes. Duplication 1 occurred before agnathan divergence producing IKLF1 and IKLF2 precursors in the IFL1 lineage. Duplication 2 gave
rise to the individual Ikaros members from the IKFL precursors; IKFL1 produced Ikaros and Aiolos, and IKFL2 produced Helios and Eos, in cartilaginous
fish and tetrapods. In contrast, Pegasus genes have evolved from the IFL2-related precursor. Duplication 3 was a fish-specific event that produced duplicated
members in medaka and pufferfish, as indicated. Absent genes in zebrafish are denoted by “x”. The zinc finger motifs for each member are indicated as
follows: zinc finger (ZF) 1, blue; ZF2, brown; ZF3, green; ZF4, yellow; ZF5, purple; and ZF6, orange. Lower panel, An evolutionary tree is indicated, along
with proposed whole genome duplications (1/2/3R in red with gray shading). This is not drawn to scale.
likelihood PHYLIP 3.68 algorithm, which produced a similar
branching structure (data not shown).
The trees indicated two distinct clades within the Ikaros family
(red boxes). The first clade consisted of IFL2 sequences from amphioxus and Ciona along with Pegasus sequences from various
higher vertebrate species, with IFL2 being most divergent. The
second clade consisted of IFL/IFL1 sequences from sea urchin,
amphioxus, tunicates, and Ciona, IKFL sequences from agnathans,
as well as Helios, Eos, Ikaros, and Aiolos sequences from a variety
of higher vertebrates. The IFL/IFL1 sequences were clearly divergent from the other sequences, which subsequently split into two
branches, one consisting of Helios and Eos along with IKLF2 sequences and the other of all Ikaros and Aiolos along with IKLF1
sequences (blue boxes). Each of these branches split into individual Ikaros, Aiolos, Helios, and Eos clusters (yellow boxes). Finally, the Hunchback proteins grouped closest to the Ikaros-related
sequences, a consistent result regardless of the outgroup or algorithm used (supplementary Fig. 2 and data not shown).
Synteny analysis of Ikaros family genes in higher vertebrates
Synteny analysis was performed on Ikaros family members from
sea squirt, lamprey, and three representative higher vertebrates:
human, frog, and medaka. Genes with conserved synteny were
observed for each Ikaros member: HMX3, ACAD-SB and CUZD1related genes for Pegasus; ZPBP, FIGNL1-related, GRB10-related
and ERBB1-related genes for Ikaros; IGFBP4, PSMD3-related,
ZPBP2, GRB7-related, ERBB2-related, and MLLT6-related genes
for Aiolos; STAT2-related, SUOX, RSP26-related, ERBB3-related,
and PA2G4-related for Eos, as well as ABCA12-related, BARD1related, ERBB4-related, ACAD-L, and SPAG16-related for Helios
(Fig. 3), confirming the orthologous assignment of frog Aiolos and
Eos, and medaka Ikaros, Aiolos, Helios, and the duplicated Eos
member. However, conserved syntenic relationships were also observed between different Ikaros genes: Ikaros and Aiolos shared
synteny with GRB and ZPBP homologues (Ikaros: GRB10 and
ZPBP; Aiolos: GRB7 and ZPBP2), while Helios and Eos share
synteny with ERBB homologues (Eos: ERBB3; Helios: ERBB4),
confirming close evolutionary derivation of each gene pair. Ikaros
and Aiolos also shares synteny with an ERBB homologue (Ikaros:
ERBB1; Aiolos: ERBB2) highlighting a likely common evolutionary ancestry between the gene pairs and also to Ciona IFL1 which
contains an ERBB gene on its loci. Similarly, Pegasus and Helios
also shared conserved synteny with ACAD homologues, again suggesting a common evolutionary derivation. Synteny analysis of the
duplicated teleost eos and pegasus genes confirmed differential
gene loss between species (supplementary Fig. 3).
Preservation of splice sites within Ikaros-related genes
Comparison of splicing patterns reinforced the presumed evolutionary relationships (Fig. 4). Ikaros, Aiolos, Helios, and Eos have
an identical splicing pattern except for an extra exon in zebrafish
Eos and medaka Aiolos, together with the absence of zinc finger
4798
sequences in teleost Helios. IFL1 and lamprey IKFL genes showed
a similar pattern, except for three additional exons in Ciona. In
contrast, while Pegasus genes in lamprey and higher vertebrates
had their first ZF on a separate exon like the other Ikaros genes, the
remainder lie on a single exon like Hunchback. IFL2 showed similarities to both Pegasus and other Ikaros genes.
Discussion
The Ikaros family of zinc finger proteins represent an important
class of transcriptional regulators, with key roles identified for
some members in adaptive immunity (7). To further our understanding of the evolution of this family, we identified and characterized related genes in teleost fish, agnathans, cephalochordates,
and other species. Detailed bioinformatics analysis of these and
other Ikaros-related sequences has shed new light on the evolutionary origins of this family with implications for understanding
the emergence of adaptive immunity.
Phylogenetic analysis of full-length protein sequences (Fig. 2
and supplemental Fig. 2) highlighted the likely evolutionary relationships between specific IFL genes of urochordates and lower
vertebrates and Ikaros family members of higher verterbrates:
IFL1 genes clustering with Ikaros, Aiolos, Helios, and Eos, and
IFL2 with Pegasus. Within the former cluster, distinct Ikaros/Aiolos and Helios/Eos subclusters were present, typical of a gene family expanded through successive rounds of duplication, with the
agnathan IKFL1 and IKFL2 likely representing derivatives of the
respective intermediates. These relationships were confirmed and
further emphasized by similarities in the sequences of individual
zinc fingers (Fig. 1 and supplementary Fig. 1), arrangements of
respective gene loci (Fig. 3 and supplementary Fig. 3), and splicing
patterns (Fig. 4).
On the basis of these data, we propose a model describing the
evolution of Ikaros-related genes (Fig. 5). This suggests that Ikaros
members evolved from a primordial gene (“X”). This duplicated
before the divergence of cephalochordates and urochordates (34),
producing a lineage represented by present-day IFL2 in urochordates and Pegasus in higher vertebrates, and another lineage represented by present-day IFL/IFL1 in urochordates and lower vertebrates. The latter lineage underwent successive duplications to
produce intermediate precursors, related to present day agnathan
IKFL1 and IKFL2, and from these the Ikaros/Aiolos and Helios/
Eos gene pairs, respectively, by the time of the divergence of cartilaginous fish. Two rounds of whole genome duplications occurred during this time (35, 36), providing a convenient
mechanism for how this occurred. A third round of whole genome
duplications resulted in the further expansion of the Ikaros gene
family by duplications of Pegasus and Eos members in the teleost
lineage, although these appear to have been lost in zebrafish where
gene loss appears to have occurred. Collectively, these data suggest that Pegasus likely retains a more ancient function compared
with other members of the higher vertebrate Ikaros family
member.
The proposed model of Ikaros gene family evolution has implications for understanding the evolution of the immune system. For
example, the presence of IFL/IKFL sequences has been used as
evidence for the existence of a lymphocyte-like gene regulatory
apparatus in urochordates and early vertebrates (19, 20, 22), by
analogy with Ikaros (as well as Aiolos and Helios), which are
largely expressed in a lymphoid-specific manner with lymphoidspecific gene regulatory functions (8, 9, 11, 37). However, the
evolutionary relationships delineated in this study would suggest
that IFL/IKFL genes may not necessarily have immune functions.
Thus, IFL2 is likely to reflect a functional homologue of the more
divergent and broadly expressed Pegasus, for which expression has
IKAROS FAMILY EVOLUTION
been observed in the adult mammalian brain, heart, skeletal muscle, kidney, and liver by Northern blot analysis (17), as well as
specifically in the forebrain, hindbrain, eye, and gut during zebrafish embryogenesis as judged by in situ hybridization (ISH)
(L. B. John and A. C. Ward, unpublished data). As for IFL1, the
amphioxus equivalent was found to be expressed in ovary and gills
by ISH (22), while urochordate IFL1 was strongly expressed in the
oocyte and filter house as judged by ISH (20) and, significantly,
failed to bind the consensus Ikaros site (20). Similarly, lamprey
IKFL genes were found to be expressed in intestinal epithelium
and strongly in the ovary using ISH, with further expression in
liver and gills detected by RT-PCR (19). This collectively suggests
that ascribing an immune-related function to the IFL genes may be
a dangerous assumption, although general expression patterns do
not themselves rule out such a role. With this important caveat, our
model places the appearance of the definitively lymphoid Ikaros,
Aiolos, and Helios proteins coincidentally with the timing of the
“emergence” of adaptive immunity at around 450 million years
ago, as argued by others (35), although this remains controversial.
The protein Hunchback, found in various invertebrate species,
has been previously described as having sequence similarity with
Ikaros, particularly at the C terminus (38, 39). Despite this, a direct
evolutionary relationship has not been considered, in part due to
the divergent function of Hunchback, a “gap” gene playing a crucial role in controlling pattern formation and segmentation during
development (40 – 44), compared with the lymphopoietic function
of Ikaros, Helios, and Aiolos (7). However, because Hunchback
showed the highest homology of any protostome sequence to
Ikaros family members (data not shown), consistently grouping
with them in phylogenetic analysis (supplementary Fig. 2 and data
not shown) with the same topology of C2H2 type zinc finger motifs
(45, 46) that showed reasonable alignment with those of Ikaros
family members (Fig. 1), the possibility of shared evolutionary
origins from primordial gene X should be entertained.
Acknowledgments
We thank Clifford Liongue for very helpful discussions regarding phylogenetic analysis and sequence alignments.
Disclosures
The authors have no financial conflict of interest.
References
1. Latchman, D. S. 1997. Transcription factors: an overview. Int. J. Biochem.Cell.
Biol. 29: 1305–1312.
2. Leon, O., and M. Roth. 2000. Zinc fingers: DNA binding and protein-protein
interactions. Biol. Res. 33: 21–30.
3. Gamsjaeger, R., C. K. Liew, F. E. Loughlin, M. Crossley, and J. P. Mackay.
2007. Sticky fingers: zinc-fingers as protein-recognition motifs. Trends Biochem.
Sci. 32: 63–70.
4. Simpson, R. J. Y., E. D. Cram, R. Czolij, J. M. Matthews, M. Crossley, and
J. P. Mackay. 2003. CCHX zinc finger derivatives retain the ability to bind Zn(II)
and mediate protein-DNA interactions. J. Biol. Chem. 278: 28011–28018.
5. Shannon, M., A. T. Hamilton, L. Gordon, E. Branscomb, and L. Stubbs. 2003.
Differential expansion of zinc-finger transcription factor loci in homologous human and mouse gene clusters. Genome Res. 13: 1097–1110.
6. Cortes, M., E. Wong, J. Koipally, and K. Georgopoulos. 1999. Control of lymphocyte development by the Ikaros gene family. Curr. Opin. Immunol. 11:
167–171.
7. Rebollo, A., and C. Schmitt. 2003. Ikaros, Aiolos and Helios: transcription regulators and lymphoid malignancies. Immunol. Cell Biol. 81: 171–175.
8. Molnar, A., P. Wu, D. A. Largespada, A. Vortkamp, S. Scherer, N. G. Copeland,
N. A. Jenkins, G. Bruns, and K. Georgopoulos. 1996. The Ikaros gene encodes
a family of lymphocyte-restricted zinc finger DNA binding proteins, highly conserved in human and mouse. J. Immunol. 156: 585–592.
9. Schmitt, C., C. Tonnelle, A. Dalloul, C. Chabannon, P. Debre, and A. Rebollo.
2002. Aiolos and Ikaros: regulators of lymphocyte development, homeostasis and
lymphoproliferation. Apoptosis 7: 277–284.
10. Tonnelle, C., B. Calmels, C. Maroc, J. Gabert, and C. Chabannon. 2002. Ikaros
gene expression and leukemia. Leuk. Lymph. 43: 29 –35.
11. Schorpp, M., M. Bialecki, D. Diekhoff, B. Walderich, J. Odenthal,
H. M. Maischein, A. G. Zapata, and T. Boehm. 2006. Conserved functions of
The Journal of Immunology
12.
13.
14.
15.
16.
17.
18.
19.
20.
21.
22.
23.
24.
25.
26.
Ikaros in vertebrate lymphocyte development: genetic evidence for distinct larval
and adult phases of T cell development and two lineages of B cells in zebrafish.
J. Immunol. 177: 2463–2476.
Ghadiri, A., M. Duhamel, A. Fleischer, A. Reimann, F. Dessauge, and
A. Rebollo. 2007. Critical function of Ikaros in controlling Aiolos gene expression. FEBS Letts. 581: 1605–1616.
Winandy, S., P. Wu, and K. Georgopoulos. 1995. A dominant mutation in the
Ikaros gene leads to rapid development of leukemia and lymphoma. Cell 83:
289 –299.
Zhang, Z., C. S. Swindle, J. T. Bates, R. Ko, C. V. Cotta, and C. A. Klug. 2007.
Expression of a non-DNA-binding isoform of Helios induces T-cell lymphoma in
mice. Blood 109: 2190 –2197.
Tabayashi, T., F. Ishimaru, M. Takata, I. Kataoka, K. Nakase, T. Kozuka, and
M. Tanimoto. 2007. Characterization of the short isoform of Helios overexpressed in patients with T-cell malignancies. Cancer Sci. 98: 182–188.
Honma, Y., H. Kiyosawa, T. Mori, A. Oguri, T. Nikaido, K. Kanazawa, M. Tojo,
J. Takeda, Y. Tanno, S. Yokoya, et al. 1999. Eos: a novel member of the Ikaros
gene family expressed predominantly in the developing nervous system. FEBS
Lett. 447: 76 – 80.
Perdomo, J., M. Holmes, B. Chong, and M. Crossley. 2000. Eos and Pegasus, two
members of the Ikaros family of proteins with distinct DNA binding activities.
J. Biol. Chem. 275: 38347–38354.
Haire, R. N., A. L. Miracle, J. P. Rast, and G. W. Litman. 2000. Members of the
Ikaros gene family are present in early representative vertebrates. J. Immunol.
165: 306 –312.
Mayer, W. E., C. O’Huigin, H. Tichy, J. Terzic, and M. Saraga-Babic. 2002.
Identification of two Ikaros-like transcription factors in lamprey. Scand. J. Immunol. 55: 162–170.
Cupit, P. M., J. D. Hansen, A. S. McCarty, G. White, M. Chioda, F. Spada,
S. T. Smale, and C. Cunningham. 2003. Ikaros family members from the agnathan Myxine glutinosa and the urochordate Oikopleura dioica: emergence of an
essential transcription factor for adaptive immunity. J. Immunol. 171:
6006 – 6013.
Kohonen, P., K. P. Nera, and O. Lassila. 2004. Avian Helios and evolution of the
Ikaros family. Scand. J. Immunol. 60: 100 –107.
Huang, G., X. Xie, Y. Han, L. Fan, J. Chen, C. Mou, L. Guo, H. Liu, Q. Zhang,
S. Chen, et al. 2007. The identification of lymphocyte-like cells and lymphoidrelated genes in Amphioxus indicates the twilight for the emergence of adaptive
immune system. PLoS ONE 2: e206.
Hibino, T., M. Loza-Coll, C. Messier, A. J. Majeske, A. H. Cohen,
D. P. Terwilliger, K. M. Buckley, V. Brockton, S. V. Nair, K. Berney, et al. 2006.
The immune gene repertoire encoded in the purple sea urchin genome. Dev. Biol.
300: 349 –365.
Hansen, J., P. Strassburger, and L. Du Pasquier. 1997. Conservation of a master
hematopoietic switch gene during vertebrate evolution: isolation and characterization of Ikaros from teleost and amphibian species. Eur. J. Immunol. 27:
3049 –3058.
Liippo, J., and O. Lassila. 1997. Avian Ikaros gene is expressed early in embryogenesis. Eur. J. Immunol. 27: 1853–1857.
Willett, C. E., H. Kawasaki, C. T. Amemiya, S. Lin, and L. A. Steiner. 2001.
Ikaros expression as a marker for lymphoid progenitors during zebrafish development. Dev. Dyn. 222: 694 – 698.
4799
27. Yu, C., M. Dong, X. Wu, S. Li, S. Huang, J. Su, J. Wei, Y. Shen, C. Mou, X. Xie,
et al. 2005. Genes “waiting” for recruitment by the adaptive immune system: the
insights from Amphioxus. J. Immunol. 174: 3493–3500.
28. Heringa, J. 1999. Two strategies for sequence comparison: profile-preprocessed
and secondary structure-induced multiple alignment. Comput. Chem. 23:
341–364.
29. Page, R. D. 1996. TreeView: an application to display phylogenetic trees on
personal computers. Comput. Appl. Biosci. 12: 357–358.
30. Barnabas, J., M. Goodman, and G. W. Moore. 1972. Descent of mammalian ␣
globin chain sequences investigated by the maximum parsimony method. J. Mol.
Biol. 69: 249 –278.
31. Felsenstein, J. 1981. Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17: 368 –376.
32. Padgett, R. A., P. J. Grabowski, M. M. Konarska, S. Seiler, and P. A. Sharp.
1986. Splicing of messenger RNA precursors. Annu. Rev. Biochem. 55:
1119 –1150.
33. Willett, C. E., A. Cortes, A. Zuasti, and A. G. Zapata. 1999. Early hematopoiesis
and developing lymphoid organs in the zebrafish. Dev. Dyn. 214: 323–336.
34. Blair, J., and S. Hedges. 2005. Molecular phylogeny and divergence times of
deuterostome animals. Mol. Biol. Evol. 22: 2275–2284.
35. Rothenberg, E. V., and R. Pant. 2004. Origins of lymphocyte developmental
programs: transcription factor evidence. Sem. Immunol. 16: 227–238.
36. Steinke, D., S. Hoegg, H. Brinkmann, and A. Meyer. 2006. Three rounds (1R/
2R/3R) of genome duplications and the evolution of the glycolytic pathway in
vertebrates. BMC Biol. 4: 16.
37. Narvi, E., K. P. Nera, P. Terho, L. Mustonen, J. Granberg, and O. Lassila. 2007.
Aiolos controls gene conversion and cell death in DT40 B Cells. Scand. J. Immunol. 65: 503–543.
38. Kehle, J., D. Beuchle, S. Treuheit, B. Christen, J. A. Kennison, M. Bienz, and
J. Muller. 1998. dMi-2, a Hunchback-interacting protein that functions in polycomb repression. Science 282: 1897–1900.
39. Georgopoulos, K., D. D. Moore, and B. Derfler. 1992. Ikaros, an early lymphoidspecific transcription factor and a putative mediator for T cell commitment. Science 258: 808 – 812.
40. Simpson, P. 2002. Evolution of development in closely related species of flies
and worms. Nat. Rev. Genet. 3: 907–907.
41. Lynch, J., and C. Desplan. 2003. “De-evolution” of Drosophila: toward a more
generic mode of axis patterning. Int. J. Dev. Biol. 47: 497–503.
42. Pinnell, J., P. S. Lindeman, S. Colavito, C. Lowe, and R. M. Savage. 2006. The
divergent roles of the segmentation gene hunchback. Integr. Comp. Biol. 46:
519 –532.
43. Savage, R. M., and M. Shankland. 1996. Identification and characterisation of a
hunchback orthologue, Lzf2, and its expression during leech embryogenesis. Dev.
Biol. 175: 205–217.
44. Kerner, P., F. Zelada González, M. Le Gouar, V. Ledent, D. Arendt, and
M. Vervoort. 2006. The expression of a hunchback ortholog in the polychaete
annelid Platynereis dumerilii suggests an ancestral role in mesoderm development and neurogenesis. Dev. Genes Evol. 216: 821– 828.
45. Fay, D., H. Stanley, M. Han, and W. Wood. 1999. A Caenorhabditis elegans
homologue of hunchback is required for late stages of development but not early
embryonic patterning. Dev. Biol. 205: 240 –253.
46. Sommer, R. J., M. Retzlaff, K. Goerlich, K. Sander, and D. Tautz. 1992. Evolutionary conservation pattern of zinc-finger domains of Drosophila segmentation
genes. Proc. Natl. Acad. Sci. USA 89: 10782–10786.