The mouse salivary androgen-binding protein (ABP) gene cluster on

The mouse salivary androgen-binding protein (ABP) gene
cluster on Chromosomes 7: characterization and
evolutionary relationships
Christina M. Laukaitis,1,* Stephen R. Dlouhy,2 Robert C. Karn1
1
Department of Biological Sciences, Butler University, 4600 Sunset Avenue, Indianapolis, Indiana 46208, USA
Department of Medical and Molecular Genetics, Indiana University School of Medicine, 975 W. Walnut, Room 130,
Indianapolis, Indiana 46202, USA
2
Received: 18 April 2003 / Accepted: 18 June 2003
Abstract
Mouse salivary androgen-binding protein (ABP) is a
pair of dimers, composed of an alpha subunit disulfide bridged to either a beta or a gamma subunit. It
has been proposed that each subunit is encoded by a
distinct gene: Abpa, Abpb, and Abpg for the alpha,
beta, and gamma subunits, respectively. We report
here the structures and sequences of the genes that
encode these three subunits. Each gene has three
exons separated by two introns. Mouse salivary ABP
is a member of the secretoglobin family, and we
compare the structure of the three ABP subunit
genes to those of 18 other mammalian secretoglobins. We map the three genes as a gene cluster located 10 cM from the centromere of Chromosome
(Chr) 7 and show that Abpa is the closest of the three
to the gene for glucose phosphate isomerase (GPI)
and that Abpg is the closest to the centromere, with
Abpb mapping between them. Abpa is oriented in
the opposite direction to Abpb and Abpg, with its 5¢
end directed toward their 5¢ ends. We compare the
location of these genes with other secretoglobin
genes in the mouse genome and with the known
locations of secretoglobin genes in the human genome and present evidence that strong positive selection has driven the divergence of the coding
regions of Abpb and Abpg since the putative duplication event that created them.
The nucleotide sequence data reported in this paper have been
submitted to GenBank and have been assigned the accession
numbers Abpa: AF144714; Abpb: AY325897; Abpg: 325898.
*Current address: Department of Internal Medicine, St. Vincent
Hospital, 2100 W. 86th St., Indianapolis, IN 46260, USA
Correspondence to: R.C. Karn; E-mail: [email protected]
A gene’s location within the genome reflects its evolutionary history and affects its expression. With
the recent genome sequencing effort, it has become
possible to study the context surrounding individual
genes. In a simplistic model, gene duplications
would initially place genes within close proximity to
one another within the genome where they potentially could be coordinately up- or down-regulated.
Indeed, in the zebrafish, the cluster of evolutionarily
related odorant receptor genes are found in close
proximity, and individual chromosomal clusters are
expressed in the same olfactory epithelial zone
(Kratz et al. 2002), suggesting both gene duplication
and coordinate regulation. More direct evidence for
coordinate regulation comes from the occurrence of
chromosome remodeling of an entire cluster of genes
before expression of those genes. For example, the
genes for cytokines IL-4, IL-5, and IL-13, which are
required for TH2 differentiation, are located in a 180kb region of human Chr 5, and this region of chromatin undergoes remodeling before TH2 differentiation (Agarwal et al. 1999).
We are interested in the murine members of the
secretoglobin protein family (Klug et al., 2000) that
are expressed in salivary glands. This family, called
mouse salivary androgen-binding protein (ABP),
consists of two different dimers: an alpha subunit is
bound to either a beta or a gamma subunit by disulfide bridging before secretion into mouse saliva
(Dlouhy et al. 1987). From biochemical genetic evidence, Dlouhy et al. (1987) postulated that three
distinct genes, Abpa, Abpb, and Abpg, encoded the
three protein subunits, alpha, beta, and gamma, respectively. They obtained three products with distinctly different isoelectric points from cell-free
translation of a submaxillary gland poly(A)-tailed
RNA extract, suggesting that each of the three
DOI: 10.1007/s00335-003-2291-y • Volume 14, 679–691 (2003) • Springer-Verlag New York, Inc. 2003
679
680
C.M. L AUKAITIS
subunits was translated from a different mRNA and
lending additional support to the hypothesis of three
ABP genes. They could not exclude, however, the
possibility that exon shuffling was responsible for
the different mRNAs.
More recently, the cDNAs for these subunits
have been described (Karn and Russell 1993; Karn
and Laukaitis 2003), and the amino acid sequences
predicted from them suggest that these three proteins are evolutionarily related to each other and to
other secretoglobins. Construction of a phylogenetic
tree suggests that Abpa diverged from a common
ancestor of Abpb and Abpg, and thus Karn and
Laukaitis (2003) proposed that the three putative
genes originated from gene duplication events. Previous genetic studies have suggested that Abpa and
Abpg are closely linked and assigned Abpa to Chr 7,
at a location 10 cM from the centromere (Dlouhy
et al. 1987), but the lack of electrophoretic polymorphism of Abpb precluded mapping it with
standard recombination techniques. If the hypothesis of gene duplication is correct, then Abpa, Abpb,
and Abpg might still be found in close proximity to
one another within the genome.
It was our purpose in this study to resolve the
issue of the number of genes encoding mouse salivary ABP subunits and to produce a physical map of
the gene structures and the organization of the
cluster. We demonstrate the existence of three distinct genes encoding the alpha, beta, and gamma
subunit cDNAs and show that these are clustered on
Chr 7 in the mouse. We also characterize the exon
and intron structures and orientations of these three
genes. There is evidence that one or more of these
genes have evolved under strong Darwinian selection (Hwang et al. 1997; Karn and Nachman 1999;
Karn et al. 2002), and we examine the potential evolutionary relationships among these genes and
other secretoglobin genes.
(Sigma Biochemicals, St. Louis, Mo.) as previously
described (Hwang et al. 1997; Karn and Nachman
1999). Long-range PCR was performed with the LA
PCR kit from TaKaRa Biotechnical Supplies (Osaka,
Japan), using recipes recommended by the manufacturer. The reaction included: 1 min at 94C; 14
cycles of 20 s at 98C, 20 s at 50C, and 10 min at
68C; 6 cycles of 20 s at 98C, 20 s at 56C, and 11
min at 68C; 6 cycles of 20 s at 98C, 20 s at 56C,
and 12 min 30 s at 68C; 6 cycles of 20 s at 98C; 20 s
at 56C, and 14 min at 68C; and 5 min at 72C.
Materials and methods
Polymerase chain reaction. Templates for PCR reactions were genomic DNAs from the inbred DBA/2J
and C3H/HeJ mouse strains purchased from The
Jackson Laboratory (Bar Harbor, Maine). Production
of submaxillary gland cDNA libraries has been previously described (Hwang et al. 1997; Karn and
Laukaitis 2003). Primers for amplification of Abpa,
Abpb, and Abpg were designed from their cDNAs
(Karn and Laukaitis 2003) and purchased from Genosys (St. Louis, Mo.). Their positions are shown in
the gene figures, where they are underlined with the
direction of their 3¢ ends indicated with arrows. PCR
was performed by using Sigma Taq Polymerase
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
C HR 7
DNA sequencing. Clean-up of PCR templates
was performed by using QiaQuick PCR clean-up
spin columns according to the manufacturer’s protocol (Qiagen Incorporated, Valencia, Calif.). Automated sequencing was performed by ACGT, Inc.
(Chicago, Ill), and manual sequencing was performed
by using Amersham end-termination sequencing
kits (Amersham-Pharmacia, Uppsala, Sweden; Karn
and Nachman 1999).
Data analysis. DNASIS/PROSIS v. 5.0 (Hitachi
Software, Yokohama, Japan) was used to search and
to manually align DNA sequences isolated from
Blast searches as well as in primer design. The program MEGA, version 2.1 (Kumar et al. 2001), was
used to calculate nucleotide divergences with the
Kimura correction for multiple hits and a transition:
transversion ratio of 2. The distances and their
standard errors were compared by a modification of a
one-tailed t-test with infinite degrees of freedom
(Graur and Li 2000).
Genome mining. High throughput genome sequencing data from mice were searched by using Blast
(Altschul et al. 1990) for sequences corresponding to
cDNA sequences of Abpa, Abpb, and Abpg (Karn and
Laukaitis 2003). BACs identified as having scores
>200 were downloaded for further analysis. The Ensembl program (Hubbard et al. 2002) was used with
the February 2002 mouse genome assembly to visualize the assigned locations of DNA contigs, unsequenced gaps, known genes, and gene predictions
on the proximal end of Chr 7. The synteny-view feature was used to assess homology between secretoglobin-containing regions of mouse Chrs 7, 18, and
19 and human Chrs 5, 11, and 19. BLAT (Kent 2002)
was used to search the February 2002 mouse genome
assembly to determine the chromosomal location of
Abpg. The UCSC genome browser (Karolchik et al.
2003) was used to search for CpG islands, GC content, gene predictions, and sequence repeats in proximal Chr 7 and to obtain DNA sequence of the 80 kb
region spanning Abpg through Abpa.
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
681
C HR 7
Table 1. Exon and intron structures for 21 secretoglobin chains
Exon 1
Group
A
B
C
D
E
Gene
Total Size
(kb)
Oc Utero
Hs CC10
Mm CC10
Rn CC10
Mm Abpa
Mm lachrymal
Fc Fel dI, chain 1
Hs MGB A
Hs MGB B
Rn PBP C3
Ma HGB B1
Ma HGB B2
Ma HGB A
Rn PBP C1
Rn PBP C2
Hs LPP A
Hs LPP B
Fc Fel dI, chain 2
Mm 11 kDa
Mm Abpb
Mm Abpg
5¢ UTR
Coding
Exon 3
Total Intron 1 Exon 2 Intron 2
3.08
47
55
102
2286
188
4.11
8
55
63
3070
188
4.33
72
45
127
2471
188
3.55
57
55
112
1680
188
1.33
20a
61
81
121
188
1.30
16
64
80
119
188
N/A
?
61
N/A
155
188
3.00
59
56
114
609
188
5.26
64
55
119
1612
188
2.76
54
58
112
1551
188
Only cDNA sequences are avalible at this time
Only cDNA sequences are avalible at this time
Only cDNA sequences are avalible at this time
3.01
36
61
97
1719
191
2.88
33
52
85
1634
203
3.25
47
55
102
1716
188
2.55
54
55
109
926
188
N/A
?
61
N/A
512
182
N/A
?
61
N/A
465
182
1.99
31a
61
92
451
182
1.99
31a
61
92
451
182
Coding
3¢ UTR
Total
30
161
45
45
27
27
27
35
42
39
143
254
127
118
128
123
(Term)
164
169
170
173
137
172
163
155
150
N/A
199
211
209
859
81
805
81
1105
27
1193
27
1251
78
Not determined
993
93
1009
93
61
68
112
111
(Term)
142
149
139
138
N/A
161
161
254
254
327
650
1376
1406
785
761
1101
1888
3128
695
a
Based on putative 5¢ end.
BLAT was used to obtain gene sequences for other
secretoglobins in mouse, rat, and human. cDNA sequences were searched against the November 2002
human, November 2002 rat, or February 2002 mouse
genome assemblies, as appropriate, to determine exon
and intron size and placement. 5¢ and 3¢ UTR assignments were performed according to ATG and stop codon locations in the gene and polyadenlyation signal
sites in the cDNA. GenBank cDNA accession numbers are: mouse ABP alpha (AF144714), mouse lachrymal (AF008595), rat PBP C1 (V01255), rat PBP C2
(V01256), rat PBP C3 (V01263), human lipophilin A
(AJ224171), human lipophilin B (AJ224172), human
mammoglobin A (U33147), human mammoglobin B
(AF071219), human Clara 10 (X13197), mouse Clara 10
(L04503), rat Clara 10 (NM_013051), hamster heteroglobin A (Z66540), hamster heteroblogin B1
(AJ252138), and hamster heteroglobin B2 (AJ252139).
No gene corresponding to an exact match for the mouse
11-kDa cDNA (AF272844) was found; gene structure
was predicted in Table 1 from a similar predicted gene
with 90% homology (ENMUSG00000036047). The
rabbit uteroglobin gene structure (X01423) was previously determined, as were partial gene structures for
feline Fel d I chain 1 (X62477) and chain 2 (X62478).
Results
The structures of the ABP genes (Abpa, Abpb and
Abpg). The three genes encoding the subunits of
mouse salivary ABP are known to be expressed in
mouse submaxillary glands since mRNA encoding
all three subunits can be isolated from the glands
(Dlouhy and Karn 1984; Hwang et al. 1997; Karn and
Laukaitis 2003), and protein composed of the alpha
subunit bound to either the beta or gamma subunits
can be isolated from submaxillary secretions (Karn
and Russell 1993; Karn and Clements 1999). Abpa,
the gene common to all forms of ABP described to
date, has been until now only partially sequenced.
Karn and Russell (1993) reported the sequence of a
partial cDNA that included the coding region of the
entire secreted alpha subunit and the 3¢ UTR. The
intron/exon structure of Abpa was delineated and
the sequence of the second intron produced by Karn
and Nachman (1999). We have completed the sequence of Abpa. Although the 5¢ UTR of the Abpa
cDNA was not known previously, we were able to
design a primer for a putative 5¢ UTR using sequence
in the mouse genome. Using this primer and a reverse primer in the second exon, we were able to get
amplification from a submaxillary cDNA template
(Fig. 1), and we confirmed that this represented expressed Abpa by sequence analysis of the coding
region representing the secreted protein (not shown).
We were also able to determine the signal peptide.
We subsequently utilized these primers for amplification from genomic DNA (Fig. 1) and determined
the sequence of the first intron of Abpa. Thus, our
data complete the structure of Abpa by adding the
682
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
C HR 7
Fig. 1. Abpa and Abpb amplification products. Panel A: Amplification of the gene corresponding to the complete cDNA of
Abpa. Lane 1, molecular weight standards; lane 2, a 1.0-kb portion of Abpa amplified from genomic DNA with a forward
primer in Exon 2 and a reverse primer in the 3¢ UTR (Karn and Nachman 1999); lane 3, the partial cDNA amplified using
the same primers as in lane 2; lane 4, the full-length (1.33 kb) Abpa amplifed from genomic DNA with a forward primer in
the 5¢ UTR, a reverse primer in the 3’ UTR; and lane 5, the full-length cDNA amplified with the same primers as in lane 4.
Panel B: Sequencing strategy for the Abpb gene (all PCR performed on genomic DNA). Lane 1, molecular weight
standards; lane 2, the 0.6-kb 5¢ end of Abpb amplified with a forward primer in the 5’ UTR and a reverse primer in Exon 2;
lane 3, the 1.3-kb 3¢ end of Abpb amplified with a forward primer in Exon 2 and a reverse primer in the 3¢ UTR; and lane 4,
the ca. 2-kb doublet obtained when PCR is performed on genomic DNA by using a forward primer in the 5¢ UTR and a
reverse primer in the 3’ UTR. Panel C: Amplification of the fragment separating Abpa and Abpb. Lane 1, molecular weight
standards; lane 2, the ca. 8-kb fragment separating Abpa and Abpb, amplified from genomic DNA by using primers at the
5¢ ends of both genes. These were pointed toward the 5¢ ends of the two genes and are diagrammed on Fig. 5.
sequences of the first exon and the first intron
(Fig. 2). The first exon includes a small 5¢ UTR and
the coding sequence for most of the signal peptide,
and this is followed by a very small intron (121 bp).
The cDNA result shown in Fig. 1A, lane 5, confirms
that the additional exon has been correctly assigned.
We also report here the complete structures of the
beta subunit gene, Abpb (Fig. 3) and the gamma
subunit gene, Abpg (Fig. 4). We began with a strategy
similar to that of Karn and Russell (1993). Briefly,
mixed oligomer probes were designed by using a
partial beta subunit protein sequence (Karn and
Russell 1993) and were used to probe a submaxillary
cDNA library for complementary clones which were
sequenced. A cDNA, putatively identified as encoding the beta subunit of ABP including part of its
transcription initiation sequence, was obtained and
sequenced (Karn and Laukaitis 2003). Confirmation
of its identity required comparison with that encoding the alternative subunit, gamma, since the protein
sequences of these were reported to be quite similar
(Karn and Clements 1999). Primers designed from the
5¢ and 3¢ UTR sequences of the cDNA yielded a
doublet of bands of approximately 2 kb when used to
amplify genomic DNA (Fig. 1). To ensure that we
sequenced only a single gene corresponding to the
cDNA, we designed a set of antiparallel but partially
overlapping primers near the center of the coding
region (eventually found to be in Exon 2; Fig. 3) and
used each in conjunction with the appropriate 5¢ or 3¢
UTR primer to amplify fragments of the gene (Fig. 1).
These fragments were sequenced and assembled to
yield a gene approximately 2 kb in length (Fig. 3). The
subsequent release of the mouse genome allowed us
to locate this gene in the database.
Using a similar strategy to that outlined for
Abpb, we designed a mixed oligermeric DNA probe
to re-probe the cDNA library for clones encoding the
gamma subunit. We subsequently obtained a putative gamma cDNA sequence and showed that the 5¢
UTRs, signal peptide coding regions, and the 3¢
UTRs of the beta and gamma cDNAs are almost
identical (Karn and Laukaitis 2003). This similarity
explains why we observed two bands when amplifying genomic DNA with primers located in the 5¢
and 3¢ UTRs. Identifying the gene shown in Fig. 3 as
Abpb was possible when the ‘‘a’’ and ‘‘b’’ alleles of
the beta and gamma cDNAs were sequenced from
C3H/HeJ and DBA/2J mouse submaxillary glands,
respectively (Karn and Laukaitis 2003). Identification required comparing a protein sequence encoded
by both subunits in both strains to predict the electrophoretic variations upon which the original subunit assignment was made (Dlouhy et al. 1987). We
deduced the gamma gene (Abpg; Fig. 4) from the
mouse genome, using its cDNA sequence (Karn and
Laukaitis 2003). The exons were assigned from the
cDNA coding sequences and their 5¢ and 3¢ UTRs,
and the intron boundaries were verified by the consensus GT and AG doublets flanking the ends of the
exons.
The sequences of Abpa, Abpb, and Abpg, which
are given in Figs. 2–4, respectively, show that each is
composed of three exons and two introns, which are
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
C HR 7
683
Fig. 2. Abpa, the mouse androgen-binding protein alpha subunit gene. The gene consists of three exons and two introns
shown in the diagram above the sequence. The amino acid sequence encoded by the exons are written below them. The
putative primary promoter region, a potential secondary promoter region much farther upstream, and the polyadenylation
signal are boxed. Transcription initiation sequences are in bold. GT/AG splice sequences in the introns are in italics.
Primer sequences are underlined with arrows indicating their 3¢ ends.
very similar in size between Abpb and Abpg. By
contrast, these segments in Abpa have much different sizes and sequences:
1. The coding regions of Exon 1 are the same size
for all three subunits, but the sequence of Abpa is
different from the other two, which are nearly
identical to each other (92% with one insertion/
deletion [indel]). The predicted 5¢ UTR size varies by
11 bp between Abpa (20 bp) and Abpb and Abpg
(31 bp).
2. Intron 1 is much smaller for Abpa (121 bp)
than for the other two (451 bp), and it has essentially
no sequence in common with them. By contrast,
there is significant identity (90%) between Abpb and
Abpg in this intron.
3. Exon 2 is similar in size (188 bp in Abpa; 182
bp in Abpb and Abpg) among the three subunit
genes, but the sequence of Abpa has essentially no
sequence in common with Abpb and Abpg (which
are 72% similar to each other), as reported for their
cDNAs (Karn and Laukaitis 2003).
4. Intron 2 is smaller in Abpa (785 bp; [Karn and
Nachman 1999]) than in Abpb (993 bp) and Abpg
(1009 bp) and shows no significant sequence identity
with them. Intron 2 of Abpb and Abpg, on the other
hand, shows 90% identity; the size differences ac-
684
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
C HR 7
Fig. 3. Abpb the mouse androgen-binding protein beta subunit gene. The gene consists of three exons and two introns
shown in the diagram above the sequence. Control regions are boxed, and other labels are as described for Fig. 2.
counted for by 5 indels, the largest of which is 14 bp.
The net effect of the 5 indels distributed between the
Abpb and Abpg second introns is a size difference of
only 16 bases.
5. Exon 3 is smaller in Abpa (155 bp) than Exon 3
in Abpb and Abpg (254 bp in each). The region
contains the end of the coding sequence (27 bp in
Abpa and 93 bp in both Abpb and Abpg) and the 3¢
UTR (as determined from cDNA sequences; 146 bp
in Abpa, 170 bp in Abpb, and 171 bp in Abpg). Exon
3 of Abpa shows no significant identity with the
other two. Abpb and Abpg share 61% identity in the
Exon 3 portion of their coding regions and 89% in
their 3¢ UTRs.
Putative control regions of Abpa, Abpb and
Abpg. Figures 2–4 also show putative control regions for the three ABP subunit genes, which were
surmised 1) from the distances from the putative
promoter to the putative transcription initiation site
and 2) from consensus sequences derived from other
1 studies (White 2001; Klug and Cummings 2003).
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
685
C HR 7
Fig. 4. Abpg, the mouse androgen-binding protein gamma subunit gene. The gene consists of three exons and two introns
shown in the diagram above the sequence. Control regions are boxed and other labels are as described for Fig. 2.
Abpa has a sequence (TATAAAA) consistent with
the primary promoter consensus TATAAT 25 bp
proximal to the transcription initiation site (GAGCACTTGCC). It does not have a CAAT consensus
80 bp proximal to that site, the expected position of
the secondary promoter sequence, but it does have a
CAAT sequence 205 bp proximal to the transcription initiation site. Abpb and Abpg have identical
sequences (TGATAAT) consistent with the primary
promoter consensus 25 bp proximal to the transcription initiation site (GCTCTGAGAAGAGC in
Abpb and GCTTTGAGAAGAGC in Abpg). They
also both have a putative secondary promoter sequence (GCTCAAAGCT in Abpb and GCCCAA
AGGT in Abpg) 80 bp proximal to that site. Using a
UCSC genome browser search tool, we found no
CpG islands or GC-rich regions that are commonly
associated with housekeeping genes in the area immediately proximal to the transcription initiation
sites. These findings are consistent with the fact that
ABP genes are primarily expressed in salivary glands
(Dlouhy et al. 1986).
686
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
C HR 7
Fig. 5. The structure of the mouse salivary androgen-binding protein (ABP) gene cluster on Chr 7 with the contigs used to
construct the map shown below the chromosome bar. The centromere is to the left (24.5 Mb or ca. 10 cM) of the center of
the ABP cluster. The structures of Abpb and Abpa are shown above the chromosome bar, with their 5¢ ends oriented
towards each other. The primers used in long-range amplification are shown above the Abpb and Abpa gene structures.
The structure of the Abpg gene is shown below the chromosome bar, mapping in the same 5¢-to-3¢ orientation as Abpb and
to the left of Abpb and Abpa. The exons of each gene are shown with shaded coding regions and unshaded noncoding
regions (5¢ and 3¢ UTRs). The sizes of each gene are shown below its diagram. Also shown are the Mus musculus Major
allergen 1 (Mjal 1) gene on the left flank of the ABP cluster, the glucose phosphate isomerase (GPI) gene on the right flank
of the cluster, and five LINE 1 (L1) genes interspersed among the ABP genes.
The physical map of the ABP gene cluster. A
BLAST search of the Mus musculus high-throughput
genome sequencing database for Abpa identified
three BACs containing full-length matches with
greater than 99% homology: RP23-69B1, RP23291N20, and RP23-477F12. A similar BLAST search
for Abpb identified the same three BACs. A search
for Abpg identified only RP23-477F12. All of these
BACs have been mapped to the proximal end of
mouse Chr 7 by Ensembl. The individual Abpa and
Abpb gene sequences are located on separate subclones within RP23-291N20 and RP23-69B1 and on
the same clone within RP23-477F12. This places
Abpa and Abpb approximately 8 kb apart with their
5¢ regions oriented toward each other and their 3¢
ends pointing away from each other (Fig. 5). The
sequence predicted to fall between Abpa and Abpb
corresponds to the ends of the two subclones con-
taining Abpa and Abpb in RP23-291N20 and RP2369B1. We confirmed the orientation of Abpa and
Abpb relative to each other by amplifying an overlapping segment using long-range PCR with genomic
DNA as the template (Fig. 1). We obtained a PCR
product of the expected 8-kb size when we amplified
with primers pointed toward the 5¢ direction of both
Abpa and Abpb (Fig. 5), but with none of the other
possible primer combinations.
Abpg is found on a single clone within RP23477F12. This subclone contains neither Abpa
nor Abpb. The ends of this piece showed no
homology to any of the other subclones identified as
containing Abpa or Abpb. The Ensembl database
maps RP23-477F12 immediately proximal to RP2329N120. The appearance of Abpg in this region is
consistent with the earlier linkage of Abpa and Abpg
with recombination data (Dlouhy et al. 1987) and
Fig. 6. Comparison of nucleotide divergences for different segments of Abpb and Abpg. The diagram at the top of the figure
shows the general structure of the two genes. The nucleotide divergences with standard errors for the exons and introns
are shown below the diagram. Nucleotide divergences for regions within the three exons are shown at the bottom.
C HR 7
687
supports the location of this contig in this segment
of Chr 7.
Comparing the map we derived with the map of
this region assembled by Ensembl shows that Abpb
is closest to the centromere, while Abpa is closest to
GPI (Fig. 5). BLAT was used to search the UCSC
mouse database for Abpa, Abpb, and Abpg, and
confirmed the assignment of Abpa to nucleotides
24,507,099 through 24,508,667 of Chr 7, running in a
5¢ to 3¢ orientation. The Abpa sequence described in
the genome databases differs in the sequence of its
second intron by two bases from our sequence. These
are at positions 912 and 1126, and are emboldened
and underlined on Fig. 2. Current and previous DNA
sequencing (Karn and Nachman 1999) support our
version of Intron 2. It must be noted, however, that
we are working with the C3H/HeJ strain of mice
while genome sequencing used the C57BL/6 strain.
Both strains are Abpaa/Abpaa (Dlouhy and Karn
1983), but minor differences in their genes could exist. A BLAT search of the UCSC database retrieved a
sequence essentially identical to Abpb found on Chr
7 with its 3¢ end at 24,497,716 and its 5¢ end at
24,499,896. This distance confirms our placements,
including relative orientations, of Abpa and Abpb.
One difference exists between the genome sequences
at this location and the sequence we report here for
Abpb. This change is at bp 280 and is emboldened
and underlined. Here, we report a T for the genome
project’s C. The C at this position would change the
amino acid of the signal peptide, substituting a Thr
for an Ile, which is inconsistent with both our current
DNA sequencing data (Karn and Laukaitis 2003) and
with our previous protein sequence (Karn and Russell
1993; Karn and Clements 1999); however, all that
work with the ‘‘a’’ variant of ABP a has been done
with the C3H strain. A BLAT search of the UCSC
database retrieved a sequence 100% identical to the
cDNA of Abpg with its 3¢ end at 24,429,469 and its 5¢
2 end at 24,431,662 on Chr 7. This places the 5¢ end of
Abpg 66.054 kb from the 3¢ end of Abpb in the same
orientation (Fig. 5).
The gene prediction program Genescan embedded within the UCSC genome browser predicts the
existence of Abpb (labeled Chr7_9.8) and the first
two exons of Abpg, although the third exon is
incorrectly predicted (Chr7_9.6). Ensembl gene prediction software does not predict either Abpb or
Abpg. Thus, our current and previous work with
protein and cDNA sequences has enabled us to identify all three genes (Abpa, Abpb, and Abpg) that are
expressed in adult submaxillary glands. A number of
other features are found in close proximity to the
ABP genes on mouse Chr 7. At least four repeats of
4 kb in length flank the ABP gene cluster. These
repeats are 92–99% homologous to a retrotransposon
L1 (AF081104) and are plotted on Fig. 5. A number of
genes with 90% homology to Abpa have also been
found in this area (Waterston et al. 2002) and we have
noted numerous Abpb and Abpg paralogs.
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
Comparison of Abpa, Abpb and Abpg with
other secretoglobin genes. Karn and Laukaitis
(2003) compared the amino acid sequences of 21 secretoglobin chains from various mammalian species
and derived a molecular phylogeny consistent with
the five groups (A–E; see Table 1) of Ni et al. (2000),
adding the M. musculus lachrymal proteins and expanding group E to include the Mus musculus salivary ABP beta and gamma subunits. We identified
gene sequences encoding the 21 proteins clustered in
groups A–E by Karn and Laukaitis (2003) and compared their intron and exon sizes (Table 1). Overall,
the structures of the 21 secretoglobins we studied are
similar: three exons separated by two introns. The
sizes of these genes differ widely, however, from
about 2 kb to over 5 kb with a mean of 2.96 (S.D.
1.08). Most of the breadth of range is owed to the
highly variable introns. The region least variable in
size is in the middle exon (mean 188 bp, S.D. 4 bp).
Exons 1 and 3 show more size variation (Exon 1:
mean 99 bp, S.D. 17; Exon 3: mean 176 bp, S.D. 45),
owing mostly to variability in the sizes of the 5¢ and
3¢ UTRs, although we realize that variability in 5¢
and 3¢ UTRs may be due at least in part to how
complete these regions were in the reported cDNA
sequences. The introns vary much more widely in
size than the exons (Intron 1: mean 1197 bp, S.D.
892; Intron 2: mean 1137 bp, S.D. 1066).
Comparing the phylogeny developed on the basis
of amino acid sequence similarity of their products
to the exon/intron structures of the genes reveals
interesting relationships. Group A, the uteroglobin/
claras, has the largest average gene size (mean 3767
bp, S.D. 567). Group C comes close to this average
(mean 3670 bp, S.D. 1380) only because of the very
large second intron in Hs MGB B, making it the
largest gene of the 21. Otherwise, group C gene sizes
are closer to the group D average size (mean 2922 bp,
S.D. 290). Groups B (mean 1314 bp, S.D. 23) and E
(mean 1982 bp, S.D. 8), the ones containing the ABP
genes, are much smaller than the other three groups,
although 5¢ and 3¢ UTR sizes cannot presently be
determined for one member of group B and two of
group E. Perhaps the most interesting observation
about group E is that all of the genes have their
middle exon reduced by six nucleotides, three on the
5’ end and three on the 3¢ end. This coincides with a
shift in the cleavage point of the signal peptide such
that the first Cys in the amino acid sequence has
688
C.M. L AUKAITIS
become residue 1 in the secreted forms of group E,
while it is residue 3 in all the other sequences.
Laukaitis 2003). The divergences of the 5¢ and 3¢
UTRs are comparable to the intron values, as is that
for Helix 3. The other three helices have significantly higher values (P < 0.01; one-tailed t-test with
infinite degrees of freedom), suggesting that they
represent the regions of the secreted proteins that
have been under strong positive selection.
Nucleotide divergence of Abpb and Abpg. The
nearly identical structures of Abpb and Abpg suggest
that they are the products of gene duplication. Surprisingly, their first exons, which contain the coding
regions for nearly all of the signal peptides, and their
introns are much more similar than the coding regions (Exons 2 and 3) for their secreted protein products as shown by their nucleotide divergences (Fig. 6).
In our previous work, we showed that positive selection operated on the coding sequences of the secreted proteins by demonstrating relatively high
nonsynonymous/synonymous substitution rate ratios (Ka/Ks) in pairwise comparisons of ABPs from
different taxa (Hwang et al. 1997; Karn and Nachman
1999; Karn and Laukaitis 2003). We had to take a
different approach here because we wanted to compare potential selection on coding regions (exons) and
noncoding regions (introns), and Ka/Ks have no
meaning for noncoding regions. Instead, we calculated distances, in terms of nucleotide divergences. In
most proteins, the divergence in the coding regions is
expected to be less than that in the noncoding regions
because of purifying (negative) selection acting on the
amino acid sequence of the gene product (thus, there
is less constraint expected on noncoding regions, i.e.,
introns [Hughes and Yeager 1997]). Positive selection
for changes in protein sequence, on the other hand,
could create a situation in which the divergence in
the coding region is actually greater than in the
noncoding regions.
We observed no divergence at all in the coding
regions for the signal peptides. The divergences for
the coding regions of Exons 2 and 3, on the other
hand, are significantly higher (P < 0.01; one-tailed
t-test with infinite degrees of freedom) than those for
either intron. This suggests that positive selection
has acted strongly on the coding regions for the secreted proteins, whereas perhaps the coding regions
for the signal peptides have been under purifying selection. An alternative explanation is that gene conversion was responsible for the conservation of Exon
1 (5¢ UTR and signal peptide). We feel that this is a
less likely explanation because it would have had to
produce the same signal peptide for both the Abpb
and Abpg in both the domesticus and musculus
subspecies of M. musculus (Karn and Laukaitis 2003).
Figure 6 also shows a comparison of the nucleotide divergences for the 5¢ and 3¢ UTRs in Exons 1
and 3, respectively, and for the four helical regions
which are found primarily in Exon 2. Exon 3 contains the rearward portion of Helix 4 and a long tail
not found on most other secretoglobins (Karn and
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
C HR 7
Discussion
We have described three distinct genes, Abpa, Abpb,
and Abpg, encoding the three mouse salivary ABP
subunits, alpha, beta, and gamma, respectively, and
we have produced a physical map for them on the
proximal end of Chr 7. The orientation and proximity of Abpb and Abpg, as well as their nearly
identical intron and exon structures, reinforce the
notion that they are the products of relatively recent
gene duplication (Karn and Laukaitis 2003). Abpa is
probably a more distant relative of these two genes,
in light of its reverse orientation and the different
intron and exon structures. The existence of numerous repeats surrounding Abpa and Abpb and
between Abpb and Abpg suggests that the cluster of
ABP genes could have resulted from one or more
gene duplication events. There is preliminary evidence that additional paralogues of Abpa exist
within the mouse genome (Waterston et al. 2002),
and we have noted additional Abpb/Abpg paralogues. The presence of flanking repeats with
homology to LINE elements suggests that retrotransposons have been actively shaping this area of
Chr 7; work addressing this issue is ongoing.
Abpa and Abpb are orientated with their 5¢ ends
nearest each other and their 3¢ ends running in opposite directions. Thus, these genes share a common
8 kb of upstream DNA. This suggests that they
could be coordinately regulated by a common upstream element. While Abpb and Abpg are 66 kb
apart, they are oriented in the same direction. It
seems possible, therefore, that the expression of
Abpg and Abpb could also have elements in common, e.g., both activated by an enhancer between
Abpa and Abpb.
The secretoglobin family of proteins includes
more than a dozen members found in numerous species of mammals (for a review, see Mukherjee and
Chilton 2000). No function is certain for any of the
secretoglobin family members, but the frequency
with which these secretory proteins are expressed in
tissues which connect the animal’s external environment to its internal physiology suggests that they may
play a role in protecting surfaces (Dominguez 1995).
Gene knockout studies of the best-known protein in
this family, uteroglobin, have had mixed results with
C HR 7
689
one strain developing an IgA nephropathy (Zhang et al.
2000) and the other strain exhibiting increased susceptibility to oxidizing damage to the lung (Stripp et al.
2000). Both of these possible functions are supported
by studies of naturally occurring human uteroglobin
polymorphisms (Choi et al. 2000; Menegatti et al.
2002; Narita et al. 2002; Niimi et al. 2002).
Karn and Laukaitis (2003) constructed a phylogenetic tree with five groups of secretoglobins similar to those described by Ni et al. (2000) but adding
the mouse salivary ABP beta and gamma sequences
and mouse lachrymal and 11 kDa proteins (see Table
1 for the members of groups A–E). Many of the human secretoglobin genes, those representing groups
A, C, and D in the phylogenetic tree, have been
mapped to a region on Chr 11q12.2 (Ni et al. 2000).
Previous work, however, had mapped the class C and
D proteins to Chrs 10q23 and 15q12-q13 (Lehrer et al.
2000) and a uteroglobin-related protein (UGRP1) to
5q31-32 (Niimi et al. 2001).
Murine secretoglobin homologues do not appear
to be concentrated into a single gene cluster, and we
propose the existence of at least three clusters of secretoglobin homologues within the mouse genome.
The uteroglobin cluster of human genes was previously identified on human Chr 11 (Ni et al. 2000) in a
region corresponding to mouse Chr 19 where the
mouse uteroglobin and a potential mouse lipophilin
and mammoglobin have been assigned. A homologue
of the human UGRP1 gene on Chr 5 is found on
mouse Chr 18. The region of mouse Chr 7 where
Abpa and Abpg have mapped genetically (Dlouhy et
al. 1987) and where we have now physically located
them is syntenic to the region of human Chr 19
where the glucose phosphate isomerase (GPI) gene
maps. Also within this region, approximately 1 Mb
closer to the centromere, is a gene homologous to the
major allergen-like 1 (Mjal-1) gene. No human salivary protein equivalent to mouse ABP has yet been
described (Dlouhy et al. 1986).
Interestingly, the secretoglobins described as
classes B (Abpa and Mjal-1) and E (Abpb, Abpg, and
the lachrymal 11-kDa protein; Karn and Laukaitis
2003), which have no identified human homologues,
are located together in the mouse Chr 7 cluster. This
is not entirely surprising since other regions of the
murine genome have duplicated since divergence of
mouse and human from a common ancestor 65–75
million years ago (Waterston et al. 2002). The intriguing question this raises is whether the ABP gene
cluster was duplicated in the mouse and remained
because it serves a unique function, or whether it
was present in the common ancestor and eliminated
from the human line because it served no function
there (Smith et al. 1999).
Our comparison of secretoglobin gene structures
(Table 1) reveals interesting relationships in the sizes
of the three exons and two introns of the 18 secretoglobins for which sequence data are available.
Secretoglobin groups A–E were derived by molecular
evolutionary comparisons of amino acid sequences,
and yet the sizes of the exons and introns of their
genes that we present in Table 1 reinforce that evolutionary grouping. Evolutionary considerations lead
one to predict that the exon size would be more
strongly conserved than intron size, since the exons
contain the coding regions, and this prediction is
realized. Variation in gene size among the five
groups of secretoglobins is primarily influenced by
variability in intron size, as shown by the relatively
large standard deviations from the mean intron sizes
in each of the five groups.
Further, one would predict that the first and third
exons would be more variable in size than the middle
exon because they contain the 5¢ and 3¢ UTRs which
are also noncoding sequences, and this prediction too
is realized. In fact, the size of Exon 2 is maintained at
188 bp in all but a few cases, most notably cluster E.
That group consistently lacks the first three nucleotides and the last three nucleotides of Exon 2. The effect on the ABP beta and gamma subunits is to extend
their signal peptides by one amino acid on the C-terminus and to reduce the length of each secreted subunit by one amino acid. Group E also has the cleavage
point of the signal peptide shifted to make the first Cys
in the amino acid sequence residue 1 in the secreted
protein, whereas it is residue 3 in all the other secretoglobins. Whether the change in the size of the
middle exon in the group E genes is related to the shift
in the cleavage point for the signal peptide is unclear.
One of the most interesting features of the ABP
system is the strong Darwinian selection that has been
shown to be operating on one or more of these genes.
Several studies of Abpa have suggested that its evolution has deviated significantly from the predictions
of the Neutral Theory of Molecular Evolution (Kimura
1983), probably as the result of strong directional selection (Hwang et al. 1997; Karn and Nachman 1999;
Karn et al. 2002). It has been proposed that selection on
Abpa has proceeded by selective sweeps in at least two
subspecies of Mus musculus: M. m. domesticus (Karn
and Nachman 1999) and M. m. musculus (Karn et al.
2002). More recently, Karn and Laukaitis (2003) have
suggested that directional selection is probably also
driving the evolution of Abpb and Abpg. This is
probably owing to selection directly on these genes,
rather than hitchhiking, since the strongest evidence
is provided by a relatively high nonsynonymous/synonymous substitution rate ratio (Ka/Ks) as shown for
Abpa (Hwang et al. 1997; Karn and Nachman 1999)
C.M. L AUKAITIS
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
690
C.M. L AUKAITIS
and for Abpb and Abpg (Karn and Laukaitis 2003). In
the case of Abpa, the gene encoding the alpha subunit,
the one common to both dimer forms (A:B and A:G),
there is evidence that the directional selection may
take the form of sexual selection (Talley et al. 2001;
Laukaitis et al. 1997). It remains to be seen whether
sexual selection might also explain the relatively high
Ka/Ks ratios observed for Abpb and Abpg.
The two different ABP dimers, A:B and A:G, have
been shown to bind testosterone and dihydrotestosterone (DHT) with different affinities (Karn and Clements 1999); this has been attributed to differences in
key residues in the hydrophobic pockets of the two
dimers (Karn and Laukaitis 2003). The role of ligand
binding in the function of ABP and, for that matter, the
other secretoglobins, is unknown. It is likely, however, that the evolution of Abpb and Abpg since their
divergence from a common ancestral sequence has
proceeded under the influence of a need to bind different ligands (Karn and Laukaitis 2003). This idea is
supported by the significantly higher nucleotide divergences in the coding region (exons 2 and 3) for the
secreted proteins than in their introns or the coding
region for their signal peptides. We suggest that the
evolutionary pressure that likely fixed the duplication
of the gene ancestral to Abpb and Abpg and resulted in
the expression of the duplicated products is a selection
pressure that is not likely to be influencing the evolution of Abpa because its product is common to both
dimers. An alternative explanation to our view that
higher nucleotide divergences reflect positive selection on exons 2 and 3 of Abpb and Abpg is that mutation rates could vary between exons and introns. It
seems unlikely (though not impossible) that mutation
rates vary in such a nonrandom way, but it is quite
likely that the amount of constraint influencing nucleotide divergence on gene regions is different (Hughes and Yeager 1997). In other words, the high
divergence at exons compared with introns might be
driven by positive selection on exons, unusually
strong negative (purifying) selection on introns, or
some combination of both. Since there is no evidence
that the introns of Abpb and Abpg are under any
constraint different from the introns of other genes
that have been studied thus far, we favor the view that
our data support positive selection on the exons.
Closer analysis, in which the nucleotide divergences of the coding regions of Abpb and Abpg (signal
peptide, the four helices, and the tail) are compared
with each other and with the noncoding regions (5¢
and 3¢ UTRs and the introns), creates an even more
intriguing picture. The lower level of divergence in
the noncoding regions and in two parts of the coding
region, the signal peptide and Helix 3, suggests that
the duplication that produced Abpb and Abpg may
not be very old. This is because the nucleotide divergence of Helix 3 is not significantly different from
the noncoding regions, which are not very high. By
contrast, the values for the other three helices are
significantly higher. It appears that positive selection
drove the divergence of Helices 1, 2, and 4. Helix 3, by
contrast, was not under the same pressure, and the
signal peptide may even have been conserved by purifying selection. Karn and Laukaitis (2003) suggested
that the conservation of Helix 3 between the beta and
gamma subunits may be due to a critical role in the
dimer interface where the ligand is bound (Callebaut
et al. 2000), which may explain why it has been
conserved relative to the other helices which contain
residues that may control the specificity for the ligand bound (Karn and Laukaitis 2003).
The possibility of selection occurring directly on
one or more of the genes in the ABP cluster raises the
interesting question of the effect this may have had
on the evolution of other DNA sequences within and
around the cluster. This begs the production of sequences of a large region of this chromosome segment in other species of Mus as well as in the three
subspecies of Mus musculus. Such a comparison
would shed light on a number of very interesting
issues, such as the effects of selection on the ABP
genes and their surrounding sequences and the
question of how recently duplications producing
ABP gene paralogues occurred.
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
C HR 7
Acknowledgments
The authors thank M. Nachman and R. Emes for
helpful discussions and advice. This work was supported in part by the Holcomb Research Institute,
Butler University, and that support is gratefully acknowledged.
References
3 1. Agarwal S, Viola JP, Rao A (1999) Chromatin-based
regulatory mechanisms governing cytokine gene transcription. J Allergy Clin Immunol 103, 990–999
2. Altschul SF, Gish W, Miller W, Myers DE, Lipman DJ
(1990) Basic local alignment search tool. J Mol Biol 215,
403–410
3. Callebaut I, Poupon A, Bally R, Demaret JP, Housset D
et al. (2000) The uteroglobin fold. Ann N Y Acad Sci
923, 90–112
4. Choi M, Zhang Z, Ten Kate LP, Collee JM, Gerritsen J
et al. (2000) Human uteroglobin gene polymorphisms
and genetic susceptibility to asthma. Ann N Y Acad
Sci 923, 303–306
5. Dlouhy S, Karn R (1983) The tissue source and cellular
control of the apparent size of androgen binding protein
(Abp), a mouse salivary protein whose electrophoretic
C HR 7
691
mobility is under the control of Sex-limited saliva
pattern (Ssp). Biochem Genet 21, 1057–1070
Dlouhy S, Karn R (1984) Multiple gene action determining a mouse salivary protein phenotype: identification of the structural gene for Androgen binding
protein (Abp). Biochem Genet 22, 657–667
Dlouhy R, Nichols W, Karn R (1986) Production of an
antibody to mouse salivary androgen binding protein
(ABP) and its use in identifying a prostate protein produced by a gene distinct from Abp. Biochem Genet 24,
673–743
Dlouhy S, Taylor B, Karn R (1987) The genes for mouse
salivary Androgen-binding protein (ABP) subunits alpha and gamma are located on chromosome 7. Genetics 115, 535–543
Dominguez P (1995) Cloning of a Syrian hamster
cDNA related to sexual dimorphism: establishment of
a new family of proteins. FEBS Lett 376, 257–261
Graur D, Li W-H (2000) Fundamentals of Molecular
Evolution. (Sunderland, Mass.: Sinauer Associates, Inc.)
Hubbard T, Barker D, Birney E, Cameron G, Chen Y et
al. (2002) The Ensembl genome database project. Nucleic Acids Res 30, 38–41
Hughes AL, Yeager M (1997) Comparative evolutionary rates of introns and exons in murine rodents. J Mol
Evol 45, 125–130
Hwang J, Hoffstetter J, Bonhomme F, Karn R (1997) The
microevolution of mouse salivary androgen-binding
protein (ABP) paralleled subspeciation of Mus musculus.
J Hered 88, 93–97
Karn R, Clements M (1999) A comparison of the structures of the alpha:beta and alpha:gamma dimers of mouse
salivary androgen-binding protein (ABP) and their differential steroid binding. Biochem Genet 37, 187–199
Karn R, Laukaitis C (2003) Characterization of two
forms of mouse salivary androgen-binding protein
(ABP): Implications for evolutionary relationships and
ligand-binding function. Biochemistry 42, 7162–7170
Karn R, Nachman M (1999) Reduced nucleotide variability at an androgen-binding protein locus (Abpa) in
house mice: evidence for positive natural selection.
Mol Biol Evol 16, 1192–1197
Karn R, Russell R (1993) The amino acid sequence of
the alpha subunit of mouse salivary androgen-binding
protein (ABP), with a comparison to the partial sequence of the beta subunit and to other ligand-binding
proteins. Biochem Genet 31, 307–319
Karn R, Orth A, Bonhomme F, Boarsot P (2002) The
complex history of a gene proposed to participate in a
sexual isolation mechanism in house mice. Mol Biol
Evol 19, 462–471
Karolchik D, Baertsch R, Diekhans M, Furey TS, Hinrichs A et al. (2003) The UCSC genome browser database. Nucleic Acids Res 31, 51–54
Kent WJ (2002) BLAT–the BLAST-like alignment tool.
Genome Res 12, 656–664
Kimura M (1983) The Neutral Theory of Molecular Evolution. (Cambridge, U.K.: Cambridge University Press)
Klug J, Beier HM, Bernard A, Chilton BS, Fleming TP
et al. (2000) Uteroglobin/Clara cell 10-kDa family of
proteins: nomenclature committee report. Ann N Y
Acad Sci 923, 348–354
Klug WS, Cummings MR (2003) Concepts of Genetics.
Upper Saddle River, NJ; Prentice Hall
Kratz E, Dugas JC, Ngai J (2002) Odorant receptor gene
regulation: implications from genomic organization.
Trends Genet 18, 29–34
Kumar S, Tamura K, Jakobsen I, Nei M (2001) Mega2:
Molecular Evolutionary Genetics Analysis Software.
Bioinformatics 17, 1244–1245
Laukaitis C, Critser E, Karn R (1997) Salivary androgen-binding protein (ABP) mediates sexual isolation in
Mus musculus. Evolution 51, 2000–2005
Lehrer RI, Nguyen T, Zhao C, Ha CX, Glasgow B (2000)
Secretory lipophilins: a tale of two species. Ann N Y
Acad Sci 923, 59–67
Menegatti E, Nardacchione A, Alpa M, Agnes C, Rossi
D et al. (2002) Polymorphism of the uteroglobin gene
in systemic lupus erythematosus and IgA nephropathy.
Lab Invest 82, 543–546
Mukherjee A, Chilton B eds. (2000) The uteroglobin/
Clara cell protein family. Ann N Y Acad Sci 923
Narita I, Saito N, Goto S, Jin S, Omori K et al. (2002)
Role of uteroglobin G38A polymorphism in the progression of IgA nephropathy in Japanese patients. Kidney Int 61, 1853–1858
Ni J, Kalff-Suske M, Gentz R, Schageman J, Beato M
et al. (2000) All human genes of the uteroglobin family
are localized on chromosome 11q12.2 and form a dense
cluster. Ann N Y Acad Sci 923, 25–42
Niimi T, Keck-Waggoner CL, Popescu N, Zhou Y, Levitt
B et al. (2001) UGRP1, a uteroglobin/Clara cell secretory
protein-related protein, is a novel lung-enriched downstream target gene for the T/EBP/NKX2.1 homeodomain
transcription factor. Mol Endocrinol 15, 2021–2036
Niimi T, Munakata M, Keck-Waggoner C, Popescu N,
Levitt B et al. (2002) A polymorphism in the human
UGRP1 gene promoter that regulates transcription is
associated with an increased risk of asthma. Am J Hum
Genet 70, 718–725
Smith NG, Knight R, Hurst L (1999) Vertebrate genome evolution: a slow shuffle or a big bang? Bioessays
21, 697–703
Stripp B, Reynolds S, Plopper C, Boe I, Lund J et al.
(2000) Pulmonary phenotype of CCSP/UG deficient
mice: a consequence of CCSP deficiency or altered
Clara cell function? Ann N Y Acad Sci 923, 202–209
Talley H, Laukaitis C, Karn R (2001) Female preference for male saliva: implications for sexual isolation of Mus musculus subspecies. Evolution 55,
631–634
Waterston R, Lindblad-Toh K, Birney E, Rogers J, Abril J
et al. (2002) Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520–562
White R (2001) Gene Transcription: Mechanisms and
Control. (London, U.K.: Blackwell Science Ltd.)
Zhang Z, Kundu G, Zheng F, Yuan C, Lee E et al. (2000)
Insight into the physiological function(s) of uteroglobin
by gene-knockout and antisense-transgenic approaches. Ann N Y Acad Sci 923, 210–233
C.M. L AUKAITIS
4
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
20.
5
21.
22.
6
ET AL .:
M OUSE ABP G ENE C LUSTER
ON
23.
24.
25.
26.
27.
28.
29.
7
30.
31.
32.
33.
8
34.
35.
36.
37.
38.
39.