Diversity is demonstrated in class I HLA-A and HLA

J.M. Ellis
S.J. Mack
R.F.G. Leke
I. Quakyi
A.H. Johnson
C.K. Hurley
Key words:
Africa; Cameroon; class I; sequence-based typing
(SBT); human leukocyte antigen (HLA); sequencespecific oligonucleotide probe (SSOP) typing
Acknowledgments:
The authors would like to express their thanks to
Henry Erlich for advice regarding the population
analysis, Noriko Steiner for designing the HLA-A
locus AIN-A, T, and G primers, and Christina
Ginsberg for extracting DNA from the CS series
samples. Noriko Steiner and Carol Kosman
sequenced the alleles from unrelated patient and
registry volunteers possessing B*1403, *4703,
and *4016. This research is supported by
funding from the Office of Naval Research
N00014–94–1–0049 to the C.W. Bill Young
Marrow Donor Recruitment and Research
Program (C.K.H.) and NIAID NIH UO1-AI-35839
(I.A.Q., R.F.G.L., A.H.J.). The views expressed in
this article are those of the authors and do not
reflect the official policy of the Department of
the Navy, the Department of Defense, or the
U.S. Government.
Diversity is demonstrated in class I HLA-A
and HLA-B alleles in Cameroon, Africa:
description of HLA-A*03012, *2612,
*3006 and HLA-B*1403, *4016, *4703
Abstract: To examine the genetic diversity in west Africa, class I HLA-A
and HLA-B alleles of 92 unrelated individuals from two areas in the Cameroon, the capital Yaoundé and the village of Etoa, were identified by
direct automated DNA sequencing of exons 2 and 3 of the HLA-B locus
alleles and sequence-specific oligonucleotide probe (SSOP) and/or sequencing of the HLA-A locus alleles. HLA-A*2301 (18.7%), A*2902 (10.4%),
B*5301 (10.9%), and B*5802 (10.9%) were the most frequently detected
alleles, present in at least 10% of the population. A total of 30 HLA-A locus
and 33 HLA-B locus alleles, including six novel alleles, were detected. The
novel alleles were HLA-A*03012, A*2612, A*3006 and HLA-B*1403, B*4016,
and B*4703. HLA-B*4703 contains a novel amino acid sequence that is a
combination of the first 5 amino acids of the Bw6 epitope and the last 2
residues of the Bw4 epitope. The addition of 6 alleles to the ever-expanding
number of known class I HLA alleles supports our hypothesis that extensive
genetic diversity, including previously undescribed alleles, would be observed in this African population. In the Yaoundé population, the allele frequency distribution at the HLA-A locus is consistent with distributions
indicative of balancing selection. Extensive HLA-A-B haplotypes were observed in this population suggesting that only a fraction of the Cameroon
HLA-A-B haplotype diversity has been observed.
Modern-day Homo sapiens likely arose from a population of individuals who carried the ancestral forms of the major human leukocyte
antigen (HLA) allele families. The polymorphism of the HLA system could have further developed through the evolutionary mechanisms of intraallelic and interallelic conversion, reciprocal recombination, and point mutation, under the influence of positive selective
pressure for alleles that efficiently bind and present pathogenic peptides to the immune system (1). Because it has been postulated that
human life arose in Africa (2), the HLA diversity of modern African
Received 16 July 1999, revised, resubmitted,
accepted for publication 31 July 2000
Copyright c Munksgaard 2000
Tissue Antigens . ISSN 0001-2815
Tissue Antigens 2000: 56: 291–302
Printed in Denmark . All rights reserved
Authors’ affiliations:
J.M. Ellis1,
S.J. Mack2,3,
R.F.G. Leke4,5,
I. Quakyi4,
A.H. Johnson6,
C.K. Hurley1
1
Department of Microbiology
and Immunology,
Georgetown University
Medical Center,
Washington, DC, USA,
2
Department of Human
Genetics, Roche Molecular
Systems, Alameda,
California, USA,
3
Children’s Hospital Oakland
Research Institute,
Oakland, California, USA,
4
Department of Biology,
Georgetown University,
Washington, DC, USA,
5
Faculty of Medicine and
Biomedical Sciences,
University of Yaoundé,
Yaoundé, Cameroon,
6
Department of Pediatrics,
Georgetown University
Medical Center, Washington,
DC, USA
Correspondence to:
Carolyn Katovich Hurley,
Ph.D.
E404 Research Building
Georgetown University
Medical Center
3970 Reservoir Rd NW
Washington DC 20007
USA
Tel: π1 202 687 2157
Fax: π1 202 687 6440
populations may best reflect the HLA diversity of the human population prior to the diversification of our species into different racial
and ethnic groups. Significant genetic diversity has been observed
among different African ethnic groups (3, 4), and Africans may dis-
291
Ellis et al : Class I alleles in Cameroon, Africa
play a genetic diversity which is overall greater than that observed
speaks 24 major languages and countless dialects (5). Due to its
for other racial and ethnic groups.
geography and history, Cameroon provides a prime location to
Although Cameroon is located in west Africa, its vivid and di-
study the diversity of class I HLA alleles in Africa.
verse history has been shaped by numerous waves of human migration, making Cameroon a cross-section of all Africa. Situated on
the southern border of Lake Chad, Cameroon lies midway along the
Sahel Corridor (Fig. 1), a major route along the southern border of
Material and methods
the Sahara Desert. Since approximately 2750 B.C., different ethnic
groups – in particular, the nomadic Fulani of modern Senegal and
Whole blood was obtained from 92 unrelated individuals native to
the east African Shuwa Arabs – have used the Sahel Corridor for
Cameroon, primarily living in the capital city of Yaoundé (DT series
travel between east and west coasts of Africa. In approximately 1
samples, n⫽41) and the village of Etoa (CS series samples, n⫽51).
A.D., the so-called outpouring of the Negro tribes was initiated, in
The 41 samples from Yaoundé were derived from unrelated women
which the Zande and the Bantu tribes migrated across Cameroon.
who gave birth at Central Maternity Hospital. The remaining 51
This Bantu expansion resulted in the southern movement of the
samples were selected as unrelated from 200 samples collected dur-
San and the Pygmy peoples, and continued to shape African history
ing a census of the families living in the village of Etoa. Serologic
for nearly a millenium. In the late 16th century, Portuguese ex-
typing was performed on some of the CS series samples (data not
plorers ‘‘discovered’’ Africa, initiating the arrival of European mer-
shown).
chants and missionaries. Over the course of the 18th and 19th cen-
In the case of the HLA-A locus, exons 2 and 3 and intron 2
turies, so-called Islamic holy wars were mounted by the northern
were amplified with the primers 5AIN1–36 and 3AIN3–62 and high-
Fulani, resulting in migration, intermingling and adsorption of dif-
resolution sequence-specific oligonucleotide probe (SSOP) typing
ferent groups in northern Cameroon. In the 19th and 20th centuries,
was performed as previously described (6, 7) using the probes listed
Cameroon was colonized by Germany and France. Cameroon be-
in Table 1. In instances where the HLA types could not be resolved
came independent from France in 1960 (4), and the modern Ca-
to allele level, direct automated DNA sequencing was performed.
meroonian population represents more than 200 ethnic groups, and
By designing three forward primers that had either an A, T, or G
at the 3ø end (known as AIN1-A, AIN1-T, and AIN1-G, Tables 2
and 3), in conjunction with a conserved reverse primer (3AIN3–62),
samples which contained alleles that differ at nucleotide 75 in the
first intron were separately amplified and sequenced with the dye
terminator method using either an ABI 373 or 377 automated DNA
sequencer (Perkin Elmer/Applied Biosystems, Foster City, CA,
USA). DNA from samples in which both alleles co-amplified with
the same primer set was sequenced using the dye primer method.
Amplified DNA was purified using Microcon-100 columns (Amicon,
Beverly MA, USA) and diluted in 30–100 ml water, according to the
intensity of the amplified band present on an agarose gel stained
with ethidium bromide, prior to the sequencing reaction. Sequence
analysis was performed using the MatchTools software package
(Applied Biosystems).
The HLA-B locus alleles from all samples were characterized by
direct automated DNA sequencing as previously described (9). Alleles were separated into two groups based on their amplification
with two primer sets (Bin1TA-M13F and Bin1CG-M13F), utilizing a
TA/CG dimorphism in intron 1 with a conserved intron 3 reverse
Fig. 1. Map of Africa. The Sahel Corridor is outlined with arrows showing the migration of the Shuwa Arabs and Fulani. The large arrow across
western Africa traces the migration of the Negro tribes. Countries mentioned for allele comparisons are labeled.
292
Tissue Antigens 2000: 56: 291–302
primer (Bin3-M13R). Alleles that were amplified separately were
sequenced with the dye terminator method; alleles co-amplifying
with the same primer set were sequenced concurrently using the
dye primer method.
Ellis et al : Class I alleles in Cameroon, Africa
HLA-A locus SSOP used and their wash temperatures
Probe name
Primer sequences
Sequence (5ø-3ø)
Wash temperature (æC)
AIN1-A(M13F)1
5ø TGT AAA ACG ACG GCC AGT GGG GCG CA[G/A] GAC CCG GGA
1
131R
CGC TCT TGG ACC GCG
50
AIN1-G(M13F)
TGT AAA ACG ACG GCC AGT GGG CGC AGG ACC GGG GG
114R1
GGG TAC CGG CAG GAC
50
AIN1-T(M13F)
TGT AAA ACG ACG GCC AGT GGG [G/A]CG CAG GAC CCG GGT
1
GAC CGG AAC ACA CGG
50
5P2
GGG CGT CGA CGG ACT CAG AAT CTC CCC AGA CGC CGA G
114Q1
TAC CAG CAG GAC GCC
50
ex2R
GCT CTG GTT GTA GTA GCC GC
77S1
GAG AGC CTG CGG ATC
50
ex3F
CCT GGA GAA CGG GAA GGA CAA
76AN1
GAC CGA GCG AAC CTG
50
6N2
GCA GCC TGA GAG TAG CTC CCT
1
1
62RN
62LQ
TGG GAC CTG CAG ACA
50
114EH1
TAT GAA CAG CAC GCC
46
152W1
GCC CGT TGG GCG GAG
50
GAG AGG CCT GAG TAT
50
GCG GCC CGT GTG GCG
56
2
2
56R
1
151R1
73I
1
TCA CAG ATT GAC CGA G
48
142TK1
ACC ACC AAG CAC AAG
50
163R1
GAG GGC CGG TGC GTG
50
156W1
152E
1
GAG CAG TGG AGA GCC
50
GCC CAT GAG GCG GAG
50
149T1
TGG GAG ACG GCC CAT
50
3E207S
GAG GGC ACG TGC GT
48
3E207
GAG GGC GAG TGC GT
48
166DG1
GTG GAC GGG CTC CG
50
M13 sequence is underlined
(8)
Table 2
Alleles amplifying in HLA-A locus primer groups1
AIN-A
AIN-T
AIN-G
A*02
A*29
A*01
A*23
A*31
A*03
A*24
A*32
A*11
A*25
A*33
A*30
A*26
A*36
A*34
A*3204
A*43
161D1
CTG GAT GGC ACG TGC
50
1
GAG CAG CAG AGA GCC
50
150V1
AGG CGG TCC ATG CG
50
142IK1
CAG ATC ACC AAG CGC A
50
62EG1
CGA GGA GAC AGG GAA A
50
62G1
GAC GGG GAG ACA CGG
50
2E162
ACG GAT GTG AAG AAA TAC
50
2E2202
CCC AGG TCC ACT CGG
50
70H
AAG GCC CAC TCA CAG A
50
70Q2
TCT GTG ACT GGG CCT T
50
156L
GGA GCA GTT GAG AGC C
50
74H
ACT CGG TGA GTC TGT G
50
43R
ATC CTC CGG CTC GCG
50
D23
CGG ATC GCG CTC CGC TAC
56
144Q1
AGA TCA CCC AGC GCA
50
A*66
156Q
A*68
A*69
A*80
1
2
Oh et al. (6)
Sequence from non-coding strand.
Table 1
1
Not all alleles tested within family
Table 3
was performed on the amplified template with the same forward
primer (5P2) and a specific reverse primer (ex2R) located in exon 2
that would amplify B*4016, but not B*4701 and B*5802 (the second
HLA-B locus alleles present in individuals CS25 and CS48). A specific forward primer (ex3F) located in exon 3 was used with the reverse primer 6N located in exon 6, to amplify exons 4 and 5 of
B*4016, but not the other HLA-B locus alleles present in individuals
CS25 and CS48. Dye terminator sequencing was performed using
the primers 5P2, ex2R, ex3F, and 6N.
In the case of novel alleles or ambiguous dye primer results,
In addition, exons 1, 4, and 5 of the novel allele HLA-B*4016
amplified DNA was cloned to separate the alleles using the pCRII
were sequenced. To sequence exon 1, a forward primer (5P2) (primer
TA cloning kit (Invitrogen, San Diego, CA, USA). DNA inserts from
sequences are listed in Table 2) located in the 5ø untranslated region
the cloned allele were directly amplified from a lysate containing
was used in conjunction with the reverse primer Bin3-M13R to co-
the bacterial clone heated to 99æC in 50 ml water and sequenced by
amplify both alleles of the HLA-B locus. A second amplification
the dye terminator method. Both strands of DNA preparations from
Tissue Antigens 2000: 56: 291–302
293
Ellis et al : Class I alleles in Cameroon, Africa
multiple polymerase chain reactions (PCR) were sequenced for each
HLA-A alleles and their frequencies (n⫽91)
novel allele.
Number of times observed
The names listed for these sequences have been officially as-
HLA-A allele
Total
DT series
CS series
Total
frequency (%)1
signed by the WHO Nomenclature Committee (10). This follows the
A*0101
2
0
2
1.1
agreed policy that, subject to the conditions stated in the most re-
A*0201/*02092
13
7
6
7.1
cent Nomenclature Report, names will be assigned to new se-
A*0202
15
8
7
8.2
quences as they are identified. Lists of such new names will be
A*0204
1
0
1
0.6
published in the following WHO Nomenclature Report.
A*0205
4
4
0
2.2
Among the novel alleles detected in this study, two of the three
A*0211
1
1
0
0.6
HLA-A locus alleles would not have been detected by common
A*0214
1
0
1
0.6
SSOP methods because the novel sequence substitutions occur at
A*03011
14
8
6
7.7
conserved positions not commonly interrogated by SSOPs. There-
A*030123
fore, there is a possibility that other novel alleles can elude detection
A*2301
by SSOP because their sequence substitutions occur at positions
A*2402/*2409N2
1
0
1
0.5
previously thought to be relatively conserved. An automated se-
A*2601
3
1
2
1.7
quencing approach minimized the number of ‘‘hidden’’ alleles that
A*2612
1
0
1
0.6
would otherwise remain uncharacterized, especially in a population
A*2902
19
6
13
10.4
expected to be diverse.
1
1
0
0.6
34
20
14
18.7
A*3001
10
3
7
5.5
The Ewens-Watterson homozygosity statistic (F) (11, 12) was
A*3002
11
5
6
6.0
used to examine the selective forces influencing allelic diversity of
A*3003
2
0
2
1.1
the Cameroon HLA-A and -B loci. The observed alleleic frequency
A*3004
2
1
1
1.1
distributions were compared to those expected under the neutral
A*3006
1
0
1
0.6
model, in which mutation and random genetic drift are the only
A*3101
2
1
1
1.1
forces altering allele frequencies. The MONTE CARLO program (13,
A*3201
2
2
0
1.1
14) was used to perform the Ewens exact tests and homozygosity
A*3303
5
1
4
2.8
tests on these allele distributions.
A*3402
4
3
1
2.2
2
The exact test of Guo and Thomson (15) was used to evaluate
A*3601
4
0
4
2.2
deviations from expected Hardy-Weinberg genotypic proportions. A
A*6601
11
1
10
6.0
minor deviation from expected Hardy-Weinberg ratios was observed
A*6602
2
1
1
1.1
for the Etoa population at the HLA-B locus. Since these individuals
A*6801
1
1
0
0.6
are from a small population group, it is possible that individuals
A*6802
5
3
2
2.8
are more related than previously thought resulting in this deviation.
A*6901
1
1
0
0.6
A*7401/*74022
9
3
6
5.0
182
82
100
HLA-A-B haplotypes were estimated using standard methods (16,
17).
Total
1
2
Results and discussion
3
HLA-A locus
100
Frequency is defined as the percentage of the number of times the allele was observed in 182
alleles total. Alleles from presumed homozygous samples were counted twice.
Alleles differing outside of exons 2 and 3 were not resolved. A*0209 exists only as a protein
sequence that differs from the protein sequence of A*0201 at one residue in exon 4. The allele
has never been observed a second time.
Novel alleles described in this report are in bold.
Table 4
A total of 30 alleles were present in a sample size of 91 unrelated
individuals. [There was not sufficient DNA to type one of the 92
HLA-A samples.] The most frequently detected alleles were A*2301
(18.7%) and A*2902 (10.4%) (Table 4). HLA-A*0202 (8.2%),
A*2612 and A*3006, which are described in detail below. The allele
A*03011 (7.7%), A*0201 (7.1%), A*6601 (6.0%), A*3002 (6.0%), and
names and Genbank accession numbers for the novel alleles de-
A*3001 (5.5%) were also observed at frequencies greater than 5%.
scribed in this study are listed in Table 5 and their sequences in
Three novel HLA-A alleles were identified, HLA-A*03012,
294
Tissue Antigens 2000: 56: 291–302
Fig. 2.
Ellis et al : Class I alleles in Cameroon, Africa
Table 5
Samples containing novel alleles
Sample
HLA-A
2
HLA-B
Genbank Accession no.1
DT18
A*03012 , A*2301
B*0801, B*3501
AF053128, AF053129
CS3
A*2612, A*2902
B*1503, B*4407
AF065486, AF065487
CS48
A*3006, A*0202
B*4016, B*5802
AF028713, AF028714 (A*3006)
AF027296, AF027297 (B*4016)
CS25
A*0202, A*3004
B*4016, B*4701
AF017022, AF017023
DT16
A*0201, A*2301
B*1403, B*4501
U91330, U91331
DT3
A*2301
B*1403, B*5801
AF015271, AF015272
DT23
A*2301, *3004
B*1403, B*3501
–
DT32
A*0201, A*0202
B*4703, B*44031
AF016842, AF016843
1
2
Novel allele only, corresponds to two exons.
New alleles described in this report are bolded.
Fig. 2. Exon 2 and 3 nucleotide sequences of the three novel
HLA-A alleles are compared to the allele with closest homology.
Numbering begins at the first nucleotide of exon 1 and disregards introns
(18).
Tissue Antigens 2000: 56: 291–302
295
Ellis et al : Class I alleles in Cameroon, Africa
B*5802 (both 10.9%). HLA-B*5301 is the most frequently detected
HLA-B alleles and their frequencies (n⫽92)
allele in other West African populations and has been hypothesized
Number of times observed
HLA-B allele
Total
DT series
CS series
Total frequency (%)1
to be associated with resistance to severe malaria (19). Malaria is
B*07021
11
6
5
6.0
endemic to the region of Cameroon where these samples originated.
B*0801
10
7
3
5.4
HLA-B*3501 (7.1%), *44031 (6.0%), *07021 (6.0%), *4901 (5.4%),
B*1302
5
1
4
2.7
*5801 (5.4%) and *0801 (5.4%) were also frequent in this popula-
B*1401
2
1
1
1.1
tion. Three novel HLA-B alleles were characterized, HLA-B*1403,
B*1402
1
0
1
0.5
*4016, and *4703, which are described in detail below and in Table
B*14032
3
3
0
1.6
5 and Fig. 3.
B*1501
2
0
2
1.1
B*1503
9
5
4
4.9
B*1510
2
1
1
1.1
B*1516
2
2
0
1.1
B*1517
2
1
1
1.1
The Ewens-Watterson homozygosity statistic (F) has been used to
B*1801
5
1
4
2.7
infer the operation of evolutionary forces on allele frequency distri-
B*2703
1
1
0
0.5
butions (20–23). In general, homozygosity values lower than the
B*3501
13
4
9
7.1
neutral value predicted by the Ewens-Watterson model reflect the
B*3701
1
0
1
0.5
operation of balancing selection, while values higher than the neu-
B*3910
1
0
1
0.5
tral value reflect the influence of directional selection. By comparing
B*4016
2
0
2
1.1
observed homozygosity values with those expected under this
B*4101
1
0
1
0.5
model, Klitz et al. (20) have shown that balancing selection (hetero-
B*4201
9
5
4
4.9
zygote advantage) is in operation at the HLA-DRB, -DQA1 and -
B*44031
11
7
4
6.0
DQB1 loci. While this phenomenon has been observed in a large
B*44032
1
0
1
0.5
number of populations at these class II loci, it has not been demon-
B*4407
4
0
4
2.2
strated as clearly for the class I loci (22).
B*4501
6
4
2
3.3
In particular, the homozygosity value for the HLA-A locus in
B*4701
2
1
1
1.1
the Yaoundé population is significantly lower than expected for a
B*4703
1
1
0
0.5
population of the same size with the same number of alleles (Table
B*4901
10
5
5
5.4
7), and is consistent with the operation of balancing selection. In
B*5001
1
0
1
0.5
addition, an overall trend toward lower observed homozygosity
B*51011
3
2
1
1.6
values is evident for the Cameroon HLA-A and -B loci, consistent
B*5301
20
6
14
10.9
with the observations of Klitz et al. (20) for the class II loci, sug-
B*5703
5
3
2
2.7
gesting that balancing selection is operating on these populations,
B*5801
10
5
5
5.4
perhaps in concert with other evolutionary forces.
B*5802
20
6
14
10.9
8
4
4
4.4
184
82
102
B*8101
Total
1
2
Allele diversity and selective forces
100
Frequency is defined as the percentage of the number of times allele present in 184 alleles
total. Alleles for presumed homozygous samples are counted twice.
Novel alleles described in this report are in bold.
Table 6
Novel allele – HLA-A*03012
DNA from samples typed by SSOP as A*0301 or A*0303N was
sequenced to differentiate between these alleles. HLA-A*0303N contains a deletion in exon 3. Although the null allele was not detected,
a single nucleotide variant of A*0301 was identified. The codon
CTG at 156 was present in A*03012, resulting in a silent substi-
HLA-B locus
tution and no predicted amino acid change between A*03011 and
Thirty-three alleles were identified in 92 unrelated individuals
A*03012. Only A*8001 and A*03012 have CTG at codon 156, al-
(Table 6). The most frequently observed alleles were B*5301 and
though many HLA-B locus alleles contain this sequence.
296
Tissue Antigens 2000: 56: 291–302
Ellis et al : Class I alleles in Cameroon, Africa
Fig. 3. Exon 2 and 3 nucleotide sequences of the novel alleles
HLA-B*1403 and B*4703 are compared to the allele with closest
homology. Exons 1–6 of HLA-B*4016 are compared to both HLA-B*40012
and B*0706. Numbering begins at the first nucleotide of exon 1 (18).
Tissue Antigens 2000: 56: 291–302
297
Ellis et al : Class I alleles in Cameroon, Africa
Table 7
Homozygosity in Cameroon populations
Locus
Sample series
A
DT (Yaoundé)
A
CS (Etoa)
B
DT (Yaoundé)
B
CS (Etoa)
1
2n
k
Observed F
Expected F
p-value
82
24
0.0577
0.0901
⬍0.005
102
29
0.0668
0.0768
n.s.1
82
22
0.1047
0.1013
n.s.
100
24
0.0764
0.0993
n.s.
n.s., not significant
later identified in 8 unrelated individuals from another study, 7 of
Novel allele – HLA-A*2612
whom were African American and one of Middle Eastern descent
A single nucleotide variant of A*2601 was characterized from one
(data not shown). Based on this, HLA-B*1403 might be frequent in
individual in the CS series with a variant SSOP pattern. Probe 152E
African populations.
(covering nucleotides 520–534 in exon 3) did not hybridize, resulting
in an assigned typing of A*26v, A*2901/*2902. SBT demonstrated
HLA-A*2612 contains a GTG codon at 156, rather than GAG pres-
Novel allele – HLA-B*4703
ent in HLA-A*2601. A single amino acid change is predicted al-
HLA-B*4703 was identified from one individual and, later, 3 unre-
tering glutamic acid in A*2601 to valine in A*2612. Other HLA-A
lated African American individuals (data not shown). A two nucleo-
alleles, such as A*0201, contain this A*2612 sequence motif.
tide variant of B*4702, two amino acid substitutions at residues 82
(arginine to lysine) and 83 (glycine to arginine) are predicted in the
polypeptide encoded by B*4703. Codons 77–83 encode the Bw4 and
Novel allele – HLA-A*3006
Bw6 epitopes. The predicted polypeptide sequence is a novel combi-
A single nucleotide variant of HLA-A*3004, A*3006, was identified
nation containing residues 77 through 81 of the Bw6 epitope
from one individual. The GCG at codon 31 of the mature protein is
(SLRNL) joined to the last two residues of the Bw4 epitope (LR)
unique; all HLA-A alleles have the codon ACG, with the exception
(Fig. 4). No other alleles contain this hybrid sequence. The Class Ib
of A*8001 which has TCG at this position (24). This codon has not
gene, HLA-F, and the chimpanzee homologue of HLA-F contain the
been observed in any other HLA class I alleles, pseudogenes, primate alleles, or primate pseudogene sequences examined (25–29).
Using SSOP, this new allele would not have been identified as a
variant due to an absence of probes in the generally conserved region around codon 31. Due to the presence of a novel HLA-B allele
(B*4016) in this sample, the alleles of the HLA-A locus were also
sequenced rather than typed by SSOP. DNA from the other two
individuals typed as HLA-A*3004 by SSOP was also sequenced
confirming that they were indeed A*3004, not A*3006.
Novel allele – HLA-B*1403
A single nucleotide variant of HLA-B*1402 was identified from
DNA from individuals in the DT Series. HLA-B*1403 contains the
sequence CGG (rather than CTG in *1402) at codon 156, resulting
in a predicted amino acid change from leucine (B*1402) to arginine
(B*1403). This sequence is present at codon 156 in other reported
alleles, such as B*0705. A total of three individuals possessed the
novel B*1403 allele; all B*1403 positive individuals also carried
A*2301, suggesting an association with B*1403. HLA-B*1403 was
298
Tissue Antigens 2000: 56: 291–302
Fig. 4. Protein sequences observed at the Bw4/Bw6 epitope (codons 77–83) (26, 30).
Ellis et al : Class I alleles in Cameroon, Africa
same predicted sequence as HLA-B*4703 spanning codons 78–83,
but differ at codon 77 (alanine rather than the serine present in
B*4703) (26, 31). The chimpanzee allele, Patr-B*04, also contains
the same Bw4/Bw6 hybrid, with the exception of a glycine (G) at
codon 77. Studies of the Bw4 and Bw6 epitopes reveal that residues
77 and 80–83 are necessary components of these epitopes with residues 82 and 83 being the most crucial. We have predicted that this
novel HLA-B locus molecule bears a Bw4 serologic epitope since
R82 and G83 are essential for the Bw6 epitope and studies with a
B7/B27 hybrid generated by mutagenesis containing the SLRNLLR
sequence seen in HLA-B*4703 was shown to be Bw4 reactive (8,
32). Studies by Darke et al. have shown that HLA-B*4703 is reactive
with most or all Bw4 and some Bw6 antisera (33).
Novel allele – HLA-B*4016
HLA-B*4016 differs from B*40012 by five nucleotides, with each
substitution yielding a predicted amino acid change. The exons 1,
2, 4, and 5 of this allele are identical to B*40012, while exons 3, 4,
and 5 are identical to B*0705 and B*0706. Exon 4 and 5 sequences
of B*40012, *4016, *0705, and *0706 are identical. B*4016 possibly
arose following a reciprocal recombination event or an intraallelic
conversion event involving B*0705 (or *0706) and B*40012. B*4016
was present in two unrelated Cameroonian individuals, both from
Estimated haplotypes1 in Cameroon
Haplotype (A-B)
Frequency
Haplotype (A-B)
Frequency
0201–07021
0.0110
2902–4407
0.0110
0201–1801
0.0165
2902–4901
0.0220
0210–5301
0.0110
2902–5301
0.0220
0202–4016
0.0110
2902–5802
0.0110
0202–44031
0.0165
2902–8101
0.0110
0301–0801
0.0110
3001–4201
0.0165
0301–4201
0.0110
3001–5301
0.0110
0301–5301
0.0110
3001–5802
0.0110
0301–5801
0.0110
3002–1302
0.0165
0301–5802
0.0110
3002–5301
0.0165
2301–07021
0.0165
3002–5801
0.0110
2301–1403
0.0110
3303–5802
0.0110
2301–3501
0.0330
3402–51011
0.0110
2301–44031
0.0110
3601–5301
0.0165
2301–4501
0.0110
6601–3501
0.0110
2301–5301
0.0110
6601–5802
0.0110
2301–5801
0.0110
6802–0801
0.0110
2301–5802
0.0110
7401/2–1503
0.0110
2301–8101
0.0165
7401/2–5802
0.0110
2601–07021
0.0110
1
Only haplotypes appearing more than once are listed
Table 8
the village of Etoa and later identified in six other unrelated African
American individuals (data not shown). The sequence of this allele
was erroneously assigned both names HLA-B*4016 and HLAB*4017 by the WHO Nomenclature Committee and the name HLA-
Comparison of populations
B*4017 was dropped.
Few class I allele level population studies are available, making it difHLA-A and HLA-B associations
ficult to compare class I frequencies among populations. For this
study of the HLA-A locus, alleles have been grouped into predicted
A large number of HLA-A-B haplotypes were observed in both Ca-
serologic groups to compare to existing serologic level population
meroon population samples. Most haplotypes were observed once
studies. Fig. 5 compares the frequencies of the HLA-A antigens in Ca-
in each population sample, as indicated by the high value of diver-
meroon to the 1991 International Histocompatibility Workshop sero-
sity (h) for this population (h⫽0.98). Although not significantly as-
logic results for ‘‘West Africans’’ (34). A74 and A66 were not tested
sociated at the antigen level, the A*2301-B*3501 haplotype was the
for in the Workshop but these types would have been identified as
most common Cameroon haplotype, ocurring at a frequency of ap-
A19 and A10. A66 is also often typed A26 (35). A74 and A66 are listed
proximately 3–4% overall. Predicted haplotypes occurring more
in Fig. 5 for purposes of showing the frequency in Cameroon.
than once in the population are listed in Table 8.
Absent in both the Cameroon and Workshop results are HLA-
It seems likely that the pool of HLA-A-B haplotypes in the Ca-
A11, A25, and A43. HLA-A36 was not detected in the Workshop,
meroon population is very large, due both to the extensive allelic
but is present in Cameroon. The frequency of HLA-A29 is dramati-
diversity at these loci, and the observation that these allele levels
cally higher in Cameroon (10.4% vs. 2.3%), the frequency of HLA-
are maintained at relatively even levels by balancing selection. Sub-
A23 is nearly double (18.7% vs. 9.7%) and the frequency of HLA-
sequent studies of this and other west African populations should
A3 is increased (8.2% vs. 1.5%). Conversely, there is a much lower
endeavor to obtain very large sample sizes so as to more accurately
frequency in Cameroon of HLA-A1 (1.1% vs. 5.3%), HLA-A24 (0.6%
assess the extent and distribution of HLA-A-B haplotype diversity.
vs. 3.8%), HLA-A28 (3.9% vs. 12.6%), and HLA-A33 (2.7% vs.
Tissue Antigens 2000: 56: 291–302
299
Ellis et al : Class I alleles in Cameroon, Africa
Fig. 5. HLA-A antigen frequency comparison between the International Histocompatibility Workshop West Africans (34) and
Cameroon. HLA-A74 and -A66 were not tested for by workshop serology.
BL is HLA-A blank. Frequencies in Cameroon are represented by the
light bars (left), while the West African frequencies are the solid bars
(right).
15.2%). The most frequent serologic type, HLA-A2, has a similar
likely positively selected in an evolutionary timeframe. Whether
frequency in both populations (19.2% vs. 20.6%). Within the HLA-
other alleles frequent in this population have been positively se-
A2 family, 6 of the 30 known A2 alleles were detected in Cameroon,
lected for their ability to efficiently present peptides from patho-
including A*0201, A*0202 , A*0204, A*0205, A*0211, and A*0214.
genic organisms is unknown.
The diversity at the HLA-B locus makes these comparisons dif-
The inference of balancing selection in the Etoa population and
ficult because so many recently described HLA-B alleles have been
the generally low values of F for the Cameroon HLA-A and -B loci
defined solely by sequencing and not by serology. Of interest, and
suggests that the class I loci in this population have been subject
probably the most relevant, is the variation in HLA-B53 frequency
to evolutionary forces similar to those observed at the class II loci.
in western Africa. In Cameroon, HLA-B*5301 was present at a fre-
Similarly low homozygosity values have been observed at the HLA-
quency of 10.9%. In other African populations, HLA-B53 was sero-
A and B loci in Pacific/Asian populations, and a significantly low
logically detected in 22.2% of Nigerians, and 13.9% of the Serere
value has been observed in Chinese populations (22), suggesting
and Mandinka in Senegal, and 14–17% of two tribes in The Gambia
that class I and class II allele frequency distributions in populations
(Wolof and Mandinka), 8.8% of Zaireans, 8.3% of Zimbabweans,
around the globe have been shaped by balancing selection.
1.0% of South African Xhosa (36, 37). The variation in HLA-B53
frequency between different African populations could reflect the
Allelic diversity
diverse genetic backgrounds of these groups. The many historical
migrations of different tribes and peoples through Cameroon due to
A diverse group of alleles was detected in Cameroon. In a population
war, famine, expansion, colonization, and emigration, is reflected in
of slightly less than a hundred individuals, 30 HLA-A locus alleles
the allelic diversity at the HLA-A and HLA-B loci. This diversity
and 33 HLA-B locus alleles were detected. Diversity values (h) for the
may have also arisen under selective pressure as Cameroon is en-
HLA-A loci of the Etoa and Yaoundé population samples are 0.95 and
demic for P. falciparum malaria, among other parasites. The higher
0.97, respectively, while the values for the HLA-B loci are 0.94 and
frequency of HLA-B*5301, which has been hypothesized to provide
0.92. In comparison, the HLA-A locus diversity in Pacific/Asian popu-
resistance to severe malaria, is an example of an allele that was
lations is uniformly lower (h⫽0.37–0.88)(22). Comparing Camerooni-
300
Tissue Antigens 2000: 56: 291–302
Ellis et al : Class I alleles in Cameroon, Africa
ans, descending from a large founder population, to South American
to 50,000 per year during the peak of the slave trade in the 18th
Indians, who are believed to have originated from a much smaller
century. Most of the exported slaves arrived in the Americas and,
founder population, there was a drastic difference in the number of
thus, alleles present in West African populations are present in
alleles detected in the populations. As few as 3 to 6 HLA-A and 6 to 20
African Americans. All novel HLA-B alleles described in this study
different HLA-B alleles have been detected in South American Indian
of Cameroon were later identified in several unrelated African
populations with similar to larger sample sizes than our current study
Americans (and, in the case of HLA-B*1403, one individual of
(38). This observation supports the impact of founder population size
Middle Eastern origin) in routine typing of patients and volunteer
on the allelic diversity present in different populations.
donors for an American bone marrow registry. Allele level population studies of the HLA class I loci in Africans provide helpful
frequency data to be used in the development of stem cell registries
Alleles in populations of African origin
and the development of subunit or peptide-based vaccines, in adDuring the sixteenth century, between one and two thousand slaves
dition to aiding the understanding of human evolution and the im-
of West African origin were exported annually. This number rose
pact of disease on HLA polymorphism.
References
1. Parham P, Ohta T. Population biology of
antigen presentation by MHC class I
molecules. Science 1996: 272: 67–74.
2. Nei M, Livshits G. Genetic relationships of
Euroasians, Asians, and Africans and the
origin of modern Homo sapiens. Hum
Heredity 1989: 39: 276–81.
3. Stine OC, Dover GJ, Zhu D, Smith KD. The
evolution of two West African
populations. J Mol Evol 1992: 34: 336–44.
4. McEvedy, C. The Penguin Atlas of African
History. 2nd edition. New York: Penguin
Books, 1995.
5. Nelson, HD. Area Handbook for the United
Republic of Cameroon. 1st edition.
Washington, DC: U.S. Government Printing
Office, 1974.
6. Oh S, Fleischhauer K, Yang SY. Isoelectric
focusing subtypes of HLA-A can be defined
by oligonucleotide typing. Tissue Antigens
1993: 41: 135–42.
7. Cereb N, Maye P, Lee S, Kong Y, Yang SY.
Locus-specific amplification of HLA class
I genes from genomic DNA: locus-specific
sequences in the first and third introns of
HLA-A, -B, and -C alleles. Tissue Antigens
1995: 45: 1–11.
8. Lutz CT, Smith KD, Greazel NS et al. Bw4reactive and Bw6-reactive antibodies
recognize multiple distinct HLA structures
that partially overlap in the alpha-1 helix.
J Immunol 1994: 153: 4099–110.
9. Steiner N, Ng J, Bush J, Hartzman RJ,
Johnston-Dow L, Hurley CK. HLA-B alleles
associated with the B15 serologically
defined antigens. Hum Immunol 1997: 56:
84–93.
10. Bodmer JG, Marsh SGE, Albert ED et al.
Nomenclature for factors of the HLA
system. Tissue Antigens 1999: 53: 407–46.
11. Ewens WJ. The sampling theory of
selectively neutral alleles. Theor Popul Biol
1972: 3: 87–112.
12. Watterson GA. The homozygosity test of
neutrality. Genetics 1978: 88: 405–17.
13. Slatkin M. An exact test for neutrality
based on the Ewens sampling distribution.
Genet Res 1994: 64: 71–4.
14. Slatkin M. A correction to the exact test
based on the Ewens sampling distribution.
Genet Res 1996: 68: 259–60.
15. Guo SW, Thompson EA. Performing the
exact test of Hardy-Weinberg proportion for
multiple alleles. Biometrics 1992: 48: 361–
72.
16. Baur MP, Danilovs JA. Population analysis
of HLA-A, B, C, DR and other genetic
markers. In: Terasaki PI, ed.
Histocompatibility Testing 1980. Los
Angeles: UCLA Press, 1980.
17. Long JC, Williams RC, Urbanek M. An EM algorithm and testing strategy for
multiple locus haplotypes. Am J Hum
Genet 1995: 56: 799–810.
18. Mason PM, Parham P. HLA class I and II
sequence alignments.
www.anthonynolan.com/HIG/data.html
1999.
19. Hill AVS, Allsopp CEM, Kwiatkowski D et
al. Common West African HLA antigens
are associated with protection from severe
malaria. Nature 1991: 352: 595–600.
20. Klitz W, Thomson G, Baur MP. Contrasting
evolutionary histories among tightly linked
HLA loci. Am J Hum Genet 1986: 39: 340–
9.
21. Titus-Trachtenberg EA, Rickards O, De
Stefano GF, Erlich HA. Analysis of HLA
class II haplotypes in the Cayapa Indians
of Ecuador: a novel DRB1 allele reveals
evidence for convergent evolution and
balancing selection at position 86. Am J
Hum Genet 1994: 55: 160–7.
22. Bugawan TL, Mack SJ, Stoneking M et al.
HLA class I allele distributions at six
Pacific/Asian populations: evidence of
selection at the HLA-A locus. Tissue
Antigens 1999: 53: 311–9.
23. Mack SJ, Bugawan TL, Moonsamy PV et
al. Evolution of Pacific/Asian populations
inferred from HLA class II allele frequency
distributions. Tissue Antigens 2000: 55:
383–400.
24. Wagner AG, Hughes AL, Iandoli ML et al.
HLA-A*8001 is a member of a newly
discovered ancient family of HLA-A alleles.
Tissue Antigens 1993: 42: 522–9.
Tissue Antigens 2000: 56: 291–302
301
Ellis et al : Class I alleles in Cameroon, Africa
25. Cadavid LF, Hughes AL, Watkins DI. MHC
class I-processed pseudogenes in New World
primates provide evidence for rapid
turnover of MHC class I genes. J Immunol
1996: 157: 2403–9.
26. Lawlor DA, Warren E, Ward FE, Parham P.
Comparison of class I MCH alleles in
humans and apes. Immunol Rev 1990: 113:
147–85.
27. Watkins DI, Zemmour J, Parham P. Nonhuman primate MHC class I sequences.
Immunogenetics 1993: 37: 330.
28. Geraghty DE, Koller BH, Pei JI, Hansen JA.
Examination of four HLA class I
pseudogenes. J Immunol 1992: 149: 1947–
56.
29. Suarez B, Morales P, Castro MJ et al. MHCE polymorphism in Pongidae primates: the
same allele is found in two different
species. Tissue Antigens 1997: 50: 695–8.
30. Parham P, Lawlor DA. Evolution of class I
major histocompatibility complex genes
and molecules in humans and apes. Hum
Immunol 1991: 30: 119–28.
302
31. Allsopp CEM, Hill AVS, Kwiatkowski D et
al. Sequence analysis of HLA-Bw53, a
common West African allele, suggests an
origin by gene conversion of HLA-B35. Hum
Immunol 1991: 30: 105–9.
32. Toubert A, Raffoux C, Boretto J et al.
Epitope mapping of HLA-B27 and HLA-B7
antigens by using intradomain
recombinants. J Immunol 1988: 141:
2503–9.
33. Darke C, Guttridge MG, Street J, Thompson
J, Thomas M. HLA-B*4703: sequence
confirmation, serology and distribution.
Tissue Antigens 1999: 53: 586–90.
34. Imanishi T, Akaza T, Kimura A, Tokunaga
K, Gojobori T. Allele and haplotype
frequencies for HLA and complement loci
in various ethnic groups. In: Tsuji K,
Aizawa M, Sasazuki T, eds. HLA 1991
Proceedings of the Eleventh International
Histocompatibility Workshop and
Conference. Oxford: Oxford University
Press, 1992: 1065–220.
Tissue Antigens 2000: 56: 291–302
35. Hurley CK, Schreuder GMT, Marsh SGE,
Lau M, Middleton D, Noreen H. The search
for HLA-matched donors: a summary of
the HLA-A*, -B*, -DRB1*1/3/4/5* alleles
and their association with serologically
defined HLA-A, -B, and -DR antigens. Tissue
Antigens 1997: 50: 401–18.
36. Halle L, Mbayo K, Lurhuma Z et al. HLAA, B, C, DR and DQ polymorphism in
Zarieans. Tissue Antigens 1994: 44: 196–9.
37. Dieye A, Diaw ML, Rogier C, Trape JF,
Sarthou JL. HLA-A, -B, -C, -DR, -DQ typing
in a population group of Senegal:
distribution of HLA antigens and HLADRB1*13 and DRB1*11 subtyping by PCR
using sequence-specific primers (PCR-SSP).
Tissue Antigens 1996: 47: 194–9.
38. Cadavid LF, Watkins DI. Heirs of the jaguar
and the anaconda: HLA, conquest and
disease in the indigenous populations of
the Americas. Tissue Antigens 1997: 50:
702–11.