Genetic Diversity and Population Structure of 20

Genetic Diversity and Population Structure of
20 North European Cattle Breeds
J. Kantanen, I. Olsaker, L.-E. Holm, S. Lien, J. Vilkki, K. Brusgaard,
E. Eythorsdottir, B. Danell, and S. Adalsteinsson
Blood samples were collected from 743 animals from 15 indigenous, 2 old imported, and 3 commercial North European cattle breeds. The samples were analyzed
for 11 erythrocyte antigen systems, 8 proteins, and 10 microsatellites, and used to
assess inter- and intrabreed genetic variation and genetic population structures.
The microsatellites BoLA-DRBP1 and CSSM66 were nonneutral markers according
to the Ewens–Watterson test, suggesting some kind of selection imposed on these
loci. North European cattle breeds displayed generally similar levels of multilocus
heterozygosity and allelic diversity. However, allelic diversity has been reduced in
several breeds, which was explained by limited effective population sizes over the
course of man-directed breed development and demographic bottlenecks of indigenous breeds. A tree showing genetic relationships between breeds was constructed from a matrix of random drift-based genetic distance estimates. The breeds were
classified on the basis of the tree topology into four major breed groups, defined
as Northern indigenous breeds, Southern breeds, Ayrshire and Friesian breeds, and
Jersey. Grouping of Nordic breeds was supported by documented breed history
and geographical divisions of native breeding regions of indigenous cattle. Divergence estimates between Icelandic cattle and indigenous breeds suggested a separation time of more than 1,000 years between Icelandic cattle and Norwegian native breeds, a finding consistent with historical evidence.
From Animal Production Research, Agricultural Research Centre (MTT ), Jokioinen, Finland ( Kantanen
and Vilkki), Department of Morphology, Genetics and
Aquatic Biology, Norwegian College of Veterinary Science, Oslo, Norway (Olsaker), Department of Animal
Breeding and Genetics, Danish Institute of Agricultural
Science, Tjele, Denmark ( Holm and Brusgaard), Department of Animal Science, Agricultural University of
Norway, Ås, Norway ( Lien), Agricultural Research Institute, Keldnaholt, Reykjavik, Iceland ( Eythorsdottir),
Department of Animal Breeding and Genetics, Swedish
University of Agricultural Sciences, Uppsala, Sweden
( Danell), and the Nordic Gene Bank for Farm Animals,
Department of Animal Science, Agricultural University
of Norway, Ås, Norway (Adalsteinsson). J. Kantanen is
currently at the Department of Genetics, Trinity College, Dublin 2, Ireland. S. Adalsteinsson is currently at
Sudurgata 24, IS-101, Reykjavik, Iceland. The authors
would like to thank Nina Hovden-Sæther, Jon V. Jonmundsson, Elisabeth Koren, and Bine Melby for organizing animal sampling, Claes-Gøran Fristedt, Mogens
Kjaer, Tuula-Marjatta Nieminen, and Thorsteinn Olafsson for collection of blood samples, Valgerdur Andresdottir and Soeren Svendsen for isolation of DNA, Ole
Albert Guttersrud for technical assistance on DNA extraction and microsatellite genotyping, and Kaj Sandberg and Leif Anderson for genotyping Danish, Icelandic, Norwegian, and Swedish breeds for plasma
proteins and BoLA-DRBP1. We are indebted to Kari Elo
and Jaakko Lumme for valuable comments on an earlier draft of this manuscript. We acknowledge the financial support provided by the Nordic Council of Ministers and the Norwegian Ministry of Agriculture.
2000 The American Genetic Association 91:446–457
446
Genetic diversity of domestic cattle is typically distinguished into two components:
genetic differences between breeds and
genetic differences between individuals
within a breed. Previous studies on genetic diversity in cattle are based on data derived from typing of red cell antigens and
plasma proteins. Recently, increased preference has been given to microsatellites
that are ubiquitous throughout the cattle
genome and highly polymorphic (e.g., Machugh et al. 1997). Studies of genetic relationships between cattle breeds provide
useful information on the evolution of
breeds, gene pool development, and the
magnitude of genetic differentiation ( Baker and Manwell 1980; Blott et al. 1998;
MacHugh et al. 1997; Medjugorac et al.
1994). These studies are important for decisions concerning breed conservation.
Breeds with a unique evolutionary history
could potentially have a value in the maintenance of genetic diversity at the species
level ( Hall and Bradley 1995), but also for
breeding strategies aimed at improving
traits through heterosis (Graml and Pirchner 1984).
From the patterns of within-population
genetic variation at marker loci, it is possible to deduce demographic factors important to the conservation of domestic
cattle diversity (e.g., Cornuet and Luikart
1996; Kantanen et al. 1999). Breeds can be
observed to have lost within-population
genetic variation as a result of low effective population sizes ( Kantanen et al.
1999). At the marker loci, the level of allelic diversity is typically reduced more by
decreases in effective population sizes
(Ne ) than the multilocus heterozygosity,
establishing a situation known as allelic
deficiency within a population (Maruyama
and Fuerst 1985). Luikart et al. (1998) and
Luikart and Cornuet (1998) have presented a statistical test and a qualitative
graphical method for identifying this allele
deficiency from allele frequency data, and
have successfully applied these methods
to demonstrate an influence of demographic bottlenecks on genetic variation
in wildlife species.
A large phenotypic variation between
cattle populations from different geographical regions has been characteristic
for animal husbandry in northern Europe
( Denmark, Finland, Iceland, Norway, and
Materials and Methods
Breeds
The Danish, Finnish, Icelandic, Norwegian,
and Swedish breeds and the numbers of
animals (ranging from 11 to 49 animals per
breed, with a mean of 37 individuals per
breed, a total of 743 animals) studied
were:
Figure 1. The original breeding areas of indigenous and old imported North European cattle breeds are presented. Danish Jersey and Danish Shorthorn originated from British populations. Abbreviations of breed names
are as follows: JER, Danish Jersey; DKH, Danish Shorthorn; IS, Icelandic cattle; RDM, Red Danish 1970; SDM, Danish
Black-Pied 1965; SJM, Jutland breed; ORA, Eastern Red Polled; VRA, Western Red Polled; TM, Telemark cattle; DOL,
Doela cattle; VFJ, Western Fjord cattle; STN, Blacksided Troender and Nordland cattle; ROK, Swedish Red Polled;
SFR, Swedish Mountain cattle; WFC, Western Finncattle; EFC, Eastern Finncattle; NFC, Northern Finncattle.
Sweden). As a consequence of demands
for high-input farming systems, local cattle
breeds have become almost totally displaced by commercial red-and-white Ayrshire or black-and-white Holstein-Friesianbased breeds during the last decades.
Knowledge of the exact genetic relationships between native North European cattle breeds and their divergence from commercial cattle breeds is still lacking,
despite the risk of extinction for some of
the breeds. Icelandic cattle are assumed
to have descended from animals introduced from Norway when Iceland was colonized during the years 874–930 (Adalsteinsson 1981) and to have remained
isolated from other cattle populations.
Studies on bovine red cell antigen systems
( Brænd et al. 1962; Kidd and Cavalli-Sforza
1974) have indicated that Icelandic cattle
are closely related to the Norwegian native breeds Doela, Telemark, and Blacksided Troender and Nordland cattle. Local
Norwegian cattle populations are thought
to originate from Denmark, Sweden, and
possibly Finland ( Tuff 1951).
Within the present study, genetic diversity was estimated in 15 indigenous, 2 old
imported, and 3 commercial modern
North European cattle breeds using multilocus heterozygosity, the mean number
of alleles per locus, and the proportion of
polymorphic loci, and their genetic structure was examined. The present-day pattern of within-population genetic variation
of the North European cattle breeds was
also evaluated by adopting the methods of
Luikart et al. (1998) and Luikart and Cornuet (1998). The use of genetic markers
provides the only tool to assess population genetic structures of the most endangered indigenous cattle breeds because
they lack detailed pedigree recording data.
Furthermore, we studied genetic relationships between existing North European
cattle breeds, based on calculated genetic
distances and dendrograms. The magnitude of genetic differentiation between the
breeds was tested and divergence times
between Icelandic cattle and indigenous
and old imported cattle breeds were estimated. Icelandic cattle were selected as a
reference population because its history is
well documented. The results are based
on 29 loci, including microsatellites, red
cell antigen systems, plasma proteins, and
milk proteins.
(1) 15 indigenous breeds: Danish [Jutland Breed (49 animals), Danish Black-Pied
1965 (27), and Red Danish 1970 (39)],
Finnish [Eastern Finncattle (31), Northern
Finncattle (26), and Western Finncattle
(40)], Icelandic [Icelandic cattle (44)], Norwegian [Blacksided Troender and Nordland cattle (34), Doela cattle (35), Eastern
Red Polled (11), Telemark cattle (46),
Western Fjord cattle (41), and Western
Red Polled (36)], and Swedish [Swedish
Mountain cattle (41), and Swedish Red
Polled (34)]. All native Nordic breeds, except Icelandic cattle, consist of fewer than
5000 reproducing females (for census sizes, see Kantanen et al. 1999).
(2) Two old imported breeds: Danish
[Danish Jersey (41) and Danish Shorthorn
(41)]. These breeds were imported in the
19th century from Great Britain. The present census size of Danish Shorthorn is
approximately 200 breeding cows, and
that of the original Danish Jersey is 10,000.
(3) Three modern commercial breeds:
Finnish [Finnish Ayrshire (46) and Finnish
Holstein-Friesian (43)], and Norwegian
[Norwegian cattle-NRF (38)]. These are
large cattle breeds with population sizes
in excess of 300,000, 100,000, and 300,000,
respectively.
Finnish Ayrshire and NRF are classified
as Ayrshire-based breeds, and Finnish Holstein-Friesian, Danish Black-Pied 1965, and
Jutland Breed are black-and-white Friesian
breeds. Danish Black-Pied 1965 and Red
Danish 1970 represent original relics of the
old Danish Black-Pied and Red Danish
breeds, before being crossed with commercial black-and-white or red breeds.
The analyzed Danish Jersey animals were
original and have not been crossbred with
North American Jersey. The map ( Figure
1) shows the original geographical breeding areas of the indigenous and old imported breeds.
Collection of blood samples and extraction of DNA has been described previously
by Lien et al. (1999).
Loci
Estimates of genetic relationships between breeds, within-population diversi-
Kantanen et al • Genetic Diversity in North European Cattle 447
ty, and population structures were based
on allelic variation at 10 microsatellite
loci, 11 red cell antigen systems, 4 milk
protein loci, and 4 plasma protein loci.
The following microsatellites were genotyped: BoLA-DRBP1, CSSM66, INRA005,
INRA035, INRA063, ILSTS005, HEL1, HEL5,
HEL9, and HEL10 ( Barendse et al. 1997;
Kappes et al. 1997). Polymerase chain reactions (PCRs) of microsatellite loci were
carried out using either fluorescent or radioactively labeled primers. The amplified
products were separated on 6% denaturing polyacrylamide gels. Genetic variants
were visualized using either the A.L.F.
DNA Sequencer (Pharmacia, Uppsala, Sweden) or by autoradiography. Milk protein
genes (␣s1-casein, ␤-casein, ␬-casein, and
␤-lactoglobulin), antigens within each red
cell antigen system (A, B, C, F, J, L, M, S,
Z, R⬘, and T ), and one plasma protein
( Transferrin) were previously analyzed
( Kantanen et al. 1999; Lien et al. 1999). Detection of polymorphisms at three other
plasma protein loci (amylase-I, posttransferrin II, and postalbumin) were performed by electrophoretic methods according to Mazumder and Spooner (1970)
and Gahne et al. (1977).
Allele frequencies of the codominant
loci (microsatellites, milk and plasma proteins, and red cell antigen systems F and
R⬘) were estimated by direct allele counting. In the case of dominant loci (the red
cell antigen systems A, B, C, J, L, M, S, Z,
and T⬘), a maximum likelihood method
was used ( Hartl and Clark 1989). Alleles of
the A system were A, AH, H, and the recessive a, and those of the S system SH⬘,
H⬘, U⬘, H⬘U1, and the recessive s. Three alleles were detected in the F system (F, V1,
and V2). Polymorphic B and C systems
were both reduced to four alleles. Alleles
of the B system were B (antigen B), BGK
(antigen complex BGK; refer to Grosclaude
et al. 1979), non-B (those other than the B
or BGK antigen complex), and b (no antigens of system B detected). Accordingly,
alleles of the C system were C1, C2, C⬙, and
non-C. This last allele indicating an absence of C1, C2, or C⬙ antigens in the phenotype of an individual is extremely rare.
Information on alleles of other red cell antigen systems are documented by Kantanen et al. (1999). Allele frequencies of the
milk proteins (for all breeds) and transferrin (for Danish Jersey, Western Finncattle,
Finnish Ayrshire, and NRF) have been reported previously ( Kantanen et al. 1999;
Lien et al. 1999).
448 The Journal of Heredity 2000:91(6)
Statistical Analysis
Within-breed genetic variation and population structures. Intrabreed genetic variation
of the microsatellites, proteins, and erythrocyte antigen systems were quantified using observed heterozygosities and mean
expected unbiased heterozygosities ( Nei
1978) (in the case of dominant loci, only
mean expected unbiased heterozygosity
was possible), the average number of alleles per locus, and the percentage of
polymorphic loci. Loci were defined as
polymorphic when the frequency of the
most common allele was less than 0.95. Estimates of intrabreed genetic variation
were derived using the BIOSYS computer
program (Swofford and Selander 1989).
An exact test was used to determine
possible deviations from Hardy–Weinberg
proportions and the existence of nonrandom associations of genotypes across
polymorphic codominant loci (Guo and
Thompson 1992; Weir 1990). Exact tests
were performed using the program GENEPOP version 3.1 (Raymond and Rousset
1995b). A Markov chain Monte Carlo method was applied to compute unbiased estimates of the exact probabilities (P values). The length of the chain was set to be
100,000 iterations. Multiple test significance was assayed using Bonferroni correction (Weir 1990).
Population subdivision was examined
with Wright’s fixation indexes, using the
variance-based method of Weir and Cockerham (1984). F statistics were computed
using the FSTAT computer program (Goudet 1995). Means and standard deviations
of the F-statistics parameters, that is, the
correlation of genes in structured populations, F, ⌰, and f were obtained for each
locus across breeds by the jackknife procedure (Weir 1990). The significance of F,
⌰, and f estimates was determined by permuting alleles within the total population
(including all 20 breeds), genotypes within
the total population, and alleles within
samples, respectively, with 5000 permutations. The null hypothesis was that the estimates were not significantly different
from zero. The level of significance (P ⬍
.05) was adjusted with the Bonferroni correction.
The pattern of present-day within-population genetic variation was further assessed on the basis of frequency data of
codominantly inherited alleles (Cornuet
and Luikart 1996). Expected unbiased heterozygosities (Hexp) estimated assuming
Hardy–Weinberg equilibrium were compared with the theoretical values (Heq) calculated on the basis of the number of sam-
pled chromosomes and alleles observed
in the sample of a breed (Cornuet and Luikart 1996; Luikart and Cornuet 1998). This
statistical test is known to be a sign test
for heterozygosity excess. The theoretical
heterozygosity at a locus under the supposition of mutation-drift equilibrium was
obtained by 1000 iterations, assuming that
the allelic states of microsatellites change
according to the stepwise mutation model
(SMM) and proteins and erythrocyte antigen systems according to the infinite allele model ( IAM) (Cornuet and Luikart
1996). In addition, the microsatellite, protein and erythrocyte antigen alleles were
classified into 10 frequency classes. The
proportion of different classes was demonstrated using the qualitative graphical
method of Luikart et al. (1998), with the
aim of determining whether the distribution of allele frequencies follow the normal
L-shaped form, where alleles with the lowest frequencies (0.001–0.1) are the most
abundant. The computer program BOTTLENECK (Piry et al. 1999) was used for
calculations.
Genetic differentiation. The DA distance
was computed separately for microsatellite data (10 loci), protein-red cell antigen
system data (19 loci), and finally for a
combined dataset including all 29 loci.
Takezaki and Nei (1996) demonstrated
that genetic distances based on a model
of pure genetic drift, such as DA distance,
are suitable for constructions of phylogeny irrespective of mutation model of genetic markers (i.e., SMM, IAM, or twophase mutations). Trees were constructed
from DA distances using the neighbor-joining method of Saitou and Nei (1987). The
robustness of the phylogenies was evaluated by bootstrapping (1000 replicates)
over loci. For estimating evolutionary
time, the (␦␮)2 distance of Goldstein et al.
(1995) and the genetic distance measure
DTL of Tomiuk and Loeschcke (1991, 1995)
are typically recommended ( Takezaki and
Nei 1996; Tomiuk et al. 1998). From microsatellite data, we derived (␦␮)2 and DTL
distances between the breeds. Values for
(␦␮)2 distances were found to be less reliable compared to the documented history of Icelandic cattle, and standard errors of (␦␮)2 distance values estimated by
1000 bootstraps by resampling loci were
high (SE varied from 0.055 to 6.205). In
contrast, DTL appeared to give more reliable evolutionary relationships. Standard
deviations for DTL estimates obtained by
jackknifing based on 10 microsatellites
ranged from 0.005 to 0.040. DTL was chosen
for calculations of divergence times be-
Table 1. Mean observed (Hobs) and expected (Hexp) unbiased heterozygosities, mean number of alleles (A), and percentage of polymorphic loci (P0.95) in 20
North European breeds
Codominant protein-erythrocyte
antigen data (10 loci)
Microsatellite data (10 loci)
Whole protein-erythrocyte
antigen data (19 loci)
Breed
Hobs
Hexp
A
P0.95
Hobs
Hexp
A
P0.95
Hexp
A
Danish breeds
Danish Shorthorn
Danish Jersey
Red Danish 1970
Danish Black-Pied 1965
Jutland Breed
0.49 (0.09)
0.50 (0.07)
0.52 (0.05)
0.68 (0.05)
0.57 (0.07)
0.45 (0.08)
0.53 (0.05)
0.57 (0.03)
0.64 (0.04)
0.64 (0.05)
3.4 (0.5)
3.9 (0.4)
3.7 (0.4)
4.5 (0.5)
5.5 (0.7)
90
100
100
100
100
0.25 (0.08)
0.33 (0.06)
0.27 (0.07)
0.38 (0.06)
0.35 (0.06)
0.25 (0.07)
0.34 (0.07)
0.28 (0.07)
0.36 (0.05)
0.39 (0.06)
2.0 (0.2)
2.0 (0.2)
2.1 (0.2)
1.9 (0.1)
2.3 (0.3)
Finnish breeds
Eastern Finncattle
Northern Finncattle
Western Finncattle
Finnish Ayrshire
Finnish Holstein-Friesian
0.56 (0.06)
0.62 (0.07)
0.66 (0.05)
0.64 (0.06)
0.66 (0.05)
0.69 (0.05)
0.67 (0.06)
0.67 (0.04)
0.64 (0.05)
0.64 (0.04)
6.0 (0.8)
5.1 (0.7)
5.7 (0.7)
5.3 (0.9)
5.5 (0.8)
100
100
100
100
100
0.24 (0.06)
0.34 (0.05)
0.29 (0.06)
0.35 (0.07)
0.32 (0.05)
0.29 (0.06)
0.35 (0.05)
0.29 (0.06)
0.36 (0.07)
0.35 (0.05)
Icelandic breed
Icelandic Cattle
0.56 (0.08)
0.55 (0.07)
4.1 (0.7)
90
0.31 (0.05)
Norwegian breeds
Blacksided Troender
and Nordland cattle
Doela cattle
Eastern Red Polled
Telemark cattle
Western Fjord cattle
Western Red Polled
Norwegian cattle-NRF
0.63 (0.08)
0.64 (0.06)
0.64 (0.07)
0.54 (0.07)
0.68 (0.06)
0.57 (0.06)
0.63 (0.06)
0.61 (0.07)
0.64 (0.05)
0.65 (0.04)
0.59 (0.07)
0.67 (0.05)
0.62 (0.05)
0.62 (0.06)
5.2 (0.8)
4.7 (0.4)
4.0 (0.4)
5.5 (0.7)
6.4 (1.0)
5.1 (0.7)
5.0 (0.7)
100
100
100
90
100
100
100
Swedish breeds
Swedish Mountain cattle
Swedish Red Polled
0.64 (0.06)
0.55 (0.06)
0.65 (0.05)
0.60 (0.06)
5.2 (0.7)
5.2 (0.8)
100
100
P0.95
60
80
70
90
90
0.28 (0.06)
0.37 (0.05)
0.27 (0.05)
0.34 (0.04)
0.37 (0.04)
2.2 (0.2)
2.1 (0.2)
2.2 (0.2)
2.1 (0.1)
2.4 (0.2)
63
79
68
95
95
2.3 (0.2)
2.1 (0.1)
2.3 (0.2)
2.0 (0.2)
2.3 (0.2)
70
100
80
80
90
0.31 (0.04)
0.36 (0.04)
0.34 (0.04)
0.37 (0.04)
0.38 (0.04)
2.4 (0.2)
2.4 (0.2)
2.4 (0.1)
2.2 (0.1)
2.6 (0.2)
74
95
84
90
95
0.33 (0.05)
1.9 (0.1)
90
0.33 (0.04)
2.2 (0.2)
90
0.35 (0.05)
0.32 (0.08)
0.37 (0.07)
0.26 (0.06)
0.42 (0.05)
0.38 (0.08)
0.37 (0.05)
0.34 (0.05)
0.30 (0.07)
0.35 (0.06)
0.26 (0.06)
0.41 (0.05)
0.34 (0.06)
0.38 (0.04)
2.2 (0.1)
1.9 (0.2)
2.0 (0.1)
1.9 (0.1)
2.2 (0.1)
2.2 (0.1)
2.2 (0.1)
100
70
90
90
100
80
100
0.33 (0.05)
0.32 (0.05)
0.35 (0.04)
0.31 (0.05)
0.40 (0.04)
0.34 (0.04)
0.40 (0.03)
2.3 (0.2)
2.2 (0.2)
2.1 (0.1)
2.2 (0.1)
2.4 (0.2)
2.3 (0.2)
2.4 (0.1)
90
74
84
84
100
84
100
0.42 (0.06)
0.35 (0.07)
0.37 (0.05)
0.33 (0.07)
2.2 (0.1)
2.3 (0.2)
90
90
0.31 (0.04)
0.36 (0.05)
2.3 (0.2)
2.4 (0.1)
79
95
Standard errors are presented in parenthesis.
tween Icelandic cattle and the other Nordic indigenous and old breeds. The formula DTL ⫽ 2 ␮t ( Tomiuk et al. 1998),
where ␮ is the assumed mutation rate of
microsatellites and t indicates the time of
divergence as generations, was used. The
mutation rate was assumed to be 1.4 ⫻
10⫺4/locus/gamete according to averaged
estimates over large sets of loci in sheep
populations (Crawford and Cuthbertson
1996). Computer programs DISPAN ( T.
Ota, Pennsylvania State University), MICROSAT1.5b (Minch 1997), and POPDIST1.1.1 (Guldbrandtsen et al. 2000) were
applied for the respective calculations.
The null hypothesis, H0 ⫽ ‘‘the allelic
distribution at 20 codominant loci is identical across the breeds,’’ was tested using
an exact test for population differentiation
(Raymond and Rousset 1995a), available
within GENEPOP version 3.1 (Raymond
and Rousset 1995b). A Markov chain Monte Carlo method was applied to obtain an
unbiased estimate of exact probability as
in determination of Hardy–Weinberg equilibrium and genotypic associations. Probability estimates subsequently derived
from each locus were pooled for every
pairwise population comparison ( Fisher
1970).
Genetic markers for the phylogeny
study should be neutral. Therefore the Ew-
ens–Watterson test for neutrality for all
loci ( Ewens 1972; Watterson 1978) was
calculated using the computer algorithm
given by Manly (1985). The observed sum
of the squared allele frequencies (observed F), that is, homozygosity, was compared with the 95% confidence intervals
for the expected sum of the squared allele
frequencies (expected F). The sums of
squared alleles were adjusted for sample
sizes and the number of alleles. The 95%
confidence intervals and standard errors
for observed F were calculated using 1000
simulated samples. The Ewens–Watterson
test for neutrality was performed for each
locus separately across all breeds.
Results
Tables of allele frequencies for 10 microsatellites, 11 red cell antigen systems, and
8 protein loci in the 20 breeds, and tables
of DA, (␦␮)2, and DTL distances are available from the Nordic Gene Bank for Farm
Animals, Agricultural University of Norway. Allele frequencies for the four milk
proteins and transferrin ( Danish Jersey,
Western Finncattle, Finnish Ayrshire, and
NRF) have been reported by Lien et al.
(1999) and Kantanen et al. (1999), respectively.
Within-Breed Genetic Variation
A total of 92 microsatellite alleles were
identified. None of the microsatellite alleles detected were predominant across
all 20 cattle breeds. At INRA035, 19 breeds
had the same predominant allele of 104
bp. A total of 14 private microsatellite alleles (unique to one breed) were detected
in samples of Finnish Ayrshire, Finnish
Holstein-Friesian, NRF, Eastern Finncattle,
Western Finncattle, Blacksided Troender
and Nordland cattle, Doela cattle, Western
Fjord cattle, Western Red Polled, and
Swedish Red Polled. Frequencies of private alleles were lower than 0.1, except
that of the ILSTS005 allele (187 bp) identified in Doela cattle with a frequency of
0.157. The high frequency of a private allele could reflect that some individuals of
the 35 Doela cattle sampled may be related.
Mean expected unbiased heterozygosities based on microsatellite data varied
from 0.45 ( Danish Shorthorn) to 0.69
( Eastern Finncattle) ( Table 1). Values
based on codominant proteins and erythrocyte antigen systems ranged from 0.25
( Danish Shorthorn) to 0.41 (Western Fjord
Cattle), and data of all proteins and erythrocyte antigen systems including both codominant and dominant loci resulted in
similar heterozygosity values, ranging
Kantanen et al • Genetic Diversity in North European Cattle 449
from 0.27 (Red Danish 1970) to 0.40 (Western Fjord Cattle and NRF). Eastern Finncattle were exceptional since considerable
differences between observed and expected values for multilocus heterozygosity
existed at the microsatellite loci.
The average number of alleles varied
from 3.7 (Red Danish 1970) to 6.4 (Western Fjord Cattle) at the 10 microsatellite
loci studied, and from 2.1 ( Danish BlackPied 1965) to 2.6 ( Finnish Holstein-Friesian) at 19 protein loci and erythrocyte antigen systems ( Table 1). All microsatellite
loci were polymorphic in the studied
breeds with the exception of Danish Shorthorn, Icelandic cattle, and Telemark cattle.
All protein loci and erythrocyte antigen
systems were polymorphic in Western
Fjord cattle and NRF.
Hardy–Weinberg Expectations and
Random Genotypic Associations
In total, 33 of 400 locus-population comparisons revealed significant (P ⬍ .05) departures from Hardy–Weinberg proportions, which was slightly more than would
be expected by chance alone. The microsatellite locus INRA035 showed deviations
from Hardy–Weinberg proportions in Eastern Finncattle, Telemark cattle, Red Danish 1970, and Danish Jersey (P ⬍ .001, adjusted by the Bonferroni correction), and
in Swedish Mountain cattle and Swedish
Red Polled (P ⬍ .05). Significant deviations
from Hardy–Weinberg proportions were
also found at ␣s1-casein in Eastern Finncattle (P ⬍ .01), and at INRA005 in Jutland
Breed and Red Danish 1970 (P ⬍ .05).
The exact test for nonrandom association of genotypes across loci gave 20 significant values depicted in nine breeds.
Significant genotypic disequilibrium between markers on chromosome 6 was
found in Jutland breed ( between ␬-casein
and postalbumin, P ⬍ .001, adjusted by
the Bonferroni correction), in Danish Jersey ( between ␬-casein and ␣s1-casein, P ⬍
.01), in Finnish Ayrshire ( between ␬-casein
and ␤-casein, P ⬍ .001), in Icelandic cattle
( between ␤-casein and postalbumin, P ⬍
.01), and in NRF ( between ␤-casein and
postalbumin, P ⬍ .05). Further nonrandom
associations were found in Jutland breed
between INRA063 and CSSM66 (chromosomes 18 and 14, respectively; P ⬍ .001),
ILSTS005 and ␬-casein (chromosomes 10
and 6; P ⬍ .05), and HEL9 and transferrin
(chromosomes 8 and 1, respectively; P ⬍
.05), in Danish Jersey between BoLADRBP1 and CSSM66 (chromosomes 23 and
14, respectively; P ⬍ .001), and in Finnish
Ayrshire between transferrin and amylase
450 The Journal of Heredity 2000:91(6)
Table 2. Results of F statistics for each codominant locus estimated by jackknifing across the breeds
Locus
F (FITa)
⌰ (FSTa)
f(FISa)
Microsatellites
BoLA-DRBP1
CSSM66
ILSTS005
INRA005
INRA035
INRA063
HEL1
HEL5
HEL9
HEL10
Mean
0.105 (0.029)*b
0.159 (0.038)*
0.089 (0.048)
0.112 (0.030)*
0.481 (0.081)*
0.135 (0.033)*
0.080 (0.032)*
0.105 (0.022)*
0.117 (0.024)*
0.075 (0.033)*
0.133 (0.026)*
0.105 (0.021)*
0.145 (0.035)*
0.094 (0.035)*
0.078 (0.016)*
0.137 (0.049)*
0.103 (0.029)*
0.096 (0.023)*
0.133 (0.020)*
0.095 (0.017)*
0.085 (0.024)*
0.107 (0.007)*
0.000 (0.019)
0.017 (0.020)
⫺0.006 (0.047)
0.037 (0.033)
0.397 (0.073)*
0.035 (0.023)
⫺0.017 (0.027)
⫺0.032 (0.024)
0.024 (0.020)
⫺0.012 (0.031)
0.029 (0.027)*
0.143 (0.064)*
0.152 (0.056)*
0.139 (0.046)*
0.089 (0.020)*
0.193 (0.092)*
0.091 (0.036)*
0.120 (0.027)*
0.044 (0.021)*
0.156 (0.040)*
0.139 (0.035)*
0.123 (0.016)*
⫺0.034 (0.052)
⫺0.049 (0.032)
0.070 (0.039)
⫺0.011 (0.067)
⫺0.014 (0.041)
0.022 (0.034)
0.041 (0.073)
⫺0.033 (0.037)
⫺0.018 (0.034)
0.024 (0.028)
0.003 (0.013)
Proteins and erythrocyte antigen systems
0.117 (0.109)*
EAF
0.112 (0.078)
EAR
0.119 (0.056)*
Amylase I
0.088 (0.064)*
Posttransferrin II
0.181 (0.095)*
Postalbumin
0.111 (0.040)*
Transferrin
␣S1-casein
0.155 (0.059)*
␤-casein
0.011 (0.036)
␬-casein
0.141 (0.053)*
␤-lactoglobulin
0.160 (0.036)*
Mean
0.125 (0.022)*
a
b
Correspondence in Wright’s terminology, refer to Weir and Cockerham (1984).
Statistical significance P ⬍ .05 from permutation tests.
Standard deviations are presented in parentheses.
I (chromosome 1 and unknown location,
respectively; P ⬍ .05). Disequilibrium was
observed between HEL9 and HEL10 (chromosomes 8 and 19, respectively; P ⬍ .001)
and between BoLA-DRBP1 and CSSM66 (P
⬍ .001) in Western Fjord cattle, and between CSSM66 and HEL9 (P ⬍ .05) and between ␤-lactoglobulin and amylase I (chromosome 11 and unknown location,
respectively; P ⬍ .05) in Western Red
Polled. Furthermore, five nonrandom associations (P ⬍ .05) between markers in
Danish Shorthorn and one (P ⬍ .05) in
Swedish Mountain cattle were detected.
Analysis of F Statistics
Results of F statistics across all breeds using the method of Weir and Cockerham
(1984) are given in Table 2. Mean estimates for microsatellites for F, ⌰, and f
were 0.133, 0.107, and 0.029, respectively.
Estimates across proteins and erythrocyte
antigen markers were 0.125, 0.123, and
0.003, respectively. Estimates of ⌰ for each
locus were significantly (P ⬍ .05, adjusted
by the Bonferroni correction) different
from zero. Values for F were found to be
significantly different from zero, except
those for ILSTS005, EAR, and ␤-casein.
INRA035 had an exceptionally high f estimate resulting in an overall f estimate for
the microsatellite data being significantly
different from zero (P ⬍ .05). By excluding
INRA035, the microsatellite data gave
mean estimates and standard deviations
[f(P ⬎ .05) ⫽ 0.004 (standard deviation
0.008), ⌰ ⫽ 0.105 (0.008), and F ⫽ 0.109
(0.009)] corresponding to the proteinerythrocyte antigen data. Although the
present breed samples do not allow an examination of the inheritance of microsatellite alleles, it is assumed that the results
obtained for INRA035 in the analyses of
Hardy–Weinberg equilibrium and the overall f estimate are due to the presence of
undetected null alleles at the INRA035 locus.
The Sign Test and Distribution of Allele
Frequencies
Assuming SMM, the sign test showed that
Red Danish 1970 had nine microsatellite
loci with heterozygosity excess, that is,
there were too few microsatellite alleles
compared to the level of expected heterozygosities ( Table 3). This proportion of
microsatellite loci showing heterozygosity
excess in Red Danish 1970 was significantly larger (P ⫽ .03) than that supposed by
stochastic effects alone.
Using the codominant proteins and
erythrocyte antigen systems in the sign
test, a significant proportion of loci indicated a heterozygosity excess (P ⬍ .05) in
samples of Jutland breed, Danish Jersey,
Danish Black-Pied 1965, Northern Finncattle, Finnish Ayrshire, Icelandic cattle,
Western Fjord cattle, and Swedish Moun-
Table 3. Number of loci showing heterozygosity excess or deficiency, and probabilities obtained from
the sign test, assuming that microsatellites and protein-erythrocyte antigen markers evolve according to
the stepwise mutation model and the model of infinite number of mutations, respectively
Protein-erythrocyte antigen data
Microsatellite data
Breed
Exc. Hexp
Exc.
Hobs
Def.
Hobs
P
Exc.
Exc. Hexp Hobs
Def.
Hobs
P
Danish Shorthorn
Danish Jersey
Red Danish 1970
Danish Black-Pied 1965
Jutland Breed
Eastern Finncattle
Northern Finncattle
Western Finncattle
Finnish Ayrshire
Finnish Holstein-Friesian
Icelandic Cattle
Blacksided Troender and
Nordland Cattle
Doela Cattle
Eastern Red Polled
Telemark Cattle
Western Fjord Cattle
Western Red Polled
Norwegian Cattle-NRF
Swedish Mountain Cattle
Swedish Red Polled
5.04
5.60
5.69
5.76
5.75
5.77
5.74
5.78
5.76
5.75
5.09
5
3
9
8
8
5
7
4
7
5
6
4
7
1
2
2
5
3
6
3
5
3
.617
.089
.030
.131
.129
.425
.319
.204
.324
.431
.399
3.63
3.55
3.59
3.87
4.02
4.47
4.32
4.49
3.50
4.39
3.67
5
8
5
9
8
5
8
6
8
7
8
3
0
3
0
1
5
2
4
0
3
1
.266
.001
.257
.000
.008
.487
.020
.259
.001
.088
.044
5.66
5.74
5.95
5.78
5.61
5.72
5.83
5.81
5.73
6
7
7
3
4
5
6
8
5
4
3
3
7
6
5
4
2
5
.546
.318
.370
.073
.238
.438
.590
.138
.436
4.46
3.44
4.01
3.72
4.32
4.32
4.35
4.37
4.47
7
6
6
6
9
7
7
9
6
3
2
3
3
1
3
3
1
4
.096
.071
.158
.114
.003
.081
.084
.003
.253
Exc. Hexp ⫽ expected number of loci with heterozygosity excess.
Exc. Hobs ⫽ observed number of loci with heterozygosity excess.
Def. Hobs ⫽ observed number of loci with heterozygosity deficiency.
P ⫽ probability derived from the sign test.
tain cattle, when the IAM model was assumed to explain the pattern of change in
allelic states ( Table 3).
The mode of allele frequency distribution was evaluated by plotting frequency
histograms ( Figure 2). Frequency histograms of Northern Finncattle and Doela
cattle showed that the proportion of microsatellite alleles ( black bars in Figure 2D
and F) in the class between frequencies of
0.001 and 0.1 was lower than that in the
class between 0.101 and 0.2, indicating a
mode-shift distortion in the distribution of
microsatellite alleles. Distribution of codominant protein and erythrocyte antigen
alleles (open bars) into the 10 different frequency classes deviated from the normal
L-shaped mode in Icelandic cattle ( Figure
2A), Danish Black-Pied 1965 ( Figure 2B),
Danish Jersey ( Figure 2C), Northern Finncattle ( Figure 2D), Finnish Ayrshire ( Figure 2E), Doela cattle ( Figure 2F), Eastern
Red Polled ( Figure 2G), Western Fjord cattle ( Figure 2H), NRF ( Figure 2I), and Swedish Mountain cattle ( Figure 2J).
The sign test and graphical method assumes Hardy–Weinberg equilibrium and
linkage equilibrium between loci (Cornuet
and Luikart 1996; Luikart et al. 1998).
When INRA035 was omitted from the microsatellite data, the sign test under the
SMM showed a deficiency of alleles in Red
Danish 1970 (P ⫽ .048), and in Jutland
breed (P ⫽ .050), and an excess of alleles
(Hexp ⬍ Heq) in Telemark cattle (P ⫽ .040).
The distorted mode of allele frequency
distribution due to a lack of rare microsatellite alleles was observed for Doela
cattle.
The casein loci are tightly linked ( Feretti et al. 1990). The sign test and the qualitative graphical method was performed
using 8 instead of 10 markers by omitting
␣s1-casein (showed monomorphisms in
seven breeds), and either ␤-casein or ␬casein from the protein-erythrocyte antigen marker data. When ␣s1-casein and ␤casein were excluded, the data for Doela
cattle (only seven polymorphic loci) indicated a normal L mode for allele frequency
distribution instead of a distorted mode,
and the reverse for Blacksided Troender
and Nordland cattle. When ␣s1-casein and
␬-casein were excluded, the sign test and
the graphical method gave corresponding
results as the whole dataset, except for
Western Red Polled which showed a shifted frequency distribution of protein and
erythrocyte antigen alleles.
Excluding markers deviating from the
assumptions required for bottleneck testing (Cornuet and Luikart 1996; Luikart et
al. 1998) did not drastically alter the test
results. This indicates that the tests for
bottlenecks are robust.
Genetic Distances and Trees
For the microsatellite dataset, values of
the DA distance ( Nei et al. 1983) ranged
between 0.0716 ( between Eastern and
Western Finncattle) and 0.3091 ( between
Icelandic cattle and Danish Jersey). Distance for the dataset of 11 red cell antigen
systems and eight protein loci varied between 0.0204 ( between Western Finncattle
and Western Fjord cattle) and 0.1761 ( between Red Danish 1970 and Danish Jersey). Use of the combined dataset, including all 29 loci, showed that DA distances
ranged between 0.0408 ( between Eastern
and Western Finncattle) and 0.2153 ( between Red Danish 1970 and Danish Jersey).
DTL distance estimates calculated from
microsatellite data varied from 0.040 ( between Blacksided Troender and Nordland
cattle, and Western Fjord cattle) to 0.236
( between Danish Jersey and Red Danish
1970) ( Table 4). DA ( based on 29 loci) and
DTL ( based on microsatellites; Table 4) distance estimates showed that Icelandic cattle were genetically closest to Blacksided
Troender and Nordland cattle, Swedish
Mountain cattle, Northern Finncattle, Eastern Finncattle, and Western Finncattle.
Unrooted neighbor-joining trees (Saitou
and Nei 1987) were constructed from DA
distances based on (1) 10 microsatellites,
(2) 19 proteins and red cell antigen systems, and (3) the combined dataset of 29
loci. The last is presented in Figure 3. Only
seven nodes were supported by bootstrap
values higher than 50%. In this dendrogram, Northern Finncattle, Swedish Mountain cattle, Icelandic cattle, Blacksided
Troender and Nordland cattle, Eastern
Finncattle, and Western Finncattle tended
to form one group. The nodes grouping
the three Friesian breeds were among the
most strongly supported. The phylogeny
(tree not presented) was reconstructed
using 29 loci without the three commercial
breeds of Finnish Ayrshire, Finnish Holstein-Friesian, and NRF, which have been
partially generated by crossbreeding ( Kantanen et al. 1999). The branching pattern of
the indigenous and old imported breeds
was unchanged from that in Figure 3.
Lower bootstrap values were observed
when the microsatellite tree and proteinred cell antigen tree were constructed separately (not presented) than for the total
data. The microsatellite tree showed a
similar topology to that of the dendrogram
based on all 29 loci. Nodes grouping Eastern Red Polled/Danish Jersey, Western
Fjord cattle/Western Red Polled, Red Danish 1970/Danish Shorthorn, and Northern
Kantanen et al • Genetic Diversity in North European Cattle 451
Figure 2. Distribution of allele frequencies in 20 North European cattle breeds. Black bars represent the proportion of microsatellite alleles; white bars the proportion of
protein and erythrocyte antigen alleles in different frequency classes. The x axes represents the 10 frequency classes, and the y axes the proportion of alleles. Allele frequency
distributions suggest bottlenecks (distorted shape) in breeds A–J, whereas K–T tend to indicate normal L-shaped form.
Table 4. Divergence time estimates of Icelandic cattle from other indigenous and old breeds based on
DTL genetic distance estimates (Tomiuk and Loeschcke 1991, 1995) for 10 microsatellite loci
Breeds
DTL
Divergence
times
(generations)
Icelandic cattle versus
Blacksided Troender and Nordland cattle
Swedish Mountain cattle
Northern Finncattle
Western Finncattle
Eastern Finncattle
Western Fjord cattle
Swedish Red Polled
Doela cattle
Jylland Breed
Telemark cattle
Western Red Polled
Red Danish 1970
Danish Shorthorn
Eastern Red Polled
Danish Black-Pied 1965
Danish Jersey
0.062
0.084
0.094
0.105
0.105
0.116
0.135
0.137
0.165
0.172
0.182
0.206
0.220
0.236
0.237
0.240
221
300
336
375
375
414
482
489
589
614
650
736
786
843
846
857
95% CI indicate 95% confidence intervals for divergence times.
452 The Journal of Heredity 2000:91(6)
95% CI
[206, 237]
[280, 320]
[311, 360]
[355, 395]
[357, 393]
[388, 441]
[453, 511]
[465, 514]
[554, 625]
[572, 656]
[603, 696]
[687, 784]
[730, 841]
[794, 891]
[787, 906]
[793, 921]
Finncattle/Swedish Mountain cattle were
supported by bootstrap values of 86%,
73%, 63%, and 71%, respectively. The occurrence of the Danish Black-Pied 1965/
Jutland breed grouping was 80%, and the
node grouping Finnish Holstein-Friesian
and two other Friesian breeds had an occurrence of 75%.
In the protein-red cell antigen tree (not
presented), only three nodes ( Telemark
cattle/Danish Shorthorn, Danish BlackPied 1965/Jutland breed, and the node
grouping Swedish Red Polled and Eastern
Red Polled/Red Danish 1970) were supported by bootstrap values exceeding
50%. Some differences in topology compared to the two other trees were observed. For example, Finnish Ayrshire and
Norwegian cattle were in a different cluster, and Finnish Holstein-Friesian were
grouped independently of other Friesian
breeds.
erations (1105–1326 years), but for an appreciably longer time period from other
Norwegian indigenous breeds.
Population Differentiation
The exact test for population differentiation (Raymond and Rousset 1995a) showed
that combined probabilities in every pairwise breed comparison were less than
0.01 (in the case of four comparisons P ⬍
.01, other comparisons gave P ⬍ .001), indicating that allelic distributions are significantly different between breeds.
Figure 3. Unrooted neighbor-joining tree constructed from DA distances showing the relationships between 20
North European cattle breeds. Genetic distances are based on pooled data of 29 loci. Numbers at the nodes
represent the percentage of group occurrence in 1000 bootstrap replicates.
hand with Danish Jersey occurred 857
generations ago (95% confidence intervals
were 793–921 generations). Assuming a
generation interval of 5 or 6 years ( Kantanen et al. 1999), a divergence time of
some 4200–5100 years was estimated (95%
confidence intervals give 4000–5500 years).
The results indicate that Icelandic cattle
has been separated from Blacksided
Troender and Nordland cattle for 221 gen-
Divergence Times
Divergence times between Icelandic cattle
and other North European indigenous and
old breeds based on the microsatellite
data and estimates of DTL distances are
given in Table 4. The magnitude of the DTL
genetic distance between Icelandic cattle
and Danish Jersey suggests that separation of the lineages ending on one hand
with Icelandic cattle, and on the other
Table 5. The Ewens–Watterson test for neutrality
Locus
Obs F
SE
95% CI for Exp F
BoLA-DRBP1
CSSM66
Amylase I
0.1317
0.1671
0.5069
0.0911
0.0205
0.0260
[0.1749, 0.7227]
[0.1771, 0.7297]
[0.5066, 0.9986]
Locus
Breed
Obs F
SE
95% CI for Exp F
BoLA-DRBP1
Blacksided Troender
Danish Shorthorn
Doela Cattle
Eastern Finncattle
Northern Finncattle
Blacksided Troender
Eastern Red Polled
Finnish Ayrshire
Finnish Holstein-Friesian
NRF
0.1946
0.2195
0.1804
0.1738
0.1649
0.1375
0.2686
0.1609
0.1696
0.1745
0.0144
0.0175
0.0138
0.0105
0.0087
0.0066
0.0144
0.0093
0.0124
0.0191
[0.1977, 0.6631]
[0.2296, 0.7365]
[0.1967, 0.6522]
[0.1738, 0.5869]
[0.1746, 0.5473]
[0.1440, 0.4567]
[0.2851, 0.7521]
[0.1656, 0.5458]
[0.1858, 0.6171]
[0.2199, 0.7379]
CSSM66
Observed F (Obs F) values for BoLA-DRBP1, CSSM66, and amylase I are given. Standard errors (SE) for Obs F and
95% confidence intervals (95% CI for Exp F) were calculated using 1000 simulated samples. Values for BoLA-DRBP1
and CSSM66 within individual breeds are presented.
Test for Neutrality
The overall test for neutrality (pooled
across breeds) showed that observed F
values of two microsatellite loci, BoLADRBP1 and CSSM66, outlied the lower and
upper boundaries of the 95% confidence
region, and the observed F value for amylase I approached the minimum expected
F value ( Table 5). The Ewens–Watterson
test within each breed indicated that the
observed F value for BoLA-DRBP1 was outside the 95% confidence region of expected F in the individual samples of Eastern
Finncattle, Northern Finncattle, Doela cattle, Blacksided Troender and Nordland
cattle, and Danish Shorthorn. Similar findings were attained for CSSM66 in samples
of Finnish Ayrshire, Finnish Holstein-Friesian, Eastern Red Polled, Blacksided
Troender and Nordland cattle, and Norwegian cattle ( Table 5).
Discussion
The results support the usefulness of microsatellites as markers to examine the
evolutionary relationships between cattle
breeds. The protein-red cell antigen tree
appears to correspond less strongly with
known recent breed histories than the microsatellite and all the 29 loci-based phylogenies. As the average mutation rate of
microsatellites exceeds that of proteins,
and different genotyping techniques are
applied, the average number of observed
alleles at microsatellite loci is higher than
for protein loci (Crawford and Cuthbertson 1996; Kimura and Ohta 1975; Maruyama and Fuerst 1985; Ohta and Kimura
1973). Estimates of population genetic diversity, based on markers with a large
number of alleles, have been suggested to
be less biased than those based on lowpolymorphic markers ( Bowcock et al.
1994).
The current data ( Figure 3) suggest a division of the North European breeds into
four major genetic groups as follows:
Kantanen et al • Genetic Diversity in North European Cattle 453
Northern indigenous breeds, Southern
breeds, Ayrshire and Friesian breeds, and
the Jersey breed. The robustness of breed
classification based solely on genetic analysis appeared to be weakened by previous
or recent admixtures and insufficient numbers of analyzed microsatellite markers.
The branching pattern of the tree, DTL genetic distances between Icelandic cattle
and other indigenous and old breeds, together with historical evidence, suggest
that the Northern indigenous breed group
includes Icelandic cattle, Blacksided
Troender and Nordland cattle, Swedish
Mountain cattle, Northern Finncattle, Eastern Finncattle, and Western Finncattle.
The original breeding area of these animals has been the northernmost part of
Northern Europe ( Figure 1). Icelandic cattle are probably descendants from animals
of Norwegian origin brought into Iceland
between the years 874–930 (Adalsteinsson
1981). Previous studies on blood groups
have tended to confirm this ancestry
( Brænd et al. 1962; Kantanen et al. 1999;
Kidd and Cavalli-Sforza 1974). Current results ( Figure 3; Table 4) indicate that animals transported to Iceland may have
originated from breeding areas of the present-day Blacksided Troender and Nordland cattle rather than areas of Doela cattle and Telemark cattle as suggested by
Brænd et al. (1962) and Kidd and CavalliSforza (1974). It is possible to estimate
that the separation of Icelandic cattle from
Troender cattle occurred 1100–1300 years
ago, a finding consistent with the documented history of Icelandic cattle. Another explanation could be provided if it is
assumed that the cattle populations further south in Norway have been influenced to a greater extent by gene flow
from other geographic regions after separation from Icelandic cattle than the population resident in the north. The results
do not suggest clear allocation of Western
Fjord cattle, Doela cattle, and Western Red
Polled into any of the present breed
groups ( Figure 3), suggesting that these
breeds are admixtures of different groups
(MacHugh et al. 1997). Divergence between Icelandic cattle and Eastern and
Western Finncattle, the easternmost
breeds of the Northern breed group, is
proposed to have occurred approximately
2000 years ago.
Danish, Norwegian, and Swedish indigenous breeds were divided between more
than one cluster in the tree ( Figure 3),
whereas Finnish indigenous breeds were
clustered together in the Northern indigenous breed group. This implies that dif-
454 The Journal of Heredity 2000:91(6)
ferent native cattle breeds in Denmark,
Norway, and Sweden have their origins in
different evolutionary lineages, whereas
indigenous cattle in Finland appear to descend from one lineage. A common historic origin of the breeds classified into the
Northern indigenous group is obvious.
Medjugorac et al. (1994) studied genetic
relationships between 14 Balkan and
crossbred Balkan-Alpine cattle breeds, estimating genetic distance from the typings
of erythrocyte antigens, milk, and plasma
proteins. The patterns of genetic distances observed between these breeds reflected the historical spread of cattle populations in Europe. Similar regular patterns
can also be found in North Europe between Northern indigenous breeds. Genetic distance estimates of DA ( based on the
whole data) and DTL ( based on the microsatellite data) between Icelandic cattle
and other breeds in the Northern group
increased regularly in accordance with
geographical distribution from northwest
to southeast. As a result, similar regular
increases in divergence time estimates of
Icelandic cattle from the other Northern
indigenous breeds were observed ( Table
4). The proposed common origin of the
Northern indigenous breeds is based on
nuclear genetic marker analysis and it
would be useful to examine if mitochondrial and Y-chromosome markers would
be suitable tools to test this result (see
Bradley et al. 1994, 1996).
The original breeding area for Eastern
Red Polled, Swedish Red Polled, Telemark
cattle, and Red Danish 1970 has been confined to southern Scandinavia, but these
breeds do not form a homogeneous group.
The dendrogram ( Figure 3) indicates
grouping together of Eastern Red Polled
and Swedish Red Polled, but the grouping
of Telemark cattle with the two Red Polled
breeds is less strongly supported than
that with Red Danish 1970 and Danish
Shorthorn. The branching pattern ( Figure
3) implies clustering of Red Danish 1970
and the old imported breed, Danish Shorthorn, but the topology is not strongly supported. To conclude, we suggest that the
Southern breed group could be divided
into two subgroups: indigenous Red Polled
breeds ( Eastern Red Polled and Swedish
Red Polled), and indigenous and old imported Southern Scandinavian breeds
( Telemark cattle, Red Danish 1970, and
Danish Shorthorn).
Based on the present results on genetic
variation at nuclear genetic markers ( Figure 3) and the known history of the Friesian breeds, the three commercial
breeds—Finnish Ayrshire, Finnish Holstein-Friesian, and NRF—together with the
native Jutland breed and Danish BlackPied 1965 are suggested to form an Ayrshire and Friesian group. A similar observation concerning the grouping of Ayrshire
and Friesian breeds has recently been reported ( Blott et al. 1998). Grouping of Ayrshire-based breeds in the same cluster as
Friesian breeds indicates that these
breeds have a common ancestry.
The Danish Jersey appears to have no
close genetic relationships with the other
breeds in the study. This finding is consistent with other studies (e.g., Baker and
Manwell 1980; Medjugorac et al. 1994;
Blott et al. 1998), and it is suggested that
the Jersey form a fourth breed group ( Figure 3).
Genetic relationships between the Nordic breeds can partly explain the level of
present-day intrabreed genetic variation in
some of the breeds. The values of the
mean expected heterozygosities calculated from microsatellite data, and the mean
numbers of microsatellite and codominant
protein and erythrocyte antigen alleles decreased regularly in accordance with the
geographical distribution of the Northern
indigenous breeds from the southeast
( Eastern Finncattle, Western Finncattle,
Northern Finncattle) to the northwest
(Swedish Mountain cattle, Blacksided
Troender and Nordland cattle, Icelandic
cattle), suggesting the existence of a cline
with microsatellite-based heterozygosity
and allele numbers from the southeast toward the northwest of Northern Europe.
However, differences between estimates of
intrabreed genetic variation do not approach statistical significance as indicated
by the Student–Newman–Keuls test (results not presented). The relatively high
genetic diversity within the Western Fjord
cattle breed supports the assumption that
Western Fjord cattle have evolved through
an admixture of breeds possibly descending from different breed groups. In general,
both microsatellite data and red cell antigen and protein data indicate that the
breeds show similar levels of within-population genetic variation, estimated by
multilocus heterozygosity and allelic diversity ( Table 1), although recent demographic and evolutionary histories of Nordic cattle breeds differ considerably.
Eastern Finncattle had an unusual population structure since the breed had extraordinarily large deviations between observed and expected heterozygosities.
This indicates a subdivided population
structure for Eastern Finncattle (Wah-
lund’s effect), which is in accordance with
the fact that most of the Eastern Finncattle
animals saved for conservation originate
from genetically isolated inbred herds.
Several North European cattle breeds
showed the pattern of allelic deficiency at
the protein and erythrocyte antigen marker loci ( Table 3; Figure 2). When a population undergoes a reduction in effective
population size (Ne) it usually loses rare
alleles, but multilocus heterozygosity can
remain higher than expected, when expressed on the basis of the number of remaining alleles ( Luikart and Cornuet 1998;
Maruyama and Fuerst 1985). Thus both
the excess of heterozygosity ( Table 3) and
a distorted allele frequency distribution
( Figure 2) suggest a loss of allelic diversity
due to lowered Ne in several North European cattle breeds.
The probability of detecting an allelic
deficiency in the present-day pattern of
within-population genetic variation depends on the duration of time since the
reduction of Ne was initiated and the
length of this reduction (Cornuet and Luikart 1996; Luikart et al. 1998; Maruyama
and Fuerst 1985). The Danish Black-Pied
1965, Red Danish 1970, Northern Finncattle, Eastern Red Polled, and Western Fjord
cattle have experienced a decrease in population size corresponding to the bottlenecks of wild animal populations in the examples of Luikart et al. (1998) and Luikart
and Cornuet (1998). This reduction of census size to less than 200 individuals may
have induced allele losses. In contrast, Icelandic cattle, Danish Jersey, Finnish Ayrshire, NRF, and Swedish Mountain cattle
have not experienced similar severe demographic bottlenecks, although their
census sizes are also in decline. Island
breeds and breeds originating from island
breeds—such as Icelandic cattle, Danish
Jersey, Danish Shorthorn and Red Danish
1970, and breeds on the British Isles
(MacHugh et al. 1997)—appear typically
to display a lower level of intrabreed genetic variation than continental breeds.
This reflects limitations in founder populations and a more restricted gene flow for
island breeds. If the original number of alleles was small due to a small founder
population, the allelic deficiency may not
be detectable after the reduction of Ne
(Maruyama and Fuerst 1985). This could
explain the equilibrium condition presently observed in the endangered Danish
Shorthorn. The detected deficiency of alleles in Icelandic cattle and Danish Jersey
implies a more pronounced reduction of
effective size during the demographic his-
tory of these breeds. In a recent report,
the variance Ne of Danish Jersey was observed to have decreased over a 40-year
period ( Kantanen et al. 1999). The assumption of a limited founder size may
not exclusively explain the lack of alleles
in Nordic cattle breeds. Therefore it appears reasonable that the deficiency in the
number of protein and erythrocyte antigen alleles can be explained by restricted
Ne induced by breeding strategies applied
over the course of several generations as
a consequence of a low number of males
needed for breeding. Furthermore, in
breeding strategies there is also a need for
groups of related individuals within a
breed in order to obtain a reliable evaluation of animal breeding values. The highest ranking animals will contribute proportionally more to genetic variation of
the next generation than their contemporaries. Despite continuous importation of
genes via semen or breeding animals, allelic deficiency appears to be present in
the Finnish Ayrshire and NRF breeds. Only
Northern Finncattle and Doela cattle
showed a lack of both microsatellite and
protein/erythrocyte antigen alleles ( Figure
2D and F; Table 3, protein and erythrocyte
antigen data), while the Red Danish 1970
only showed the deficiency in microsatellite alleles ( Table 3). Both Northern Finncattle and Doela cattle were close to extinction during the 1970s, suggesting that
these breeds have experienced a more severe reduction of Ne than other Nordic
breeds.
In this study, 20 significant nonrandom
genotypic associations were observed in
nine breeds, the Jutland breed, Danish Jersey, Danish Shorthorn, Finnish Ayrshire,
Icelandic cattle, Western Fjord cattle,
Western Red Polled, NRF, and Swedish
Mountain cattle. In cattle, albumin and the
tightly linked three casein loci locate in
the same linkage group on chromosome 6
( Barendse et al. 1997). Genomic scans for
locating production-associated quantitative trait loci (QTL) in cattle have yielded
evidence for the existence of one (Georges
et al. 1995) or two QTL ( Velmala et al.
1999) on chromosome 6 affecting milk production. The linkage disequilibrium observed on this chromosome could be explained by the physical linkage of casein
loci, as well as selection and hitchhiking
effects connected to the QTL on this linkage group. The nonrandom genotypic association of markers in different linkage
groups found in Nordic breeds could be
explained by limited and nonexpanding Ne
during their demographic history (Slatkin
1994). Especially in Norwegian breeds,
amalgamation between different lineages
may also explain the deviation from equilibrium (see Slatkin 1994).
The Ewens–Watterson test for neutrality
suggested that the BoLA-DRBP1 and
CSSM66 microsatellites are not neutral
markers ( Table 5). The BoLA microsatellite
is located within the highly polymorphic
bovine major histocompatibility complex
(MHC) on chromosome 23. Extraordinary
diversity of the MHC region in vertebrates
may be explained by some form of balancing selection ( Edwards and Potts 1996).
CSSM66 is located on chromosome 14, and
recently a close genetic linkage between
CSSM66 and a QTL affecting milk yield,
milk fat, and protein composition has
been detected in Holstein-Friesian populations (Coppieters et al. 1998). It is possible that selection for these milk production traits has affected the CSSM66 allele
frequencies due to association with production trait alleles in the commercial
dairy cattle breeds Finnish Holstein-Friesian, Finnish Ayrshire, and Norwegian cattle, as well as in the native breeds Blacksided Troender and Nordland cattle and
Eastern Red Polled. Omitting BoLA-DRBP1
and CSSM66 from the phylogeny study,
and reconstructing the tree using 27 rather than 29 loci, resulted in a similar
branching pattern but with lower bootstrap values.
In summary, genetic divergence of the
Nordic cattle breeds in addition to withinpopulation genetic diversity is a result of
the combined effects of breed origin, the
extent of admixture occurring during
breed foundation and development, and
random genetic drift ( limited Ne during
breed foundation or recently). A few microsatellite alleles observed in only one
breed may have arisen from recent mutations after breed divergence. Some form of
selection was observed to impose on two
markers. North European breeds are significantly differentiated, indicated by the
exact test for population differentiation,
and thus their gene pools are developed
through a breed-specific evolution. The ⌰
estimates obtained by the permutation
procedure asserted the statistical significance of breed differentiation ( Table 2).
Depending on the set of markers used, 11–
12% of the total genetic variation can be
explained by differences between cattle
breeds ( Table 2). Thus the present-day
subdivision of the North European cattle
into discrete breeds is moderate, and
comparable to the extent of genetic differentiation between European cattle breeds
Kantanen et al • Genetic Diversity in North European Cattle 455
presented in the microsatellite study of
MacHugh et al. (1998).
Presently all indigenous and old North
European cattle breeds are rare or endangered, and only Icelandic cattle have a
larger census size of 30,000 breeding females. The old original Danish Jersey is
also declining due to continuous crossbreeding with American Jersey. Only the
modern Ayrshire and Holstein-Friesian
breeds have populations of relatively large
size, and therefore in Northern Europe the
Ayrshire and Friesian breed group is the
only one not endangered. It appears that
genetic variation in domestic cattle is not
only threatened by extinction of individual
indigenous breeds, but whole breed
groups are also in danger. Our study
showed that the level of effective population size of Nordic commercial cattle
breeds appears to be too low to prevent
loss of genetic variation within the species.
Based on phylogenetic analyses, the
Danish Jersey, Red Danish 1970, Danish
Shorthorn, and the group comprised of
Icelandic cattle, Swedish Mountain cattle,
and Northern Finncattle were identified as
genetically differentiated breeds. However,
the Danish breeds and Icelandic cattle
showed a reduced level of heterozygosity
and allelic diversity ( Table 1). This indicates that the effects of genetic drift occurring over the course of breed foundation and/or development have lead to an
increased divergence from other related
breeds. Thus in addition to phylogenetic
distinctness and between-population genetic variation, within-population variation and gene losses should be considered
when breeds are selected for conservation
purposes. Not only are single breeds important, but representative populations
from whole breed groups could also be
conserved in order to maintain the maximum amount of diversity of domestic cattle. Breed history may also provide a foundation for conservation. Icelandic cattle
are known to have experienced isolation
for more than 1000 years and are scientifically, historically, and culturally unique.
References
Adalsteinsson S, 1981. Origin and conservation of farm
animal populations in Iceland. J Anim Breed Genet 98:
258–264.
Baker CAA and Manwell C, 1980. Chemical classification of cattle. 1. Breed groups. Anim Blood Groups
Biochem Genet 11:127–150.
Barendse W, Vaiman D, Kemp SJ, Sugimoto Y, Armitage
SM, et al., 1997. A medium-density genetic linkage map
of the bovine genome. Mamm Genome 8:21–28.
456 The Journal of Heredity 2000:91(6)
Blott SC, Williams JL, and Haley CS, 1998. Genetic relationships among European cattle breeds. Anim Genet
29:273–282.
Bowcock AM, Ruiz-Linares A, Tomfohrde J, Minch E,
Kidd JR, and Cavalli-Sforza LL, 1994. High resolution of
human evolutionary trees with polymorphic microsatellites. Nature 368:455–457.
Bradley DG, MacHugh DE, Cunningham EP, and Loftus
RT, 1996. Mitochondrial diversity and the origins of African and European cattle. Proc Natl Acad Sci USA 93:
5131–5135.
Bradley DG, MacHugh DE, Loftus RT, Sow RS, Hoste CH,
and Cunningham EP, 1994. Zebu-taurine variation in Y
chromosomal DNA: a sensitive assay for genetic introgression in West African trypanotolerant cattle populations. Anim Genet 25:7–12.
Kantanen J, Olsaker I, Adalsteinsson S, Sandberg K,
Eythorsdottir E, Pirhonen K, and Holm L-E, 1999. Temporal changes in genetic variation of North European
cattle breeds. Anim Genet 30:16–28.
Kappes SM, Keele JW, Stone RT, McGraw RA, Sonstegard TS, Smith TPL, Lopez-Corrales NL, and Beattie CW,
1997. A second-generation linkage map of the bovine
genome. Genome Res 7:235–249.
Kidd KK and Cavalli-Sforza LL, 1974. The role of genetic
drift in the differentiation of Icelandic cattle and Norwegian cattle. Evolution 28:381–395.
Kimura M and Ohta T, 1975. Distribution of allelic frequencies in a finite population under stepwise production of neutral alleles. Proc Natl Acad Sci USA 72:2761–
2764.
Brænd M, Rendel J, Gahne B, and Adalsteinsson S,
1962. Genetic studies on blood groups, transferrin and
haemoglobins in Icelandic cattle. Hereditas 48:264–283.
Lien S, Kantanen J, Olsaker I, Holm L-E, Eythorsdottir
E, Sandberg K, Dalsgard B, and Adalsteinsson S, 1999.
Comparison of milk allele frequencies in Nordic cattle
breeds. Anim Genet 30:85–91.
Coppieters W, Riquet J, Arranz J-J, Berzi P, Cambisano
N, et al., 1998. A QTL with major effect on milk yield
and composition maps to bovine chromosome 14.
Mamm Genome 9:540–544.
Luikart G, Allendorf FW, Cornuet J-M, and Sherwin WB,
1998. Distortion of allele frequency distributions provides a test for recent population bottleneck. J Hered
89:238–247.
Cornuet JM and Luikart G, 1996. Description and power
of two tests for detecting recent population bottlenecks from allele frequency data. Genetics 144:2001–
2014.
Luikart G and Cornuet J-M, 1998. Empirical evaluation
of a test for identifying recently bottlenecked populations from allele frequency data. Conserv Biol 12:228–
237.
Crawford AM and Cuthbertson RP, 1996. Mutations in
sheep microsatellites. Genome Res 6:876–879.
MacHugh DE, Loftus RT, Cunningham P, and Bradley
DG, 1998. Genetic structure of seven European cattle
breeds assessed using 20 microsatellite markers. Anim
Genet 29:333–340.
Edwards SV and Potts WK, 1996. Polymorphism of
genes in the major histocompatibility complex (MHC):
implications for conservation genetics of vertebrates.
In: Molecular genetic approaches in conservation
(Smith TB and Wayne RK, eds). New York: Oxford University Press; 214–237.
Ewens WJ, 1972. The sampling theory of selectively
neutral alleles. Theor Popul Biol 3:87–112.
Feretti L, Leone P, and Sgaramella V, 1990. Long range
restriction analysis of the bovine casein genes. Nucleic
Acids Res 18:6829–6833.
Fisher RA, 1970. Statistical methods for research workers, 14th ed. Edinburgh: Olivier and Boyd.
Gahne B, Juneja RK, and Grolmus J, 1977. Horizontal
polyacrylamide gradient gel electrophoresis for the simultaneous phenotyping of transferrin, post-transferrin, albumin and post-albumin in the blood plasma of
cattle. Anim Blood Groups Biochem Genet 8:123–137.
Georges M, Nielsen D, Mackinnon M, Mishra A, Okimoto R, et al., 1995. Mapping quantitative trait loci controlling milk production in dairy cattle by exploiting
progeny testing. Genetics 139:907–920.
Goldstein DB, Linares AR, Cavalli-Sforza LL, and Feldman MW, 1995. Genetic absolute dating based on microsatellites and the origin of modern humans. Proc
Natl Acad Sci USA 92:6723–6727.
Goudet J, 1995. FSTAT (version 1.2): a computer program to calculate F-statistics. J Hered 86:485–486.
Graml R and Pirchner F, 1984. Relation of genetic distance between cattle breeds and heterosis of resulting
crosses. Anim Genet 15:173–180.
Grosclaude F, Guérin G, and Houlier G, 1979. The genetic map of the B system of cattle blood groups as
observed in French breeds. Anim Blood Groups Biochem Genet 10:199–218.
Guldbrandtsen B, Tomiuk J, and Loeschcke V, 2000.
POPDIST, version 1.1.1: a program to calculate population genetic distance and identity measures. J Hered
91:178–179.
Guo SW and Thompson EA, 1992. Performing the exact
test of Hardy-Weinberg proportions for multiple alleles.
Biometrics 48:361–372.
MacHugh DE, Shriver MD, Loftus RT, Cunningham P,
and Bradley DG, 1997. Microsatellite DNA variation and
the evolution, domestication and phylogeography of
Taurine and Zebu cattle (Bos taurus and Bos indicus).
Genetics 146:1071–1086.
Manly BFJ, 1985. The statistics of natural selection.
London: Chapman & Hall.
Maruyama T and Fuerst PA, 1985. Population bottlenecks and nonequilibrium models in population genetics. II. Number of alleles in a small population that was
formed by a recent bottleneck. Genetics 111:675–689.
Mazumder NK and Spooner RL, 1970. Studies on bovine
serum amylase; evidence for two loci. Anim Blood
Groups Biochem Genet 1:145–146.
Medjugorac I, Kustermann W, Lazar P, Russ I, and Pirchner F, 1994. Marker-derived phylogeny of European
cattle supports demic expansion of agriculture. Anim
Genet 25(suppl 1):19–27.
Minch E, 1997. MICROSAT 1.5b. Stanford, CA: Stanford
University.
Nei M, 1978. Estimation of average heterozygosity and
genetic distance from a small number of individuals.
Genetics 89:583–590.
Nei M, Tajima F, and Tateno Y, 1983. Accuracy of estimated phylogenetic trees from molecular data. J Mol
Evol 19:153–170.
Ohta T and Kimura M, 1973. A model of mutation appropriate to estimate the number of electrophoretically
detectable alleles in a finite population. Genet Res 22:
201–204.
Piry S, Luikart G, and Cornuet J-M, 1999. BOTTLENECK:
a computer program for detecting reductions in the effective size using allele frequencies. J Hered 90:502–
503.
Raymond M and Rousset F, 1995a. An exact test for
population differentiation. Evolution 49:1280–1283.
Raymond M and Rousset F, 1995b. GENEPOP (version
1.2): population genetics software for exact tests and
ecumenicism. J Hered 86:248–249.
Hall SJG and Bradley DG, 1995. Conserving livestock
breed biodiversity. Trends Ecol Evol 10:267–270.
Saitou N and Nei M, 1987. The neighbor-joining method: a new method for reconstructing phylogenetic
trees. Mol Biol Evol 4:406–425.
Hartl DL and Clark AG, 1989. Principles of population
genetics, 2nd ed. Sunderland, MA: Sinauer.
Slatkin M, 1994. Linkage disequilibrium in growing and
stable populations. Genetics 137:331–336.
Swofford DL and Selander RB, 1989. BIOSYS-1. A computer program for the analysis of allelic variation in
population genetics and biochemical systematics. Release 1.7. Champaign, IL: Illinois Natural History Survey.
Takezaki N and Nei M, 1996. Genetic distances and reconstruction of phylogenetic trees from microsatellite
DNA. Genetics 144:389–399.
Tomiuk J and Loeschcke V, 1991. A new measure of
genetic identity between populations of sexual and
asexual species. Evolution 45:1685–1694.
Tomiuk J and Loeschcke V, 1995. Genetic identity combining mutation and drift. Heredity 74:607–615.
Watterson GA, 1978. The homozygosity test of neutrality. Genetics 88:405–417.
Tomiuk J, Guldbrandtsen B, and Loeschcke V, 1998.
Population differentiation through mutation and drift—
a comparison of genetic identity measures. Genetica
102/103:545–558.
Weir BC, 1990. Genetic data analysis. Sunderland, MA:
Sinauer.
Tuff P, 1951. Opprinnelsen og utbredelsen av norske
feraser. Nordisk Vet Med 3:780–804.
Velmala RJ, Vilkki HJ, Elo KT, Koning DJ de, and MäkiTanila AV, 1999. A search for quantitative trait loci for
milk production traits on chromosome 6 in Finnish Ayrshire cattle. Anim Genet 30:136–143.
Weir BC and Cockerham CC, 1984. Estimating F-statistics for the analysis of population structure. Evolution
38:1358–1370.
Received April 22, 1999
Accepted July 17, 2000
Corresponding Editor: James Womack
Kantanen et al • Genetic Diversity in North European Cattle 457