Genealogical Relationship among Members of Selection and

Journal of Heredity 2010:101(2):154–163
doi:10.1093/jhered/esp102
Advance Access publication November 27, 2009
Ó The American Genetic Association. 2009. All rights reserved.
For permissions, please email: [email protected].
Genealogical Relationship among
Members of Selection and Production
Populations of Yellow Cedar (Callitropsis
nootkatensis [D. Don] Oerst.) in the
Absence of Parental Information
NASIM MASSAH, JINLIANG WANG, JOHN H. RUSSELL, ANNETTE VAN NIEJENHUIS,
YOUSRY A. EL-KASSABY
AND
From the Department of Forest Sciences, Faculty of Forestry, The University of British Columbia, 2424 Main Mall,
Vancouver, BC V6T 1Z4, Canada (Massah and El-Kassaby); Institute of Zoology, Zoological Society of London, London,
UK (Wang); British Columbia Ministry of Forests and Range, Research Branch, Cowichan Lake Research Station, Box 335,
Mesachie Lake, BC, Canada (Russell); and Saanich Forestry Centre, Western Forest Products Limited, Saanichton, BC,
Canada (Van Niejenhuis).
Address correspondence to Y. A. El-Kassaby at the address above, or e-mail: [email protected].
Abstract
We used DNA fingerprinting and pedigree reconstruction to determine the genetic relationship among members of
3 yellow-cedar (Callitropsis nootkatensis [D. Don] Oerst.) selection populations in the absence of their parental genotypes.
Selection population members consisted of the tallest individuals within seedling crops originated from natural stand seed
collected from multiple seed donors covering wide areas within 3 distinct locations (phenotypic mass selection). Pairwise
relative kinship estimates indicated the presence of extensive coancestry among the selected seedlings, and pedigree
reconstruction grouped each selection members into multiple full-sib families of different sizes (1–10) nested within several
half-sib families (19–21). The ‘‘STRUCTURE’’ program (Pritchard JK, Stephens M, Donnelly P. 2000. ‘‘Inference of
population structure using multilocus genotype data.’’ Genetics. 155:945–959.) provided a pictorial classification of the
3 selection populations and grouped their individuals into multiple cohorts (9–10). The STRUCTURE program’s results
corresponded with that of the pedigree reconstruction, indicating that members of the selection populations originated from
a subset of the seed donors forming the natural stand seed collections. The species’ silvics, reproductive biology, methods of
natural stand seed collection and seedling production, and the high selection intensity applied to form the selection
populations contributed to limiting the selection to a subset of the original donor trees. The associated buildup of
coancestry in selection and production populations is expected to result in inaccurate estimation of genetic parameters and
an unintentional reduction in genetic diversity in reforestation stocks.
Key words: Callitropsis nootkatensis, clonal testing, DNA fingerprinting, pairwise relative kinship, pedigree reconstruction, selection
The recurrent selection scheme of tree improvement
programs follows 3 main steps: phenotypic selection of
candidate trees from natural stands or plantations; breeding,
wherein mating designs are commonly used to create
structured pedigreed material for testing, evaluation, and
ranking of selected parents based on their offspring’s
performance; and the identification of elite genotypes for
production population(s) establishment and/or starting
a new round of selection (Namkoong et al. 1988). The
154
robustness of this approach has been proven, and substantial genetic gain has been captured for many species for
a multitude of attributes (e.g., Namkoong 1979; Xiang et al.
2003). This classical approach with its predictable mechanics
(i.e., breeding, testing, and selection) remained unchanged;
however, species biology such as predisposition to cloning
through vegetative propagation or somatic embryogenesis
provided opportunities to increasing testing fidelity (Russell
and Cartwright 1991; Heilman 1999; Sutton et al. 2004) and
Massah et al. Genealogy of Yellow Cedar
the application of clonal multiplication for deployment
(Karlsson and Russell 1990; Russell et al. 1990; Heilman
1999; Cyr et al. 2001).
Yellow cedar (Callitropsis nootkatensis [D. Don] Oerst.),
formerly known as Chamaecyparis nootkatensis [D. Don]
Spach), is an important timber species in Western North
America covering wide geographical and ecological range
spanning up to 20° in latitude and elevation from sea level
up to tree line (Harris 1990) (Figure 1). The species
taxonomy has gained recent attention since the discovery of
new species (Xanthocyparis vietnamensis [Farjon et Hiep]) in
Northern Vietnam (Farjon et al. 2002; Little et al. 2004), and
its contemporary taxonomy is the subject of intense debate
(Gadek et al. 2000; Ritland et al. 2001; Wang et al. 2003).
The species has a 3-year reproductive cycle as adaptation to
its high elevation habitat (Owens and Molder 1974, 1975,
1977); however, this cycle is plastic and is reduced to 2 years
in its southern and at low elevation in its northern range
(El-Kassaby et al. 1991).
An innovative approach to reducing efforts as well as
shortening the length of the breeding cycle was attempted
for yellow cedar. This scheme involved substituting
structured mating designs with wind-pollination matings
among trees in natural stands and the application of 2
phases of selection. The first was a high intensity
phenotypic mass selection (Allard 1960, p. 252) among
natural crosses’ bulked offspring conducted at very early
age (1-year-old seedlings), whereas the second took place
after intense field testing over multiple sites and years using
clonal material produced by vegetative propagation of the
Figure 1.
first selection phase members. The intent of the second
selection phase is to identify elite genotypes for their
inclusion into production populations. This approach was
successful in capturing appreciable amount of genetic gain;
however, the extent of this gain cannot be ascertained
specifically when the genetic relationship among members
of the first and second selection phases is unknown.
Genetic gain calculation for clonal selection differs from
that of half-sib families. The former is similar to that for
monozygotic twins and assumes that the covariance among
clones equals 100% of the additive genetic variance,
whereas the latter assumes the covariance among half-sibs
equals 25% of the additive genetic variance (Falconer and
Mackay 1996). Thus, if genetic gain calculations are based
on clonal material, as in the case when the genealogy
among selections is unknown, the estimated gain will be
higher than that based on structured half- and full-sib
families.
The objectives of this study are to determine 1) the
genealogical relationships among the first and second phase
selection members and 2) effective population size (extent
of genetic diversity) for the testing and deployment
populations. If members of the first or second phase
selections are related, then the original genetic parameters
derived before considering the presence of coancestry
require adjustment, and care must be directed toward the
vegetative multiplication of the production population
members to avoid potential reduction in genetic diversity
in reforestation material through the amplification of the
related material.
The range of yellow cedar in the Pacific North West (Harris 1990) and the location of the 3 studied populations.
155
Journal of Heredity 2010:101(2)
Materials and Methods
Seedlings representing 3 first-phase selection populations
originated from natural stand seed collections made in
Vancouver Island, British Columbia (Table 1; Figure 1),
were used in this study. At each location, seed was collected
using a helicopter equipped with cone rakes to strip seed
cone–bearing branches from tree tops. Collections were
made from 29, 30, and 36 nonneighboring trees to form
seedlots #1, 2, and 5, respectively; thus, it was assumed that
seed donors acted as pollen recipients from their restricted
neighborhood. For each location, the collected seed cones
were bulked prior to seed extraction; therefore, the genetic
identity of the seedlings produced from these seedlots was
unknown. The seedlots were sown in a commercial nursery
to produce seedlings (several hundred thousands) for
reforestation purposes. At the end of the nursery growing
season, the healthiest and tallest 1-year-old seedlings within
each seedlot were selected to form the testing populations
(phase 1 selection). A total of 170, 271, and 133 individual
seedlings were selected from seedlots #1, 2, and 5,
respectively. The selection populations (seedlings) were
planted in an arboretum and were repeatedly topped to form
‘‘stool beds’’ for the mass production of small shoots that
were used to produce clonal material for field testing
through rooting. The resulting seedlings (ramets) were
planted in multiple replicated clonal trials over Vancouver
Island. Nine years after the trial establishment, several
growth and yield attributes were assessed for each individual
within all test sites, and quantitative genetics analyses were
conducted to identify the top performing individuals
(clones) within each selection population. The top 5, 17,
and 8 clones representing seedlots #1, 2, and 5, respectively,
were selected as members of the production population.
Fresh foliage samples from each member of the selection
populations were collected from 5 test sites near Port
McNeill (3 sites) and Jordan River (2 sites), Vancouver
Island, British Columbia. Collected samples were placed in
marked plastic bags, stored on ice, and shipped to UBC for
further processing. The tips of each branchlet sample were
placed in a vial tube (2 ml) and stored at 80 °C until DNA
extraction.
DNA extraction followed the Doyle and Doyle (1987)
protocol after optimizations to remove organic compounds
such as terpenes, proteins, and phenols (Barton 1976).
Extracted DNA was qualified and quantified, and each
sample was diluted to 20 ng/ll and stored at 20 °C for
polymerase chain reaction (PCR) amplification. Four yellow
cedar–specific microsatellite primers (Y1F10, Y1G09,
Y2C12, and Y2H01) (Bérubé et al. 2003), one (Cos2590)
of 19 primers tested on Japanese cypress (Chamaecyparis
obtusa), a closely related species (Nakao et al. 2001;
Matsumoto et al. 2006), and 1 of 6 tested primers obtained
from the C. obtusa available expressed sequence tag (EST)
data, accessed via genebank, were selected for genotyping.
We used the SSR finder program (Robinson et al. 2004) to
search through the C. obtusa available EST clones to detect
microsatellite sequences with the following search criteria:
1) primer minimum, optimum, and maximum temperature
were 50, 55, and 65 °C, respectively, and maximum
temperature difference was set at default, 2) primers# G-G
content % (GC%) was set at minimum, optimum, and
maximum of 40%, 50%, and 60%, respectively, and 3) the
default settings were used for the rest. Because GC has
3 binding sites, we set the percentage of their existence on
high in order to reach a better binding and amplification.
PCR reaction conditions were set as recommended by the
primer developers, and PCR reaction products were
electrophoresed on polyacrylamide gels using a LiCor
4200 automated sequencer (LiCor Inc., Lincoln, NE).
For more accurate genotyping and minimizing scoring
errors, a search for allelic variation was done using 2 subsets
of 30 individuals from seedlots #1 and 2, and then the
selected 6 loci were amplified and genotyped with the
SAGA GT (LiCor Biosciences, Lincoln, NE, United States)
software. In addition to the standard marker/DNA ladder,
samples that captured allelic variation within each locus
were selected, mixed, and used as a DNA reference of
known allele size. Each individual PCR plate contained
samples from 2 seedlots at a time to aid in correct allelic
detection, and all gels were scored twice.
Data analyses were conducted using 3 independent
approaches: the first to explore if genetic relationship
existed among the selected seedlings within each seedlot
using Ritland’s (1996) ‘‘MARK’’ program, the second to
construct the pedigree within each seedlot and classify the
selected seedlings into multiple full-sib families nested
within half-sib families using Wang’s (2004) ‘‘COLONY’’
program (it was considered as the primary analysis), and
finally, Pritchard et al.’s (2000, 2007) STRUCTURE program to provide a pictorial representation of cohorts within
each seedlot. The first 2 analyses were used to assist in
hypothesis development (MARK) and testing (COLONY),
whereas the third analysis (STRUCTURE) was used to
verify the results obtained from the COLONY program.
The ‘‘MA R K’’ program (Ritland 1996) was used to
calculate the genetic relationship among each possible dyad
within each seedlot by pairwise kinship analysis. The kinship
Table 1. Location of the 3 yellow-cedar seed collections, the number of seed donors, number of seedlings selected during the firstphase selection, and their seedlot registration numbers
Seedlot
Location
Latitude
Longitude
Elevation
# Of seed donors
# Of selected seedlings
1
2
5
Port McNeill
Shawnigan
Adams River
50°19#
48°45#
50°25#
127°08#
124°00#
126°05#
550
800
570
30
29
36
170
271
133
156
Massah et al. Genealogy of Yellow Cedar
coefficients were classified into 5 biologically meaningful
distinct groups (0.0, 0.125, 0.250, 0.375, and 0.5, representing unrelated, half-sib, full-sib, full-sib with inbred parents,
and inbred, respectively). The COLONY program ver. 1.3
(Wang 2004) was used to reconstruct all sample relationships as a cohort by applying their genotype data after
setting genotyping error rate at 5%. The COLONY program
uses group-likelihood methods, and individuals are studied
as a group after integrating the genotypic information of all
samples. However, it is restricted to distinguish first-degree
relationships such as full-sibs and half-sibs in a cohort. The
STRUCTURE program is a model-based method, which
utilizes the Bayesian algorithm to cluster individuals within
a population into genetically similar cohorts based on their
multilocus genotypes (Pritchard et al. 2000). The program
calculates every locus allele frequency separately for each
population and clusters individuals to cohorts based on their
posterior probabilities (Q) calculated from their genotype
data (Pritchard et al. 2000). This analysis is commonly used
in population structure–related studies to allocate each
entity to its genetically related cohort even in cases where
there is no prior parental or population structure information or in cases where entities are originated from
more than one population. STRUCTURE program ver. 2.2
(Pritchard et al. 2000) was used with a length of burn-in
period parameter set at 50 000 (burn-in length between
10 000 and 100 000 were recommended by the author), and
the same number was taken for Markov chain Monte Carlo
replications after burn-in. We set K (cluster) to range
between 1 and 20 with number of independent runs set at
40 (i.e., each K was simulated 40 times). The number of
most likely cohorts was determined using the ad hoc statistic
known as DK, which is based on the log probability change
across successive K values (Evanno et al. 2005).
Genetic diversity of a population (selection or production) can be expressed as the effective population size,
which describes the proportion of individuals from various
parents and their genetic relationships. Considering related
and/or inbred individuals, the concept of status number was
used (Lindgren et al. 1996) and denoted as Ne in this study
(Ne 5 ½H). Group coancestry, H, represents the average
coancestry of all pairs of population members including selfcoancestry and reciprocals (Cockerham 1967).
Results
The kinship analysis (Ritland 1996) estimated all possible
pairwise relative kinship coefficients among individuals
within each selection population (n(n 1)/2) (Figure 2).
The mean kinships over all individual pairs within the
studied 3 populations were 0.061(standard error 5 0.00067),
0.052 (0.00041), and 0.098 (0.00140), respectively. This
analysis provided insight of the genetic relationship among
individuals within each of the 3 selection populations. In
cases where no genetic relationship exists among the
selected seedlings (relative kinship (r) 0.0), it is expected
that all of the pairwise combinations will produce a value of
Figure 2. Distribution of pairwise relative kinship estimates
among members of 3 yellow-cedar selection populations (top:
Port McNeill [seedlot #1]; middle: Shawnigan [seedlot #2]; and
bottom: Adams River [seedlot #5]). Pairwise values were
classified under 5 biologically meaningful classes with
midpoints of 0.0, 0.125, 0.25, 0.375, and 0.5.
zero. However, the pattern obtained from the 3 selection
populations indicated that the selected seedlings belonged to
groups of half-sib families (r 5 0.125) with several smaller
full-sib families (r 5 0.250) nested within these half-sib
families (Figure 2). It should be pointed out that the
pairwise kinship estimate representation of Figure 2 is done
to portray biologically meaningful scenarios as the obtained
kinship estimates of every selection population covered
a continuum between 0 and 1 (for instance, 0.125 class
represents a range of relative kinship estimates between 0.064
and 0.188). Additionally, the 3 seedlots produced 2 very small
peaks of higher coancestry, indicating that some individuals
resulted from mating between inbred parents and/or were
a product of selfing (Figure 2). This pattern is consistent
with the grouping of each population’s individuals into sets of
157
Journal of Heredity 2010:101(2)
full-sib families with different sizes nested within their
respective half-sib families that were obtained from the
pedigree reconstruction analyses (see below).
The pedigree reconstruction analysis (Wang 2004) was
conducted based on the following assumptions: 1) seed
donors received pollen from multiple male parents present
in their vicinity (i.e., mother trees were polygamous) and 2)
self-fertilization would fail to produce viable seed due to the
high genetic load, or selfed seedlings would have a selective
disadvantage and thus would not be included in the
selection populations (see Discussion). The resulting
analyses grouped the selected seedlings into multiple fullsib families nested within 19, 21, and 17 half-sib families, for
seedlots # 1, 2, and 5, respectively (Figure 3). The full-sib
family size within these half-sib families varied and ranged
between 2 and 5, 1 and 10, and 1 and 4, for seedlots # 1, 2,
and, 5, respectively (Figure 3). The production population
members’ genetic status was determined from the pedigree
reconstruction, and half-sib membership was detected for
selection populations # 2 (6 HS families accounting for 14
of the 17 selections) and 5 (2 HS families accounting for 5
of the 8 selections) (Table 2).
Following the recommendations of Bradbury et al.
(2008), the number of iterations for running the STRUCTURE program was set at 40 to allow reaching stable
results. The pictorial classification of cohorts within each
selection population was presented graphically, where every
cohort is defined with a specific color, and individuals are
represented by vertical bars demonstrating their cohort
membership posterior probability. The STRUCTURE program grouped the selected population members into 9
(seedlot # 2) and 10 (seedlots # 1 and 5) cohorts (Figure 4).
For each selection population, some correspondence was
observed between the COLONY and STRUCTURE results
as presented by the size of the largest single half-sib family
within a specific genetic cohort (Table 3). Although half-sib
Figure 3. Pictorial presentations of pedigree reconstruction results for 3 yellow-cedar selection populations (top: Port McNeill
[seedlot #1]; middle: Shawnigan [seedlot #2]; and bottom: Adams River [seedlot #5]). Individuals with each half-sib family are
grouped (nested) within their respective full-sib families.
158
Massah et al. Genealogy of Yellow Cedar
Table 2. Number of genetic classes within the production
population (elite genotypes) and the production population
effective population size (Ne) estimates under genetic
independence (no genetic relationship) and the COLONY family
classification assuming equal contribution to the reforestation
planting stock
Elite genotypes
Ne
Seedlot
Half-sib Half-sib
#
Unrelated (size 5 2) (size 5 3) Independent COLONYa
1
2
5
a
5
3
3
—
4
1
—
2
1
5
17
8
5.0
13.1
6.4
Wang 2005.
family membership was not restricted to a single genetic
cohort, some family representation was high and ranged
between 18% and 61% of the cohort’s size, indicating that
seed donors contributed unequally to the resultant seedlots
and selection reflected their seedlings phenotypic superiority.
Discussion
The attempt to circumvent the arduous long-term classical
breeding program by substituting parent-tree selection and
structured mating designs with extensive early selection
from naturally produced population of seedlings is novel. It
has good potential to shorten the length and substantially
reduce both costs and efforts of breeding programs. This
approach is based on several assumptions that include 1) the
nursery seedling population genetic structure is not affected
by the species silvics and its peculiar seed germination and
dormancy, 2) members within each selection population are
genetically independent, 3) nursery truncation selection does
not limit the selected seedlings to few seed donors (families),
and 4) the ageage correlation between nursery and field
performance is reasonably high.
Several factors must be considered during the interpretation of results obtained from pairwise, pedigree
reconstruction, and STRUCTURE analyses. These are
related to the genetic structure within the 3 natural stand
seedlots as well as the expected level of coancestry present
among their respective seedlings. Although the number of
seed-cone donors forming each seedlot is known, their
proportionate contribution to the bulk seedlot and subsequently their successful input to the seedling crops are
unknown. The reproductive biology of yellow cedar is
characterized by a 3-year cycle, an adaptation to its high
elevation habitat (Owens and Molder 1974). When seed
cones are collected, trees often carry a mixture of 1- and
2-year cones that are difficult to differentiate. The collected
cone crops contain both developmentally mature and
immature seed, and therefore, the volume of cones collected
from a particular seed donor is not indicative of its seed
contribution. Even when a donor contributes substantially
Figure 4. Results from the STRUCTURE program for 3 yellow-cedar selection populations (top: Port McNeill [seedlot #1];
middle: Shawnigan [seedlot #2]; and bottom: Adams River [seedlot #5]). Scale represents individuals’ group membership
posterior probability.
159
Journal of Heredity 2010:101(2)
Table 3. Correspondence between the COLONY and
STRUCTURE programs represented by the size of the largest
half-sib family within a single genetic cohort (%)
Genetic cohorts K (cluster)
Seedlot #
1
2
3
4
5
6
7
8
9
10
1
2
5
20
22
38a
23
61
33
31
44
33
42
44
22a
55
38
43
23
50
18b
23
35
19b
47
29
22
33
27
21
29
—
36
a
Two half-sib families with the same size.
b
Three half-sib families with the same sizes.
to the seed crop, this does not guarantee increased
representation in the seedling crop, due to genetic variation
in seed germination and the deep dormancy of fully
developed seed (Pawuk 1993; El-Kassaby 1995).
The species is commonly present in small patches
(clusters) with many scattered individuals (Harris 1990);
thus, nonneighboring seed-cone donors are expected to
receive pollen from their adjacent locations (i.e., they are
spatially and may be temporally isolated). Seed-cone
collections originating from multiple nonneighboring trees
are expected to represent multiple ‘‘local’’ gene pools, thus
increasing the chance for greater differentiation among the
wind-pollinated maternal half-sib families originated from
these seedlots. Owens and Molder (1977) estimated that
mature cones averaged 7.2 seeds per cone with only 29%
being filled, thus yielding only about 2 filled seeds per cone.
This low yield may result from sparse pollination, increased
abortion of selfed seed (Bishir and Pepper 1977), and the
rarity of 2 consecutive years of good environmental
conditions, needed for pollination, fertilization, and embryo
and seed development (El-Kassaby 1995).
The collective genetic structure of seedling populations
originating from natural stand seed is expected to harbor
high level of variability, reflecting the variation among the
maternal parents (seed donors) as well as their multiple local
pollen pools. British Columbia’s 2003–2008 average number
of yellow-cedar seedlings produced from natural stand seed
sources is 900 K (SPAR: BC Ministry of Forests and Range
2009). Natural stand seedlots are used to produce seedling
crops of size varying between 50 and 100 K seedlings.
Under mass selection for outcrossing species (Allard 1960,
p. 252), factoring in the total number of seedlings selected
from the 3 studied seedlots (average: 191; range: 133–271), it
is safe to assume that the selection intensity applied was
close to 1:500–1:1000. Thus, the high selection intensity
applied in the nursery to form the testing populations is
expected to limit the selection to a subset of the original
number of parents forming these seedlots. Considering
these factors collectively, it is not surprising that the results
from either the COLONY or STRUCTURE programs
produced a smaller number of parents/cohorts as compared
with the original seed donors forming the seedlots.
The pattern obtained from the pairwise kinship analyses
(Ritland 1996) confirmed the presence of half- and full-sib
families (Figure 2) and allowed the formulation of the main
160
hypotheses that no genetic relationship exists among the
nursery stage selected seedlings (selection populations) or
among the selected elite genotypes forming the production
populations. Naturally, the presence of sib-ship relationship
among these individuals formed our alternate hypothesis.
We therefore employed the COLONY program (Wang
2004) to determine genetic membership among the selected
seedlings within each population. We assumed that the seed
donors receive pollen from multiple pollen donors
(polygamous) and that self-fertilization does not result in
viable seed, or self-seedlings are selected against at the
nursery stage during the removal (thinning) of excess
germinants, or seedling development during the growing
season resulted in inferior seedlings (El-Kassaby and
Thomson 1995; El-Kassaby 2000). Thus, self-seedlings
would not be included in the selection populations. This
assumption is supported by reports of inbreeding depression
causing embryo development failure or selective disadvantage of selfed germinants, seedlings, and saplings has been
reported for multiple conifers including yellow cedar
(Sorensen 1969, 1982; Bramlett and Popham 1971; Koski
1971, 1973; Bishir and Pepper 1977; Bishir and Namkoong
1987; Namkoong and Bishir 1987; Savolainen et al. 1992;
Williams and Savolainen 1996; Williams et al. 1999).
Forest trees commonly practice a mixed-mating system
with reported high levels of outcrossing rate (O’Connell
2003) and often show low levels of correlated matings
(O’Connell et al. 2001; Liewlaksaneeyanawin 2006). However, low selfing might be contradictory to the application of
Wang’s (2004) COLONY pedigree reconstruction program.
Like other members of the Cuperssaceae family, yellow
cedar can self-fertilize, and high inbreeding coefficients were
reported by Ritland et al. (2001) (mean rangewide F 5 0.18,
also see Thompson et al. 2008). Using Wright’s (1965)
approach (t 5 (1 FST)/(1 þ FST)), we estimated an
inbreeding coefficient of F 5 0.08 for the 3 studied
populations. The difference between Ritland et al. (2001) and
the present study’s inbreeding coefficient estimates may
reflect the high selection intensity applied in the greenhouse
to form the 3 test populations. The observed pattern from
the pairwise analyses (Figure 2) as well as reported inbreeding
coefficient estimates for the species and that of the present
study could support the notion that some members of the
classified full-sib families may be the result of selfing.
The pedigree reconstruction analyses (Wang 2004)
grouped the test population members into multiple full-sib
families nested within half-sib families (Figure 3). A total
of 19, 21, and 17 half-sib families were observed within
seedlots #1, 2, and 5, respectively, and multiple full-sib
families with variable sizes (2–5, 1–10, and 1–4, for seedlots #
1, 2, and 5, respectively) were nested within them (Figure 3).
The observed number of half-sib families with their
variable sizes confirmed that the strict truncation selection
applied in the nursery during the test population formation
successfully eliminated some as well as restricted the
selection to a subset of seed donors. The observed family
structure reduced the testing populations’ effective population size to 39.9, 37.6, and 28.0 for seedlots # 1, 2, and 5,
Massah et al. Genealogy of Yellow Cedar
respectively, as compared with their census numbers (average
191; range: 133–271).
The STRUCTURE analyses provided pictorial classification of each seedlot presented by the program’s bar plots
(Figure 4). Cohorts are defined by different colors where
individuals are represented by vertical bars demonstrating
the posterior probability (q) of each individual belonging to
a specific cohort (Figure 4). It should be noted that the
posterior probability of some individuals indicates that they
belong to more than one cohort (i.e., individuals are
represented by 2 or more colors), or it could be the result of
incomplete information caused by missing data (genotype)
(Bergel and Vigilant 2007). In general, the number of
cohorts within a test population was smaller than the halfsib family number obtained from the pedigree reconstruction and ranged between 9 and 10 (Figure 4; Table 2). This
reflects the difference between COLONY and STRUCTURE, where the former is based on the exact nature of
jointly assigning parentage and sib-ships based on their
multilocus genotypes and the latter on providing a group
membership posterior probability of individuals.
Combining pedigree reconstruction and STRUCTURE
program in analyzing natural populations has been attempted by Vähä et al. (2007) in studying the natal homing of
Atlantic salmon populations. They repeatedly used the
STRUCTURE program to stratify the populations in
sequential manner, and finally, they used pedigree reconstruction (the COLONY program) to identify the
genealogical relationship among the spawning individuals
within each stratum. In the present study, the test
populations were already stratified as they originated from
seedlots collected from different locations (Figure 1). Our
approach was similar and the use of the 2 analytical
approaches (i.e., pedigree reconstruction and STRUCTURE) assisted in unraveling the genetic similarity/differences among each seedlot’s selected seedlings. However, it
should be noted that the STRUCTURE and COLONY
program results did not completely match in spite of using
the same multilocus genotypic data. These discrepancies
could be caused by violating the STRUCTURE program’s
assumptions, specifically those pertaining to the presence of
family structure within the seedling populations (Pritchard
and Wen 2004; Camus-Kulandaivelu et al. 2007). Under
increasing number of groups, the STRUCTURE program
analysis starts by identifying the most genetically distinct
groups followed by the identification of the less distinct
groups. The presence of related individuals creates sufficient
correlation among these individuals, forcing them to be
identified as a genetic group. This situation was unintentionally encountered by Vähä et al. (2007) when they
repeatedly applied the STRUCTURE analysis to stratifying
their Atlantic salmon populations into subgroups and finally
realized that the final ‘‘Structure’’ represented full- and halfsib families. In essence, we followed Yu et al.’s (2006)
approach to identify possible family structure in the
analyzed populations.
Results from the COLONY and STRUCTURE programs were compared to investigate the presence of any
correspondence between the 2 approaches (Table 2).
Although the COLONY and STRUCTURE programs
produced different number of groups (half-sib families
and cohorts, respectively), some correspondence between
their results is apparent by the size of the largest single halfsib family within a specific genetic cohort (Table 2).
Although half-sib family membership was not restricted to
a single genetic cohort, some family representation was high
and ranged between 18% and 61% of the cohort’s size,
indicating that seed donors may have contributed differently
to the seedlots and that some members of different cohorts
shared common alleles. Additionally, some cohorts included
members of more than one half-sib family (cohorts #1 and
4 encompassed 2 half-sib families, each representing 38% of
the cohort size, whereas cohorts # 6 and 7 included 3 halfsib families representing 18% and 19% of the cohort size,
respectively; Table 2).
The ultimate goal of the implemented selection program
was to identify elite genotypes for their inclusion in
production populations. After field testing, the second
selection phase resulted in identifying 5, 17, and 8
individuals, representing seedlots #1, 2, and 5, respectively,
as determined by the COLONY analyses (Table 3). These
selected genotypes will be used to produce large numbers of
rooted cuttings for reforestation. If these genotypes are
related, then it is expected that the genetic diversity of the
deployed material will be lower than that assumed under no
relationship; in fact, the effective population sizes of the
selection populations were found to be lower than their
perceived census numbers (see Table 3). With the exception
of seedlot # 1 where no genetic relationship was detected,
seedlots # 2 and 5 contained 5 and 2 half-sib families with
different sizes, respectively, resulting in effective population
sizes of 13.1 and 6.4 as opposed to census numbers of 17
and 8 (Table 3).
The utilization of matings among parents in their natural
settings coupled with the implementation of 2 rounds of
selection present a novel approach to breeding and selection
of forest trees and provides an opportunity to reducing
breeding time, efforts, and resources. The present study
demonstrates the necessity of applying molecular biology
techniques such as DNA fingerprinting and pedigree
reconstruction as essential tools to advance these new
breeding methods. The pioneering work of Lambeth et al.
(2001) and Grattapaglia et al. (2004) opened new avenues to
molecular tree breeding, and this work has been followed by
the development of even bolder approaches such as
‘‘Breeding Without Breeding’’ (El-Kassaby and Lstibůrek
2009). Tree breeding must capitalize on ongoing scientific
developments in molecular biology, spatial statistics,
parentage assignment, and association genetics to advance
population improvement to new frontiers.
Conclusion
The implementation of an unconventional approach to tree
improvement that deviates from classical breeding schemes
161
Journal of Heredity 2010:101(2)
was attempted. This approach forfeits the phenotypic
selection of ‘‘superior genotypes’’ phase as well as the use
of conventional mating designs for creating structured
pedigree for testing. The method involves high selection
intensity at very early age (1 year) from offspring originated
from natural crosses among parent trees in their native
habitats, followed by exploitation of the species’ biology
(easy vegetative propagation) for the production of clonal
material for long-term extensive field testing. Using this
approach, the pedigree of the offspring selected for testing
is unknown, which could cause inappropriate estimation of
quantitative genetics parameters as well as the deployment
of genetically related material, thus reducing genetic
diversity. We applied DNA fingerprinting and pedigree
reconstruction to unravel the presence of extensive family
structure among the selected offspring. In light of the
findings, we recommend revision of 1) genetic parameter
calculations based on assumed independence among tested
individuals (i.e., clones) and 2) genetic diversity estimates of
deployment material, as member propagules of half-sib
families are expected to produce lower effective population
size (a clone’s effective population size is equal to census
number, whereas a half-sib’s effective population size is
lower than the census number).
Funding
Johnson’s Family Forest Biotechnology Endowment, British
Columbia Forest Investment Account Forest Genetic
Conservation and Management program, Natural Sciences
and Engineering Research Council of Canada—Discovery
and Industrial Research Chairs Program grants to Y.A.K.
and Natural Resources Canada—Science and Technology
Internship Program to N.M.
Camus-Kulandaivelu L, Veyrieras J-B, Gouesnard B, Charcosset A,
Manicacci D. 2007. Evaluating the reliability of STRUCTURE outputs in
case of relatedness between individuals. Crop Sci. 47:887–892.
Cockerham C. 1967. Group inbreeding and coancestry. Genetics.
56:89–104.
Cyr DR, Attree SM, El-Kassaby YA, Ellis DD, Polonenko DR, Sutton BCS.
2001. Application of somatic embryogenesis to tree improvement in
conifers. In: Morohoshi N, Komamine A, editors. Molecular breeding of
woody plants. Elsevier Science B.V. p. 305–312.
Doyle JJ, Doyle JL. 1987. A rapid DNA isolation procedure from small
quantities of fresh leaf tissues. Phytochem Bull. 19:11–15.
El-Kassaby Y, Lstibůrek M. 2009. Forthcoming. 2009. Breeding without
breeding. Genet Res. 91:111–120.
El-Kassaby YA. 1995. The fitness of reproductive plasticity in yellow-cedar
(Chaemacyparis nootkatensis). Silvae Genet. 44:217–218.
El-Kassaby YA. 2000. Representation of Douglas-fir and western hemlock
families in seedling crops as affected by seed biology and nursery crop
management practices. For Genet. 7:305–315.
El-Kassaby YA, Maze J, Macleod DA, Banerjee S. 1991. Reproductive cycle
plasticity in yellow-cedar (Chamaecyparis nootkatensis). Can J For Res. 21:
1360–1364.
El-Kassaby YA, Thomson AJ. 1995. Effect of seed biology and common
seedling nursery production practices on parental contribution rank and
relative performance. For Sci. 42:228–235.
Evanno G, Regnaut G, Goudet J. 2005. Detecting the number of clusters of
individuals using the software STRUCTUTRE: a simulation study. Mol
Ecol. 14:2611–2620.
Falconer DS, Mackay TFC. 1996. Introduction to quantitative genetics. 4th
ed. New York: Longman.
Farjon A, Nguyen TH, Harder DK, Phan KL, Averyanov L. 2002. A new
genus and species in Cupressaceae (Coniferales) from Northern Vietnam,
Xanthocyparis vietnamensis. Novon. 12:179–189.
Gadek PA, Alpers DL, Heslewood MM, Quinn CJ. 2000. Relationships
within Cupressaceae sensu lato: a combined morphological and molecular
approach. Am J Bot. 87:1044–1057.
References
Grattapaglia D, Ribeiro VJ, Rezende GDSP. 2004. Retrospective selection
of elite parent trees using paternity testing with microsatellite markers: an
alternative short term breeding tactic for Eucalyptus. Theor Appl Genet.
109:192–199.
Allard RW. 1960. Principles of plant breeding. New York: John Wiley and
Sons Inc.
Harris AS. 1990. Alaska-cedar. Silvics of North America: 1. Conifers.
Agriculture Handbook 654. Washington (DC): USDA Forest Service.
Barton GM. 1976. A review of yellow cedar (Chamaecyparis nootkatensis
[D. DON] SPACH) extractives and their importance to utilization. Wood
Fiber. 8:172–176.
Heilman PE. 1999. Planted forests: poplars. New For. 17:89–93.
Bergel RA, Vigilant L. 2007. Genetic analysis reveals population structure
and recent migration within the highly fragmented range of the cross river
gorilla (Gorilla gorilla). Mol Ecol. 16:501–516.
Bérubé Y, Ritland C, Ritland K. 2003. Isolation, characterization, and crossspecies utility of microsatellites in yellow cedar (Chamaecyparis nootkatensis).
Genome. 46:353–361.
Bishir J, Namkoong G. 1987. Unsound seed in conifers: estimation of
numbers of lethal allele and magnitude of effect associated with maternal
parent. Silvae Genet. 36:180–185.
Bishir J, Pepper WD. 1977. Estimation of the number of embryonic lethal
alleles in conifers: I. Self-pollinated seed. Silvae Genet. 26:50–54.
Bradbury IR, Campana SE, Bentzen P. 2008. Estimating contemporary
early life-history dispersal in an estuarine fish: integrating molecular and
otolith elemental approaches. Mol Ecol. 17:1438–1450.
Bramlett DL, Popham TW. 1971. Model relating unsound seed and
embryonic lethals in self-pollinated pines. Silvae Genet. 20:192–193.
162
Karlsson IK, Russell JH. 1990. Comparison of yellow cypress trees of
seedling and rooted cutting origins after 9 and 11 years in the field. Can J
For Res. 29:37–42.
Koski V. 1971. Embryonic lethals of Picea abies and Pinus sylvestris. Commun
Inst For Fenn. 75:1–30.
Koski V. 1973. On self-pollination, genetic load and subsequent inbreeding
in some conifers. Commun Inst For Fenn. 78:1–42.
Lambeth C, Lee BC, O’Malley D, Wheeler N. 2001. Polymix breeding with
parental analysis of progeny: an alternative to full-sib breeding and testing.
Theor Appl Genet. 103:930–943.
Liewlaksaneeyanawin C. 2006. Genetic evaluation of natural and domesticated lodgepole pine populations using molecular markers. PhD thesis.
Vancouver (BC): The University of British Columbia.
Lindgren D, Gea LD, Jefferson PA. 1996. Loss of genetic diversity
monitored by status number. Silvae Genet. 45:52–59.
Little DP, Schwarzbach AE, Adams RP, Hsieh C. 2004. The circumscription and phylogenetic relationships of Callitropsis and the newly described
genus Xanthocyparis (Cupressaceae). Am J Bot. 91:1872–1881.
Massah et al. Genealogy of Yellow Cedar
Matsumoto A, Tani N, Li X, Nakao Y, Tomaru N, Tsumura Y. 2006.
Development and polymorphisms of microsatellite markers for hinoki
(Chamaecyparis obtusa). Mol Ecol Notes. 6:310–312.
Nakao Y, Iwata H, Matsumoto A, Tsmura Y, Tomaru N. 2001. Highly
polymorphic microsatellite markers in Chamaecyparis obtusa. Can J For Res.
31:2248–2251.
Namkoong G. 1979. Introduction to quantitative genetics in forestry.
Washington (DC): US Department of Agriculture, Forest Service. Tech
Bulletin No 1588.
Namkoong G, Bishir J. 1987. Frequency of lethal alleles in forest tree
populations. Evolution. 41:1123–1127.
Namkoong G, Kang HC, Brouard JS. 1988. Tree breeding: principles and
strategies. New York: Springer-Verlag Monograph, Theo Appl Genet 11.
O’Connell LM. 2003. The evolution of inbreeding in western redcedar
(Thuja plicata: Cupressaceae). PhD thesis. Vancouver (BC): The University
of British Columbia.
O’Connell LM, Viard F, Russell J, Ritland K. 2001. The mating system in
natural populations of western redcedar (Thuja plicata). Can J Bot. 79:
753–756.
Forestry Canada and British Columbia Ministry of Forests. FRDA
Report 171. p. 34–35.
Russell JH, Grossnickle SC, Ferguson C, Carson DW. 1990. Yellow cedar
stecklings: nursery production and field performance. Victoria (BC):
Forestry Canada and British Columbia Ministry of Forests. FRDA Report
148.
Savolainen O, Karkkainen K, Kuttinen H. 1992. Estimating numbers of
embryonic lethals in conifers. Heredity. 69:308–314.
Sorensen F. 1969. Embryonic lethals in Douglas-fir, Pseudotsuga menziesii.
Am Nat. 103:389–398.
Sorensen FC. 1982. The roles of polyembryony and embryo viability in the
genetic system of conifers. Evolution. 36:725–733.
SPAR: BC Ministry of Forests and Range. 2009. Seed planning and registry
application. http://www.for.gov.bc.ca/hti/spar/index.htm
Sutton BCS, Attree SM, El-Kassaby YA, Grossnickle SC, Polonenko DR.
2004. Commercialisation of somatic embryogenesis for plantation forestry.
In: Walter C, Carson M, editors. Plantation forest biotechnology for the
21st century. Kerala (India): Research Signpost. p. 275–301.
Owens JN, Molder M. 1974. Cone initiation and development before
dormancy in yellow cedar (Chamaecyparis nootkatensis). Can J Bot.
52:2075–2084.
Thompson SL, Bérubé Y, Bruneau A, Ritland K. 2008. Three-gene identity
coefficients demonstrate that clonal reproduction promotes inbreeding and
spatial relatedness in yellow-cedar (Callitropsis nootkatensis). Evolution.
62:2570–2579.
Owens JN, Molder M. 1975. Pollination, female gametophyte, and embryo
and seed development in yellow cedar (Chamaecyparis nootkatensis). Can J Bot.
53:186–199.
Vähä J-P, Erkinaro J, Niemelä E, Primmer CR. 2007. Life-history and
habitat features influence the within-river genetic structure of Atlantic
salmon. Mol Ecol. 16:2638–2654.
Owens JN, Molder M. 1977. Cone induction in yellow cypress
(Chamaecyparis nootkatensis) by gibberellin A3 and the subsequent development of seeds within the induced cones. Can J For Res. 7:605–613.
Wang J. 2004. Sibship reconstruction from genetic data with typing errors.
Genetics. 166:1963–1979.
Pawuk WH. 1993. Germination of Alaska-cedar seed. Tree Planter’s Notes.
44:21–24.
Pritchard JK, Stephens M, Donnelly P. 2000. Inference of population
structure using multilocus genotype data. Genetics. 155:945–959.
Pritchard JK, Wen W. 2004. Documentation for structure software.
Chicago (IL): The University of Chicago Press.
Pritchard JK, Wen X, Falush D. 2007. Documentation for structure software Version 2.2. http://pritch.bsd.uchicago.edu/software/structure2_2.
html.
Ritland C, Pape T, Ritland K. 2001. Genetic structure of yellow cedar
(Chamaecyparis nootkatensis). Can J Bot. 79:822–828.
Ritland K. 1996. Estimators for pairwise relatedness and individual
inbreeding coefficients. Genet Res. 67:175–185.
Wang WP, Hwang CY, Lin TP, Hwang SY. 2003. Historical biogeography
and phylogenetic relationships of the genus chamaecyparis (Cupressaceae)
inferred from chloroplast DNA polymorphism. Plant Syst Evol. 241:13–28.
Williams CG, Barnes RD, Nyoka I. 1999. Embryonic genetic load for
a neotropical conifer, Pinus patula Schiede et Deppe. J Hered. 90:394–398.
Williams CG, Savolainen O. 1996. Inbreeding depression in conifers:
implications for breeding strategy. For Sci. 42:102–117.
Wright S. 1965. The interpretation of population structure by F-statistics
with special regard to systems of mating. Evolution. 19:395–420.
Xiang B, Li B, McKeand S. 2003. Genetic gain and selection efficiency of
loblolly pine in three geographic regions. For Sci. 49:196–208.
Yu J, Pressoir G, Briggs H, William HB, Bi IV, Yamasaki M, Doebley JF,
McMullen MD, Gaut BS, Nielsen DM, et al. 2006. A unified mixed-model
method for association mapping that accounts for multiple levels of
relatedness. Nat Genet. 38:203–208.
Robinson AJ, Love CG, Batley J, Barker G, Edwards D. 2004. Simple
sequence repeat marker loci discovery using SSR primer. Bioinformatics.
20:1475–1476.
Received April 2, 2009; Revised October 3, 2009;
Accepted October 22, 2009
Russell JH, Cartwright C. 1991. The genetics of yellow cypress. In: Lousier JD,
editor. Yellow cypress: Can we grow it? Can we sell it? Victoria (BC):
Corresponding Editor: James L. Hamrick
163