Models for Genome ´ Environment Interaction: Examples in Livestock

Published May 6, 2016
Research
Models for Genome ´ Environment
Interaction: Examples in Livestock
Ben J. Hayes,* Hans D. Daetwyler, and Mike E. Goddard
Abstract
In livestock, genotype ´ environment interaction
(G ´ E) has been widely investigated, with genotype defined at the level of subspecies, breeds,
individual animals within a breed (for example
performance of offspring of elite sires across
environments), and genotypes at single-nucleotide polymorphisms (SNPs). Environments can
be described by category (e.g., tropical vs. temperate, high vs. low farm input levels, countries)
and by continuous variables such as temperature. To predict breeding values of genotypes in
environments described by categories, multitrait
models with each category a different trait are
used. The models are now being used to predict genomic estimated breeding values (GEBV)
for different environments such as the value of
a bull’s genetics for his daughter’s milk production in different countries. The multitrait genomic
model has also been used to enable reference
populations to be merged across environments
and across countries, leading to more accurate
GEBV. When the environment can be described
by a continuous variable, random regression
models have been used to predict response
of genotypes to the environment. For example,
these models have been used to determine if
there are SNP genotypes associated with less
sensitivity of milk production to increasing temperature. In both livestock and plant breeding,
methods that use genomic information can better cope with a reduced degree of replication of
individuals across environments, as it is actually the alleles that must be replicated across
environments. More accurate estimates of G ´
E with the genomic approach may therefore be
achievable than was possible in the past.
B.J. Hayes, and H.D. Daetwyler, AgriBio, Centre for AgriBioscience,
Biosciences Research, DEDJTR, Victoria, Australia and Biosciences
Research Centre, La Trobe Univ., Victoria, Australia; M.E. Goddard,
Melbourne School of Land and Environment, Univ. of Melbourne,
Victoria, Australia. Received 28 July 2015. Accepted 1 Dec. 2015.
*Corresponding author ([email protected]).
Abbreviations: BMSCC, bulk milk somatic cell count; G ´ E, genotype ´ environment interaction; GEBV, genomic estimated breeding
value; SNP, single-nucleotide polymorphism.
G
enotype ´ environment interactions in livestock have
been widely investigated. The genotypes compared have
included subspecies (Bos taurus vs. Bos indicus), breeds, individual
animals within a breed and genotypes at SNPs. The environments
have been described by category (e.g., tropical vs. temperate) and
by continuous variables such as temperature. A G ´ E is defined
to exist if the difference between genotypes depends on the
environment in which it is measured. As in plant breeding, this
includes two different situations: the genotypes can change ranking between environments, or they can retain the same ranking
but the differences can be larger in one environment than in the
other. These two different types of G ´ E have different implications for the breeding program with the former being more
important. Therefore, the G ´ E is commonly analyzed as a multiple-trait situation in which the trait measured in the different
environments are treated as different but correlated traits. Then,
the genetic correlation between the traits measures the degree of
reranking between the environments. The genetic correlations
(rg) reported are reviewed below but, in general, rg < 0.8 usually
only occur if the environments are very different, for example,
tropical vs. temperate. Genotype ´ environment interaction can
be incorporated into traditional calculation of estimated breeding
Published in Crop Sci. 56:1–9 (2016).
doi: 10.2135/cropsci2015.07.0451
© Crop Science Society of America | 5585 Guilford Rd., Madison, WI 53711 USA
All rights reserved.
crop science, vol. 56, september– october 2016 www.crops.org1
values based on records of phenotype and pedigree. The
global analysis of dairy bulls, which is routinely performed
by Interbull (http://www.interbull.org), is an excellent
example, where phenotypes, such as milk yield in different countries, are treated as different traits. This is possible
because, for dairy cattle breeding, a small group of elite
sires are very widely used across the globe, and in contrast
to plant breeding programs, there is very little introgression to generate new genetic diversity, that is, essentially
the same genotypes are evaluated again and again.
In most other livestock species, however, G ´ E are
not routinely included in estimated breeding value calculations, that is, most analyses are based on phenotypic data
from a limited range of environments and few animals
have offspring in very different environments. The use
of SNP genotypes in the calculation of GEBV has the
potential to overcome this problem because GEBV could
be calculated for many different environments based only
on SNP genotype data. These analyses require that the
SNP alleles have been observed in different environments,
which is much more likely than individuals having offspring in multiple environments. Using GEBV for different environments could substantially increase selection
intensities in breeding programs; for example, dairy bulls
from anywhere in the world could be screened for the
performance of their daughters in Australia. Note that if
nonadditive effects are important, it is not just SNP allele
effects that must be observed in the different environments but the SNP genotype effects and SNP ´ SNP
genotypes in the case of epistasis. Genomic selection that
incorporates G ´ E could also accelerate genetic gains in
predicted future climates.
Properly accounting for G ´ E is important when reference populations (used to derive SNP prediction equations) are merged with the aim of increasing the accuracy
of the GEBV. This is of increasing interest, as very large
reference populations are required to calculate accurate G
´ E, and it may be difficult to assemble such large populations in multiple environments. If these reference populations have phenotypes measured in different environments, then not accounting for G ´ E can reduce the
accuracy of GEBV from the combined reference populations and cause bias (Haile-Mariam et al., 2015).
In this review, we discuss models of G ´ E including models used to derive GEBV for different environments. We then give some examples of G ´ E in livestock
including SNP by environment interactions. Finally,
inclusion of G ´ E to maximize progress from breeding
programs is discussed as well as similarities and contrasts
with accommodating G ´ E in plant breeding programs.
2
Modeling Genotype ´
Environment Interactions
To take account of G ´ E and to obtain GEBVs in different
environments, two models have been used: multitrait models
and reaction norm (also called random regression) models.
Multitrait Models
Multitrait models treat performance of a genotype for a
trait in different environments as different but potentially
correlated traits. The multitrait approach can handle a
wide variety of definitions of environment. Example trait
definitions include the following:
Countries: performance of a genotype in different
countries
Farming systems: performance in high- and low-input
systems
Environment descriptors such as heat stress measured by
temperature, humidity, and altitude (e.g., performance at
high and low levels of descriptor)
The multitrait approach is also relatively straightforward to extend to genomic predictions that capture G ´ E,
as described below.
For two environments (two traits), the multitrait model
is as follows (e.g., Hayes et al., 2003; Mulder et al., 2004):
é y1 ù é I1
ê ú =ê
ê y2 ú ê 0
ë û ë
0 ù é m1 ù éZ1 0 ù é g1 ù é e1 ù
úê ú+ê
úê ú+ê ú
I 2 úû êëm 2 úû êë 0 Z2 úû êë g2 úû êëe 2 ûú
where y1 and y2 are trait records for genotypes in Environment 1 and Environment 2, respectively, I1 and I 2 are
identity matrices, µ1 and µ 2 are the means for Environment
1 and Environment 2, Z1 and Z2 are the design matrices
that relate breeding values with the response variables, g1
and g 2 are the breeding values for genotypes for Environment 1 and Environment 2, and e1 and e2 are vectors
of random residuals for Environment 1 and Environment
ée ù
2. The random residuals are assumed êê 1 úú ~ N (0, I Ä R ) ,
ëe 2 û
é s2 s ù
e1
e12
ú , the residual variance–covariance
where R = êê
2 ú
ëêse12
se2 ûú
matrix for Environment 1 and Environment 2, I is a genotype ´ genotype identity matrix, and ⊗ is the Kronecker
product. Extension to more than two environments is
straightforward.
When the multitrait approach to modeling G ´ E
is implemented in livestock, genotype is usually defined
as the individual animal. It is rare that animals have performance recorded in two or more different environments; they may remain on the one farm throughout
their life. The performance of an animal’s genetics can
still be evaluated in multiple environments by modeling
www.crops.org
crop science, vol. 56, september– october 2016
the genetic relationships between animals. For example, a
dairy bull may have daughters in two or more environments, and these daughters inherit half of the bull’s genes.
These relationships can be modeled though the A matrix
(Henderson 1984), which describes the expected proportion of the genome that each pair of individuals share so
that the distribution of breeding values is assumed to be
é s2
é g1 ù
ê ú  N (0, A Ä T) , where T = ê g1
ês
ê g2 ú
êë g12
ë û
sg12 ùú
, the genetic
s2g 2 úûú
variance–covariance matrix for Environment 1 and Environment 2. The estimate of breeding values for individuals for Environment 1 and Environment 2 are then
and , respectively. The estimate of the genetic correlation between performance in the two environments
is
s¢g12

2 
2
s
g1 s g 2
• Farm input levels. Herd average production level
is often used as a surrogate for the level of feeding. This approach has been used in beef cattle,
sheep, and dairy cattle (Calus et al., 2002; Fikse et
al., 2003; McLaren et al., 2015; Hayes et al., 2003;
Pegolo et al., 2011)
• Farm disease level, for example, herd average levels
of somatic cell count, an indicator of mastitis
(Calus et al., 2006; Streit et al., 2013a).
The response of each genotype to change in the environmental descriptor is modelled as a unique curve. For
instance, the breeding value of a single genotype (with
genotype either an individual SNP or individual animal
or a variety) in an environment with environmental
descriptor variable w, can be modelled as a polynomial:
(variance components can be estimated
by software such as ASReml [Gilmour et al., 2006]).
This model can be extended to include data on SNP
genotypes. The SNP information can be used to model
genomic relationships among animals to obtain GEBV. If the
genomic relationship approach is taken, all that is required is
to replace A (the pedigree derived relationship matrix) with
G, which are the relationships derived from the markers as
described by VanRaden (2008) or Yang et al. (2010):
é g1 ù
ê ú  N (0, G Ä T )
ê g2 ú
ë û
The assumptions underlying this formulation are
described in Appendix 1. Then ĝ1 and ĝ2 from fitting the
model are the GEBV for animals for Environment 1 and
Environment 2.
One interesting feature of the genomic implementation of the multitrait model is that it is not necessary to
have close relatives (for example daughters of the same
bull) in different environments, as even the small coefficients of genomic relationship contribute to the estimate
of an individual’s breeding value in each environment. For
example, in human genetics, this multiple-trait approach
has been used with individuals that only share small proportions of their genome (Maier et al., 2015).
Reaction Norm Models
If the environment is better described by a continuous
variable than by a series of categories, reaction norm
models are an alternative to multitrait models. Examples
of continuous environmental descriptors are as follows:
• Temperature humidity indices, as a measure of
heat stress (Ravagnolo and Misztal 2000; Hayes et
al., 2003; Haile-Mariam et al., 2008; Hammami et
al., 2015)
crop science, vol. 56, september– october 2016 g = S 0 + wS1 + w 2S 2 + …
where S¢ = (S 0 S1 S 2) are regression coefficients, specific
to this genotype, which are treated as random variables
with var(S) = C (for a model with intercept and slope,
the elements of the C matrix would be the variance of
the intercepts, the variance of the slopes, and the covariance between them). This can be done with random
regression, where each genotype has its own intercept,
slope, and potentially higher-order terms that describe the
trait response to increases in the environmental descriptor (for more details on random regression see Jamrozik,
and Schaeffer, 1997). In practice, only the intercept and
slope are usually considered (variance components associated with higher order terms such as quadratic and cubic
coefficients can be difficult to estimate). Reaction norm
models have been applied with animals as genotypes (e.g.,
Fikse et al., 2003; Ravagnolo and Misztal 2000) and also
to estimate SNP ´ environment interaction (e.g., Streit
et al., 2013a; Hayes et al., 2009). The random regression
implementation of the reaction norm model (with intercept and slope) is as follows.
For n individuals whose breeding value are stored in
(n1) a vector g, the model using pedigree becomes g = Ws,
where W = (W0, W1), a n ´ 2n matrix, where {wi} jk = w 2j
if j = k, and 0 otherwise, S ¢ = (S0¢S1¢ ) (a 1 ´ 2n) vector,
where Si is a n ´ 1 vector of breeding values for trait Si (I
= 0 is the intercept and i = 1 is the slope). The variance of
é s2
S is (S) = A Ä C , where C = êê
S0
êësS10
sS01 ùú
and A is the rela2 ú
sS1
úû
tionship matrix derived from pedigree as described above.
Thus the reaction norm model is a multitrait model
in which the traits are random regression coefficients (for
example for the intercept, and linear slope).
www.crops.org3
Given some assumptions (Appendix 2), GEBV for the
slope and intercept can be predicted by replacing A with
.
G,
Examples of GENOTYPE ´
ENVIRONMENT INTERACTIONS
in Livestock
Multiple-Trait Models
A good example of implementation of the multiple-trait
approach to estimate breeding values in different environments was growth of Angus cattle at high and low
altitudes (Williams et al., 2012). At high altitudes, cattle
can suffer high mountain disease, also called brisket disease, which is heart failure as a result of hypoxic pulmonary hypertension. This can severely compromise growth.
Williams et al. (2012) assessed growth rate (weaning
weights) of more than 77,000 cattle on farms in Colorado
at a range of altitudes. Two traits were defined: growth at
high altitude and growth at low altitude. Relationships
between animals were derived from pedigree record. The
genetic correlation for growth at high and low altitudes
was 0.74. This indicates significant reranking between
sires will occur between high- and low-altitude farms,
and the genetic evaluation for growth should include the
genotype ´ altitude interaction.
The multitrait genomic approach to accommodate G
´ E was exemplified by Haile-Mariam et al. (2015). In this
study performance of dairy cattle in Australia were treated
as one trait and performance in the Netherlands and New
Zealand another trait. Milk yields, protein yields, fertility,
and longevity were investigated with this approach. The
aim of the study was actually to improve the accuracy of
the Australian GEBVs by increasing the size of the reference population by including information on genotype
performance from other countries. There were 5720 bulls
with daughter records in one, two, or all three countries,
and these bulls were genotyped for 36,000 SNP markers.
The genomic relationship matrix was constructed among
the bull from the SNP genotypes as described by Yang
et al. (2010). As a result of implementing the multitrait
model described above, GEBVs were produced for Australian and the other country environments. Including
information from the other countries improved the accuracy of genomic breeding values in Australia (as demonstrated in a validation population) by up to 10% for milk
yield. The genetic correlation between performance in
the different countries was as low as 0.72 for longevity
and 0.8 for protein yield. The fact that these correlations
are significantly <1 indicates that significant reranking
of sires occurs between the countries, and breeding programs specific to each country are justified.
4
Reaction Norm Models
A good example of the reaction norm approach to model
genotype ´ farm disease level interactions was presented by Calus et al. (2006). Those authors used bulk
milk somatic cell count (BMSCC) as the environmental
descriptor. Bulk milk somatic cell count is the number of
somatic cells that are present in milk samples pooled across
the cows in a herd: high levels of BMSCC indicate mastitis
is prevalent in the herd, and low levels of BMSCC indicate
low incidence of mastitis in the herd. The practical question is whether sires can be identified that have daughters with low somatic cell counts even when mastitis is
prevalent in the rest of the herd (high BMSCC). The data
set included 344,029 test-day records (somatic cell count
records) of 24,125 cows sired by 182 bulls in 461 herds.
The model included random regressions for each sire on
herd test-day BMSCC. The genetic correlation was 0.72
between somatic cell counts at low and at high BMSCC.
This is considerably <1, suggesting sires rerank considerably for their performance (in this case somatic cell counts
of their daughters) in low and high disease incidence herds.
The reaction norm approach has also been used to
derive GEBV for heat tolerance. Nguyen et al. (2015)
defined heat tolerance of a cow as the drop in milk production with increasing temperature and humidity (combined in a temperature and humidity index, which predicts
heat stress). Milk production data was recorded at least five
times during each cow’s lactation for 343,016 cows, and this
data was combined with daily temperature and humidity
measurements from weather stations closest to the tested
herds for 10 yr of data. Tolerance to heat stress was then
estimated for each cow using random regression (intercept
and slope) to model the rate of decline in production with
increasing temperature humidity index accumulated over
the 4 d before and the day of milking for milk yield, fat
yield, and protein yield. The slopes from this model were
used to define daughter averages (daughter trait deviations
[DTD] for their sires, of which, 2735 Holsteins and 710
Jersey had genotypes [either real or imputed]) for 632,003
SNP. Genomic best linear unbiased prediction was used to
calculate GEBV for heat tolerance. The reference population consisted of either genotyped sires only (2300 Holstein
and 575 Jersey sires) or genotyped sires and cows where the
cows had genotypes (2191 Holstein and 1190 Jersey). The
reminder of the sires (435 Holsteins and 135 Jerseys) were
used as a validation set, and accuracy of GEBV for heat
tolerance was calculated as the correlation of GEBV and
DTD divided by the accuracy of the DTD for these sires.
The accuracy of GEBV for heat tolerance was 0.46 for the
Holstein validation sires and 0.49 for the Jersey validation
sires. These accuracies are moderate to high, suggesting
genomic selection for heat tolerance could be included in
dairy cattle breeding programs to improve production in
environments where heat stress occurs.
www.crops.org
crop science, vol. 56, september– october 2016
Fig. 1. Example of reaction norms for single-nucleotide polymorphism alleles. Allele A is associated with the highest milk production at
low levels of temperature and humidity, while at very high levels of temperature and humidity, allele C performs better.
Hayes et al. (2009) used a similar approach to investigate individual SNP marker by temperature and humidity index interaction. In this case random regression was
used to model the response of each SNP allele to increasing temperature humidity index (Fig. 1). The model was
fitted for 39,048 SNPs one a time. The SNPs associated
with response of milk production to the temperature
humidity index were identified on chromosome 9 and 29,
and these were validated in two independent populations,
one a different breed of cattle.
Another interesting example of SNP by environment
interaction in livestock is the effect of myostatin genotype
on body temperature during heat and cold stress (Howard
et al., 2013). Mutations in the myostatin gene can result
in the double-muscling phenotype in Belgian Blue, Piedmontese, and other breeds of cattle. In this study, animals
that were homozygous wild-type, heterozygous, or homozygous for the Piedmontese-derived myostatin mutation
had rectal temperatures collected during periods of heat
and cold stress. The results indicated a G ´ E did exist for
the myostatin mutation; the additive effect was +0.10°C
during heat stress and the dominance estimate was −0.12°C
in rectal temperature. During winter stress events, the
additive estimate was 0.10°C and dominance estimate was
0.054°C. All these effects were significant (P < 0.05). The
study of Howard et al. (2013) illustrates a SNP G ´ E
exists for the myostatin mutation, and that heterozygous
animals were more robust to environmental extremes in
comparison with either homozygous genotype.
crop science, vol. 56, september– october 2016 Overall Extent of Genotype ´ Environment
Interaction in Livestock
The extent of G ´ E in livestock depends very much on
the genotypes involved, the classification of environment,
the trait, and the statistical method used to estimate G ´
E (Table 1). As might be expected, G ´ E is largest when
performance for very different environments is compared
(e.g., tropical vs. temperate performance) and very different genotypes are compared (Bos taurus vs. Bos indicus).
Within breeds and within countries, the magnitude of G ´
E is usually much smaller. These conclusions are similar to
those of Burrow (2012) in a review of the importance of G
´ E in tropical beef breeding systems. The extent of G ´ E
also appears to be larger for traits more closely related to fitness, for example, for fertility (Haile-Mariam et al., 2008).
Accounting for Genotype ´ Environment
Interactions in Livestock Breeding Programs
An obvious question that stems from Table 1 is what level
of G ´ E justifies different breeding programs or different genomic evaluations? Robertson (1959) proposed that
a correlation of performance between environments of
above 0.8 would indicate that there would be minimum
reranking of selection candidates in the two environments; correlations below 0.8 would indicate considerable
reranking and would justify separate breeding programs.
Mulder and Bijma (2005) reached a similar conclusion;
those authors considered two environments: a selection
environment and a commercial production environment,
with a genetic correlation between performance in the two
environments. They concluded that when this correlation
www.crops.org5
Table 1. Some examples of genotype ´ environment (G ´ E) and genome ´ environment interaction in livestock.
Species
Environment
Trait
Dairy cattle
Farming system
(grazing vs. confinement)
Milk production
Beef cattle
Pasture or feedlot
Final weight,
average daily
gain and scrotal
circumference
Yearling
weight
Beef cattle
Dairy cattle
Dairy cattle
Pigs
Beef cattle
Dairy cattle
Dairy cattle
Dairy cattle
Dairy Cattle
Dairy cattle
Dairy cattle
Dairy cattle
Dairy cattle
Sheep
Sheep
Sheep
Sheep
Genotype
Body weight
Milk yield
Milk production
Milk yield, udder
health, and fatty
acid profile in milk
Fertility
Reference
0.89
Kearney et al. 2004
0.75, 0.49, 0.89
Raidan et al. 2015
Saavedra-Jiménez
et al., 2013
Multitrait
0.23 (wet tropic,
temperate) to 0.99
(dry tropic, temperate)
0.93, 0.79
Mulder et al., 2004
Multitrait
0.78 to 0.90
Fikse et al., 2003
Statistical
model
Animal, Holstein
Correlation of
breed
estimated breeding
value from two
environments
Animal, Nellore
Multitrait
breed
Climatic zone: dry tropic,
Animal, Braunvieh
wet tropic, temperate
cattle
climates in Mexico
Robotic milking vs.
Milk yield, somatic Animal, Holstein
conventional milking
cell score
breed
Country (Australia, Canada,
Milk yield
Animal, Guernsey
United States, South Africa)
Conventional vs.
Growth,
Breed
organic farming system
carcass quality
Farm input (herd
weight gain)
Temperature and
humidity
Country (Luxembourg,
Tunisia)
Temperature and
humidity
Extent of G ´ E and
genetic correlation
between extreme
environments†
Animal, Nellore
breed
Animal, Holstein
breed
Animal, Holstein
breed
Animal, Holstein
breed
Multitrait
Multitrait
Random
regression
Random
regression
Random
regression
Random
regression
Temperature and
Animal, Holstein
Random
humidity
regression
Farm input level (herd
Milk yield
SNP‡ genotype
Random
production level as a proxy
regression
for level of feeding)
Farm disease level (bulk
Milk yield
SNP genotype
Random
milk somatic cell count
regression
Farm input level (herd
Milk yield
SNP genotype
Random
production level as a proxy
regression
for level of feeding)
Temperature and
Milk yield
SNP genotype
Random
humidity
regression
Farm environment,
Weight, ultrasound
Animal,
Random
expressed as principal
back-fat, muscle
Texel breed
regression
component loadings
depths
after clustering on farm
characteristics
Farm environment
Lamb weaning Animal, Norwegian
Multitrait
weight
white and Spel
breeds
Farm environment
Fecal egg count and Animal, Merino Multitrait, random
production
breed
regression
Farm environment
Growth
Animal, Santa
Multitrait, random
Ines breed
regression
Except weight gain,
Brandt et al., 2010
no major shift of the ranking
order within environment
between genotypes.
0.09–0.74
Pegolo et al., 2011
>0.90
0.50
0.80, 0.64, 0.67
(depending on fatty acid)
0.79
28 SNP validated
for slope
11 SNP validated
for slope
27 significant SNP
for slope
SNP on chromosome 29
validated for slope
Brügemann et al.,
2011
Hammami et al.,
2009
Hammami et al.,
2015
Haile-Mariam et al.,
2008
Streit et al., 2013b
Streit et al., 2013a
Lillehammer et al.,
2009
Hayes et al., 2009
McLaren et al.,
2015
0.82
Steinheim et al.,
2008
Significant but
small G ´ E
>0.70
Pollott and Greeff,
2004
Santana et al., 2013
† For random regression models, these correlations are typically between performance at the fifth and 95th percentile of the environment descriptor.
‡ SNP, single-nucleotide polymorphism.
was lower than 0.8, selection based on progeny tested in
the commercial production environment resulted in more
gain than selection based on sibs of selection candidates
measured in the selection environment. Within a country, particularly countries with temperate environments,
the genetic correlation between environmental extremes
rarely falls below 0.8 (Table 1). This is in contrast to
6
between countries and between tropical and temperate
zones within a country where correlations of performance
can be considerably below 0.8. For example, milk production in Luxemburg and Tunisia has a genetic correlation of 0.5 (Hammami et al., 2009). For a tropical dairy
system in Kenya, Okeno et al. (2010) compared genetic
progress from a local progeny testing with importation of
www.crops.org
crop science, vol. 56, september– october 2016
semen from temperate countries and concluded importation was the superior strategy only if the genetic correlation between milk production in Kenya and the temperate
counties was greater than 0.7 despite the fact that progeny
test schemes were much larger in the temperate countries.
Between Australia and North America, genetic correlations for performance of Holstein cattle are 0.8 or below
(http://www.interbull.org). The opportunity cost of not
accounting for G ´ E can be quite large. It has been estimated that if breeding programs for Australian dairy cattle
were based purely on breeding values calculated using North
American information, $12 million yr−1 in genetic progress
could be lost (J.E. Pryce, personal communication, 2015).
One option for using genotypes that are genetically
superior in one environment but are not well adapted to
the target environment, would be to introgress alleles for
adaptation in the target environment. This already occurs
for tropical dairy production in Brazil, where high-performance Holstein cattle are crossed to adapted Bos indicus
cattle to form a composite called Girolando. Most of the
milk production in Brazil is now from Girolando cows
(da Costa et al., 2015). More targeted introgression has
also been demonstrated; Dikmen et al. (2014) introgressed
an allele at SLICK hair locus (a mutation that changes
the hair follicle and thermotolerance) from Senepol cattle
into Holsteins. There are relatively few examples of this
in livestock; however, as long generation time means that
introgression is very slow. Genome editing for adaption
alleles would be a potentially more rapid alternative.
There is increasing interest in livestock in breeding for
“robustness”. While this is a vague term, one interpretation is breeding for animals that produce well across a range
of environments. Reaction norm models directly identify
“robust” genotypes– these are the genotypes with low slope
values in response to the environmental indicator, whereas
sensitive genotypes have steep slopes. Note that in plant
breeding the equivalent term for robustness would be stability across environments. Lillehammer et al. (2009) identified SNP genotypes associated with improved milk production in dairy cows that were robust to the herd average
level of production, that is, animals with these genotypes
had improved production at low and high levels of feeding (high intercept and low slope). Streit et al. (2013a) used
a similar approach to identify SNPs that could be used to
select for robustness of milk production to the average level
of disease (mastitis) for the farm (e.g., animals with these
SNP genotypes produced well regardless of whether the
farm had a high disease load or a low disease load).
As another example of using genotype ´ environment models to enable breeding for robustness was Rose
et al. (2013), where the aim was to identify sheep that lost
less body weight in harsh nutritional conditions. Rose et
al. (2013) used both a multitrait approach (body weight
loss defined as a phenotype) and the random regression
crop science, vol. 56, september– october 2016 approach. The heritability of body weight loss in harsh
nutritional conditions was 0.05 to 0.16, and the genetic
correlation between body weight gain in good conditions
and body weight loss in poor conditions was negative but
low. This led the authors to conclude that sheep can be
bred to be more tolerant to variation in feed supply.
Parallels and Differences Between
Genotype ´ Environment Interactions
in Livestock and Plants
One reason why G ´ E is potentially more crucial for
plants is that livestock can, to some extent, move to avoid
or mitigate stressors (and in some cases are kept in controlled environments, such as barns, where temperatures
can be to some extent increased or increased to avoid
stress), whereas plants are obviously less able to do so. A
further point of differentiation is that in livestock, the
focus in estimation of G ´ E has been largely on additive
effects to generate selection gains, whereas in plants G ´
E for dominance and espistatic effects can be exploited
more easily through hybrids or clonal propagation. In
terms of modeling G ´ E , the approaches outlined above
can and have also been applied in plants (e.g., Burgueño et
al., 2011; Jarquín et al., 2014). One key point is that in the
past, quantification of G ´ E without pedigrees required
each genotype to be grown in all environments. Provided
genomic markers (or pedigree) are available, the relatedness among lines can be modelled using matrices G or A,
and as in animal studies, G ´ E methods that use genomic
information can better cope with a reduced degree of
replication of individuals across environments (in the
genomic approach it is actually the alleles that must be
replicated across environments). More accurate estimates
of G ´ E with the genomic approach may therefore be
achievable than was possible in the past.
Conclusion
In livestock, G ´ E is considerable when genotype is
defined at the level of subspecies (Bos taurus vs. Bos indicus) and environments are described by different climatic
zones (tropical vs. temperate) but less for individuals within
a breed and within (most) countries. Models that include G
´ E, particularly the multitrait model, are now being used
to enable the reference populations used for genomic selection to be merged across environments and across countries,
leading to higher accuracy of GEBV. In either multitrait or
reaction norm models, using genomic information can lead
to more accurate estimates of G ´ E, as it is more likely
that SNP genotypes are well replicated across environments
than individual animals or their close relatives.
www.crops.org7
Appendix 1. Prediction of
MultiTrait Genomic Estimated
Breeding Values
References
Define breeding values as g = Uq, where U is a matrix containing the genotypes of animals at all the SNPs (n animals
´ m, where m is the number of SNP coded as the number
of copies of the second allele), and q is the effect of each
SNP on the trait, is a linear model for the breeding value g.
The SNP effects (q) are treated as random effects sampled
from a distribution; for instance, a normal distribution q
~N(0,D). The D matrix is usually diagonal, implying that
the effect of different SNPs are independent of each other,
,
for example, for the ith SNP in Environment 1,
is the variance associated with the ith SNP in
where
Environment 1. The covariance of the SNP effects in Environment 1 and 2 is Dij = sij,12. Then for the multitrait model
described in the text, the variance of breeding values are as
follows:
é g ù é UD11 U ' UD12 U ' ù
ú.
var ê 1 ú = ê
ê g2 ú ê UD12 U ' UD22 U 'ú
ë û ë
û
If we assume that all SNP effects for each trait are
drawn from the same normal distribution (q ~ N(0, Is12 ),
ég ù
then var êê 1 úú can be written as
. That is, the
ë g2 û
expected relationship matrix A is replaced by G, the realized relationship among individuals calculated from the
markers (G = UU¢/m) is calculated for example.
é g1 ù
ê ú  N (0, G Ä T ) .
ê g2 ú
ë û
Appendix 2. Prediction of Genomic
Estimated Breeding Values for
Reaction Norms
If there are SNP genotypes U (an n ´ m matrix, where
m is the number of SNP, and genotypes are coded as the
number of copies of the second allele), then the breeding values for animals for trait 0 (the intercept) are S0 =
Uq 0, where q 0 is the effect of the SNP on the intercept,
and var(q 0) = D0, a m ´ m diagonal matrix, and likewise
for the other traits. For the case with intercept and slope,
cov(q 0,q1) = D01, a diagonal matrix (m ´ m) of covariances
among the SNP effects on the intercept and slope, and
é UD00 U ' UD01U 'ù
ú.
var (S) = ê
ê UD10 U ' UD11U ' ú
ë
û
2
If Dii = Isi and Dij = Isi2 then var (S) = G Ä C , where
é s2
C = êê S 0
êësS10
sS 01 ùú
, as above, and G = UU¢/m, or versions of
sS21 úúû
the genomic relationship matrix described for example by
VanRaden (2008) or Yang et al. (2010).
8
Brandt, H., D.N. Werner, U. Baulain, W. Brade, and F. Weissmann.
2010. Genotype–environment interactions for growth and carcass traits in different pig breeds kept under conventional and
organic production systems. Animal 4:535–544. doi:10.1017/
S1751731109991509
Brügemann, K., E. Gernand, U.U. von Borstel, and S. König. 2011.
Genetic analyses of protein yield in dairy cows applying random regression models with time-dependent and temperature
´ humidity-dependent covariates. J. Dairy Sci. 94:4129–4139.
doi:10.3168/jds.2010-4063
Burgueño, J., G. de los Campos, K. Weigel, and J. Crossa. 2011.
Genomic prediction of breeding values when modeling genotype ´ environment interaction using pedigree and dense
molecular markers. Crop Sci. 52:707–719. doi:10.2135/cropsci2011.06.0299
Burrow, H.M. 2012. Importance of adaptation and genotype ´
environment interactions in tropical beef breeding systems.
Animal 6:729–740. doi:10.1017/S175173111200002X
Calus, M.P., A.F. Groen and G. de Jong. 2002. Genotype ´ environment interaction for protein yield in Dutch dairy cattle as
quantified by different models. J. Dairy Sci. 85:3115–3123.
doi:10.3168/jds.S0022-0302(02)74399-3
Calus, M.P., L.L. Janss, and R.F. Veerkamp. 2006. Genotype by
environment interaction for somatic cell score across bulk milk
somatic cell count and days in milk. J. Dairy Sci. 89:4846–
4857. doi:10.3168/jds.S0022-0302(06)72533-4
da Costa, A.N., J.V. Feitosa, P.A. Montezuma, Jr., P.T. de Souza,
and A.A. de Araújo. 2015. Rectal temperatures, respiratory
rates, production, and reproduction performances of crossbred
Girolando cows under heat stress in northeastern Brazil. Int.
J. Biometeorol. 59:1647–1653. doi:10.1007/s00484-015-0971-4
Dikmen, S., F.A. Khan, H.J. Huson, T.S. Sonstegard, J.I. Moss, G.E.
Dahl, and P.J. Hansen. 2014. The SLICK hair locus derived
from Senepol cattle confers thermotolerance to intensively
managed lactating Holstein cows. J. Dairy Sci. 97:5508–5520.
doi:10.3168/jds.2014-8087
Fikse, W.F., R. Rekaya, and K.A. Weigel. 2003. Genotype ´
environment interaction for milk production in Guernsey
cattle. J. Dairy Sci. 86:1821–1827. doi:10.3168/jds.S00220302(03)73768-0
Gilmour, A.R., B.J. Gogel, B.R. Cullis, and R. Thompson. 2006.
ASReml User Guide 2.0. VSN International Ltd., Hemel
Hempsted, UK.
Haile-Mariam, M., M.J. Carrick, and M.E. Goddard. 2008. Genotype by environment interaction for fertility, survival, and
milk production traits in Australian dairy cattle. J. Dairy Sci.
91:4840–4853. doi:10.3168/jds.2008-1084
Haile-Mariam, M., J.E. Pryce, C. Schrooten, and B.J. Hayes. 2015.
Including overseas performance information in genomic evaluations of Australian dairy cattle. J. Dairy Sci. 98:3443–3459.
doi:10.3168/jds.2014-8785
Hammami, H., B. Rekik, H. Soyeurt, C. Bastin, E. Bay, J. Stoll,
and N. Gengler. 2009. Accessing genotype by environment
interaction using within- and across-country test-day random
regression sire models. J. Anim. Breed. Genet. 126:366–377.
doi:10.1111/j.1439-0388.2008.00794.x
Hammami, H.B., J. Vandenplas, M.L. Vanrobays, B. Rekik, C. Bastin, and N. Gengler. 2015. Genetic analysis of heat stress effects
on yield traits, udder health, and fatty acids of Walloon Holstein
cows. J. Dairy Sci. 98:4956–4968. doi:10.3168/jds.2014-9148
www.crops.org
crop science, vol. 56, september– october 2016
Hayes, B.J., Bowman, P.J., Chamberlain, A.J., Savin, K., van Tassell, C.P., Sonstegard, T.S. and M.E. Goddard. 2009. A validated genome wide association study to breed cattle adapted to
an environment altered by climate change. PLoS One. 2009
4:e6676. doi:10.1371/journal.pone.0006676
Hayes, B.J., M. Carrick, P. Bowman, and M.E. Goddard. 2003.
Genotype ´ environment interaction for milk production of
daughters of Australian dairy sires from test-day records. J. Dairy
Sci. 86:3736–3744. doi:10.3168/jds.S0022-0302(03)73980-0
Henderson, C.R. 1984. Applications of linear models in animal
breeding. University of Guelph, Guelph, ON.
Howard, J.T., S.D. Kachman, M.K. Nielsen, T.L. Mader, and M.I.
Spangler. 2013. The effect of myostatin genotype on body
temperature during extreme temperature events. J. Anim. Sci.
91:3051–3058. doi:10.2527/jas.2012-6180
Jamrozik, J., and L.R. Schaeffer. 1997. Estimates of genetic parameters for a test day model with random regressions for yield
traits of first lactation Holsteins. J. Dairy Sci. 80:762–770.
doi:10.3168/jds.S0022-0302(97)75996-4
Jarquín, D., K. Kocak, L. Posadas, K. Hyma, J. Jedlicka, G. Graef,
and A. Lorenz. 2014. Genotyping by sequencing for genomic
prediction in a soybean breeding population. BMC Genomics
15:740. doi:10.1186/1471-2164-15-740
Kearney, J.F., M.M. Schutz, P.J. Boettcher, and K.A. Weigel.
2004. Genotype ´ environment interaction for grazing versus confinement. I. Production traits. J. Dairy Sci. 87:501–509.
doi:10.3168/jds.S0022-0302(04)73189-6
Lillehammer, M., B.J. Hayes, T.H. Meuwissen, and M.E. Goddard. 2009. Gene by environment interactions for production
traits in Australian dairy cattle. J. Dairy Sci. 92:4008–4017.
doi:10.3168/jds.2008-1484
Maier, R., G. Moser, G.B. Chen, S. Ripke, Cross-Disorder Working Group of the Psychiatric Genomics Consortium, W.
Coryell, J.B. Potash, W.A. Scheftner, J. Shi, M.M. Weissman,
C.M. Hultman, M. Landén, D.F. Levinson, K.S. Kendler,
J.W. Smoller, N.R. Wray, and S.H. Lee. 2015. Joint analysis of psychiatric disorders increases accuracy of risk prediction for schizophrenia, bipolar disorder, and major depressive
disorder. Am. J. Hum. Genet. 96:283–294. doi:10.1016/j.
ajhg.2014.12.006
McLaren, A., S. Brotherstone, N.R. Lambe, J. Conington, R.
Mrode, and L. Bunger. 2015. The effects of different farm
environments on the performance of Texel sheep. Animal
9:1624–1634. doi:10.1017/S1751731115001123
Mulder, H.A., and P. Bijma. 2005. Effects of genotype ´ environment interaction on genetic gain in breeding programs. J.
Anim. Sci. 83:49–61.
Mulder, H.A., A.F. Groen, G. De Jong, and P. Bijma. 2004. Genotype ´ environment interaction for yield and somatic cell score
with automatic and conventional milking systems. J. Dairy Sci.
87:1487–1495. doi:10.3168/jds.S0022-0302(04)73300-7
Nguyen, T.T.T., P.J. Bowman, M. Haile-Mariam, J.E. Pryce, and
B.J. Hayes. 2015. Genomic selection for tolerance to heat
stress in Australian dairy cattle. Proc. of the 21st Conf. Assoc.
Advmt. Breed. Genet. 21:378–381.
Okeno, T.O., I.S. Kosgey, and A.K. Kahi. 2010. Genetic evaluation
of breeding strategies for improvement of dairy cattle in Kenya.
crop science, vol. 56, september– october 2016 Trop. Anim. Health Prod. 42:1073–1079. doi:10.1007/s11250010-9528-z
Pegolo, N.T., L.G. Albuquerque, R.B. Lôbo, and H.N. de Oliveira.
2011. Effects of sex and age on genotype ´ environment interaction for beef cattle body weight studied using reaction norm
models. J. Anim. Sci. 89:3410–3425. doi:10.2527/jas.2010-3520
Pollott, G.E., and J.C. Greeff. 2004. Genotype ´ environment
interactions and genetic parameters for fecal egg count and
production traits of Merino sheep. J. Anim. Sci. 82:2840–2851.
Raidan, F.S., T.L. Passafaro, B.O. Fragomeni, L.A. Josahkian, I.G.
Pereira, and F.I. Toral. 2015. Genotype ´ environment interaction in individual performance and progeny tests in beef cattle. J. Anim. Sci. 93:920–933. doi:10.2527/jas.2014-7983
Ravagnolo, O., and I. Misztal. 2000. Genetic component of heat
stress in dairy cattle, parameter estimation. J. Dairy Sci.
83:2126–2130. doi:10.3168/jds.S0022-0302(00)75095-8
Robertson, A. 1959. The sampling variance of the genetic correlation coefficient. Biometrics 15:469–485. doi:10.2307/2527750
Rose, G., A. Kause, H.A. Mulder, J.H. van der Werf, A.N. Thompson, M.B. Ferguson, and J.A. van Arendonk. 2013. Merino
ewes can be bred for body weight change to be more tolerant to uncertain feed supply. J. Anim. Sci. 91:2555–2565.
doi:10.2527/jas.2012-5539
Saavedra-Jiménez, L.A., R. Ramírez-Valverde, R. Núñez-Domínguez, J.G. García-Muñiz, N. Lopez-Villalobos, and A. RuízFlores. 2013. Genotype by climate interaction in the genetic
evaluation for growing traits of Braunvieh cattle in Mexico.
Trop. Anim. Health Prod. 45:1489–1494. doi:10.1007/s11250013-0387-2
Santana, M.L., A.B. Bignardi, et al. 2013. Genotype by environment interaction and model comparison for growth traits of
Santa Ines sheep. J. Anim. Breed. Genet. 130:394–403.
Streit, M., F. Reinhardt, G. Thaller, and J. Bennewitz. 2013a.
Genome-wide association analysis to identify genotype ´ environment interaction for milk protein yield and level of somatic
cell score as environmental descriptors in German Holsteins. J.
Dairy Sci. 96:7318–7324. doi:10.3168/jds.2013-7133
Streit, M., R. Wellmann, F. Reinhardt, G. Thaller, H.P. Piepho,
and J. Bennewitz. 2013b. Using genome-wide association analysis to characterize environmental sensitivity of milk traits in
dairy cattle. G3. 3:1085–93.
Steinheim, G., J. Odegård, T. Adnøy, and G. Klemetsdal. 2008.
Genotype by environment interaction for lamb weaning
weight in two Norwegian sheep breeds. J. Anim. Sci. 86:33–
39. doi:10.2527/jas.2007-0031
VanRaden, P.M. 2008. Efficient methods to compute genomic predictions. J. Dairy Sci. 91:4414–4423. doi:10.3168/jds.2007-0980
Williams, J.L., J.K. Bertrand, I. Misztal, and M. Łukaszewicz. 2012.
Genotype by environment interaction for growth due to altitude in United States Angus cattle. J. Anim. Sci. 90:2152–2158.
doi:10.2527/jas.2011-4365
Yang, J., B. Benyamin, B.P. McEvoy, S. Gordon, A.K. Henders,
D.R. Nyholt, P.A. Madden, A.C. Heath, N.G. Martin, G.W.
Montgomery, M.E. Goddard, and P.M. Visscher. 2010. Common SNPs explain a large proportion of the heritability for
human height. Nat. Genet. 42:565–569. doi:10.1038/ng.608
www.crops.org9