Crop Modeling, QTL Mapping, and Their

Crop Modeling, QTL Mapping, and Their Complementary Role in Plant Breeding
Xinyou Yin,* Piet Stam, Martin J. Kropff, and Ad H. C. M. Schapendonk
ABSTRACT
massive amounts of information for plant breeding
(Stuber et al., 1999; Miflin, 2000), an option of improving
breeding efficiency is to develop and utilize a thorough
understanding of morphophysiological factors that determine yield (Bindraban, 1997). However, only in a
limited number of instances has plant physiology led to
crop improvement; rather, its role in breeding so far
has been to provide possible explanations for the improvements that have been achieved. Miflin (2000) suggested that this situation may change in the future if
the links between physiology and genetics are established.
This suggestion agrees with a main conclusion from an
extensive survey (Jackson et al., 1996) that there is a
general agreement among plant breeders and physiologists that physiological knowledge can be applied to
improve breeding efficiency in the future.
As early as in the 1960s, Donald (1968) proposed an
approach based much more explicitly on the design of
plants or ideotypes for target environment, using known
principles of physiology and agronomy. Rasmusson
(1987) suggested improvements to Donald’s approach,
considering correlations between traits, and designed a
barley (Hordeum vulgare L.) ideotype for the Midwest
USA with desired changes in culm, leaf, and head characteristics. Rasmusson (1991) reported that while some
characteristics appeared to afford little opportunities
for obtaining gains, others did show more promise. Based
on the concept of ideotype approach, International Rice
Research Institute (IRRI) launched a program in 1989
to develop a “new plant type” rice that combines multiple innovations (Peng et al., 1999). While so far these
new lines have not broken the yield barrier as hoped for,
progress is being made at IRRI with refined ideotype
designs (S. Peng, personal communication, 2002). A
convincing example of ideotype approach is the breeding for the superhybrid rice by Professor Yuan Longping’s group in China. Rather than count on heterosis
alone to raise yields, he also incorporated morphophysiological characters such as long, narrow and erect top
leaves and large panicles that hang close to ground, the
characteristics that physiologists have expected to enhance efficiency of crop light capture (Setter et al., 1995).
Field trials at four separate locations showed potential
yields of the superhybrid were 15 to 20% higher than
10.5 t ha⫺1 for existing hybrids (Normile, 1999).
Using crop physiology in ideotype breeding can be
more feasible than ever because of the development of
dynamic process-based crop growth simulation models
(crop models hereafter). These models quantify causal-
Crop modelers and geneticists have developed a vision of their roles
in plant breeding from their own perspective. However, to improve
breeding efficiency, interdisciplinary collaboration becomes increasingly important. The objective of this paper is to explore opportunities
for collaboration between modelers and geneticists in ideotype breeding for high crop yield. The advent of molecular markers enables variation of a complex trait to be dissected into the effects of quantitative
trait loci (QTL) and assists the transfer of these QTL into desired
cultivars or lines. A recent study in which QTL information was linked
to crop modeling has shown that QTL analysis removes part of random
errors of measured model input parameters and that QTL information
can successfully be coupled with crop models to replace measured
parameters. The QTL-based modeling overcomes the limitations in designing ideotypes by using models that ignore the inheritance of model
input traits. On the other hand, crop modeling can potentially be a
powerful tool to resolve genotype ⫻ environment interactions and to
dissect yield into characters that might be under simpler genetic control. Based on the complementary aspects of crop modeling and QTL
mapping, we propose an approach that integrates marker-assisted
selection into model-based ideotype framework to support breeding
for high crop yield. For this approach to be effective, there is a need
to develop crop models that are capable of predicting yield differences
among genotypes in a population under various environmental conditions.
B
reeding for high-yielding crop cultivars for specific environments is a major challenge to feed growing world populations. Through extensive selection, based
largely on empirical field observations, breeders have
been successful in creating high-yielding cultivars. In
many instances, progress has been attained from changes
in a relatively few genes, e.g., those involved in plant
height and photoperiodism (Miflin, 2000). However,
further improvement has been increasingly complex and
difficult (Bindraban, 1997), and in some crops such as
rice (Oryza sativa L.), no progress in increasing yield
potential has been achieved in the past decades (Peng
et al., 1999). Difficulties in manipulating yield are related to its genetic complexity: polygenic nature, interactions between genes (epistasis), and environmentdependent expression of genes (Ribaut and Hoisington,
1998). For more efficient crop improvement, joint interdisciplinary ventures to develop new knowledge and tools
are increasingly becoming important (Shorter et al., 1991).
Besides recent developments in genomics (such as
genome sequencing) that will provide useful tools and
X. Yin and M.J. Kropff, Crop and Weed Ecol. Group, Wageningen
Univ., P.O. Box 430, 6700 AK Wageningen, the Netherlands; P. Stam,
Lab. of Plant Breeding, Wageningen Univ., P.O. Box 386, 6700 AJ
Wageningen, the Netherlands; and A.H.C.M. Schapendonk, Plant
Dynamics, Englaan 8, 6703 EW Wageningen, the Netherlands. Joint
contrib. from Plant Res. Int. and the C.T. de Wit Graduate School for
Prod. Ecol. and Resour. Conserv. Received 1 May 2001. *Corresponding author ([email protected]).
Abbreviations: AFLP, amplification fragment length polymorphism;
G ⫻ E, genotype ⫻ environment interaction; h2, heritability; MAB,
marker-assisted breeding; QTL, quantitative trait locus or loci; QTL ⫻
E, quantitative trait loci ⫻ environment interaction; RFLP, restriction
fragment length polymorphism; RIL, recombinant inbred line; SLA,
specific leaf area.
Published in Agron. J. 95:90–98 (2003).
90
YIN ET AL.: THE COMPLEMENTARITY OF CROP MODELING AND QTL MAPPING
ity between relevant physiological processes and responses of these processes to environmental variables
and therefore allow yield predictions not restricted to
the environments where the model parameters are derived. As model parameters can represent certain genetic characteristics, crop modeling has been considered
a useful tool to assist breeding (Loomis et al., 1979; Whisler et al., 1986; Boote et al., 1996). Shorter et al. (1991)
proposed collaborative efforts among breeders, physiologists, and modelers, using models as a framework to
integrate physiology with breeding. For such, understanding the inheritance of the model parameters is required (Stam, 1998).
An important development during the last decade in
quantitative genetics was the ability to identify genome
regions responsible for variation of a trait due to the
advent of molecular markers (Paterson et al., 1988). The
term QTL has come to refer to polygenes underlying a
quantitative trait. Numerous studies have been reported
on identifying QTL for various traits in humans, animals, and plants. Similar to other quantitative traits,
individual input parameters of a crop model are amenable to QTL analysis (Yin et al., 1999b).
In this paper, we discuss potentials and limitations
of crop modeling and QTL analysis in assisting plant
breeding. The complementary aspects of crop modeling
and QTL analysis are explored to develop an integrated
approach for ideotype breeding.
CROP MODELING AS A TOOL
TO ASSIST PLANT BREEDING
Generally, crop models require two types of inputs:
environmental inputs (i.e., weather variables and management options) and physiological inputs. The latter
inputs are used as model parameters for characterizing
genotypic differences. These parameters are also referred
to as genetic coefficients (Hunt et al., 1993; White and
Hoogenboom, 1996; Mavromatis et al., 2001) or model
input traits (Yin et al., 2000a), reflecting the awareness
that model input parameters may be under genetic control.
Given the expectation that crop models based on
physiologically sound mechanisms have the potential to
quantify and integrate crop yield responses to genetic
and environmental factors, physiologists and modelers
have explored potential uses of crop models in various
aspects of breeding:
• To identify main yield-determining traits (Bindraban, 1997; Yin et al., 2000b)
• To define optimum selection environments (Aggarwal et al., 1997a)
• To optimize single-trait values (Boote and Tollenaar, 1994; Setter et al., 1995; Yin et al., 1997)
• To design ideotypes consisting of multiple traits
(Penning de Vries, 1991; Dingkuhn et al., 1993;
Kropff et al., 1995; Haverkort and Kooman, 1997)
• To assist multilocation testing (Dua et al., 1990)
and explain genotype ⫻ environment interactions
(G ⫻ E) (Mavromatis et al., 2001)
A common endpoint of these studies, based on model
91
simulations, is suggestions that breeders may use. Given
that direct experimental confirmation and objective
comparisons of modeled suggestions with those already
used in breeding programs are rare, Stam (1998), from
a geneticist’s and breeder’s point of view, expressed
his concerns about this model-based approach. First, a
practical problem of breeding is that the majority of
model input traits to be assessed are difficult to accurately measure. Second, the inheritance of the model
input traits is largely unknown. For example, in designing an ideotype by modeling, it is assumed, either tacitly
or explicitly, that these traits can be combined at will
in a single genotype. Such an assumption ignores the
possible existence of constraints and correlations among
the traits. Constraints might be imposed simply by the
fact that little genetic variation exists in the genetic
material available for selection. Thus, models may not
identify those traits for which gain via breeding may be
easiest (Jackson et al., 1996). Correlations between the
traits, due either to a tight linkage between QTL or to
a single QTL that affects multiple traits (pleiotropy),
may seriously hamper the realization of an ideotype.
After all, plant breeding is genetic improvement; knowledge of the genetic basis of phenotypic variation,
whether described in terms of conventional agronomic
traits or model input traits, is crucial for a successful
breeding program (Stam, 1998). To assist the development of efficient breeding strategies, crop modeling requires understanding of the inheritance of the factors
that determine crop growth (Shorter et al., 1991).
White and Hoogenboom (1996) presented a model
for bean (Phaseolus vulgaris L.) in which the genetic
control of model parameters was considered. They applied linear regression to estimate values of more than
20 model input traits from information about alleles
(variants at a gene locus) of seven known genes in the
cultivars studied. This approach, however, assumes that
all the traits were controlled by pleiotropic effects of the
seven genes, ignoring possible additional trait-specific
genes. Advances in quantitative genetics, by mapping
trait-specific QTL, can help to broaden insight in the
genetic basis of crop traits.
QTL MAPPING AND ITS APPLICATIONS
In genetics, the distance between genes on the genome is assessed on the basis of the frequency of recombination of the genes, estimated from scoring genotypes
of progeny of a cross (Kearsey and Pooni, 1996). Mapping quantitative traits is difficult because the genotype
is never unambiguously inferred from the phenotype.
Classical quantitative genetics pursues a different approach, using statistical concepts such as means, variances, correlations, heritabilities (h2), built on assumptions, e.g., that effects of individual genes on a trait are
small and additive. This assumption sheds little light,
if any, on the individual genes themselves (Prioul et
al., 1997).
To map quantitative traits, supplementary information from recognizable single-gene loci is required. First
attempts to link a quantitative trait to a major gene
92
AGRONOMY JOURNAL, VOL. 95, JANUARY–FEBRUARY 2003
Fig. 1. Seed weight of three F2 seed-color genotypes of a cross between
two lines of bean (data of Sax, 1923).
locus in plants date back to Sax (1923), who studied
seed weight and color in an F2 of a cross in bean. Seed
color involved the segregation of a single gene, P/p.
Seed weight differed among the three color genotypes
(Fig. 1), indicating that either the P/p locus had a pleiotropic effect on seed weight or there was a QTL for
weight closely linked to the P/p locus.
Major gene mutants, however, are scarce and may
not exist in a population under study. Because QTL may
occur throughout the genome, a large number of gene
markers are required to locate them. Early studies of
quantitative traits suffered from the lack of major-gene
markers that could make a complete map. This problem
was overcome with the realization that maps could be
constructed using pieces of chromosomal DNA as markers (Botstein et al., 1980). The first DNA-based molecular markers were fragments produced by restriction enzyme digestion, designated as restriction fragment length
polymorphism (RFLP). The RFLP markers are naturally occurring, abundant in most species, and simply
inherited Mendelian characters. Unlike major-gene mutant (or morphological) markers, those of RFLP are not
true genes because a gene codes for a gene product,
whereas an RFLP does not because it is possibly the
result of a single base change in a noncoding genome
region (Kearsey and Pooni, 1996). There is a growing
number of alternative markers, e.g., amplification fragment length polymorphism (AFLP), based on small differences in base sequence; information about them is
now widely available (e.g., Staub and Serquen, 1996;
Jones et al., 1997).
Until the mid-1980s, most of the mapped genes were
mutant genes with clear phenotypes; mapping gradually
progressed by looking at progeny from crosses between
the carrier of the new gene and genotypes carrying already mapped gene(s) (e.g., Woodward, 1957). With the
advent of DNA markers, we are in the position of analyzing a large number of recognizable loci segregating
simultaneously in the same cross. To handle the large
number of loci, a variety of software (e.g., Stam, 1993)
has been developed to establish the overall map that
gives the best fit to the combined data. An example of
such a map, using AFLP produced from a cross between
two barley cultivars, is illustrated in Fig. 2 for chromosome 3. Needless to say, information about the position
Fig. 2. Amplification fragment length polymorphism (AFLP) marker
map for chromosome 3, established by the use of software JoinMap
(Stam, 1993), for Apex ⫻ Prisma recombinant inbred line population of barley (redrawn from Yin et al., 1999b). The AFLP markers
are labeled E45M55-408, E44M58-196, etc., and their map positions
are counted in genetic map units, cM (1 cM corresponds to 1%
recombination per meiosis), from the top terminal marker. The
position of the denso dwarfing gene is highlighted.
of mutant loci is important for building a DNA-marker
map because previously mapped mutant loci (e.g., the
denso locus in Fig. 2) can be used as anchor markers that
assign new markers to different chromosomal groups. A
growing number of map databases in plants now become
YIN ET AL.: THE COMPLEMENTARITY OF CROP MODELING AND QTL MAPPING
accessible through the web sites (e.g., http://www.nal.
usda.gov/pgdic; verified 11 Sept. 2002).
A marker linkage map can be used to localize QTL
for a quantitative trait, as first demonstrated by Paterson
et al. (1988). The basis of all QTL detection is the statistical analysis of associations between markers and trait
values (Fig. 3). Statistical techniques for using a marker
map to detect QTL have reached a fairly high level of
sophistication, but improvements are still being made
(Kearsey and Farquhar, 1998). A widely used method was
interval mapping (Lander and Botstein, 1989). Other
approaches, e.g., the multiple QTL method (Jansen,
1995), were developed to detect multiple linked QTL.
However, a QTL detected by any technique is not a
true gene, only the indicated genome region that most
likely contains gene(s) for the trait under study.
The number of research reports on QTL analysis of
specific crop traits, using the methodology outlined
above, is growing rapidly. Almost all studies, regardless
of the crop or trait to which they are applied, have come
to support a main result of the first study (Paterson et
al., 1988) that, in a given cross, a small number of QTL
explained a large part of the genetic variation, even for
highly complex traits. This result differs from the assumption of classical quantitative genetics of the effects
of many genes with small and similar actions.
Two complementary uses of the QTL approach have
emerged: the fundamental and the applied (Prioul et al.,
1997). The first use, which is of interest to physiologists,
targets QTL by determining their contribution to physiological components of macroscopic traits. Not only does
the QTL approach provide unequivocal answers to a
range of physiological questions, it also generates new
insight into the causality between components that would
have been difficult to obtain by conventional physiological approaches (e.g., Simko et al., 1997). The importance
of the QTL approach is shown in a special issue of New
Phytologist [1997, 137(1)], which was entirely devoted
to proselytizing physiologists to take a genetic approach.
The second use of the QTL studies, which is of interest
to breeders, is marker-assisted breeding (MAB). This
approach uses markers for tagging QTL of interest so
as to pyramid favorable QTL alleles and break their
linkage with undesirable alleles (Lee, 1995; Ordon et al.,
1998; Ribaut and Hoisington, 1998). An apparent use of
MAB is the marker-steered introgression with valuable
single genes from exotic donors to enhance elite breeding material (Stam, 1998), which allows faster recovery
of the recipient-parent genome than the conventional
recurrent backcrossing (Ribaut and Hoisington, 1998).
As alien species or landraces are rich in resistance genes
and resistances are simply inherited relative to yield
traits, the application of markers for tagging of resistance genes in major crops has progressed rapidly (Ordon et al., 1998).
A major challenge for MAB is to deal with traits
controlled by multiple interactive and environmentdependent QTL, such as yield and yield-relating traits
that often have a low h2. Genetic simulation studies (e.g.,
Van Berloo and Stam, 1998) have shown that MAB
can be superior to the conventional phenotype-based
approach for traits of low h2, and there is some evidence
93
Fig. 3. The QTL mapping of six traits in barley (data of the 1997 field
experiment reported by Yin et al., 1999b). The interval mapping
method (Lander and Botstein, 1989) was used. By moving the
position of the putative QTL along the genetic map (horizontal
axis), a profile of the test statistics, QTL likelihood (LOD), was
produced for each chromosome, and results are given here for
chromosome 3. The peak of the LOD profile indicates the most
likely position of a QTL affecting the trait under study. Results
indicate a major QTL at 126.4 cM on chromosome 3 (note that
this is also the position of the denso gene as shown in Fig. 2). A
in each figure refers to the additive effect of this QTL on the
trait, defined as (mean of dwarf genotypes ⫺ mean of nondwarf
genotypes)/2. DS, developmental stage.
94
AGRONOMY JOURNAL, VOL. 95, JANUARY–FEBRUARY 2003
that marker-facilitated backcrossing can be employed
to manipulate and improve grain yield in maize (Zea
mays L.) (Stuber et al., 1999). However, in most cases,
the superiority of MAB has not been convincingly demonstrated experimentally (Ribaut and Hoisington, 1998).
Manipulating these traits is difficult because of their
intrinsic complexities: polygenic control, epistasis, and
G ⫻ E. Existing QTL detection methods do not seem
to have the required precision to deal with these complexities. With traits like yield that have a low h2, many
QTL may be segregating. The QTL with major effects
are easily manipulated by empirical breeding practices
and may already be fixed in many breeding lines. It would
be more productive to use marker technology as a means
for placing greater emphasis on those QTL that show
only relative minor effects (Stuber et al., 1999). The
location of minor QTL identified by existing mapping
methods may have wide confidence intervals. The most
likely location of a useful QTL may appear to be between a pair of markers, but it could actually be as far
as 20 cM away (Kearsey and Farquhar, 1998). While
recent multiple-QTL methods (e.g., Jansen, 1995) can
reduce confidence intervals of QTL locations (Fig. 4A)
and resolve two or more linked QTL, the efficacy of
Fig. 4. Plot of QTL likelihood (LOD) over chromosome 3 for specific
leaf area (SLA) in barley measured at 27 d after emergence (DAE)
and for SLA corrected at the same developmental stage (DS) of
0.35, roughly equivalent to the time of 27 DAE (redrawn from
Yin et al., 1999a). The horizontal line at a height of 2.5 indicates
the threshold for the presence of a QTL. The dotted curve is from
the use of the interval mapping method (Lander and Botstein,
1989), and the thin continuous one is from the use of the multipleQTL mapping method (Jansen, 1995).
these methods depends on whether markers are evenly
distributed in the map. In principle, epistasis of QTL
can be included within the frame of these multiple-QTL
methods. However, the rapid increase in the number of
parameters, difficulties to decide which interactions to
include, and the computational burden force us to assume the absence of epistasis. Methods have been developed to evaluate QTL ⫻ environment interactions
(QTL ⫻ E) using multiple-environment data (e.g., Jiang
and Zeng, 1995; Van Eeuwijk et al., 2000), but the information obtained cannot be applied to predict phenotypes in independent environments (Stratton, 1998).
COMBINING CROP MODELING
AND QTL MAPPING:
AN EXPLORATIVE STUDY
The first study to explore opportunities of linking crop
modeling with QTL mapping was recently conducted for
barley (Yin et al., 1999a, 1999b, 2000a), using a SUCROS
(Goudriaan and Van Laar, 1994)-based yield prediction
model, SYP-BL (Yin et al., 2000b). Main model input
traits included preflowering duration, postflowering duration, specific leaf area (SLA), leaf N concentration,
and fraction of biomass partitioned to leaves and to
spikes. The QTL approach was applied to these traits,
using a population consisting of 94 recombinant inbred
lines (RILs) from a cross of two-row spring barley cultivars, Apex and Prisma (Yin et al., 1999b). An AFLP
marker linkage map was established for this population
(see Fig. 2 for the case of chromosome 3). By analyzing
the association between trait phenotypes and marker
genotypes of the 94 RILs, QTL were found for each of
the above model input traits (Yin et al., 1999b). Most
traits were associated, though to different extents, with
the major dwarfing gene (with the mutant dwarf allele
from Prisma), denso (also designated as sdw1), which
was mapped at 126.4 cM on chromosome 3 (Fig. 2) by
segregation analysis of the distinctive prostrate juvenile
growth habit. The importance of this gene on a number
of traits, including some model input traits, based on
QTL analysis, is highlighted in Fig. 3. The result with the
RIL population that the major QTL for so many different traits mapped at the same position as the denso
locus is in support of the pleiotropy of this gene (Yin et
al., 1999b). The additive effect of the locus (Fig. 3) indicates the direction of the gene effect on each of these
traits; for example, the dwarf allele is associated with a
prolonged flowering time. This analysis provides direct
evidence for the genetic background and the interdependence of various model input parameters, which has
received little consideration from modelers in designing
ideotypes (Aggarwal et al., 1997b; Stam, 1998).
Physiological aspects of a trait, which have so far
received little attention from geneticists in QTL analysis, were considered, using SLA as the example (Yin et
al., 1999a). The SLA was measured six times: one conducted at the same developmental stage for all RILs
(at flowering), four at specific days before flowering,
and the last one at 14 d after flowering. When the SLA of
each measurement time was directly subjected to analysis,
YIN ET AL.: THE COMPLEMENTARITY OF CROP MODELING AND QTL MAPPING
one to three QTL were detected. The denso gene was
found to affect SLA strongly at all measurement times,
e.g., 27 d after emergence (Fig. 4A), except at flowering.
If the SLA of the different RILs was corrected for differences in physiological age at the time of measurement,
using the phenology submodel in SYP-BL, QTL were
detected for SLA at only three stages. Moreover, the effect of the denso gene was no longer significant during the
preflowering stages, e.g., at developmental stage 0.35
(Fig. 4B). The effect of the denso gene on the SLA
detected in the first instance was therefore the artifact
of its direct effect on the preflowering duration that can
be seen in Fig. 3. This result suggests potential use of
physiology and modeling in QTL analysis. Any further
roles of physiology or modeling should be explored,
especially given that any great change in the reliability
of QTL detection methods can hardly be achieved in
future (Kearsey and Farquhar, 1998).
Next, the identified QTL were coupled to the SYPBL model by replacing the original measured input trait
values with those predicted from the QTL effects (Yin
et al., 2000a). This replacement generated a QTL-based
model for barley, QTL-BL. Yields predicted by both
models correlated with the observed values, despite substantial unexplained variation (Fig. 5). The QTL-BL
model predicted yield differences slightly better than the
SYP-BL model. Similar results were obtained when the
models were applied to a season independent from the
one in which the original input traits used for QTL
analysis were measured (Yin et al., 2000a). The slightly
better performance of QTL-BL could be due to less
random noise in the QTL-based values because the random error in measured model input traits was partly
removed by QTL analysis statistics. However, this advantage of the QTL-BL model is obtained at the cost
of ignoring some genetic effects because all of the QTL
detected for a trait often do not fully explain its genetic
variation. Nonetheless, the correlation between SYPBL- and QTL-BL-predicted yields was high (r ⬎ 0.88),
indicating that QTL information can successfully replace measured parameters (Yin et al., 2000a).
EXPECTATIONS AND
FUTURE PERSPECTIVE
Potential Uses of Crop Models in Plant
Breeding and QTL Mapping
For a model to be an effective tool in breeding, it
must accurately simulate the difference in performance
among relatively similar lines in a population (McLaren,
1995). Obviously, however, there remains substantial
yield variation that was not explained by the existing
model (Fig. 5). Current crop models have to be improved in this context, in considerations of both input
traits and feedback structure. The random errors in input traits of current models are largely caused by field
samplings. The need for destructive samplings to determine input parameters is a major limitation in using
them in breeding, not only because of the required
amount of work but also because of the limited amount
95
Fig. 5. Comparisons between observed grain yields in barley and
those predicted by the two models, QTL-BL and SYP-BL (redrawn
from Yin et al., 2000a). Because the denso gene affected most
model input traits, QTL-BL predicted two clusters in yields that
match the segregation of the gene (the dwarf recombinant inbred
lines had higher yields than the tall lines).
of available material (Aggarwal et al., 1997b; Stam,
1998). Input parameters of current crop models may
vary with environment (Yin et al., 2000b). Model parameters have to be environment independent to enable the
models to extrapolate G ⫻ E, the expected advantage of
process-based crop models over any data-based genetic
G ⫻ E models (Shorter et al., 1991; Hunt et al., 1993).
While there is evidence that current crop models can
partially reproduce the observed G ⫻ E in cultivar performance trials (Mavromatis et al., 2001), developing
models that can accurately predict G ⫻ E on yields in
a population is a major challenge for modelers.
If models are capable of predicting G ⫻ E in a population, they can assist QTL analysis to resolve QTL ⫻ E, a
major problem that hampers the use of MAB in practical
breeding (Lee, 1995; Ribaut and Hoisington, 1998). The
QTL ⫻ E is commonly seen when growing a mapping
population under a range of environments. An example
of this is flowering time in Arabidopsis spp., examined
under various daylength and vernalization regimes (Jansen, 1995). It turned out that daylength and/or vernaliza-
96
AGRONOMY JOURNAL, VOL. 95, JANUARY–FEBRUARY 2003
than to resolve G ⫻ E. It could be demonstrated first
for relatively simple traits (such as time to flowering)
or in species with simple genetic makeup (such as Arabidopsis spp.) through simulating relevant biochemical
pathways.
Integration of Crop Modeling and QTL
Mapping into a Breeding Strategy
Fig. 6. Proposed framework of combining crop modeling and QTL
mapping for an integrated approach to select crop ideotype for a
specific environment. The dotted part in the figure is optional
for this framework because development of crop models that are
capable of resolving epistasis may take a long-term effort. G ⫻ E,
genotype ⫻ environment interaction; AFLP, amplification fragment length polymorphism.
tion influence the effect of some QTL, indicating QTL ⫻
E in a statistical sense. However, this information on
interaction cannot be applied to new environments (Stratton, 1998). From a physiologists’ or modelers’ point of
view, the impact of environments has to be minimized
to identify the true genetic effect. Phenology models
separate different aspects of flowering responses to photothermal environments (Atkinson and Porter, 1996).
Parameters in a physiologically robust phenology model
are genetically determined and are not altered by environment but predict flowering date of genotypes in a
wide range of environments (Roberts et al., 1996). It
is therefore expected that the QTL and their effects,
detected for model parameters, will not be environment dependent.
When crop models enter a high-precision stage at
which critical processes are quantified and integrated
at the biochemical level, they could be used to resolve
epistasis, a classical difficulty in genetics. Epistasis is
often found for phenotypes that are achieved through
interactive and interrelated metabolic and ontogenetic
pathways (Lee, 1995). It might be reduced or even disappear if input traits of a model that accounts for interrelations among relevant processes are subjected to analysis.
Such possibility agrees with the awareness of geneticists
that epistasis can often be removed by a physiologically
based scaling of trait values (Kearsey and Pooni, 1996).
It should be acknowledged, however, that use of crop
models to resolve epistasis may be a more difficult task
When crop models advance to the level of reliably
predicting genotype difference, crop modeling could be
integrated into the framework of MAB for an improved
breeding approach (Fig. 6). Within this integrated approach, the crop model is evaluated if it predicts yield
differences among genotypes in a genetic population
under diverse environments; thus, G ⫻ E is interpreted
in terms of a biological, as opposite to statistical, model.
Mapping is performed on input traits of the model to
dissect their variation into individual QTL, which in
turn, will be coupled to the model. Once the physiological and genetic bases of yield responses to environments
are adequately quantified, ideotypes can be proposed
for a specific environment (Atkinson and Porter, 1996)
in terms of the allelic constitution of the QTL for model
input traits that determine yield. This approach overcomes the limitations in designing ideotypes by using
models that ignore genetic constraints and correlations
among the traits. Information obtained can be applied
to any environment because of the high ability of extrapolation of crop modeling. With this integrated approach,
epistasis may also be considered (Fig. 6), but resolving
epistasis needs a long-term strategy.
While the proposed integrated approach could potentially deal with G ⫻ E and epistasis, it cannot solve all
limiting factors in using MAB, especially nontransferability of information obtained from one cross to another. The nontransferability can be largely due to the
possibility that a QTL detected in one cross simply does
not segregate in a second cross because the parents of
the second cross carry identical alleles at that QTL. A
gene important for physiologists or modelers may be
useless for geneticists or breeders because if the gene
is physiologically crucial, its variation will have been
strongly reduced over generations of breeding (Prioul
et al., 1997); so, QTL will hardly be detected at such a
gene locus.
Traditionally, physiologists have worked with only a
few genotypes but measured many characteristics or
processes to understand crop responses to environments.
In contrast, geneticists and breeders usually score a few
traits on many genotypes (often ⬎100) of a segregating
population and rely on selection and statistics to move
the population mean in the desired direction. This fundamental difference has often led geneticists and breeders to be skeptical of using physiological knowledge.
On one hand, our proposed integrated approach does
provide an excellent opportunity for collaboration among
physiologists, modelers, geneticists, and breeders. On the
other hand, implementation of such integrated approach
needs large experiments, assessing many traits in many
genotypes. To reduce this difficulty, the crop model
YIN ET AL.: THE COMPLEMENTARITY OF CROP MODELING AND QTL MAPPING
should be developed such that its input parameters can
be quickly assessed or through the way by which tissue
can be harvested and frozen for later analysis. Options
from genetic studies such as selective mapping (Xu and
Vogl, 2000) should also be considered. Reducing the size
of a mapping experiment with little sacrifice of the power
of QTL detection, as the common interest of geneticists
and physiologists, may represent a specific research area
for their collaborations.
Use of Marker Technology in
Modeling Cultivar Difference
Using regression analysis, Virk et al. (1996) has shown
that variation of many agronomic traits in rice germplasm is associated with allelic variation of markers,
indicating that marker-trait association is present not
only in segregating populations but also across a crop
germplasm or cultivar collection. If this result turns out
to be generally true, QTL-based modeling may be applicable to a germplasm collection, for which important
markers identified by, for example, multiple regression,
are used as the surrogate of QTL. Because the chance
that a specific marker maps to different genome positions in different populations within a species is small
(e.g., Waugh et al., 1997), we could use markers identified from a germplasm collection to infer the position of
QTL controlling the trait. This opportunity is especially
true when integrated marker maps based on acrosspopulation data are becoming available (e.g., Haanstra
et al., 1999). The applicability of marker information
across germplasm or cultivar collection would allow the
genetically based crop modeling to be performed without recourse to the use of a mapping population.
CONCLUSIONS
Crop modeling and quantitative genetics are independently evolving disciplines. Crop models are now increasingly being used to assist plant breeding, in particular, to define crop ideotypes for different environments.
The advent of DNA-based molecular markers and the
development in quantitative genetics of using marker
linkage maps to identify QTL provide a new perspective
for determining crop model input parameters, allowing
the biological meaning of model input parameters to
be more explicit. On the other hand, crop models can
potentially assist QTL mapping, especially in extrapolating information to a new environment and in dissecting
yield into physiological components that are more likely
related directly to gene expression. Based on the complementary aspects of modeling and mapping, an approach was proposed, from the future perspective, to
integrate marker-assisted technology into model-based
ideotype framework as a breeding strategy to increase
crop yield. Developing crop models, which can be sufficiently accurate to model G ⫻ E of a genetic population,
is the major challenge to enable this integrated approach
to be fruitfully used in a breeding program.
97
REFERENCES
Aggarwal, P.K., M.J. Kropff, K.G. Cassman, and H.F.M. ten Berge.
1997a. Simulating genotypic strategies for increasing rice yield potential in irrigated, tropical environments. Field Crops Res. 51:5–17.
Aggarwal, P.K., M.J. Kropff, P.S. Teng, and G.S. Khush. 1997b. The
challenge of integrating systems approaches in plant breeding: Opportunities, accomplishments and limitations. p. 1–23. In M.J.
Kropff et al. (ed.) Applications of systems approaches at the field
level. Kluwer Academic Publ., Dordrecht, the Netherlands.
Atkinson, D., and J.R. Porter. 1996. Temperature, plant development
and crop yields. Trends Plant Sci. 1:119–124.
Bindraban, P.S. 1997. Bridging the gap between plant physiology
and breeding: Identifying traits to increase wheat yield potential
using systems approaches. Ph.D. diss. Wageningen Agric. Univ.,
the Netherlands.
Boote, K.J., J.W. Jones, and N.B. Pickering. 1996. Potential uses and
limitations of crop models. Agron. J. 88:704–716.
Boote, K.J., and M. Tollenaar. 1994. Modeling genetic yield potential.
p. 533–565. In K.J. Boote et al. (ed.) Physiology and determination
of crop yield. ASA, CSSA, and SSSA, Madison, WI.
Botstein, D., R.L. White, M. Skolnick, and R.W. Davis. 1980. Construction of a genetic linkage map in man using restriction fragment
length polymorphisms. Am. J. Hum. Genet. 32:314–331.
Dingkuhn, M., F.W.T. Penning de Vries, and K.M. Miezan. 1993.
Improvement of rice plant type concepts: Systems research enables
interaction of physiology and breeding. p. 19–35. In F.W.T. Penning
de Vries et al. (ed.) Systems approaches for agricultural development. Kluwer Academic Publ., Dordrecht, the Netherlands.
Donald, C.M. 1968. The breeding of crop ideotypes. Euphytica 17:
385–403.
Dua, A.B., F.W.T. Penning de Vries, and D.V. Seshu. 1990. Simulation
to support evaluation of the production potential of rice varieties
in tropical climates. Trans. ASAE 33:1185–1194.
Goudriaan, J., and H.H. Van Laar. 1994. Modelling potential crop
growth processes. Kluwer Academic Publ., Dordrecht, the Netherlands.
Haanstra, J.P.W., C. Wye, H. Verbakel, F. Meijer-Dekens, P. Van
den Berg, P. Odinot, A.W. Van Heusden, S. Tanksley, P. Lindhout,
and J. Peleman. 1999. An integrated high density RFLP-AFLP map
of tomato based on two Lycopersicon esculentum ⫻ L. pennellii F2
populations. Theor. Appl. Genet. 99:254–271.
Haverkort, A.J., and P.L. Kooman. 1997. The use of systems analysis
and modeling of growth and development in potato ideotyping
under conditions affecting yields. Euphytica 94:191–200.
Hunt, L.A., S. Pararajasingham, J.W. Jones, G. Hoogenboom, D.T.
Imamura, and R.M. Ogoshi. 1993. GENCALC: Software to facilitate the use of crop models for analyzing field experiments. Agron.
J. 85:1090–1094.
Jackson, P., M. Robertson, M. Cooper, and G. Hammer. 1996. The role
of physiological understanding in plant breeding, from a breeding
perspective. Field Crops Res. 49:11–37.
Jansen, R.C. 1995. Genetic mapping quantitative trait loci in plants—a
novel statistical approach. Ph.D. diss. Wageningen Agric. Univ.,
the Netherlands.
Jiang, C.J., and Z.B. Zeng. 1995. Multiple trait analysis of genetic
mapping for quantitative trait loci. Genetics 140:1111–1127.
Jones, N., H. Ougham, and H. Thomas. 1997. Markers and mapping:
We are all geneticists now. New Phytol. 137:165–177.
Kearsey, M.J., and A.G.L. Farquhar. 1998. QTL analysis in plants:
Where are we now? Heredity 80:137–142.
Kearsey, M.J., and H.S. Pooni. 1996. The genetical analysis of quantitative traits. Chapman and Hall, London.
Kropff, M.J., A.J. Haverkort, P.K. Aggarwal, and P.L. Kooman. 1995.
Using systems approaches to design and evaluate ideotypes for
specific environments. p. 417–435. In J. Bouma et al. (ed.) Ecoregional approaches for sustainable land use and food production.
Kluwer Academic Publ., Dordrecht, the Netherlands.
Lander, E.S., and D. Botstein. 1989. Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics
121:185–199.
Lee, M. 1995. DNA markers and plant breeding programs. Adv.
Agron. 55:265–344.
98
AGRONOMY JOURNAL, VOL. 95, JANUARY–FEBRUARY 2003
Loomis, R.S., R. Rabbinge, and E. Ng. 1979. Explanatory models in
crop physiology. Annu. Rev. Plant Physiol. 30:339–367.
Mavromatis, T., K.J. Boote, J.W. Jones, A. Irmak, D. Shinde, and
G. Hoogenboom. 2001. Developing genetic coefficients for crop
simulation models with data from crop performance trials. Crop
Sci. 41:40–51.
McLaren, C.G. 1995. Combining statistics and crop models for improved plant breeding strategies. p. 41–47. In P.K. Aggarwal et al.
(ed.) Applications of systems approaches in plant breeding. IRRI,
Los Baños, Philippines.
Miflin, B. 2000. Crop improvement in the 21st century. J. Exp. Bot.
51:1–8.
Normile, D. 1999. Crossing rice strains to keep Asia’s rice bowls
brimming. Science 283:313.
Ordon, F., W. Wenzel, and W. Friedt. 1998. Recombination: Molecular
markers for resistance genes in major grain crops. Prog. Bot. 59:
49–79.
Paterson, A.H., E.S. Lander, J.D. Hewitt, S. Peterson, S.E. Lincoln,
and S.D. Tanksley. 1988. Resolution of quantitative factors by using
a complete linkage map of restriction fragment length polymorphisms. Nature (London) 335:721–726.
Peng, S., K.G. Cassman, S.S. Virmani, J. Sheehy, and G.S. Khush.
1999. Yield potential trends of tropical rice since the release of
IR8 and the challenge of increasing rice yield potential. Crop
Sci. 39:1552–1559.
Penning de Vries, F.W.T. 1991. Improving yields: Designing and testing VHYVs. p. 13–19. In F.W.T. Penning de Vries et al. (ed.)
Systems simulations at IRRI. IRRI Res. Paper 151. IRRI, Los
Baños, Philippines.
Prioul, J.L., S. Quarrie, M. Causse, and D. de Vienne. 1997. Dissecting
complex physiological functions through the use of molecular quantitative genetics. J. Exp. Bot. 48:1151–1163.
Rasmusson, D.C. 1987. An evaluation of ideotype breeding. Crop
Sci. 27:1140–1146.
Rasmusson, D.C. 1991. A plant breeder’s experience with ideotype
breeding. Field Crops Res. 26:191–200.
Ribaut, J.-M., and D. Hoisington. 1998. Marker-assisted selection:
New tools and strategies. Trends Plant Sci. 3:236–238.
Roberts, E.H., A. Qi, R.H. Ellis, R.J. Summerfield, R.J. Lawn, and S.
Shanmugasundaram. 1996. Use of field observations to characterise
genotypic flowering responses to photoperiod and temperature: A
soyabean exemplar. Theor. Appl. Genet. 93:519–533.
Sax, K. 1923. The association of size differences with seed-coat pattern
and pigmentation in Phaseolus vulgaris. Genetics 8:552–560.
Setter, T.L., E.A. Conocono, J.A. Egdane, and M.J. Kropff. 1995.
Possibility of increasing yield potential of rice by reducing panicle
height in the canopy: I. Effects of panicles on light interception
and canopy photosynthesis. Aust. J. Plant Physiol. 22:441–451.
Shorter, R., R.J. Lawn, and G.L. Hammer. 1991. Improving genotypic
adaptation in crops—a role for breeders, physiologists and modelers. Exp. Agric. 27:155–175.
Simko, I., S. McMurry, H.M. Yang, A. Manschot, P.J. Davies, and
E.E. Ewing. 1997. Evidence from polygene mapping for a causal
relationship between potato tuber dormancy and abscisic acid content. Plant Physiol. 115:1453–1459.
Stam, P. 1993. Construction of integrated genetic linkage maps by
means of a new computer package: JoinMap. Plant J. 3:739–744.
Stam, P. 1998. Crop physiology, QTL analysis and plant breeding. p.
429–440. In H. Lambers et al. (ed.) Inherent variation in plant
growth: Physiological mechanisms and ecological consequences.
Backhuys Publ., Leiden, the Netherlands.
Staub, J.E., and F.C. Serquen. 1996. Genetic markers, map construction, and their application in plant breeding. HortScience 31:729–741.
Stratton, D.A. 1998. Reaction norm functions and QTL-environment
interactions for flowering time in Arabidopsis thaliana. Heredity
81:144–155.
Stuber, C.W., M. Polacco, and M.L. Senior. 1999. Synergy of empirical
breeding, marker-assisted selection, and genomics to increase crop
yield potential. Crop Sci. 39:1571–1583.
Van Berloo, R., and P. Stam. 1998. Marker-assisted selection in autogamous RIL population: A simulation study. Theor. Appl. Genet.
96:147–154.
Van Eeuwijk, F.A., J. Crossa, M. Vargas, and J.-M. Ribaut. 2000.
Variants of factorial regression for analysing QTL by environment
interaction. p. 107–116. In A. Gallais et al. (ed.) Quantitative genetics and breeding methods. Proc. Meet. of the EUCARPIA Section
Biometr. in Plant Breeding, 11th, Paris. 30 Aug.–1 Sept. 2000.
INRA Editions, Versailles Cedex, France.
Virk, P.S., B.V. Ford-lloyd, M.T. Jackson, H.S. Pooni, T.P. Clemeno,
and H.J. Newbury. 1996. Predicting quantitative variation within
rice germplasm using molecular markers. Heredity 76:296–304.
Waugh, R., N. Bonar, E. Baird, B. Thomas, A. Graner, P. Hayes, and
W. Powell. 1997. Homology of AFLP products in three mapping
populations of barley. Mol. Gen. Genet. 255:311–321.
Whisler, F.D., B. Acock, D.N. Baker, R.E. Fye, H.F. Hodges, J.R.
Lambert, H.E. Lemmon, J.M. McKinion, and V.R. Reddy. 1986.
Crop simulation models in agronomic systems. Adv. Agron. 40:
141–208.
White, J.W., and G. Hoogenboom. 1996. Simulating effects of genes
for physiological traits in a process-oriented crop model. Agron.
J. 88:416–422.
Woodward, R.W. 1957. Linkages in barley. Agron. J. 49:28–32.
Xu, S.Z., and C. Vogl. 2000. Maximum likelihood analysis of quantitative trait loci under selective genotyping. Heredity 84:525–537.
Yin, X., S.C. Chasalow, C.J. Dourleijn, P. Stam, and M.J. Kropff.
2000a. Coupling estimated effects of QTLs for physiological traits
to a crop growth model: Predicting yield variation among recombinant inbred lines in barley. Heredity 85:539–549.
Yin, X., M.J. Kropff, P.K. Aggarwal, S. Peng, and T. Horie. 1997.
Optimal preflowering phenology of irrigated rice for high yield
potential in three Asian environments: A simulation study. Field
Crops Res. 51:19–27.
Yin, X., M.J. Kropff, J. Goudriaan, and P. Stam. 2000b. A model
analysis of yield differences among recombinant inbred lines in
barley. Agron. J. 92:114–120.
Yin, X., M.J. Kropff, and P. Stam. 1999a. The role of ecophysiological
models in QTL analysis: The example of specific leaf area in barley.
Heredity 82:415–421.
Yin, X., P. Stam, C.J. Dourleijn, and M.J. Kropff. 1999b. AFLP mapping of quantitative trait loci for yield-determining physiological
characters in spring barley. Theor. Appl. Genet. 99:244–253.