Annals of Botany 88 (Special Issue): 703-712, 2001 doi:10.1006/anbo.2001.1481, available online at http://www.idealibrary.com on ® IDEA- Developing Multisite Dynamic Models of Mixed Species Plant Communities J. CONNOLLY* and M. WACHENDORFt Department of Statistics, University College Dublin, Belfield, Dublin 4, Ireland Received: 31 October 2000 Returned for revision: 4 January 2001 Accepted: 2 May 2001 Data on the development of two white clover cultivars (AberHerald and Huia) in mixed clover/ryegrass swards were available at 12 sites in Europe from experiments conducted for several years under a common protocol. Swards were measured up to seven times over winter and up to seven times over the growing season. In the overwintering period, detailed morphological measurements were taken for clover at each sampling time and, during the growing season, the clover contribution to total available biomass was recorded. Detailed meteorological data were available at all sites. The development of these clover/ryegrass communities over time and sites was modelled. The modelling strategy had three main elements: (a) division of the annual growth cycle of the clover/ryegrass community into a number of functional periods; (b) development of relationships within each functional period using models incorporating plotlevel biotic variables characterizing each community at the start of the period and site-level climatic and management variables measured during the period; and (c) introduction of a dynamic element by linking the models across functional periods. The response variable(s) for a functional period was the biotic independent variable(s) of the succeeding period. The object was to produce a dynamic series of models in which community development within and across sites was described as a resultant of the initial state of the community and climatic and other forces acting on it. The analysis used a mixed models technique in recognition of the complex error structure of the data. Various statistical aspects of the modelling are discussed including the models and fitting strategy used, the complexity of the error structure in an experiment that includes sites and years, and the desirability of transforming certain variables before modelling. The issues in presenting the results of a series of complex models are discussed and a graphical/ tabular approach is outlined. © 2001 Annals of Botany Company Key words: Meta-analysis, multisite, community dynamics, competition, mixed model, white clover, Trifolium repens, perennial ryegrass, Lolium perenne, AberHerald, Huia, dynamic model. INTRODUCTION In plant breeding where cultivars have to prove themselves across a range of different environments multisite experiments with plants are quite standard (e.g. Talbot and Verdooren, 1996), but they are otherwise relatively rare (e.g. Reader et al., 1994), especially for mixed species commu- nities. Although meta-analysis, the combination of information from many experiments, is of growing importance in plant studies (e.g. Curtis and Wang, 1998; Xiong and Nilsson, 1999; Poorter and Nagel, 2000) the experiments being combined rarely share a common experimental structure and philosophy, or a common measurement framework. This inevitably leads to a somewhat simple combined analysis. Few experiments with mixed plant communities track the system with detailed measurements through a number of annual cycles. Hence, the data from a common experiment on clover/ryegrass mixtures performed at 12 European sites over several years (Wachendorf et al., 2001a, b) provide a rare opportunity for modelling dynamic community relationships from a broad and complex body of data. The results of the analysis of these data are reported in Wachendorf et al. (2001a, b) and this paper discusses the background to their analysis. * For correspondence. E-mail john.connolly(a,uccl.ie t Current address: Institute of Crop Science and Plant Breeding, Department of Grass and Forage Science, University of Kiel, Olshausenstr.40, 24098 Kiel, Germany. 0305-7364/01/100703 + 10 $35.00/00 Successive sections describe the experiments, consider the principles underlying the modelling strategy, and discuss various statistical modelling issues including methods for presenting the results from a series of linked dynamic models. The approach deals with difficulties in combining data over many sites and also develops perspectives relevant to evaluating data when information is available on several phases of the annual cycle of mixed species plant communities. DESCRIPTION OF EXPERIMENTS A detailed description of the experiments is given in Wachendorf et al. (2001a) and is summarized here. A common experiment was repeated at 12 sites in Europe ranging from Iceland to Italy, and Ireland to Finland. Generally, a common experimental protocol covering design and measurement was followed. Experiments in different sites began in different years and generally lasted for 3-4 years. Usually measurement commenced after an establishment phase of 1 year. At each site, two clover cultivars were compared, each grown with a companion ryegrass in a randomized complete block design with three to four replicates of each clover/ryegrass mixture. The clover/ryegrass mixtures were sown by broadcasting using 3 kg ha - l of one of two white clover cultivars (Trifolium repens cultivars AberHerald and Grasslands Huia, both medium-sized leaf) mixed with perennial ryegrass (in most t 2001 Annals of Botany Company 704 Connolly and Wachendorf- Multisite Dynamic Models of Mixed Species Plant Communities cases Lolium perenne Preference). Nitrogen fertilizer was applied in equal amounts at the start of the growing season and after each harvest. The total quantity of nitrogen used varied among sites and, at some sites, among years. Morphological measurements were made on four to seven occasions during the winter. The first sampling took place approx. 35 d after the last harvest (mean over sites) before the onset of severe frost. The winter sampling period ended before a considerable accumulation of biomass had taken place. Up to seven harvests were taken during the growing season. The first harvest was taken when the sward height reached 20 cm and the last was taken when no more biomass accumulation was detectable, but no later than the end of October. For each morphological sampling, three to four cores per plot, 12 cm in diameter and 10 cm deep, were taken at a distance of at least 30 cm from each other, avoiding previously sampled locations. Samples were pooled and a representative subsample was taken for processing. A range of morphological characteristics was measured for each plot. These included clover mass variables [clover dry mass (DM), DM of stolons, petioles, buds and nodal roots], clover morphological variables (number of terminal buds, total buds, nodal roots and tap roots, leaf area index, stolon length), clover biochemical variables [total non-structural carbohydrates (TNC) and water soluble carbohydrates (WSC) of stolons], grass variables (grass DM, tiller density) and combined variables (clover DM as a percentage of total DM). Swards were harvested two to seven times during the growing season, cut to a stubble height of 5-7 cm. The fresh weight of the harvested material was estimated from sub-plots of 0.25 to 1 m2 within each plot generally cut to a stubble height of 5-7 cm. Dry matter yield of the total sward and of the grass, clover and unsown species components was recorded. For all sites, climatic variables (air temperature, radiation, precipitation) were typically recorded daily. PRINCIPLES OF THE MODELLING STRATEGY The object was to produce a series of models in which community development within and across sites was described as a resultant of the initial state of the community, applied treatments and forces common to all plots at a site (climate, management) acting on it. The modelling strategy had three main elements: (a) the annual growth cycle of the clover/ryegrass community was divided into a number of functional periods; (b) relationships were developed for each functional period, modelling some measure(s) of community performance, typically at the end of each period. Variables in the models included biotic variables measured on each plot at each site and characterizing community status at the start of the period, applied treatments (clover cultivar) and climatic and other variables measured at the site level during the period; and (c) a dynamic element was introduced by linking the models across functional periods. This was done by taking as response variable(s) for a functional period the biotic variable(s) that characterized initial community status in the model for the succeeding period. Some elements of this approach are similar to methods for models of single crops described by France and Thornley (1984) and of cloverjryegrass dynamics under mixed and mono-grazing (Nolan et al., 2001). (a) Division of annual cycle into functional periods At each site, community development followed a particular rhythm through its annual cycle, from overwintering through initiation of growth to growing season growth. The timing of these phases varied widely across sites and was reflected in the timing of morphological and production measurements at each site, leading to a data set uncoordinated in calendar time across sites. In consequence, a calendar-based approach to modelling (which might suffice at any one site) was regarded as inappropriate. Instead, the annual life history cycle of the community was broken into four functional periods (Fig. 1): Summer-from the first to the last growing season cut; Spring- from the final morphological measurement to the first growing season cut; Winter- from the first to the last morphological measurement; and Autumn-from the last growing season cut to the first morphological measurement. The mean, maximum and minimum durations of each of these periods are shown in Table I. Modelling plant behaviour and community development within each of these periods reduced the statistical difficulty and facilitated interpretation by comparing a common phase of the life cycle of the community across all sites. (b) Modelling using plot and site independent variables Each plot is a community whose development can be affected by initial conditions that differ from other plots at the same site, even those treated identically. Plant growth and interaction in a community during a functional period is largely a result of sward state at the start of the period, clover cultivar and forces (e.g. climate, soil, management) common to all plots at a site in a particular year during the functional period. System responses (e.g. clover content, clover leaf area index or grass tiller density at the end of, or averaged over, a period) were modelled for each functional period. Some independent variables for each model were measured at the plot and some at the site level. Plot variables are biotic descriptors characterizing the status of the community at the start of the functional period (e.g. clover content, grass tiller density, clover leaf area index). Site variables (e.g. temperature, radiation, precipitation, management, soil type) characterize growing conditions at different sites and have a common value for all plots within a site for a given functional period and year. Models also included cultivar effects. In this approach the prior history of swards is taken to be largely integrated into the observed sward state at the start of the period of interest, and other facets of the plots that are not measured are taken to be largely allowed for by the Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities 705 FIG. 1. Organization of modelling of system over four functional periods. TABLE 1. Response variables, plot and site independent variables in addition to clover cultivar, and mean, minimum and maximum duration (d) for each functional period Independent variables Duration (d) Functional period Model Response variables Plot Site* Mean Minimum Maximum Summer Spring A B Clover content Clover content T TS, PP, RRS 129 17 52 9 161 30 Winter Winter Autumn Autumn Summer C1 C2 D1 D2 E Clover leaf area index Grass tiller density Clover leaf area index Grass tiller density Clover content Clover content Clover leaf area index Grass tiller density Clover leaf area index Grass tiller density Clover content Clover content Clover content T, RR T, PP, D PP, RR T, PP, D TS 172 172 37 34 130 81 81 13 15 48 239 239 119 55 204 *C, Clover cultivar; T, mean daily temperature; PP, mean daily precipitation; RR, mean daily radiation; TS, temperature sum; PPS, precipitation sum; RRS, radiation sum; D, duration of period. plot independent variables. Site variables and their interaction with plot variables and clover cultivars are likely to be major determinants of variation among sites and over years. If variation in community behaviour across plots, sites and years can be described by such variables it could lead to a wider understanding of the role of the three types of independent variable in the development of clover/ ryegrass communities and perhaps the degree to which variation in community behaviour is beyond prediction, being linked to unpredictable elements of the environment. It might also provide a useful framework for hypothesis generation and allow an understanding of the likely response at a particular site to variation in the site variables over a number of years. These insights would be difficult to gain at a single site, even for an experiment repeated over 2 or 3 years, since climate would be unlikely to vary sufficiently over years to allow good estimates of their effects and some site variables (e.g. soil) would hardly change at a site. (c) Linking models across.functionalperiods The models in different functional periods were linked to produce a set of dynamic models describing sward development through the annual cycle. The independent plot variable(s) for one functional period becomes the response 706 Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities variable(s) to be modelled in the preceding period (Fig. 1, Table 1). For example, leaf area index at the end of winter was an important determinant of clover content at the end of spring, and, in turn, was modelled using sward variables at the beginning of winter. A number of plot variables were available at the start of each functional period as possible candidates for inclusion as independent variables, particularly when the detailed morphological measurements were available. The selection of the independent variables actually used is described below. Also available for inclusion were climatic variables for each modelling period, mean and cumulative temperature, radiation and precipitation. Since the duration of a functional period differed across sites, mean and cumulative climatic variables are not simply related and may describe different facets of the response. Modelling commenced by examining the determinants of average clover content in swards during summer using clover content at the end of spring as the plot variable. Clover content at the end of spring was then modelled using morphological variables at the end of winter as plot variables, and so on. Finally, to complete the cycle, clover content at the end of summer was modelled using end of spring biotic variables. In all, seven models (Table 1) were required for the four functional periods. A potential difficulty with this approach is that several plot biological variables could emerge as important independent variables for some functional period(s), which could make the modelling of the preceding functional period(s) quite complex, with an expanding number of variables to be modelled at each phase. This did not occur and the chain of models was based on only a few biological variables (Table 1; Wachendorf et al., 2001h). STATISTICAL ISSUES Statistical issues arising in modelling these data include (a) the models and fitting strategy used, (b) the complexity of the error structure in an experiment that includes plots, blocks, sites and years, (c) the desirability of transforming certain variables before modelling and (d) procedures for presenting the results of the models. The modelling of mean clover content during summer (Model A) is used to illustrate various issues discussed below. (a) Models and fitting strategy Preliminary modelling was carried out to identify the relative importance of site, year, clover cultivar and their interactions, all as fixed effects, in determining morphological, chemical and productivity responses (Wachendorf et al., 2001a). For the main modelling (Wachendorf et al., 2001b) linear mixed models (Verbeke and Molenberghs, 1997) were used in each functional period to allow for the estimation of the effects of cultivar and of plot and site variables on responses and for the complex error structure. The general form of the models was the standard mixed model, with fixed and random effects: y = Xfp+Zu+: (1) where y is the response, /i is the vector of fixed effects parameters, X is the matrix of fixed independant variable and factor level indicator values, u is the vector of random effects for each response, other than the residual error, Z (a matrix of zeros and ones) details which random effects are associated with each plot and is a vector of residual error terms which are assumed to have the same variance and to be uncorrelated across plots and years. The form of the random components is discussed below. Modelling was carried out using the REML (restricted maximum likelihood) option of the mixed procedure in SAS. Determination of the significance of fixed terms was through Wald tests with denominator degrees of freedom estimated by Satterthwaite's approximation (SAS Institute, 1992). Selection among different transformations of independent variables was aided by comparison of full maximum likelihood, with the random effects as defined below. The selection of terms for inclusion in the fixed part of the model depended on preliminary analysis, standard statistical model selection methods (Draper and Smith, 1998), the rules of hierarchy and marginality (Nelder. 1994; Nelder and Lane, 1995) and biological insight. Fixed effect terms were included if their significance exceeded the 5 , level. Models containing many fixed terms were initially fitted and then reduced by a combination of statistical testing and biological assessment. The rules of hierarchy ensured that if an interaction was included in the fixed part of the model then the variables (factors) involved in the interaction were also included separately. The modelling obeyed the marginality principle (Nelder and Lane, 1995). which implies that if a term appears as part of a more complex element in the model then, in general, the term is not tested for significance. Thus, if there is an A x B interaction in the model the estimates of A and B are not tested. This is because the meaning of such terms is open to misinterpretation as they depend on the particular way in which SAS sets up the model with interaction and could differ were another package used for analysis. Furthermore, polynomial terms lower in order than the highest term fitted (e.g. a linear term in a quadratic model) are also particularly open to misinterpretation unless the independent variables are centred about their means prior to analysis. As centring carries its own difficulties, uncentred data were used in the analyses and the methods of presentation of results were as described below. Thus, t- and P-values for model coefficients where the term is involved in a higher order term in the model were generally omitted. There is an exception. Where a factor with only two levels [as with the factor clover cultivar (C), where levels 1 and 2 correspond to AberHerald and Huia] interacts only with a variate x, and the model contains no quadratic or higher order term in x, there is a useful interpretation of the coefficient of x. This occurs for models B, Cl and D2 (Wachendorf et al., 2001b). Let the relevant terms in the model be 13lx + Bi3x C. Using the protocol in SAS, the coefficient of x by itself (,) is the slope of the response to .v for Huia and the slope for AberHerald is (fl +3). When this occurs, the recommendation is to present the t- and P-values for the linear coefficient fl and Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities 707 to interpret cultivars. 3 as the change in slope between the two (b) Error structure of models The multisite and multiyear nature of the study suggests a complex error structure and hence the necessity to use a mixed model approach. Random components considered were: site; year within site; block within site; and plot within block within site, which leaves the (repeated measures) yearto-year variation associated with each plot as the residual term. As data for 2 consecutive years only were available for most plots, compound symmetry was assumed for the repeated measures. The meaning of the variance components is as follows. Site, although not chosen randomly, is considered random, conditional on having removed the effects of the site variates included in the model, i.e. the remainder of site differences, having removed the effects of the site variables, are considered as random. The terms involving blocks represent the structure imposed on the experiment within each plot, recognizing that blocks have no relationships across sites. It might be argued that blocks are not random within a site, rather the contrary if an effective blocking strategy has been employed. In the current experiment, the use of biotic covariates seems to have minimized block variation and so it generally appears to contribute little or nothing to the variation (see Table 3, Wachendorf et al., 2001b). The issue of its randomness is, therefore, somewhat academic here. Plot within block within site represents the variation among plots within a block at a site averaged over years, and year within site represents the way in which the site-to-site response (having fitted the model) changes from year to year. Estimates of these components for model A are: 0 for site, 0.274 for year within site, 0.00253 for block within site and 0.00189 for plot within block within site and 0111 for residual. Estimates of these terms are shown for all models in Table 3 of Wachendorf et al. (2001b). The year within site component was large for all models. The inclusion of non-random terms such as site and block as random in mixed models needs to be treated with caution, but sometimes this approach provides the only way of assessing the influence of variables like climate in nonrandomized data sets. It is not always appropriate to assume that such terms are random and normally distributed. The issue is affected by considerations such as there being a sufficient number of levels of the factor(s) and the pattern of the effects for different levels e.g. would site effects, having removed the influence of climate, look like a sample from a normal distribution or are block effects normally distributed, having removed the effects of plot biotic variables? The decision as to whether effects are random or otherwise is discussed in Searle et al. (1992) and the non-normality of random effects in Verbeke and Molenberghs (1997). (c) Transformation of variables Three variables were transformed before inclusion in the models as response variables: clover content, leaf area index and tiller density. Some of the independent variables were also transformed. For more information on the rationale behind transformations see e.g. Sokal and Rohlf (1995). Mean summer clover content (g 100 g-l DM) ranged from 08 to 95-2, and clover content at the end of spring from 0.05 to 99-6. These values are close to the minimum and maximum (0 and 100) achievable and suggest that models in which the untransformed clover content was the response could lead to predicted clover contents outside this range, and might also include interactions induced by scale rather than biology (Cox and Snell, 1989). Under conditions in which levels of clover content are low or close to 100 %, the absolute effects of treatments or other factors/ variables tend to be smaller than at intermediate values, simply because there is less scope for response. For example, a factor that doubles low clover content will change 2 to 4 % at very low levels but will increase 10 to 20 %, a seeming interaction with clover level which obscures the simplicity of the doubling effect of the factor. (Of course such a factor could not work in this way at high clover contents.) To avoid these difficulties, the logit of clover content (CL), defined as log[CL/(100-CL)], was used. The logit transformation tends to give a scale that reduces the effect of interaction simply due to scale and also ensures that the model will lead to predicted clover contents lying within the range 0 to 100. Clover leaf area index is a response variable in models C 1 and D1 (Table 1). In both cases the logarithm of leaf area index was used, as residual plots indicated severe heteroscedasticity for models on the original scale. Residuals from models on the logarithmic scale showed some evidence of greater variance for smaller predicted values for model C1 but not for model Dl. Grass tiller density is a response variable in models C2 and D2. Heteroscedasticity was a problem for models on the natural scale and various transformations (logarithm, inverse and square root) were tested. By far the best of these in terms of eliminating heteroscedasticity was the square root transformation, and this was used in both models. Clover leaf area index (LAI) was important as an independent variable in two models (B and C1, Table 1). It was tested untransformed and on the logarithmic scale. The more appropriate form was checked by comparing the full maximum likelihood under the two alternatives and the maximum was considerably greater for the logarithmic form. This suggests that a fixed proportional increase in LAI, rather than a fixed absolute increase, produced a constant effect on the response. This implies that additional increments of LAI of equal size have less effect on the response the greater the level of LAI. This agrees qualitatively with the well-known phenomenon that the ability of a canopy to capture additional light becomes limited and eventually reaches a plateau as LAI increases, setting a limit to many processes dependent on the products of photosynthesis. Whether the response variable was clover content at the end of spring or LAI at the end of winter (Figs 3 and 4 of Wachendorf et al., 2001b), the response to LAI was positive, but at a decreasing rate in both cases. Other more complex response relationships linked to the physiology of the growth relationships (e.g. 708 Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities France and Thornley, 1984) were not considered for several reasons. The shape of the estimated response relationships agrees with what is expected, and additional terms, testing for a more subtle relationship, were not significant. In addition, more complex relationships (e.g. a curve involving an asymptote) could require inclusion of non-linear elements in the model and this extra complexity was not considered necessary at the level of sophistication attached to these models. Finally, the physiological relationships in a mixed canopy are considerably more complex than in a monoculture, involving interactions between two competing canopies and two LAIs (e.g. Kropf and Spitters, 1991) and it was considered that these data were not an appropriate vehicle for exploring those relationships. Tiller density was a useful independent variable in two models (B and C2, Table 1). Four forms of this variable were compared, untransformed, logarithm, square root and inverse. The full maximum likelihood approach outlined above was also used here. The inverse of tiller density was a better independent variable than the three alternatives, perhaps reflecting the common experience in competition models that plant responses are inversely related to the density of competitors (Suehiro and Ogawa, 1980; Wright, 1981; Spitters, 1983; Firbank and Watkinson, 1985; Connolly et al., 1990; Menchaca and Connolly, 1990). Underlying the analysis of all responses is the assumption that they are normally, and hence symmetrically, distributed on the scale of the analysis and this is checked by examination of residuals. When responses are transformed prior to analysis, predictions from the model on the transformed scale are taken to be symmetrically distributed and the predicted value on the transformed scale is the estimated mean of the population for the set of values of the independent variables used. This has a number of consequences for the reporting of results. All inference, including tests of significance and computation of confidence intervals for predictions or predicted differences, should be carried out on the transformed scale. It may be desirable to back-transform to the scale of the original data for reporting and interpretation. Direct back-transformation of predicted values gives values that are predicted medians on the original scale and are generally not the means of responses on that scale. If the mean is to be predicted on the original scale a correction is required. In the current analysis, back-transformation to the original scale is used without correction and so the values presented graphically for model A (Fig. 2) should be interpreted as medians rather than means. (d) Presentation of results The final model(s) for a functional period may contain many terms, sometimes interacting with each other. The terms in the model may be variables (e.g. leaf area index, average precipitation) or factors (e.g. clover cultivar). The model, expressed as estimates of multiple regression coefficients (e.g. Table 2), may be difficult to appreciate and interpret, even for one familiar with multiple regression, particularly if transformations of response and/or independent variables have been used. Methods of displaying the meaning of the models vary. The broad principle underlying the presentation of many model terms is that predictions are initially made from the model for the term in question on the scale of the analysis [e.g. logit of clover content predicted from log(clover leaf area index)]. These predictions, perhaps back-transformed to the original scale in which the biologist most readily thinks (e.g. clover content), are displayed in graphical or tabular form. Where an independent variable appears in the model in transformed mode [e.g. log(clover leaf area index)], its representation in graphs may again be on the original scale (e.g. clover leaf area index). There are several types of model terms: variables or factors not involved in interactions, variables in higher polynomial (quadratic) forms, two factor interactions of three types (factor by factor, factor by variable and variable by variable) and perhaps more complex terms such as three factor interactions. Presenting the salient features of models may be done through displaying these terms in graphical or tabular form. There are two sides to the choice of scales for presenting results. Graphs on the back-transformd scale may be preferred as unit effects of independent variables on a transformed scale may be difficult to understand, particularly if the independent variable is also transformed. On the other hand, presenting graphical results on the transformed scale(s) but with axes labelled on the original scale(s) is sometimes acceptable to biologists and reduces the difficulty of including inferential information. In some complex models three-dimensional diagrams may display facets of the model better. The model for average summer clover content (Table 2 and Wachendorf et al., 2001b, Table 4) is used as an example to illustrate some of the types of presentation available. As it stands, interpretation of this model is very difficult, since the response variable is transformed to the logit scale and the model contains two interactions and a quadratic term. The interaction between the clover content at the end of spring (SCC) and clover cultivar (SCC x C) is an interaction between a variable and a factor, where C is a factor representing the two clover cultivars, with levels I and 2 being AberHerald and Huia respectively. SCC x T is an interaction between a variable and a variable, where T is mean daily temperature(°C). The model also contains a quadratic effect for clover content at the end of spring (SCC x SCC). Since AberHerald is the first and Huia the second level of C, Huia is used by SAS as the reference cultivar. The coefficients for clover cultivar (C), clover content at the end of spring (SCC) and temperature (T) are not of great interest in themselves. The coefficient for C in Table 2 is the predicted difference between AberHerald and Huia at zero SCC and T; the coefficient for SCC is the slope of the response to spring clover content for AberHerald at zero SCC and the coefficient for T is the slope of the response to temperature for AberHerald at zero SCC. Quantitative interpretation of the interaction terms is also not obvious. A graphical presentation was chosen for model A and for the models in Wachendorf et al. (2001b). Prediction of values to set up the display of model terms and associated tests of significance and measures of variability are derived Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities 709 80 . 60 rr W AberHerald _ 5) U 0 Huia To .)O _ 0 cUI ao E EJ- _ 40 P-values for slopes AberHerald <0.001 Huia <0.001 20 _ 0 Jv 0 20 40 60 80 100 0 20 Clover content at the end of Spring (g 100 g-1 DM) 40 60 80 100 Clover content at the end of Spring (g 100 g- 1 DM) FIG. 2. Model A Summer: Predictions of mean clover content during summer (g 100 g I DM). A, The interaction of clover content at the end of spring (g 100 g- DM) with clover cultivar, and B, the interaction of clover content at the end of spring with mean daily temperature during summer (T) (C). Open horizontal bars indicate the range of values of clover content at the end of spring for which differences between the curves are significant at the 5 % level. The significance of the response to clover content at the end of spring is indicated for each cultivar and each level of temperature. TABLE 2. Details of the modelfor mean clover content during summer (g 100 g-' DM) Effect INTERCEPT Clover cultivar (C)t Clover content at the end of spring (SCC) (g 100 g-' DM) Mean daily temperature (T) (C) SCC x C SCC x T SCC x SCC Estimate s.e. -5.588 -0.225 0.128 0.242 0.0118 -0.00507 -0.00026 1.024 0.103 0.0266 0-0683 0-00268 0.00168 0.000057 d.f. 37 130 173 38 166 169 190 t Pr > t * 4.39 -302 -445 <0.001 0.003 <0.001 The response variable was transformed to the logit scale. The independent variables were the clover cultivar, clover content at the end of spring and mean daily temperature over summer. *t-values and probabilities of main effects were omitted when the effect was included in a significant interaction. tCultivar is coded AberHerald and Huia as levels 1 and 2. for the model in Table 2 and the other models in Wachendorf et al. (2001b) using the LSMEANS/AT, CONTRAST and ESTIMATE options of the SAS MIXED procedure. In making predictions for a variable or factor it is the usual practice to set all the other variables in the model to their mean value and to attach equal weight to each level of other factors in the model. Presentation of the SCC x C interaction (variable x factor). The model is used to predict mean summer clover content (ACC) values for a range of clover contents at the end of spring (SCC) for both AberHerald and Huia. Predictions are on the logit scale and are back-transformed to the clover content scale (ACC as a percentage of total dry matter) using Predicted ACC 100 x exp[predicted logit(ACC)] 1 + exp[predicted logit(ACC)] These values are plotted for AberHerald and Huia against clover content at the end of spring (Fig. 2A). Measures of the statistical importance of features observed in Fig. 2A are presented in two ways. The cultivar effect is tested for significance for several values of SCC. The range of values of SCC for which the P-value for the cultivar comparison is less than 0-05 is indicated by an open horizontal bar. Thus, 710 Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities there is a significant cultivar effect (Fig. 2A) for SCC between about 30 and 80 %. Another feature of interest in this diagram is the significance of the increase in ACC with increasing SCC for both AberHerald and Huia. As there is a quadratic effect of SCC in the model the slopes of these lines are not constant, even on the logit scale. The increase is tested for significance by testing the difference between predictions made for a cultivar at a low (20 %) and high (70 %) value of SCC. The results are shown in Fig. 2A as 'P-values for slopes' and for both cultivars the P-values are very small. The significant SCC x C interaction in the model shows that the lines do not have the same slope; they are neither coincident nor parallel. The range of SCC values chosen in Fig. 2A lies well within the range observed in the data, as does the range of predicted values, and these two criteria have been used in all graphical displays of models in Wachendorf et al. (2001b). Presentationof the SCC x T interaction (variable x variable). A low and high value of one of the variables is chosen (e.g. mean T + s.d.) and the model is used to predict ACC values for a range of SCC values at both levels of T. These series are then plotted against the levels of SCC to show the interaction (Fig. 2B), which can then be interpreted directly. If either the response or independent variables are transformed prior to analysis the predictions are made on the transformed scales but the predicted response and the independent variable may be backtransformed as above before plotting the graph. The salient features of the graph are displayed as in the case of the SCC x C interaction. The temperature effect was significant only for low SCC levels and the response to SCC was significant for both low and high temperatures. The significant interaction in the model means that the two response lines in Fig. 2B do not have the same slope. Other model terms and issues. In Model 2, there are no independent variables or factors that are not involved in an interaction, but in several of the other models this does occur (e.g. mean daily radiation in Model C 1 and clover cultivar in Model D of Wachendorf et al., 2001b). For such a variable, predictions of the response and a 95 % confidence interval (allowing for the random effects in the model) are made on the transformed scale for a range of values of the independent variable. These are backtransformed to the scale desired for presentation and form the basis of a graph of predicted response and confidence limits against the independent variable. To explore the full interpretation of the effect of such a variable in the model, predictions of the effect of the variable may be made at values of the remaining independent variables other than their means. When the factor clover cultivar (C) is not involved in an interaction, predicted means are derived for AberHerald and Huia together with a standard error of difference between means (or other inferential information such a P-value for the test of significance or a confidence interval) and the means are presented on the preferred scale. For factor x factor interactions [not in the models of Wachendorf et al. (2001b), but included here for completeness] a table of predicted means is produced for all possible combinations of the two factors accompanied by appropriate measures of variability or significance of differences. In Model A (Table 2), both interactions involve the same variate SCC. In this situation the slopes of the responses to SCC in Fig. 2A and B are average values. In Fig. 2A they are the means of the slopes at 13.7 and 16.7 C respectively for each cultivar. In Fig. 2B they are the means of the slopes for the two cultivars at 13.7 and 16.7 C, respectively. It would be desirable to examine the four separate slopes were the patterns of the average slopes very different for the two panels in Fig. 2A and B, but this is not necessary here. DISCUSSION The modelling strategy was successful in developing a series of biologically meaningful linked models, which included the effects of plot and site independent variables and the effect of clover cultivar. The models give insight into the annual development of the clover/ryegrass community across a wide range of environmental, in this case climatic, conditions. The use of functional periods greatly reduced the complexity of the modelling and fitted well with a biological view of the annual life history of the mixed plant species community. The use of plot variables representing initial conditions in the community at the start of each functional period was very successful and led to models that provided biological insights. The inclusion of site level climatic variables to account for differences in community behaviour across sites and years broadened the basis of inference of the models and helped to quantify the impact of forces that, though important, are uncontrolled. Other variables describing site management or other site characteristics could readily be incorporated within the same modelling framework. The role of plot independent variables characterizing the initial state of the system is taken largely to integrate the effects of all plot history including past climate at the site. The site variables help to describe the effects of changes in the environment on the system from that initial state forward. This is, of course, too simplistic, but at the level of definition achievable in this study it is felt to be a useful approximation. For functional periods of very short duration, processes such as tiller production, which take a certain minimum time to complete, could well be partly influenced by environmental conditions at the end of the previous period. Autumn is a period when this is likely to occur. Although the minimum duration of autumn is 13 d, its mean duration in this study is about 35 d (Table 1), perhaps sufficient for the effect of the previous period to be relatively small. The interpretation of effects in this series of models should take potential confounding into account. Some plot independent variables may contain effects of treatment (clover cultivar) and this may result in some confounding of treatment effect with effects of independent variables. In model A, for example, the coefficient(s) for initial clover content shows the effect of variation in initial clover content across plots with the same clover cultivar, i.e. the within treatment Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities 711 effect. The effect of treatment (clover cultivar) is the difference between species in their performance over the period assuming that they have a common level of initial clover content. Thus, any effect of treatment on the initial clover content is not taken into account in the treatment effect for the period, it only represents that portion of treatment difference that arose during the period. The use of initial state variables as covariates and then modelling these in turn may also be subject to the charge that the relationship between the performance of a species over successive periods arises partly from intrinsic differences among the plots that affect both variables systematically rather than reflecting a causal association between them. With these caveats in mind it is emphasized that the models derived from this process are not claimed to be explanatory. However, since many of the relationships are very strong and biologically plausible, causality should be entertained as accounting for at least part of the observed relationships, particularly since the climatic variables represent forces that would be universally agreed to be important determinants of biological responses. Normally, these variables are either excluded from use as field experimental factors through reasons of cost or impracticality, or generally ineffectually dealt with through field experiments at a single site lasting even 2 or 3 years. Establishing insight into their importance was a real strength of the modelling. Verification of the extent of causality in the relationships requires controlled experimentation, some of which will be under laboratory rather than field conditions. Although the dynamic models cover the complete annual cycle they should not be used to predict long-term effects in the system as they do not include any regulatory mechanism to prevent elements of the population (e.g. leaf area index and tiller density) from unbounded growth. The use of initial biotic conditions in modelling growth in mixed species communities is not widespread, with much of the literature using the initial densities of species (Suehiro and Ogawa, 1980; Spitters, 1983; Firbank and Watkinson, 1985; Connolly et al., 1990; Menchaca and Connolly, 1990; Gibson et al., 1999). Connolly and Wayne (1996) used initial seedling biomass of two species as independent variables in a model of growth of two competing weed species immediately following establishment. The ratio of species leaf area indices early in the growth of a crop-weed community was used in Kropf and Spitters (1991) in modelling crop yield loss from weed competition. Connolly et al. (2001) and Gibson et al. (1999) stressed the need to allow for initial biotic conditions rather than species densities in modelling competitive effects in community development to avoid the possibility of bias due to differences in the initial size of the competing species. Indeed, for perennial species such as white clover and perennial ryegrass, density may not be a meaningful measure once they have spread by clonal reproduction, whereas their contribution to swards can be determined by sampling at any stage. Although the design of this experiment does not fall within the usual range of design for the study of interspecific interaction (Gibson et al., 1999; Connolly et al., 2001), the variability among plots in the development of the clover and ryegrass allowed regression analysis on initial plot conditions, from which some information on interspecific effects was estimable, particularly during the winter period when more detailed information was available. However, for a full competition experiment it would be desirable to establish plots differing widely in composition. When an independent variable is measured imprecisely the estimator of the regression coefficient will be biased (Carroll et al., 1995), tending, in models like these, to be somewhat reduced in absolute size. Morphological variables such as leaf area index, which are based on subsamples within plots, may not measure the true plot leaf area index precisely. If this is of concern, preliminary detailed analysis of the variation among subsamples compared with variation among plots will provide a basis for deciding whether the variable is measured sufficiently well to lead to negligible bias or to correct for it. As data from subsamples were not available in the current study this issue was not examined. Presenting complex models is a necessary part of modelling. This paper has outlined several aspects of a graphical/tabular approach to this which allows the user to engage directly with the biological issues raised by the model and bypasses the difficulties of attempting to understand a complex equation. These methods have been extremely successful in communicating the results of seven rather complex models to biologists. CONCLUSIONS The modelling strategy was successful in developing a series of multisite, biologically meaningful, linked dynamic models of white clover/ryegrass communities as affected by clover cultivar and climatic independent variables. The models gave insight into the annual development of the clover/ryegrass community across a wide range of environmental conditions. The statistical issues were handled within a linear mixed models framework. The methods of presentation allowed a relatively simple appreciation of the behaviour of community dynamics across four linked functional periods. ACKNOWLEDGEMENTS Thanks are due to Eric Macklin and Michael O'Kelly for helpful discussions on some of the issues in this paper. The helpful comments of Adrian Dunne, David Elston, David Kemp and David Williams have greatly improved this work. We acknowledge the financial contribution of the Commission of the European Community for support for the COST (Cooperative Organisation of Science and Technology) 814 Action. LITERATURE CITED Carroll RJ, Ruppert D, Stefanski LA. 1995. Measurement error in nonlinear models. London: Chapman & Hall. Connolly J, Wayne P. 1996. Asymmetric competition between plant species. Oecologia 108: 311-320. 712 Connolly and Wachendorf-Multisite Dynamic Models of Mixed Species Plant Communities Connolly J, Wayne P, Bazzaz FA. 2001. Interspecific competition in plants: how well do current methods answer fundamental questions? The American Naturalist 157: 107 125. Connolly J, Wayne P, Murray R. 1990. Dynamic interactions of Stellaria media (L.)Vill. and Poa annua. Oecologia 82: 513-526. Cox DR, Snell EJ. 1989. The analysis of binaryl data, 2nd edn. London: Chapman &Hall. Curtis PS, Wang XZ. 1998. A meta-analysis of elevated CO2 effects on woody plant mass, form, and physiology. Oecologia 113: 299 313. Draper NR, Smith H. 1998. Applied regression analysis, 3rd edn. New York: John Wiley & Sons. Firbank LG, Watkinson AR. 1985. On the analysis of competition within two-species mixtures of plants. JournaloJ Applied Ecology 22: 503-517. France H, Thornley JHM. 1984. Mathematical models in agriculture. London: Butterworths. Gibson DJ, Connolly J, Hartnett DC, Weidenhamer JD. 1999. Essay review: designs for greenhouse studies of interactions between plants. Journal of' Ecology 87: 1-16. Kropf MJ, Spitters CJT. 1991. A simple model of crop loss by weed competition from early observations on relative leaf area of the weeds. Weed Research 31: 97-105. Menchaca LB, Connolly J. 1990. Species interference in white cloverryegrass mixtures. Journal of Ecology 78: 223-232. Nelder JA. 1994. The statistics of linear models: back to the basics. Statistics and Computing 4: 221-234. Nelder JA, Lane PW. 1995. The computer analysis of factorial experiments. In memoriam Frank Yates. The American Statistician 49: 382-385. Nolan T, Connolly J, Wachendorf M. 2001. Mixed grazing and climatic determinants of white clover (Trifolium repens L.) content in a permanent pasture. Annals of Botany 88. Poorter H, Nagel 0. 2000. The role of biomass allocation in the growth response of plants to different levels of light, CO, nutrients and water: a quantitative review. Australian Journal of Plant Physiology 27: 595-607. Reader RJ, Wilson SD, Belcher JW, Wisheu I, Keddy PA, Tilman D, Morris EC, Grace JB, Mcgraw JB, Olff H, Turkington R, Klein E, Leung Y, Shipley B, Vanhulst R, Johansson ME, Nilsson C, Gurevitch J, Grigulis K, Beisner BE. 1994. Plant competition in relation to neighbor biomass an intercontinental study with Poa pratensis. Ecology 75: 1753 1760. SAS Institute Inc. 1992. SAS* Technical report P-229, SAS/STAT. Software: Changes and enhancements, Release 6.07. Cary, NC, USA: SAS Institute Inc. Searle SR, Casella G, McCulloch CE. 1992. Variance components. New York: John Wiley & Sons, Inc. Spitters CJT. 1983. An alternative approach to the analysis of mixed cropping experiments. Netherlands Journalof Agricultural Science 31: 1-11. Sokal RR, Rohlf FJ. 1995. Biometry. 3rd edn. New York: W.H. Freeman and Co. Suehiro K, Ogawa H. 1980. Competition between two annual herbs, Atriplex gmelini C. A. Mey and Chenopodium album L., in mixed cultures irrigated with seawater of various concentrations. Oecologia 45: 167 177. Talbot M, Verdooren LR. 1996. Meta-analysis and crop variety trials. In: Connolly J, Williams D, eds. Proceedings of'EU HARMA VII Concerted Action Workshop. Dublin: University College Dublin, 1-6. Verbeke G, Molenberghs G. 1997. Linear mixed models in practice: A SAS-oriented approach. New York: Springer-Verlag. Wachendorf M, Collins RP, Connolly J, Elgersma A, Fothergill M, Frankow-Lindberg BE, Ghesquiere A, Guckert A, Guinchard MP, Helgadottir A, Liischer A, Nolan T, Nykanen-Kurki P, N6sberger J, Parente G, Puzio S, Rhodes I, Robin C, Ryan A, Staheli B, Stoffel S, Taube F. 2001a. Overwintering of Trifolium repens L. and succeeding growth: Results from a common protocol carried out at twelve European sites. Annals of Botany 88: 669-682. Wachendorf M, Collins RP, Connolly J, Elgersma A, Fothergill M, Frankow-Lindberg BE, Ghesquiere A, Guckert A, Guinchard MP, Helgadottir A, Luischer A, Nolan T, Nykiinen-Kurki P, N6sberger J, Parente G, Puzio S, Rhodes I, Robin C, Ryan A, Stiheli B, Stoffel S, Taube F. 2001b. Overwintering and growing season dynamics of Trifolium repens L. in mixture with Lolium perenne L. A model approach to plant-environment interactions. Annals of Botany 88: 683-702. Wright AJ. 1981. The analysis of yield-density relationships in binary mixtures using inverse polynomials. Journal of Agricultural Science 96: 561 567. Xiong SJ, Nilsson C. 1999. The effects of plant litter on vegetation: a meta-analysis. Journal of Ecology 87: 984--994.
© Copyright 2026 Paperzz