Applications of Individual Growth Curve Modeling for Pediatric Psychology Research Christian DeLucia,1PHD, and Steven C. Pitts,2 PHD 1 University of Illinois at Chicago and 2University of Maryland, Baltimore County Objective To provide a brief, nontechnical introduction to individual growth curve modeling for the analysis of longitudinal data. Several applications of individual growth curve modeling for pediatric psychology research are discussed. Methods To illustrate these applications, we analyze data from an ongoing pediatric psychology study of the possible impact of spina bifida on child and family development (N = 135). Three repeated observations, spaced by approximately 2 years, contributed to the analyses (M age at baseline = 8.84). Results Results indicated that individual linear growth curves of emotional autonomy varied as a function of the youth gender by spina bifida group membership interaction. Conclusions Strengths of individual growth curve modeling relative to more traditional methods of analysis are highlighted (e.g., completely flexible specification of the time variable, explicit modeling of both aggregate-level and individual-level growth curves). Key words Growth curves; trajectories; longitudinal modeling; statistical methods. The purpose of this article is to provide a brief, nontechnical introduction to individual growth curve modeling for the analysis of longitudinal pediatric psychology data. As will be discussed, individual growth curve models are extremely flexible and offer pediatric psychology researchers several advantages over traditional methods for analyzing longitudinal data, such as the repeated measures analysis of variance model (ANOVA). Individual growth curve modeling (e.g., Rogosa, Brandt, & Zimowski, 1982; Willett, 1997) is one of several names used in referring to this general class of statistical techniques. Other names include random regression models (Gibbons et al., 1993; Laird & Ware, 1982), hierarchical linear models (Bryk & Raudenbush, 1987), multilevel models (Snijders & Bosker, 1999), and mixed linear models (Littell, Milliken, Stroup, & Wolfinger, 1996). Although these various models are functionally equivalent for longitudinal data analysis, nomenclature varies mainly as a function of which model characteristic is being highlighted. (We discuss these issues in more detail below.) What are Individual Growth Curve Models? Generally speaking, individual growth curve models allow researchers to measure change over time in a phenomenon of interest (e.g., response to treatment) at both the aggregate (i.e., population) and individual (i.e., study participant) levels. Historically, researchers have been more likely to model change as an “incremental” process (e.g., through the use of change score analysis from two-wave designs, Willett, 1988). This may have arisen due to misconceptions regarding how to adequately model change as much as due to software limitations (Gibbons et al., 1993; Rogosa, 1988; Willett, 1988). In the past 20 years, statistical procedures that model change in a completely flexible manner have been developed (Bryk & Raudenbush, 1987; Gibbons et al., 1993; Hedeker & Gibbons, 1996a; Laird & Ware, 1982; Rogosa et al., 1982; Willett, 1988). The individual growth curve model is an outgrowth of this tradition and, relative to traditional analytical techniques, represents a monumental leap in the analysis of change for several All correspondence concerning this article should be addressed to Christian DeLucia, PhD, Center for Treatment Research on Adolescent Drug Abuse, Department of Epidemiology and Public Health, University of Miami Miller School of Medicine, 1400 NW 10th Avenve Suite 1107B, Miami, Florida 33136. E-mail: [email protected]. Journal of Pediatric Psychology 31(10) pp. 1002–1023, 2006 doi:10.1093/jpepsy/jsj074 Advance Access publication September 21, 2005 Journal of Pediatric Psychology vol. 31 no. 10 © The Author 2005. Published by Oxford University Press on behalf of the Society of Pediatric Psychology. All rights reserved. For permissions, please e-mail: [email protected] Individual Growth Curve Modeling 1003 reasons: (a) change can be modeled in an extremely flexible and realistic manner; (b) in addition to aggregate-level growth curves, individual-level growth curves are explicitly modeled as well; and (c) the approach allows researchers to explore rich hypotheses, thereby helping to close the gap between conceptual and theoretical and statistical models. A Hypothetical Basic Research Application: Examining Treatment Adherence To assist with understanding individual growth curve modeling at a conceptual level, we consider a hypothetical example in which a researcher is interested in studying the natural history of treatment adherence in a sample of youth with cystic fibrosis. By repeatedly interviewing a sample of such youth over a 5-year period, researchers could estimate developmental growth curves of adherence for each youth. Figure 1 presents hypothetical treatment adherence data for four individuals with cystic fibrosis. On the y or vertical axis is an adherence scale ranging from 0 to 10 (with higher scores indicating higher levels of adherence). On the x or horizontal axis is a “time” variable representing participant age, ranging from 10 to 14. There are five estimated growth curves displayed in the figure. The solid lines represent estimated growth curves for each individual; each curve has its own intercept (defined as score on adherence at age 10, ranging from 0 to 4) and slope (defined as rate of change in adherence per one year of age, ranging from .7 to 1.75). The dashed line is the estimated population, 10 9 8 Adherence 7 6 5 β10 = 1.19 4 3 β00 =2.13 b11 = 1.75 2 or average, growth curve. Parameter estimates are presented for the population (i.e., the dashed line with β00 and β10 for intercept and slope, respectively) and for one participant (i.e., individual 1 with b01 and b11 for intercept and slope, respectively). Our hypothetical data allow us to illustrate several important features of individual growth curve models. First, the population, or average, growth curve carries aggregate-level information. As such, we know that the average 10-year-old with cystic fibrosis scored a 2.13 on treatment adherence (i.e., the estimate of the population intercept) and gained 1.19 units of treatment adherence per year through age 15 (i.e., the estimate of the population slope). Second, individuals are allowed to deviate or vary from this population growth curve. In Fig. 1, all four individuals vary from the population intercept (i.e., have adherence scores other than 2.13 at age 10) and slope (i.e., have slopes other than 1.19). As we illustrate below, if individual variability in the growth curve parameters is present (as depicted in Fig. 1), this variability might be predicted from theoretically meaningful variables (e.g., parental monitoring). While considering Fig. 1, we turn to a presentation of the unconditional linear growth model. The Unconditional Linear Growth Model A straightforward way to conceptualize growth curve models is as two levels of analysis (Bryk & Raudenbush, 1987; Singer & Willett, 2003). The level 1 model is commonly referred to as the within-person or intra-individual change model. As we discuss in detail below, these names highlight the central feature of the level 1 model—it captures person-specific (i.e., individual) growth rates. Time-varying predictor variables (e.g., age, time elapsed since treatment, level of symptomatology) can be included in the level 1 model. The level 2 model is commonly referred to as the between-person or interindividual change model. These names highlight the central feature of the level 2 model—it captures between-person variability in the growth rates. Timeinvariant predictors (e.g., gender, ethnicity, level of symptomatology at a given point in time—say at the baseline assessment) can be included in the level 2 model. A linear growth model with no additional time varying or time invariant predictors appears as: 1 b01 =0.00 Level 1 model: Yij = b0i + b1i (time ij ) + e ij . 0 10 11 12 13 14 Age Figure 1. Hypothetical growth curves of treatment adherence based on individual growth curve model. Level 2 model: b0i = b00 + n 0i . b1i = b10 + n1i . 1004 DeLucia and Pitts Although it is useful to conceptualize the model in this “multilevel” framework, in practice a single integrated model is estimated. This integrated model can be formed by substituting the level 2 model into the level 1 model, resulting in Yij = (β00 +ν0i) + [(β10 +ν1i) (timeij)] + eij. Singer and Willett (2003) provide extensive discussion of this multilevel conceptualization and the resulting integrated model in chapters three and four. The Level 1 Model The level 1 model resembles an ordinary least squares (OLS) regression model (OLS regression is hereafter referred to as basic regression). An outcome (i.e., Yij, where the subscripts “i” and “j” denote person and measurement occasion, respectively) is written as a function of an intercept (i.e., b0i), plus the multiplication of a slope parameter (i.e., b1i) by a predictor variable (i.e., timeij) and a residual (i.e., eij). In this model, the two regression parameters representing the intercept and slope (i.e., b0i and b1i) carry the person-level subscript “i.” As such, these parameters are allowed to vary across individuals (i.e., can take on different values for different individuals). This feature of the model represents a marked departure from the basic regression model, in which parameters are assumed to be fixed for all individuals in the sample (or for all individuals belonging to a particular group, in the event that the regression lines are nonparallel across groups).1 Although we have included only a single predictor in our level 1 model (i.e., timeij), additional time-varying predictors can be included in the level 1 model (e.g., level of symptomatology), thus further reflecting the flexibility of the procedure. For a discussion of the inclusion and interpretation of time-varying predictors, see Singer and Willett (2003, chap. 5). The level 1 model can be expanded to include curvilinear growth forms as well (e.g., quadratic, cubic). For example, to examine a quadratic growth form (i.e., a curve characterized by one bend), the level 1 model could be rewritten as follows: Yij = b0i + b1i (timeij) + b2i (timeij)2 + eij. In this equation, b2i carries information about the quadratic effect. The Level 2 Model In the level 2 model, it is conceptually helpful to consider the individual parameter estimates from the level 1 model (i.e., b0i and b1i) being treated as outcomes. For example, intercept of person “i” (i.e., b0i) is written as a 1 The level 1 residuals (i.e., eij) are assumed to be normally distributed with a mean of zero and common variance σe and are conditionally independent, given the random effects (Hedeker & Gibbons, 1996b). function of a population intercept (i.e., β00) plus his or her deviation from the population intercept (i.e., ν0i). Similarly, slope of person “i” (i.e., b1i) is written as a function of a population slope (i.e., β10) plus his or her deviation from the population slope (i.e., ν1i). Fixed Effects In the level 2 model, the population-level estimates (i.e., β00 and β01) are referred to as the “fixed” effects. Similar to the basic regression model, these effects are assumed fixed (i.e., constant) for all individuals in the sample. Random Effects The individual deviations (i.e., ν0i and ν1i), which can be thought of as the level 2 residuals, are referred to as the random effects.2 Indeed, it is the estimation of these deviations (of individuals from the population curve) that puts the “individual” in individual growth curve modeling. The term “random” here gives rise to another name for the procedure-random regression models. Moreover, the mixture of fixed and random effects in a single model gives rise to other names for the procedure, for example, mixed regression models, mixed linear models. Including Predictors in the Level 2 Model To the extent that individual growth curves vary from the population estimates, the variability associated with the random effects (i.e., ν0i and ν01) will be deemed statistically significant (we discuss this in more detail below). Indeed, an early task in individual growth curve modeling is establishing whether variability in the growth curves is present. Upon establishing this significant variability, the level 2 model can be expanded to include time-invariant predictors of variability in the individual growth curves. For example, in expanding our earlier hypothetical example, we might hypothesize that baseline level of parental monitoring of adherence behaviors is associated with both intercept and slope variability, such that youth exposed to higher levels of baseline parental monitoring will exhibit higher levels of adherence at age 10 (i.e., higher intercepts) and, to a lesser degree, differing growth rates of adherence (i.e., differing slopes). We could test this hypothesis empirically by expanding the level 2 model to include baseline parental monitoring as a predictor of both intercept and slope variability. The new conceptual equations appear as: Level 1: Yij = b0i + b1i (time ij ) + e ij . 2 In this model, the random effects are assumed to be bivariate normally distributed with means of zero and population variance σν02 for the intercept, population variance σν12 for the slope, and population covariance σν01. Individual Growth Curve Modeling 1005 Level 2: b0i = b00 + b01 ( baseline parent 8 7 b1i = b10 + b11 ( baseline parent 6 monitoring i ) + n1i . 5 In the above level 2 model, we are stating that each individual’s intercept (b0i) and slope (b1i) estimates are a function of three components: (a) the population estimate (β00 for intercept and β10 for slope), (b) his or her score on baseline parental monitoring, and (c) an individual deviation (ν0i for intercept and v1i for slope). An interesting observation that follows from the level 2 model for slope variability is that the predictors added in this model form interactions with the time variable from the level 1 model (this is more apparent when the level 2 model is substituted into the level 1 model). As such, in our present example, the parental monitoring effect on slope variability, captured by β11, is actually a test of the (baseline parental monitoringi × timeij) interaction or the extent to which the linear effect of time varies as a function of baseline parental monitoring. (As shown in the Appendix, SAS syntax to request similar effects involves explicit specification of the predictor by time interaction.) A Hypothetical Applied Research Application: Participant Response to an Intervention In Figs 2 and 3, we depict two possible applications of individual growth curve models for intervention researchers. These data are based on a hypothetical 4-week inter8 Treatment = solid lines Control = dashed lines 7 6 Adherence 5 4 3 2 1 0 0 1 Measurement occasion Figure 2. Testing main effects of treatment using individual growth curve models. 2 Adherence monitoring i ) + n 0i . 4 3 2 1 0 0 1 2 Measurement occasion Figure 3. Testing moderated treatment effects using individual growth curve models. vention to increase treatment adherence among youth with cystic fibrosis. For illustrative purposes, suppose some affected youth were randomized to receive a familybased intervention to cultivate and maintain adherence behaviors, while the remaining youth were randomized to a no-treatment assessment (i.e., control) condition. Testing Main Effects of Treatment In Fig. 2, we present data that can help answer the question: “Does the intervention improve adherence behaviors?” The y axis contains a treatment adherence scale (with higher scores indicating higher levels of adherence), whereas the x axis contains the time variable (i.e., a measurement occasion scale: 0, “baseline”; 1, “four weeks postbaseline at intervention completion”; and 2, “eight weeks post-baseline at one-month follow-up”). All of the lines in the figure represent estimates of individual growth curves. The treated individuals, represented by solid lines, show greater rates of improvement than do the control participants. These data are conceptually consistent with a main effect of treatment on adherence behaviors. In the individual growth curve-modeling framework, the intervention effect could be tested by including a dichotomous treatment predictor variable (0, “control”; 1, “treatment”) in the level 2 model of slope variability. Given Fig. 2, we would expect the coefficient for this predictor to be positive, suggesting that on average, treated participants are growing more rapidly in adherence than are control participants. Testing Moderated Treatment Effects In Fig. 3, we present data that can help answer the question: ‘For whom does the intervention work best?’ Fig. 3 1006 DeLucia and Pitts Testing Mediated Treatment Effects Individual growth curve modeling can also be used to help pediatric psychology researchers examine treatment-process questions. The question here becomes ‘How does the intervention work?’ Directly on the outcome? Indirectly through an intermediate variable? In psychology, such process questions are typically discussed in the context of mediation analysis (for examples, see Baron & Kenny, 1986; Holmbeck, 1997, 2002; MacKinnon, 1994; Mackinnon, Lockwood, Hoffman, West & Sheets, 2002; West & Aiken, 1997). For example, an intervention designed to improve treatment adherence among a sample of youth with cystic fibrosis, might attempt to improve youth adherence, at least in part, by improving parental monitoring of adherence. If youth were randomly assigned to a family-based intervention condition or to a no-intervention control condition and multiple waves of data were collected on both parental monitoring (i.e., the mediator) and youth adherence (i.e., the outcome), individual growth curve modeling could be used to explore mediational hypotheses. Although the specification of such models is more advanced and beyond the scope of this article, readers are referred to relevant illustrations by MacKinnon and colleagues (e.g., Choeng, MacKinnon, & Khoo, 2003; Krull & MacKinnon, 2001), and by McArdle and colleagues (Ferrer & McArdle, 2004; McArdle et al., 2004). Advantages of Individual Growth Curve Modeling Relative to traditional techniques for the analysis of longitudinal data (e.g., ANOVA), individual growth curve modeling offers many advantages including analysis at the individual level, complete flexibility in the treatment of the time variable, responses to missing data, ability to handle clustered data, and generalizations to nonnormally distributed outcomes. Data are Truly Modeled at the Individual Level First, as depicted in the above illustrations, data are truly modeled at the individual level, allowing for the examination of individual variability in intercepts and rates of change in the phenomenon under investigation (Bryk & Raudenbush, 1992; Gibbons et al., 1993; Willett, 1997). Generally speaking, the ANOVA approach focuses on growth curves at the aggregate level. Although the ANOVA model allows for individual variability in intercepts, individual variability in rates of change is not explicitly modeled (Gibbons et al., 1993). Figure 4 shows the same hypothetical data presented in Fig. 1 based on the ANOVA. The primary difference between Figs 1 and 4 is that in the latter figure all of the lines have the same slope (i.e., the rate of linear change in treatment adherence is assumed to be equal for all individuals). Given that a basic tenet of developmental theory is that individuals vary in their rates of development over time, eliminating this variability will often fail to capture the richness of the data. Intercept and slope 10 9 8 7 Adherence takes on the same general form as Fig. 2 with one primary exception. At baseline, we depict two general “clusters” of individuals—youth who are reporting fairly low levels of adherence behaviors and youth who are reporting fairly high levels of adherence behaviors (irrespective of treatment condition). For individuals who report fairly low levels of adherence behaviors at baseline, there appears to be an intervention effect on slope variability; treated youth are growing more rapidly in adherence behaviors than are control youth. There appears to be no intervention effect, however, for youth who report higher levels of adherence at baseline (i.e., treated and control participants are growing minimally and at about the same rate in their adherence behaviors). In the context of individual growth curve modeling, we would model this intervention by baseline interaction by including three predictors in our level 2 model of slope variability: (a) treatment, (b) baseline level of adherence, and (c) the treatment by baseline-adherence interaction. Given Fig. 3, we would expect a significant treatment by baseline-adherence interaction. We would then probe the interaction following guidelines for basic (fixed effects) regression suggested by Aiken and West (1991). 6 5 β10 = 1.19 4 β00 = 2.13 3 b11 = 1.19 2 b01 = 0 1 0 10 11 12 13 14 Age Figure 4. Hypothetical growth curves of treatment adherence based on analysis of variance (ANOVA) model. Individual Growth Curve Modeling 1007 estimates are presented again for participant 1. The estimated intercept remains the same (0.00), whereas the slope estimate has decreased from Fig. 1 by 0.56 units (a 32% reduction) to 1.19 (the population estimate). Flexible Treatment of the Time Variable A second strength of individual growth curve modeling relative to the ANOVA approach is the complete flexibility with which the time variable can be treated (Bryk & Raudenbush, 1992; Gibbons et al., 1993). For example, time can be treated continuously (e.g., representing chronological age or time since entry into the study) rather than as a factor with a relatively finite number of levels. Also, individuals can be observed at different time points (as opposed to assuming that all individuals were measured on the same occasions). (We discuss some very important implications of this added flexibility of the growth curve modeling approach below.) Easy Handling of Missing Data Third, in individual growth curve models, participants with missing data at one or more time points can be retained in the analysis (if the data are missing by design, completely at random, or are ignorable—i.e., can be modeled as a function of observed covariates; see Gibbons et al., 1993). (Missing data might also be able to be handled through a “pattern-mixture” approach, even if missingness is nonignorable, see Hedeker & Gibbons, 1997.) Although ANOVA models can handle cases with partial missing data, researchers often enforce listwise deletion (i.e., a participant is dropped from the analysis if she/he is missing any of the repeated observations) (Bryk & Raudenbush, 1992; Gibbons et al., 1993). Models can Easily Incorporate Three Levels of Data Nesting or Clustering Fourth, because the individual growth curve model is a specific form of a multilevel model—in which the repeated observations are “nested” or “clustered” within individuals—individual growth curve models can be easily generalized to include additional levels of nesting (e.g., individuals nested within broader structures, such as classrooms, clinics, hospitals, or group homes) (Bryk & Raudenbush, 1992; Singer, 1998). As mentioned above, an additional commonly used name for individual growth curve modeling is hierarchical linear modeling or HLM. This naming convention emphasizes the multilevel or hierarchical structure of the model (Bryk & Raudenbush, 1992). Bryk and colleagues have also developed software bearing the same name, which is commonly used to analyze these models (Bryk, Raudenbush, & Congdon, 1996). Generalizations to Nonnormal Data Exist Fifth, individual growth curve models can be easily generalized for use with nonnormal data (e.g., Hedeker & Gibbons, 1996a; Hedeker, 1999). As such, researchers could model growth in count, binary, ordered categorical, or nominal categorical data. This might be particularly useful for researchers interested in modeling growth over time in repeated observations of outcomes such as counts of pediatric admissions to hospitals over the past several years, remission status in pediatric cancer patients in the past 24 months, and monthly changes in symptom severity—coded as mild, moderate, and severe—among pediatric anxiety patients.3 Real-World Example: Modeling the Development of Emotional Autonomy in Youth with Spina Bifida and Able-Bodied Comparison Youth Next, we work through a series of relatively basic individual growth curve models to tie some of the concepts we have introduced above to actual longitudinal data in the area of pediatric psychology. These data are from an ongoing study—conducted by Holmbeck and associates—of the possible impact of spina bifida on child and family development (see Coakley, Holmbeck, Friedman, Greenley, & Thill, 2002; Holmbeck et al., 2003; Holmbeck, Coakley, Hommeyer, Shapera, & Westhoven, 2002; Hommeyer, Holmbeck, Wills, & Coers, 1999). Spina bifida is a relatively common birth defect, which occurs in approximately 1 in 1000 live births in the United States (Holmbeck et al., 2003). In children with spina bifida, the spinal cord fails to fully develop, resulting in exposure of a portion of the cord at birth (Holmbeck et al., 2003). In addition to common physical problems (e.g., sensory loss, bladder control problems), children with spina bifida are at increased risk (relative to able-bodied peers) for psychosocial problems as well (e.g., social immaturity, attention, and concentration problems) (Holmbeck et al., 2003). In this 3 On a slightly more technical note, another strength of individual growth-curve modeling is that it allows the variance/covariance matrix of the repeated observations to follow a general form. The ANOVA model assumes this matrix is compound symmetric— that is, all variances are equal and all covariances are equal— which is typically unrealistic, given that variances often change over time, and covariances for more proximally spaced repeated observations are often larger in magnitude than are covariances for more distally spaced repeated observations (Gibbons et al., 1993; Rogosa, 1988). 1008 DeLucia and Pitts study, we examined growth over time in emotional autonomy from mothers (Steinberg & Silverberg, 1987) as a function of both spina bifida group membership and child gender. Methods Sample This sample is comprised of 67 children with spina bifida (n = 31 girls, 46.3% of spina bifida group) and 68 able-bodied comparison children (n = 31 girls, 45.6% of group). Child age at baseline (i.e., T1) was similar for children with spina bifida (M = 8.98, SD = .61) and ablebodied comparison children (M = 8.72, SD = .50). Child ethnicity was also similar for children with spina bifida, n = 12 non-Caucasian children (18%), and comparison children, n = 6 non-Caucasian children (9%). Participant Recruitment Children with spina bifida were recruited from four sources—a children’s hospital, a children’s hospital for youth with physical disabilities, a university-based medical center, and a statewide spina bifida association. Out of a possible participant pool of 310 children, 70 families were successfully recruited into the study. Other families were excluded for various reasons (e.g., distance from research base, failure to reach, refusal). Able-bodied comparison children were recruited from the schools attended by the children with spina bifida. Out of roughly 1,700 mailed recruitment letters, 72 families agreed to participate in the study. The intensive nature of the longitudinal study—which was explained in detail in the recruitment letters—accounts for some of this low response rate. Sample sizes were reduced to 68 youth in the spina bifida and comparison groups to facilitate matching on key demographic factors (e.g., age, SES, ethnicity). (For additional information on sample recruitment procedures, see Holmbeck et al., 2003.) Procedure For the purposes of this study, data collected on three measurement occasions were utilized. On average, the measurement occasions were spaced by approximately 2 years. On all occasions, interviews were conducted in participants’ homes by trained undergraduate and graduate psychology students. Interviews lasted approximately 3 h, and families were paid $50 at T1, $75 at T2, and $100 at T3 for their time and effort. Although interviews were conducted with parents and children, only child data are discussed in this aeticle. Measures Age A continuous measure of child age in years was obtained on each measurement occasion using the child’s birth date and relevant interview date. This variable was used as the time variable in the growth curve models. Gender Child gender (0, “female”; 1, “male”) was the sole demographic variable used in these analyses. This variable was used as a predictor in the growth curve models. Group Spina bifida diagnostic status (0, “able-bodied comparison youth”; 1, “youth with spina bifida”) was ascertained at baseline. This grouping variable was used as a predictor in the growth curve models. Emotional Autonomy from Mothers (Steinberg & Silverberg, 1987) This measure captured the degree to which “childish” dependencies on mothers were relinquished by youth. Children were asked to rate how much they agreed with each of 14 statements (e.g., “My mother and I agree on everything,” “I go to my mother for help before trying to solve a problem myself,” “I try to have the same opinions as my mother”). Response options included 1 (strongly agree), 2 (agree somewhat), 3 (disagree somewhat), and 4 (strongly disagree). Relevant items were reverse coded such that higher scores indicate higher levels of emotional autonomy. The measure was administered at all three interviews. Reliability coefficients (based on Cronbach’s alpha) across the three time points were .63, .75, and .80, respectively. We estimated growth in emotional autonomy, as discussed below. Results In these analyses, we were interested in describing growth over time in emotional autonomy from mothers during a 4-year period of preadolescent and adolescent development (i.e., on average, ages 9–13). Three waves of longitudinal data contributed to the models described below. Descriptive Data Descriptive data for the three repeated observations of emotional autonomy and participant age are summarized in Table I. These data are presented by measurement occasion to give the reader a general feel for the data. It is helpful to keep in mind, however, that in the growth models described below, growth in emotional Individual Growth Curve Modeling 1009 Table I. Descriptive Statistics for Emotional Autonomy and Age Measurement occasion Time 1 (N = 133) Statistics Time 2 (N = 131) Time 3 (N = 129) Emotional autonomy M 2.24 2.38 SD 0.45 0.51 2.58 0.53 Skew –0.01 0.14 0.06 Kurtosis –0.52 0.06 0.47 Correlations Time 1 1 Time 2 0.33 1 Time 3 0.24 0.46 M 8.85 10.95 12.79 SD 0.57 0.60 0.74 Skew 0.33 0.28 0.11 –1.03 –0.82 –0.97 1 Participant age Kurtosis autonomy is modeled as a function of participant age, not measurement occasion. These descriptive data suggest that scores on the emotional autonomy measure—at least at the sample level—increased over the three measurement occasions. Skew and kurtosis values, which help assess the degree of univariate normality of the individual measures, suggest that the measures are normally distributed. Moreover, the correlations among these measures show a classic pattern of diminishing magnitude over time (i.e., a violation of compound symmetry). Figure 5 shows raw data trajectories for 20 individuals selected at random from the data set. The x axis contains the age variable, which ranges from 8 to 15; the Emotional autonomy 4 3 2 1 8 9 10 11 12 13 14 Age Figure 5. Plot of raw data of emotional autonomy for 20 youth selected at random. 15 y axis contains the emotional autonomy scale, which ranges from 1 to 4. Two important pieces of information can be gleaned from these data. First, there appears to be variability in individuals’ initial scores on emotional autonomy (typically measured between the ages of 8 and 9). Second, there appears to be variability in how individuals are changing over time in emotional autonomy. Some individuals are clearly increasing in their levels of emotional autonomy over time, some individuals are decreasing, and still others are showing a more stable pattern (i.e., not changing much at all). It is this variability (in starting points and rates of change) that will be explicitly modeled using individual growth curve models. Some of these individual trajectories appear to be quadratic in nature, suggesting perhaps that a curved line with a single bend might be a better representation of the data than would a straight line. As such, we will test for both linear and quadratic forms of growth—at least at the aggregate- or population-level. Fitting Individual Growth Curves of Emotional Autonomy from Mothers To assess change over time in the development of emotional autonomy from mothers, we estimated individual growth curve models using the Mixed Procedure in SAS statistical software. For a basic primer on the Mixed Procedure, see Singer (1998); a more comprehensive discussion of the procedure is presented by Littell et al. (1996). When estimating change in a phenomenon, it is of interpretational benefit to identify a meaningful metric of the time variable (i.e., participant age) (Biesanz, Deeb-Sossa, Papadakis, Bollen, & Curran, 2004). For example, in this study, if age was left in its original metric, the intercept would be interpreted as the level of emotional autonomy when the participant age was zero; not a substantively meaningful interpretation. We elected to scale age (the time variable) such that the zero point corresponded to a value of age 9 (i.e., subtracted 9 from all participants’ ages at each measurement). Thus, intercept estimates are interpreted as emotional autonomy at age 9; a value that corresponds to the average age of individuals at entry into the study. Before evaluating potential predictors of change in a phenomenon, it is important to ensure that the individual growth curve model is correctly specified. This includes identification of the correct form of growth (e.g., linear, quadratic), as well as correct specification of the variance of the individual growth estimates (e.g., intercept and slope). As we discuss in detail below, it is possible to statistically test these model estimates. If significant variability in the growth curves is present 1010 DeLucia and Pitts Table II. Model Summary Information Model name Level 1 models Level 2 models χ2 Δχ2(df) p Fixed intercept Yij = b0i + eij b0I = β00 595.8 — — Unconditional means Yij = b0i + eij b0I = β00 + ν0i 566.2 29.6(1) <.0001a Compound symmetry Yij = b0i + b1i (ageij – 9) + eij b0I = β00 + ν0i 509.7 56.5(1) <.0001 500.9 8.8(2)b .0061a 500.7 .2(1) .6547 b1I = β10 Unconditional linear Yij = b0i + b1i (ageij – 9) + eij Fixed quadratic Yij = b0i + b1i (ageij – 9) + b2i (ageij – 9)2 + eij b0I = β00 + ν0i b1I = β10 + ν1i b0i = β00 + ν0i b1I = β10 + ν1i b2I = β20 Full maximum likelihood estimation was used for all models. a Following the methods of Snijders and Bosker (1999, p. 90), halved probability values were used to determine the significance of the variance components. b Two dfs distinguish the unconditional linear and compound symmetry model (one for the random linear effect and one for the covariance between the random intercept and the random linear effect). Model 4 was retained as the final model based on relative model fit. (e.g., individuals vary in their intercepts and slopes), it might be possible to predict this variability from theoretically meaningful variables (e.g., spina bifida status). Correct model specification typically includes evaluating a series of models to determine which model results in the best relative fit to the data. Similar to other analytical techniques (e.g., logistic regression, path analysis), it is possible to directly compare two nested individual growth curve models. Two models are nested when one model (the more parsimonious) can be “created” from another model (the more complex) by not estimating one or more parameters. The relative fit of two nested models can be compared by evaluating the difference in −2 log likelihood (−2LL) between the models (similar to logistic regression). The df for the test will reflect the number of restrictions (e.g., nonestimated parameters between the models). As these change statistics follow a χ2 (chi-square) distribution, we hereafter refer to these tests as χ2 tests.4 Many increasingly complex hypotheses can be directly tested with individual growth curve models, thus some nested models might be estimated. Hypotheses to be tested in this example include Do individuals significantly differ from each other in their average levels of emotional autonomy? Do individuals display growth in emotional autonomy? Do all individuals grow at the same rates? and Is a linear progression an adequate representation of the average form of growth? To test these hypotheses, we estimated five models (described below). In practice, the number of hypotheses (and thus, nested 4 When full maximum likelihood estimation is used-as was the case in this study—these tests can be used to evaluate the elimination of either fixed or random effects. When restricted maximum likelihood estimation is used, these tests can be used to evaluate the elimination of random effects only (see Snijders & Bosker, 1999, p. 89). models) will vary as a function of the complexity of the researcher’s questions and study design. The approach we present is to estimate increasingly complex models (e.g., not estimating vs. estimating the variance of the slope estimates) and test the associated change in χ2 value resulting from the change in model specification. This procedure is analogous to including additional predictors in hierarchical multiple regression; the distinction is that we are focused on additional model estimates, rather than on additional predictors. The five models to be estimated are (a) fixed intercept, (b) unconditional means, (c) compound symmetry, (d) unconditional linear, and (e) fixed quadratic. The associated equations and summary information for each model are summarized in Table II.5 Model 1: Fixed Intercept This model serves as the baseline model. In this model, no growth parameters nor variance estimates in growth are specified. This model would fit the data only to the extent that all individuals reported similar (i.e., nonsignificantly different) levels of emotional autonomy and that these scores were stable over time. Model 2: Unconditional Means This model modifies Model 1 by allowing (estimating) individual level variance in the intercept scores. Model 1 is nested within, and thus can be compared with, this model. Model 2 assumes that each individual has similar scores at each of the time points, but allows these values 5 For all of the growth models described below, 391 of a possible maximum of 405 observations contributed to the analyses (96.54%). As such, missing data was quite minimal. The possible maximum number of observations is computed by multiplying the number of participants (135) by the maximum number of repeated observations per participant (3). Individual Growth Curve Modeling to vary across individuals (i.e., each person’s growth “curve” is a flat line at the height of their estimated average value of emotional autonomy over time). From our observations of Fig. 5 (i.e., individuals growing over time), it is not likely that this model adequately captures true growth in this sample. However, this model does provide a significant improvement in fit relative to Model 1, χ2(1) = 29.6, p < .0001. This supports the observation that individuals differ in their average level of emotional autonomy. On a more technical note, this model also allows the computation of the population autocorrelation (intraclass correlation) (Singer & Willett, 2003); a population based estimate of the average correlation of the repeated observations of emotional autonomy, which is .29 in this study.6 Model 3: Compound Symmetry This model modifies Model 2 through the inclusion of a linear slope parameter (i.e., age is included as a level 1 predictor). Relative to Model 2, this model will test for the presence of linear growth in the sample, though it is a fixed effect as variability in the slopes is not estimated (i.e., all participants are forced to have the same rate of growth). This model is analogous to the ANOVA model with two notable differences: (a) time is allowed to be a continuous (rather than discrete) measure and (b) only the linear component of growth is estimated. (Results of the ANOVA model are presented below.) Model 3 provides a significant improvement in fit relative to Model 2, χ2(1) = 56.5, p < .0001, suggesting that on average individuals are changing on emotional autonomy over time. Model 4: Unconditional Linear This model modifies Model 3 through estimation of two additional parameters; variance in the individual slopes and covariance between intercept and slope estimates. This model allows for the possibility that individuals grow at different rates (the slope variance estimate) and acknowledges that individuals’ initial standing may be related to their own amount of change (e.g., individuals initially high on emotional autonomy at age 9 might be expected to show less absolute growth than individuals initially low on autonomy). This model was an improvement over Model 3, χ2(2) = 8.8, p = 006, suggesting that individuals vary in their rates of linear change in emotional autonomy. Model 5: Fixed Quadratic This model modified Model 4 through inclusion of a fixed quadratic component of growth (variance of this parameter was not estimated). The purpose of this model was to test whether linear growth was an adequate representation of the form of change in emotional autonomy in this sample. This model did not provide a significant improvement in fit relative to Model 4 however, χ2(1) = 0.2, p = .66, suggesting that the form of growth is linear in nature at the average level. Based on the decisions made above, we elected to retain our fourth model, in which growth in emotional autonomy is specified as linear, and individuals are allowed to vary in their intercepts (level at age 9), slopes (rates of change over time), and the relationship between these two values (intercept-slope covariance). Considering the population-level estimates of the unconditional linear growth model, the intercept estimate is 2.23, SE = .03, t(134) = 66.01, p < .0001, suggesting that on average, 9-year-olds score 2.23 on emotional autonomy from mothers. The fact that this quantity is significantly different from zero is not of practical value, given that the scale ranges from 1 to 4. The population slope estimate is .085, SE = .01, t(255) = 6.51, p < .0001, suggesting that on average, youth gain .085 units of emotional autonomy each year during the pre- and early-adolescent years.7 As discussed earlier, a primary strength of individual growth curve modeling is the ability to move beyond general information and focus on the individual variability in the growth curves. As stated above, significant individual variability in both intercept and slope estimates is present, suggesting that growth curves of emotional autonomy vary across individuals. Figure 6 shows the model-implied individual growth curves for the same 20 individuals whose raw data trajectories were presented in Fig. 5 (the figure also includes the population growth curve, denoted by the thicker line). A Note on Model Specification An important issue to consider when fitting individual growth curve models has to do with the “complexity” of the model specification. In this set of analyses, we chose to present the results of five increasingly complex model specifications. In general, model complexity is determined by adding and/or subtracting various fixed and random 6 In this case, the autocorrelation can be computed by forming a ratio of the population estimates of the between-person variance (i.e., level 2 variance) to the total variance (i.e., level 1 within— person variance plus level 2 variance): σν02/(σν02 + σe2) = .07840/ (.07840 + .1885) = .29. 7 df for the tests of individual-level effects are derived through a somewhat complicated algorithm. We used the ddfm = bw option on the model statement in SAS PROC Mixed. Interested readers are encouraged to see relevant SAS Institute documentation. 1011 1012 DeLucia and Pitts Examining Individual Growth Curves of Emotional Autonomy as a Function of Spina Bifida Group Membership and Child Gender Emotional autonomy 4 Given the presence of significant variability in the individual estimates (intercept and/or slope), many hypotheses may present themselves. Most generally, can any of the variability in these estimates be attributed to individual-level variables? In our next model, we examine whether individual variability in intercept and slope estimates can be accounted for by time-invariant predictor variables—spina bifida group membership, participant gender, and their interaction. 3 2 1 8 9 10 11 12 13 14 15 Age Figure 6. Plot of fitted data of emotional autonomy for the same 20 youth. effects. For example, the unconditional linear model is more complex than is the compound symmetric model, because the former allows for individual variability in the linear rates of change, whereas the latter does not. As such, the unconditional linear model attempts to capture a richness in the data that is not captured by the compound symmetry model. When the repeated observations arise from fixed measurement occasions (e.g., all participants are assessed on the same days), there are relatively straightforward “rules” that allow one to determine the upper bound of model complexity (e.g., it is possible to estimate “j–1” random effects, when j is the number of measurement occasions). When the spacing of measurement occasions varies by individual, as was the case in this study, determining the upper bound of model complexity is not as straightforward. Typically, in substantive research, the complexity of the model is determined in part by extant theory and in part by possible statistical constraints. In this article, we estimated many models to give the reader a feel for many possible model specifications, although we knew a priori that some models would not fit the data well (e.g., the fixed intercept model). Although a comprehensive discussion of model specification is beyond the scope of this article, relevant discussions can be found in Snijders and Bosker (1999, chap. 12) and Longford (1993, p. 111).8 8 We also examined several models (data not presented) to test for the possibility of a random quadratic effect, which could not be reliably estimated in this set of analyses. Model 7: Predicting Intercept and Slope Variability from Spina Bifida Group Membership, Participant Gender, and Their Interaction In our final model, the level 2 model is expanded to include gender, group, and their cross product as predictors of both intercept and slope variability.9 Level 1 : Yij = b0i + b1i ( age ij - 9) + e ij . (1) Level 2 : b0i = b00 + b01 ( genderi ) + b02 ( groupi ) + b 03 ( genderi × groupi ) + n 0i . (2a) b1i = b10 + b11 ( genderi ) + b12 ( groupi ) + b13 ( genderi × groupi ) + n1i . (2b) None of the three predictors of intercept variability (tested by examining the t tests for the three parameter estimates β01, β02, and β03) were statistically significant (ps > .10). In contrast, however, the gender by group interaction (β13) was a marginally statistically significant predictor of slope variability, estimate = −.09, SE = .05, t(252) = −1.84, p = .067. Given the presence of this marginally significant interaction effect, all other effects, which are subsumed by the interaction, were retained in the model. (SAS syntax used to estimate this final model and selected SAS output are presented in the Appendix. The equation numbers provided above, e.g. Equation 1, can be used to link coefficients in the equations with relevant estimates in the output.) 9 Interaction terms in individual growth-curve modeling are estimated in analogous fashion to basic regression. Similar issues with respect to interpretability of the simple effects (effects of either of the first-order variables) arise; the effect is correctly interpreted as the “simple” effect of the variable when the other variable is equal to numerical zero. Individual Growth Curve Modeling 1013 Probing the Marginally Significant Gender by Group Interaction in Predicting Slope Variability Conceptually, there are two “sets” of results to consider— effects based on predictors of intercept variability and effects based on predictors of slope variability. Because all intercept effects were nonsignificant, however, they are not discussed further. As in basic regression analysis, in the presence of an interaction effect, it is helpful to consider estimating simple effects of the lower-order variables (e.g., the effect of gender on slope variability for able-bodied youth and the effect of gender on slope variability for youth with spina bifida). [See Aiken and West (1991) for a detailed discussion on testing and probing interaction effects and Holmbeck (2002) for an example of post-hoc probing of interactions in pediatric psychology research.) In Table III, we present relevant simple effects on slope variability, which will aid in the interpretation of the gender by group interaction. Importantly, significance tests for this full set of effects are not produced as part of the output of a single individual growth curve model analysis. Instead, multiple models (each with the same specification) were estimated in which the coding of gender and/or group was altered to allow determination of the full set of relevant effects, appropriate standard errors, and t tests of statistical significance. In Table III, we elected to present the results in a format that highlights the simple effects analysis of gender (i.e., the effect of gender on growth in emotional autonomy for both able-bodied comparison children and for children with spina bifida). Table III contains the population slope estimates for four subgroups of youth: (a) able-bodied girls, (b) able-bodied boys, (c) girls with spina bifida, and (d) boys with spina bifida. Figure 7 contains a plot of the growth curves for these four groups. From the examination of Fig. 7 and the estimates in Table III, it is apparent that very little difference exists in the rates of growth between able-bodied girls and ablebodied boys; slope estimates are .126 and .128 for these Table III. Simple Fixed Effect Estimates of Slope Variability, by Gender and Group Effects Estimate SE df t p Able-boded girls 0.126 0.026 252 4.92 <.0001 Able-bodied boys 0.128 0.024 252 5.41 <.0001 –0.002 0.035 252 –0.07 .95 <.001 Effect of gender for comparison youth Spina bifida girls 0.092 0.026 252 3.50 Spina bifida boys 0.003 0.023 252 0.12 .91 0.089 0.035 252 2.53 .01 Effect of gender for spina bifida youth two groups, respectively. Both groups display statistically significant positive growth in emotional autonomy. Moreover, the simple effects analysis allows a direct test of the difference in these slopes which, not surprisingly, was not statistically significant. However, the plot presented in Fig. 7 and corresponding estimates from Table III clearly suggest that girls and boys with spina bifida are growing at different rates in emotional autonomy; slope estimates of .092 and .003, respectively, for which the difference is statistically significant. Although growth is positive and significant for girls with spina bifida, there is no evidence that boys with spina bifida are growing in emotional autonomy. From the information summarized in Table III, it is also possible to compute the simple effects of spina bifida on slope variability for each gender. These simple effects are more pronounced for boys than for girls. For example, the simple effect of spina bifida for boys (the difference in slope estimates between able-bodied boys and boys with spina bifida) is .128 – .003 = .125. Although the test of this effect is not reported in Table III, it is statistically significant, t(252) = 3.77, p < .001. The simple effect of spina bifida for girls (the difference in slope estimates between able-bodied and spina bifida girls) is .126 – .092 = .034, which is nonsignificant, t(252) = .94, p = .35. Inclusion of child gender, spina bifida group, and their interaction resulted in a 51% reduction of slope variability from the unconditional linear growth model. Although the joint effects of these predictors account for over half of the explainable slope variability, significant variability in slope estimates remain.10 ANOVA Comparison We also analyzed the data as a 2 (gender) × 2 (spina bifida group) × 3 (time—i.e., measurement wave) using a mixed design ANOVA (Keppel, 1991). In contrast to the results from our individual growth curve analysis, the gender by group by time interaction (similar to the effect of the gender by group interaction in predicting slope variability) was nonsignificant (p > .20).11 There were significant (or marginally significant) gender by time and group by time effects. Examination of the polynomial contrast coefficients (i.e., linear and quadratic) 10 The significance of the residualized slope variance can be tested using nested models in an analogous fashion as done above. 11 This was true for both the multivariate F statistic as well as the F statistic based on results applying the Huynh-Feldt correction. As well, the p level for the test of gender by group by linear change (most consistent with the individual growth-curve model) was p = .177. 1014 DeLucia and Pitts 4 Emotional autonomy 3 2 Solid lines = comparison group Dashed lines = spina bifida group Thick lines = girls Thin lines = boys 1 8 9 10 11 12 Age and their interaction with each gender and group suggested no presence of quadratic effects. Examination of the linear by gender and linear by group effects further suggested that (a) females were increasing faster than were males and (b) able-bodied youth were increasing faster than were youth with spina bifida. When juxtaposed with results from our individual growth curve analysis, these results provide a different picture of growth over time in emotional autonomy. That is, this analysis fails to account for the finding that males with spina bifida were growing at slower rates than were both males without spina bifida and females with spina bifida. This has implications for the interpretation of both the group by linear trend and the gender by linear trend effects. With respect to the group by linear trend interaction, collapsing males and females with spina bifida into a single group (even though they were growing at different rates) and comparing them with male and female able-bodied youth result in a confounding of the spina bifida by time effect. Similarly, with respect to the gender by linear trend effects, this analysis collapses males with and without spina bifida, comparing them with females with and without spina bifida. 13 14 15 Figure 7. Plot of the simple gender by spina bifida interactions in predicting slope variability. From our simple effects analysis of the results of the individual growth curve models (Table III), we observed that males with and without spina bifida were growing at different rates; this is what is meant by confounding the gender by time effect. As suggested earlier, there are some advantages of the individual growth curve modeling approach relative to ANOVA. One of the more salient advantages in this example regards the heterogeneity in age at each measurement occasion. Although the individual growth curve model can treat age as continuous and variable across individuals, the ANOVA model assumes all individuals are measured at the same time point for any given wave. To explore whether this treatment of time was a contributing factor to the differences in inference between the two approaches, we coded age as discrete values (9, 11, and 13) and coded gender and group as effect codes before estimating the individual growth curve model. Similar to the results from ANOVA, the gender by spina bifida interaction was not a statistically significant predictor of slope variability (p = .13). There were, however, significant effects of both gender and spina bifida groups, with the same interpretation of effects as seen with Individual Growth Curve Modeling 1015 ANOVA. Thus, it appears that at least one reason for the difference in findings is the inability of the ANOVA model to treat time (age) as a truly continuous variable. Coming Full Circle: Putting the Individual Back into the Individual Growth Curve Model As mentioned above, even though we were able to significantly predict both intercept and slope variability, our final model still allowed for an individual component to be present in both the level 2 intercept and slope variability models. As such, individuals still varied from their group-level intercept and slope estimates. In Fig. 8, we display the model-implied individual growth curves for all study participants (by spina bifida group membership and gender) as a function of the parameters included in the final model. We present this final figure to remind the reader that, even though spina bifida group status and participant gender carry meaningful information about aggregate level growth trajectories, individuals are still allowed to deviate from their respective group trajectories, resulting in the individual growth curves displayed in the figure. Discussion The central goal of this article was to provide a brief, nontechnical introduction to individual growth curve modeling for the analysis of longitudinal data. We introduced several possible model applications in the areas of basic (developmental) and more applied (intervention) pediatric psychology research. Our discussion progressed from several hypothetical examples of model applications, which were more conceptual in nature, to an actual analytical example in which we analyzed data from an ongoing study of the possible impact of spina bifida on youth and family development (Holmbeck et al., 2003). Able-bodied girls Able-bodied boys 4 Emotional autonomy Emotional autonomy 4 3 2 1 3 2 1 8 9 10 11 12 13 14 15 8 9 10 Age 12 13 14 15 13 14 15 Age Spina bifida girls Spina bifida boys 4 4 Emotional autonomy Emotional autonomy 11 3 2 1 3 2 1 8 9 10 11 12 13 14 Age Figure 8. Plot of fitted data from final model, by group and gender. 15 8 9 10 11 12 Age 1016 DeLucia and Pitts Growth in Emotional Autonomy as a Function of Spina Bifida Status and Participant Gender From our analytical work, we were able to draw several conclusions about growth in emotional autonomy for male and female youth with spina bifida and able-bodied comparison youth. First, we established that on average, growth in emotional autonomy was linear in nature. Second, we established that growth curves of emotional autonomy varied across individuals. Third, we established that spina bifida status—captured by the group variable—and participant gender interacted in predicting rates of linear change in emotional autonomy. Further probing of this complex interaction revealed that for youth with spina bifida, girls exhibited significant positive growth in emotional autonomy, whereas boys exhibited nonsignificant growth in emotional autonomy. In contrast, for able-bodied comparison youth, girls and boys both exhibited significant positive growth in emotional autonomy. Analyses of these data using an ANOVA model failed to capture this complex interaction, resulting in different conclusions regarding growth over time in emotional autonomy for these four subgroups of youth. Practical Implications of Strengths of Individual Growth Curve Modeling Throughout our discussion, we juxtaposed individual growth curve models with the more traditional ANOVA model. Although we highlighted several strengths of individual growth curve modeling relative to ANOVA, it is worth discussing further some important statistical and practical implications of two particular strengths of individual growth curve modeling. Flexible Treatment of the Time Variable The fact that the time variable can be treated continuously in individual growth curve modeling, rather than as a set of fixed points (Bryk & Raudenbush, 1992), can have some very serious implications for the analysis of longitudinal data. Losing this flexibility can introduce error into the growth curve estimates. Consider a very simple example in which one individual was measured at three time points, each separated by 1 week, and that his/her scores on the outcome were five, six, and seven, respectively. Assume also that a second individual was measured at three time points, each separated by 2 weeks, and his/her scores on the outcome were five, seven, and nine, respectively. In reality, both of these individuals have the same growth curve, indicated by an intercept of five and a linear weekly growth rate of one. If it were assumed that these individuals were measured at weekly intervals (which is true for the first individual but false for the second), the first individual’s trajectory would be accurately estimated, but the second individual’s trajectory would be misestimated. In addition to this misestimation, spurious variability in growth curves estimates results. Although not true in this example, the actual form of growth (e.g., linear, quadratic) can be misestimated as well. Problems might arise even when the design and statistical analysis is sound. For example, studies may recruit participants of differing ages at baseline, though subsequent measurement intervals are the same for all participants (e.g., 1 year). Common practice is to include baseline age as a covariate in the analysis. However, to the extent that growth is nonlinear (e.g., physical stature), this analytical solution will not result in unbiased estimates. Individual growth curve models, by allowing time (age) to be continuous, do not suffer from this problem. For a general discussion of the possible impact of age heterogeneity on the variance estimates of the growth factors, see Mehta and West (2000). Data are Truly Modeled at the Individual Level As noted above, the focus on the individual achieved in individual growth curve modeling is an important advantage and has many possible practical applications. For example, intervention researchers can use individual growth curves of treatment response to plan the timing of follow-up or booster sessions (e.g., individuals whose treatment response is decaying more rapidly can be given booster sessions earlier than individuals whose treatment response is decaying less rapidly). Similarly, researchers interested in studying the development of risk behaviors (e.g., externalizing, substance use) can use the individual growth curves derived in an earlier period of development (e.g., preadolescence) to make decisions about which youth to follow-up in a subsequent developmental period (e.g., researchers might want to randomly sample among subclasses of individuals— say those with average growth, those with very steep growth rates, and those with negative growth rates) (Nagin, 1999). In these examples, information concerning individual-level growth, in addition to information about aggregate-level growth, is extremely useful. Some Suggestions for Beginning Users We hope this article will encourage researchers to learn more about possible applications of individual growth curve models in their own areas of research. Although the early stages of such an endeavor can seem quite overwhelming, there are many excellent articles, chapters, Individual Growth Curve Modeling 1017 and book sections dedicated to this general class of statistical procedures. Although no single work will completely unpack the “mystery” of individual growth curve models for beginning users, we have found the following sources particularly helpful in increasing our knowledge (Bryk & Raudenbush, 1987, 1992; Gibbons et al., 1993; Littell et al., 1996; Singer & Willett, 2003; Singer, 1998; Snijders & Bosker, 1999; Willett, 1988, 1997; Willett, Singer, & Martin, 1998). Interested readers might begin their study by reading the first eight chapters of Singer and Willett’s (2003) book, which includes a wonderful mix of conceptual and technical explanations of individual growth curve models.12 Readers can also visit the book’s companion website (Singer & Willett, 2005), which contains links to downloadable data sets and syntax files that can be used to estimate individual growth curve models in several commonly used software programs (e.g., SAS, SPSS). We have also found D. Hedeker’s website (2005) an excellent resource for learning more about longitudinal modeling in general and individual growth curve models more specifically. For example, the website has links to lecture notes and sample programs for a course he teaches on longitudinal data analysis in the Division of Epidemiology and Biostatistics at the University of Illinois at Chicago. We also encourage visiting D. Hedeker’s website because it contains links to many freely downloadable software programs (and user manuals), all of which can be used to estimate individual growth curve models (or more generally, mixed-effects regression models) for continuous as well as other data types (e.g., Hedeker & Gibbons, 1996a, 1996b; Hedeker, 1999). Although software to estimate these kinds of models is readily available today (Snijders & Bosker, 1999, chap. 15), free software remains the exception. generalizations exist for nonnormal data). Throughout the article, we have discussed many possible applications of individual growth curve models for basic and applied pediatric psychology research. In addition to discussion of the model’s conceptual framework, we also estimated several models examining growth in emotional autonomy during the preadolescent and adolescent years in a sample of youth with spina bifida and able-bodied comparison youth. Our analytical work provided examples of the kinds of interesting hypotheses; these models can be used to examine: (a) Is there growth in the phenomenon? (b) What is the shape of growth? (c) Are individuals growing at different rates? and (d) Are there person-level characteristics that can predict variability in growth parameters (both intercepts and slopes). In addition, our discussion also touched on several important statistical issues worthy of consideration, some of which were specific to individual growth curve modeling (e.g., giving the time variable a meaningful metric), and others of which were more general in nature (e.g., the probing of interaction effects). We hope this article will encourage researchers to learn more about individual growth curve models and consider possible applications of these models for their work. Acknowledgments We thank Grayson N. Holmbeck for providing the data analyzed in this study and Donald Hedeker for statistical consultation. Received February 28, 2005; revisions received May 18, 2005 and July 22, 2005; accepted August 1, 2005 References Conclusion Individual growth curve modeling is an extremely powerful and flexible procedure for the analysis of longitudinal data. It offers researchers several advantages over more traditional methods (e.g., data are modeled at the individual level, flexible treatment of the time variable, 12 It is worth noting that the latter seven chapters of this text are dedicated to describing a general class of statistical procedures typically referred to as survival modeling, in which the modeling of “event” timing/occurrence is of primary interest. Although discussion of survival modeling was beyond the scope of this article, these kinds of models might also be of interest to pediatric psychologists interested in questions that center more on the timing of events (e.g., time to school integration following a traumatic injury). Aiken, L. S., & West, S. G. (1991). Multiple regression: Testing and interpreting interactions. Newbury Park, CA: Sage. Baron, R. M., & Kenny, D. A. (1986). The moderatormediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology, 51, 1173–1182. Biesanz, J. C., Deeb-Sossa, N., Papadakis, A. A., Bollen, K. A., & Curran, P. J. (2004). The role of coding time in estimating and interpreting growth curve models. Psychological Methods, 9, 30–52. Bryk, A. S., & Raudenbush, S. W. (1987). Application of hierarchical linear models to assessing change. Psychological Bulletin, 101, 147–158. 1018 DeLucia and Pitts Bryk, A. S., & Raudenbush, S. W. (1992). Hierarchical linear models: Applications and data analysis methods. Newbury Park, CA: Sage. Bryk, A. S., Raudenbush, S. W., & Congdon, R. T. (1996). HLM: Hierarchical linear and nonlinear modeling with the HLM/2L and HLM/3L programs. Chicago: Scientific Software International. Choeng, J., MacKinnon, D. P., & Khoo, S. T. (2003). Investigation of mediational processes using parallel process latent growth curve modeling. Structural Equation Modeling, 10, 238–262. Coakley, R. M., Holmbeck, G. N., Friedman, D., Greenley, R. N., & Thill, A. W. (2002). A longitudinal study of pubertal timing, parent-child conflict, and cohesion in families of young adolescents with spina bifida. Journal of Pediatric Psychology, 27, 461–473. Ferrer, E., & McArdle, J. J. (2004). An experimental analysis of dynamic hypotheses about cognitive abilities and achievement from childhood to early adulthood. Developmental Psychology, 40, 935–952. Gibbons, R. D., Hedeker, D., Elkin, I., Watermaux, C., Kraemer, H. C., Greenhouse, J. B., et al. (1993). Some conceptual and statistical issues in analysis of longitudinal psychiatric data. Archives of General Psychiatry, 50, 739–750. Hedeker, D. (1999). MIXNO: A computer program for mixed-effects nominal logistic regression. Journal of Statistical Software, 4, 1–92. Hedeker, D. (2005). Don Hedecker’s homepage. Retrieved August 30, 2005, from http://tigger.uic. edu/∼hedeker/ Hedeker, D., & Gibbons, R. D. (1996a). MIXOR: A computer program for mixed-effects ordinal probit and logistic regression analysis. Computer Methods and Program in Biomedicine, 49, 157–176. Hedeker, D., & Gibbons, R. D. (1996b). MIXREG: A computer program for mixed-effects regression models with autocorrelated errors. Computer Methods and Program in Biomedicine, 49, 229–252. Hedeker, D., & Gibbons, R. D. (1997). Application of random-effects pattern-mixture models for missing data in longitudinal studies. Psychological Methods, 2, 64–78. Holmbeck, G. N. (1997). Toward terminological, conceptual, and statistical clarity in the study of mediators and moderators: Examples from the child clinical and pediatric psychology literatures. Journal of Consulting and Clinical Psychology, 65, 599–610. Holmbeck, G. N. (2002). Post-hoc probing of significant moderational and mediational effects in studies of pediatric populations. Journal of Pediatric Psychology, 27, 87–96. Holmbeck, G. N., Coakley, R. M., Hommeyer, J. S., Shapera, W. E., & Westhoven, V. C. (2002). Observed and perceived dyadic and systemic functioning in families of preadolescents with spina bifida. Journal of Pediatric Psychology, 27, 177–189. Holmbeck, G. N., Westhoven, V., Shapera, W. E., Bowers, R., Gruse, C., Nikolopoulos, T., et al. (2003). A multimethod, multi-informant, and multidimensional perspective on psychosocial adjustment in preadolescents with spina bifida. Journal of Consulting and Clinical Psychology, 71, 782–796. Hommeyer, J. S., Holmbeck, G. N., Wills, K. E., & Coers, S. (1999). Condition severity and psychosocial functioning in pre-adolescents with spina bifida: Disentangling proximal functional status and distal adjustment outcomes. Journal of Pediatric Psychology, 24, 499–509. Keppel, G. (1991). Design and analysis: A researcher’s handbook (3rd ed.). Englewood Cliffs, NJ: Prentice Hall. Krull, J. L., & MacKinnon, D. P. (2001). Multilevel modeling of individual and group level mediated effects. Multivariate Behavioral Research, 36, 249–277. Laird, N. M., & Ware, J. H. (1982). Random-effects models for longitudinal data. Biometrics, 38, 963–974. Littell, R. C., Milliken, G. A., Stroup, W. W., & Wolfinger, R. D. (1996). SAS system for mixed models. Cary, NC: SAS Institute. Longford, N. T. (1993). Random coefficient models. New York: Oxford University Press. MacKinnon, D. P. (1994). Analysis of mediating variables in prevention and intervention studies. In A. Cazares & L. Beatty (Eds.), Scientific methods for prevention intervention research (NIDA Monograph No. 139, pp. 127–153). Rockville, MD: National Institute on Drug Abuse. MacKinnon, D. P., Lockwood, C. M., Hoffman, J. M., West, S. G., & Sheets, V. (2002). A comparison of methods to test mediation and other intervening variable effects. Psychological Methods, 7, 83–104. McArdle, J. J., Hamagami, F., Jones, K., Jolesz, F., Kikinis, R., Spiro, A., et al. (2004). Structural modeling of dynamic changes in memory and brain structure using longitudinal data from the normative aging study. Journals of Gerontology: Series B: Psychological Sciences and Social Sciences, 59B, 294–304. Mehta, P. D., & West, S. G. (2000). Putting the individual back into individual growth curves. Psychological Methods, 5, 23–43. Individual Growth Curve Modeling 1019 Nagin, D. S. (1999). Analyzing developmental trajectories: A semiparametric, group-based approach. Psychological Methods, 4, 139–157. Rogosa, D. (1988). Myths about longitudinal research. In K. W. Schaie, R. T. Campbell, W. Meredith, & S. C. Rawlings (Eds.), Methodological issues in aging research (pp. 171–210). New York: Springer. Rogosa, D., Brandt, D., & Zimowski, M. (1982). A growth curve approach to the measurement of change. Psychological Bulletin, 92, 726–748. Singer, J. D. (1998). Using SAS PROC MIXED to fit multilevel models, hierarchical models, and individual growth models. Journal of Educational and Behavioral Statistics, 24, 323–355. Singer, J. D., & Willett, J. B. (2003). Applied longitudinal data analysis: Modeling change and event occurrence. New York: Oxford University Press. Singer, J. D., & Willett, J. B. (2005). Applied longitudinal data analysis: Modeling change and event occurrence. New York: Oxford University Press. Retrieved August 30, 2005, from http://gseacademic.harvard. edu/alda/ Snijders, T. A. B., & Bosker, R. J. (1999). Multilevel analysis: An introduction to basic and advanced multilevel modeling. London: Sage. Steinberg, L., & Silverberg, S. B. (1987). The vicissitudes of autonomy in early adolescence. Child Development, 57, 841–851. West, S. G., & Aiken, L. S. (1997). Toward understanding individual effects in multicomponent prevention programs: Design and analysis strategies. In K. J. Bryant, M. Windle, & S. G. West (Eds.), The science of prevention: Methodological advances from alcohol and substance abuse research (pp. 167–209). Washington, DC: American Psychological Association. Willett, J. B. (1988). Questions and answers in the measurement of change. In E. Rothkopf (Ed.), Review of research in education 1988–1989 (pp. 345–422). Washington, DC: American Educational Research Association. Willett, J. B. (1997). Measuring change: What individual growth modeling buys you. In E. Arnsel & K. A. Reninger (Eds.), Change and development (pp. 213–243). Maywah, NJ: Erlbaum. Willett, J. B., Singer, J. D., & Martin, N. C. (1998). The design and analysis of longitudinal studies of development and psychopathology in context: Statistical models and methodological recommendations. Development and Psychopathology, 10, 395–426. 1020 DeLucia and Pitts Appendix SAS Syntax and Selected Output From Final model Described in Paper /* This syntax and accompanying data file can be downloaded from the following website: http://userpages.umbc.edu/~spitts SAS syntax is presented in bold-face type and comments are presented in regular type enclosed within the following comment indicators /* */ Options nocenter; /*This option results in output printed flush left.*/ /* Below code reads in space-delimited raw data file, assigns variable names for existing variables, creates three additional variables (i.e., age91, age92, age93), and creates a new working data file (i.e., eaut). Variables are as follow: id = participant id group = group membership (0 = comparison, 1 = spina bifida) gender = gender (0 = female, 1 = male) age1 = age at time 1 (age2 and age3 follow same form) age91 = age - 9 at time 1 (which centers age variable at age 9) this variable will be used to examine linear effect of age in growth models (age92 and age93 follow same form) eaut1 = emotional autonomy from mothers at time 1 (eaut2 and eaut3 follow same form) */ data eaut; infile 'C:\eaut.txt'; input id group gender age1 age2 age3 eaut1 eaut2 eaut3; age91 = age1 - 9; age92 = age2 - 9; age93 = age3 - 9; run; /* The working data file 'eaut' is a "wide" data file in that the repeated observations are designated as multiple variables for each individual (e.g., age1, age2, age3). As such, each individual takes up 1 row in the data file. To execute the mixed procedure, the data file must be converted to a "long" file, in which the repeated observations are designated as multiple linked records for the same individual. As such, each individual has multiple rows in the data file. For example, if we consider a hypothetical participant id# 1000 who is 8-years-old at T1, 10 at T2, and 12 at T3, his/her id and age variables would be represented in a wide data file as follows 1000 8 10 12 and in a long data file as follows 1000 8 1000 10 1000 12 The below array statement converts the wide file 'eaut' to the long file 'eautlong', renames the "linear" age variables (i.e., age91, age92, age93) age9, renames the emotional autonomy variables (i.e., eaut1, eaut2, eaut3) eaut, and retains only the variables that will be used in the mixed procedures. The resulting long data file has five variables in the following order: id group gender age9 eaut. */ Individual Growth Curve Modeling 1021 1022 DeLucia and Pitts Individual Growth Curve Modeling 1023
© Copyright 2026 Paperzz