Chapter |2| Aging, Cohorts, and Methods Yang Yang Department of Sociology & Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina CHAPTER CONTENTS Introduction Early Literature Why Cohort Analysis? Why is there an APC Identification Problem? Conventional Solutions New Developments: Models, Methods, and Substantive Research Research design I: Age-by-Time Period Tables of Rates/Proportions INTRODUCTION 17 18 18 18 18 19 19 Guidelines for Estimating APC Models of Rates 19 Temporal Changes of US Adult Lung Cancer Mortality 20 Conclusion 21 Research Design II: Repeated Cross-Sectional Surveys 21 Hierarchical Age-Period-Cohort (HAPC) Models 22 Social Inequality of Happiness 22 Conclusion and Extensions 23 Research Design III: Accelerated Longitudinal Panels 24 Growth Curve Models of Individual and Cohort Change 24 Sex and Race Health Disparities Across the Life Course 25 Conclusion Directions for Future Research Continuously Evolving Cohort Effects Model Longitudinal Cohort Analysis of Balanced Age-by-Cohort Data Structure References © 2011 Elsevier Inc. All rights reserved. DOI: 10.1016/B978-0-12-380880-6.00002-2 27 27 27 28 29 Theoretical developments combined with new methodological tools and data sources have contributed to considerable growth in aging research in the social sciences over the past few decades (Elder, 1985). To address key questions of aging and the life course paradigm – how individual lives unfold with age and are shaped by historical time and social context – researchers often need to compare age-specific data recorded at different points in time and from different birth cohorts. A systematic study of such data, termed the age-period-cohort (APC), or simply “cohort analysis,” is one of the most useful means to gain a greater understanding of the process of aging within and across human populations. The challenges posed by cohort analysis are well known. Whether observed time-related differences are related to aging or cohort variation is a question usually deemed conceptually important but empirically intractable for two reasons. The first is data limitations. Using cross-sectional data at one point in time, for example, aging and cohort effects are confounded. Similarly, comparing data from multiple time periods for the same age group precludes distinguishing between period and cohort effects. The second reason is the use of linear regression models that suffer from either specification errors or an identification problem and consequently are incapable of distinguishing age, period, and cohort effects. What should one do to model these effects in empirical research to understand the true social and biological mechanisms generating the data? This chapter aims to provide some useful guidelines on how to conduct cohort analysis. The expository strategy is to provide those guidelines in the context of illustrative empirical analyses. The remainder of this chapter is organized as follows. The next section summarizes the state of knowledge on cohort analysis in aging research in 17 Part |1| THEORY AND METHODS sociology, demography, and epidemiology. Following on further, the next section synthesizes new developments in cohort analysis, including models, methods, and substantive applications. Finally, the last section outlines new directions for which methods and data are promising, but not yet completely developed. EARLY LITERATURE Why Cohort Analysis? In a classic article, Norman Ryder (1965) made an extended argument for the conceptual relevance of cohorts to an extraordinary range of substantive issues in social research. He argued that cohort membership could be as important in determining behavior as other social structural features such as socioeconomic status (SES). APC analysis is, in this sense, synonymous with cohort analysis (Smith, 2008). Among various cohorts defined by different initial events (such as marriage and college/university entrance), birth cohorts are the most commonly examined unit of analysis in demographic and aging research. A birth cohort shares the same birth year and ages together. Birth cohorts born in different time periods that encounter different historical and social conditions as they age would conceivably have diverse developmental paths. Cohort analysis concerns the precise depiction of such time-related phenomena, is central to the study of aging and the life course, and is crucial for inference. Age effects represent aging-related physiological or developmental changes and offer clues to etiology in epidemiologic studies. Period effects reflect changes in contemporaneous social, historical, and epidemiologic conditions that affect all living cohorts. Cohort effects reflect different formative experiences resulting from the intersection of individual biographies and macrosocial influences. In the absence of period and cohort changes, age effects are broadly applicable across historical eras and different cohorts. Presence of either or both of these changes, however, indicates the existence of exogenous forces or exposures that are period- and/or cohort-specific. Why is there an APC Identification Problem? The essence of cohort analysis is the identification and quantification of different sources of variation that are associated with age, period, and/or cohort effects in an outcome of interest. Early investigators developed models for situations in which all three factors account for a substantive phenomenon and in which simpler two-factor models (such as an ageperiod model) are subject to model specification errors and spurious results (Mason et al., 1973). The 18 “conventional linear regression model,” also known as the APC multiple classification/accounting model (Mason et al., 1973), has been widely adopted as the general methodology for cohort analysis, but it also suffers from a well-known model identification problem. Specifically, any two of the constituent factors perfectly predict the third. For example, if date of birth (i.e. birth cohort) and time of measurement are known, age also is known. This produces multiple instead of unique estimates of the three effects. The Mason et al. (1973) article spawned a large methodological literature in the social sciences beginning with Glenn’s critique (1976) and Mason et al.’s reply (1976), followed by Fienberg and Mason’s work (1978). Similar debates occurred between Rodgers (1982) and Smith et al. (1982). These early investigations culminated in an edited volume on cohort analysis by Mason and Fienberg in 1985. Limitations of those analytic strategies propelled the search for new approaches. For example, a Bayesian approach was proposed by Saski and Suzuki (1987), which was again critiqued by Glenn, who cautioned against “mechanical solutions” to the problem (1987). APC analysis also took root in biostatistics and epidemiology (e.g. Clayton & Schifflers, 1987; Holford, 1991; Kupper et al., 1985). Several recent reviews provide useful additional material on these and related contributions to cohort analysis (Mason & Wolfinger, 2002; Yang, 2007a, 2009). Conventional Solutions Within the framework of linear regression models, three conventional solutions to the identification problem have been used. The purpose of each is to break the linear dependency between the three variables and, thus, to identify the model, but each has limitations. The constrained coefficients approach imposes one or more equality constraints on the coefficients of one parameter vector to just-identify (one constraint) or over-identify (two or more constraints) the model (Mason et al., 1973; Yang et al., 2004). For example, in an APC analysis of US tuberculosis mortality, Mason and Smith (1985) placed the constraint by equating the contrasts for ages 0–9 and 10–19. This yielded unique estimates for age, period, and cohort effects. This is the most widely used approach, but is not immune to problems. First, the analyst must rely on prior or external information that rarely exists or cannot be verified to select constraints. Second, different identifying constraints (e.g. two age groups vs two cohorts) can produce widely different estimates of age, period, and cohort effects. And third, all justidentified constrained coefficients models produce the same levels of goodness-of-fit to the data. Thus, model fit cannot be used as the criterion for selecting the best model (Yang et al., 2008). Chapter |2| AGING, COHORTS, AND METHODS The nonlinear parametric (algebraic) transformation approach defines a nonlinear parametric function for at least one of the age, period, and cohort variables so that its relationship to others is nonlinear (Mason & Fienberg, 1985). For example, one can specify a quadratic or cubic function of age to model the nonlinear rate of change in happiness with age (Yang, 2008a). The drawbacks of this approach are that: (1) specifying a structure of the parameters makes the model less flexible and (2) it may not be clear what nonlinear function should be defined. The proxy variables approach uses one or more proxy variables to replace the age, period, or cohort variable. This is a popular approach because of its substantive appeal. After all, the indicator variables of age, period, and cohort serve as surrogates for different sets of unmeasured structural correlates (Hobcraft et al., 1982). Examples of proxy variables for cohort effects include relative cohort size (O’Brien et al., 1999) and cohort mean years of smoking before age 40 (Preston & Wang, 2006). Unemployment rate, labor force size, and gender role attitudes have been used as proxy variables for period effects (Pavalko et al., 2007). Smith and colleagues (1982, p. 792), however, cautioned that replacing cohort or period with a measured variable “leaves open the question of whether all of the right measured variables have been included in an appropriately wrought specification. Although replacing an accounting dimension with measured variables solves an identification problem, it makes room for others.” Thus, assuming that all of the variation associated with the A, P, or C dimension is fully accounted for by the chosen proxy, variable(s) may be unwarranted. Winship and Harding (2008) proposed a mechanism-based approach that accommodates a more general set of models. Using the framework of causal modeling described by Pearl (2000) to achieve identification, the approach allows any given measured variable to be associated with more than one of the age, period, and cohort dimensions and provides statistical tests for the plausibility of alternative restrictions. If a rich set of mechanism variables are available and the original age, period, and cohort categories can be conceived as the exogenous elements of a causal chain (Smith, 2008), this is an enriched and sophisticated alternative to the proxy variable approach. Where does the early literature on cohort analysis leave us today? If a researcher has a temporally ordered data set and wants to tease out its age, period, and cohort components, how should he/ she proceed? Can any methodological guidelines be recommended? A problem with much of the extant literature is that there is a lack of useful guidelines on how to conduct an APC analysis. Instead, one is left with the impression that either it is impossible to obtain meaningful estimates of the distinct contributions of age, cohort, and time period, or that the conduct of an APC analysis is an esoteric art that is best left to a few skilled methodologists (Yang, 2009). I now seek to redress this situation by focusing on recent developments in APC analysis for three research designs commonly used in research on aging. NEW DEVELOPMENTS: MODELS, METHODS, AND SUBSTANTIVE RESEARCH In this section, I introduce a new estimation method to solve the identification problem within the context of conventional linear models. Note that the identification problem is not inevitable in all settings – only in the case of linear models that assume additive effects of age, period, and cohort. The text then describes other families of models that are not subject to this problem and are suitable for many research questions. Empirical studies of aging, health, and well-being for each of the three research designs will be use as illustrative examples. Sample codes using standard statistical software packages are available at http://home.uchicago.edu/~yangy/apc. Research Design I: Age-by-Time Period Tables of Rates/Proportions In demographic and epidemiologic investigations, researchers are typically interested in aggregate population-level data such as rates of morbidity, disability, or mortality. Using the conventional data structure, rates or proportions are arranged in rectangular arrays with age intervals defining the rows and time periods defining the columns. Age and period are usually of equal interval length (e.g. 5 or 10 years), so the diagonal elements of the matrix correspond to birth cohorts. An example of this data structure is shown in Figure 2.1, where age-specific lung cancer death rates for 14 five-year age groups (30 to 95) of women are plotted by eight time periods between 1960 and 1999 and for 21 cohorts born between 1870 and 1920 (mid-year). Guidelines for Estimating APC Models of Rates Many previous applications of cohort analysis using linear regression models are based on the ageby-period matrix. Among the limitations of conventional solutions to the identification problem is the lack of guidelines for estimating APC models of rates or proportions. A three-step procedure is helpful for this purpose. Step 1 is to conduct descriptive data analyses using graphics. The objective is to provide qualitative understanding of patterns of age, 19 Part |1| THEORY AND METHODS 300 250 1920 birth cohort 1915 Rate/100 000 200 1910 150 1905 1900 100 1895 50 1995–99 1990–94 1985–89 1890 1980–84 1885 1975–79 1880 1970–74 1965–69 1875 1870 1960–64 0 30 35 40 45 50 55 60 65 70 75 80 85 90 95+ Age Figure 2.1 US female lung cancer mortality rates per 100 000 by age at death, period of death, and birth cohort. Source: Yang (2008b). period, cohort, and two-way age by period and age by cohort variations. Step 2 is model fitting. The objective is to ascertain whether the data are sufficiently well-described by any single factor or two-way combination of age, period, and cohort. If these analyses suggest that only one or two of the three effects are operative, the analysis can proceed with a reduced model that omits one or two groups of variables and there is no identification problem. If, however, these analyses suggest that all three dimensions are at work, Yang and colleagues (2004, 2008) recommend Step 3: applying a new method of estimation of the linear model, the intrinsic estimator (IE). We will now look at these steps using examples from recent analyses of US adult chronic disease mortality change (Yang, 2008b). The example below focuses on lung cancer mortality (see Yang, 2008b for the complete set of analyses of other causes of death). Temporal Changes of US Adult Lung Cancer Mortality Figure 2.1 is a graphical presentation of the female lung cancer rate data for the Step 1 descriptive analyses. The age pattern shows much higher rates of deaths due to lung cancers in older ages than young adulthood. Rates of lung cancer deaths in women 20 aged 40 and older also increased dramatically. Large increases in successive cohorts are also evident: more recent cohorts of women had higher death rates than their predecessors at the same ages. Because cohort effects can be interpreted as a special form of interaction effect between age and period, cohort effects can be detected by non-parallel age-specific curves by time period – a pattern evident in Figure 2.1. Peak ages shifted to the left over time, indicating higher death rates at younger ages for more recent female cohorts. The graphical analyses present all available rates, but provide no summary or quantitative assessment of the source of mortality change (Kupper et al., 1985). The curve of age-specific death rates for women in any given time period cuts across multiple birth cohort curves. Therefore, the period curve is affected by both varying age effects is cohort effects. The question of which of these effects is more important in explaining the mortality change can only be addressed using regression modeling. In Step 2, the relative importance of the A, P, and C dimensions can be determined by comparing overall measures of model fit (e.g. R-squared for linear regression models or deviance statistics/penalized likelihood functions such as Bayes the Information Criterion for generalized linear models). For this analysis, goodness-of-fit statistics for four log-linear Chapter |2| AGING, COHORTS, AND METHODS Age effect 2.0 Male Female 1.0 –2.0 0.0 –4.0 –1.0 9 –9 4 –9 95 19 9 –8 90 19 4 85 –8 80 19 -7 9 19 4 –7 75 19 9 70 –6 19 65 19 –6 4 –2.0 19 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95+ Age 60 Log coefficient 0.0 –6.0 Period effect 2.0 Male Female Year Cohort effect 2.0 Male Female Log coefficient 1.0 0.0 –1.0 –2.0 75 19 65 19 55 45 19 19 35 25 15 19 19 05 19 19 95 18 85 18 75 18 18 65 –3.0 Cohort Figure 2.2 Intrinsic estimates of age, period, and cohort effects of lung cancer mortality by sex. Source: Yang (2008b). models of female lung cancer mortality were produced: age effects only (A), age and period effects (AP), age and cohort effects (AC), and age, period, and cohort effects (APC). The three-factor model best fit the data, regardless of the statistics used. Thus, all three sources of variation contribute to the mortality change and need to be included in the model. In Step 3, one estimates the APC model coefficients to reveal the age, period, and cohort effects. The IE is a new method of estimation that yields a unique solution to the APC model and is the unique estimable function of both the linear and nonlinear components of the APC model. It has minimal assumptions and is estimated using principal component regression. The reason why the IE is useful lies in the special constraint it imposes – removing the influence of the number of age and period groups (which is not related to the outcome) on coefficient estimates. This constraint allows the IE to avoid the bias that the conventional equality constraints produce. For the mathematical foundations of the IE see Yang et al. (2004) and for additional information about its statistical properties see Yang et al. (2008). The log coefficient estimates using the IE on the lung cancer mortality data by sex are shown in Figure 2.2. Age effects are concave: the mortality risk increased rapidly from early adulthood, peaked around ages 80 to 85, and then leveled off. Cohort mortality curves increased first and decreased for more recent cohorts, following an inverse-U shape. Period effects are modest compared to age and cohort effects. Conclusion Is the IE a “final” or “universal” solution to the APC “conundrum”? No. There will never be a solution within the confine of linear models. Researchers should never naively apply any estimator to APC data and expect to get meaningful results. In all cases, APC analysis should be approached with great caution and awareness of its many pitfalls. The IE appears to be the most useful approach to the identification and estimation of these models, but shares some of the limitations of conventional linear models. Research Design II: Repeated Cross-Sectional Surveys Repeated cross-sectional surveys, such as the Current Population Survey (CPS) and General Social Survey (GSS), provide unique opportunities for cohort analysis. Pooling data of all survey years yields the 21 Part |1| THEORY AND METHODS age-by-year data structure needed for cohort analysis. Many researchers previously assumed that the APC identification problem for repeated cross-sectional surveys is the same as that for population rates. But sample surveys are different from aggregate population-level data in two important ways. First, repeated cross-sectional surveys allow age intervals to differ from period intervals. More importantly, the sample size for each age-by-period cell exceeds rather than equals one. This data structure suggests a multi-level data structure. In this design, respondents are nested in and cross-classified simultaneously by the two higher-level social contexts defined by time period and birth cohort. That is, individual members of any birth cohort are interviewed in multiple replications of the survey, and individual respondents in any particular wave of the survey can be drawn from multiple birth cohorts. Hierarchical Age-Period-Cohort (HAPC) Models Given these data characteristics, there are two potential solutions to the identification problem in addition to those described in the first section. First, within the framework of the conventional linear regression model, different temporal groupings for the A, P, and C dimensions can be used to break the linear dependency. For example, one can use single year of age, time periods corresponding to years in which surveys are conducted (which may be several years apart), and cohorts defined either by five- or ten-year intervals, which are conventional in demography, or by application of substantive classification (e.g. War babies, baby boomers, Baby Busters, etc.). Despite the appeal of this approach, it has several problems. First and foremost, linear models assuming additivity of age, period, and cohort effects not only produce the identification problem, but are poor approximations of the processes generating social changes. Second, simple linear models do not account for multilevel heterogeneity in the data and may underestimate standard errors. Third, the APC accounting model cannot include explanatory variables and test substantive hypotheses. A useful alternative is a different family of models that (1) do not assume fixed age, period, and cohort effects that are additive and therefore avoid the identification problem; (2) can statistically characterize contextual effects of historical time and cohort membership; and (3) can accommodate covariates to aid better conceptualization of specific social processes generating observed patterns in the data. The Hierarchical APC (HAPC) models approach (Yang, 2006; Yang & Land, 2006) is appropriate for these purposes and can address questions unique to repeated cross-sectional surveys. If there is leveltwo (period and/or cohort) heterogeneity, assuming fixed period and cohort effects ignores this hetero- 22 geneity and may not be adequate. In this case, crossclassified random effects models (CCREMs), mixed (fixed and random) effects models, can be used to quantify the level-2 heterogeneity. The objectives of CCREM are to (1) assess the possibility that individuals within the same periods and cohorts share unobserved random variance and (2) test explanatory hypotheses of A, P, and C effects by including additional variables at all levels. Social Inequality of Happiness To illustrate the application of HAPC models, I will use the example of subjective well-being from a recent study (Yang, 2008a). This study is based on time series data on happiness in the US as reported by people 18 years of age and older from the GSS 1972–2004 (N 28 869). This study addressed the following questions: Do people get happier with age and over time? How do social inequalities in happiness vary over the life course and by time? Are some people born to be happy? Are there any birth cohort differences in happiness? The HAPC-CCREM method was applied to estimate fixed effects of age and other individual-level covariates and random effects of period and cohort. Hypotheses about differential sex, race, and SES effects over the life course were tested by entering their interaction terms with age. The level-2 (period- and cohortlevel) model was further specified to take into account error terms associated with period and cohort. The simplest specification of a HAPC model, as introduced by Yang and Land (2006), is a linear mixed effects model for normally distributed outcomes. Generalized linear mixed model specifications need to be used for dichotomous and multiple categorical outcome variables. In this case, the ordinal logit HAPC models of happiness were estimated. These estimates were obtained using the SAS PROC GLIMMIX. Next came a calculation of the predicted probabilities of being very happy by age, sex, and race, adjusting for all other factors and net of period and cohort effects. The age curves of happiness are quadratic and convex. The J-shaped net age effects suggest that average happiness levels bottom out in early adulthood and then increase at an increasing rate across age. Thus, with age comes happiness. Everything else being equal, women are slightly happier than men at all ages, and the gap converges at older ages. And when other factors are held constant, the race gap shows stronger trends of convergence with reversals at around the age of 70, with blacks becoming slightly happier than whites afterwards. Thus, life course variations in social inequalities in happiness are more consistent with the age-as-leveler hypothesis than the cumulative advantage hypothesis. Longitudinal panel data are needed to sort out whether selective survival of the happier or other Chapter |2| AGING, COHORTS, AND METHODS processes are more plausible explanations of the stratification of aging and subjective well-being. Estimates of variance components at level-2 indicated significant period and cohort effects controlling for age and other individual-level variables. The results on period and cohort effects as estimated by the random effect coefficients showed variations in levels of happiness over time for all groups and that sex and race disparities decreased during the last 30 years. The most interesting cohort effect was a dip in happiness levels for cohorts born between 1945 and 1960, i.e. the baby boomers. What accounts for these period and cohort patterns? One hypothesis is that period effects may reflect changes in macroeconomic variables such as gross domestic product (GDP) and levels of joblessness between the 1970s and 1990s (Di Tella et al., 2003). Exploratory analysis suggests that GDP is concurrent with the period changes in happiness and that the unemployment rate has a lagged effect. The two period-level covariates (GDP per capita and changes in unemployment rate) can be added to the models for the overall mean, sex, and race effects to determine whether the economic environment explains period variations in happiness and sex and race gaps therein. In addition, the cohort analysis proposition emphasizes the exogenous social demographic environments into which cohorts are born and age. The baby boomer pattern is consistent with the Easterlin hypothesis that large cohort sizes have negative consequences on cohort members’ socioeconomic and other life course outcomes, such as sense of wellbeing and mental stress (Easterlin, 1987). Relative cohort size (adopted from O’Brien et al., 1999) can be added to the model for the overall mean to account for cohort-level variance. Results of the Yang (2008a,b) study indicated that GDP per capita was significantly and positively associated with happiness and larger cohort size reduces happiness significantly, although the residual period effect remains significant. Economic conditions also affect sex and race differences over time. Higher GDP per capita reduces the sex and race differences by 83% and 38%, respectively. Higher unemployment rates, interestingly, decrease the race gap. Relative cohort size not only reduces the cohort variance by 73%, but also diminishes its statistical significance. So the lower levels of happiness in baby boomers are largely due to their large size compared to their predecessors and successors. More competition to go to school and find jobs meant more pressure and strains as the boomers came of age – something that can decrease their positive affect and assessment of quality of life (QOL). Conclusion and Extensions The empirical example above suggests that HAPC analysis moves far beyond simply identifying age and cohort trends. It allows us to see how social stratification operates over the life course and historical time and to test the extent to which aging and cohort theories apply to subjective well-being in ways that cannot be achieved by a conventional linear regression analysis. The HAPC model does not incur the identification problem because the age, period, and cohort effects are not assumed to be linear and additive. This analytic technique opens new doors to research on aging by offering a systematic and flexible modeling strategy that takes advantage of the multilevel data structure and rich covariates in repeated crosssectional surveys. HAPC models can be extended in various ways to address other research questions. We focus on three extensions here: fixed vs random effects models, an F-test for the presence of random effects, and full Bayesian HAPC models. First, the HAPC-CCREM approach illustrated above uses a mixed (fixed and random) effects model with a random effects specification for the level-2 variables (period and cohort). A frequently used alternative is a fixed effects specification for the level-2 variables in which dummy variables are used to represent cohorts and survey years. This approach may be appropriate when the number of survey replications is relatively small (three to five). In general, however, the random effects specification is preferable to the fixed effects specification for the level-2 variables because (1) it avoids an assumption of fixed effects models that the dummy variables representing the fixed cohort and periods effects fully account for the group effects; (2) it allows group-level covariates to be incorporated into the model and explicitly models cohort characteristics and period events to test explanatory hypotheses; and (3) it is generally more statistically efficient for unbalanced research designs (designs in which there are unequal numbers of respondents in the period-by-cohort cells), which is typical in repeated cross-section surveys (Yang & Land, 2008). Second, the random effects of level-2 variables in the HAPC models, particularly the variance components of cohort and period, are obtained by using the restricted maximum likelihood (REML) method, which assumes that the level-2 residual or error terms are asymptotic normally distributed, and yields variance estimators with good large sample properties. When the number of level-2 units, in this case cohorts and periods, is not large, this assumption may not be appropriate. Further, the z-scores for the REML estimates of the variance components are only proximate. To test more exactly whether the birth cohort and period effects make statistically significant contributions to explained variance in an outcome variable, a general linear hypothesis may be applied. Specifically, one can use an F test to test the hypothesis of the presence of random effects. The F statistic is preferred over the z-score when the sample sizes 23 Part |1| THEORY AND METHODS for random effects are small (Littell et al., 2006) and can be constructed using sum of squares for the linear mixed effects model and log-likelihood functions for some generalized linear mixed effects models. Interested readers should see Yang et al. (2009) for detail on the formulation of the F-test. Third, the standard REML-EB (empirical Bayes) estimation algorithm for mixed effects models has other limitations when applied to APC analyses of finite time period survey data. As noted above, the numbers of periods and birth cohorts in surveys usually are too small to satisfy the large sample criterion required by the maximum likelihood estimation of variance components. In addition, the sample sizes within each cohort are highly unbalanced. These result in inaccurate REML estimates of variance-covariance components that further result in inaccurate EB estimates of fixed effects. This is because errors in the REML estimates may produce increased uncertainty in the EB estimates of fixed effects coefficients that will not be reflected in the standard errors. A remedy is to use the full Bayesian approach that, by definition, ensures that inferences about every parameter fully account for the uncertainty associated with all others. To this end, one specifies the level-1 model as the likelihood function, assigns prior probability distributions for all fixed effect parameters and variance components, and uses the Bayes rule to yield the joint model or posterior distribution. Gibbs sampling has been widely used to derive the marginal posterior distributions for each model parameter and can be implemented using the software WinBUGS (freely available at http://www.mrc-bsu.cam.ac.uk/ bugs/). For an example of Bayesian HAPC analysis, see Yang (2006). Research Design III: Accelerated Longitudinal Panels The former two designs rely on information from synthetic cohorts that contain different cohort members at each point in time. Inferences drawn from such designs, therefore, assume that synthetic cohorts mimic true cohorts, and changes over time across synthetic cohort members mimic the age trajectories of change within true cohorts. If the composition of cohorts does not change over time due to migration or other factors and sample sizes are large, these assumptions are generally met. However, longitudinal data obtained from the same persons followed over time are increasingly available. The accelerated longitudinal panel design – where multiple birth cohorts are followed over multiple points in time – is an important advance in aging and cohort research. The primary advantage of this design is that it provides cross-time linkages within 24 individuals and hence information pertaining to true birth cohorts. Growth Curve Models of Individual and Cohort Change Hierarchical models are a useful tool for modeling longitudinal data. Because repeated observations over time (level-1 units) can be viewed as nested within individuals (level-2 units), one can specify growth curve models to assess simultaneously the intracohort age changes and intercohort differences. Although the application of two-level hierarchical regression models to standard longitudinal data is relatively straightforward, growth curve analysis of multi-cohort multi-wave data is complicated by two issues. First, because the observable age trajectories of different cohorts initiate and end at different ages, cohort comparisons are based on different segments of cohort members’ life courses. This raises the question of the potential confounding of the age and cohort effects. Two analytic strategies help to resolve this problem. The first is the use of centered age variables (e.g. centered around cohort median). Age centering eases the interpretation of the intercept, stabilizes estimation, and prevents the bias in the estimate that arises from systematic variation in mean age across the cohorts, hence eliminating the confounding of age and cohort variables. Second, the models yield tests of significance of overlapping segments of the life course of adjacent cohorts. As waves of data accumulate, the number of overlapping ages of adjacent cohorts increases, which increases statistical power. In this case, age and cohort will become less and less confounded, making it increasingly possible to compare cohort differences in age trajectories. The second issue concerns period effects. This model does not explicitly incorporate period effects for both substantive and methodological reasons. First, in contrast to synthetic cohort designs that usually cover several decades, longitudinal designs typically span much shorter time periods (e.g. a decade or so). So, the effects of period can be assumed to be trivial and omitted from the models, especially if the theoretical focus is on aging. Second, it is challenging to estimate a separate period effect in accelerated longitudinal designs. In the framework of the growth curve models, the level-1 analysis reflects within-individual change by modeling the outcome as a function of the time indicator (Singer & Willett, 2003). This means that one can include age or wave (period), depending on substantive focus, but not both because within individuals age and period are the same. The simultaneous estimation of period effects creates the model identification problem, which requires different data designs and mixed model specifications to resolve (Yang & Land, 2006, 2008). Third, one does not need Chapter |2| AGING, COHORTS, AND METHODS to estimate period effect per se and can focus instead on the age by cohort interaction effects. Sex and Race Health Disparities Across the Life Course This illustrative example comes from a recent analysis of life course patterns and cohort variations in sex and race health gaps (Yang & Lee, 2009). An important question in recent aging research is whether, and how, social status affects health dynamics in later life. Evidence about whether social disparities in health grow or diminish over the life course has been inconsistent (e.g. House et al., 1994; Lynch, 2003; Wilson et al., 2007). Another key unresolved issue concerns the distinct roles of aging and cohort succession. Whether recent birth cohorts show more or less sex and race inequalities in health due to cohort changes in social attainment, health capital, and the roles of women and blacks merits further study. This study systematically examines intercohort variations and intracohort heterogeneity in health trajectories to shed light on the social mechanisms generating sex and race inequalities in health over the life course. It addresses three questions. First, are there cohort differences in age trajectories of health? It tests the intercohort change hypothesis by modeling main cohort and age by cohort interaction effects. Second, how do sex and race health disparities change over the life course within cohorts? The prevailing explanations for decreasing health gaps with age are the ageas-leveler and selective survival processes. The primary explanation for increasing health gaps with age is the cumulative advantage process. The double jeopardy hypothesis has also been proposed as a secondary explanation. Because these theoretical perspectives all emphasize intracohort differentiation in aging, a proper test should control for cohort differences that might otherwise be confounded with aging effects, but this was not previously tested. The current analysis tests the intracohort inequality hypothesis by modeling age by sex and age by race interaction effects, net of cohort effects. Third, are there intercohort variations in the intracohort health differentials by sex and race? The study further tests the intercohort difference in intracohort inequality hypothesis by incorporating three-way interactions between age, cohort, and social status. The data are from the Americans’ Changing Lives (ACL) study, which uses an accelerated longitudinal design. An initial sample of 3617 adults aged 25 and older from seven 10-year birth cohorts (before 1905 to 1964) were interviewed in 1986, 1989, 1994, and 2001/2002. Three health outcomes were of interest: depressive symptoms as measured by an 11-item CES-D (Center for Epidemiologic Studies – Depression) scale, physical disability as measured by a summary index indicating levels of functional disability (range 1–4), and self-rated health as measured by a scale indicating levels of perceived general health (range 1–5). The analytic sample consisted of 10 174 person-year observations from black and white respondents at all waves for which data on all variables were available. Growth curve models are the method of analysis. The level-1 model characterizes within-individual change with age. In this model, the response variable is modeled as a function of linear and quadratic terms of age, where age is centered around the cohort median. The level-2 model assesses individual differences in change with age and determines the associations between the change with age and person-level characteristics including birth cohort and its quadratic function, sex, race, and the interaction effects of sex and cohort and race and cohort. A similar model was tested for the quadratic growth rate and was not significant. Control variables are entered at level-1 for time-varying covariates (family income, marital status, chronic illnesses, body mass index (BMI), and smoking). The hierarchical linear modeling (HLM) growth curve methodology has the advantage of allowing data that are unbalanced in time (Raudenbush & Bryk, 2002). That is, it incorporates all individuals with data for the estimation of trajectories, regardless of the number of waves s/he contributes to the personyear data set. Compared to alternative modeling techniques, this substantially reduces the number of cases lost to follow-up due to mortality or nonresponse. However, this method does not completely resolve the sample selection problem unless it distinguishes those lost to follow up from those with complete data for all waves. If mortality and nonresponse are significantly correlated with worse health and key covariates of health, such as age and SES, parameter estimates of the health trajectories may be biased if they are not controlled. To account for this, the analysis controls for the effects of attrition by including dummy variables indicating the deceased and nonrespondents in the level-2 models. All statistical analyses are performed using SAS PROC MIXED. The analysis tested the cohort-based models (full model) against the models combining all cohorts (reduced model with no cohort effect) to determine whether cohort-specific trajectories can actually be represented by a single mean-age trajectory. The hypothesis that there are no cohort differences in age trajectories was rejected. Results from the growth models yield several significant findings. First, the intercohort change hypothesis is supported for all three health outcomes. There are significant cohort differences in health trajectories, with more recent cohorts faring better in physical functioning and self-rated health, but worse in mental health. Second, the intracohort inequality hypothesis is supported only for depression. Specifically, there is evidence of a converging sex gap in depression trajectories, but 25 Part |1| THEORY AND METHODS shows that the black excess in depression has become more pronounced for more recent cohorts. We further find significant intercohort differences in age changes of intracohort sex gaps in depression. Figure 2.4 shows the predicted age trajectories of depressive symptoms by sex for selected cohorts. It suggests that the sex gaps in growth rates of depression are strongly contingent upon cohort membership. We see less convergence in the male–female gap in depression with age in more recent cohorts, suggesting a weaker age-as-leveler process in these cohorts. constant sex and race gaps in physical and self-rated health trajectories within cohorts. Third, there is support for the hypothesis of intercohort variations in intracohort sex and race differences in mean levels of all health outcomes, but not in the growth rates other than depression. Sex and race gaps in mean levels of health narrowed across cohorts for disability, but widened for depression and perceived health. The predicted mean levels of CES-D scores are plotted in Figure 2.3 by cohort for blacks and whites based on the model estimates, adjusting for age and all other factors. It 1 White Black CES-D 0.8 0.6 0.4 0.2 0 –1905 1905–1914 1915–1924 1925–1934 1935–1944 1945–1954 1955–1964 Birth cohort Figure 2.3 Predicted mean levels of CES-D score by birth cohort: race gap. Source: Yang & Lee (2009). 1.2 1955–64 cohort 1 0.8 CES-D 1935–44 cohort 0.6 1915–24 cohort 0.4 0.2 –1905 cohort Men Women 0 –0.2 25 Figure 2.4 35 40 45 50 55 60 65 Age 70 75 80 Predicted age growth trajectories of CES-D score by birth cohort: sex gap. Source: Yang & Lee (2009). 26 30 85 90 95+ Chapter |2| AGING, COHORTS, AND METHODS Conclusion This analysis provides an example of how to address some long-standing questions in the stratification of aging from the perspective of cohort analysis. It provides substantial evidence that the process of cohort change is important for the theory, measurement, and analysis of social inequalities in health over the life course. Controlling for cohort effects substantially alters existing explanations of sex and race gaps in health trajectories over the life course. For example, because we do not find any significant changes in the intracohort sex or race gaps in disability or self-rated health with age, but find substantial intercohort variations in sex and race gaps in mean levels of these outcomes, we conclude that any changes with age observed in previous studies are actually due to intercohort differences in disability or self-rated health levels. Furthermore, there is no meaningful cumulative advantage process or age-as-leveler effect at work in later or earlier cohorts’ physical and self-rated health when the confounding effects of cohort and other social factors are controlled. The constant intracohort disparities in disability and self-rated health trajectories indicate the persistent disadvantages of women and African Americans. The analytic framework exemplified above facilitates studies of patterns of inequality within and between birth cohorts that can provide a particularly useful line of inquiry into how individual lives evolve with social change. For example, analyzing five waves of data from the China Health and Nutrition Survey (CHNS) from 1991 to 2004 with growth curve methods, a recent analysis reveals patterns of social stratification in health trajectories for multiple cohorts in the context of an ever-changing macro-social environment in China (Chen et al., 2010). The study found strong cohort variations in SES disparities on health trajectories: the effect of education on mean level of health is smaller for more recent cohorts; the income gap in age trajectories of health increases for earlier cohorts but decreases for most recent cohorts. These cohort effects are in an opposite direction to recent US studies (e.g. Lynch, 2003; Wilson et al., 2007). Thus, this study highlights the uniqueness of China’s social, economic, and political structures; China’s stage of epidemiologic transition; and the changing impact of the government on health care. In sum, cohort membership contextualizes aging-related outcomes such as mental and physical well-being, QOL, and longevity, and also conditions social inequalities therein. The significance of cohort change in shaping the individual life suggests that new patterns may emerge for future cohorts as a result of their new life circumstances. The methods and findings illustrated here should prompt future examinations of aging-related hypotheses in a cohort-specific context. DIRECTIONS FOR FUTURE RESEARCH Demographic rates and social indicators arrayed over time by age or cohort are canonical organizations of data in research on aging. A vast literature in the social sciences uses linear models methodology for APC analysis. Different solutions to the model identification problem often produce ambiguous and inconsistent results and have ignited continuous debates on whether any solutions exist or which solutions are better. Researchers do not agree on methodological solutions to these problems and have concluded that APC analysis is still in its infancy (Mason & Wolfinger, 2002). The developments introduced in this chapter highlight new solutions to this problem – or simply new ways of thinking about the problem. They provide an important insight that the identification problem is model-specific rather than data-specific (Fu, 2008). Because the use of linear models (and hence the identification problem) characterizes the majority of previous studies, investigators erroneously concluded that the problem is inherent in APC analysis. In fact, it is the linear models, not the APC data structure, that incur the identification problem. Thus, the most effective way of accounting for aging-related phenomena and social and demographic change is to develop alternative approaches that do not treat age, period, and cohort as independent covariates in an additive fixed effects model. In addition to the new estimator for the conventional models and hierarchical models already discussed above, two approaches – one methodological and the other data design – hold promise for moving cohort analysis to a new era of methodological development and substantive research. Continuously Evolving Cohort Effects Model It is well known that the linear model suffers from identification problems. Less appreciated is that the model also suffers from a conceptual problem. The model rests on key assumptions that do not always accurately describe most APC-related phenomena. It assumes that the effect of age is the same for all periods and cohorts (Hobcraft et al., 1982). But the influence of age may change over time and across cohorts. Consider, for instance, the dramatic declines in infant mortality over the past century. It also assumes that the effect of period is the same for people of all ages. But period effects are often age-specific. For example, the influenza epidemic of 1918 caused especially high mortality among people in their teens and twenties. Similarly, it assumes that a cohort effect remains the same as long as the 27 Part |1| THEORY AND METHODS cohort lives. But cohorts must change (Ryder, 1965). That is, cohorts are continuously exposed to events whose influences accumulate over the life course. Wars and epidemics are examples of events that may occur in the middle of a cohort’s life and leave an imprint on all of its subsequent behaviors and outcomes. New events constantly occur. A model with unchanging cohort effects is appropriate only if all relevant events occur before the initial observation and only if these events’ impacts stay fixed as the cohort ages (Hobcraft et al., 1982). To capture the process of changing cohort effects over time, however, one needs a more general model – a framework that Hobcraft et al. (1982) labeled “continuously accumulating cohort effects.” Schulhofer-Wohl and Yang (2009) addressed this gap in a recent paper. They developed a general model that relaxes the assumption of the conventional additive model. The new model allows age profiles to change over time and period effects to differ for people of different ages. The model also defines cohort effects as an accumulation of age-by-period interactions over all events across the life course. Although a longstanding literature on theories of social change conceptualizes cohort effects in exactly this way, this is the first time that a method of statistically modeling for this more complex form of cohort effects has been presented. This new model was applied to analyze changes in age-specific mortality rates in Sweden over the past 150 years and found that the model fits the data dramatically better than the additive model. The analyses also yield interesting results that show the utility of this model in testing competing theories about the evolution of human mortality. The flexibility of the model, however, comes at a high computational cost because it involves the estimation of a large number of parameters. The inclusion of additional covariates also has not been analytically attempted. So, this new approach presents both opportunities and challenges for future analysts. When further improved, it may find applications in many areas of research. Longitudinal Cohort Analysis of Balanced Age-by-Cohort Data Structure The multi-cohort multi-wave data design is especially important for aging and cohort analysis. Although more longitudinal surveys using this design are available, such as the Health and Retirement Survey (HRS) and the National Long Term Care Survey, cohort studies would benefit from further developments in data collection. The usual accelerated longitudinal design has two limitations: (1) some ages cannot be observed for all cohorts and (2) coverage of the individual life course and historical time is extremely restricted. 28 The imbalance of the age-by-cohort structure arises from the fact that the baseline survey consists of cohorts of different ages and follow-up surveys occur at the same times. As a result, cohorts age for exactly the same number of years but will remain ageheterogeneous at the end of data collection. The growth curve models applied to these data therefore yield estimates of cohort differences in age trajectories based only on the overlapping age groups of adjacent cohorts rather than the entire possible range of ages. As mentioned above, this could affect the accuracy of the estimates when the number of waves is small or the overlapping age intervals are few. Increasing the number of follow-ups alleviates problems for inference, but is less than perfect for the purposes of disentangling aging effects from birth cohort differences and observing period effects. A better design is one in which the age-by-cohort data structure is balanced and extends for a long time. Extant secondary data that meet these criteria are exceedingly rare, however. The Liaoning Multigenerational Panel (LMGP) data (Campbell & Lee, 2009) may be an exceptional resource for cohort studies. The LMGP is a database of at least one million observations of 200 000 individuals from eighteenth to early-twentieth Chinese population registers. It provides entire life histories for men and nearly complete life histories for women. The length of the historical period spanned and the sheer number of observations included make it a great candidate for future longitudinal research on aging. It holds the potential for vastly enhancing our ability to estimate various age and cohort models. Data like these are difficult to collect and compile, but can be highly useful for a variety of substantive investigations. For example, the mechanisms underlying persistent cohort differences in mortality need to be better understood. The “cohort morbidity phenotype” hypothesis has been proposed to link large cohort improvement in survival to reductions in exposures to infections, inflammation, and increased nutrition in early life (Finch & Crimmins, 2004). But evidence of an association between early life conditions and late-life mortality is solely based on aggregate population data from developed countries and needs further testing using individual life histories across multiple birth cohorts in other national populations such as those in the LMGP. The APC problem has intrigued and frustrated social scientists for decades. Recent developments in techniques for modeling APC data – techniques that avoid the identification problem that has long compromised previous APC analyses – are available and have already generated important substantive findings. These techniques, coupled with new and superior data sources, set the stage for enhancing our understanding of the complex interplay of human aging, cohort characteristics, and historical events and processes. Chapter |2| AGING, COHORTS, AND METHODS REFERENCES Campbell, C., & Lee, J. (2009). Longterm mortality consequences of childhood family context in Liaoning, China, 1749–1909. Social Science & Medicine, 68, 1641–1648. Chen, F., Yang, Y., & Liu, G. (2010). Social change and socioeconomic disparity in health over the life course in China: A cohort analysis. American Sociological Review, 75, 126–150. Clayton, D., & Schifflers, E. (1987). Models for temporal variation in cancer rates. I-II: Age-period and age-cohort models. Statistics in Medicine, 6, 449–481. Di Tella, R., MacCulloch, R. J., & Oswald, A. J. (2003). The macroeconomics of happiness. The Review of Economics and Statistics, 85, 809–827. Easterlin, R. A. (1987). Birth and fortune: The impact of numbers on personal welfare. Chicago: University of Chicago. Elder, G. H. Jr. (Ed.), (1985). Life course dynamics: Trajectories and transitions, 1968–1980. Ithaca, NY: Cornell University Press. Fienberg, S. E., & Mason, W. M. (1978). Identification and estimation of age-period-cohort models in the analysis of discrete archival data. Sociological Methodology, 8, 1–67. Finch, C. E., & Crimmins, E. M. (2004). Inflammatory exposure and historical changes in human life spans. Science, 305, 1736–1739. Fu, W. J. (2008). A smoothing cohort model in age-period-cohort analysis with applications to homicide arrest rates and lung cancer mortality rates. Sociological Methods and Research, 36, 327–361. Glenn, N. D. (1976). Cohort analysts’ futile quest: Statistical attempts to separate age, period, and cohort effects. American Sociological Review, 41, 900–905. Glenn, N. D. (1987). A caution about mechanical solutions to the identification problem in cohort analysis: A comment on Sasaki and Suzuki. American Journal of Sociology, 95, 754–761. Hobcraft, J., Menken, J., & Preston, S. H. (1982). Age, period, and cohort effects in demography: A review. Population Index, 48, 4–43. Holford, T. R. (1991). Understanding the effects of age, period, and cohort on incidence and mortality rates. Annual Review of Public Health, 12, 425–457. House, J. S., Lepkowski, J. M., Kinney, A. M., Mero, R. P., Kessler, R. C., & Herzog, A. R. (1994). The social stratification of aging and health. Journal of Health and Social Behavior, 35, 213–234. Kupper, L. L., Janis, J. M., Karmous, A., & Greenberg, B. G. (1985). Statistical age-period-cohort analysis: A review and critique. Journal of Chronic Disease, 38, 811–830. Littell, R. C., Milliken, G. A., Stroup, W. W., Wolfinger, R. D., & Schabenberger, O. (2006). SAS for mixed models (2nd edn.). Cary, NC: SAS Institutes, Inc. Lynch, S. M. (2003). Cohort and life course patterns in the relationship between education and health: A hierarchical approach. Demography, 40, 309–331. Mason, W. M., & Fienberg, S. E. (1985). Cohort analysis in social research: Beyond the identification problem. New York: Springer-Verlag. Mason, W. M., & Smith, H. L. (1985). Age-period-cohort analysis and the study of deaths from pulmonary tuberculosis. In W. M. Mason & S. E. Fienberg (Eds.), Cohort analysis in social research (pp. 151–228). New York: Springer-Verlag. Mason, W. M., & Wolfinger, N. H. (2002). Cohort analysis. In Smelser N. J. & Baltes P. B. (Eds.), International encyclopedia of the social and behavioral sciences (pp. 151–228). New York: Elsevier. Mason, K. O., Mason, W. M., Winsborough, H. H., & Poole, W. K. (1973). Some methodological issues in cohort analysis of archival data. American Sociological Review, 38, 242–258. Mason, K. O., Mason, W. M., & Winsborough, H. H. (1976). Reply to Glenn. American Sociological Review, 41, 904–905. O’Brien, R. M., Stockard, J., & Isaacson, L. (1999). The enduring effects of cohort characteristics on age-specific homicide rates, 1960–1995. American Journal of Sociology, 104, 1061–1095. Pavalko, E. K., Gong, F., & Long, S. (2007). Women’s work, cohort change, and health. Journal of Health and Social Behavior, 48, 352–368. Pearl, J. (2000). Causality: Models, reasoning, and inference. Cambridge, UK: Cambridge University Press. Preston, S. H., & Wang, H. (2006). Smoking and sex mortality differences in the United States. Demography, 43, 631–646. Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models: Applications and data analysis methods. Thousand Oaks: Sage. Rodgers, W. L. (1982). Estimable functions of age, period, and cohort effects. American Sociological Review, 47, 774–787. Ryder, N. B. (1965). The cohort as a concept in the study of social change. American Sociological Review, 30, 843–861. Saski, M., & Suzuki, T. (1987). Changes in religious commitment in the United States, Holland, and Japan. American Journal of Sociology, 92, 1055–1076. Schulhofer-Wohl, S., & Yang, Y. (2009). Modeling the evolution of age and cohort effects in social research. Paper presented at the annual meetings of the Population Association of America, Detroit, MI, April. Singer, J. D., & Willett, J. B. (2003). Applied longitudinal data analysis: Modeling change and event occurrence. New York: Oxford University Press. Smith, H. L. (2008). Advances in ageperiod-cohort analysis. Sociological Methods & Research, 36, 287–296. Smith, H. L., Mason, W. M., & Fienberg, S. E. (1982). More chimeras of the age-period-cohort accounting framework: Comment on Rodgers. American Sociological Review, 47, 787–793. Wilson, A. E., Shuey, K. M., & Elder, G. H., Jr. (2007). Cumulative 29 Part |1| THEORY AND METHODS advantage processes as mechanisms of inequality in life course health. American Journal of Sociology, 112, 1886–1924. Winship, C., & Harding, D. J. (2008). A mechanism-based approach to the identification of ageperiod-cohort models. Sociological Methods & Research, 36, 362–401. Yang, Y. (2006). Bayesian inference for hierarchical age-periodcohort models of repeated crosssection survey data. Sociological Methodology, 36, 39–74. Yang, Y. (2007a). Age/Period/Cohort distinctions. In K. S. Markides (Ed.), Encyclopedia of health and aging (pp. 20–22). Los Angeles, CA: Sage Publications. Yang, Y. (2007b). Is old age depressing? Growth trajectories and cohort variations in late life depression. Journal of Health and Social Behavior, 48, 16–32. Yang, Y. (2008a). Social inequalities in happiness in the U.S. 30 1972–2004: An age-period-cohort analysis. American Sociological Review, 73, 204–226. Yang, Y. (2008b). Trends in U.S. adult chronic disease mortality: Age, period, and cohort variations. Demography, 45, 387–416. Yang, Y. (2009). Age, period, cohort effects. In D. Carr (Ed.), Encyclopedia of the life course and human development (pp. 6–10). New York: Gale Publishing. Yang, Y., & Land, K. C. (2006). A mixed models approach to ageperiod-cohort analysis of repeated cross-section surveys: Trends in verbal test scores. Sociological Methodology, 36, 75–97. Yang, Y., & Land, K. C. (2008). Age-period-cohort analysis of repeated cross-section surveys: Fixed or random effects? Sociological Methods & Research, 36, 297–326. Yang, Y., & Lee, L. C. (2009). Sex and race disparities in health: Cohort variations in life course patterns. Social Forces, 87, 2093–2124. Yang, Y., Fu, W. J., & Land, K. C. (2004). A methodological comparison of age-period-cohort models: Intrinsic estimator and conventional generalized linear models. Sociological Methodology, 34, 75–110. Yang, Y., Schulhofer-Wohl, S., Fu, W. J., & Land, K. C. (2008). The intrinsic estimator for age-period-cohort analysis: What it is and how to use it. American Journal of Sociology, 113, 1697–1736. Yang, Y., Frenk, S., & Land, K. C. (2009). Assessing the significance of cohort and period effects in hierarchical age-period-cohort models. Paper presented at the annual meetings of the American Sociological Association of America, San Francisco, CA, August.
© Copyright 2026 Paperzz