Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Experimental Stochastic Population Projections for New Zealand: 2009(base)–2111 Statistics New Zealand Working Paper No 11–01 Kim Dunstan 1 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Disclaimer The Statistics New Zealand Working Paper series is a collection of occasional papers on a variety of statistical topics written by researchers working for Statistics NZ. Papers produced in this series represent the views of the authors, and do not imply commitment by Statistics NZ to adopt any findings, methodologies, or recommendations. Any data analysis was carried out under the security and confidentiality provisions of the Statistics Act 1975. Liability statement Statistics NZ gives no warranty that the information or data supplied in this paper is error free. All care and diligence has been used, however, in processing, analysing and extracting information. Statistics NZ will not be liable for any loss or damage suffered by customers consequent upon the use directly, or indirectly, of the information in this paper. © Crown copyright This work is licenced under the Creative Commons Attribution-Noncommercial 3.0 New Zealand license. You are free to copy, distribute, and adapt the work for non-commercial purposes, as long as you attribute the work to Statistics NZ and abide by the other licence terms. Please note you may not use any departmental or governmental emblem, logo, or coat of arms in any way that infringes any provision of the Flags, Emblems, and Names Protection Act 1981. Use the wording 'Statistics New Zealand' in your attribution, not the Statistics NZ logo. Citation Dunstan K (2011). Experimental Stochastic Population Projections for New Zealand: 2009(base)– 2111 (Statistics New Zealand Working Paper No 11–01). Wellington: Statistics New Zealand Published in April 2011 by Statistics New Zealand Tatauranga Aotearoa P O Box 2922 Wellington, New Zealand [email protected] [email protected] www.stats.govt.nz ISSN 1179-934X ISBN 978-0-478-35397-6 (online) Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Contents List of figures .................................................................................................................................... iv Abstract .............................................................................................................................................. vi 1 Introduction............................................................................................................................. 1 Conventional population projections .................................................................................................. 1 Stochastic population projections ........................................................................................................ 1 Feedback sought ......................................................................................................................................... 2 Method ..................................................................................................................................... 3 2 General model applied............................................................................................................................. 3 Simulations .............................................................................................................................................. 4 Base population .......................................................................................................................................... 6 Modelling uncertainty in base population ................................................................................... 7 Births ............................................................................................................................................................. 10 Modelling uncertainty in fertility ................................................................................................... 10 Modelling uncertainty in sex ratio at birth ................................................................................ 13 Deaths .......................................................................................................................................................... 13 Modelling uncertainty in mortality ............................................................................................... 14 Migration...................................................................................................................................................... 19 Modelling uncertainty in net migration ...................................................................................... 20 3 Results..................................................................................................................................... 23 Population size and growth ................................................................................................................. 23 Births and deaths ..................................................................................................................................... 25 Population age structure ....................................................................................................................... 27 Dependency ratios .................................................................................................................................. 31 Discussion .............................................................................................................................. 34 4 Summary of method .............................................................................................................................. 34 Nature of projections .............................................................................................................................. 34 Summary of results ................................................................................................................................. 35 Implications for national population projections ......................................................................... 36 Practical issues..................................................................................................................................... 37 Implications for other projections ...................................................................................................... 39 National labour force projections ................................................................................................. 39 National ethnic population projections...................................................................................... 40 National family and household projections ............................................................................. 40 Subnational demographic projections ....................................................................................... 41 Conclusion .................................................................................................................................................. 43 References ...................................................................................................................................... 44 Appendix 1: List of abbreviations ............................................................................................ 48 Appendix 2: Examples of simulations .................................................................................... 49 Appendix 3: Additional projection results ............................................................................ 51 iii Experimental Stochastic Population Projections for New Zealand by Kim Dunstan List of figures 1 Assumed standard error in base population, by single year of age and sex, at 30 June 2009 ............................................................................................................................................ 8 2 Assumed relative standard error in base population, by single year of age and sex, at 30 June 2009 .............................................................................................................................. 8 3 Base population probability distribution, by single year of age, at 30 June 2009 ............................................................................................................................................................ 9 4 Period total fertility rate, 1962–2009 ............................................................................................ 11 5 Assumed total fertility rate probability distribution, 2010–2111 ........................................ 12 6 Assumed age-specific fertility rate probability distribution, 2061 ....................................... 12 7 Estimated sex ratio at birth, 1900–2008 ..................................................................................... 13 8 Cohort life expectancy at birth, by sex, 1876–1934 ............................................................... 15 9 Assumed male life expectancy at birth probability distribution, 2010–2111 ............... 16 10 Assumed female–male difference in life expectancy at birth probability distribution, 2010–2111 .................................................................................................................... 17 11 Assumed female life expectancy at birth probability distribution, 2010–2111 ........... 18 12 Assumed male age-specific survival rate probability distribution, 2061 .......................... 19 13 Net migration by class, 1900–2009 .............................................................................................. 20 14 Assumed net migration probability distribution, 2010–2111 ............................................. 21 15 Assumed net migration by age probability distribution, from 2013.................................. 22 16 Projected population probability distribution, 2009–2111 .................................................. 24 17 Projected annual growth rate probability distribution, 2010–2111 .................................. 24 18 Projected births probability distribution, 2010–2111 ............................................................. 25 19 Projected deaths probability distribution, 2010–2111........................................................... 26 20 Projected natural increase probability distribution, 2010–2111 ........................................ 27 21 Projected single-year of age probability distribution, 2031 ................................................... 27 22 Projected single-year of age probability distribution, 2061 ................................................... 28 23 Projected age-sex pyramid probability distribution, 2061 ..................................................... 29 24 Projected baby boomer population (born 1946–65) probability distribution, 2009–61 ................................................................................................................................................... 29 25 Projected population aged 65+ probability distribution, 2009–2111 ............................. 30 iv Experimental Stochastic Population Projections for New Zealand by Kim Dunstan 26 Projected percentage of population aged 65+ probability distribution, 2009– 2111 ......................................................................................................................................................... 30 27 Projected 0–14 dependency ratio probability distribution, 2009–2111 ........................ 32 28 Projected 65+ dependency ratio probability distribution, 2009–2111 .......................... 33 29 Projected total dependency ratio probability distribution, 2009–2111........................... 33 30 Example #1 of simulated total fertility rate, male and female life expectancy at birth, net migration, total population, births and deaths, and 65+ population, 2010–2111 ............................................................................................................................................. 49 31 Example #2 of simulated total fertility rate, male and female life expectancy at birth, net migration, total population, births and deaths, and 65+ population, 2010–2111 ............................................................................................................................................. 50 32 Projected population aged 0–14 probability distribution, 2009–2111 .......................... 51 33 Projected percentage of population aged 0–14 probability distribution, 2009– 2111 ......................................................................................................................................................... 51 34 Projected population aged 15–39 probability distribution, 2009–2111 ....................... 52 35 Projected percentage of population aged 15–39 probability distribution, 2009–2111 ............................................................................................................................................. 52 36 Projected population aged 40–64 probability distribution, 2009–2111 ....................... 53 37 Projected percentage of population aged 40–64 probability distribution, 2009–2111 ............................................................................................................................................. 53 38 Projected population aged 85+ probability distribution, 2009–2111 ............................. 54 39 Projected percentage of population aged 85+ probability distribution, 2009– 2111 ......................................................................................................................................................... 54 40 Projected median age probability distribution, 2009–2111 ................................................ 55 v Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Abstract The demographic future is uncertain. Conventionally, this uncertainty is conveyed by different scenarios with specific, stated assumptions about the components of population change (fertility, mortality, and migration). Alternative projection scenarios give an indication of possible uncertainty, although the uncertainty is not quantified. A stochastic or probabilistic approach to projections can potentially help their interpretation by quantifying the inherent uncertainty. This paper outlines a stochastic method and summarises the results for projections of the New Zealand population from a 2009 base. Uncertainty is modelled from historical data for fertility (total fertility rate), mortality (life expectancy at birth), and net migration, as well as for the sex ratio at birth. Uncertainty in the base population is modelled using expert judgement. Simulations of these parameters give probability distributions around Statistics New Zealand’s deterministic mid-range projection. The results illustrate that the deterministic scenarios give a poor indication of uncertainty for some key demographic characteristics. Even for other characteristics, the uncertainty indicated by the scenarios is neither consistent across the projection period, nor consistent between characteristics. This largely reflects that the low and high deterministic assumptions are not equivalent to a given probability interval that is consistent among the fertility, mortality, and migration components. Moreover, the uncertainty is rarely symmetrical. There are few practical obstacles to producing stochastic population projections, other than the additional resource required to formulate measures of uncertainty and produce multiple simulations. A stochastic approach could also be applied to other demographic projections produced by Statistics NZ, aiding their interpretation where uncertainty is even greater (eg ethnic population and subnational projections). However, the method outlined in this paper can be applied to the New Zealand population projections without compromising the current methods or results of those other demographic projections. Key words Stochastic, probabilistic, projection, population, uncertainty. Acknowledgements The author thanks John Bryant who assisted with data visualisation, Richard Speirs and the New Zealand Treasury who assisted with a 2004-base prototype, and anonymous reviewers for their helpful comments. vi Experimental Stochastic Population Projections for New Zealand by Kim Dunstan 1 Introduction The demographic future is uncertain. Conventionally, this uncertainty is conveyed by different scenarios with specific, stated assumptions about the components of population change (fertility, mortality, and migration). Alternative projection scenarios give an indication of possible uncertainty, although the uncertainty is not quantified. A stochastic or probabilistic approach to projections can potentially help their interpretation by quantifying the inherent uncertainty. However, it is important to note that the measures of uncertainty are themselves uncertain because of subjective decisions that inevitably have to be made about how to estimate such uncertainty. Conventional population projections Statistics New Zealand has traditionally published a set of New Zealand or national population projections every two to three years. The latest national population projections are a 2009-base set released in October 2009. These have a projection horizon of 2061, although projections to 2111 have also been derived and are available. All these projections were derived 'deterministically'. That is, they are scenario-based projections produced using specific assumptions. These assumptions are not just an extrapolation of historical trends, but are formulated after analysis of short-term and long-term demographic trends, patterns and trends observed in other countries, government policy, and other relevant information. Different combinations of these assumptions result in different projection scenarios (or series), although the likelihood of each scenario (or the range covered by the different scenarios) is never quantified. Stochastic population projections The main advantage of stochastic population projections is that they provide a means of quantifying the demographic uncertainty, although it is important to note that the estimates of uncertainty will themselves be uncertain. While it is possible to estimate uncertainty based on the historical variability of the demographic parameters, it is more difficult to estimate the uncertainty that arises from the choice of models, or from the choice of time period(s) that affect the model parameters. Dowd et al (2010) refer to these three different types of uncertainty as: 1. model uncertainty (eg we do not know the true fertility model) 2. parameter uncertainty (eg whatever mortality model we use, we do not know the true values of its parameters) 3. forecast uncertainty (eg the uncertainty of future migration rates given any particular model and its calibration). Stochastic methods also produce projection trajectories that are more realistic, in that they are more variable than a deterministic projection. In fact, deterministic projections are really indicating average trajectories given long-run assumptions. Bryant (2003, 2005) and Booth (2006) give good summaries of the advantages of a stochastic approach. 1 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Stochastic population projections have not hitherto been widely adopted by national statistical offices. Examples exist for Denmark, Sweden, the Netherlands, and China (eg Qiang Li et al, 2009). These have generally been produced by academic researchers or research institutes rather than national statistical offices. However, several countries have been exploring stochastic methods (eg US Census Bureau: see Long and Hollmann, 2004; Statistics Sweden: see Hartmann and Strandell, 2006; UK Office of National Statistics: see Rowan and Wright, 2010). Statistics NZ produces a range of different demographic projections, but this paper presents an experimental set of stochastic projections for the total New Zealand population only. The focus of the paper is to describe a method to quantify the uncertainty inherent in projections of New Zealand’s future population (section 2: Method) and to summarise the stochastic projection results (section 3: Results). A summary of the key points, practical considerations, and implications for other demographic projections produced by Statistics NZ are discussed (section 4: Discussion). The paper is therefore of interest to both producers and users of projections. The stochastic population projections presented in this paper are tagged ‘experimental’ to differentiate them from any official projections produced by Statistics NZ. The methodology and associated uncertainty parameters are similarly experimental. They are subject to revision if and when a stochastic approach becomes integrated within Statistics NZ’s projection methodology. Feedback sought The aim of this paper is twofold. First, it documents Statistics NZ's recent work in the area of stochastic (or probabilistic) population projections. Second, it provides a basis for users of projections to comment on the stochastic approach. Statistics NZ welcomes comments on the specific stochastic methodology outlined in this paper, but is also interested in hearing the views of users around broader questions such as: · Is there demand for a different projection methodology? · What is the value/advantage to users of stochastic population projections? · Do the benefits to users outweigh the added complexity and production costs? · What are the implications for the suite of demographic projections produced by Statistics NZ – should a stochastic approach be applied to all national and subnational projections? · What do users want from the projections? · Are these needs currently being met? · Do users of projections want a prediction of the future population, or is it sufficient to have an indication based on simplified but sensible assumptions? Feedback can be provided to the author (email [email protected], phone 03 964 8330) or Statistics NZ's Population Statistics Unit (email [email protected]). 2 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan 2 Method This section describes the modelling approach used to derive stochastic population projections of the New Zealand population from 2009 (base) to 2111. General model applied The approach adopted here is to apply a stochastic framework to the mid-range deterministic projection series produced by Statistics NZ, which combines the medium fertility, medium mortality, and medium migration assumptions (ie series 5 of the 2009base national population projections released in October 2009). This is similar to the approach adopted by Wilson (2004, 2005), and is preferable to stochastically deriving a median trajectory. There are several advantages of this approach. First, it allows the incorporation of assumptions that have been deterministically formulated from assessing a wide range of information. This includes the latest New Zealand demographic trends, cohort fertility rates, immigration policy, immigration applications and approvals, and international trends in fertility and mortality. Assumptions about demographic trends, especially in the short term (ie in the initial years of the projection period), are perhaps best formulated deterministically rather than stochastically. For example, expert judgement can be applied to interpreting available information such as immigration applications and approvals as a precursor to actual migration trends. In contrast, a pure stochastic approach is driven by historical data and does not incorporate knowledge about real world events. Second, this approach maintains compatibility with other demographic projections produced by Statistics NZ. Projections of ethnic populations, labour force, and families and households, at both national and subnational levels, are designed to be consistent with national population projections (specifically the mid-range series). This reflects the top-down approach adopted by Statistics NZ and the importance of additivity, which is valued by users of the projections. Third, this approach allows direct comparison of the stochastic population projections with the official national population projections released in October 2009. For example, the lowest and highest growth scenarios (series 1 and 9 respectively) can be compared with the range of stochastic projection outcomes. Fourth, this approach can utilise the work that already exists in a New Zealand context. Tom Wilson, while at the Queensland Centre for Population Research, applied a stochastic framework to Statistics NZ’s 2004-base national population projections (Wilson, 2005). Statistics NZ itself developed an experimental set of 2004-base stochastic population projections, including SAS programs to produce these, in collaboration with the New Zealand Treasury in 2005. This working paper builds on both of these previous developments. 3 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The alternative is to stochastically derive all assumptions, including the median trajectory, by time series modelling. Such an approach does require subjective selection of historical data or time series models. For example, time series modelling of fertility conducted within Statistics NZ concluded that the forecasts were sensitive to the length of the historical data and the specific model chosen, and often produced implausible fertility results. Nevertheless, time series models have been successfully applied in other cases, notably for mortality (eg Lee & Carter, 1992; Lee & Miller, 2001). For a fuller discussion, see Hyndman and Booth (2006). The approach taken here is to develop probability distributions for the three components of population change – fertility, mortality, and migration – and overlay these on the medium fertility, mortality, and migration assumptions of the 2009-base national population projections (Statistics New Zealand, 2009a). Fixed age profiles are effectively used for all assumptions, although these are scaled according to the simulated parameters. In comparison with Wilson's work, two additional components are modelled with probability distributions: 1. uncertainty in the base population 2. uncertainty in the sex ratio at birth. This approach assumes that all the parameters of uncertainty can be estimated with certainty and remain constant over the projection period. The parameters may of course be overestimated or underestimated, but they also ignore the uncertainty that can arise from model or parameter uncertainty (Dowd et al, 2010). This underscores the importance of acknowledging that any estimates of uncertainty are themselves uncertain. In terms of modelling, autoregressive integrated moving average (ARIMA) models provide satisfactory approximations of the fertility, mortality, and migration time series. Other types of models were not explored but ARIMA models have also been used in other stochastic projections (eg Wilson, 2004, 2005; Keilman, 2005; Lee & Tuljapurkar, 1994). Different ARIMA models were assessed for each time series, using the BoxJenkins approach, with diagnostics such as autocorrelation plots and checks supporting the specific models selected in this paper. For further background and discussion of ARIMA models, see for example Chatfield (2009). Simulations Simulations (or iterations or sample paths) are created for the base population, births, deaths, and net migration, and combined as per the fundamental population equation: P(T) = P(T–1) + Births – Deaths + Arrivals – Departures where P(T) is the population at the end of the time period, P(T–1) is the population at the beginning of the time period (base population), and Births – Deaths (natural increase) and Arrivals – Departures (net migration) relate to events occurring during the time period. No simulation is more likely, or more unlikely, than any other. However, if a random variable is measured many times, a distribution of the values it can take can be constructed. Collectively, therefore, the simulations provide a probability distribution which can be summarised via percentiles. Two examples of simulated assumptions and projection results are given in appendix 2. 4 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Calculating the simulations involves several assumptions (de Beer, 2000): 1. Type of probability distribution. For calculating the simulations, a complete probability distribution needs to be specified for each component for each projection year. If a normal distribution is assumed, only two parameters have to be specified: the mean (corresponding with the medium variant) and the variance. If a uniform distribution is assumed, the minimum and maximum values need to be specified. If an asymmetrical distribution is assumed, at least one additional parameter indicating the skewness has to be specified. One disadvantage of an asymmetrical distribution is that the mean of the distribution does not correspond with the most probable value. This may be confusing for the users of the forecast. 2. Correlation between age-specific rates. The age-specific fertility and mortality rates in a given forecast year can be expected to be positively correlated. If the economic and social situation is favourable for having children, it can be expected that all agespecific fertility rates will be relatively high. However, for cohort data, a negative correlation may be plausible. If the economic situation drives people to postpone having children, age-specific fertility rates at young and old ages may be negatively correlated. Similarly, a selection mechanism may cause a negative relationship between mortality rates at young and old ages for the same cohort. 3. Serial correlation. The probability distributions of fertility, mortality, and migration in successive forecast years are correlated. If fertility is very high in one forecast year, it is not very probable that fertility will be very low in the next year. Thus if a high value of fertility is drawn in one forecast year, the probability of drawing a high value in the next year should be higher than that of drawing a low value. In the short run a negative correlation may also be possible. For example, if the number of deaths is relatively high in one year due to a severe winter, the number of deaths may be relatively low in the next year, due to a selection mechanism, as many frail people died in the previous year. 4. Correlation between components. The values of fertility, mortality, and migration can be correlated. For example, if immigrants have more children than the native population, an increase in the number of young immigrants may lead to an increase in the fertility rates in later years. Note that even if independence between fertility and mortality rates and migration numbers is assumed, there is no independence of numbers of births and deaths, and numbers of migrants. For example, if immigration is high in a certain year, this will result in larger numbers of births and deaths in later years for given values of fertility and mortality rates. In the case of the stochastic projections derived here: 1. Normal probability distributions are assumed for all parameters, although the base population has different variances above and below the median. 2. Age-specific fertility rates, survival rates, and base populations are assumed to be perfectly correlated across age. 3. The probability distributions of fertility, mortality, and migration in successive projection years are assumed to be serially correlated, as evident from the fitted models. 5 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan 4. All components of the population equation are assumed to be independent. For developed countries, there is no reason for, or evidence of, correlation between these components (Lee & Tuljapurkar,1994; Keilman, 1997), at least not at a national level. An alternative to the simulation approach is to analytically derive the projections. However, applying a stochastic cohort-component model is described as “very complicated” and requires a large number of simplifying assumptions (de Beer, 2000) or approximations (Keilman, 2005). Base population The projections have as a base, or starting point, an estimate of the population that usually lives in New Zealand: the estimated resident population (ERP). The ERP is largely derived from the latest census count, but includes allowances for people not included in the census. This includes people who were temporarily overseas at the time of the census (residents temporarily overseas), as well as people missed by the census (net census undercount). The ERP also includes allowances for changes in the population since the census due to births, deaths, and migration. The base for the 2009-base national population projections was the ERP (provisional) of New Zealand at 30 June 2009, 4.316 million people. The same base was used for all projection series. This provisional estimate was subsequently ‘finalised’ in November 2009, after the projections were released in October 2009, following further birth and death registrations (which permitted a more refined estimate of the number of births and deaths occurring up to 30 June 2009). The differences between the final and provisional estimate were negligible, although this ‘final’ ERP will be further revised following results from the 2011 Census of Population and Dwellings (as will all postJune 2006 population estimates). In recognition that the base ERP inevitably has some uncertainty, a probabilistic distribution was applied to the base ERP (by age and sex). Although the ERP is a historical dataset and largely derived from census counts, uncertainty in the ERP can arise from two broad sources: 1. Census enumeration and processing. Coverage errors may arise from nonenumeration and mis-enumeration (eg residents counted as visitors from overseas, and vice versa), either because of deliberate or inadvertent respondent or collector error. Errors may also arise during census processing (eg scanning, numeric and character recognition, imputation, coding, editing). 2. Adjustments in deriving population estimates. This includes the adjustments applied in deriving the ERP at 30 June of the census year: net census undercount (NCU), residents temporarily overseas (RTO), and demographic reconciliation (Statistics New Zealand, 2010a). It also includes uncertainty associated with the post-censal components of population change (eg estimates of births occurring in each time period based on birth registrations; changes in classification of external migrants between ‘permanent and long-term’ and 'short-term'). 6 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The ERP is largely based on census enumeration and recorded events (births, deaths, arrivals, departures). There is potential for both undercount and overcount of people in the respective data collections. There are many reasons for people being missed or double-counted, including deliberate avoidance, people shifting residence around the time of data collection, and people with multiple residences including students at tertiary institutions and children in shared custody (Statistics New Zealand, 2007a). Modelling uncertainty in base population Population estimates already make an allowance for coverage errors based on the census post-enumeration survey (PES). Although the PES provides one measure of uncertainty, namely estimates of sampling error associated with the estimates of NCU, the overall probability distribution associated with the base ERP was subjectively derived, based on the author’s knowledge of the strengths and weaknesses of the different data sources. A subjective approach is necessary because it is very difficult to quantify all nonsampling errors in census enumeration and the various adjustments applied in deriving population estimates. The uncertainty in the base ERP is assumed to have equal probability that the population is lower or higher than the official ERP. Significantly, however, it is also assumed that for most ages any underestimate is likely to be larger than any overestimate. This reflects the nature of the ERP and how it is derived. First, the adjustments for NCU, RTOs, and unregistered births are based on empirical evidence but tend to be conservative to avoid over-adjusting. Second, external migration trends between 2006 and 2009 suggest permanent and long-term migration data are probably an underestimate of the contribution of migration to New Zealand’s population change over that period. Similarly, between 30 June 2001 and 30 June 2006, New Zealand recorded net permanent and long-term migration of 116,600, compared with an estimated net migration of 160,800 (Statistics New Zealand, 2008, p13 and 28). Consequently, if the ERP is still an underestimate after the collective adjustments and updating for post-censal births, deaths, and migration, then this is potentially larger than any overestimate. Owing to the nature of the uncertainties, the estimated uncertainty in the population estimates is designed to indicate broad approximate potential error. In absolute terms (figure 1), uncertainty is assumed to be highest among young adult ages (16–45 years). The adjustments for NCU and RTO are also highest at these ages. In relative terms (figure 2), uncertainty is assumed to be highest at the oldest ages (90+ years) where the small populations are sensitive to census miscount. At the oldest ages there is more potential for an overestimate, based on demographic analysis conducted within Statistics NZ involving retrospective comparisons between death registrations and census counts. The uncertainty in the base population is referred to in terms of ‘standard errors’ (SE) to reflect that it is based on an assumed underlying distribution. In contrast, the uncertainties in the other parameters (ie fertility, mortality, and migration) are referred to in terms of ‘standard deviations’ to reflect that they are estimated from observed historical data. 7 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 1 Assumed standard error in base population By single year of age and sex At 30 June 2009 Population 800 700 Male +1 SE 600 Female +1 SE 500 Male -1 SE 400 Female -1 SE 300 200 100 0 -100 -200 0 10 20 30 40 50 Age (years) 60 70 80 90 100+ 90 100+ Figure 2 Assumed relative standard error in base population By single year of age and sex At 30 June 2009 5 Percent 0 -5 -10 Male +1 SE Female +1 SE -15 Male -1 SE Female -1 SE -20 -25 0 10 20 30 40 50 Age (years) 8 60 70 80 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Given the asymmetrical pattern of uncertainty assumed around the base ERP, the approach adopted here is to assume perfect correlation across age-sex. That is, for any given simulation, all ages for males and females are above or below the official ERP. This asymmetrical approach results in the median of the probability distribution equating to the base ERP, but the mean of the probability distribution does not. An alternative approach of assuming adjacent ages are imperfectly correlated would be appropriate if the uncertainty was assumed to be symmetrical around the base ERP. For each simulation, the magnitude of the difference from the base ERP was determined by drawing a random number from a normal distribution with a mean of zero. The absolute value of this random number was multiplied by the assumed standard error for each age-sex to give the difference between the base ERP and the alternative base population. A set of 1,000 alternative populations were derived by adding the differences to the base ERP (by age and sex) at 30 June 2009 (figure 3). The percentiles are impossible to distinguish in figure 3 which underscores that there is relatively little uncertainty in the base population in absolute terms. The total ERP has a 90 percent probability interval of 4.28–4.43 million, compared with the official ERP of 4.32 million. Figure 3 Base population probability distribution By single year of age At 30 June 2009 Note: Percentiles shown are 5, 25, 50, 75, and 95. Age 100 is 100 years and over. 9 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The probability distributions and intervals illustrated in this section are based on theoretical modelled results, not simulated results. The 50th percentile (50% line) should be interpreted as signifying that there is a 50 percent chance that the given parameter for a given year will be below this line, and a 50 percent chance that the given parameter for a given year will be above this line. Similarly, the 75th percentile (75% line) signifies that there is a 75 percent chance that the given parameter for a given year will be below this line, and a 25 percent chance that the given parameter for a given year will be above this line. It follows that there is a 50 percent chance that the given parameter for a given year will be between the 25% line and the 75% line (the interquartile range), and a 90 percent chance that the given parameter for a given year will be between the 5% line and the 95% line. Births Projected (live) births are derived by applying age-specific fertility rates (ASFRs) to the mean female population at ages 12 to 49 years. The ASFRs for each year represent the average number of births to females of each age in that year. The set of ASFRs for each year is summarised by the total fertility rate (TFR). The mean female population for each age is derived by averaging the population at the beginning and end of each year. The sum of the number of births derived for each age of mother gives the projected number of births for each year. The relative number of male births and female births is derived by the sex ratio at birth. The medium fertility variant of the 2009-base national population projections assumes that the TFR drops gradually from 2.14 births per woman in the year ended June 2009 to 1.90 in 2026, and then remains constant. This assumption was based on an analysis of New Zealand period and cohort fertility rates, rates of childlessness, and ethnic fertility patterns, as well as international comparisons. These factors suggest a general decline in overall New Zealand fertility rates from current levels is most likely (for further discussion see Statistics New Zealand, 2009b). The medium fertility variant also assumed ASFRs of women aged under 32 years will decline between 2009 and 2026, with ASFRs increasing for women aged 32 years and over. Modelling uncertainty in fertility Initial investigations centred on modelling uncertainty in disaggregated (age-specific) fertility rates. ASFRs for the total New Zealand population are available from 1962 (Statistics New Zealand, 2010b). However, uncertainty modelled on the ASFRs gave an implausibly wide range of future fertility in the short-term and long-term, albeit treating each age independently. The range was only partly improved by limiting the historical time series to more recent data. Future work could include modelling ASFRs to account for correlation across age. Nevertheless, the TFR is a useful summary measure of the ASFRs prevailing in a given year, even though its cross-sectional nature can conceal important patterns occurring across ages and/or birth cohorts. Over the 47-year period, 1962–2009, New Zealand’s TFR varied between 1.9 and 4.2 births per woman (figure 4). However, the range has been much narrower since 1977 (1.9–2.2). 10 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 4 4.5 Period total fertility rate 1962–2009 Births per woman 4.0 3.5 3.0 2.5 2.0 1.5 1.0 0.5 0.0 1962 1967 1972 1977 1982 1987 1992 December year 1997 2002 2007 Standard deviations were calculated for the entire 1962–2009 TFR time series. However, this implied an overly wide probability distribution for the TFR (eg 90 percent probability distribution of 1.1–2.7 births per woman in 2021, 0.5–3.6 in 2061, and 0.2–4.2 in 2111). In the final model, standard deviations were calculated for 1977–2009 (December years). Fertility patterns over this period, following the post-war baby boom, could be considered a better basis for formulating a probability distribution for future fertility. By comparison, Wilson (2005) modelled the uncertainty of future fertility on the TFR for the 30-year period 1975–2004. After considering different data periods, a simple random walk with drift model was deemed appropriate for producing future fertility simulations, as with Wilson. As far as the historical data was concerned, this is the equivalent of fitting an ARIMA(0,1,0) model to annual TFR. The mathematical formula for deriving future TFRs was: TFR(T) = TFR(T–1) + e{TFR}(T) + drift{TFR}(T) where TFR(T) > 0, T denotes a one year interval, e{TFR} are random errors sampled from a normal distribution with the calculated standard deviation (0.06229) and a mean of zero, and drift{TFR} shifts the median of the future simulations of TFRs to follow the medium fertility variant of the 2009-base national population projections. ASFRs were subsequently derived by scaling the ASFRs from the medium fertility variant of the 2009-base national population projections, to match each TFR simulation for each projection year. The final distributions for TFR (by year) and ASFRs (in 2061) are illustrated in figure 5 and figure 6. These figures also illustrate the low and high fertility variants from the 2009-base national population projections. These variants encompass the interquartile range in the short term, but beyond 2031 they encompass an increasingly smaller probability interval. 11 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 5 Assumed total fertility rate probability distribution 2010–2111 Figure 6 Assumed age-specific fertility rate probability distribution 2061 Note: Percentiles shown are 5, 25, 50, 75, and 95. 12 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Modelling uncertainty in sex ratio at birth The sex ratio at birth has only a small impact on the projected population, but it is an uncertain component that affects future birth numbers and uncertainty because of its impact on the female population. Based on estimated births by date of occurrence (Statistics New Zealand, 2009c), the mean sex ratio at birth for births occurring in the 109-year period 1900–2008 (December years) was 1.055 males per female, with a standard deviation of 0.010 (figure 7). For each simulation and for each year of the projection period, the assumed sex ratio at birth was determined by drawing a random number sampled from a normal distribution with the calculated standard deviation (0.010) and a mean of 1.055 (the constant assumption adopted in the 2009-base national population projections). Figure 7 Estimated sex ratio at birth 1900–2008 1.08 Males per female 1.07 1.06 1.05 1.04 1.03 1.02 1900 1910 1920 1930 1940 1950 1960 Year of birth 1970 1980 1990 2000 2010 Source: From cohort mortality data updated in September 2009, which includes births registered to June 2009. Deaths The projected number of deaths is calculated indirectly. The detailed mortality assumptions are formulated in terms of age-specific survival rates (ASSRs) for males and females separately. This is because in the projection model the base population is survived forward each year. The male and female ASSRs for each year represent the proportion of people at each age-sex who will survive for another year. In general, survival rates are highest at ages 5–11 years and then decrease with increasing age. The set of ASSRs for each year is summarised by male and female life expectancy at birth. Annual survival rates are applied separately to births, migrants, and the population at the beginning of each year. For further explanation, refer to Statistics New Zealand (2010c). 13 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The medium mortality variant of the 2009-base national population projections assumes that ASSRs will increase at different rates at different ages. The period life expectancy at birth (e 0 ) for males increases from 78.0 years in 2005–07 to 85.6 years in 2061. The corresponding e 0 increase for females is from 82.2 years in 2005–07 to 88.7 years in 2061. These assumptions were driven by the observed age-specific death rates from complete cohort life tables for the 1876–2007 birth cohorts. Exponential curves were fitted to historical cohort mortality data for each age-sex. Complete cohort life tables were derived for selected birth cohorts (eg 1956, 2006, 2061) and intermediate cohorts were interpolated. Cohort ASSRs were then transformed to period ASSRs, with some adjustment to give plausible death numbers by age-sex in the initial years of the projection period. International comparisons of mortality and longevity served as a useful check on plausibility. Despite differences in methods, the New Zealand life expectancy assumptions are broadly consistent with the latest available projection assumptions from national statistical agencies in Australia, Canada, Japan, United Kingdom, and the United States. Modelling uncertainty in mortality As with fertility, initial investigations centred on modelling uncertainty in disaggregated (age-specific) death or survival rates. ASSRs for the total New Zealand population are available from the complete period life tables that Statistics NZ derives every five years (Statistics New Zealand, 2009d). The complete cohort life tables, updated and extended annually, provide an even more comprehensive mortality time series from 1876 (Statistics New Zealand, 2009c). This same cohort mortality data was used to formulate the mortality assumptions of the 2009-base national population projections (Statistics New Zealand, 2009a). A third potential data source is the Human Mortality Database (HMD), which in early 2010 contained annual life table data for New Zealand for the period 1948–2003 and birth cohorts 1867–1973. Uncertainty modelled on the cohort age-specific death/survival rates gave an implausibly wide range of mortality/survival results in both the short term and long term. The range was only partly improved by limiting the historical time series to more recent data. However, ages were treated independently, and future work could include modelling with different levels of correlation across age. Uncertainty was eventually modelled on e 0 , specifically cohort e 0 for 1876–1917 (December years of birth) (figure 8). The usefulness of e 0 is partly as a summary measure of the ASSRs prevailing in a given year. A limitation of e 0 is its cross-sectional nature, which can conceal important patterns occurring across ages and/or birth cohorts. 14 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 8 Cohort life expectancy at birth By sex 1876–1934 80 Years of life 75 70 Female 65 60 Male (excluding war deaths) 55 Male (including war deaths) 50 1875 1880 1885 1890 1895 1900 1905 1910 Year of birth 1915 1920 1925 1930 1935 Note: Dashed lines indicate that life expectancy is partly based on projected mortality experiences at ages above 74 years. Source: Based on cohort life tables derived in September 2009, which include deaths registered to June 2009. To model variation in e 0 , it is preferable that annual e 0 data are available on a consistent basis. Such data are available from cohort life tables but not from period life tables. Definitive period e 0 measures are only available every five years, and interpolation of annual period e 0 from five-yearly e 0 would not give a robust measure of annual uncertainty. The alternative HMD source is neither up-to-date nor entirely reliable for New Zealand data. The appropriateness of modelling uncertainty in future period e 0 using past uncertainty in cohort e 0, is based on two premises. First, that uncertainty in cohort e 0 reflects annual variations in people’s actual life expectancy at birth that results from changes in age-specific death rates. Second, that cohort and period e 0 will experience similar, if not identical, trends – given that both are a function of the same underlying age-specific death rates. However, two refinements were applied. First, birth cohorts after 1917 were excluded where remaining mortality experience needed to be projected at ages above 90 years. As more mortality experience is projected, the derived e 0 is liable to become increasingly smooth from cohort to cohort. Second, war deaths were excluded as this adds to the year-to-year uncertainty in male e 0 . The projections are not designed to account for extreme events such as major wars (see ‘Nature of projections’ in section 4: Discussion). 15 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The refined cohort time series may still overestimate uncertainty, because of the inclusion of birth cohorts with relatively high infant and child mortality due to infectious epidemics. Such epidemics were a feature of mortality in New Zealand before World War II (Dunstan et al, 2006). Although it could be argued that increasing globalisation and ease of international travel are resuming the risk of infectious epidemics. After considering different data periods, the experience with the fertility models, and Wilson's work with New Zealand population projections, it was decided that a simple random walk with drift process would be used to model male e 0 . As far as the historical data was concerned, this is the equivalent of fitting an ARIMA(0,1,0) model to annual male e 0 . The mathematical formula for deriving future male e 0 was: e 0 (males, T) = e 0 (males, T–1) + e{e 0 }(males, T) + drift{e 0 }(males, T) where T denotes a one year interval, e{e 0 } are random errors sampled from a normal distribution with the calculated standard deviation (0.57459) and a mean of zero, and drift{e 0 } shifts the median of the future simulations of male e 0 to follow the medium mortality variant of the 2009-base national population projections (figure 8). Also illustrated in that figure are the low and high mortality variants from the 2009-base national population projections. These variants approximate the interquartile range over the projection period. Figure 9 Assumed male life expectancy at birth probability distribution 2010–2111 Given the strong correlation between male and female e 0 , future female e 0 was derived by adding together the simulations of male e 0 and female–male differences in e 0 (de 0 ). Again, a simple random walk with drift process was used to model female–male differences in e 0 . And again as far as the historical data was concerned, this is the equivalent of fitting an ARIMA(0,1,0) model to annual e 0 differences: de 0 (T) = de 0 (T–1) + e{de 0 }(T) + drift{de 0 }(T) 16 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan where T denotes a one-year interval, e{de 0 } are random errors sampled from a normal distribution with the calculated standard deviation (0.37400) and a mean of zero, and drift{de 0 } shifts the median of the future simulations of female–male differences in e 0 to follow the medium mortality variant of the 2009-base national population projections (figure 10 and figure 11). Figure 10 Assumed female–male difference in life expectancy at birth probability distribution 2010–2111 Also illustrated in figure 11 are the low and high mortality variants from the 2009-base national population projections. These variants give a slightly narrower range than the interquartile range over the projection period. The contrast with males (figure 9) is interesting and reflects: · In the deterministic projections, the difference between the low and high variants is higher for males than females (eg 6 years compared with 5 years in 2061, and 10 years compared with 8 years in 2111). · In the stochastic projections, the uncertainty is effectively higher for females than males (eg 90 percent probability interval of 16 years compared with 14 years in 2061, and 23 years compared with 19 years in 2111). This is the result of adding the female–male difference in e 0 variable to the male e 0 variable. However, explicitly modelling female e 0 from the same historical data used to model male e 0 gives a similar result of higher female uncertainty in e 0 . This supports deriving female e 0 from male e 0 (by adding female–male difference in e 0 ) rather than deriving male e 0 from female e 0 (by subtracting female–male difference in e 0 ). In terms of female–male differences in life expectancy at birth (figure 10), there are no explicit low or medium variants in the 2009-base national population projections. 17 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 11 Assumed female life expectancy at birth probability distribution 2010–2111 ASSRs were subsequently derived by scaling the ASSRs from the medium mortality variant of the 2009-base national population projections to match each e 0 simulation for each projection year. The probability distribution for male ASSRs in 2061 is illustrated in figure 11. A feature of the ASSR distribution is the very high rates and narrow probability interval under 60 years of age, where there is currently less than 1 death per 100 people (Statistics New Zealand, 2009d). In contrast, there is significant uncertainty for ASSRs above 80 years of age, indicating that projections of the population at the oldest ages are sensitive to mortality assumptions. Also illustrated in figure 12 are the low and high mortality variants from the 2009-base national population projections. These variants approximate the interquartile range in 2061. 18 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 12 Assumed male age-specific survival rate probability distribution 2061 Note: Age 100 is 100 years and over. Migration Projected net migration is calculated directly in terms of a specific age-sex distribution for each specific net migration level. New Zealand has a rich source of external migration data in the passenger cards completed by people travelling into and out of the country. In 2009, there were 4.5 million arrivals and 4.5 million departures. However, only a small proportion of these movements translate to changes in the resident population of New Zealand. For population estimates purposes, net migration is based on the ‘permanent and longterm’ classing of passengers, mainly on the basis of self-reported travel intentions. Therefore, for projections of the resident population, there is the issue as to whether ‘permanent and long-term’ or ‘all movements’ data should inform measures of uncertainty in future migration (figure 13). For further description of this issue and data availability, see Dunstan et al (2006, pp5, 25–29) and Statistics New Zealand (2008, p28). 19 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 13 80 Net migration by class 1900–2009 Thousand All movements 60 40 20 0 -20 -40 Permanent and long-term -60 1920 1930 1940 1950 1960 1970 June year 1980 1990 2000 2010 Note: ‘All movements’ includes permanent and long-term as well as short-term (less than 12 months) movements. Despite the long historical time series, there is also the issue of what period should be used to model migration uncertainty. New Zealand’s migration flows continue to evolve in response to changes in immigration policy (both in New Zealand and abroad), the advent of relatively cheap air travel, and the increasing globalisation of education and labour markets. The 2009-base national population projections medium migration variant assumed a long-run annual net migration gain of 10,000, with higher net gains in the short run (2010–12). This assumption was based on an analysis of immigration permits, residence applications and approvals, overseas student numbers, and arrivals and departures analysed by characteristics such as citizenship, country of last/next permanent residence, and age. The medium migration variant also assumed the main net outflow at ages 22–25 years, mainly due to young New Zealanders embarking on overseas travel and the departure of students from overseas after studying in New Zealand. Net inflows were assumed for most other ages, with the highest net inflows at 15–20 and 27–37 years. The age-sex distribution of net migration remains constant in the long term. Modelling uncertainty in net migration Standard deviations were calculated using permanent and long-term data for 1980– 2009 (June years). Migration patterns over this period, which included significant changes in immigration policy (notably in 1987), were considered to be a better basis for formulating a probability distribution for future migration than a longer historical time series, or a time series using all passenger movements. An ARIMA(1,0,1) model was fitted to this data, which was also the type of ARIMA model that Wilson (2005) used. The mathematical formula for deriving the future net migration levels was: 20 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan N(T) = F{N}N(T–1) + Q{N}e{N}(T-1) + e{N}( T) + drift{N}(T) where T denotes a one-year interval, F{N} is an autoregressive parameter (0.49112), Q{N} is a moving average parameter (-0.61052), e{N} are random errors sampled from a normal distribution with the calculated standard deviation (12,204) and a mean of zero, and drift{N} shifts the median of the future simulations of net migration levels to follow the medium migration variant of the 2009-base national population projections (figure 14). The probability distribution is much wider than the alternative 5,000 (low) and 15,000 (high) long-run net migration levels used in the 2009-base national population projections. Figure 14 Assumed net migration probability distribution 2010–2111 The associated age-sex net migration distributions were derived by using the patterns adopted with the 2009-base national population projections. These patterns are constant over the projection period. The projected net migration distribution by age-sex was derived by interpolating or extrapolating between the low and high net migration patterns for any given net migration level (figure 15). As with the net migration levels, the probability distribution is much wider than the alternative 5,000 (low) and 15,000 (high) long-run net migration levels used in the 2009-base national population projections. 21 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 15 Assumed net migration by age probability distribution From 2013 Note: Percentiles shown are 5, 25, 50, 75, and 95. Age 100 is 100 years and over. The described approach to migration differs from Wilson, who modelled immigration and emigration (as immigration minus net migration) separately. While it is intuitively appealing to model arrivals and departures separately, Wilson did have to reject some implausible simulations and apply floor/ceiling limits. Earlier work within Statistics NZ found no suitable ARIMA models for historical arrivals data that could produce satisfactory future simulations. Moreover, the disaggregation of net migration into arrivals and departures is beyond the current scope of Statistics NZ’s national population projections. 22 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan 3 Results This section summarises the results of the stochastic population projections of the New Zealand population from 2009 (base) to 2111. The results presented here are not exhaustive, but illustrate the probability distributions relating to different characteristics of the population, and how these compare with the published deterministic scenarios (eg low growth series 1 and high growth series 9 of the 2009-base national population projections). The stochastic results presented in this section are representative of the projection results that would be available to users, if and when stochastic projections were produced. Additional projection results are illustrated in appendix 3. The projection results are based on running 1,000 simulations of the assumptions described and discussed in section 2. All simulations produced plausible results (eg there were no negative populations). Note that the median (the 50th percentile as indicated by the 50% label in each figure) is taken directly from the 2009-base national population projections, while all projection results relate to June years. The probability distributions and intervals illustrated in this section are based on the specified assumptions, with associated probability distributions and intervals from the simulated results. The 50th percentile (50% line) should be interpreted as signifying that there is a 50 percent chance that the given result for a given year will be below this line, and a 50 percent chance that the given result for a given year will be above this line. Similarly, the 75th percentile (75% line) signifies that there is a 75 percent chance that the given result for a given year will be below this line, and a 25 percent chance that the given result for a given year will be above this line. Population size and growth These experimental stochastic population projections indicate considerable uncertainty in the future total population of New Zealand (figure 16, overleaf). The 90 percent probability interval for New Zealand’s population is 4.89–5.41 million in 2031, 4.82– 6.69 million in 2061, and 3.58–10.35 million in 2111. Interestingly, the low growth series 1 and high growth series 9 of the 2009-base national population projections equate roughly to the 5 percent and 95 percent probability distributions until the 2060s, but a narrower range thereafter. Stochastic projections can also enhance the information not otherwise available from deterministic projections. For example, the stochastic projections indicate a 50 percent probability that the New Zealand population will reach 5 million during 2024–29; an 80 percent probability that the population will reach 5 million before 2030; and a 1 in 3 probability that the population will reach 6 million before 2060. 23 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 16 Projected population probability distribution 2009–2111 In terms of growth rates, the projections indicate a general decline until the 2050s (figure 17), which is driven by the trends in births, deaths, and natural increase (figure 18–figure 20). The low growth series 1 and high growth series 9 approximate the interquartile range over the projection period. Figure 17 Projected annual growth rate probability distribution 2010–2111 24 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Births and deaths These experimental stochastic population projections indicate considerable uncertainty in the annual number of (live) births (figure 18). Even in the first year of the projection period, 2010, there is a 50 percent probability that births will be outside the range 61.6–64.2 thousand. The interquartile range expands to 55–67 thousand by 2031, to 49–79 thousand by 2061, and to 39–105 thousand by 2111. The divergence of uncertainty from the late 2030s reflects that the uncertainty in future fertility rates is compounded by the uncertainty in the future number of women of childbearing age. By comparison, the low growth series 1 and high growth series 9 encompass the interquartile range for most of the projection period. Figure 18 Projected births probability distribution 2010–2111 In comparison with births, the future number of deaths is more certain, especially before 2030 (figure 19, overleaf). In the first year of the projection period, 2010, there is a 50 percent probability that deaths will be outside the range 28.4–31.0 thousand. The interquartile range expands to 37–46 thousand by 2031, to 52–64 thousand by 2061, and to 58–73 thousand by 2111. There is a strong indication of increased deaths until the 2050s, despite the continued increases in life expectancy assumed at all ages. This is due to more people reaching the older ages. 25 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Of the nine official series published in the 2009-base national population projections, series 3 and 7 give the highest and lowest number of deaths, respectively, for most of the projection period. Compared with the stochastic projections, the range given by these series is very narrow. The alternative series “allow users to assess the impact on population size and structure resulting from changes in the assumptions for each of the components of population change” (Statistics New Zealand, 2009a, emphasis added). However, this nuance may not be fully understood by users of the projections, who might expect the range of deterministic scenarios to give a good indication of uncertainty in all characteristics of the projected population, including deaths. Furthermore, users might expect the lowest growth and highest growth series (series 1 and 9) to give the widest indication of uncertainty, but this is not true for all population characteristics. Figure 19 Projected deaths probability distribution 2010–2111 The uncertainty in natural increase is a function of the uncertainty in both births and deaths (figure 20). The general trend is for shrinking natural increase until the 2050s, driven by more deaths. The projections indicate a 2 in 5 probability of natural decrease from 2060. Again, the low growth series 1 and high growth series 9 encompass the interquartile range for most of the projection period. 26 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 20 Projected natural increase probability distribution 2010–2111 Population age structure Population projections are particularly important for indicating changes in the age distribution of the population. In 2031, 22 years after the base year, demographic uncertainty varies significantly by age (figure 21). Ages under 20 years have relatively large uncertainty bounds, mainly reflecting the uncertainty of future fertility rates. Above age 80, the uncertainty of future mortality/survival rates causes uncertainty in proportional terms, but the uncertainty is small in absolute numbers. Figure 21 Projected single-year of age probability distribution 2031 Note: Percentiles shown are 5, 25, 50, 75, and 95. Age 100 is 100 years and over. 27 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan By 2061, 52 years after the base year, demographic uncertainty is relatively large for ages under 50 years, reflecting the uncertainty of future fertility rates, and to a lesser extent the uncertainty of future migration (figure 22). The impact of fertility on uncertainty is cumulative in that the number of births in 2061 is affected by the uncertain number of women of childbearing age alive in 2061, which in turn is driven by uncertain births (and fertility patterns) before 2050. Those aged 52 years and over in 2061 are already alive in the base year, so only deaths and migration can alter their numbers. The uncertainty at these ages is largely driven by the uncertainty of future mortality/survival rates. Figure 22 Projected single-year of age probability distribution 2061 Note: Age 100 is 100 years and over. Other measures can also be used to indicate changes in the age distribution of the population. For example, the median age indicates the age at which half the population is younger, and half is older. From a median age of 36.5 years in 2009, the stochastic projections indicate that the interquartile probability interval is 39.4–40.9 years in 2031, 40.8–46.1 years in 2061, and 40.6–51.6 years in 2111 (figure 40, appendix 3). The largest increases in the median age are therefore likely to occur between 2021 and 2041. Population age pyramids are commonly used to illustrate changes in the age distribution of the population, and can also be used with stochastic projections. Figure 23 reiterates the results of the previous two figures, namely the relatively large uncertainty at the youngest ages (driven by uncertainty in future fertility rates). To a lesser extent, there is also relatively high uncertainty at the oldest ages (driven by uncertainty in future mortality/survival rates). 28 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 23 100+ Projected age-sex pyramid probability distribution 2061 Age (years) 90 Males Females 80 70 60 50 40 30 20 10 95% 0 60 75% 50% 25% 40 5% 20 5% 0 Thousand 20 25% 50% 75% 95% 40 60 Ages can also be transformed into birth cohorts (people with a common year of birth) and stochastic projections used to convey uncertainty in cohort populations (Figure 24). Figure 24 Projected baby boomer population (born 1946–65) probability distribution 2009–61 Note: Percentiles shown are 5, 25, 50, 75, and 95. 29 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 25 Projected population aged 65+ probability distribution 2009–2111 Figure 26 Projected percentage of population aged 65+ probability distribution 2009–2111 30 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The stochastic projections indicate a significant increase in the population aged 65+ years (figure 25). From 0.55 million in 2009, the 90 percent probability interval indicates a population of 0.95–1.18 million in 2031, 1.10–1.75 million in 2061, and 1.23–2.56 million in 2111. The uncertainty is driven by mortality/survival rates, as future fertility rates do not impact on the projected numbers aged 65+ until the 2070s. The low growth series 1 and high growth series 9 encompass the interquartile range throughout the projection period, but by 2111 these series actually equate closer to the 90 percent probability interval. The stochastic projections do help negate the myth that the increase in the 65+ population is a temporary phenomenon driven by the ageing of the large birth cohorts of the 1950s and 1960s (the ‘baby boomers’). Instead, the projections indicate a sustained structural shift in both the number and proportion of the population aged 65+ years (figure 26). This shift is far from being some artefact of the projection assumptions. From 13 percent in 2009, the 90 percent probability interval indicates a proportion of 19–23 percent in 2031, 19–31 percent in 2061, and 18–44 percent in 2111. However, it is noteworthy that the alternative low growth series 1 and high growth series 9 give no indication of demographic uncertainty for the proportion aged 65+ years. The above results are underscored by results for other age groups (appendix 3). Uncertainty is greatest for the youngest age groups because of the uncertainty in births (fertility rates), and the oldest age groups which are most sensitive to mortality assumptions. Deterministic projections, such as the low growth series 1 and high growth series 9, are poor at indicating the uncertainty in population proportions. Dependency ratios Dependency ratios are simple measures that relate the number of people in broad ‘dependent’ age groups (such as 0–14 and 65+ years) to the broad ‘working-age’ population (such as 15–64 years). ‘Dependency’ has a variety of connotations and need not imply financial or economic dependency. Furthermore, the dependency status of the older age group could be expected to change over time, reflecting changes in life expectancy, physical and mental well-being, and labour force status. Nevertheless, dependency ratios illustrate the changing age structure by simply relating the numbers of people in the youngest and oldest age groups to the numbers in the middle age groups (most of whom are in the workforce). The trend in the 0–14 (or ‘youth’) dependency ratio largely reflects the trend in birth numbers. The interquartile range of 29.4–31.6 in 2021 is relatively narrow, but uncertainty increases quickly (figure 27, overleaf). The interquartile range expands to 27.3–31.5 in 2031, to 24.4–32.4 in 2061, and further to 22.2–34.2 in 2111. Of the alternative deterministic series, series 2 and 8 (which use alternative fertility assumptions) give the widest range in the long-term 0–14 dependency ratio. These series encompass the interquartile range in the short term, but are well within this range in the long term. 31 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 27 Projected 0–14 dependency ratio probability distribution 2009–2111 Compared with the 0–14 dependency ratio, the probability interval of the 65+ (or ‘elderly’) dependency ratio is narrower in the short term, but wider in the long term. The interquartile range of 32.5–35.2 in 2031 increases to 37.6–47.2 in 2061, and further to 41.7–64.5 in 2111 (figure 28). However, the projections suggest an increase in the 65+ dependency ratio, from about 19 in 2009 to over 30 beyond 2030, is almost inevitable. Of the alternative deterministic series, series 3 and 7 (which use alternative mortality assumptions) give the widest range in the long-term 65+ dependency ratio. These series reflect the interquartile range in the short term, but are also well within this range in the long term. 32 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 28 Projected 65+ dependency ratio probability distribution 2009–2111 The probability interval of the total dependency ratio expands from an interquartile range of 61.0–65.6 in 2031, to 66.9–75.4 in 2061, and further to 74.0–90.0 in 2111 (figure 29). An increase in this ratio from its 2009 level of 50 therefore seems inevitable. Series 3 and series 7 reflect the interquartile range over the entire projection period. Figure 29 Projected total dependency ratio probability distribution 2009–2111 33 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan 4 Discussion This section discusses the issues, both conceptual and practical, of incorporating a stochastic approach to future demographic projections produced by Statistics NZ. Summary of method The previous sections describe a method of quantifying the uncertainty for projections of the New Zealand population. Probability distributions around a deterministic mid-range projection are produced by combining simulations of the base population, births (via fertility rates), deaths (via survival rates), and net migration. All simulations of all components were retained, with none discarded because of outright implausibility. Simulations of population projections are produced by combining the simulated components. All simulated projections were also retained. The results are plausible, but the estimates of uncertainty are uncertain for several reasons. First, where historical data are used to drive the estimates of uncertainty, choices must be made as to which data series are used (eg period or cohort data, aggregated or age-specific data). Second, choices must be made as to what time periods are used. Third, there is a choice of models to apply to time series to estimate parameters such as standard errors. Fourth, any historical data series will have issues of coverage and accuracy relating to its collection and coding, which will affect the estimation of parameters. Inevitably, the process of formulating measures of uncertainty for stochastic projections is similar to that of formulating assumptions for deterministic projections. They both require a balance of empirical data analysis and judgement. Nature of projections A stochastic projection extrapolates observed variability in demographic data to the future (Keilman, 2005). For a proper assessment of the variability, Keilman suggests that one needs long series with annual data of good quality; the minimum is about 50 years, but a longer series is preferable. New Zealand’s demographic time series are high quality but imperfect. Keilman notes that when time series analysis cannot be used to compute predictive distributions, one has to rely strongly on expert opinion. This expert opinion can be drawn from a mix of experts within an organisation and/or external to an organisation such as Statistics NZ. The challenge of eliciting experts’ opinions, particularly in avoiding too narrow prediction intervals, is discussed in Lutz et al (1996, 2001), O’Hagan (2005), Kynn (2008), and Lutz (2009). Regardless of how projection assumptions are formulated and demographic projections derived, they are neither predictions nor forecasts. They represent the statistical outcomes of various combinations of selected assumptions about future changes in the dynamics of population change (eg future fertility, mortality, and migration patterns). These assumptions are formed from the latest demographic trends and patterns, as well as international experiences, to represent some possible scenarios. Each projection scenario gives a picture of the changing population, but is not designed to be an exact forecast or to project specific annual variations. 34 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The difference between projections and predictions is provided in an analogy made by the Australian Productivity Commission (2005). They give the example of someone seeing a large boulder on a train track. The projection is that there will be a rail disaster and many deaths if the boulder is not moved or the train is not stopped. The prediction is that someone will move the boulder, averting the accident. It is likely that the projection is much more useful for policy formulation and planning. This analogy is relevant to demographic projections. A number of organisations, including central and local governments, may initiate strategies to avert the population trends implied by the projections. Examples include changes to immigration policy and local economic development strategies (eg to retain residents or attract new residents). One of the roles of projections is to enable future demographic changes to be understood and managed, if not averted. In such instances, it is illogical to criticise the projections if they do not match actuality, especially when projections have been used to inform those strategies. These nuances are important to understanding both conventional deterministic projections and stochastic projections. Any projections and associated measures of uncertainty are inherently uncertain and subject to update. Indeed, they should be regularly updated to maintain their relevance. In interpreting the measures of uncertainty, it is also important to note that extreme events such as major wars, catastrophes, and pandemics, as well as major government and business decisions, are not realistically accounted for in either deterministic scenarios or stochastic projections. That is, the measures of uncertainty do not encompass all possibilities. Summary of results Demographic uncertainty is generally much greater than can be discerned from an analysis of a limited range of deterministic scenarios. This is not necessarily because the alternative scenarios have assumptions that are too narrow, but simply that there is no quantification of likelihood attached to the alternative scenarios. Moreover, a limited number of scenarios cannot convey how the different assumptions interact and fluctuate over time. The deterministic projections, including alternative low growth and high growth scenarios, do illustrate plausible projection outcomes. However, they only partly convey an indication of uncertainty. The stochastic projections derived here are based on a multitude of simulations. Each of these simulations is plausible – some more so than others – but collectively they give an indication of demographic uncertainty in both the short term and long term. Results from 1,000 simulations are surprisingly stable for most distributions, although more simulations may be needed if stochastic projections were to be implemented officially. 35 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan The results in this section, although not exhaustive, illustrate that the deterministic scenarios give a poor indication of uncertainty for some key demographic characteristics. Even for other characteristics, the uncertainty indicated by the scenarios is neither consistent across the projection period, nor consistent between characteristics. For example, at times the alternative scenarios are analogous to a 50 percent probability interval, while at other times they are equivalent to a 90 percent (or higher) probability interval. This largely reflects that the low and high deterministic assumptions are not equivalent to a given probability interval that is consistent among the fertility, mortality, and migration components. Moreover, the uncertainty is rarely symmetrical, unlike that typically conveyed by deterministic projections. Although the probability of outcomes below and above the median may be equal, the uncertainty is usually skewed. For population growth outcomes, for example, there is a greater range of outcomes above the median than below. Stochastic projections are much better at conveying the uncertainty in some characteristics of the population, such as deaths, percentage age distribution, and dependency ratios. They also seem much better at conveying the large uncertainty that exists in long-range projections beyond 50 years, and the extent to which this uncertainty expands over time. Given the widening probability distribution, the stochastic projections can indicate how far into the future the projections are useful. However, ‘usefulness’ will vary from user to user depending on which type of demographic projection is being used (eg national population, subnational ethnic population), which demographic characteristics are being used (eg total population, 65+ population, births), and how the projection is being used (eg the specific application and the respective risks of an over-projection or underprojection). Implications for national population projections This paper describes a method for producing plausible stochastic population projections for New Zealand. The question remains as to whether such an approach should be embedded within the official demographic projections regularly produced by Statistics NZ. Methodological enhancements and developments continue to shape the statistical environment. However, it is impractical for any producer of statistics to adopt every enhancement as it unfolds. As Lutz et al (1998) discuss, "the change of a longestablished tradition" generally requires the following: 1. The new practice must have clear advantages when compared with the current one. 2. It should be consistent with other work done by the producing institution, and present an evolution along established lines rather than a discontinuity. 3. The proposed approach should be internally consistent and based on accepted scientific work. 4. It should be practical for both the users and producers, and not cost too much. 36 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan These points deserve some elaboration. In the context of the independence and integrity of Statistics NZ as a producer of (population) statistics, a high priority is put on the statistical rigour of data, concepts, and methods, including incorporating international standard practices. However, there is also an onus on Statistics NZ to provide statistical leadership (eg in statistical methodologies), which can run counter to more conservative statistical practices. Furthermore, the usefulness of any alternative or new method needs to be evaluated against the costs, complexity, and transparency for users and producers. Even if stochastic projections become a formal aspect of Statistics NZ’s demographic projections, deterministic scenarios will continue to remain useful. ‘What if?’ scenarios have been published in recent releases (eg Statistics New Zealand, 2009a). They illustrate specific fertility, mortality, and migration scenarios relative to the mid-range scenario, such as: · a very high fertility scenario (long-run total fertility rate of 2.5) · a very low mortality scenario (continued fast increases in life expectancy at birth, to reach 95 years of life for males and females in 2061) · no migration scenario (a ‘closed’ population) · very high migration scenario (25,000 annual net migration). The ‘What if?’ scenarios have received positive feedback. Stochastic and deterministic approaches could therefore be seen as complementary. Practical issues Practical issues affecting implementation include: 1. Impact on quality. Most national statistical organisations provide guidelines and discussion of the dimensions of quality as they relate to statistics (eg Statistics New Zealand, 2007b; Australian Bureau of Statistics, 2009; Office of National Statistics, 2004; Statistics Canada, 2002). Such quality dimensions provide a framework for evaluating the usefulness of a methodological change such as a stochastic approach to population projections. For projections, the quality dimensions can be described as follows: a. Relevance. Do the projections cover the necessary geographic areas, demographic characteristics (eg age, sex, ethnicity), and future time periods as required by different users? Are the projections produced to satisfy the expectations and aspirations of individuals or groups, or are they based on an objective assessment of demographic trends? b. Timeliness. Are the projections updated and available when they are needed? c. Coherence. Is the choice of methods, data, and assumptions consistent with accepted practices and do they account for the relevant factors? Are the projection results plausible given known constraints and limitations? d. Accessibility. Is the information readily available to everyone? Are there costs to access? e. Interpretability. Is the information about the projections (eg methods, assumptions, results) available, understandable, and even replicable? Do the projections provide measures of uncertainty? 37 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan f. Accuracy. How do the projected trends compare with actuality? Do the projections adequately illustrate changing demographic patterns? A stochastic approach would enhance the interpretability of projections, while the only quality dimension that could be compromised is that of timeliness. Inevitably, producing stochastic projections would require more person-hours, as the additional parameters of uncertainty would need to be calculated and reassessed in future projections. Running multiple simulations and subsequent summarising and dissemination could be expected to add work time to the production cycle. 2. Computing capacity. A stochastic approach using multiple simulations involves more time and larger datasets than the conventional deterministic approach, although this is becoming less of an issue with rapidly progressing technology (eg virtual memory). This paper illustrates how as few as 1,000 simulations can enhance the projections with robust measures of uncertainty. A larger number of simulations would increase the smoothness of the probability distributions (eg for growth rates and natural increase). 3. Managing user expectations. A stochastic approach does not necessarily mean that projections are more 'accurate'. The advantages of a stochastic approach are in quantifying uncertainty and in more realistic, variable projection trajectories. But the estimates of uncertainty are themselves uncertain. 4. Applying stochastic methods to subgroup populations. Most users of Statistics NZ demographic projections want additivity or internal consistency. For example, that subnational projections sum consistently to national projections. Or, that ethnic population projections are consistent with those of the total population. There may be expectations that a stochastic approach could be readily applied to national projections of labour force, families and households, and ethnic populations. Because there are more parameters to model, these would be more difficult than the application to national population projections. Similarly, stochastic projections for subnational areas need to deal with multiple geographic areas with small populations. The implications for other projections are discussed in the following section. 5. Effective dissemination of methods and results. Any projection methodology and underlying assumptions need to be transparent to users. Stochastic methods are, arguably, more difficult to explain concisely, especially in a non-technical way. If and when stochastic projections are produced, their use and understanding would be helped by appropriate explanatory notes and metadata that typically accompany any Statistics NZ release. There are additional issues in summarising and presenting the results of thousands of projection trajectories or simulations (eg graphical representation of uncertainty bands). In addition, many projection results (eg percentage age distributions) would need to be explicitly provided for users, as they cannot be accurately derived from conventional results (eg population by age). However, this paper illustrates how stochastic assumptions and projections can be presented using fan charts, developed using R and Excel statistical software, although it is difficult to display multiple variables (eg births and deaths) in one chart. 38 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Furthermore, all the stochastic results presented in this paper (section 3) can be disseminated through existing tools, such as Statistics NZ’s Table Builder (Statistics New Zealand, 2010e). Alternative percentiles that summarise the probability distributions could be published in preference to the current practice of publishing selected deterministic series. Implications for other projections Statistics NZ produces an integrated and comprehensive suite of demographic projections for national and subnational areas. Advocates of stochastic projections rarely comment on the implications and methodology for the full range of projections typically produced by national statistical organisations, tending to focus on one dimension (eg national population projections or subnational population projections). And yet, this consistency of assumptions and projections across the suite of demographic projections is critical for data users, who expect additivity and plausibility at all levels. In principle, a similar stochastic approach as that applied to the national population projections, could be applied to the other demographic projections produced by Statistics NZ. Some practical aspects are discussed here. National labour force projections Projections of the labour force are derived by applying labour force participation rates (LFPRs) to population projections, by age-sex. The most recent national labour force projections assumed LFPRs remain constant at most ages beyond 2021 (Statistics New Zealand, 2010d). The major exception to this was at ages above 55 years, where current trends and international comparisons suggest continued increases in LFPRs are likely. This reflects increasing flexibility in the retirement age (with no compulsory retirement age), and increasing life expectancy and well-being at the older ages. However, as with all projection assumptions, any constancy does not signify an expectation of actual stability, but is merely a simplification of a complex reality. The constancy also reflects that the level and trend of LFPRs is uncertain in the long term. A stochastic approach to labour force projections would therefore seem to enhance their interpretation. Historical LFPR time series are available from two sources, although with some definitional differences and changes over time: 1. the five-yearly Census of Population and Dwellings, which provides single-year of age detail 2. the quarterly Household Labour Force Survey, which provides official measures of the labour force, but with sampling error even for grouped ages. Estimates of LFPR uncertainty would therefore need to draw on both data sources, perhaps with expert judgement being applied. 39 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan National ethnic population projections These projections include two additional parameters, namely paternity rates by age of father (to allow for births where the mother is not of the specified ethnic group, but the father is) and inter-ethnic mobility rates by age-sex (to allow for people changing their ethnic identification over time). One issue with all ethnic parameters, including those related to the conventional fertility, mortality, and migration assumptions, is the much shorter time series of historical data compared with that of the total New Zealand population. Moreover, the measurement of ethnicity has changed over time in many collections, while it is not captured at all in some collections (eg external migration data). Fertility rates for the broad Pacific and Asian populations are only available for 1996, 2001, and 2006. This partly reflects the limited availability of population estimates and the introduction of new birth (and death) registration forms in 1995. Official life tables do not exist for either population, reflecting their relatively small populations and few deaths. The limited ethnic time series may therefore require a more subjective approach to estimating parameter uncertainty. Nevertheless, ethnic population projections are a good example of where some quantification of uncertainty, even though uncertain, would assist interpretation of the projection results. Ethnic population projections are more uncertain than projections of the total population. But how much more uncertain? And ethnic population projections are produced with a shorter projection period – providing measures of uncertainty could help justify why the projection period is shorter. An example of stochastic methods applied to ethnic group projections is given for the United Kingdom population in Coleman and Scherbov (2005). One of the key issues relating to ethnic populations is their overlap, reflecting that people can and do identify with multiple ethnicities. Stochastic projections of individual ethnic groups could be produced independently. However, they could not be meaningfully combined or contrasted with stochastic projections of the total New Zealand population (eg to calculate ethnic shares), other than using the median projection which is deterministically derived. National family and household projections Statistics NZ currently projects families and households using a propensity method. In this method, living arrangement type rates (LATRs) by age-sex are applied to population projections to give projections of the population in 11 different living arrangement types. These projections are subsequently aggregated to give projections of families (by broad family type) and households (by broad household type). Uncertainty in the family and household projections can arise from the underlying population projections, the LATRs, and four additional univariate parameters: 1. average number of families per family household 2. average number of people per other multiperson household 3. proportion of two-parent families with dependent children 4. proportion of one-parent families with dependent children. 40 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Variances for all of these parameters could be modelled from historical census data. However, the data are essentially limited to a maximum of six censuses (1981 to 2006 at five-year intervals), where the requisite data are available electronically. Furthermore, changes to census collection and processing procedures from census to census mean that the census parameters are not necessarily consistent over time. This may lead to an overestimate of the respective variances. Alho and Keilman (2010) have developed a method that involves a random breakdown of the population according to household position (eg single, cohabiting, living with a spouse, living alone). These positions are analogous to LATRs, so the random share method could be incorporated into the propensity method to provide some quantification of uncertainty in LATRs in addition to the uncertainty in the underlying population projections. One aspect that could be problematic in stochastic family and household projections is the significantly different numbers of male and female partners (in ‘couple without children’ and/or ‘two-parent families’). The deterministic mid-range series is carefully formulated to project similar numbers of male and female partners, bearing in mind the numbers need not be identical because of same-sex partnerships and differences in self-identification (eg differences between partners as to whether they are usually partnered or even as to whether they usually live in New Zealand or at the same address). Without a reconciliation, the stochastic projections could be demographically implausible. Subnational demographic projections Statistics NZ produces various demographic projections for three key geographic units: 1. regional council areas (regions). Based on boundaries and estimated resident populations at 30 June 2010, there were 16 regions – ranging in size from Auckland (1.46 million people) down to West Coast (33,000)(Statistics New Zealand, 2010f). 2. territorial authority areas (TAs). The new Auckland Council area (effective 1 November 2010) reduces the number of TAs to 67, with a median size of 30,000 people. Three-quarters of TAs have a population between 7,000 and 60,000. 3. area units (AUs). In 2010, there were almost 2,000 AUs (equivalent to ‘suburbs’ in major urban areas), with a median size of 2,000 people. Three-quarters of AUs had a population between 100 and 4,000. AUs also form the building blocks for deriving projections for other geographic areas, such as wards and urban/rural areas. Projection accuracy is generally proportional to the population size of geographic areas (Keilman, 2005; Statistics New Zealand, 2008). That is, demographic uncertainty generally increases as the size of the geographic area decreases. Conveying this uncertainty to users of subnational population projections would seem to be prudent. 41 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan However, in practice, any estimates of uncertainty are likely to be much more uncertain than those at the national level. This is largely because of the much shorter historical time series available for subnational fertility, mortality, and migration measures. In turn, this partly reflects the major geographic boundary changes that have occurred. For example, the local government reorganisation that took effect on 1 November 1989 caused significant boundary changes when establishing regional council and TA areas. Historical time series for AUs are complicated by their ongoing proliferation, which saw the number of AUs increase from 1,775 in 1996 to 2,013 in 2011. Although fertility, mortality, and migration can be assumed to be largely independent at the national level, at the subnational level this is less true. For example, the migration of people in the prime reproductive ages, 20–44 years, is often associated with the loss/gain of reproductive potential – their offspring. Or, ‘greenfield’ developments that attract younger adult migrants can result in significant changes in local fertility patterns as family formation occurs. There are also some significant, although often complex, migrations of older people related to changes in their well-being and independence, which result in correlations between migration and mortality. A further limitation of stochastic subnational population projections is apparent for areas subject to high growth trajectories, either in the past or the future. For areas that have reached capacity or have very limited scope for further residential development, past demographic patterns may be a poor basis for formulating projection assumptions, let alone demographic uncertainty. Similarly, for areas of future high growth (eg greenfield developments), past demographic patterns are unlikely to indicate the uncertainty around the pace and timing of future population growth. Extrapolative techniques are likely to be unsuitable in such circumstances. Cameron and Poot (2010) outline a method for producing stochastic population projections for subnational areas. While this work seems promising, their projections are only for selected geographic areas and are unconstrained to higher-level geographies (eg national population projections). The uncertainty parameters are based on national data and are assumed to be the same for all ages, both sexes, and all geographic areas. Cameron and Poot suggest a stochastic approach can mitigate the inherent conservatism of projections, where the fastest growing areas tend to be under-projected and the slowest growing (or declining) areas tend to be over-projected. This appears to be achieved by using migration rates, rather than migration levels. The use of age-andsex-specific migration rates is already an implicit part of Statistics NZ's method, as evidenced by changing migration levels (although limited) over the projection period. Yet the use of explicit migration rates intuitively appeals. However, all the stochastic projections presented by Cameron and Poot are higher than the mid-range Statistics NZ deterministic projections, which suggests that slow growing (or declining) areas will be further over-projected and underscores the importance of constraining to plausible higher-level projections. If explicit migration rates were used, constraining to higher-level projections would not be as straightforward. 42 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Cameron and Poot also do not examine if and how a stochastic method is transferable to the AU geography, which remains a critical part of Statistics NZ's subnational population projections. One option, not discussed in their paper, is whether different methods are preferable for different geographies. For example, is a stochastic approach suitable for TAs, but unsuitable for AUs? An important consideration in this context may be the added time and cost of a more complex method, when producers of population projections are under pressure to reduce the time and costs of production. Conclusion Both deterministic and stochastic projections are a simplification or model of complex and variable phenomena, with projection assumptions designed to convey future average long-run patterns. However, deterministic scenarios give a poor indication of uncertainty for some key demographic characteristics. Stochastic projections offer the clear advantage of giving estimates of uncertainty. They therefore give users of projections a better indication of uncertainty and the associated probability intervals. Moreover, given the nature of projections, a stochastic approach is consistent with how statistical agencies would like projections to be conveyed and interpreted. 43 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan References Alders, M, & de Beer, J (2004). Assumptions on fertility in stochastic population forecasts. International Statistical Review, 72(1), p65–79. Alders, M, & de Beer, J (2005). An expert knowledge approach to stochastic mortality forecasting in the Netherlands. In N Keilman (ed), Perspectives on Mortality Forecasting: Probabilistic Models, Social Insurance Studies No. 2, Swedish Social Insurance Agency. Available from www.forsakringskassan.se Alders, M, Keilman, N, & Cuijsen, H (2007). Assumptions for long-term stochastic population forecasts in 18 European countries. European Journal of Population, 23, 33– 69. Alho, JM (1997). Scenarios, uncertainty and conditional forecasts of the world population. Journal of the Royal Statistical Society: Series A (Statistics in Society), 160(1), 71–85. Alho, JM (2005). Remarks on the use of probabilities in demography and forecasting. In N Keilman (ed), Perspectives on Mortality Forecasting: Probabilistic Models, Social Insurance Studies No. 2, Swedish Social Insurance Agency. Available from www.forsakringskassan.se Alho, J, & Keilman, N (2010). On future household structure. Journal of the Royal Statistical Society: Series A (Statistics in Society), 173(1), 117–143. Australian Bureau of Statistics (2009). ABS data quality framework. Available from www.abs.gov.au Australia Productivity Commission (2005). Economic implications of an ageing Australia. Available from www.pc.gov.au Booth, H (2004). On the importance of being uncertain: forecasting population futures for Australia. People and Place, 12(2), 1–12. Available from http://elecpress.monash.edu.au Booth, H (2006). Demographic forecasting: 1980 to 2005 in review. Working Papers in Demography 100, Australian National University. Available from http://adsri.anu.edu.au Booth, H, & Tickle, L (2008). Mortality modelling and forecasting: A review of methods. Australian Demographic and Social Research Institute working paper No 3. Available from http://adsri.anu.edu.au Bryant, J (2003). Can population projections be used for sensitivity tests on policy models? New Zealand Treasury Working Paper 03/07. Available from www.treasury.govt.nz Bryant, J (2005). What can stochastic population projections contribute to policy analysis? New Zealand Population Review, 31(1), 1–11. Cameron, M, & Poot, J (2010). A stochastic sub-national population projection methodology with an application to the Waikato region of New Zealand. Population Studies Centre Discussion paper No 70. Available from www.waikato.ac.nz 44 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Chatfield, C (2009). The analysis of time series: An introduction (6th edition): Chapman & Hall/CRC. Coleman, D, & Scherbov, S (2005). Immigration and ethnic change in low-fertility countries – towards a new demographic transition? Presented at the Population Association of America Annual Meeting, Philadelphia. Available from www.spsw.ox.ac.uk De Beer, J (2000). Dealing with uncertainty in population forecasting. Statistics Netherlands. Available from www.cbs.nl De Beer, J, & Alders, M (1999). Probabilistic population and household forecasts for the Netherlands. Statistics Netherlands. Paper for the European Population Conference EPC99, 30 August – 3 September 1999, The Hague, The Netherlands. Available from www.cbs.nl Dowd, K, Blake, D, & Cairns, A (2010). Facing up to uncertain life expectancy: The longevity fan. Demography, 47(1), 67–78. Dunstan, K, Howard, A, & Cheung, J (2006). A history of survival in New Zealand: Cohort life tables 1876–2004. Wellington: Statistics New Zealand. Available from www.stats.govt.nz Gneiting, T (2008). Editorial: Probabilistic forecasting. Journal of the Royal Statistical Society – Series A: General – statistics in society, 171(2), 319–321. Hartmann, M, & Strandell, G (2006). Stochastic population projections for Sweden. Statistics Sweden. Available from www.scb.se Human Mortality Database. www.mortality.org Hyndman, R J, & Booth, H (2006). Stochastic population forecasts using functional data models for mortality, fertility and migration. Working Papers in Demography 99, Australian National University. Available from http://adsri.anu.edu.au Keilman, NW (1990). Uncertainty in national population forecasting: issues, backgrounds, analyses, recommendations. Swets & Zeitlinger: Amsterdam. Keilman, N (1997). Ex-post errors in official population forecasts in industrialised countries. Journal of Official Statistics, 13, 245–277. Keilman, N (2005). Erroneous population forecasts. Social Insurance Studies No. 2, in N Keilman (ed), Perspectives on Mortality Forecasting: Probabilistic Models, Swedish Social Insurance Agency. Available from www.forsakringskassan.se Kynn, M (2008). The 'heuristics and biases' bias in expert elicitation. Journal of the Royal Statistical Society – Series A: General – statistics in society, 171(1), 239–264. Lee, RD, & Carter, LR (1992). Modeling and forecasting U.S. mortality. Journal of the American Statistical Association, 87, 659–675. Lee, RD & Miller, T (2001). Evaluating the performance of the Lee-Carter method for forecasting mortality. Demography, 38(4), 537–549. Lee, R & Tuljapurkar, T (1994). Stochastic population forecasts for the United States: Beyond high, medium, and low. Journal of the American Statistical Association, 89, 1175–1189. Available from www.jstor.org Long, JF, & Hollmann, FW (2004). Developing official stochastic population forecasts at the US Census Bureau, International Statistical Review, 72(2), 201–208. 45 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Lutz, W (2009). Toward a systematic, argument-based approach to defining assumptions for population projections. Interim Report IR-09-037, International Institute for Applied Systems Analysis, Austria. Available from www.iiasa.ac.at Lutz, W, Sanderson, W, & Scherbov, S (1996). Probabilistic population projections based on expert opinion. In W Lutz (ed), The future population of the world: What can we assume today? London: Earthscan. Lutz, W, & Scherbov, S (1998). An expert-based framework for probabilistic national population projections: the example of Austria, European Journal of Population, 14, 1– 17. Lutz, W, Sanderson, W, & Scherbov, S (2001). The end of world population growth, Nature, 412, 543–545. Office of National Statistics (2004). Guidelines for measuring statistical quality. Available from www.statistics.gov.uk Office of National Statistics (nd). Improving Migration and Population Statistics. Available from www.ons.gov.uk O'Hagan, T (2005). Elicitation. Significance, 84–86. Prommer, I (2006). Current practice of official population projections in the EU: Summary of the questionnaires on the use of expert opinions in assumption making for population projections in national statistical offices. Interim Report IR-06-026, International Institute for Applied Systems Analysis, Austria. Available from www.iiasa.ac.at Qiang, Li, Reuser, M, Kraus, C, & Alho, J (2009). Ageing of a giant: a stochastic population forecast for China, 2006–2060. Journal of Population Research, 26(1), 21– 50. Rowan, S, & Wright, E (2010). Developing stochastic population forecasts for the United Kingdom: Progress report and plans for future work. Paper to joint Eurostat/UNECE Work Session on Demographic Projections, 28–30 April 2010, Lisbon, Portugal. Available from www.unece.org Statistics Canada (2002). Statistics Canada's quality assurance framework. Available from www.statcan.gc.ca Statistics New Zealand (2007a). A report on the 2006 Post-enumeration Survey. Available from www.stats.govt.nz Statistics New Zealand (2007b). Principles and protocols for producers of tier 1 statistics. Available from www.statisphere.govt.nz Statistics New Zealand (2008). How accurate are population projections? An evaluation of Statistics New Zealand population projections, 1991–2006. Available from www.stats.govt.nz Statistics New Zealand (2009a). National population projections, 2009(base)–2061. Available from www.stats.govt.nz Statistics New Zealand (2009b). Birth and population trends: how predictable are they? Available from www.stats.govt.nz Statistics New Zealand (2009c). Cohort life tables [Data files]. Available from www.stats.govt.nz 46 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Statistics New Zealand (2009d). Period life tables [Data files]. Available from www.stats.govt.nz Statistics New Zealand (2010a). Information about the population estimates. Available from www2.stats.govt.nz Statistics New Zealand (2010b). Births: Age-specific fertility rates by single-year of age (Total population) [Data files]. Available from www.stats.govt.nz/infoshare Statistics New Zealand (2010c). Information about the demographic projections. Available from www2.stats.govt.nz Statistics New Zealand (2010d). National Labour Force Projections: 2006(base)–2061 update. Available from www.stats.govt.nz Statistics New Zealand (2010e). Population projections tables [Data files]. Available from www.stats.govt.nz Statistics New Zealand (2010f). Subnational population estimates tables [Data files]. Available from www.stats.govt.nz Tuljapurkar, S (2005). Stochastic forecasts of mortality, population and pension systems. In N Keilman (ed), Perspectives on Mortality Forecasting: Probabilistic Models, Social Insurance Studies No. 2, Swedish Social Insurance Agency. Available from www.forsakringskassan.se Wilson, T (2004). The application of a probabilistic framework to New Zealand’s official national population projections. Discussion Paper 2004/02, Queensland Centre for Population Research, University of Queensland. Wilson, T (2005). The application of a probabilistic framework to New Zealand’s official national population projections. New Zealand Population Review, 31(1), 51–75. Wilson, T, & Bell, M (2003). Probabilistic population forecasts for Australia and New Zealand, 2002–2051. Paper prepared for the Population Association of New Zealand conference, July 2003, Queensland Centre for Population Research, University of Queensland. Wilson, T, & Bell, M (2003). Subnational probabilistic population forecasts: the example of Australia. Discussion Paper 2003/06, Queensland Centre for Population Research, University of Queensland. Available from http://espace.library.uq.edu.au Wilson, T, & Bell, M (2004). Australia's uncertain demographic future. Demographic Research, 11(8), 195–234. Available from www.demographic-research.org Wilson, T, & Bell, M (2004). Probabilistic regional population forecasts: the example of Queensland, Australia. Discussion Paper 2004/04, Queensland Centre for Population Research, University of Queensland. Wilson, T, & Rees, P (2005). Recent developments in population projection methodology: a review. Population, Space and Place, 11, 337–360. 47 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Appendix 1: List of abbreviations ARIMA autoregressive integrated moving average ASFR age-specific fertility rate ASSR age-specific survival rate AU area unit de 0 female–male difference in life expectancy at birth e0 life expectancy at birth ERP estimated resident population HMD Human Mortality Database LATR living arrangement type rate LFPR labour force participation rate NCU net census undercount PES post-enumeration survey RTO resident temporarily overseas SE standard error TA territorial authority TFR total fertility rate 48 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Appendix 2: Examples of simulations Illustrations of simulation #1 (figure 30) and simulation #2 (figure 31) of 1,000 simulations, for selected assumptions and projection results. Figure 30 Example #1 of simulated total fertility rate, male and female life expectancy at birth, net migration, total population, births and deaths, and 65+ population 2010–2111 Male and female life expectancy at birth Total fertility rate Births per woman 3.0 Years of life 105 2.5 100 2.0 95 1.5 90 1.0 85 0.5 80 Female Male 0.0 75 2006 2016 2026 2036 2046 2056 2066 June year 2076 2086 2096 2106 2006 Net migration 2026 2036 2046 2056 2066 June year 2076 2086 2096 2106 2086 2096 2106 2086 2096 2106 Total population Thousand 50 2016 Million 8 40 7 30 6 20 5 10 4 0 3 -10 2 -20 1 -30 -40 0 2006 2016 2026 2036 2046 2056 2066 June year 2076 2086 2096 2106 2006 Births and deaths 120 2016 2026 2036 2046 2056 2066 June year 2076 65+ population Thousand 2.5 100 Million 2.0 80 1.5 Births 60 1.0 40 20 0.5 Deaths 0 2006 0.0 2016 2026 2036 2046 2056 2066 June year 2076 2086 2096 2006 2106 49 2016 2026 2036 2046 2056 2066 June year 2076 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 31 Example #2 of simulated total fertility rate, male and female life expectancy at birth, net migration, total population, births and deaths, and 65+ population 2010–2111 Male and female life expectancy at birth Total fertility rate Births per woman 3.0 2.5 100 2.0 95 1.5 90 1.0 85 0.5 80 0.0 75 2006 2016 2026 2036 2046 2056 2066 June year 2076 2086 2096 2106 Female Male 2006 Net migration 2016 2026 2036 2046 2056 2066 June year 2076 2086 2096 2106 2086 2096 2106 2086 2096 2106 Total population Thousand 50 Years of life 105 Million 8 40 7 30 6 20 5 10 4 0 3 -10 2 -20 1 -30 -40 0 2006 2016 2026 2036 2046 2056 2066 June year 2076 2086 2096 2106 2006 Births and deaths 120 2016 2026 2036 2046 2056 2066 June year 2076 65+ population Thousand 2.5 100 Million 2.0 80 1.5 Births 60 1.0 40 20 Deaths 0.5 0 2006 0.0 2016 2026 2036 2046 2056 2066 June year 2076 2086 2096 2006 2106 50 2016 2026 2036 2046 2056 2066 June year 2076 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Appendix 3: Additional projection results Figure 32 Projected population aged 0–14 probability distribution 2009–2111 Figure 33 Projected percentage of population aged 0–14 probability distribution 2009–2111 51 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 34 Projected population aged 15–39 probability distribution 2009–2111 Figure 35 Projected percentage of population aged 15–39 probability distribution 2009–2111 52 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 36 Projected population aged 40–64 probability distribution 2009–2111 Figure 37 Projected percentage of population aged 40–64 probability distribution 2009–2111 53 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 38 Projected population aged 85+ probability distribution 2009–2111 Figure 39 Projected percentage of population aged 85+ probability distribution 2009–2111 54 Experimental Stochastic Population Projections for New Zealand by Kim Dunstan Figure 40 Projected median age probability distribution 2009–2111 55
© Copyright 2026 Paperzz