Atmospheric Concentration of Chlorofluorocarbons: Addressing the

DEPARTMENT OF STATISTICS AND ACTUARIAL SCIENCE
University of Waterloo, 200 University Avenue West Waterloo, Ontario, Canada, N2L 3G1
519-888-4567, ext. 00000 | Fax: 519-746-1875 | www.stats.uwaterloo.ca
UNIVERSITY OF WATERLOO
DEPARTMENT OF STATISTICS AND ACTUARIAL SCIENCE
WORKING PAPER SERIES
2009-04
Atmospheric Concentration of
Chlorofluorocarbons: Addressing the
Global Concern with the Longitudinal
Bent-Cable Model
Shahedul Ahsan Khan,
University of Waterloo, e-mail:
[email protected]
Grace Chiu,
University of Waterloo, e-mail:
[email protected]
Joel A. Dubin,
University of Waterloo, e-mail:
[email protected]
Abstract
The recent steady decline in atmospheric chlorofluorocarbon (CFC) concentrations could be a direct result of the Montréal Protocol on Substances
That Deplete the Ozone Layer, in effect since 1987. To study the extent of the
decline, we introduce the longitudinal bent-cable model to describe CFC concentrations observed over a global detection network. The bent cable is a parametric regression model to study data that exhibit a trend change. It comprises
two linear segments to describe the incoming and outgoing phases, joined by
a quadratic bend to model the transition period. For longitudinal data, measurements taken over time are nested within observational units drawn from
some population of interest. Here, it is useful to develop a mixed-effects model
extension of existing (frequentist) bent-cable methodology for a single time series. We do so in a hierarchical Bayesian framework, where each observational
unit is associated with a random bent cable and within-unit serial correlation.
Our analysis using the longitudinal bent-cable model reveals a global decrease
in atmospheric levels of CFC-11. The global drop took place between January,
1989 and September, 1994 approximately, with November, 1993 being the estimated time point at which CFC-11 levels went from increasing to decreasing.
keywords: Longitudinal data; Bent-cable regression; Changepoint model; Bayesian
hierarchical model; Random effects.
1
Introduction
Many biological consequences such as skin cancer and cataracts, irreversible damage to plants, and reduction of drifting organisms (animals, plants, archaea, bacteria) in the ocean’s photic zone may result from the increased ultraviolet (UV) exposure due to ozone depletion. According to “Ozone Science: The Facts Behind the
Phaseout” by the U.S. Environmental Protection Agency (U.S. EPA, “http://www.epa.
gov/ozone/science/sc fact.html”), each natural reduction in ozone levels has been
followed by a recovery, though there is convincing scientific evidence that the
ozone shield is being depleted well beyond changes due to natural processes. In
particular, ozone depletion due to human activities is a major concern, and may be
controlled. One such human activity is the use of chloroflurocarbons (CFCs). As
cited in “The Ozone Hole Tour” by the University of Cambridge (“http://www.atm.
ch.cam.ac.uk/tour/part3.html”), the catalytic destruction of ozone by atomic chlorine and bromine is the major cause of the forming of polar ozone holes, and photodissociation of CFC compounds is the main reason for these atoms to be in the
stratosphere.
CFCs are nontoxic, nonflammable chemicals containing atoms of carbon, chlorine and fluorine. CFC-11 is one such compound. CFCs were extensively used
in air conditioning/cooling units, and as aerosol propellants prior to the 1980’s.
While CFCs are safe to use in most applications and are inert in the lower atmosphere, they do undergo significant reaction in the upper atmosphere. As cited in
The Columbia Encyclopedia, CFC molecules take an average of fifteen years to travel
1
270
265
260
250
255
CFC-11 (in ppt)
275
from the ground to the upper atmosphere, and can stay there for about a century.
Chlorine inside the CFCs is one of the most important free radical catalysts to destroy ozone. The destruction process continues over the atmospheric lifetime of
the chlorine atom (one or two years), during which an average of 100,000 ozone
molecules are broken down. Because of this, CFCs were banned globally by the
1987 Montréal Protocol on Substances That Deplete the Ozone Layer (“Montreal
Protocol” in The Columbia Encyclopedia). Since this protocol came into effect, the
atmospheric concentration of CFCs has either leveled off or decreased. For example, Figure 1 shows the monthly average concentrations of CFC-11 monitored
from a station in Mauna Loa, Hawaii (Global Monitoring Division of the National
Oceanic and Atmospheric Administration (NOAA)/Earth System Research Laboratory (ESRL)). We see roughly three phases: an initial increasing trend (incoming
phase), a gradual transition period, and a decreasing trend after the transition period (outgoing phase). This trend is representative of CFC-11 measurements taken
by stations across the world and illustrated in this paper.
1988
1990
1992
1994
1996
1998
2000
Time
Figure 1: Trend of the monthly average concentrations of CFC-11 in Mauna Loa,
Hawaii. (Data source: NOAA/ESRL global monitoring division)
The effects of CFCs in ozone depletion is a global concern. Although exploratory
data analyses reveal a decrease of CFCs in the earth’s atmosphere since the early
1990’s, so far no sophisticated statistical analysis has been conducted to evaluate the global trend. In addition, there are several other important questions regarding the CFC concentration in the atmosphere that could be useful not only to
policy makers, but also for human awareness. For example, (1) How long did it
take for the CFC concentration to show an obvious decline? (2) What were the
rates of change (increase/decrease) in CFCs before and after the transition period? (3) What was the critical time point (CTP) at which the CFC trend went
from increasing to decreasing? In this article, we will address these questions
2
statistically by fitting a special changepoint model for CFC-11 data. We focus
on CFC-11, because it is considered one of the most dangerous CFCs to reduce
the ozone layer in the atmosphere. In fact, it has the shortest lifetime of common CFCs, and is regarded as a reference substance in the definition of the ozone
depletion potential (ODP). The ODP of a chemical is the ratio of its impact on
ozone compared to the impact of a similar mass of CFC-11. Thus, the ODP is 1
for CFC-11 and ranges from 0.6 to 1 for other CFCs. (These facts about CFCs are
taken from the U.S. EPA websites, “http://www.epa.gov/ozone/defns.html” and
“http://www.epa.gov/ozone/science/ods/classone.html”.)
In a broader sense, we will comment in this article on (i) the global trend of
CFC-11, and (ii) the effectiveness of the Montréal Protocol on preserving the ozone
level by reducing the use of CFC-11. Our findings will also provide a rough idea
of how long it may take to diminish CFC-11 from the earth’s atmosphere. Note
that this article is associated with that of Khan, Chiu and Dubin (2009, to appear in
CHANCE) with more details on our proposed methodology.
2
Data
CFCs are monitored from different stations all over the globe by the NOAA/ESRL
global monitoring division (“ftp://ftp.cmdl.noaa.gov/hats/cfcs/cfc11/insituGCs/”),
and by the Atmospheric Lifetime Experiment/Global Atmospheric Gases Experiment/Advanced Global Atmospheric Gases Experiment (ALE/GAGE/AGAGE)
global network program (“http://cdiac.esd.ornl.gov/ndps/alegage.html”). Henceforth, we will refer to these two programs simply as NOAA and AGAGE.
Under the Radiatively Important Trace Species (RITS) program, NOAA began
measuring CFCs using in situ gas chromatographs at their four baseline observatories — Pt. Barrow (Alaska), Cape Matatula (American Samoa), Mauna Loa
(Hawaii) and South Pole (Antarctica) — and at Niwot Ridge (Colorado) in collaboration with the University of Colorado. We will label these five stations from 1 to 5
respectively. During the period of 1988-1999, a new generation of gas chromatography called Chromatograph for Atmospheric Trace Species (CATS) was developed
and has been used to measure CFC concentrations ever since.
The AGAGE program consists of three stages corresponding to advances and
upgrades in instrumentation. The first stage (ALE) began in 1978, the second
(GAGE) began during 1981-1985, and the third (AGAGE) began during 1993-1996.
The current AGAGE stations are located in Mace Head (Ireland), Cape Grim (Tasmania), Ragged Point (Barbados), Cape Matatula (American Samoa), and Trinidad
Head (California). These five stations will be labeled from 6 to 10 respectively.
We consider monthly mean data for our statistical analysis. Ideally, we wish
to have (a) full data for all stations, (b) a long enough period to capture all three
phases of the CFC trend, and (c) no change in instrumentation to avoid the elements of non-stationarity and biased measurement, if any. However, we do not
have the same duration of consecutive observations for all stations. Moreover, data
were recorded by instrumentation that switched from one type to another. Table 1
3
summarizes the availability of the consecutive observations, and the instrumentations used to record data.
Table 1: CFC-11 data summary
Station
Available consecutive observations
Instrumentation
Barrow (Station 1)
Nov, 1987 - Feb, 1999
RITS
Jun, 1998 - Aug, 2008
CATS
May, 1989 - Apr, 2000
RITS
Dec, 1998 - Aug, 2008
CATS
Jul, 1987 - Aug, 2000
RITS
Jun, 1999 - Aug, 2008
CATS
Jun, 1990 - Nov, 2000
RITS
Feb, 1998 - Aug, 2008
CATS
Feb, 1990 - Apr, 2000
RITS
May, 2001 - Jul, 2006
CATS
Feb, 1987 - Jun, 1994
GAGE
Mar, 1994 - Sep, 2007
AGAGE
Dec, 1981 - Dec, 1994
GAGE
Aug, 1993 - Sep, 2007
AGAGE
Aug, 1985 - Jun, 1996
GAGE
Jun, 1996 - Sep, 2007
AGAGE
Jun, 1991 - Sep, 1996
GAGE
Aug, 1996 - Sep, 2007
AGAGE
–
GAGE
Oct, 1995 - Sep, 2007
AGAGE
Cape Matatula (Station 2)
Mauna Loa (Station 3)
South Pole (Station 4)
Niwot Ridge (Station 5)
Mace Head (Station 6)
Cape Grim (Station 7)
Ragged Point (Station 8)
Cap Matatula (Station 9)
Trinidad Head (Station 10)
Thus, ideal statistical conditions are not achievable in this case. As a compromise, we remove Stations 9 and 10 from our analysis due to insufficient data, and
choose a study period in such a way that it can reflect the changing behavior of the
CFC-11 concentration in the atmosphere. The Montréal Protocol came into force on
January 1, 1989. So, we expect an increasing trend in CFC-11 prior to 1989 because
of its extensive use during that period. After the implementation of the protocol, we expect a change (either decreasing or leveling off) in the CFC-11 trend. To
characterize this change, we wish to have a study period starting from some point
before the implementation of the protocol. Moreover, we must have sufficient data
to observe the CFC-11 trend, if any. Thus, we settle for a relatively long study period of 152 months from January, 1988 to August, 2000, which is perhaps the most
reasonable to satisfy (a)-(c) as much as possible. In particular, it covers Stations 3-4
with a single measuring device, RITS. Stations 2 and 5 have RITS data until April,
2000, at which point we truncate their data so that only RITS is present for all of Sta4
260
250
240
Barrow (Station 1)
Cape Matatula (Station 2)
Mauna Loa (Station 3)
South Pole (Station 4)
Niwot Ridge (Station 5)
Mace Head (Station 6)
Cape Grim (Station 7)
Ragged Point (Station 8)
230
CFC-11 in parts-per-trillion (ppt)
270
280
tions 2-5 throughout the study period. Data for the remaining four stations during
this period were recorded by two measuring devices – RITS and CATS for Station
1, and GAGE and AGAGE for Stations 6-8 – each device occupying a substantial
range of the 152 months. Figure 2 shows the eight profiles of the corresponding
CFC-11 data. Specifically, each station constitutes an individual curve, which is
different from the others due to actual CFC-11 levels during measurement, exposure to wind and other environmental variables, sampling techniques, and so on.
Our objective is to assess the global CFC-11 concentration in the atmosphere, as
well as station-specific characterization of the trends.
1988
1990
1992
1994
1996
1998
2000
Time
Figure 2: CFC-11 profiles of eight stations (monthly mean data)
Note that in this paper, we do not take into consideration the effects of change
in instrumentation (RITS/CATS and GAGE/AGAGE). Modeling these effects is
currently in progress (see the Conclusion section), and a preliminary analysis reveals statistically insignificant results. However, between-station differences are
accounted for, at least partially, by the random components introduced in our modeling (see the Methods section).
3
Methods
We wish to unify information from each station to aid the understanding of the
global as well as station-specific behavior of CFC-11 in the atmosphere. We will
index the stations by i = 1, 2, . . . , 8, and the months from January, 1988 to August,
2000 by j = 1, 2, . . . , 152. Let tij denote the j th measurement occasion of the ith
station. We model the CFC-11 measurement for the ith station at time tij , denoted
5
by yij , by the relationship
yij = f (tij , θ i ) + ij
(1)
where θ i is a vector of regression coefficients for the ith station, f (·) is a function
of tij and θ i characterizing the trend of the station-specific data over time, and ij
represents the random error component.
Some remarks are required to explain tij . Recall that some stations do not have
data for all 152 months. We employ the following system for defining tij . For
example, the first and last months with recorded data by Station 3 are January, 1988
and August, 2000, respectively; thus, t3,1 = 1, t3,2 = 2, . . . , t3,152 = 152. In contrast,
Station 2 had its first and last recordings in May, 1989 and April, 2000, respectively;
hence, t2,1 = 17, t2,2 = 18, . . . , t2,132 = 148. The same approach is used to define tij ’s
for other i’s. Note that a few yij ’s (from 1 to 5) were missing between the first and
last months for a given station. We replace them by observations from another
data set (e.g. CATS or AGAGE) or by mean imputation based on neighboring time
points if not available from another data set. Though mean imputation can be
problematic resulting in biased estimates, if just a few missing values are replaced
by the mean, the deleterious effect of mean substitution is reduced (McKnight,
Figueredo and Sidani, 2007). So, we expect our findings to be minimally affected
by this replacement for so few time points.
Next, we will describe the complete formulation of the model given by (1). We
would like an expression for f that not only describes the CFC-11 profiles as shown
in Figure 2, but also gives useful information regarding the rates of change and the
transition. Although a simple quadratic model such as f (tij , θ i ) = β0i +β1i tij +β2i t2ij ,
where θ i = (β0i , β1i , β2i )0 , might be appealing to characterize the overall convexity
of the trend, it would not be expected to fit the observed data all that well, in
light of the apparent three phases: incoming and outgoing, joined by the curved
transition. A characterization of such a trend can be well-accomplished by the
so-called bent-cable function (Chiu, Lockhart and Routledge, 2006), given by
f (tij , θ i ) = β0i + β1i tij + β2i q(tij , γi , τi )
(2)
(tij − τi + γi )2
1{|tij − τi | ≤ γi } + (tij − τi )1{tij − τi > γi }.
q(tij , γi , τi ) =
4γi
(3)
where
A graphical description of this function is given in Figure 3. The model is parsimonious in that it has only five regression coefficients, and is appealing due to the
greatly interpretable regression coefficients. We will extend their model to account
for the between-station heterogeneity as suggested by the different profiles in Figure 2. Henceforth, we will denote f (tij , θ i ) and q(tij , γi , τi ) by simply fij and qij ,
respectively.
Overall, the bent-cable function, f , represents a smooth trend over time. The
random error components, ij ’s in (1), account for within-station variability in the
measurements around the regression f . Many standard assumptions for regression analyses do not hold for longitudinal data, including independence between
6
Bent-cable function, f
Incoming
phase
Outgoing phase
𝛾𝛾𝑖𝑖
𝜏𝜏𝑖𝑖 − 𝛾𝛾𝑖𝑖
𝛾𝛾𝑖𝑖
𝜏𝜏𝑖𝑖
𝜏𝜏𝑖𝑖 + 𝛾𝛾𝑖𝑖
Time
Figure 3: The bent-cable function, comprising two linear segments (incoming and
outgoing) joined by a quadratic bend. There are three linear parameters β0i , β1i
and β2i , and two transition parameters γi and τi , with θ i = (β0i , β1i , β2i , γi , τi )0 . The
intercept and slope for the incoming phase are β0i and β1i , respectively; the slope
for the outgoing phase is β1i + β2i ; and the center and half-width of the bend are τi
and γi , respectively. The critical time point (at the green line) is the point at which
the response takes either an upturn from a decreasing trend or a downturn from
an increasing trend.
all measurements (Fitzmaurice, Laird and Ware, 2004). Instead, it is necessary
to account for correlation between repeated measurements within the same unit
(here, station) over time. Though, for some data, the unit-specific coefficients θ i
may adequately account for this correlation, quite often there is additional serial
correlation remaining that can be accounted for by these ij ’s. An order-p (p > 0)
autoregressive (AR(p)) serial correlation structure (Box and Jenkins, 1994) may be
well-suited for many types of longitudinal data. Thus, we consider here an AR(p),
p > 0, model for the ij ’s, that is,
ij = φ1 i,j−1 + φ2 i,j−2 + . . . + φp i,j−p + uij
(4)
where φ1 , φ2 , . . . , φp are the AR(p) parameters, which we denote by the vector φ,
that is, φ = (φ1 , φ2 , . . . , φp )0 . We assume that uij ’s in (4) are independently and
2
identically normally distributed with mean 0 and variance σui
. Furthermore, we
assume that the initial p observations for each i are known. We develop a Bayesian
methodology conditional on these initial observations, that is, the posterior (Section
3.1) is conditioned on these initial observations. This assumption is customary in
the frequentist’s estimation procedures, and was considered by Chib (1993) in a
Bayesian approach. To analyze a single profile (measurements from a single station
over time) using a bent-cable model with AR(p) noise, the reader may refer to Chiu
and Lockhart ([3] and [4]).
7
In particular, an AR(1) model, being very parsimonious, is characterized by
an exponential decay of correlations as the separation between a pair of measurements increases, a feature that is common in many longitudinal data sets. For
our CFC-11 analysis, we assume an AR(1) structure with correlation parameter
ρ = corr(i,j , i,j±1 ), and variance parameters σi2 = var(ij ). To define a population
level of CFC-11, we wish to relate the station-specific coefficients θ i to the population coefficients in some meaningful way. Before describing such a relation, we
mention here one more assumption we make in developing the model: the linear
coefficients β i = (β0i , β1i , β2i )0 and transition coefficients αi = (γi , τi )0 are independent a priori (see Section 3.1). Practically, β i characterizes the intercepts and
the rates of change, while αi characterizes the structure of the transition. There
is generally no way to infer about αi from prior knowledge of β i , without first
seeing the data. For example, suppose the two linear phases are very steep. This
information alone is not sufficient to answer the question “what is the chance of αi
being, say, (30, 40)0 versus (35, 50)0 ?” Hence, it is reasonable to assume that β i and
αi are a priori independent. Now, we relate β i to the population coefficient vector
β = (β0 , β1 , β2 )0 plus a random component η 1i , that is, β i = β + η 1i , where η 1i is
a 3 × 1 column vector. Under the assumption that η 1i has mean 0 and a 3 × 3 covariance matrix D1 , conceptually we can regard stations as having their own linear
coefficients, and the population coefficients as the average across stations. Then,
the covariance matrix D1 provides information on the variability of the deviation of
the station-specific coefficient β i from the population coefficient β. As an extreme
example, a zero variation of the deviation between β1i and β1 indicates that the
station-specific and global incoming slopes are identical. In other words, rates of
change in the incoming phase are identical across stations. To complete the model
formulation for the linear coefficients, an assumption on the distribution of η 1i , or
equivalently β i , is necessary. Since (i) the dependence structure can be fully specified by the covariance matrix through an assumption of the multivariate normal
distribution, and (ii) it is convenient theoretically and computationally, we assume
that η 1i ∼ MVN (0, D1 ), or equivalently, β i ∼ MVN (β, D1 ).
Let us turn now to a model for the transition coefficients. Note that γi and τi
are positive. We assume that α∗i = (log (γi ), log (τi ))0 = α∗ + η 2i , α∗ = (γ ∗ , τ ∗ )0 ,
and η 2i is a 2 × 1 column vector with η 2i ∼ MVN (0, D2 ), so that E(α∗i ) = α∗ .
This assumption is equivalent to αi ∼ MVLN (α∗ , D2 ), i.e. a bivariate lognormal
distribution with α∗ and D2 being the mean vector and the covariance matrix of
∗
∗
α∗i , respectively. We may then define a population effect as (eγ , eτ )0 , which is
the median of αi under the lognormal distribution. Statistical assumptions for
ij , β i and αi as discussed above, together with Equations (1)-(4), constitute our
longitudinal bent-cable.
3.1
Statistical Inference
We employ a Bayesian approach for statistical inference. The main idea of Bayesian
inference is to combine data and prior knowledge on a parameter to determine its
posterior distribution (the conditional density of the parameter given the data). The
8
prior knowledge is supplied in the form of a prior distribution of the parameter,
which quantifies information about the parameter prior to any data being gathered. For example, recall the argument for the a priori independence of β i and
αi . When little is reliably known about the parameter, it is reasonable to choose a
fairly vague, minimally informative prior. For example, in analyzing the CFC-11
data, we assumed a multivariate normal prior for β with mean 0 and a covariance
matrix with very large variance parameters. This leads to a non-informative prior
for β, meaning the data, assuming a sufficient amount of data is collected, will
primarily dictate β’s resulting posterior distribution.
Any conclusions about the plausible value of the parameter are to be drawn
from the posterior distribution. For example, the posterior mean (median if the
posterior density is noticeably asymmetric) and variance can be considered an estimate of the parameter and the variance of the estimate, respectively. A 100(1−2p)%
Bayesian interval, or credible interval, is [c1 , c2 ], where c1 and c2 are the pth and
(1 − p)th quantiles of the posterior density, respectively. This credible interval has a
probabilistic interpretation. For example, for p = 0.025, the conditional probability
that the parameter falls in the interval [c1 , c2 ] given the data is 0.95.
The Bayesian model formulation of our longitudinal bent-cable model derived
from the above assumptions involves three levels (see Appendix 1 for details of the
first level):
(2) (1)
2
2
∼ MVN µi (β i , αi , φ), σui
Ii ,
Yi |yi , β i , αi , φ, σui



β i |β, D1 ∼ MVN (β, D1 ) 
,


∗
∗

αi |α , D2 ∼ MVLN (α , D2 )



β|h1 , H1 ∼ MVN (h1 , H1 ), α∗ |h2 , H2 ∼ MVN (h2 , H2 ), 




−1
−1
−1
−1
,
D1 ∼ W ν1 , (ν1 A1 ) , D2 ∼ W ν2 , (ν2 A2 ) ,






−2
a ab

φ|h3 , H3 ∼ MVN (h3 , H3 ), σui |a, b ∼ G( 2 , 2 )
(2)
(1)
(5)
(6)
(7)
0
where Yi = (Yi,p+1 , . . . , YP
= (yi1 , . . . , yip )0 , µP
i,ni ) , yi
i (β i , αi , φ) ≡ µi = (µi,p+1 ,
p
0
. . . , µi,ni ) , µij = β0i (1 − P k=1 φk ) + β1i xij + β2i rij + pk=1 φk yi,j−k , xij = tij −
P
p
p
k=1 φk ti,j−k , rij = qij −
k=1 φk qi,j−k (j = p + 1, . . . , ni ), and W and G stand for
Wishart and gamma distributions, respectively. The first two levels are (5) and (6)
which represent, respectively, the within-station and between-station characteristics, and the third level is (7), which is the specification of the prior distributions
with fixed, prespecified hyperparameters.
Bayesian inference is carried out by the Markov Chain Monte Carlo (MCMC)
method (Gilks, Richardson and Spiegelhalter, 1996). More specifically, we generate data by the Hybrid Gibbs (Smith and Roberts, 1993) (Metropolis within Gibbs)
9
algorithm to summarize the posterior distributions. We prefer conjugate priors
at level 3 to facilitate the straightforward implementation of the Gibbs sampler
because of the posterior’s closed form. While occasionally criticized to be too restrictive, conjugate priors are preferred here to reduce the extensiveness of computations involved in the MCMC method. Specification of the priors in Equation (7)
leads to the following full conditionals (see Appendix 2 for derivations):
β i |. ∼ MVN
−2
Mi σui
X0i
zi +
D−1
1
β , Mi ,
−1
β|. ∼ MVN U1 (D−1
β̃
+
H
h
),
U
1
1 ,
1
1
∗
−1
−1
α |. ∼ MVN U2 (D2 α̃ + H2 h2 ), U2 ,
m
hX
i−1 −1
0
D1 |. ∼ W m + ν1 ,
(β i − β) (β i − β) + ν1 A1
,
(8)
(9)
(10)
(11)
i=1
m
hX
i−1 −1
∗
∗
∗
∗ 0
D2 |. ∼ W m + ν2 ,
(αi − α ) (αi − α ) + ν2 A2
,
(12)
i=1
0
ni − p + a (zi − Xi β i (zi − Xi β i + ab
∼G
,
,
2
2
X
m
−2
0
−1
φ|. ∼ MVN V
σui Wi i + H3 h3 , V ,
−2
σui
|.
(13)
(14)
i=1
P
where zi = (zi,p+1 , . . . , zi,ni )0 with zij = yij − pk=1 φk yi,j−k , i = (i,p+1 , . . . , i,ni )0
Pm ∗
Pm
−2
= σui
X0i Xi + D−1
αi , M−1
with ij = yij − fij , β̃ =
1 ,
i
i=1P
i=1 β i , α̃ =
m
−1
−1
−1
−1
−1
−1
−1
−2
0
−1
U1 = mD1 + H1 , U2 = mD2 + H2 , V = i=1 σui Wi Wi + H3 , and




Pp
1
−
φ
x
r
i,p+1
i,p+1 
i,p−1
i,1

k=1 k
 i,p





Pp



xi,p+2 ri,p+2 
i,p+1
i,p
i,2 
 1 − k=1 φk


 , Wi = 
Xi = 
.


.
.
.
.
.
.


.
.
.
..
..
.. 

.
. 
 .






P
i,ni −1 i,ni −2 i,ni −p
1 − pk=1 φk
xi,ni
ri,ni
(15)
The full conditional for αi cannot be expressed in a closed form. It is proportional
to the following expression
n
o
n 1
o
1
1
∗
∗
exp − 2 (zi − Xi β i )0 (zi − Xi β i ) exp − (α∗i − α∗ )0 D−1
(α
−
α
)
.
2
i
γi τi
2σui
2
(16)
10
We use a random walk Metropolis (Givens, Hoeting and Sidani, 2005) for αi within
the Gibbs because of its nonstandard full conditional.
As little is known reliably on the station-specific profiles of the CFC-11 data
prior to data collection, values of the hyper-parameters in (7) are chosen to reflect
weak knowledge. We have written our own code in C to generate Monte Carlo
samples, and analyzed these in R using the coda package.
4
Results
240 245 250 255 260 265 270 275
CFC-11 (in ppt)
Figure 4 presents the estimated global CFC-11 concentrations using longitudinal
bent-cable regression assuming the AR(1) structure for within-station dependence.
The global drop (gradual) in CFC-11 took place between January, 1989 and September, 1994 approximately. Estimated incoming and outgoing slopes were 0.65 and
−0.12, respectively. Thus, the average increase in CFC-11 was about 0.65 ppt for a
one-month increase during the incoming phase (January, 1988 - December, 1988),
and the average decrease was about 0.12 ppt during the outgoing phase (October,
1994 - August, 2000). The 95% credible intervals indicated significant slopes for
the incoming/outgoing phases. Specifically, these intervals were (0.50, 0.80) and
(−0.22, −0.01), respectively, for the two linear phases, neither interval including
0. The estimated population critical time point (CTP) was November, 1993, which
implies that,
on average, CFC-11 went from increasing to decreasing around this
time. The corresponding 95% credible interval ranged from December, 1992 to
November, 1994.
CTP = Nov, 1993
Slope = ‐0.12
Slope = 0.65 End of transition = Sep, 1994 Beginning of transition = Jan, 1989 1988
1990
1992
1994
1996
1998
2000
Time
Figure 4: Global fit of the CFC-11 data using longitudinal bent-cable regression
assuming AR(1)
11
The station-specific fits, as well as the population fit are displayed in Figure 5.
It shows that the model fits the data well, with the observed data and individual
fits agreeing quite closely. Table 2 summarizes the fits numerically. We see significant increase/decrease of CFC-11 in the incoming/outgoing phases for all the
stations separately, as well as globally. The rates at which these changes occurred
(Columns 2 and 3) agree closely for the stations. This phenomenon was also evident in the estimate of D, showing small variation of the deviations between the
global and station-specific incoming/outgoing slope parameters (variance ≈ 0.02
for both cases).
1996
2000
1992
1996
2000
280
260
250
230
240
CFC-11 (in ppt)
270
280
270
260
250
CFC-11 (in ppt)
230
1988
1988
1992
1996
2000
1988
1992
1996
Time
Niwot Ridge
(Station 5)
Mace Head
(Station 6)
Cape Grim
(Station 7)
Ragged Point
(Station 8)
1996
Time
2000
1988
1992
1996
2000
270
260
250
CFC-11 (in ppt)
230
1988
Time
240
270
260
250
CFC-11 (in ppt)
230
240
270
260
250
CFC-11 (in ppt)
230
240
270
260
250
240
1992
2000
280
Time
280
Time
280
Time
230
1988
South Pole
(Station 4)
240
270
260
250
CFC-11 (in ppt)
230
240
270
260
250
CFC-11 (in ppt)
240
230
1992
280
1988
CFC-11 (in ppt)
Mauna Loa
(Station 3)
280
Cap Matatula
(Station 2)
280
Barrow
(Station 1)
1992
1996
Time
2000
1988
1992
1996
2000
Time
Figure 5: Station-specific fits (red) and population fit (green) of the CFC-11 data,
with estimated transition marked by vertical lines. The black line indicates observed data.
The above findings support the notion of constant rates of increase and decrease, respectively, before and after the enforcement of the Montréal Protocol, observable despite a geographically spread-out detection network. They also point
to the success of the widespread adoption and implementation of the Montréal
Protocol across the globe. However, the rate by which CFC-11 has been decreasing
(about 0.12 ppt per month, globally) suggests that it will remain in the atmosphere
throughout the 21st century, should current conditions prevail.
Let us turn now to the behavior of the transition of CFC-11 over time. The
transition periods and critical time points varied somewhat across stations (Ta12
Table 2: Estimated station-specific and global concentrtions of CFC-11 assuming
AR(1)
Incoming slope
(95% credible
interval)
Outgoing slope
(95% credible
interval)
Transition period
(Duration)
CTP
(99% credible
interval)
Global
0.65
(0.50, 0.80)
−0.12
(−0.22, −0.01)
Jan, 1989 - Sep, 1994
(69 months)
Nov, 1993
(Aug, 1992 to
May, 1995)
Barrow
(Station 1)
σ12 ≈ 2.97
0.55
(0.39, 0.72)
−0.19
(−0.24, −0.15)
Jan, 1989 - Aug, 1994
(68 months)
Mar, 1993
(Jul, 1992 to
Nov, 1993)
Cap Matatula
(Station 2)
σ22 ≈ 1.01
0.74
(0.56, 0.94)
−0.10
(−0.13, −0.07)
May, 1989 - Jan, 1995
(69 months)
May, 1994
(Oct, 1993 to
Feb, 1995)
Mauna Loa
(Station 3)
σ32 ≈ 1.81
0.67
(0.52, 0.83)
−0.12
(−0.16, −0.09)
Mar, 1989 - Jun, 1994
(64 months)
Aug, 1993
(Dec, 1992 to
May, 1994)
South Pole
(Station 4)
σ42 ≈ 0.30
0.60
(0.42, 0.77)
−0.12
(−0.15, −0.10)
Dec, 1988 - Nov, 1995
(84 months)
Sep, 1994
(Apr, 1994 to
Mar, 1995
Niwot Ridge
(station 5)
σ52 ≈ 0.82
0.56
(0.34, 0.79)
−0.11
(−0.13, −0.08)
Nov, 1988 - Jul, 1994
(69 months)
Aug, 1993
(Dec, 1992 to
May, 1994)
Mace Head
(Station 6)
σ62 ≈ 1.20
0.59
(0.44, 0.74)
−0.11
(−0.13, −0.08)
Sep, 1988 - Jan, 1994
(65 months)
Mar, 1993
(Jul, 1992 to
Dec, 1993)
Cape Grim
(Station 7)
σ72 ≈ 0.29
0.78
(0.68, 0.93)
−0.07
(−0.09, −0.06)
Mar, 1989 - Nov, 1994
(69 months)
Jun, 1994
(Jan, 1994 to
Oct, 1994)
Ragged Point
(Station 8)
σ82 ≈ 2.25
0.70
(0.55, 0.86)
−0.10
(−0.14, −0.07)
Jan, 1989 - Apr, 1994
(64 months)
Aug, 1993
(Nov, 1992 to
Jun, 1994)
ble 2). This may be due to the extended CFC-11 phase-out schedules contained
in the Montréal Protocol – 1996 for developed countries and 2010 for developing
countries. Thus, many countries at various geographical locations continued to
contribute CFCs to the atmosphere during the 152 months in our study period,
while those at other locations had stopped. Overall, the eight transitions began between September, 1988 and May, 1989, a period of only nine months. This reflects
the success and acceptability of the protocol all over the globe. Durations of the
13
transition periods are very similar among stations except for South Pole. Thus, it
took almost the same amount of time in different parts of the world for CFC-11
to start dropping linearly with an average rate of about 0.12 ppt per month. The
last column of the table indicates that the global estimate of the CTP is contained
in all the station-specific 99% credible intervals except for those of South Pole and
Cape Grim, with the lower bound for South Pole coming three months later than
that for Cape Grim, making South Pole a greater outlier. Specifically, the transition
for South Pole was estimated to take place over 84 months, an extended period
compared to the other stations. This could be due to the highly unusual weather
conditions specific to the location. CFCs are not disassociated during the long
winter nights in the South Pole. Only when sunlight returns in October does ultraviolet light break the bond holding chlorine atoms to the CFC molecule (Ozone
Facts, NASA, “http://ozonewatch.gsfc.nasa.gov/facts/hole.html”). For this reason, it may be expected for CFCs to remain in the atmosphere over the South Pole
for a longer period of time, and hence, an extended transition period. Indeed,
our findings for South Pole are very similar to those reported by Ghude, Jain and
Arya (2009). To evaluate the trend, the authors used the NASA EdGCM model –
a deterministic global climate model wrapped in a graphical user interface. They
found the average growth rate to be 9 ppt per year for 1983-1992, and about −1.4
ppt per year for 1993-2004, turning negative in the mid 1990’s. With our statistical
modeling approach, we estimated a linear growth rate of 0.6 ppt per month (7.2
ppt per year) prior to December, 1988, a transition between December, 1988 and
November, 1995, and a negative linear phase (−0.12 ppt per month, or −1.44 ppt
per year) after November, 1995.
The eight estimates of within-station variability (σi2 ) are given in the first column of Table 2. We noticed earlier from the profile plot (Figure 2) that Barrow
measurements are more variable, whereas Cape Grim and South Pole show little
variation over time. This is reflected in their within-station variance estimates of
2.97, 0.29, and 0.30, respectively. This also explains, at least partially, as to why
the credible intervals of the South Pole and Cape Grim CTPs did not contain the
estimate of the global CTP. As expected, we found a high estimated correlation between consecutive within-station error terms (AR(1) parameter ρ ≈ 0.81 with 95%
credible interval (0.77, 0.85)).
In summary, our methodology provides a satisfactory fit of the longitudinal
bent-cable model to the CFC-11 data under the assumption of an AR(1) withinstation correlation structure. We are currently investigating the goodness of this
fit in comparison to a higher order AR model. The generality of our methodology
with respect to the order of the AR process makes it a flexible tool in analyzing
different types of longitudinal changepoint data.
5
Conclusion
CFC-11 is a major source for the depletion of stratospheric ozone around the globe.
Since the Montréal Protocol came into effect, a global decrease in the CFC-11 has
14
been observed, a finding confirmed by our analysis. Our analysis, using the longitudinal bent-cable model assuming an AR(1) within-station dependence structure,
revealed a gradual rather than abrupt change, the latter of which is assumed by
most standard changepoint models. This makes scientific sense due to the fact
that CFC molecules can stay in the upper atmosphere for about a century, and
their breakdown does not take place instantaneously. The substantial decrease in
global CFC-11 levels after the gradual change shown by our analysis suggests that
the Montréal Protocol can be regarded as a successful international agreement to
reduce the negative impact of CFCs on the ozone layer.
One possible extension of the proposed bent-cable model for longitudinal data
is to incorporate individual-specific covariate(s) (e. g. effects of instrumentations
specific to different stations in measuring the CFC-11 data) to see if they could
partially explain the variations within and between the individual profiles. One
may incorporate this by modeling the individual-specific coefficients θ i ’s to vary
systematically with the covariate(s) plus a random component. This specification
makes the estimation method more complicated, and is currently in progress.
Acknowledgement
This work is partially supported by NSERC through Discovery Grants to G. Chiu
(RGPIN261806-05) and J.A. Dubin (RGPIN327093-06), and by the Government of
Ontario through an Ontario Graduate Scholarship to S.A. Khan (000113006). We
extend our appreciation to the NOAA/ESRL and ALE/GAGE/AGAGE global
network program, who made their CFC-11 data available to the public. We thank
Geoffrey S. Dutton, NOAA/ESRL, and Derek Cunnold, ALE/GAGE/AGAGE global
network program, for clarifications of their data used in this article. We also thank
Peter X. Song, Professor, Department of Statistics and Actuarial Science, University
of Waterloo, Waterloo, Ontario, for his temporary support through NSERC.
15
Appendix 1: Level 1 of the Hierarchical Model
We can write the first level of our hierarchical model as in Equation (5). This is
formulated from the regression model represented by Equations (1)-(3), and under
the assumptions for ij ’s, i.e.
yij = fij + ij , ij =
p
X
2
φk i,j−k + uij , uij ∼ N (0, σui
).
k=1
Here,
yij = fij + ij
= β0i + β1i tij + β2i qij + ij
= β0i + β1i tij + β2i qij + φ1 i,j−1 + φ2 i,j−2 + . . . + φ1 i,j−p + uij
= β0i + β1i tij + β2i qij + φ1 (yi,j−1 − β0i − β1i ti,j−1 − β2i qi,j−1 )+
φ2 (yi,j−2 − β0i − β1i ti,j−2 − β2i qi,j−2 ) + . . . +
φp (yi,j−p − β0i − β1i ti,j−p − β2i qi,j−p ) + uij
p
p
p
X
X
X
= β0i 1 −
φk + β1i tij −
φk ti,j−k + β2i qij −
φk qi,j−k +
k=1
k=1
k=1
p
X
φk yi,j−k + uij
k=1
= β0i 1 −
p
X
φk + β1i xij + β2i rij +
k=1
p
X
φk yi,j−k + uij
k=1
= µij + uij .
Therefore, we specify the first level as
(2)
(1)
2
2
Yi | yi , β i , αi , σui
, φ ∼ MVN (µi , σui
Ii ).
Appendix 2: Full Conditionals
−1
−2
Full conditionals for β i , β, α∗ , D−1
1 , D2 , σui , φ and αi are given in Equations
(8)-(14) and (16), respectively. Here we will present the derivations for β i , β, D−1
1 ,
−2
−1
∗
σui , φ and αi . Full conditionals for α and D2 can be derived in the same way
as for β and D−1
1 , respectively.
In general, full conditionals are derived from the joint distribution of the vari
0
(2)0
(2)0
ables of interest (Gilks, 1996). Letting Y(2) = Y1 , . . . , Ym , the joint distribu-
16
tion of our hierarchical model (5)-(7) is given by
−2
−2
−1
π(Y(2) , β 1 , . . . , β m , α1 , . . . , αm , σu1
, . . . , σum
, β, α∗ , D−1
1 , D2 , φ)
−1
−2
−2
, β, α∗ , D−1
, . . . , σum
= π Y(2) | β 1 , . . . , β m , α1 , . . . , αm , σu1
1 , D2 , φ) ×
−2
−2
−1
π(β 1 , . . . , β m | α1 , . . . , αm , σu1
, . . . , σum
, β, α∗ , D−1
1 , D2 , φ) ×
−1
−2
−2
, β, α∗ , D−1
, . . . , σum
π(α1 , . . . , αm | σu1
1 , D2 , φ) ×
2
2
−1
π(σu1
, . . . , σum
| β, α∗ , D−1
1 , D2 , φ) ×
−1
∗
−1
−1
π(β| α∗ , D−1
1 , D2 , φ) × π(α | D1 , D2 , φ) ×
−1
−1
π(D−1
1 | D2 , φ) × π(D2 | φ) × π(φ)
=
m
hY
m
i hY
i
(2)
2
Ii ) ×
MVN (Yi | µi , σui
MVN (β i | β, D1 ) ×
i=1
i=1
m
hY
m
i
hY
i
−2 a ab
MVLN (αi | α∗ , D2 ) ×
G σui
| ,
×
2
2
i=1
i=1
MVN β| h1 , H1 × MVN α∗ | h2 , H2 ×
−1
−1
W D−1
× W D−1
× MVN φ| h3 , H3 ) .
1 | ν1 , (ν1 A1 )
2 | ν2 , (ν2 A2 )
Now, to construct the full conditional of a variable, say β i , we need only to pick
out the terms in the joint density which involve β i . Note that any term which does
not depend on β i can be taken as a proportionality constant in the full conditional.
Full Conditional for β i
Picking out the terms in the joint density which involve β i , we get
π(β i | .) ∝ exp
n
−
o
0 (2)
1
1
(2)
0
−1
y
−
µ
y
−
µ
−
(β
−
β)
D
(β
−
β)
.
i
i
i
1
i
i
2
2σui
2 i
Here,
yij − µij = yij − β0i 1 −
p
X
φk − β1i xij − β2i rij −
k=1
p
p
X
φk yi,j−k
k=1
X = zij − β0i 1 −
φk − β1i xij − β2i rij for j = p + 1, . . . , ni .
k=1
17
(2)
In vector-matrix notation, yi − µi = zi − Xi β i . Therefore,
0
o
1 −2
0
−1
zi − Xi β i zi − Xi β i + (β i − β) D1 (β i − β)
σ
π(β i | .) ∝ exp −
2 ui
n 1
−2 0
−2 0
−2 0
−2 0
= exp −
zi Xi β i − σui
β i X0i zi + σui
β i X0i Xi β i +
σui
zi zi − σui
2
o
0
0
0
0
−1
−1
−1
−1
β i D1 β i − β i D1 β − β D1 β i + β D1 β
n
n 1
−2 0
−2 0
−2 0
∝ exp −
− σui
zi Xi β i − σui
β i X0i zi + σui
β i X0i Xi β i +
2
o
0
0
0
−1
−1
−1
β i D1 β i − β i D1 β − β D1 β i
−2 0
zi zi and β 0 D−1
proportionality follows because σui
1 β do not depend on β i
n 1
o
0
0
−2 0
−2 0
0
0
−1
−1
= exp −
− 2 σui β i Xi zi + σui β i Xi Xi β i + β i D1 β i − 2 β i D1 β
2
−2 0
σui zi Xi β i and β 0 D−1
1 β i are scalars
n 1
o
0
0
−2
−2
0
−1
0
−1
− 2β i σui Xi zi + D1 β + β i σui Xi Xi + D1 β i
= exp −
2
n 1
o
0
0
−2
−1
0
−1
− 2β i σui Xi zi + D1 β + β i Mi β i
= exp −
2
−2
0
−1
where M−1
i = σui Xi Xi + D1
n 1
0
−2
−2
0
−1
∝ exp −
− β 0i σui
X0i zi + D−1
1 β − σui Xi zi + D1 β β i +
2
0
o
−2
−2
0
−1
0
−1
β 0i M−1
β
+
σ
X
z
+
D
β
M
σ
X
z
+
D
β
i
i
i i
1
i i
1
i
ui
ui
h
−2
−2
β 0i σui
X0i zi + D−1
is a scalar, and so is β 0i σui
X0i zi + D−1
1 β
1 β =
0
0
−2
−2
0
−1
σui
X0i zi + D−1
1 β β i . Proportionality follows because σui Xi zi + D1 β Mi
−2
σui
X0i zi + D−1
1 β
does not depend on β i
i
n 1
0 −1 o
−2
−2
0
−1
= exp −
β i − Mi σui
X0i zi + D−1
β
M
β
−
M
σ
X
z
+
D
β
,
i
i
1
i i
1
i
ui
2
which is proportional to the
pdf of a trivariate normal distribution with mean vec−2
−1
0 ∗
tor Mi σui Xi yi + D1 β and covariance matrix Mi . Therefore,
−2
0
−1
β i | . ∼ MVN Mi σui Xi zi + D1 β , Mi .
18
Full Conditional for β
Picking out the terms in the joint density which involve β, we get
m
Y
o
n 1
o
1
0
−1
(β i − β)0 D−1
(β
−
β)
exp
−
(β
−
h
)
H
(β
−
h
)
1
1
i
1
1
2
2
i=1
m
io
n 1 hX
0
−1
(β i − β)0 D−1
(β
−
β)
+
(β
−
h
)
H
(β
−
h
)
= exp −
1
1
i
1
1
2 i=1
m
n 1 hX
0
0
0
−1
−1
−1
= exp −
(β 0i D−1
1 β i − β i D1 β − β D1 β i + β D1 β)+
2 i=1
io
0
−1
0
−1
0
−1
β 0 H−1
β
−
β
H
h
−
h
H
β
+
h
H
h
1
1
1
1
1
1
1
1
π(β| .) ∝
exp
n
−
m
n 1 hX
0
0
−1
−1
∝ exp −
(−β 0i D−1
1 β − β D1 β i + β D1 β)+
2 i=1
io
0
0
−1
−1
0
−1
β H1 β − β H1 h1 − h1 H1 β
0
−1
proportionality follows because β 0i D−1
1 β i and h1 H1 h1 do not depend on β
m
io
n 1h
X
0
0
0
−1
−1
−1
− 2β 0 D−1
β
+
m
β
D
β
+
β
H
β
−
2β
H
h
= exp −
1
i
1
1
1
1
2
i=1
0 −1
0
0
−1
−1
0
−1
β i D1 β and h01 H−1
1 β are scalars, and so are β i D1 β = β D1 β i , h1 H1 β =
β 0 H1−1 h1
n 1
o
0
−1
−1
= exp −
− 2 β 0 (D−1
β̃
+
H
h
)
+
β
U
β
1
1
1
1
2
m
X
−1
−1
where β̃ =
β i , and U−1
1 = m D1 + H1
i=1
n 1
0
−1
−1
−1
0
−1
− β 0 (D−1
∝ exp −
1 β̃ + H1 h1 ) − (D1 β̃ + H1 h1 ) β + β U1 β +
2
o
−1
0
−1
−1
(D−1
β̃
+
H
h
)
U
(D
β̃
+
H
h
)
1
1
1
1
1
1
1
h
0
−1
−1
−1
β 0 (D−1
1 β̃ + H1 h1 ) is a scalar, and so is β (D1 β̃ + H1 h1 ) =
0
−1
−1
0
−1
(D−1
1 β̃ + H1 h1 ) β. Proportionality follows because (D1 β̃ + H1 h1 ) U1
i
−1
(D−1
β̃
+
H
h
)
does
not
depend
on
β
1
1
1
n 1
o
−1
−1
−1
−1
0
−1
= exp −
β − U1 (D1 β̃ + H1 h1 )] U1 [β − U1 (D1 β̃ + H1 h1 ) ,
2
19
which is proportional to the pdf of a trivariate normal distribution with mean vec−1
tor U1 D−1
and covariance matrix U1 . Therefore,
1 β̃ + H1 h1
−1
β| . ∼ MVN U1 D−1
β̃
+
H
h
,
U
1
1 .
1
1
Full Conditional for D−1
1
Picking out the terms in the joint density which involve D1 , we get
π(D−1
1 | .) ∝
m
Y
−1
MVN (β i | β, D1 ) W(D−1
1 | ν1 , (ν1 A1 ) )
i=1
m Y
n 1
o
1
0
−1
∝
exp − (β i − β) D1 (β i − β)
1/2
|D
|
2
1
i=1
n ν
o
ν1 −3−1
1
−1
2
|D−1
|
exp
−
tr(A
D
)
1
1
1
2
m
1
1 X
0
−1
=
exp −
(β − β) D1 (β i − β)
|D1 |m/2
2 i=1 i
o
n ν
ν −3−1
1
−1
−1 1 2
tr(A1 D1 )
exp −
|D1 |
2
m
i
m+ν1 −3−1
1 hX
−1
0
−1
−1
2
= |D1 |
exp −
(β − β) D1 (β i − β) + tr(ν1 A1 D1 )
2 i=1 i
m
i
(m+ν1 )−3−1
1 hX
0
−1
−1
−1
2
exp −
tr (β i − β) (β i − β) D1 + tr(ν1 A1 D1 )
= |D1 |
2 i=1
if we let d1i = (β i − β)0 D−1
and d2i = (β i − β), then d1i d2i = tr(di1 di2 ) =
1
tr(d2i d1i ) by the property of the trace of a matrix
m
i
(m+ν1 )−3−1
1 h X
0
−1
−1
−1
2
exp −
tr
(β i − β) (β i − β) D1 + tr(ν1 A1 D1 )
= |D1 |
2
i=1
m
h
i
X
(m+ν1 )−3−1
1
−1
0
−1
2
= |D1 |
exp −
tr
(β i − β) (β i − β) + ν1 A1 D1
,
2
i=1
which is proportional to a Wishart pdf with df m+ν1 and scale matrix
−1
(β i − β)0 + ν1 A1 . Therefore,
D−1
1 |
. ∼ W m + ν1 ,
m
hX
i−1 (β i − β) (β i − β) + ν1 A1
.
0
i=1
20
Pm
i=1
(β i − β)
−2
Full Conditional for σui
2
Picking out the terms in the joint density which involve σui
, we get
−2
π(σui
|
.) ∝
∝
=
∝
=
a ab Ii ) G
,
2 2
n
0 (2)
o −2 a ab 1
1
(2)
G σui ,
exp − 2 yi − µi yi − µi
2
|σui
Ii |1/2
2σui
2 2
o
n
0
1
1
−2 a ab
z
−
X
β
z
−
X
β
G
σ
,
exp
−
i
i i
i
i i
ui
2
2
|σui
Ii |1/2
2σui
2 2
n
0
o
ni2−p
1
−2
σui
exp − 2 zi − Xi β i zi − Xi β i
2σui
a
ab −2 −2 2 −1
σui
σ
exp −
2 ui
0
ni −p+a
zi − Xi β i zi − Xi β i + ab −2
−1
−2
2
σui ,
σui
exp −
2
(2)
MVN (Yi |
2
µi , σui
−2 σui
which is proportional to a gamma pdf with shape parameter (ni − p + a)/2 and rate
0
parameter zi − Xi β i zi − Xi β i + ab /2. Therefore,
−2
σui
|
0
ni − p + a (zi − Xi β i (zi − Xi β i + ab
.∼G
,
.
2
2
Full Conditional for φ
Picking out the terms in the joint density which involve φ, we get
π(φ| .) ∝
m
Y
(2)
2
MVN (yi | µi , σui
Ii ) MVN (φ| h3 , H3 )
i=1
m
Y
n
n 1
o
0 (2)
o
1
(2)
0
−1
∝
exp − 2 yi − µi yi − µi exp − (φ − h3 ) H3 (φ − h3 ) .
2σui
2
i=1
Here, for j = p + 1, p + 2, . . . , ni ,
yij − µij = yij − β0i 1 −
p
X
φk − β1i xij − β2i rij −
k=1
= yij −
p
X
φ yi,j−k − β0i 1 −
β2i qij −
p
X
k=1
p
X
φk qi,j−k
φ yi,j−k
k=1
k=1
p
X
k=1
21
φk
p
X
− β1i tij −
φk ti,j−k −
k=1
= (yij − β0i − β1i tij − β2i qij ) − φ1 (yi,j−1 − β0i − β1i ti,j−1 − β2i qi,j−1 )−
φ2 (yi,j−2 − β0i − β1i ti,j−2 − β2i qi,j−2 ) − · · · −
φp (yi,j−p − β0i − β1i ti,j−p − β2i qi,j−p )
= ij − φ1 i,j−1 − φ2 i,j−2 − . . . − φp i,j−p
where ij = yij − β0i − β1i tij − β2i qij for j = p + 1, p + 2, . . . , ni . In vector-matrix
(2)
notation, yi − µi = i − Wi φ, where i = (i,p+1 , i,p+2 , . . . , i,ni )0 , and Wi is a
(ni − p) × 3 matrix as given in Equation (15). Thus,
m
Y
n
n 1
o
0 (2)
o
1
(2)
(φ
−
h
)
exp − 2 yi − µi yi − µi exp − (φ − h3 )0 H−1
3
3
2σui
2
i=1
m
n 1
o
n
Y
0
o
1
(φ
−
h
)
=
exp − 2 i − Wi φ i − Wi φ exp − (φ − h3 )0 H−1
3
3
2σui
2
i=1
m
i
0
1 h X −2
0
−1
= exp −
σ
i − Wi φ i − Wi φ + (φ − h3 ) H3 (φ − h3 )
2 i=1 ui
m
1 h X −2 0
σui i i − 0i Wi φ − φ0 W0i i + φ0 W0i Wi φ + φ0 H−1
= exp −
3 φ−
2 i=1
i
0
−1
0
−1
0
−1
φ H3 h3 − h3 H3 φ + h3 H3 h3
π(φ| .) ∝
m
i
1 h X −2
0
0
0
0
0
0
−1
−1
σ
− 2 φ Wi i + φ Wi Wi φ + φ H3 φ − 2φ H3 h3
∝ exp −
2 i=1 ui
0
0
0
0
0
−1
i Wi φ and h03 H−1
3 φ are sclars, and so are i Wi φ = φ Wi i , h3 H3 φ =
−2 0
0
−1
φ0 H−1
3 h3 . Proportionality follows because σui i i and h3 H3 h3 do not depend on φ
m
m
X
X
1h
−2
−2
− 2 φ0
σui
W0i i + φ0
σui
W0i Wi φ + φ0 H−1
= exp −
3 φ−
2
i=1
i=1
i
2φ0 H−1
3 h3
m
m
X
X
i
1h
0
0
−2
−2
0
−1
0
−1
= exp −
−2φ
σui Wi i + H3 h3 + φ
σui Wi Wi + H3 φ
2
i=1
i=1
m
X
i
1h
0
0
−2
0
−1
−1
−2φ
σui Wi i + H3 h3 + φ V φ
= exp −
2
i=1
22
−1
where V
=
m
X
−2
σui
W0i Wi + H−1
3
i=1
m
m
X
X
0
1h
0
−2
−2
0
−1
σui Wi i + H3 h3 −
−φ
σui
W0i i + H−1
h
∝ exp −
φ+
3
3
2
i=1
i=1
m
m
X
0 X
i
0
−2
−2
−1
0
−1
0
−1
φ V φ+
σui Wi i + H3 h3 V
σui Wi i + H3 h3
i=1
i=1
m
m
h X
X
0
−2
−2
−1
0
φ0
σui
W0i i + H−1
h
=
h
σ
+
H
W
3
3 φ because
3
3
i i
ui
i=1
φ0
m
X
i=1
−2
σui
W0i i + H−1
3 h3 is a scalar. Proportionality follows because
i=1
m
X
−2
σui
W0i
i +
H−1
3
h3
0
i=1
V
m
X
−2
σui
W0i i + H−1
3 h3
i
does not depend on φ
i=1
m
X
i0 −1
1h
−2
φ−V
σui
W0i i + H−1
h
V
= exp −
3
3
2
i=1
m
h
X
i
−2
0
−1
φ−V
σui Wi i + H3 h3
,
i=1
which isPproportional to the pdfof a p-variate normal distribution with mean vecm
−2
−1
0
tor V
and covariance matrix V. Therefore,
i=1 σui Wi i + H3 h3
X
m
−2
0
−1
φ| . ∼ MVN V
σui Wi i + H3 h3 , V .
i=1
Full Conditional for αi
Picking out the terms in the joint density which involve αi , we can express the full
conditional for αi only up to a proportionality constant, that is,
n
o
1
1
0
exp − 2 (zi − Xi β i ) (zi − Xi β i ) ×
π(αi | .) ∝
γi τi
2σui
n 1
o
∗
∗ 0
−1
∗
∗
exp − (αi − α ) D2 (αi − α ) .
2
References
[1] Box, G., Jenkins, G. M. (1994). Time Series Analysis: Forecasting and Control, 3rd
ed. Prentice Hall.
[2] Chib, S. (1993). Bayes Regression with Autoregressive Errors: A Gibbs Sampling Approach. Journal of Econometrics, 58, 275-294.
23
[3] Chiu, G., Lockhart, R. (2008). Some Mathematical and Algorithmic Details
for Bent-Cable Regression with AR(p) Noise. Working Paper Series No. 200607. Department of Statistics and Actuarial Science, University of Waterloo, Waterloo, Ontario. “http://www.stats.uwaterloo.ca/stats navigation/
techreports/06WorkingPapers/2006-07.pdf”
[4] Chiu, G., Lockhart, R. (submitted). Bent-Cable Regression with Autoregressive Noise.
[5] Chiu, G., Lockhart, R., Routledge, R. (2006). Bent-Cable Regression Theory
and Applications. Journal of the American Statistical Association, 101,542-553.
[6] Chlorofluorocarbons. In The Columbia Encyclopedia, Columbia University Press, New York, 6th ed. “http://www.encyclopedia.com/doc/1E1chlorofl.html”
[7] Fitzmaurice, G. M., Laird, N. M., Ware, J. W. (2004). Applied Longitudinal Analysis, Wiley, New Jersey.
[8] Ghude, S. D., Jain, S. L., Arya, B. C. (2009). Temporal Evolution of Measured
Climate Forcing Agents at South Pole, Antarctica. Current Science, 96, 49-57.
[9] Gilks, W. R. (1996). Full Conditional Distributions. In Markov Chain Monte
Carlo in Practice. (eds Gilks, W. R., Richardson, S., Spiegelhalter, D. J.), pp.
75-88. Chapman and Hall, London.
[10] Gilks, W. R., Richardson, S., Spiegelhalter, D. J. (1996). Introducing Markov
Chain Monte Carlo. In Markov Chain Monte Carlo in Practice. (eds Gilks, W. R.,
Richardson, S., Spiegelhalter, D. J.), pp. 1-19. Chapman and Hall, London.
[11] Givens, G. H., Hoeting, J. A., Sidani, S. (2005). Computational Statistics. Wiley,
New Jersey.
[12] Khan, S. A., Chiu, G., Dubin, J. A. (2009). Atmospheric Concentration of
Chlorofluorocarbons: Addressing the Global Concern with the Longitudinal
Bent-Cable Model. CHANCE, to appear.
[13] McKnight, P. E., Figueredo, A. J., Sidani, S. (2007). Missing Data: A Gentle
Introduction. Guilford Press, New York.
[14] Montreal Protocol. In The Columbia Encyclopedia, Columbia University Press,
New York, 6th ed. http://www.encyclopedia.com/doc/1E1-MntPrtcl.html
[15] Ozone Facts (2008). NASA. “http://ozonewatch.gsfc.nasa.gov/facts/
hole.html”
[16] Ozone Layer Depletion (2008). U.S. Environmental Protection Agecy. “http://
www.epa.gov/ozone/defns.html”
24
[17] Ozone Layer Depletion - Science (2008). U.S. Environmental Protection Agecy.
“http://www.epa.gov/ozone/science/ods/classone.html”
[18] Ozone Science: The Facts Behind the Phaseout (2008). U.S. Environmental
Protection Agecy. “http://www.epa.gov/ozone/science/sc fact.html”
[19] Smith, A. F. M., Roberts, G. O. (1993). Bayesian Computation Via Gibbs Sampler and Related Markov Chain Monte Carlo Methods. Journal of the Royal
Statistical Society. Series B, 55, 3-23.
[20] The Ozone Hole Tour (1998). University of Cambridge. “http://www.atm.ch.
cam.ac.uk/tour/part3.html”
25