A BAYESIAN ESTIMATOR OF THE LINEAR REGRESSION

A BAYESIAN ESTIMATOR OF
THE LINEAR REGRESSION MODEL
WITH AND~ICERTAIN INEQUALITY CONSTRAINT
W.E. Griffiths and A.T.K. Wan
No. 74 - May, 1994
ISSN
ISBN
0 157 0188
1 86389 191 9
A BAYESIAN ESTIMATOR OF THE LINEAR REGRESSION MODEL
WITH AN UNCERTAIN INEQUALITY CONSTRAINT
William E. Griffiths
Department of Econometrics
University of New England
and
Alan T.K. Wan
Department of Econometrics
University of New South Wales
March 1994
ABSTRACT
In this paper, we consider Bayesian estimation of the normal linear
regression model with an uncertain inequality constraint. We adopt a
non-informative prior and uncertainty concerning the inequality restriction
is represented by a prior odds ratio. Furthermore, we derive the sampling
theoretic risk of the resulting Bayesian inequality pre-test estimator and
numerically determine the optimal odds ratio according to a mini-max regret
criterion. An example taken from Geweke (1986) is used to illustrate our
methodology.
ADDRESS FOR CORRESPONDENCE : Professor William E. Griffiths, Department
of Econometrics, University of New England, Armidale, N.S.W. 2351,
Australia.
FAX : +61-67-733607;
PHONE : +61-67-732319
EMAIL : [email protected]
1.
INTRODUCTION
The problem of estimating the coefficients of a linear regression model
subject to inequality constraints has been a subject of interest for both
Studies from a sampling theory
sampling theorists and Bayesians.
standpoint (see, for example, Lovell and Prescott (1970), Judge and Yancey
(1981, 1986), Yancey and Bohrer (1992) and Wan (1993, 1994a)) examine the
properties of various inequality constrained and pre-test estimators and
the dependence of the pre-test estimators’ properties on test size.
Studies from a Bayesian standpoint include O’Hagan (19"/3), Davis (1978),
Geweke (1986, 198Ba, 1988b), Griffiths (1988), Barnett, Geweke and Yue
(1991) and Chalfant and Wallace (1992).
O’Hagan and Davis investigate
posterior quantities that result from a natural conjugate prior and single
The later
and multiple linear inequality restrictions, respectively.
papers are more computationally based and show how importance sampling can
be used to estimate posterior quantities for a number of more general
inequality restrictions, and noninformative priors.
One difficulty with inequality restricted estimation, whether it be
from a sampling theory or a Bayesian point of view, is that it assumes an
unfaltering belief in the prior information implied by the inequality
constraints. Few researchers are likely to be so stubborn that they would
continue to believe in the prior constraints, when faced with strong sample
information to the contrary.
Within the sampling theory framework,
allowing for the possibility that inequality restrictions might be
incorrect,
even if this possibility is remote, means using a statistical
test and
the resulting so called inequality pre-test estimator.
This
estimator has received considerable attention in the literature (see Wan
(1993, 1994a) for a list of references). However, to our knowledge, the
analogous Bayesian estimator has not been considered. Although Hasegawa
(1989) has considered a "Bayesian inequality pre-test’° estimator, this
estimator is defined by a rule which chooses between the Bayesian
inequality restricted and the unrestricted maximum likelihood estimators
based on the outcome of a statistical test. A more appropriate Bayesian
procedure is to allow for the (remote) possibility that the inequality
restrictions are incorrect by placing a nonzero prior probability on the
region of the parameter space not covered by the restrictions.
The
resulting Bayesian "inequality pre-test" estimator that recognizes this
nonzero prior probability will be a weighted average of the Bayesian
inequality restricted estimators from different regions of the parameter
space, with weights given by the posterior probabilities attached to each
region.
In this paper, we examine the nature and properties of this Bayesian
estimator. We adopt a non-informative prior with uncertainties concerning
the inequality hypotheses measured by a prior odds ratio defined over the
null and alternative hypotheses. In Section 2, for the case of a single
inequality constraint, expressions are derived for the posterior means
under each hypothesis and for the Bayesian inequality pre-test estimator
(the posterior mean that recognizes the hypothesis uncertainty).
In
Section 3 we compare the performance of this estimator with its sampling
theory counterpart; we derive and numerically evaluate its sampling
theoretic risk, and determine the optimal prior odds ratio according to a
mini-max regret criterion. Under this criterion, the Bayesian estimator is
1.7 times more efficient than its sampling theory counterpart. An example
taken from Geweke (1986) based on the Pindyck and Rubinfeld (1981) rental
data is used to illustrate the methodology for the more general case with
more than one inequality restriction.
2.
MODEL FRAMEWORK AND THE ESTIMATORS
Consider the standard linear regression model
y = X~ + c ;
c ~ N(o,mzl)
(1)
where y and ~ are n x 1 vectors; X is a n x k non-stochastic matrix of full
If we adopt a
rank; ~ is a k × I unknown coefficient vector.
noninformative prior p(~,~) ~ ~-i, and no prior inequalities, the marginal
posterior pdf for ~ is the multivariate t pdf
p(~ly) = c[v~-z + (~-~)’
S{~-~)]-(v*k}12
(2)
where S -- X’X, the location vector is the unrestricted maximum likelihood
estimator ~ = S-IX’y, v = n-k is the degrees of freedom parameter, ~r2 =
(y-X~)’ (y-X~)/v, the
precision matrix is [~rZS-1]-I, and c is the
normalizing constant.
2
Now let us introduce information about the coefficient vector /~, in
the form of a single inequality hypothesis:
(3)
where C’ is a 1 x k known vector and r is a known scalar. For a researcher
with the noninformative prior p(l~,~r) ~ ~r-1, and no special prior weight on
H: C’~ ~ r, the posterior probability of the inequality hypothesis being
o
true is
(4)
~(HolY) = f I(Ho) P(~IY)CI~
where I(Ho) is the indicator function which is equal to 1 when ~ satisfies
H and 0 otherwise. Given the noninformative nature of the prior pdf from
o
which ~(HolY) was derived, we can also view ~(HolY) as a measure of the
sample support for Hoo As popularized by Geweke (1986), we can estimate
~(HolY) by sampling (via computer) from the pdf in (2) and computing the
proportion
of observations
~ thatprior
satisfy
Ho.
Consider
a researcheron
whose
information
is such that H° holds
with certainty, but, within the region C’~ >- r, the prior for ~ is
noninformative. We write this prior pdf as
(5)
p(/S,o-~Ho) ¢x I(Ho) o"-~
The
corresponding marginal
posterior pdf for ~ is the truncated
multivariate t pdf
p(~ZI
I(Ho)P(~ly)
(6)
Y,Ho) =
~(HolY)
Given this posterior pdf, and a quadratic loss function, the posterior mean
E[~ I Y,Ho] is the Bayesian (inequality restricted) estimator for 8.
As
described by Geweke (1986) we can estimate this mean by sampling from
p(~ly) and computing the sample average of those observations on ~ that
satisfy H .
o
A useful analytical expression for E[~Iy, Ho] can be derived for the
case where we have an inequality on a single coefficient. Suppose that it
is the
element
of Cthe
that
is subject
to the inequality
that
C’first
= (I,0
..... O) and
restriction
becomes
E1 -> r. Inrestriction,
Appendix so
A it
is shown that E[~IIy,Ho] -- ~I’~o is given by
3
2 2 -]-(v-1)/2
all~mll + {r-~1)2/(v¢~ all)j
_
,
(7)
~I,Ho---- ~1 +
where m =
r(Z~-)v
a is the 1st diagonal element of S-1, the
F(_~) (~v) 1/2(v_i)’ 11
(~(r-~1)
probability P{HoIY) = 1 - ¢,~j measures the sample support for Ho, and
11
~ is the Student’s $ distribution function with v degrees of freedom. A
similar expression for estimating a normal mean is given by Hasegawa
O’Hagan gives an alternative expression in terms of beta
Note that, in (7), the unrestricted estimator ~1 is adjusted
functions.
(1989).
upwards to accommodate the restriction 61 -~ r. If ~i is much larger than r
(the sample strongly supports the restriction), the numerator in the
adjustment term will be approximately zero and the denominator
approximately one, leading to a negligible adjustment. For ~I much less
than r (the sample contradicts the restriction), both the numerator and the
denominator will approach zero, and the adjustment will depend on the
relative rates of convergence.
The posterior mean for elements in B that are not subject to inequality
restrictions is also derived in Appendix A. Using Sz as an example, its
posterior mean (inequality restricted estimator) is given by
~2,H0 = ~2 + h:1Ih12(~1,H0 ~I)
(8)
where h and h depend on the correlation structure of X’X.
II
See
12
Appendix A for their exact definitions. Note that the adjustment to the
unrestricted estimator depends on the correlation structure in X’X and the
difference between the restricted and unrestricted estimators for 6f
As mentioned in the introduction, one difficulty with the inequality
restricted estimator is that it assumes an unquestioning belief in Ho. If
~(HolY) is very small, we may want to question our prior information that
H is true. The analogous sampling theory situation is to (pre-)test the
o
-> r vs. Hi: ~i < r, set the level of significance ~
hypothesis
that
Ho: Bithan
to
something
greater
0 and to question our prior information if H° is
rejected.
This pre-testing procedure gives rise to the sampling theory
4
inequality pre-test estimator, which chooses between the estimators implied
by the null and the alternative hypotheses according to the outcome of the
statistical test.
A great
deal has been learnt about the sampling
properties of this estimator in recent years (see Wan (1993, 1994a) for
references).
The Bayesian alternative to this pre-testing procedure is to assign a
non-zero probability to H1.
Let this probability be denoted by P(H1) and
let P(Ho) = 1 - P(HI). Presumably, the prior odds ratio T = P(HI)/P(Ho) -~
1, as we think that H is at least as likely to be true as H. This
0
1
follows from the sampling theory procedure that sets the critical value, c,
such that c - r -~ 0. As soon as we recognize a pr~or~ that H may be true,
1
the appropriate "Bayesian inequality pre-test estimator", our counterpart
to the sampling theory inequality pre-test estimator, is
(9)
~ = P(Ho[Y)~H + P(HI[y)~H ,
0
1
where the elements in ~ are given in (7) and (8) and where
0
all~rn[l + (r -~l)2/(v~’2a~l)l-(v-1}/2
(10)
As an example of the other elements, the second one is given by
(11)
In equations (9), (I0) and (II) ~ is the Bayesian inequality restricted
1
estimator under HI, ~(H11y) = I - ~(HolY) measures the sample support for
H~, and P(Hjly), j = O, 1 are the posterior probabilities when P(Ho) ~
P(HI).
Noteon
that
view
~(HolY)
and ~(H~ly) as the posterior
probabilities
H° we
and can
HI when
P(H
o) = P(HI) (we do not discriminate
a pr~or~ between H
o
and H ).
I
Under these circumstances it seems reasonable
to define
~(H.]
, j = 0, I
P(H.~Iy) =
~(HolY)P(Ho) + ~(HllY)P(H1)
from which we obtain
S
(12)
(13)
P(Hl{y) ~(Hl{y) P(H1)
P(Ho{Y) ~(Ho{Y) P(Ho)
A more rigorous justification of (12) and (13), as limiting posterior
probabilities from proper prior pdfs is given in Appendix B.
One major difference between ~ and the sampling theory inequality
pre-test estimator is that the latter is a discontinuous function of the
data, while the former is not. Also, it can be seen from equations (7) to
(13) that if we regard H° and HI as a prlori equally likely (Le., P(Ho) =
P(HI) -- 0.S such that ¯ = I), then ~ = ~. The sampling theory counterpart
is to set c = r in which case the inequality pre-test estimator reduces to
However, if P(Ho) ~ P(HI), the sampling theory pre-test estimator and
~ are different. Considering the first element first we have, by
substituting (7) and (I0) into (9),
(14)
"[~(HoIY) "
Using (13), we can write (14) as
~1 ~1+ ~’allm [1
2 2 2"] -(v-1)/2
P(Ho[Y) { P(H1)1
This expression confirms that when P(H1) -- P(Ho)’- ~l = ~i" Also, when
P(H1) -- 0, P(Ho[Y) = i and ~l reduces to ~I,HO, the Bayesian inequality
restricted estimator under Ho. Furthermore, when P(Ho) > P(H1), we adjust
the unrestricted estimator upwards. This prior adjustment will be greater,
(a) the smaller the prior odds ratio P(HI)/P(Ho),
(b) the greater the value of P(HolY)/15(HolY)
[note that P(H1)/P(Ho) -< i
implies that P(Hol y)AS(H° I Y) -> I],
(c) the greater the value of ~rz, and
2
2 2 "1 -(v-1)/2
(d) the greater the value of 1 + (r-~I) /(Vall~r )J
.
This last term can be viewed as a measure of the strength of the sample
6
information. When r - ~i = 0, the sample information does not discriminate
-(v-l)/2
between H0 and HI, the term [i + (r-~l)z/(va:l~Z)] is maximized,
and the adjustment implied by the prior information is relatively large.
+ ~J
2
2 2 "l-(v-1)/2
declines, the impact
As Ir - ~ll increases, I
of the sample information is greater, and the prior adjustment away from ~i
is smaller.
Similarly, for other elements in ~, for example ~z’ we have
-(v-I)/z
z zl
P(H ly)[
mF1 + (r-@? z/va11;
F(HoJ[y)¯ 0/1
~2 = ~2 ÷ h-lh ;aII lZ 11 L
P(H1)I
(16)
Thus, the same kind of remarks can be made about ~2’ except that the
direction and magnitude of the adjustment will also depend on the
correlation structure of X’X.
Next, we examine the finite sample risk performance of the Bayesian
inequality pre-test estimator that we have derived.
3. THE PREDICTIVE RISK FUNCTION OF ~
Sampling theory properties do not have to be used to justify Bayesian
estimators to Bayesians.
It is nevertheless interesting to derive the
sampling theoretic risk of the Bayesian inequality pre-test estimator and
to compare this risk with that of its sampling-theory counterpart.
The
results given in the previous section primarily focus on the special case
where the constraint is in the form of 61 ~ r. To examine the risk of the
Bayesian inequality pre-test estimator under the more general constraint
C’~ ~ r given in (3), it is convenient to follow the procedure given in
Judge and Yancey (1981, 1986) and reparameterize (I), (3) and (S) as
y = HO + ~: ;
(17)
H : 0 -> r
(18)
o1
and
1
(19)
p(O, ~’) = I(Ho)/O"
respectively, where H = XS-I/2Q’; O = QS~/2B; r~
= r/hl; hl is the first
’
element of h’ = C’S-I/2Q and is assumed to be positive without loss of
7
generality; 0 is the first element of ~; and O is an orthogonal matrix
1
such that Q$-I/2c(C.S-Ic)-Ic.s-I/2Q. = (I 0"I.
O0
Using these results, the procedure described in the previous section,
1-~((r1-~)/~1
and noting that ~ = S-I/2Q’~ and P(HolY) T + (l-T) 1 it is straightforward to show that the
~i)/
Bayesian inequality pre-test
estimator of /~ can be written as
+ ~rmll + (r-~)2/V~2]-(v-1)/2
(20)
(k-l)
where ~ is the first element of ~, the unrestricted estimator of e; and
~(k-l) is a (k-i) × I vector containing the remaining elements.
Now, we consider the predictive risk function of ~ under squared error
loss defined as R(X~)= E[(X~ - X~)’(X~ - XB)]/~2.
We examine this
quantity rather than the risk of ~ itself so that
our results are
independent of the data matrix.
In terms of the
~ space, this is
equivalent to assuming orthonormal regressors.
Making use of (20) and the usual definition of risk under squared error
loss, the predictive risk function of the Bayesian inequality pre-test
estimator ~ can be written as
R(X~) -- E [(~-~)’ S(~-~)]/o"2
+ m2(1-T2) E
[n-w)
2
2 ~-(v-1)/2 ,~
(~-w) /q~
+ 2 m(l-z) El ql/2w[1 +
~
dq dw
dq dw { 21)
where w = (~1-ei)/~, q = v~.Z/o.2 and ~ = {r1-el)/~. When the inequality
restriction is correct, ~ -< O. The predictive risk of ~H’ the Bayesian
o
inequality restricted estimator under Ho, results when T = O.
To further examine the risk behavior of X~, we performed numerical
evaluation of (21) for v = 5, 15, 25 ..... 80; T = O, 0.I, 0.3, O.S, O.V, I;
and ~ -- [-6, 6]. The subroutines BETAI and GAMMQ from Press et at. (1986)
were used
to calculate the cumulative distribution function for the
Student’s f distribution and the Gamma function respectively. Numerical
integrations were carried out using the NAG (1991) subroutines DOIAHF and
D01AMF. These were incorporated into a FORTRAN program written by the
authors and executed on the VAXV610 and VAX6000 machines. Figures I and 2
illustrate some typical results.
For purposes of comparisons, the
predictive risk functions of the sampling theory inequality restricted and
pre-test estimators given in Wan (1993, 1994a) are also depicted in Figure
2. The inequality pre-test estimator was first derived by Judge and Yancey
(19B1) for the case in which ~2 is known.
Their results were later
generalized by Hasegawa (19B9) and Wan (1993, 1994a) to the ~z unknown
case.
From the diagrams, we observe the following features. First, the
predictive risk function of ~H is bounded and approaches that of ~ as @ ÷
o
-~, but it is unbounded as @ ÷ =. The predictive risk of ~ is bounded as
9
101
-> ~. When the direction of the inequality constraint is correct (Le.,
~ -~ 0),than
the the
predictive
risk
of ~ is smaller than that of ~, but it is
greater
risk of X~
H.
The risk of X~ reaches a maximum in the
0
region ~ > 0, then declines monotonically and approaches the predictive
risk of the unrestricted estimator as ~ increases. Also, for T close to i,
the predictive risk function of ~ more nearly approximates the predictive
risk of ~, and vice versa. These results concur qualitatively to the risk
performance of the corresponding sampling theory estimators.
However, contrary to the risk behavior of the sampling theory
inequality pre-test predictor, the predictive risk of the Bayesian
inequality pre-test estimator ~ is enveloped by the risks of X~H and X~
0
over the entire parameter space, except for a very small region near ~ =
0.
With the sampling theory estimators, there is always a finite, but
non-negligible range in 0, where the sampling theory pre-test estimator has
risk greater than the risk of both the unrestricted and the inequality
restricted maximum likelihood estimators. Regardless of the choice of T,
the intersection of the risk functions of X~ and X~ tends to occur in the
close neighborhood of ~ = 0, whereas the corresponding intersection point
for the sampling theory estimators is in the region ~ > 0. ~ is always
risk superior to ~H when ~ > 0, while the sampling theory inequality
o
pre-test estimator has smaller risk than the corresponding maximum
likelihood inequality restricted estimator only when @ is sufficiently
large.
If minimization of sampling theoretic risk is a major criterion for
estimator choice, we can ask the question whether there exists a prior odds
ratio T which is "optimal" in this or a related sense. This problem is
analogous to specifying an optimal critical value for the sampling theory
inequality pre-test estimator; it is explored in the next section.
4.
THE OPTIMAL ODDS RATIO FOR THE BAYESIAN INEQUALITY
PRE-TEST ESTIMATOR
In the pre-test literature, the mini-max regret criterion has been used
by many authors for selecting an optimal critical value of a pre-test (see
for instance, Brook (19"/6) and Wan (1993, 1994b)). Here we adopt this
I0
criterion to find an optimal prior odds ratio such that the maximum regret
of not being on the minimum risk boundary is minimized. Now, for 0 -~ ¯ -~
I, it is observed empirically that except in the close neighborhood of ~ =
O, min R(X~), the minimum of R(X~), is given by either R(X~[~=0) or
R(X~[~=I). That is, for almost all 0, minimum risk is achieved from either
the Bayesian inequality restricted estimator under H or the unrestricted
o
maximum likelihood estimator. Near @ = O, the risk of X~ can lie above or
below the minimum of R(X~) and R(X~ ), but, since this region is typically
o
very small, it does not influence the results.
For any given ~, the regret function is defined as
P.EG(~I) = R(X~) - min R(X~) .
(22)
The mini-max regret odds ratio, ~M’ is the odds ratio such that the maximum
regret over all values of @ is minimized. It is found that decreasing
reduces the maximum regret for ~ -~ O, but increases the maximum regret for
@ > O, and vice versa. Because of this monotonicity property, ¯ is the
M
value of ¯ which equalizes the maximum of the regret function in the
regions @ > 0 and @ - O. Values of ¯
for different degrees of freedom are
computed using the Golden Section Search Routine given in Press et
(1986), and are reported in Table 1.
From Table I, we observe that ¯ fluctuates only slightly over the
M
entire range of degrees of freedom, and is around 0.43 for moderate to
large degrees of freedom. In practical terms, this result means that if H
is believed to be more likely than HI,
o
but no further information is
available regarding the relative likelihoodof the two events, then from
the standpoint of sampling theoretic risk, an odds ratio of 0.43 will lead
to an optimal predictor according to a mini-max regret criterion.
For purposes of comparison, cM, the optimal critical value for the
sampling theory inequality pre-test estimator derived by Wan (1993, 1994b)
and the corresponding values of the regret function, are also given in
Table I. As with the Bayesian case, the optimal critical value is not very
sensitive to the degrees of freedom. Although neither the Bayesian nor the
sampling theory pre-test estimator strictly dominates each other in terms
of risk, at least for the cases that we have considered, ~ dominates c in
M
M
the sense that the maximum regret of using the Bayesian inequality pre-test
11
estimator
with
T --cT is
always smaller than the corresponding maximum
regret value
when
Mis chosen as the critical value for the sampling
M
theory inequality pre-test estimator.
Also, the difference in regret is
considerable. The Bayesian estimator is twice as good for small degrees of
freedom and 1.68 times better for large v.
Figure 2 illustrates these
results for the case of v = 15.
5.
AN EMPIRICAL EXAMPLE
The purpose of this section is to illustrate our methodology described
in Section 2 with an example taken from Geweke (1986) based on the rental
data provided by Pindyck and Rubinfeld (1981, p.44).
The estimated
equation attempts to explain rentals paid by students at the University of
Michigan and is given as follows :
(23)
+ 63(l-st)rt + 64stdt + 65(l-st)dt + ct’
Ytis--the
61 +
62strt
where Yt
rent
paid per
person; rt is the number of rooms per person;
d is the distance from campus in blocks; st is a sex dummy, one for male
t
and zero for female.
In what follows we present several estimates of posterior probabilities
and posterior means. These values were computed using the BAYES command in
SHAZAM (White ef al. (1993)). The number of antithetic replications for
Monte Carlo integration was 20,000 for each case. Corresponding estimates
in
Geweke
(1986) differ slightly because of differing random number
sequences.
The first hypothesis of interest is that where the coefficients have
the expected signs.
That is,
Ho: 6z-> 0, 63 >- 0, 64- 0, 6s-~ 0
We estimate the sample support for this hypothesis as ~(HolY) = 0.0477. A
question
naturallytoarises
Given for
thisthe
low
data-based
for Ho, dothat
we proceed
obtainis:estimates
6’s
assuming probability
H° holds
with certainty, or do we entertain the possibility that alternative signs
could be possible? If we are going to let the data persuade us to consider
other possible hypotheses, then we should not begin with the dogmatic prior
P(H0) = 1.
12
The other hypotheses that we consider are
H~: ~32 ~- 0, ~3 -~ 0, ~4 > 0 and ~5 > O;
H2:~2 -- O, ~3 >- 0, ~4 > 0 and ~s -~ 0;
H3:~2 -~ O, ~3 -~ 0, ~4 - 0 and ~s > 0;
The hypothesis H might be relevant if more distant housing has attractive
features
housing close
to campus
possess.
Hypothesis
could be that
an appropriate
modification
ofdoes
Hi if not
females
are not
prepared H
to 2
pay
for transport.
more distant
housing because
of security
problems
associated
withmore
public
Hypothesis
H3 might
be relevant
if there
was a
security problem in the campus neighborhood, but not with public transport.
Note that, because there is more than one inequality involved, we no
longer have 3ust two hypotheses, as was the case in the earlier sections of
= 16
this paper. In fact, with four inequality restrictions, there are
hypotheses, one for each different combination of inequalities. For our
illustration we are assuming all hypotheses other than Ho, Hi, Hz and H3
When we have multiple inequalities, and
have zero prior probabilities.
consequent multiple hypotheses, analytical expressions for the posterior
means, conditional on each H (see equations (7) and (I0)), are no longer
available.
However, these conditional posterior means can be routinely
estimated using SHAZAM’s BAYES command.
The Bayesian inequality pretest
estimator can then be computed from
(24)
where
3
P(H~ly) = ~(H]Iy)P(H])/ ~- ~(H, Iy)P(H,)
(25)
I=0
Table 2.1 contains the inequality restricted estimates under each of
the hypotheses and the sample based posterior probabilities ~(Hily). Note
that the hypotheses and corresponding estimates differ through the signs
they attach to ~ and ~s" The sample information strongly supports Hz
where ~4 > 0 and ~s ~ 0. The sum of the P(H|Iy) is greater than 0.99,
indicating little sample support for the other 12 regions to which we
attach zero prior probabilities.
Bayesian inequality pre-test estimates
are presented in Table 2.2 for three different sets of prior probabilities.
13
In Case I, we attach a prior probability of 0.8 to what we consider to be
the most probable hypothesis Ho, but we place some credence on the argument
that more distant housing is more attractive and so allocate a 0.2 prior
probability to HI, where ~4 > 0 and 65 > 0. Cases 2 and 3 introduce
nonzero prior probabilities for H2 and I-I3 and so allow for the two security
arguments that suggest females are more concerned with security than males.
In Case 2 the prior probability on H remains high (0.7), but it is
o
weakened
to
0.4
in
Case
3.
In Case I, because the data suggest Hi is even less likely than H° and
because we have excluded Hz, we find the sample information strengthens our
prior belief in Hsmall
the estimates
for (0.1)
64 and
65 are to
consequently
probability
attached
H2 in Case 2
o, andprior
negative. The
allows the data to dominate, leading to a posterior probability of P(Hzly)
= 0.718 and to signs for 64 and 65 that satisfy H2. This effect is more
pronounced when we increase the prior probability to P(H = 0.2.
2)
6.
CONCLUDING REMARKS
We have introduced a Bayesian inequality pre-test estimator as a
counterpart to the sampling theory inequality pre-test estimator.
Furthermore, the predictive risk function of the Bayesian inequality
pre-test estimator was derived and evaluated numerically. Although in many
ways, the predictive risk behavior of the Bayesian inequality pre-test
estimator bears resemblance to that of the analogous sampling theory
pre-test estimator, the two estimators differ quali.tati.vely in terms of the
risk comparisons with their corresponding component estimators.
Our
numerical calculation shows that over almost the entire ~ space,
the
predictive risk of the Bayesian inequality estimator is enveloped by the
risk of Bayesian inequality restricted and unrestricted predictors. On the
other hand, there is always a non-negligible region in ~ in which the
sampling theory inequality pre-test estimator has predictive risk higher
than the predictive risk of both the unrestricted and inequality restricted
estimators. We have also tabulated optimal odds ratios for the Bayesian
inequality pre-test estimator according to a mini-max regret criterion. We
find that the pptimal odds ratio is approximately constant for moderate to
large degrees of freedom. Also, the Bayesian pre-test estimator dominates
14
the sampling theory pre-test estimator in the sense that the maximum regret
¯ corresponding to the optimal odds ratio uniformly dominates the
corresponding maximum regret value when the sampling theory pre-test
estimator is applied with the optimal critical value. A numerical example
is given to illustrate our methodology. In particular, we show how the
methodology is modified for multiple inequality restrictions and how the
specification of prior probabilities representing the uncertainty of
hypotheses can affect the final estimates. Finally, our risk results were
confined to a single inequality restriction; whether or not these results
also hold for non-orthonormal regressors and multiple inequality
restrictions still needs pursuing.
Appendix A: Derivation of Inequality Restricted Estimator
For the posterior mean of BI’ the parameter subject to the inequality
constraint, we have
where p(~11y) is the univariate t distribution with location ~I’ degrees of
freedom
v and precision
(a~l~rZ)-1 where
a211 is the first diagonal element
S-I. Making
the transformation
81 = ~1
+ ~r a t where t is a
of
11
standardized t random variable and where we define t’ = (r-~
obtain
E[~x [Y’H°] =~(H-I° [y) ~’ (~I+ ~allt)p(t)dt
15
a11,
we
-(v+l)/2
dt
~r allC’V I
-(v-1)/~
= ~I + ~(HolY)(V_l) i +
N2 2 l-(V-1)/2
+
= ~I + an~ m[l
where c’ is the normalizing constant for the standardized t distribution
and m = c’v/(v-1).
For the posterior mean of a parameter not subject to the inequality
constraint, say ~z’ we have
_ 1 ~ ~z I(Ho)P(~llY)P(~zI~I’Y)P(~ ..... ~kI~’~z’y)cl~
~(HolY)
-
~(Hol y) r
~
~{H0lY) r
The elements h
II
and h are elements from the precision matrix for the
12
bivariate t pdf p(~1,~zly). To obtain them, let V = S-i = (X’X)-I and let
V. be the {2x2) submatrix obtained from the first two rows and columns of
V. Let H = V-1
¯ " Then, h and h are the elements in the first row of H.
Ii
12
16
Appendix B: Derivation of a Limiting Posterior Odds Ratio
In the context of the model in the paper, suppose that the prior pdf
for ~ is multivariate t and that ~ is a priori independent of ~ whose prior
pdf is inverted gamma. That is,
where
p(~) = c1[vI + (~-b)’A(/~-b)]-(k+P
i )/2
p(~) = c ~-Iv2 +I) exp(-v s2/2~z)
2
22
In these expressions cI and cz are normalizing constants and the prior
b, A, v2 and sz. This prior was suggested by Dickey
parameters are vl,
(197S) as an alternative to the more common natural conjugate prior which
has
some
Judge etthat
al. (198S,
p.lll)
for undesirable
details. Asproperties.
A ÷ 0 andSee
v2 Dickey
÷ 0 this(197S)
prior or
becomes
used in
the
p(~, ~)posterior
~ ~-i probabilities
Thus, of interest
usvwill
be the limiting
formpaper,
of consequent
as A ~ 0to
and
2 ÷ 0.
The hypotheses introduced in Section 2 were
H:
o
C’~ -> r
I
and H:
C’~ < r
Suppose that the prior mean b is chosen such that
fpC~)d~ = f p(~)d~ = O.S
{Ho~
{HI}
That is, t.he prior pdf for ~ is centered at the boundary of the two
hypotheses. A procedure for selecting such a b is outlined below. For
prior pdf’s for 6, conditional on H° and H~, we take truncated multivariate
t distributions, obtained by truncating p(~) at the hypothesis boundary.
That is,
-(k+P1)/2
p(~~Hj) = 2c1[PI + (~-b)’A(~-b)]
I(Hj) j = 0, 1
17
where I(Hj) is an indicator function equal to 1 when ~ is such that Hj is
satisfied. For ~, we assume
Our objective is to find expressions for the posterior probabilities
for each hypothesis
P{H])p(y[Hj)
P(Hj[y)- P(Ho)p(y]Ho) + P{H1)p(y[H1)
j=0,1
given prior probabilities P(Ho) and P(H1), and where
(n+12)/2-1
= 2ClC2(2~)-n/zF[(n+vz)/2] 2 Z Qj
where
-(k+P )/2
Substituting for f(y[Hj) in the expression for P(H][y) yields
P(Hj[y) = P(Ho)Q
P(Hj)Qj
° + P(HI)Q1
For the noninformative prior implied by A ÷ 0 and v2 ÷ O,
where c~ is a collection of constants that are common to both Qo and QI"
Making this substitution for Q] yields the result in equation (12), namely
~(H] [ y)P(H~)
P(H][y) =
~(HoIY)P(Ho} + ~(HllY).P(H1)
18
The remaining task is to indicate how the prior p(/3) can be chosen so
that it is centered at the hypothesis boundary. For this purpose we can
use the transformations introduced in Section 3 of the paper. That is, we
begin with a prior pdf for e that has the kernel
p(19) ~ IvI + (O-t)’A*(19-t)]-(k+l~ )/2
I
being determined by the hypothesis
, tk)’, rl
where t -- (rI, tz, t3 ....
boundary (see Section 3), and t2, t3, ..., tk being arbitrary prior means.
Then, using notation introduced in Section 3, we define ~3 = S-I/2Q’~ so
that the quadratic form in the kernel of the prior pdf becomes
[e-t)’A*[e-t) = [Qsl/Z~-t)’ A~[QSI/2~-t)
-- [J~-S-112Q’ t)’ S1/2Q’ A~Qs1/2[~-s-II2Q’ t)
= (B-b)’ A(~-b)
where A = SI/ZQ’A*QS1/z and, to make the prior pdf symmetric around the
hypothesis boundary we choose the prior mean b = S-I/2Q’t.
19
~=0.1
~=0.3
X=0.5
X=0.7
Figure h Predictive risk functions of ~ for n = 20 and k = 5
~ (mini-max)
inequality restricted
pre-test (mini.max)
Figure 2: Risk comparisons between ~ and sampling theory inequality
pre-test predictors for n ffi 20 and k ffi 5
20
Table h Optimal odds ratio and critical values according to the
mini-max regret criterion and the corresponding maximum regret
values
v
5
I0
15
20
25
30
35
40
45
50
55
60
65
70
75
80
~M
0.516
0.460
O. 445
O. 438
O. 434
O. 432
0.430
0.428
0.427
0.427
0.426
0.426
0.426
0.425
0.425
0.425
regret(~M)
0.189
0.224
O. 234
O. 239
O. 242
O. 243
0.245
0.246
0.247
0.247
0.248
0.248
0.249
0.249
0.249
0.250
C
-i
-I
-1
-i
-1
-i
-1
-i
-i
-i
-i
-i
-i
-i
118
122
124
125
126
127
127
127
127
127
127
127
127
128
-I 128
-i 128
regret(c~)
0 410
0 414
0 416
0 418
0 418
0 419
0 419
0 419
0 419
0 419
0 419
o
0
0
0
0
420
420
420
420
420
regret(z~) and regret(c~) denote the maximum value
of the regret function corresponding to TM and c~
respectively.
21
Table 2.1: Bayesian inequality restricted estimates
Hypothesis
H
0.0477
37.843
136.77
H1
H2
H
0.0244
0.9179
0.0014
36.104
38.104
42.182
104.57
102.43
133.61
o
123.57
99.76
123.26
92.15
-0.926
3.562
3.530
-i.011
-1.2016
0.2475
-i. 1933
0.2764
Table 2.2: Prior and posterior probabilities and Bayesian inequality
pre-test estimates
Hypothesis
Case 1
Ho
H1
H2
H
0.8
0.8867
0.2 0. 1133
0.0
0.0
0.0
0.0
37. 646
133.12 120.87 -0.4171 -1.0375
0.7
0. I
0. i
0. I
0.2618
0.0191
0.7180
0.0011
38. 002
I11.48
122.86
2.3612 -1.1662
0.4
0.2
0.2
0.2
0.0919
38. 038
105.67
122.70
3.1151 -1.1582
Case 2
H
Case 3
H
0.0235
0.8833
0.0013
REFERENCES :
Barnett, W.A., J. Geweke and P. Yue (1991), "Seminonparametric Bayesian
estimation of the asymptotically ideal model: The AIM consumer demand
system" in W.A. Barnett, J. Powell and G.E. Tauchen, eds.,
Nonparamefric and Semiparamefric Methods in Econometrics and
Statistics:
Proceedings of" the Fifth International Symposium in
Economic Theory and Econometrics, Cambridge: Cambridge University
Press, 127-173.
Brook, R.B. (1976), "On the use of a regret function to set significance
points in prior tests of significance", Journal of the Arnertcan
Stafisfical Iissoc~ation 71, 126-131.
Chalfant, J.A. and N.E. Wallace (1992), "Bayesian analysis and regularity
conditions on flexible functional forms: Application to the U.S. motor
carrier industry" in W.E. Griffiths, H. Ltitkepohl and M.E. Bock, eds.,
ReadinF, s in Economefric Theory and Pracfice: A I[olume tn Honor of
GeorEe JudEe, Amsterdam: North-Holland, 159-174.
Davis, W.W. (1978), "Bayesian analysis of the linear model subject to
linear inequality constraints", Journal of fhe Amerfcan Sfaf£sfical
Associafion 73, 573-579.
Dickey, J.M. (1975), "Bayesian alternatives to the F-test and least-squares
estimates in the normal linear model", in S.E. Fienberg and A. Zellner,
eds., Sfudies ~n Bayesian Economefrics and Sfafis~cs, Amsterdam:
North-Holland, 515-554.
Geweke, J. (1986), "Exact inference in the inequality constrained normal
linear regression model", Journal of Applied Economefr~cs i, 127-141.
Geweke, J. (1988a), "Exact inference in models with autoregressive
conditional heteroskedasticity", in W. Barnett, E. Berndt and H. White,
eds., Dynamic Economefric Model ~nE: Proceedin~,s of fhe Third
Infernafional Symposium in Economic Theory and Econornefrics, Cambridge:
Cambridge University Press, 73-104.
Geweke, J. (1988b), "The secular and cyclical behavior of real GDP in 19
OECD countries", Journal of Business and Economic Sfaffsf£cs 6,
479-486.
Griffiths, W.E. (1988), "Bayesian Econometrics and how to get rid of those
wrong signs", Review of Marketing and Agricultural Economics 56, 36-56.
Hasegawa, H. (1989), "On some comparisons between Bayesian and sampling
theoretic estimators of a normal mean subject to an inequality
constraint", Journal of the Japan Statistical Society 19, 167-177.
Judge G.G., W.E. Griffiths, R.C. Hill, H. Ltitkepohl and T.C. Lee (1985),
The Theory and Practice of Econometrics, 2nd ed., New York, Wiley.
Judge, G.G., and T.A. Yancey (1981), "Sampling properties of an inequality
restricted estimator", Economics Letters 7, 327-333.
Judge, G.G., and T.A. Yancey (1986), Improved Methods of Inference ~n
Econometrics, North-Holland, Amsterdam.
Lovell, M.C. and E. Prescott (1970), "Multiple regression with inequality
constraints: Pre-testing bias, hypothesis testing and efficiency",
Journal of the American Stat~.sf~cal ~ssoc~at~on 65, 913-92S.
Numerical Algorithm Group Ltd. (1991), "NAG FORTRAN Library Manual, Mark
15, Vol. I", NAG Inc., U.S.A.
O’Hagan, A. (1973), "Bayes estimation of a convex quadratic", B£ometr£ca
60, 565-571.
Pindyck, R.S. and D.L. Rubinfeld (1981), Econometric Models and Economic
Forecasts (2nd edition), McGraw-Hill, New York.
Press, W.H., B.P. Flannery, S.A. Teukolsky and W.T. Vettering (1986),
Numerical Recipes: The ~ of Scientific Computing, New York, Cambridge
University Press.
Wan, A.T.K. (1993), "Inequality restricted and pre-test estimation in a
mis-specified Econometric model", Ph.D thesis, Department of Economics,
University of Canterbury.
Wan, A.T.K. (1994a), "The sampling performance of inequality restricted and
pre-test estimators in a mis-specified linear model", Australian
Journal of Statistics, forthcoming.
Wan, A.T.K. (1994b}, "The optimal size of a preliminary test for an
inequality restriction in a mis-specified regression model", mimeo.,
Department of Econometrics, University of N.S.W.
White K.J., S.D. Wong, D. Whistler and S.A. Haun {1993}, SHAZAM user’s
reference manual: version 7.0.
24
Yancey, T.A. and R. Bohrer (1997-), "Risk and power for inequality pre-test
estimators: general case in two dimensions",
in W.E. Griffiths,
H. Ltitkepohl and M.E. Bock, eds., Readings in Econometric Theory and
Pracf~ce: A Volume ~n Honor of George Judge, Amsterdam: North Holland,
33-55.
WORKING PAPERS IN ECONOMETRICS AND APPLIED STATISTICS
~o2.~~ P.D%eaa Red~. Lung-Fei Lee and William E. Griffiths,
No. I - March 1979.
~minQ Ro~. Howard E. Doran and Rozany R. Deen, No. 2 - March 1979.
~ A/ate on ~ Za~ ~a~imntan ~n an ~ @aaan
William Griffiths and Dan Dao, No. 3 - April 1979.
D.S. Prasada Rao, No. 5- April 1979.
Red2/: M ~9ixw.az~daa o~ ~ o~ @~/~. Howard E. Doran,
No. 6 - June 1979.
" Reo~ Roda~a. George E. Battese and
Bruce P. Bonyhady, No. 7 - September 1979.
Howard E. Doran and David F. Williams, No. 8 - September 1979.
D.S. Prasada Rao, No. 9 - October 1980.
~u~ ~o2~ - 1979. W.F. Shepherd and D.S. Prasada Rao,
No. I0 - October 1980.
~aZa~Yi~ne-Yeaieaand ~aoaa-YecZLaa ~oiat~ $atimaZea ~aodur_Zh~a Yuaciiaa
aWiA ~o~aadNeq~Romu~ ~/~. W.E. Griffiths and
J.R. Anderson, No. II - December 1980.
Howard E. Doran
and Jan Kmenta, No. 12 - April 1981.
~inai Oadea ~u~on2~~~~. H.E. Doran and W.E. Griffiths,
No. 13 - June 1981.
Pauline Beesley, No. 14 - July 1981.
Yo~ ~oia. George E. Battese and Wayne A. Fuller, No. 15 - February
1982.
26
~)~. H.I. Tort and P.A. Cassidy, No. 16 - February 1985.
H.E. Doran, No. 17- February 1985.
J.W.B. Guise and P.A.A. Beesley, No. 18 - February 1985.
W.E. Griffiths and K. Surekha, No. 19- August 1985.
~n~ ~a~. D.S. Prasada Rao, No. 20 - October 1985.
H.E. Doran, No. 21- November 1985.
~ae-Fex~ gatim~~tAie~ru~ ~ede/. William E. Griffiths,
R. Carter Hill and Peter J. Pope, No. 22 - November 1985.
William E. Griffiths, No. 23 - February 1986.
~~ ~anduntinn ~unct2nn~o~ ~ane/ ~ata: ~tA ~p~tathe
Ma~ ~a~ ~~. T.J. Coelli and G.E. Battese. No. 24 February 1986.
~aadu~u~ I!~Mggn2-q~~ata. George E. Battese and
Sohail J. Malik, No. 25 - April 1986.
George E. Battese and Sohail J. Malik, No. 26 - April 1986.
~a~. George E. Battese and Sohail J. Malik, No. 27- May 1986.
George E. Battese, No. 28- June 1986.
Nam/~.n~. D.S. Prasada Rao and J. Salazar-Carrillo, No. 29 - August
1986.
~un~ ~ex~en 9n~ 8at2~~na~M~(1) 8anna~edet. H.E. Doran,
W.E. Griffiths and P.A. Beesley, No. 30 - August 1987.
William E. Griffiths, No. 31 - November 1987.
27
Chris M. Alaouze, No. 32 - September, 1988.
G.E. Battese, T.J. Coelli and T.C. Colby, No. 33- January, 1989.
8~n~ Ze ~ gcon~-1~ide ~ade!2~. Colin P. Hargreaves,
No. 35 - February, 1989.
William Griffiths and ~orge Judge, No. 36 - February, 1989.
~%e~anag~o~ 8~ ~oiea Dan~ Druma~Z. Chris M. Alaouze,
No. 37 - April, 1989.
~ ~ 1~aZen M22~ ~r~. Chris M. Alaouze, No. 38 July, 1989.
Chris M. Alaouze and Campbell R. Fitzpatrick, No. 39 - August, 1989.
~a. Guang H. Wan, William E. Griffiths and Jock R. Anderson, No. 40 September 1989.
o~h~ Re~ Opt. Chris M. Alaouze, No. 41 - November,
1989.
~ ~he~ ozui ~~ ~. William Griffiths and
Helmut Lfitkepohl, No. 42 - March 1990.
Howard E. Doran, No. 43 - March 1990.
~!a2n~FheXa~ ~iAtea~ g~~uA-~apa~. Howard E. Doran,
No. 44 - March 1990.
Howard Doran, No. 45 - May, 1990.
Howard Doran and Jan Kmenta, No. 46 - May, 1990.
and $a/~ ~nizea. D.S. Prasada Rao and E.A. Selvanathan,
No. 47 - September, 1990.
28
~con~ YZudZe~uiZha ~~ o~ Nea~ ~ng/~. D.M. Dancer and
H.E. Doran, No. 48 - September, 1990.
D.S. Prasada Rao and E.A. Selvanathan, No. 49 - November, 1990.
~ ~n ~ gC~. George E. Battese,
No. 50 - May 1991.
Y~leann~ gannn~o~ ~~ ~oa~n. Howard E. Doran,
No. 51 - May 1991.
Fe~ Non-N~ ~ed2.2~. Howard E. Doran, No. 52 - May 1991.
~~ M~v~. C.J. O’Donnell and A.D. Woodland,
No. 53 - October 1991.
A(~ Yer_~. C. Hargreaves, J. Harrlngton and A.M.
Siriwardarna, No. 54- October, 1991.
Colin Hargreaves, No. 55 - October 1991.
~Jx~~a 90z&i9 ~vune~n $na~. G.E. Battese and T.J. Coelli,
No. 56 - November 1991.
2.0. T.J. Coelli, No. 57 - October 1991.
Barbara Cornellus and Colin Hargreaves, No. 58 - October 1991.
Barbara Cornellus and Colin Hargreaves, No. 59 - October 1991.
Duangkamon Chotlkapanich, No. 60 - October 1991.
Colin Hargreaves and Melissa Hope, No. 61 - October 1991.
" " ~ec~. Colln Hargreaves, No. 62 - November 1991.
O&a~ ~ EunA~e. Duangkamon Chotlkapanich, No. 63 - May 1992.
29
V~. G.E. Battese and G.A. Tessema, No. 64- May 1992.
~~ ~u~2.hO%~. C.J. O’Donnell, No. 65- June 1992.
~rmp~. Guang H. Wan and George E. Battese, No. 66 - June 1992.
~nz.on~n~fl~ 0% ~~ia, 1960-1985: ~ ~nAog~b~o~
Ma. Rebecca J. Valenzuela, No. 67 - April, 1993.
~o2~ D% tAe ~.F. Alicia N. Rambaldi, R. Carter Hill and
Stephen Father, No. 68 - August, 1993.
@a~izi~ ~recZa. G.E. Battese and T.J. Coelli, No. 69 - October,
1993.
Sex~ ~~. Tim Coelli, No. 70 - November, 1993.
~h~in W~ ~ ~. Tim J. Coelli, No. 71 December, 1993.
G.E. Battese and M. Bernabe, No. 72 - December, 1993.
Getachew Asgedom Tessema, No. 73 - April, 1994.
$aequz/iZ~ ~o~. W.E. Grlfflths and A.T.K. Wan, No. 74 May, 1994.
3O