The Rationality and Heterogeneity of Survey Forecasts of the Yen

The Rationality and Heterogeneity of Survey Forecasts of the
Yen-Dollar Exchange Rate: A Reexamination
Richard Cohen, Shigeyuki Abe, and Carl Bonham
September 15, 2005 DRAFT Do Not Quote
Abstract This paper examines the rationality and diversity of industry-level forecasts of
the yen-dollar exchange rate collected by the Japan Center for International Finance. In
serveral ways we update and extend the seminal work by Ito (1990). By using partially
disaggregated forecasts to test for unbiasedness, weak efficiency, and micro-homogeneity of
coefficients across industry groups, we minimize the effects of inconsistent inference due to
aggregation or pooling. We undertake a two-way comparison between univariate (simple)
and bivariate (joint) tests of unbiasedness, on the one hand, and differenced and level specifications on the other. In our systems estimation and testing, we use an innovative GMM
technique (Bonham and Cohen, 2001) that allows for forecaster cross-correlation due to the
existence of common shocks and/or herd effects. Micro-homogeneity test results for correct
unbiasedness specifications are uniformly negative. There does not, however, appear to be
a close relationship between patterns of individual deviations from unbiasedness and microhomogeneity test results. We find that integration accounting and regression specification
matter for rationality testing. We argue that joint (bivariate) tests of unbiasedness and efficiency should be abandoned in favor of separate tests for each. Using correct specifications,
we find that simple unbiasedness deteriorates with forecast horizon, and weak efficiency is
widely rejected at all horizons.
Keywords: Rational Expectations, Heterogeneity, Exchange Rate, Survey.
Richard Cohen
College of Business and Public Policy
University of Alaska Anchorage
3211 Providence Drive
Anchorage, AK 99508
[email protected]
Shigeyuki Abe
Center for South East Asian Studies
Doshisha University
[email protected]
Carl S. Bonham
Department of Economics
University of Hawaii at Manoa
Honolulu, HI 96822
[email protected]
1
Overview
This paper examines the rationality of industry-level survey forecasts of the yen-dollar
exchange rate collected by the Japan Center for International Finance (JCIF).1 Because the
forecasts and targets are integrated, we test for unbiasedness using both differenced and
level forms of these variables. Although the differenced specification is most common, if
the forecast and target are integrated and cointegrated, then the differenced specification
is misspecified as it omits all long run informtion. For both differenced and level forms of
the variables, we compare test results from univariate regressions (of the forecast error on a
constant), where the null hypothesis is that the intercept equals zero, with joint test results
from the more commonly used bivariate regressions (of the target on the forecast, with an
intercept), where the null hypothesis is that the intercept equals zero and the slope equals
one. This distinction may be significant for two reasons. First, we show that the joint
hypothesis of the bivariate regression tests not only unbiasedness but also a type of weak
efficiency, i.e., lack of correlation between forecast and forecast error. To distinguish between
tests for unbiasedness and this type of weak efficiency, we also conduct a battery of other
weak efficiency tests on the forecast error. Second, the bivariate differenced specification
may also suffer from inconsistency if the forecasts are aggregated (see Figlewski and Wachtel
(1983)) or pooled (see Zarnowitz (1985)). Neither of these critiques of inconsistency apply
when the unbiasedness test is conducted either in a univariate regression or a cointegrated
framework. We are not aware of any other rationality studies which make this two-way
comparison between differenced and level specifications, on the one hand, and univariate
and bivariate specifications, on the other.
We also examine the diversity of forecaster performance by testing for micro-homogeneity
in the unbiasedness and weak efficiency equations. The extent of diversity has implications
for rationality, since rational forecasts should satisfy cross-forecaster homogeneity restric1
See Appendix use reference 1 for description of the data.
1
tions for unbiasedness and efficiency. Similarly, a failure of rationality across forecasters
should be evidenced by a failure of micro-homogeneity, unless all forecasters are equally irrational. A failure of micro-homogeneity casts doubt on the theoretical underpinnings of the
representative agent assumption used in much of macroeconomic and financial modeling. It
also has implications for the market microstructure hypotheses.
We use the most disaggregated data available for our analysis. Few other researchers have
used disaggregated data on exchange rate expectations, for either forecast evaluation or forecast modeling, in large part because so few disaggregated databases exist. Notable examples
of disaggregated analysis include Ito (1990), who uses the JCIF data,2 and MacDonald
(1992), MacDonald and Marsh (1996), Chionis and MacDonald (1997), and Benassy-Quere
et al. (2003), all of which employ the Consensus Forecasts of London. In addition, Abou and
Prat (2000) used disaggregated stock price data from the Livingston survey.
In our tests for micro-homogeneity, it is reasonable to assume the existence of crossforecaster error correlation due to the possibility of common macro shocks and/or herd
effects in expectations. To this end, we incorporate two innovations not previously used by
investigators studying survey data on exchange rate expectations. First, we use Pesaran’s
(2004) CD test to verify the statistical significance of cross-sectional dependence of forecast
errors. Second, in our micro-homogeneity tests we use a GMM system that incorporates
a variance-covariance matrix with allowance for cross-sectional as well as moving average
and heteroscedastic errors.3 By more accurately describing the panel’s residual variance2
Ito had access to individual and industry-level forecasts, whereas Bryant (1995) had access only to
aggregated data. We have access only to industry-level (as well as aggregate) forecasts. Hence, there may
be some aggregation bias in our industry level aggregation of the data. However, it is noteworthy that,
using disaggregated data of stock market expectations from the Livingston survey, Abou and Prat (2000)
were able to conclude that their expectational models exhibited more heterogeneity over time and between
industry groups than within groups. See below for further discussion of aggregation bias.
3
Keane and Runkle (1990) first accounted for cross-sectional correlation (in price level forecasts) using a
GMM estimator on pooled data. Bonham and Cohen (2001) tested the pooling specification by extending
this GMM estimator to construct a Wald statistic for testing the micro-homogeneity of individual forecaster
regression coefficients in a system. Elliott and Ito (1999) used single equation estimation that incorporated
a White correction for heteroscedasticity and a Newey-West correction for serial correlation. See section 5.5
on Ito tests.
2
covariance structure, we expect this systems approach to improve the efficiency of our microhomogeneity test statistics.
2
Why test for rational expectations of the exchange rate?
The Rational Expectations Hypothesis (REH) states that economic agents know the
true data generating process (DGP) for the forecast variable. This implies that the market’s
subjective probability distribution of the variable is identical to the objective probability
distribution, conditional on a given information set, φt−1 . Equating first moments of the
market, Em (st |φt−1 ), and objective, E(st |φt−1 ), distributions,
Em (st |Φt−1 ) = E(st |Φt−1 ).
(1)
It follows that the REH implies that forecast errors have both unconditional and conditional means equal to zero. A forecast is unbiased if its forecast error has an unconditional
mean of zero. A forecast is efficient if its error has a conditional mean of zero. The condition
that forecast errors be serially uncorrelated is a subset of the efficiency condition where the
conditioning information set consists of past values of the forecast variable.
In financial markets such as those for foreign exchange, rational expectations plays a
crucial role in the formulation of market efficiency hypotheses. Market efficiency implies that
all available information is embodied in the current market price. Hence, market efficiency
implies an efficient allocation of information, and given this information set, it is impossible
to earn above-normal risk-adjusted profits.
The simplest form of foreign exchange market efficiency is a joint hypothesis that agents
are both risk neutral and form rational expectations of the exchange rate. This form of
market efficiency is embodied in the uncovered interest rate parity (UIP) condition
sem,t,h − st = it − i∗t ,
(2)
3
where st represents the natural logarithm of the nominal spot exchange rate at time t,4 i.e.,
price of a unit of foreign currency in units of the domestic currency, sem,t,h represents the
market’s time t expectation of the h-period ahead spot rate, it is the domestic interest rate
on a nominally riskless asset that matures in h periods, and i∗t is the corresponding interest
rate on the foreign asset.5
Transactions costs aside, UIP may fail to hold because either or both parts of the joint
hypothesis do not hold. First, the assumption that foreign and domestic assets are perfect
substitutes may fail. Thus, because their positions are uncovered, domestic investors may
require a (possibly time-varying) exchange risk premium (ρt ) for investing in foreign assets
that must be converted back to the domestic currency. Therefore, the UIP condition should
be respecified as
sem,t,h − st + ρt = it − i∗t .
(3)
Fama (1984) showed that using the (relatively less stringent) covered interest parity (CIP)
condition
ft,h − st = it − i∗t ,
(4)
together with the UIP condition, allows a decomposition of the forward premium, ρt into an
expected depreciation component and a risk premium component, i.e.,
ft,h − st = sem,t,h − st + ρt .
(5)
where ft,h is the forward rate at time t on an h-period contract.
Equivalently, we can solve for ρt = ft,h − sem,t,h . That is, the risk premium is the expected
excess return (over and above the expected spot rate) from taking a forward position. Using
4
Following other authors (e.g., Frankel and Froot (1987) and Liu and Maddala (1992)), we take natural
logarithms of the exchange rate variables, so as to avoid the Siegel paradox, which arises from Jensen’s
inequality.
5
Perhaps the most cited stylized fact about exchange rates in a floating exchange rate regime is that they
are typically indistinguishable from a random walk with drift, i.e., Et st+1 − st = constant, and therefore
unpredictable. However, a random walk (with drift) in the exchange rate is only consistent with UIP if 1)
rational expectations obtains, i.e., sem,t,1 = Et st+1 , and 2) the nominal exchange rate differential is constant
(not necessarily zero).
4
this definition of the risk premium, and defining the forecast error ηt+h as st+h − sem,t,h , Froot
and Frankel (1989) showed that β ≡ 1 − βρ + βη in the regressions
st+h − st = α + β(ft,h − st ) + εt+h ,
(6)
ft,h − st+h = αρ + βρ (ft,h − st ) + ερ,t+h ,
(7)
st − sem,t,h = αη + βη (ft,h − st ) + εη,t+h .
(8)
Equation (6) is often used to test the unbiasedness of the forward rate as a predictor of the
future spot rate. The interpretation of the beta identity is that any systematic deviation of
the forward rate from the future realized spot rate would produce a β that deviates from one.
This deviation can be explained by a time-varying risk premium (represented by a nonzero
covariance of ρ with the forward premium) in (7) and/or an expectational error (represented
by a nonzero covariance of η with the forward premium) in (8). Thus, tests of rational
expectations are necessary to explain whether the deviation from uncovered interest rate
parity (given covered interest rate parity) is due to the failure of a constant risk premium
and/or the failure of rational expectations.
The ability to decompose deviations from UIP into time-varying risk premium and systematic forecast errors components also has implications for policymakers. First, according
to the portfolio balance model, if a statistically significant time-varying risk premium component is found, this means that it − i∗t is time-varying, which in turn implies that foreign
and domestic bonds are not perfect substitutes; changes in relative quantities (which are
reflected in changes in current account balances) will affect the interest rate differential. In
this way, sterilized official intervention can have significant effects on exchange rates. Second,
if a statistically significant expectational error of the destabilizing (e.g., “bandwagon”) type
is found, and policymakers are more rational than speculators, a policy of “leaning against
5
the wind” could have a stabilizing effect on exchange rate movements. (See Cavaglia et al.
1994.)
More generally, monetary models of the exchange rate (in which the UIP condition is
embedded), which assume model-consistent (i.e., rational) expectations with risk-neutrality,
generally have not performed well empirically, especially in out-of-sample forecasting. (See,
e.g., Bryant 1995.) One would like to be able to attribute the model failure to some combination of a failure of the structural assumptions or a failure of the expectational assumption.
3
Why test rational expectations with survey forecast data?
It is clear that, to test rational expectations or risk neutrality, rather than maintain one or
the other as an untested assumption, one must be able to operationalize sem,t,h , the market’s
expectation of the future spot rate. Since this variable is unobservable, one must seek
reasonable proxies. Beginning with Frankel and Froot (1987) and Froot and Frankel (1989),
much of the literature examining exchange rate rationality in general, and the decomposition
of deviations from UIP in particular, has employed the representative agent assumption to
justify using the mean or median survey forecast as a proxy for the market’s expectation.
In both studies, Frankel and Froot found significant evidence of irrationality. Subsequent
research has found mixed results.
3.1 Why test rational expectations with disaggregated survey forecast data?
Liu and Maddala (1992, p. 366) articulate the mainstream justification for using aggregated forecasts in tests of the “Although ...data on individuals are important to throw light
on how expectations are formed at the individual level, to analyze issues relating to market
efficiency, one has to resort to aggregates.” In fact, Muth’s (1961)[p. 316] original definition
of rational expectations seemed to allow for the possibility that rationality could be applied
to an aggregate (e.g., mean or median) forecast. “. . . [E]xpectations of firms (or, more gen-
6
erally, the subjective probability distribution of outcomes) tend to be distributed, for the
same information set, about the predictions of the theory (or the ’objective’ probability distribution of outcomes.)” However, if individual forecasters have different information sets,
Muth’s definition does not apply. To take the simplest example, the (current) mean forecast
is not in any forecaster’s information set, since all individuals’ forecasts must be made before
a mean can be calculated. Thus, current mean forecasts contain private information (see
MacDonald, 1992) and therefore cannot be tested for rationality. Nevertheless, until the
1990s, few researchers tested for the rationality of individual forecasts, even when that data
was available.
Using the mean forecast also produces inconsistent parameter estimates. Figlewski and
Wachtel (1983) were the first to show that, in an unbiasedness equation, the presence of
private information variables in the mean forecast error sets up a correlation with the mean
forecast. This inconsistency occurs even if all individual forecasts are rational. In addition,
Keane and Runkle (1990) pointed out that, when some forecasters are irrational, using the
mean forecast may lead to false acceptance of the unbiasedness hypothesis, in the unlikely
event that offsetting individual biases allow parameters to be consistently estimated. (See
also Bonham and Cohen (2001), who argue that, in the case of cointegrated targets and
predictions, inconsistency of estimates in rationality tests using the mean forecast can be
avoided if corresponding coefficients in the individual rationality tests pass a test for microhomogeneity.)
A large theoretical literature examines how individual expectations are mapped into an
aggregate market expectation, and whether the latter leads to market efficiency. (See, e.g.,
Figlewski 1978, 1982, 1984; Kirman 1992; Haltiwanger and Waldman 1989.) In Haltiwanger
and Waldman’s (1989) theoretical model, a group of forecasters may exhibit rational expectations collectively, even when the members of the group do not. That is, “standard
[i.e., individual] and aggregate rational expectations typically yield different equilibria and
. . . the size of the difference depends positively on the degree of synergism [i.e., correlation
7
of forecast errors, even when there is no private information]” (Haltiwanger and Waldman,
1989, p. 619).6 For example, one could question whether the equal weights used to construct
the mean forecast were biased estimates of the weights in the true market expectation. If so,
the mean forecast contains measurement error–another source of inconsistency, in addition
to the existence of private information. We discuss sources of measurement error at the
individual level in section 4 below.7 This is a special case of the more general measurement
error issue related to the size and representativeness of the sample.8
4
Rational reasons for the failure of the Rational Expectations Hypothesis using
disaggregated data
Other than a failure to process available information efficiently, there are numerous ex-
planations for a rejection of the REH. One set of reasons relates to measurement error in the
individual forecast. (Recall from above that a separate measurement error problem may occur with aggregated forecasts.) Researchers have long recognized that forecasts of economic
variables collected from public opinion surveys should be less informed than those sampled
from industry participants. However, industry participants, while relatively knowledgeable,
may not be properly motivated to devote the time and resources necessary to elicit their
best responses.9,10 The opposite is also possible. Having devoted substantial resources to
produce a forecast of the price of a widely traded asset, such as foreign exchange, forecasters
6
Our paper focuses on individual rationality but allows for the possibility of synergism by incorporating
heteroscedasticity and autocorrelation consistent standard errors in individual rationality tests and also
cross-forecaster correlation in tests of micro-homogeneity.
7
Frankel and Froot (1987) argue that, if the measurement error is random over all forecasters, then, under
the null hypothesis of rationality, it will cancel in the aggregate.
8
Prat and Uctum (2000) show that, if the exchange rate expectation is lognormally distributed, then
using an arithmetic, rather than geometric, average of individual forecasts introduces another source of
measurement error–this time with a constant bias.
9
For example, Keane and Runkle (1990) claimed that their rationality tests, using respondents from (what
is now) the Survey of Professional Forecasters, would be less contaminated by weak incentives.
10
An implication of strong incentives to produce accurate forecasts is that forecasters should tend to delay
reporting their forecasts until close to the survey’s deadline date, so as to accumulate as much relevant
information as possible.
8
may be reluctant to reveal their true forecast before they have had a chance to trade for
their own account.11 Furthermore, reported individual forecasts may not represent the mean
of the forecaster’s subjective probability distribution if that distribution is skewed and the
forecaster reports another measure of central tendency, e.g., the median.
Second, some forecasters may not have the symmetric quadratic loss function embodied
in typical measures of forecast accuracy, e.g., minimum mean squared error. (See Zellner
1986; Stockman 1987; Batchelor and Peel 1998.) In this case, the optimal forecast may
not be the MSE. In one scenario, related to the incentive aspect of the measurement error
problem, forecasters may have strategic incentives involving product differentiation. For example, consider a forecast industry in which most forecasters tend to follow the herd and
produce relatively homogeneous forecasts. In this situation, it may pay for some forecasters
with sufficient reputational capital to produce extreme forecasts, which enhance the forecasters’ brand-name recognition more than it (subsequently) damages their record for forecast
accuracy. Laster et al. (1999) called this practice “rational bias.” Lamont (2002) found evidence that the tendency of forecasters to produce extreme forecasts increased with age and
experience. Ehrbeck and Waldmann (1996) tested hypotheses in which a less able forecaster
moderates his personal forecast by weighting it with the prediction pattern of more able
forecasters (see also Batchelor and Dua 1990a,b, 1992).
In addition to strategic behavior, another scenario in which forecasters may deviate from
the symmetric quadratic loss function is simply to maximize trading profits. This requires
predicting the direction of change, regardless of MSE.12 (See Elliott and Ito 1999; Boothe and
Glassman 1987; LeBaron 2000; Leitch and Tanner 1991; Lai 1990. Goldberg and Frydman
11
To mitigate the confidentiality problem in this case, the survey typically withholds individual forecasts
until the realization is known, or (as with the JCIF) masks the individual forecast by only reporting some
aggregate forecast (at the industry or total level) to the public.
12
This type of loss function may appear to be relevant only for relatively liquid assets such as foreign
exchange, but not for macroeconomic flows. However, the directional goal is also used in models to predict
business cycle turning points. Also, trends in financial engineering may lead to the creation of derivative
contracts in macroeconomic variables, eg., CPi futures.
9
(1996) and Pilbeam (1995) generated directional forecasts from structural models of the
exchange rate.)
Third, there may be a peso problem. The expectations formation process incorporates
a rational probability of a future shift in the exchange rate regime. It is significant that,
whichever regime is reflected in the realized exchange rate, the forecast error is biased and
serially correlated as long as expectations are a probability-weighted average of exchange
rates in two different regimes.
Fourth, and analogous to the peso problem, is the phenomenon of learning. The expectations formation process incorporates a rational probability of a past shift in the exchange
rate regime. Again, while learning whether the regime has shifted, the forecast error is biased
and serially correlated (See Lewis (1995)). Both the peso problem and rational learning are
small sample problems in the sense that, once sufficient learning has occurred that forecasters
have completely adjusted to one regime or the other (perhaps the new regime), the rational
forecast error will be unbiased.
Fifth, rational speculative bubbles may exist. In contrast to the previous theories for
the failure of the REH, which assumed a single representative agent model of the forecast
generating process, rational bubbles require some diversity in the FGP. Despite the fact that
informed traders know that the current exchange rate is above the equilibrium determined
by fundamentals,13 they recognize the market significance of momentum traders (either uninformed (i.e., “noise”) traders or technical traders (chartists)), who are characterized by
“bandwagon”, rather than regressive or rational, expectations. Fundamentalists then might
believe that the directional momentum of the exchange rate will continue in the short run.
At some point, when the deviation of the exchange rate from its fundamental value reaches
a threshold, regressive expectations will dominate, the bubble will burst, and the exchange
rate will fall rapidly. The difficult issue for traders is to figure out what the threshold exchange rate is that marks the transition from short run to long run expectations. The most
13
The fundamentals are represented by the solution to a particular structural model of the exchange rate.
10
obvious implication of rational bubbles is that the larger swings in the exchange rate will
produce excess volatility, relative to what would be expected without the bubble. However,
it is also true that the asymmetric probability distribution of the bubble component of the
exchange rate (i.e., slow expansion and rapid deflation) will be reflected in the implied rational expectation forecast errors. The skewness in the forecast errors is a violation of the
normality assumption of rational expectations forecast errors (see Sarno and Taylor, 2002,
p. 27).
Peso problems, learning about past regime shifts, and rational bubbles all have in common
an attempt by the forecaster to distinguish between temporary and permanent shifts in the
DGP. Flood and Hodrick (1990) note that, because all three theories are characterized by
skewness in rational forecast errors, the theories may be difficult to distinguish empirically.
Further, these explanations for the failure of the REH assume that deviations from rational
forecasts are temporary. However, Sarno and Taylor (2002) observe that the statistically
significant deviation from uncovered interest rate parity (as measured by a β in equation (6)
that is closer to minus one than plus one) cannot be explained as a temporary phenomenon.
Since nonrejection of the unbiasedness and efficiency tests is necessary for the rationality
of each forecaster, it follows that failure of the micro-homogeneity restrictions in unbiasedness
or efficiency tests, i.e., a minimum amount of diversity, indicates irrationality on the part
of some forecasters. The converse does not necessarily hold, since a failure to reject microhomogeneity could conceivably be due to the same degree of irrationality of each individual
in the panel. In the next section we investigate the empirical evidence for these conjectures.
5
Rationality tests
As discussed above, rational expectations is an important maintained hypothesis in many
representative agent models. If tests of these models produce rejections, it is useful to be
able to determine whether the failure is due to a failure of the structural assumption (e.g.,
uncovered interest parity) alone or a failure of the rational expectations assumption or both.
11
A natural progression of rationality testing ascends from unbiasedness to (weak) efficiency
with respect to the forecast variable’s own history to (strong) efficiency with respect to
other variables. Yet, it is important to note that the result from one type of rationality test
does not have implications for the results from any other types of rationality tests. Here
we test for unbiasedness and weak efficiency, leaving the more stringent tests of publicly
available information for future analysis. We also relate the “conventional” unbiasedness
test (bivariate regression of the realization on a constant and the forecast, with a null that
the slope equals one and the intercept equals zero) to the alternative specification (univariate
regression of the forecast error on a constant, with the null that the constant equals zero).
Finally, we compare tests in levels of the variables to tests in first differences.
5.1 Unbiasedness tests
Expressing the target and forecast in first difference form,14 the simple unbiasedness
equation is specified as
ηi,t,h = st+h − sei,t,h = αi,h + ǫi,t,h ,
(9)
where ηi,t,h is the forecast error of individual i, for an h-period-ahead forecast made at time
t.15 The results are reported in Table 1A. For the one-month horizon, unbiasedness cannot
be rejected at conventional significance levels for any group. For the three-month horizon,
unbiasedness is rejected only for exporters (at a p-value of 0.03). For the six-month horizon,
unbiasedness is rejected at the 5% level only for exporters, though it rejected at the 10% level
for all groups. The direction of the bias for exporters is negative; that is, they systematically
underestimate the value of the yen, relative to the dollar. Ito (1990) found the same tendency
using only the first two years of survey data (1985-1987). He characterized this depreciation
bias as a type of “wishful thinking” on the part of exporters.
14
Over the sample period the realized yen/dollar exchange rate is I(1). Unit root and cointegration test
results are in section 5.4 below.
15
Throughout this paper, we use the Newey-West (1987) heteroscedasticity and autocorrelation consistent
(HAC) variance-covariance matrix to capture the M A(h − 1) lag structure of individual regression residuals.
12
In addition to testing the unbiasedness hypothesis at the individual level, as described
in 3.1, we are interested in the degree of heterogeneity of coefficients across forecasters. The
null of micro-homogeneity is given by H0 : αih = αjh , for all i, j 6= i.
Before testing for homogeneous intercepts in equation (9) we must specify the form for
our GMM system variance-covariance matrix. We allow for cross-forecaster correlation,
including lagged cross-correlations through lag h − 1. Previous authors have modeled crosscorrelations without explicitly testing for their existence. They typically justify an M A(h−1)
VCV structure by appealing to information lags.16 In contrast, we use Pesaran’s (2004) CD
(cross-sectional dependence) test to check for contemporaneous correlations among pairs of
forecasters.
CD =
s
N
N
−1 X
X
2T
ρ̂ij ,
N (N − 1) i=1 j=1+1
(10)
where T is the number of time periods, N = 4 is the number of individual forecasters, and
ρ̂ij is the sample correlation coefficient between forecasters i and j, i 6= j. Under the null
hypothesis of no cross-correlation, CD ∼ N (0, 1).17 See Table 6 for CD test results. All
a
contemporaneous correlations are significant at the 0% level.18
In the four-equation system used to test the micro-homogeneity restrictions, we replace
Zellner’s (1962) SUR variance-covariance matrix with a GMM counterpart that incorporates the Newey-West single equation corrections plus allowances for corresponding crosscovariances, both contemporaneous and lagged. See Appendix 2 for details. Despite having
only two failures of unbiasedness at the 5% level over all three horizons, microhomogeneity is
rejected at a level of virtually zero for all horizons. The rejection of microhomogeneity at the
16
Keane and Runkle (1990) provide some empirical support for their modeling of cross-sectional correlations, noting that the average covariance between a pair of forecasters is 58% of the average forecast
variance.
17
Unlike Breusch and Pagan’s (1980) LM test for cross-sectional dependence, Pesaran’s (2004) CD test is
robust to multiple breaks in slope coefficients and error variances, as long as the unconditional means of the
variables are stationary and the residuals are symmetrically distributed.
18
CD tests for lagged cross-sectional dependencies of specific orders are also possible. We would expect
these tests to be statistically significant for lags of h - 1 or less. In fact, this is the assumption we make in
the system variance-covariance matrix, described next.
13
one-month horizon occurs despite the failure to reject unbiasedness for any of the industry
groups. We hypothesize that the consistent rejection of micro-homogeneity regardless of the
results of individual unbiasedness tests is the result of sufficient variation in individual bias
estimates as well as precision in these estimates.
5.2 Explicit tests of weak efficiency
The literature on rational expectations exhibits even less consensus as to the definition
of efficiency than it does for unbiasedness. In general, an efficient forecast incorporates
all available information—private as well as public. It follows that there should be no
relationship between forecast error and any information variables known to the forecaster
at the time of the forecast. Weak efficiency commonly denotes the orthogonality of the
forecast error with respect to functions of the target and prediction. For example, there is no
contemporaneous relationship between forecast and forecast error which could be exploited
to reduce the error. Strong efficiency denotes orthogonality with respect to the remaining
variables in the information set. Below we perform two types of weak efficiency tests. In the
first type, we regress each group’s forecast error on three sets of weak efficiency variables.
1. Single and cumulative lags of the mean forecast error (lagged one period):
st+h − sei,t,h = αi.h +
h+7
X
βi,t+h−k (st+h−k − sem,t+h−k,h ) + ǫi,t,h
(11)
k=h+1
2. Single and cumulative lags of mean expected depreciation (lagged one period)
st+h − sei,t,h = αi.h +
h+7
X
βi,t+h−k (sem,t+h−k,h − st−k ) + ǫi,t,h
(12)
k=h+1
3. Single and cumulative lags of actual depreciation
st+h − sei,t,h = αi.h +
h+6
X
βi,t+h−k (st+h−k − st−k ) + ǫi,t,h
k=h
14
(13)
Notice that the first two sets of weak efficiency variables include the mean forecast, rather
than the individual group forecast. Our intention is to allow a given group to incorporate
information from other groups’ forecasts via the prior mean forecast. This requires an extra
lag in the information set variables, relative to a contemporaneously available variable such
as the realized exchange rate depreciation.
For each group and forecast horizon, we regress the forecast error on the most recent
seven lags of the information set variable, both singly and cumulatively. We use a Wald test
of the null hypothesis αi,h = βi,t+h−k = 0 and report chi-square test statistics, with degrees
of freedom equal to the number of micro-homogeneity restrictions. If we were to perform
only simple regressions (i.e., on each lag individually), estimates of coefficients and tests of
significance could be biased toward rejection due to the omission of relevant variables. If
we were to perform only multivariate regressions, tests for joint significance could be biased
toward nonrejection due to the inclusion of irrelevant variables. It is also possible that
joint tests are significant but individual tests are not. This will be the case when the linear
combination of (relatively uncorrelated) regressors spans the space of the dependent variable,
but individual regressors do not.
In the only reported efficiency tests on JCIF data, Ito (1990) separately regressed the
forecast error (average, group, and individual firm) on a single lagged forecast error, lagged
forward premium, and lagged actual change. He found that, for the 51 biweekly forecasts
between May 1985 and June 1987, rejections increased from a relative few at the one- or threemonth horizons to virtual unanimity at the six-month horizon. When he added a second
lagged term for actual depreciation, rejections increased “dramatically” for all horizons.
The second type of weak efficiency tests uses the Breusch-Godfrey(1978) LM test for the
null of no serial correlation of order k=h or greater, up to order k=h+6, in the residuals of
15
the forecast error regression, equation (11).19 The LM test proceeds as follows. First, use
OLS to estimate the equation
st+h − sei,t,h = αi,h +
h−1
X
βi,k (st+h−k − sei,t−k,h ) + ǫi,t,h
(14)
k=1
Second, successively regress the residuals ǫ̂i,t,h from equation (14) on all the regressors in
that equation along with k = h, ..., h + 6 lags of ǫ̂i,t,h :
ǫ̂i,t,h = αi,h +
h−1
X
βi,k (st+h−k − sei,t−k,h ) +
k=1
h+6
X
φi,l ǫ̂i,t−l,h + ηi,t,h .
(15)
l=h
Third, successively compute the F-statistic for H0 : φi,h = . . . = φi,h+6 = 0.20 Results are
presented in Tables 3A-5D. For each group, horizon, and variable, there are seven individual
tests, i.e., on a single lag, and six joint tests, i.e., on multiple lags. These 13 tests are
multiplied by four groups times three horizons times three weak efficiency variables for a
total of 468 efficiency tests.
Using approximately nine more years of data than Ito (1990), we find many rejections.
In some cases, nearly all single lag tests are rejected, yet few if any joint tests are rejected.
(See, e.g., expected depreciation at the three- and six-month horizons.) In other cases, nearly
all joint tests are rejected, but few individual tests. (See, e..g, actual depreciation at the
three-month horizon and expected depreciation at the six-month horizon.) Remarkably, all
but one LM test for serial correlation at a specified lag produces a rejection at less than
a 10% level, with most at less than a 5% level. Thus, it appears that the generality of
the alternative hypothesis in the LM test permits it to reject at a much greater rate than
the conventional weak efficiency tests, in which the variance-covariance matrix incorporates
the Newey-West correction for heteroscedasticity and serial correlation. Finally, unlike Ito
(1990), we find no strong pattern between horizon length and number of rejections.
19
This is a general test, not only because it allows for an alternative hypothesis of higher-order serial
correlation of specified order, but also because it allows for serial correlation to be generated by AR, MA or
ARMA processes.
20
We use the F-statistic because the χ2 test statistics tend to over-reject, while the F-tests have more
appropriate significance levels (see Kiviet (1987)).
16
In addition to testing the weak efficiency hypothesis at the individual level, we are interested in the degree of heterogeneity of coefficients across forecasters. Here the null of
micro-homogeneity is given by H0 : φil = φjl , for l = h, . . . h + 6, for all i, j 6= i. There
are 468 tests/4 groups = 117 micro-homogeneity tests. The null hypothesis of equal coefficients is H0 : αi,h = αj,h , βi,t+h−k = βj,t+h−k for all i, j 6= i As with the micro-homogeneity
tests for unbiasedness, our GMM variance-covariance matrix accounts for serial correlation
of order h-1 or less, generalized heteroscedasticity, and cross-sectional correlation or order
h-1 or less. We report χ2 (n) statistics, where n is the number of coefficient restrictions, with
corresponding p-values. Rather than perform all 117 micro-homogeneity tests, we choose a
sample consisting of the shortest and longest lag for which there are corresponding individual and joint tests (i.e., for the k=h+1st and k=h+6th lag). Thus, there are 4 tests (two
individual and two corresponding joint tests) times three horizons times three variables for
a total of 36 tests. Every one of the micro-homogeneity tests are rejected at the 0% level.
5.3 Joint tests of unbiasedness and weak efficiency
Many, perhaps most, empirical tests of the “unbiasedness” of survey forecasts are conducted using the bivariate regression equation21
st+h − st = αi,h + βi,h (sei,t,h − st ) + εi,t,h .
(16)
It is typical for researchers to interpret their non-rejection of the joint null (αi,h , βi,h ) = (0, 1)
as evidence of simple unbiasedness. However, Holden and Peel (1990) show that this result
is a sufficient, though not a necessary, condition for unbiasedness. The intuition for the
lack of necessity comes from interpreting the right-hand-side of the bivariate unbiasedness
regression as a linear combination of two potentially unbiased forecasts: a constant equal
to the unconditional mean forecast plus a variable forecast, i.e., st+h − st = (1 − βi,h ) ×
E(sei,t,h − st ) + βi,h (sei,t,h − st ) + εi,t,h . Then the intercept is αi,h = (1 − βi,h ) × E(sei,t,h − st ).
21
Our variance-covariance matrices for both the individual equations and the 4-equation system parallel
those of the unbiasedness test in the previous subsection. Again, see Appendix 2 for details.
17
The necessary and sufficient condition for unbiasedness is that the unconditional mean of the
subjective expectation E[sei,t,h −st ] equal the unconditional mean for the objective expectation
E[st+h − st ]. However, this equality can be satisfied without αi,h being equal to zero, i.e.,
βi,h = 1.
Figure 1 shows that an infinite number of αi,h , βi,h estimates are consistent with simple
unbiasedness. The only constraint is that the regression line intersect the 45 degree ray
from the origin where the sample mean of the forecast and target are equal. Note that, in
the case of differenced variables, this can occur at the origin, so that αi,h = 0, but βi,h is
unrestricted (see Figure 2). It is easy to see why simple unbiasedness holds: in Figures 1
and 2 the sum of all horizontal deviations from the 45 degree line to the regression line, i.e.,
forecast errors, equal zero.22 To investigate the rationality implications of different values
for αi,h and βi,h , we follow Clements and Hendry (1998) and rewrite the forecast error in the
bivariate regression framework of (16) as
ηi,t,h = st+h − sei,t,h = αi,h + (βi,h − 1)(sei,t,h − st ) + εi,t,h
(17)
A special case of weak efficiency occurs when the forecast and forecast error are uncorrelated, i.e.,
E[ηi,t,h (sei,t,h − st )] = 0
(18)
= αi,h E(sei,t,h − st ) + (βi,h − 1)E(sei,t,h − st )2 + E[εi,t,h (sei,t,h − st )]
Thus, satisfaction of the joint hypothesis (αi,h , βi,h ) = (0,1) is also sufficient for weak efficiency with respect to the current forecast. However, it should be noted that (18) may
still hold even if the joint hypothesis is rejected. Thus, satisfaction of the joint hypothesis
represents sufficient conditions for both simple unbiasedness and this type of weak efficiency,
However, when αi,h 6= 0, and αi,h 6= (1 − βi,h ) × E(sei,t,h − st ), there is bias regardless of the value of βi,h .
See Figure 3, where the bias, E(st+h − sei,t,h ), is measured as the horizontal distance from the point on the
regression line where E(sei,t,h − st ) = E(st+h − st ) to the 45 degree line, implying systematic underforecasts.
22
18
but necessary conditions for neither.23 Finally, satisfaction of the joint hypothesis is neither
necessary nor sufficient for lack of serially correlated forecast errors—one requirement for
weak efficiency. To see this, note that
E[ηi,t,h ηi,t−1,h ]
n
o
= E [αi,h + (βi,h − 1)(sei,t,h − st ) + εi,t,h ][αi,h + (βi,h − 1)(sei,t−1,h − st−1 ) + εi,t−1,h ]
= αi,h 2 + αi,h (βi,h − 1)E(sei,t−1,h − st−1 ) + αi,h (βi,h − 1)E(sei,t,h − st ) +
(19)
(βi,h − 1)2 E[(sei,t,h − st )(sei,t−1,h − st−1 )] + E[εi,t,h εi,t−1,h ]
This expression may equal zero even if αi,h 6= 0 and βi,h 6= 1. On the other hand, if αi,h = 0,
and βi,h = 1, then serially correlated residuals imply serially correlated forecast errors.24
Recall that using the univariate regression to test the simple unbiasedness hypothesis, we
found only two rejections of unbiasedness at the 5% level (for exporters at the three- and sixmonth horizons). At the same time, we found strong rejections of microhomogeneity for all
forecasts at all horizons. Turning now to tests of the joint hypothesis (αi,h , βi,h ) = (0,1) in
Table 2, we find very different results. The joint hypothesis is rejected at the 5% significance
level for all groups at the one-month horizon (indicating the possible role of inefficiency with
respect to the current forecast), but only for the exporters at the three- and six- month
horizons. This stands in contrast to the simple unbiasedness results, where performance
deteriorated with increases in the horizon. Also in contrast to the simple unbiasedness
results, there are no rejections of microhomogeneity. Clearly, the results depend on which
definition of unbiasedness-nonrejection of the simple or joint hypothesis–one adopts!
23
If βi,h = 1, then, whether or not αi,h = 0, the variance of the forecast error equals the variance of
the bivariate regression residual, since then var(ηi,t,h ) = (βi,h − 1)2 var(sei,t,h − st ) + var(εi,t,h ) + 2(βi,h −
1)cov[(sei,t,h − st ), εi,t,h ] = var(εi,t,h ). Figure 3 illustrates this point. Mincer and Zarnowitz (1969) required
only that βi,h = 1 in their definition of forecast efficiency. If in addition to βi,h = 1, αi,h = 0, then the mean
square forecast error also equals the variance of the forecast. Mincer and Zarnowitz emphasized that, as long
as the loss function is symmetric, as is the case with a minimum mean square error criterion, satisfaction of
the joint hypothesis implies optimality of forecasts.
24
In Figure 3, where αi,h 6= 0, but βi,h = 1, serial correlation in the forecast error will occur unless
2
αi,h = −E[εi,t,h εi,t−1,h ].
19
The significance of the αi,h ’s in the joint regressions (16) generally follows the same
pattern as in the unbiasedness tests (9), except with slightly more evidence against the null
that this parameter equals zero. However, the null that βi,h = 1 is rejected for all groups
at the one-month horizon, but only for the exporters at the three- and six-month horizons.
This implies that weak efficiency with respect to the current forecast fails at the one-month
horizon, but not at the longer horizons, with only two exceptions. Thus, it appears that
tests of the joint hypothesis at the one-month horizon are rejected due to failure of this type
of weak efficiency, not simple unbiasedness.25 Tests of the joint hypothesis fare better at the
longer horizons primarily because this type of weak efficiency is rarely rejected, despite the
uniform deterioration in the simple unbiasedness statistics in Table 1. This longer-horizon
result is unusual, because weak efficiency is usually taken to be a more demanding criterion
than unbiasedness. However, the longer-horizon result may not be as conclusive as the βi,h
statistics suggest. For all tests at all horizons, the null hypothesis that βi,h equals zero
also cannot be rejected. Thus, for the longer two horizons (with just the one exception for
exporters at the three-month horizon), hypothesis testing cannot distinguish between the null
hypotheses that βi,h equals one or zero. Therefore, we cannot conclude that weak efficiency
with resepect to the forecast prevails when unbiasedness does not. The failure to precisely
estimate the slope coefficient also produces R2 s that are below 0.05 in all regressions. This
lack of power is at least consistent with the failure to reject microhomogeneity at all three
horizons. The conclusion is that testing only the joint hypothesis has the potential to obscure
the difference in performance between the unbiasedness and weak efficiency tests. This
conclusion is reinforced by an examination of figures 4 - 6, the scatter plots and regression
lines for the bivariate regressions.26 All three scatter plots have a strong vertical orientation.
25
For this reason, Mincer and Zarnowitz (1969) and Holden and Peel (1990) suggest that, if one begins by
testing the joint hypothesis, rejections in this first stage should be followed by tests of the simple unbiasedness
hypothesis in a second stage. Only if unbiasedness is rejected in this second stage should one conclude that
forecasts are biased. For reasons described immediately following, our treatment reverses the order, so that
unbiasedness and weak efficiency can be separately assessed for each forecast.
26
Note that, for illustrative purposes only, we compute the expectational variable as the four-group average
percentage change in the forecast. However, recall that, despite the failure to reject micro-homogeneity at
20
With this type of data, it is easy to find the vertical midpoint and test whether it is different
from zero. Thus, (one-parameter) tests of simple unbiasedness are feasible. However, it is
difficult to fit a precisely estimated regression line to this scatter, because the small variation
in the forecast variable inflates the standard error of the slope coefficient. This explains why
the βi,h ’s are so imprecisely estimated that the null hypotheses that βi,h = 1 and 0 are
simultaneously not rejected. Because of the correlation between the slope and intercept,
the intercepts also have relatively high standard errors. For all regressions except exporters
at the three- and six-month horizons, the standard error of the intercept is larger in the
bivariate regression than in the simple unbiasedness regression. This also explains why the
R2 s are so low. Thus, examination of the scatter plots also reveal why bivariate regressions
are potentially misleading about weak efficiency as well as simple unbiasedness. That is
why, in the previous subsection, we tested weak efficiency directly, using the past history
of exchange rate changes, forecast changes, and forecast errors.27 Thus, in contrast to both
Mincer and Zarnowitz (1969) and Holden and Peel (1990), we prefer to separate tests for
simple unbiasedness from tests for (all types of) weak efficiency at the initial stage. This
obviates the need for a joint test.28
More fundamentally, the relatively vertical scatter of the regression observations around
the origin is consistent with an approximately unbiased forecast of a random walk in exchange
rate levels. Other researchers (e.g., Bryant (1995) have found similar vertical scatters for
regressions where the independent variable, e.g., the forward premium/discount ft,h − st ,
the “exchange risk premium” ft,h − st+h , or the difference between domestic and foreign
any horizon, the Figlewski-Wachtel critique implies that these parameter estimates are inconsistent in the
presence of private information. (See the last paragraph in this subsection.)
27
As Nordhaus (1987, p. 673) puts it, “A baboon could generate a series of weakly efficient forecasts by
simply wiring himself to a random-number generator, but such a series of forecasts would be completely
useless.” For these unbiasedness regressions, however, forecast efficiency, even weak form, appears difficult
to support.
28
However, in the general case of biased and/or inefficient forecasts, Mincer and Zarnowitz (1969, p. 11)
also viewed the bivariate regression “as a method of correcting the forecasts . . . to improve [their] accuracy
. . . Theil (1966, p.33) called it the ‘optimal linear correction.”’ That is, the correction would involve 1)
subtracting αi,h , then 2) multiplying by 1/βi,h . Graphically, this is a translation of the regression line
followed by a rotation, until the regression line coincides with the 45 degree line.
21
interest rates (i − i∗ ), exhibits little variation. As Bryant (1995)[p. 51] lamented in reporting
corresponding regressions using a shorter sample from the JCIF, “the regression... is...not one
to send home proudly to grandmother.” Bryant (1995) drew the conclusion that “analysts
should have little confidence in a model specification [e.g., uncovered interest parity] setting
[the average forecast] exactly equal to the next-period value of the model...model-consistent
expectations...presume a type of forward-looking behavior [e.g., weak efficiency] that is not
consistent with survey data on expectations” (p. 40).
Finally, even if the bivariate regressions were correctly interpreted as joint tests of unbiasedness and weak efficiency with respect to the current forecast, and even if the regressions
had sufficient power to reject a false null, the micro-homogeneity tests would be subject
to additional econometric problems. Successfully passing a pre-test for micro-homogeneity
does not ensure that estimated coefficients from either aggregated or pooled bivariate unbiasedness regressions will be consistent. Figlewski and Wachtel (1983) showed that, even
if all forecasters produced unbiased forecasts, the existence of private information sets up a
nonzero correlation between the consensus forecast and the consensus forecast error, leading to inconsistent coefficient estimates. Their argument also holds if all forecasters are
“equally” irrational.29 This problem with consensus tests of the REH led a number of authors to argue for the pooling of disaggregated data.30 However, Zarnowitz (1985) showed
that, because each forecaster shares the same dependent variable in a bivariate regression,
there is cross-sectional dependence between the forecast and the forecast error, which again
29
The extent to which private information influences forecasts is more controversial in the foreign exchange
market than in the equity or bond markets. On the one hand, Chionis and MacDonald (1997, p. 211) claim
that “in the foreign exchange market there is little or no inside information that would account for the
heterogeneity of expectations.” On the other hand, they do cite the early exchange rate microstructure work
of Lyons. According to the latter author (Lyons, 2002, p. 61), “That order flow explains such a large
percentage of price moves underscores the inadequacy of this public info framework. The information the
FX market is aggregating is much subtler than the textbook models assume.” To the extent that one agrees
with the market microstructure emphasis on the importance of the private information embodied in dealer
order flow, the Figlewski-Wachtel critique remains valid.
30
These authors preferred pooling individual data for reasons of increased degrees of freedom and possibly
enhanced interpretability of estimated coefficients. This rationale is especially compelling for surveys in
which individuals’ forecasts frequently suffer from short time spans and typically contain large numbers of
missing observations. (See, e.g., Keane and Runkle (1990).)
22
leads to inconsistent parameter estimates. Thus, failure to reject micro-homogeneity does
not permit a test of unbiasedness using consensus or pooled data. On the other hand,
Bonham and Cohen (2001) showed that, if the realization and forecast were integrated and
cointegrated, both the Figlewski-Wachtel and Zarnowitz critiques break down asymptotically. In this case, micro-homogeneity could serve as a useful pretest for both aggregation
and pooling. Thus, we repeat the joint tests in the bivariate regression using the levels of
the forecast and realization.
5.4 Unbiasedness Tests on Levels
We undertake unbiasedness tests on levels for three reasons. Assume that the realization
and forcasts are integrated and cointegrated. First, as mentioned above, the cointegrating
(bivariate) regressions avoid both aggregation bias (due to Figlewski and Wachtel (1983)) and
pooling bias (due to Zarnowitz (1985)) present in bivariate regressions of stationary variables
(see Bonham and Cohen (2001)). Second, the differenced version of the joint unbiasedness
regression is misspecified. The Engel-Granger (1987) representation theorem proves that a
cointegrating regression such as the levels joint regression (16) has an error correction form
that includes both differenced variables and an error correction term in levels. Under the joint
null, the error correction term is the forecast error, and the differenced specification omits
this levels information. (See also Hakkio and Rush (1989).) Third, and most importantly, we
can investigate the possibility that the imprecise coefficient estimates in the longer horizon
bivariate regressions (16) are an artifact of a misspecified bivariate regression in differences.
We test for unbiasedness and weak efficiency in levels using Liu and Maddala’s (1992)
method of “restricted cointegration.” This specification imposes the joint restriction αi,h = 0,
βi,h = 1 on the bivariate regression
st+h = αi,h + βi,h sei,t,h + εi,t,h
(20)
and tests whether the residual (the forecast error) is non-stationary. In a bivariate regression,
any cointegrating vector is unique. Therefore, if we find that the forecast errors are station23
ary, then the joint restriction is not rejected, and (0,1) must be the unique cointegrating
vector.31
To test the null hypothesis of a unit root, we estimate the augmented Dickey-Fuller(1979)
(ADF) regression
∆yt+1 = α + βyt + γt +
p
X
θt δyt+1−p + ǫt+1
(21)
t=1
where y is the level and first difference of the spot exchange rate, the level and first difference
of each group forecast, including the mean forecast, and the forecast error (i.e., the residual
from the “restricted” cointegrating equation). The number of lagged differences to include
in (21) is chosen by adding lags until a Lagrange Multiplier test fails to reject the null
hypothesis of no serial correlation (up to lag 12). We test the null hypothesis of a unit root
(i.e., β = 0) with the ADF t and z tests. We also test the joint null hypothesis of a unit
root and no linear trend (i.e.,β = 0 and γ = 0).
A stationary forecast error is a necessary condition for forecast rationality. We interpret
unbiasedness plus lack of serial correlation in the residuals of the forecast error regression
(determined by LM efficiency tests discussed in section 5.2) as the necessary condition for
forecast rationality.
At the one-month horizon, the null of a unit root in the residual of the restricted cointegrating regression (i.e., the forecast error) is rejected at the 1% level for all groups. (See
Table 6.) Hence, inference in the simple unbiasedness tests in Table 1A is valid; we do
not reject unbiasedness for any group. However, for every group, all seven LM tests (for
cumulated lags two through eight) show evidence of serial correlation at less than a 1% level.
Thus, we reject weak efficiency. This is roughly consistent with the differenced bivariate
unbiasedness test results in Table 2A, where we rejected the joint hypothesis of α = 0, β = 1
31
Note that the Holden and Peel (1990) critique does not apply in the I(1) case; because the interecept
cannot be an unbiased forecast of a nonstationary variable. Thus, the cointegrating regression line of the
level realization on the level forecast must have both α = 0 and β = 1 for unbiasedness to hold. This differs
from Figure 2, the scatterplot in differences, where αi,h = 0 but βi,h 6= 1. Intuitively, the reason for the
difference in results is that the scatterplot in levels must lie in the first quadrant, i.e., no negative values of
the forecast or realization.
24
(which is a joint test of unbiasedness and efficiency with respect to the current forecast) at
less than the 10% level for all groups.
We find nearly identical results at the three-month horizon: the forecast error is stationary
for all groups, but all seven LM tests again find evidence of serial correlation for all groups,
this time at less than the 7% level.
In contrast, at the six-month horizon, the evidence is clearly in favor of a unit root in
the forecast error, for all four groups.32 However, the tests for the unrestricted cointegrating
residual reject the null of a unit root for all groups. Thus, we reject the null that the
forecast and realization cointegrate with a slope of unity, and we also reject the null of
simple unbiasedness because a forecast error with a unit root cannot be mean zero. Given
our finding of a unit root in the forecast errors, the simple unbiasedness tests in Table 1C
are invalid. Finally, the differenced bivariate regressions in Table 2C are misspecified as they
omit the general equilibrium error (ε̂i,t,h = st − α̂i − β̂i sei,t,h ). Interestingly, these regressions
provide somewhat less evidence against the joint hypothesis of unbiasedness and lack of
serially correlated forecasts than do the unit root and LM tests. The hypothesis is rejected
at the 5% level only for exporters, at the 10% level for exporters and life insurance & import
companies. However, as discussed above, this may be due to the misspecification.
Overall, it is reassuring that the restricted cointegration tests are generally consistent
with the simple unbiasedness tests on the differenced variables. The only exception is for the
six-month forecasts, in which regression of the nonstationary forecast error on a constant is
invalid.33
There is also a cautionary note about the joint tests in differences. They confound
the separation of tests of simple unbiasedness (which by themselves are invalidated by the
32
Banks and brokers, as well as life insurance and import companies, failed to reject a unit root at the
10% level in two out of three of the unit root tests. Exporters failed to reject in all three tests.
33
So are the efficiency regressions of the forecast error on lags of the three differenced information set
variables.
25
presence of a unit root in the forecast error) with tests of weak efficiency. This is not true
of bivariate regressions in levels.
5.5 Ito’s heterogeneity tests
In Table 3, we replicate Ito’s (1990) and Elliott and Ito’s (1999) test for forecaster group
“heterogeneity,” which regresses the deviation of the individual forecast from the crosssectional average forecast on a constant.34,35
sei,t,h − sem,t,h = (αi,h − αm ) + (εi,t,h − εm,t )
(22)
To achieve consistent inference in the presence of both heteroscedasticity and serial correlation in the regression residuals, we employ a general Newey-West variance covariance matrix
with the lag window a function of the sample size.
One may view Ito’s “heterogeneity” tests as complementary to our micro-homogeneity
tests. On the one hand, one is not certain whether a single (or pair of?) individual rejection(s) of, say, the null hypothesis of a zero mean deviation in Ito’s test would result in a
rejection of micro-homogeneity overall. On the other hand, a rejection of micro-homogeneity
does not tell us which groups are the most significant violators of the null hypothesis. It
turns out that Ito’s mean deviation test produces rejections at a level of 6% or less for all
groups at all horizons except for banks and brokers at the one-month horizon and life insurance and import companies at the six-month horizon.36 Thus, it is not surprising that
micro-homogeneity tests on the four-equation system of Ito equations produce rejections at
34
Algebraically, Ito’s regression can be derived from the individual forecast error regression by substracting
the the mean forecast error regression. Recall that our group results are not entirely comparable to Ito’s
(1990), since our dataset, unlike his, combines insurance companies and trading companies into one group,
and life insurance companies and import-oriented companies into another group.
35
Chionis and MacDonald (1997) performed an Ito-type test on individual expectations data from Consensus Forecasts of London.
36
Elliott and Ito (1999), who have access to forecasts for the 42 individual firms in the survey, find that,
for virtually the same sample period as ours, the null hypothesis of a zero deviation from the mean forecast
is rejected at the 5% level by 17 firms at the one-month horizon, 13 firms for the three-month horizon, and
12 firms for the six-month horizon. These authors do not report results by industry group.
26
a level of virtually zero for all three horizons. Since Ito’s regressions have a similar form
(though not a similar economic interpretation) to the tests for simple unbiasedness in Table
1, it is not surprising that they have similar results. The intercept is estimated with sufficient
precision to allow rejection of the null hypothesis.
6
Conclusions
In this paper, we undertake a reexamination of the rationality and diversity of JCIF
forecasts of the yen-dollar exchange rate. In several ways we update and extend the seminal
paper by Ito (1990). Whereas Ito (1990) and Elliott and Ito (1999) measured diversity
as a statistically significant deviation of an individual’s forecast from the cross-sectional
average forecast, we perform a separate test of micro-homogeneity for each type of rationality
test–unbiasedness as well as weak efficiency–that we first conducted at the industry level.
Demonstrating that individual forecasters differ systematically in their forecasts (and FGPs)
has implications for the market microstructure research program. As Frankel and Froot
(1990, p. 182) noted, “the tremendous volume of foreign exchange trading is another piece
of evidence that reinforces the idea of heterogeneous expectations, since it takes differences
among market participants to explain why they trade.” As pointed out by Bryant (1995),
a finding of micro-heterogeneity in unbiasedness and weak efficiency tests also casts doubt
on the assumption of a rational representative agent commonly used in macroeconomic and
asset pricing models.
In order to conduct the systems estimation and testing required for the micro-homogeneity
test, our GMM estimation and inference makes use of an innovative variance-covariance matrix that extends the Keane and Runkle (1990) counterpart from a pooled to an SUR-type
structure. Our variance-covariance matrix takes into account not only serial correlation
and heteroscedasticity at the individual level (via a Newey-West correction), but also forecaster cross-correlation up to h-1 lags. We document the statistical significance of the crosssectional correlation using Pesaran’s (2004) CD test.
27
We find that, for each group, the ability to produce unbiased forecasts deteriorates with
horizon length: no group rejects simple unbiasedness at the one-month horizon, but all groups
reject at the six-month horizon (at less than a 10% level). Exporters consistently perform
worse than the other industry groups, with a tendency toward depreciation bias. Using
only two years of data, Ito (1990) found the same result for exporters, which he described
a a type of “wishful thinking.” We also find that, irrespective of the ability to produce
unbiased forecasts at a given horizon, micro-homogeneity is rejected at virtually a 0% level
for all horizons. We find this result to be somewhat counterintuitive, in light of our prior
belief that micro-homogeneity would be more likely to obtain if there were no rejections of
unbiasedness. Evidently, there is sufficient variation in the estimated bias coefficient across
groups and/or high precision of these estimates to make the micro-homogeneity test quite
sensitive. The unbiasedness results are almost entirely reversed when we test the hypothesis
using the conventional bivariate specification. That is, the joint hypothesis of zero intercept
and unit slope is rejected for all groups at the one-month horizon, but only for exporters and
the three- and six-month horizons. Thus, in stark contrast to the univariate unbiasedness
tests, as well as Ito’s (1990) bivariate tests, forecast performance does not deteriorate with
increases in the horizon: there are no rejections at any horizon.
We hypothesize that at least two factors explain the contrasting results of the univariate
versus bivariate unbiasedness specifications. First, Holden and Peel (1990) have emphasized
that failure to reject the joint test in the bivariate regression is a sufficient, but not necessary,
condition for unbiasedness. This is because the joint hypothesis simultaneously tests weak
efficiency with respect to the current forecast forecast errors as well as unbiasedness. Failure
to reject the univariate test is, however, necessary and sufficient for unbiasedness. Thus,
we conjecture that, in the one-month case, the joint hypothesis fails due to the nonzero
correlation between forecast and forecast error.
This conjecture is borne out by two types of explicit tests for weak efficiency. In the first
type, we regress the forecast error on single and cumulative lags of mean forecast error, mean
28
forecasted depreciation, and actual depreciation. We find many rejections of unbiasedness.
In the second type, we use the Godfrey (1978) LM test for serial correlation of order h
through h+6 in the residuals of the forecast error regression. Remarkably, all but one LM
test at a specified lag length produces a rejection at less than a 10% level, with most at less
than a 5% level.
What, then, explains the failure to reject the bivariate unbiasedness hypothesis at the
longer horizon, where forecasting performance would be expected to deteriorate, as it did
in the univariate case? The key here, we conjecture, is the lack of power of the bivariate
test. Not only can we not reject the (0,1) joint hypothesis, we also cannot reject a (0,0)
hypothesis. Visually, the scatterplot for all three horizons resembles a vertically aligned
ellipse. This is what we would expect in a relatively efficient market: the forecasted change
is nearly unrelated to (and varies much less than) the actual change. At the longer horizons,
the difficulty of forecasting exchange rate changes may more than offset the failure of weak
efficiency with respect to the current forecast.
Finally, we conduct unbiasedness tests on levels to investigate the possibility that the imprecise coefficient estimates in the three- and six- month bivariate regressions are an artifact
of differencing. We use the Liu and Maddala (1992) method of “restricted cointegration,”
i.e., setting the coefficients in the cointegrating vector equal to their hypothesized values of
zero and one, then testing the restricted specification for a unit root in the residuals. We find
that the null of a unit root is rejected for all groups at the one- and three- month horizons,
i.e., the forecast error is stationary. Thus, the inference on simple unbiasedness in Tables
1A and 1B is consistent. However, for all industry groups there is much more evidence for
a unit root in the residuals at the six-month horizon. Thus, the forecast error appears to
be nonstationary. This result by itself rejects unbiasedness at the six-month horizon, and
implies that the inference on unbiasedness in Table 1C is invalid. Interestingly, all tests for
the null of a unit root in the residuals of the unrestricted cointegration regressions fail to
29
reject. This result implies that the bivariate regression in differnces is misspecified, since it
omits an error correction term.
In conclusion, we have attempted to explore the nature of rationality tests on integrated
variables. We compared the results from two common unbiasedness specifications using
differenced variables–univariate and bivariate regressions. We showed that the bivariate
specification not only tests a type of weak efficiency along with unbiasedness, but also has
less power to reject the null than the univariate specification. The latter result may be
related to the unpredictability of price changes in an efficient market.
We also compared unbiasedness tests in differences and levels. All (unrestricted) cointegration tests rejected the null of a unit root in the regression residuals. This points to
another problem with the bivariate regression–misspecification. For all these reasons, we
recommend that simple unbiasedness and weak efficiency tests be undertaken using separate
specifications at the outset (rather than only if the joint test is rejected, as was suggested by
Mincer and Zarnowitz (1969) and Holden and Peel (1990).37 Before conducting such rationality tests, one should test the restricted cointegrated regression residuals, i.e., the forecast
error, for stationarity. Results of such tests presented here cast doubt on the validity of all of
the joint tests, perhaps obviating the need to reconcile the short and long-horizon joint test
results. Clearly, integration accounting and regression specification matter for rationality
testing.
37
Of course, should one wish to undertake a joint test in the bivariate regression, the cointegrating bivariate
regression in levels is an alternative to the misspecified differences regression. A variety of methods, such as
those due to Saikkonen (1991) or Phillips and Hansen (1990), exist that allow for inference in cointegrated
bivariate regressions.
30
st+h − st
b bbbbbbbb
b
b
b
b bbbbbbbbbbbbbbbb bbb
b
b bbb b
b
b
bbb bbbb bbbbbbbbbbbb
bb bbb
b bbbbbbbbbbbbbb
b b bb bb b b bbb
E(st+h − st )
bbbbbbb bbb bbbb bbbbbb
b
b
b
bb bbbbbbb
b b bbbb bbbbb bb
b b b bbbbbbbbbbbbbbbb bb
b b bbb
bbbb bbbbbbbbbbbbbb
b
b
b
b
bbb bb bbb
bbbbbbb b
b bbbbbb
b bb bbbbbbbb bbbb
E(sei,t,h − st )
sei,t,h − st
45o
Figure 1: Unbiasedness with α 6= 0, β 6= 1
st+h − st
0
E(st+h
45o
b bbb
b bbbbbbb
bb
b
b bbbbbbbbbbb
b
b
b bbbb b b
bbbbbbbbbbbbb b
bbb
bbbbbbbbbbbb b
b
b
bbbbbbbbbb
bbbbbbbbbbbbb
bbb bbbb
b b
bbbbbbbbbbb bbb
b
b
b
b
b bb b
− st )
bbbbbbbbbbbbb
b
b
bb bbbbb
bbbb
bbbbbbb
bbb bbbbbbbbb
b bbbbb
bb
b bbbbbbb b
bb
bb
bbb bbbbb
bbbbbbbbb
b
bb
bbb
bbbbbbbbbbb
bb
b
b
b
bbbbb
E(sei,t,h − st )
bbb bbbbbb
0
sei,t,h − st
Figure 2: Unbiasedness with α = 0, β 6= 1
31
st+h − st
E(st+h
b b b bb b
b bbbbbb
bb b
b
b b bb bbbbbb b
b
b
b
b
b
b bbb bb
b b bbb bbbbb bb b
b b bb b
b b b bbb bbbbbb b
b b
b b bbb bbb bbbbb
bbbbb bbbbb bb
b bb b bb b
bb b b
b b bb bbbb b bb
b bbb bbb bbbb b
− st )
b bbbbbbbbb bb b b bb bbbbbbb b bias
b bbbbbbbb bb
b
b bb
bbb bbbbbb bb
b
b
b bbb
b b bbb bb
b bb b bbbb b
b
b b bbbbbb
b bb bbbb b
bbb
bb
E(sei,t,h − st )
45o
sei,t,h − st
Figure 3: Bias with α 6= 0, β = 1
st+h − st
0.1
b
b
b bb
b
bbb bb
bb b b
b bb bbbbbbbbb
bb b b
b bbbbbbbbbbbbbbbbbbbbbb
b
bbbb bbbbbbbbbbbbbbbbbbbbbb
bbbbbbbbbbbbbbb b
bb bbbbbbb b
b b
b bbbbbbbbbbbbbbb b
bbb bb
bbbbb b
bb b bb
b bb b bbb bb
b b bb
bb
bb b
0.05
0
-0.05
-0.1
-0.15
-0.15
-0.1
-0.05
0
0.05
0.1
sei,t,h − st
Figure 4: Actual vs Expected Depreciation, 1 Month-Ahead Forecast
32
st+h − st
0.2
bb
bb
0.15
b
b
bb
bbb b b
b
b bbbbbb bbb b
b bbbbbbbbbbbbbbb
b bb b b
bbb bb bbbbb b
bbb bbbbbbbbbbbbbbbb
b
b bbbbbbbbbbbbbbbbbbbb
b b bb bbbbbbbbbbb
bb
bbb b bb bb
bbbbbbbb bbbb bbb b
b bbbb bbb
bb b bb
b bb b
b bb
b bb
b
0.1
0.05
0
-0.05
-0.1
-0.15
-0.2
-0.2 -0.15 -0.1 -0.05 0 0.05 0.1 e0.15 0.2
si,t,h − st
Figure 5: Actual vs Expected Depreciation, 3 Month-Ahead Forecast
st+h − st
0.3
0.2
b b bbb
bb
b
b bbb b
bb b bb b
b
b b bbbbbbb b
b bbb
b bbb bbb
b bbbbb b
bbbb bbbbb bbbbbbbb b b
b bbbbbbbbbbbbb
bb bbb bbbbbbbbbbb
b b
b bbbbbbbbbbbbbbbbb
b b b bb
b bbbbbbbbbb
b
bb b b bbb
bbbb bbbb b
b bb
bb
0.1
0
-0.1
-0.2
-0.3
-0.3
-0.2
-0.1
0
0.1
0.2 0.3
sei,t,h − st
Figure 6: Actual vs Expected Depreciation: 6 Month-Ahead Forecast
33
0.1
0.08
0.06
0.04
0.02
0
-0.02
-0.04
-0.06
-0.08
sei,t,h − st
st+h − st
-0.1
86/05/14
91/01/16
96/10/15
Figure 7: Actual & Expected Depreciation, 1 Month-Ahead Forecast
0.2
0.15
0.1
0.05
0
-0.05
-0.1
-0.15
sei,t,h − st
st+h − st
-0.2
86/05/14
91/01/16
96/07/30
Figure 8: Actual & Expected Depreciation, 3 Month-Ahead Forecast
34
0.25
0.2
0.15
0.1
0.05
0
-0.05
-0.1
-0.15
-0.2
sei,t,h − st
st+h − st
-0.25
86/05/14
90/11/27
96/05/15
Figure 9: Actual & Expected Depreciation, 6 Month-Ahead Forecast
35
Table 1A: Forecast Error (FE) Tests (1 month forecasts)
Individual regresions
st+h − sei,t,h = αi,h + εi,t,h for h = 2
(9)
degrees of freedom = 261
i=1
i=2
i=3
i=4
(banks & (insurance &
(export
(life ins. &
brokers) trading cos.) industries) import cos.)
αi,h
0.000
0.001
-0.002
0.002
t (NW)
0.115
0.524
-0.809
0.720
MSL(NW)
0.909
0.600
0.418
0.472
MH tests H0 : αi,h = αj , for all i, j 6= i
χ2 (GMM)
41.643
MSL(GMM)
0.000
Table 1B: Forecast Error (FE) Tests (3 month forecasts)
Individual regresions
st+h − sei,t,h = αi,h + εi,t,h for h = 6
(9)
degrees of freedom = 257
i=1
i=2
i=3
i=4
(banks & (insurance &
(export
(life ins. &
brokers) trading cos.) industries) import cos.)
αi,h
-0.010
-0.008
-0.019
-0.010
t (NW)
-1.151
-0.929
-2.165
-1.121
MSL(NW)
0.250
0.353
0.030
0.262
MH tests H0 : αi,h = αj , for all i, j 6= i
χ2 (GMM)
40.160
MSL(GMM)
0.000
36
Table 1C: Forecast Error (FE) Tests (6 month forecasts)
Individual regresions
st+h − sei,t,h = αi,h + εi,t,h for h = 12
(9)
degrees of freedom = 257
i=1
i=2
i=3
i=4
(banks & (insurance &
(export
(life ins. &
brokers) trading cos.) industries) import cos.)
αi,h
-0.031
-0.031
-0.045
-0.034
t (NW)
-1.736
-1.779
-2.441
-1.887
MSL(NW)
0.083
0.075
0.015
0.059
MH tests H0 : αi,h = αj , for all i, j 6= i
χ2 (GMM)
24.426
MSL(GMM)
0.000
37
Table 2A: Unbiasedness Tests (1 month forecasts)
Individual unbiasedness regressions
st+h − st = αi,h + βi,h (sei,t,h − st ) + εi,t,h for h = 2
(16)
degrees of freedom = 260
i=1
i=2
i=3
i=4
(banks & (insurance &
(export
(life ins. &
brokers) trading cos.) industries) import cos.)
αi,h
-0.003
-0.004
-0.007
-0.006
t (NW)
-1.123
-1.428
-2.732
-1.903
MSL(NW)
0.262
0.153
0.006
0.057
βi,h
t (NW)
0.437
1.674
H0 : βi,h = 1, for i = 1, 2, 3, 4
χ2
4.666
MSL
0.031
0.289
1.382
-0.318
-1.237
0.008
-0.038
4.666
0.001
4.666
0.000
4.666
0.000
Unbiasedness Tests: H0 : αi,h = 0, βi,h = 1, for i = 1, 2, 3, 4
χ2 (NW)
4.696
11.682
29.546
19.561
MSL
0.096
0.003
0.000
0.000
MH tests H0 : αi,h = αj , βi,h = βj for all i, j 6= i
χ2 (GMM)
9.689
MSL
0.138
38
Table 2B: Unbiasedness Tests (3 month forecasts)
Individual unbiasedness regressions
st+h − st = αi,h + βi,h (sei,t,h − st ) + εi,t,h for h = 6
(16)
degrees of freedom = 256
i=1
i=2
i=3
i=4
(banks & (insurance &
(export
(life ins. &
brokers) trading cos.) industries) import cos.)
αi,h
-0.013
-0.011
-0.017
-0.014
t (NW)
-1.537
-1.362
-2.060
-1.517
MSL(NW)
0.124
0.173
0.039
0.129
βi,h
t (NW)
0.521
1.268
H0 : βi,h = 1, for i = 1, 2, 3, 4
χ2
1.362
MSL
0.243
0.611
1.868
0.082
0.215
0.484
1.231
1.415
0.234
5.822
0.016
1.728
0.189
Unbiasedness Tests: H0 : αi,h = 0, βi,h = 1, for i = 1, 2, 3, 4
χ2 (NW)
2.946
2.691
11.156
3.023
MSL
0.229
0.260
0.004
0.221
MH tests H0 : αi,h = αj , βi,h = βj for all i, j 6= i
χ2 (GMM)
5.783
MSL
0.448
39
Table 2C: Unbiasedness Tests (6 month forecasts)
Individual unbiasedness regressions
st+h − st = αi,h + βi,h (sei,t,h − st ) + εi,t,h for h = 12
(16)
degrees of freedom = 256
i=1
i=2
i=3
i=4
(banks & (insurance &
(export
(life ins. &
brokers) trading cos.) industries) import cos.)
αi,h
-0.032
-0.032
-0.039
-0.034
t (NW)
-1.879
-1.831
-2.099
-1.957
MSL(NW)
0.060
0.067
0.036
0.050
βi,h
t (NW)
0.413
0.761
H0 : βi,h = 1, for i = 1, 2, 3, 4
χ2
1.166
MSL
0.280
0.822
1.529
0.460
0.911
0.399
-0.168
0.110
0.740
1.147
0.284
1.564
0.211
Unbiasedness Tests: H0 : αi,h = 0, βi,h = 1, for i = 1, 2, 3, 4
χ2 (NW)
4.332
3.5
7.899
5.006
MSL
0.115
0.174
0.019
0.082
MH tests H0 : αi,h = αj , βi,h = βj for all i, j 6= i
χ2 (GMM)
7.071
MSL
0.314
40
Table 3A: Weak Efficiency Tests (1 Month Forecasts)
st+h − sei,t,h = αi,h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (st+h−p − sem,t+h−p,h ) + ǫi,t,h for h = 2
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Pvalue
(11)
Life Ins.&
Import Cos.
χ2
Pvalue
Single
2
3
4
5
6
7
8
0.132
-0.914
0.160
0.450
0.046
0.002
0.091
0.895
0.361
0.689
0.502
0.831
0.967
0.763
0.139
1.971
0.006
0.050
0.104
0.282
0.436
0.709
0.160
0.938
0.823
0.747
0.595
0.509
1.871
0.027
1.634
1.749
0.686
0.069
0.022
0.171
0.869
0.201
0.186
0.408
0.793
0.883
0.334
0.186
0.714
1.180
0.188
0.001
0.300
0.563
0.667
0.398
0.277
0.665
0.970
0.584
Cum.
3
4
5
6
7
8
0.765
4.626
4.747
5.501
6.252
5.927
0.682
0.201
0.314
0.358
0.396
0.548
1.778
3.463
4.382
5.592
6.065
5.357
0.411
0.326
0.357
0.348
0.416
0.617
1.746
8.763
7.680
7.652
8.879
8.390
0.418
0.033
0.104
0.176
0.180
0.299
0.585
5.349
5.081
5.768
6.677
6.087
0.746
0.148
0.279
0.329
0.352
0.530
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
n
Single
2
8
122.522
43.338
0.000
0.000
6
6
Cum.
3
8
136.830
201.935
0.000
0.000
9
24
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
41
Table 3B: Weak Efficiency Tests (1 Month Forecasts)
st+h − sei,t,h = αi.h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (sem,t+h−p,h − st−p ) + ǫi,t,h for h = 2
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
(12)
Life Ins.&
Import Cos.
χ2
Pvalue
Pvalue
Single
2
3
4
5
6
7
8
-2.325
4.482
3.162
3.956
6.368
8.769
5.451
0.020
0.106
0.075
0.047
0.012
0.003
0.020
5.641
3.519
2.580
2.993
4.830
6.786
4.114
0.018
0.061
0.108
0.084
0.028
0.009
0.043
3.658
3.379
2.805
3.102
5.952
7.755
4.417
0.056
0.066
0.094
0.078
0.015
0.005
0.036
7.011
5.877
4.911
7.467
9.766
12.502
7.564
0.008
0.015
0.027
0.006
0.002
0.000
0.006
Cum.
3
4
5
6
7
8
5.592
5.638
5.189
6.025
7.044
10.093
0.061
0.131
0.268
0.304
0.317
0.183
6.138
5.896
4.964
5.068
5.746
8.494
0.046
0.117
0.291
0.408
0.452
0.291
4.116
4.283
3.784
4.847
5.940
7.919
0.128
0.232
0.436
0.435
0.430
0.340
7.508
7.888
8.009
8.401
9.434
12.530
0.023
0.048
0.091
0.136
0.151
0.084
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
n
Single
2
8
40.462
30.739
0.000
0.000
6
6
Cum.
3
8
42.047
46.124
0.000
0.004
6
24
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
42
Table 3C: Weak Efficiency Tests (1 Month Forecasts)
st+h − sei,t,h = αi.h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (st+h−p − st−p ) + ǫi,t,h for h = 2
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Pvalue
(13)
Life Ins.&
Import Cos.
χ2
Pvalue
Single
2
3
4
5
6
7
8
-0.328
1.621
0.002
0.086
0.165
0.850
0.597
0.743
0.203
0.964
0.770
0.685
0.357
0.440
0.639
3.060
0.335
0.042
0.916
1.861
1.088
0.424
0.080
0.562
0.837
0.339
0.172
0.297
1.249
0.000
0.819
1.001
0.029
0.329
0.317
0.264
0.993
0.366
0.317
0.864
0.566
0.574
0.023
0.550
0.146
0.344
0.095
1.152
1.280
0.879
0.458
0.702
0.557
0.758
0.283
0.258
Cum.
3
4
5
6
7
8
1.978
3.304
3.781
3.651
4.493
5.619
0.372
0.347
0.436
0.601
0.610
0.585
3.169
3.501
4.248
4.646
5.609
6.907
0.205
0.321
0.373
0.461
0.468
0.439
1.940
5.567
5.806
5.756
6.608
7.907
0.379
0.135
0.214
0.331
0.359
0.341
1.132
3.318
3.598
3.819
5.040
6.521
0.568
0.345
0.463
0.576
0.539
0.480
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
n
Single
2
8
150.698
45.652
0.000
0.000
6
6
Cum.
3
8
161.950
214.970
0.000
0.000
9
24
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
43
Table 3D: LM Test for Serial Correlation (1 Month Forecasts)
H0 : βi,t−h = ... = βi,t−h−6 = 0, for h = 2
P
Ph+6
e
in ǫ̂i,t,h = αi,h + h−1
k=1 βi,k (st+h−k − si,t−k,h ) +
l=h φi,l ǫ̂i,t−l,h + ηi,t,h , (15)
where ǫ is generated from
P
e
st+h − sei,t,h = αi,h + h−1
(14)
k=1 βi,k (st+h−k − si,t−k,h ) + ǫi,t,h
Cum. lags (k)
n−k
2
219
3
205
29.415
0.000
18.339
0.000
F (k, n − k)
p-value
30.952
0.000
Insurance & Trading Cos.
19.506 15.372 11.661 9.695 8.120 7.050
0.000 0.000 0.000 0.000 0.000 0.000
F (k, n − k)
p-value
32.387
0.000
20.691
0.000
29.694
0.000
Life Ins. & Import Cos.
18.606 14.596 11.093 9.586 9.154 7.937
0.000 0.000 0.000 0.000 0.000 0.000
F (k, n − k)
p-value
F (k, n − k)
p-value
4
192
5
179
6
166
7
153
8
144
Banks & Brokers
14.264 11.180 9.699 7.922 6.640
0.000 0.000 0.000 0.000 0.000
Export Industries
16.053 12.951 10.628 9.418 7.520
0.000 0.000 0.000 0.000 0.000
44
Table 4A: Weak Efficiency Tests (3 Month Forecasts)
st+h − sei,t,h = αi,h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (st+h−p − sem,t+h−p,h ) + ǫi,t,h for h = 6
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Pvalue
(11)
Life Ins.&
Import Cos.
χ2
Pvalue
Single
6
7
8
9
10
11
12
0.667
0.052
0.006
0.055
0.264
0.299
0.172
0.414
0.820
0.940
0.814
0.607
0.585
0.678
0.954
0.071
0.010
0.043
0.278
0.381
0.336
0.329
0.789
0.921
0.836
0.598
0.537
0.562
4.493
1.434
0.382
0.140
0.001
0.020
0.011
0.034
0.231
0.537
0.708
0.980
0.888
0.918
1.719
0.268
0.001
0.060
0.432
0.598
0.633
0.190
0.605
0.976
0.806
0.511
0.439
0.426
Cum.
7
8
9
10
11
12
8.966
12.288
11.496
8.382
11.596
11.527
0.011
0.006
0.022
0.136
0.072
0.117
11.915
16.263
15.528
12.136
18.128
15.983
0.003
0.001
0.004
0.033
0.006
0.025
19.663
23.290
22.417
16.839
23.782
21.626
0.000
0.000
0.000
0.005
0.001
0.003
12.350
15.146
14.778
12.014
15.330
13.038
0.002
0.002
0.005
0.035
0.032
0.071
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
n
Single
6
188.738
12
63.364
0.000
0.000
6
6
Cum.
7
12
0.000
0.000
9
24
217.574
229.567
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
45
Table 4B: Weak Efficiency Tests (3 Month Forecasts)
st+h − sei,t,h = αi.h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (sem,t+h−p,h − st−p ) + ǫi,t,h for h = 6
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Pvalue
(12)
Life Ins.&
Import Cos.
χ2
Pvalue
Single
6
7
8
9
10
11
12
3.457
4.241
5.748
6.073
8.128
8.511
6.275
0.063
0.039
0.017
0.014
0.004
0.004
0.012
2.947
3.834
5.177
5.843
7.868
8.004
6.691
0.086
0.050
0.023
0.016
0.005
0.005
0.010
3.470
4.390
5.410
5.968
7.845
8.308
6.635
0.062
0.036
0.020
0.015
0.005
0.004
0.010
3.681
4.370
6.053
6.474
8.521
8.429
6.079
0.055
0.037
0.014
0.011
0.004
0.004
0.014
Cum.
7
8
9
10
11
12
4.717
5.733
5.195
7.333
8.539
8.758
0.095
0.125
0.268
0.197
0.201
0.271
4.985
5.209
5.411
9.245
6.658
6.747
0.083
0.157
0.248
0.100
0.354
0.456
4.954
5.045
5.112
9.456
7.488
7.796
0.084
0.168
0.276
0.092
0.278
0.351
4.928
6.736
6.053
7.872
7.955
8.698
0.085
0.081
0.195
0.163
0.241
0.275
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
n
Single
6
12
57.130
58.230
0.000
0.000
6
6
Cum.
7
12
63.917
126.560
0.000
0.000
9
24
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
46
Table 4C: Weak Efficiency Tests (3 Month Forecasts)
st+h − sei,t,h = αi.h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (st+h−p − st−p ) + ǫi,t,h for h = 6
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Pvalue
(13)
Life Ins.&
Import Cos.
χ2
Pvalue
Single
6
7
8
9
10
11
12
0.268
0.055
0.331
0.513
1.038
1.335
1.184
0.604
0.814
0.565
0.474
0.308
0.248
0.276
0.450
0.037
0.305
0.482
1.077
1.532
1.620
0.502
0.848
0.581
0.488
0.299
0.216
0.203
3.657
0.599
0.029
0.022
0.318
0.563
0.616
0.056
0.439
0.864
0.883
0.573
0.453
0.433
1.065
0.003
0.230
0.577
1.344
1.872
1.979
0.302
0.957
0.632
0.448
0.246
0.171
0.159
Cum.
7
8
9
10
11
12
6.766
8.752
8.654
9.421
9.972
8.581
0.034
0.033
0.070
0.093
0.126
0.284
8.767
11.784
11.588
12.890
13.137
11.823
0.012
0.008
0.021
0.024
0.041
0.107
15.683
18.330
18.929
19.146
19.597
17.670
0.000
0.000
0.001
0.002
0.003
0.014
10.052
11.162
11.309
12.275
13.003
11.431
0.007
0.011
0.023
0.031
0.043
0.121
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
n
Single
6
151.889
12
66.313
0.000
0.000
6
6
Cum.
7
12
0.000
0.000
9
24
164.216
193.021
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
47
Table 4D: LM Test for Serial Correlation (3 Month Forecasts)
H0 : βi,t−h = ... = βi,t−h−6 = 0, for h = 6
P
Ph+6
e
in ǫ̂i,t,h = αi,h + h−1
k=1 βi,k (st+h−k − si,t−k,h ) +
l=h φi,l ǫ̂i,t−l,h + ηi,t,h ,
where ǫ is generated from
P
e
st+h − sei,t,h = αi,h + h−1
k=1 βi,k (st+h−k − si,t−k,h ) + ǫi,t,h
Cum. lags (k)
n−k
F (k, n − k)
p-value
6
126
7
117
3.452
0.003
8
108
9
99
(15)
(14)
10
94
11
89
12
84
2.856
0.009
Banks & Brokers
3.023
2.951
2.599
0.004
0.004
0.008
2.652
0.006
2.921
0.002
2.584
0.007
2.341
0.012
F (k, n − k)
p-value
3.499
0.003
2.850
0.009
Insurance & Trading Cos.
3.408
2.907
2.492
0.002
0.004
0.011
F (k, n − k)
p-value
4.687
0.000
3.956
0.001
Export Industries
4.409
3.572
2.928
0.000
0.001
0.003
2.819
0.003
2.605
0.005
2.482
0.021
Life Ins. & Import Cos.
2.501
2.168
1.866
0.016
0.031
0.060
1.794
0.067
1.811
0.059
F (k, n − k)
p-value
2.352
0.035
48
Table 5A: Weak Efficiency Tests (6 Month Forecasts)
st+h − sei,t,h = αi,h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (st+h−p − sem,t+h−p,h ) + ǫi,t,h for h = 12
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Single
12
13
14
15
16
17
18
0.597
0.612
0.811
0.944
1.550
2.192
3.020
0.440
0.434
0.811
0.331
0.213
0.139
0.082
0.787
0.761
0.906
1.017
1.556
2.254
2.920
0.375
0.383
0.341
0.313
0.212
0.133
0.088
Cum.
13
14
15
16
17
18
2.305
5.152
5.294
8.594
12.489
14.596
0.316
0.161
0.258
0.126
0.052
0.042
1.618
5.389
6.576
8.766
12.736
15.091
0.445
0.145
0.160
0.119
0.047
0.035
2.620
2.414
2.556
2.419
3.168
3.771
4.416
Pvalue
101.660
43.721
0.000
0.000
6
6
Cum.
13
18
187.021
449.252
0.000
0.000
9
24
Pvalue
0.344
0.354
0.343
0.331
0.239
0.176
0.128
3.041 0.219
6.472 0.091
7.220 0.125
11.495 0.042
17.611 0.007
21.041 0.004
1.419
3.359
4.565
8.325
10.881
12.861
0.492
0.339
0.335
0.139
0.092
0.076
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
49
χ2
0.894
0.858
0.899
0.945
1.385
1.827
2.318
n
Single
12
18
Life Ins.&
Import Cos.
0.105
0.120
0.110
0.120
0.075
0.052
0.036
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
(11)
Table 5B: Weak Efficiency Tests (6 Month Forecasts)
st+h − sei,t,h = αi.h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (sem,t+h−p,h − st−p ) + ǫi,t,h for h = 12
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Pvalue
Single
12
13
14
15
16
17
18
3.022
2.107
1.637
0.985
1.550
0.232
0.023
0.082
0.147
0.201
0.321
0.213
0.630
0.881
3.109
2.380
1.989
1.391
0.973
0.530
0.168
0.078
0.123
0.158
0.238
0.324
0.467
0.682
3.601
3.196
2.665
1.894
1.577
1.068
0.471
0.058
0.074
0.103
0.169
0.209
0.301
0.492
Cum.
13
14
15
16
17
18
4.364
9.598
10.787
8.594
15.909
13.066
0.013
0.022
0.029
0.126
0.014
0.071
6.684
8.138
8.641
8.971
11.663
13.804
0.035
0.043
0.071
0.110
0.070
0.055
6.140
8.395
9.152
10.073
12.624
14.644
0.046
0.039
0.057
0.073
0.049
0.041
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n)
Pvalue
n
Single
12
18
38.198
55.915
0.000
6
6
Cum.
13
18
46.234
98.324
0.000
0.000
9
24
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
50
(12)
Life Ins.&
Import Cos.
χ2
2.830
2.131
1.700
1.064
0.710
0.387
0.109
Pvalue
0.093
0.144
0.192
0.302
0.198
0.534
0.741
7.389 0.025
8.839 0.032
10.537 0.032
10.917 0.053
15.762 0.015
12.714 0.014
Table 5C: Weak Efficiency Tests (6 Month Forecasts)
st+h − sei,t,h = αi.h +
Ph+6
p=h
Banks &
Brokers
Lags
χ2
Pvalue
βi,t+h−p (st+h−p − st−p ) + ǫi,t,h for h = 12
Ins. &
Trading Cos.
χ2
Export
Industry
χ2
Pvalue
Pvalue
(13)
Life Ins.&
Import Cos.
χ2
Pvalue
Single
12
13
14
15
16
17
18
0.103
0.149
0.292
0.654
0.575
2.519
4.130
0.748
0.699
0.589
0.419
0.448
0.113
0.042
0.198
0.203
0.319
0.625
1.136
2.243
3.536
0.656
0.652
0.572
0.429
0.286
0.134
0.060
1.231
1.131
1.310
1.794
2.398
3.422
4.705
0.267
0.287
0.252
0.180
0.121
0.064
0.030
0.256
0.250
0.326
0.585
1.042
1.865
2.778
0.613
0.617
0.568
0.444
0.307
0.172
0.096
Cum.
13
14
15
16
17
18
1.445
4.226
7.182
9.752
11.554
12.322
0.486
0.238
0.127
0.083
0.073
0.090
0.548
4.048
6.226
7.180
11.194
11.062
0.761
0.256
0.183
0.208
0.083
0.136
1.140
5.345
7.801
8.608
13.429
14.223
0.566
0.148
0.099
0.126
0.037
0.047
0.531
2.398
5.732
6.435
7.984
7.741
0.767
0.494
0.220
0.266
0.239
0.356
Selected micro-homogeneity tests
H0 : αi,h = αj,h , βi,t+h−p = βj,t+h−p for all i, j 6= i
χ2 (n) Pvalue
n
Single
12
18
113.567
37.223
0.000
0.000
6
6
Cum.
13
18
260.006
409.126
0.000
0.000
9
24
VCV matrix incorporates Newey-West correction for serial correlation
χ2 statistics for mean forecast error regressions (p-value underneath)
Degrees of freedom (n) represent number of regressors, excluding intercept
(n=1 for single lag, n=max. lag -2 for cumulative lags)
51
Table 5D: LM Test for Serial Correlation (6 Month Forecasts)
H0 : βi,t−h = ... = βi,t−h−6 = 0, for h = 12
P
Ph+6
e
in ǫ̂i,t,h = αi,h + h−1
k=1 βi,k (st+h−k − si,t−k,h ) +
l=h φi,l ǫ̂i,t−l,h + ηi,t,h ,
where ǫ is generated from
P
e
st+h − sei,t,h = αi,h + h−1
k=1 βi,k (st+h−k − si,t−k,h ) + ǫi,t,h
Cum. lags (k)
n−k
12
54
13
52
2.798
0.000
3.618
0.000
F (k, n − k)
p-value
2.321
0.018
Insurance & Trading Cos.
2.446
2.112
2.243
1.667
0.011
0.027
0.018
0.089
F (k, n − k)
p-value
2.747
0.006
3.134
0.002
1.687
0.096
F (k, n − k)
p-value
F (k, n − k)
p-value
14
50
15
48
(15)
(14)
16
46
17
44
18
42
2.578
0.006
2.671
0.005
2.762
0.003
1.674
0.086
1.672
0.085
Export Industries
2.685
2.709
1.818
0.005
0.005
0.052
1.844
0.052
1.871
0.048
Life Ins. & Import Cos.
2.068
0.393
2.739
1.749
0.033
0.971
0.004
0.070
1.839
0.053
1.987
0.034
Banks & Brokers
3.462
3.926
0.001
0.000
52
Table 6A: Unit Root Tests (1 Month Forecasts)
∆yt = α + βyt−1 + γt +
Pp
k=1 θt ∆yt−k
Log of spot rate (n=276)
H th difference of log of spot rate
+ ǫt for h = 2
(21)
Lags
ADF t test
ADF z test
Joint test
0
12
-2.828
-4.306***
-9.660
-167.473***
6.820**
9.311***
Group 1 (Banks & brokers)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0
0
-2.274*
-7.752***
-5.072
-96.639***
6.327**
30.085***
1
1
-11.325***
65.581***
-249.389***
64.676***
Group 2 (Insurance & trading cos.)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0
0
-2.735*
-7.895***
-5.149
-94.986***
6.705***
31.252***
1
1
-11.624***
-11.750***
-270.302***
68.053***
Group 3 (Export industries)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
1
0
-2.372
-8.346***
-4.806
-111.632***
5.045**
34.889***
1
1
-10.324***
-10.392***
-211.475***
53.757***
Group 4 (Life ins. & import cos.)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
* rejection at 10% level
** rejection at 5% level
*** rejection at 1% level
0
1
-2.726*
-5.216***
-5.009
-52.911***
6.438**
3.630***
1
1
-10.977***
-10.979***
-231.837***
60.820***
53
Table 6B: Unit Root Tests (3 Month Forecasts)
∆yt = α + βyt−1 + γt +
Pp
k=1 θt ∆yt−k
Log of spot rate (n=276)
H th difference of log of spot rate
+ ǫt for h = 6
Lags
0
2
ADF t test
-2.828
-4.760***
(21)
ADF z test
Joint test
-9.660
-49.769***
6.820**
11.351***
Group 1 (Banks & brokers)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0 -2.840*
0 -5.092***
-4.852
-48.707***
7.610***
12.990***
6
6
-29.429***
4.673**
-3.022**
-3.343*
Group 2 (Insurance & trading cos.)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0 -2.778*
0 -6.514***
-4.533
-71.931***
8.858***
21.588***
6
2
-31.038***
4.956**
-3.068**
-4.539***
Group 3 (Export industries)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0 -3.105**
1 -4.677***
-4.549
-41.524***
9.090***
10.944***
6
5
-31.207***
5.659**
-3.317**
-5.115***
Group 4 (Life ins. & import cos.)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
* rejection at 10% level
** rejection at 5% level
*** rejection at 1% level
0 -2.863*
1 -4.324***
5
4
54
-4.825***
-5.123***
-4.400
-39.870***
8.161***
9.352***
-118.586***
11.679***
Table 6C: Unit Root Tests (6 Month Forecasts)
∆yt = α + βyt−1 + γt +
Pp
k=1 θt ∆yt−k
Log of spot rate (n=276)
H th difference of log of spot rate
+ ǫt for h = 12
(21)
Lags ADF t test
ADF z test
Joint test
0 -2.828
17 -3.189**
-9.660
-26.210***
6.820**
5.500**
Group 1 (Banks & brokers)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0 -2.947**
0 -4.772***
-4.254
-44.018***
1 -2.373
7 -3.285**
-13.577*
9.131***
11.389***
2.947
Group 2 (Insurance & trading cos.)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0 -2.933**
0 -6.007***
-4.004
-64.923***
1 -2.114
1 -2.684*
-11.464*
9.531***
18.044***
2.399
Group 3 (Export industries)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
0 -3.246**
12 -4.961***
12 -1.515
0 -2.931
-4.059
-44.532***
-5.601
10.704***
12.331***
1.466
Group 4 (Life ins. & import cos.)
Log of forecast
H th difference of log of forecast
Forecast error
Restriced CI eq.
Unrestriced CI eq.
* rejection at 10% level
** rejection at 5% level
*** rejection at 1% level
0 -3.133**
0 -4.795***
-4.196
-44.062***
2 -2.508
1 -2.851*
-14.535**
55
9.549***
11.537***
3.148
Table 7: CD Tests for Contemporaneous Cross-Sectional
Dependendence of Forecast Errors
st+h − sei,t,h = αi.h + ǫi,t,h
(9)
Horizon (h)
q
CD
1 month
MSL
37.945
0.000
3 months
MSL
31.272
0.000
6 months
MSL
25.571
0.000
a
N
N −1
2T
CD = N (N
j=1+1 ρ̂ij ∼ N (0, 1)
i=1
−1)
N = 24, T = 276, ρ̂ij is the sample correlation coefficient
between forecaster i and j, i 6= j.
P
P
56
References
Alain Abou and Georges Prat. Modelling stock price expectations: lessons from microdata.
In Francois Gardes and Georges Prat, editors, Price Expectations inGoods and Financial
Markets, chapter 12, pages 271–294. Edward Elgar (Cheltenham, UK), 2000.
Roy Batchelor and Pami Dua. Forecaster ideology, forecasting technique, and the accuracy
of economic forecasts. International Journal of Forecasting, 6:3–10, 1990a.
Roy Batchelor and Pami Dua. Product differentiation in the economic forecasting industry.
International Journal of Forecasting, 6:311–316, 1990b.
Roy Batchelor and Pami Dua. Conservatism and consensus-seeking among economic forecasters. Journal of Forecasting, 11:169–181, 1992.
Roy Batchelor and David A. Peel. Rationality testing under asymmetric loss. Economics
Letters, 61:49–54, 1998.
Agnes Benassy-Quere, Sophie Larribeau, and Ronald MacDonald. Models of exchange rate
expectations: How much heterogeneity? Journal of International Financial Markets,
Institutions and Money, 2003.
Carl S. Bonham and Richard H. Cohen. To aggregate, pool, or neither: Testing the rationalexpectations hypothesis using survey data. Journal of Business & Economic Statstics, 19
(3):278–291, July 2001.
Carl S. Bonham and Richard H. Cohen. Testing the rationality of price forecasts: Comment.
American Economic Review, 85(1):284–289, March 1995.
Paul Boothe and Debra Glassman. Comparing exchange rate forecasting models. International Journal of Forecasting, 3:65–79, 1987.
T. S. Breusch. Testing for auitocorrelation in dynamic linear models. Australian Economic
Papers, 17:334–355, 1978.
T. S. Breusch and A. R. Pagan. The lagrange multiplier test and its application to model
specifications in econometrics. Review of Economic Studies, 47:239–253, 1980.
Ralph C. Bryant. The ”exchange risk premium,” uncovered interest parity, and the treatment
of exchange rates in multicountry macroeconomic models. Technical report, The Brookings
Institution, January 1995.
Stefano M. F. G. Cavaglia, Willem F. C. Verschoor, and Christian C. P. Wolff. On the
biasedness of forward foreign exchange rates: Irrationality or risk premia? Journal of
Business, 67(3):321–343, July 1994.
Dionysios Chionis and Ronald MacDonald. Some tests of market microstructure hypotheses
in the foreign exchange market. Journal of Multinational Financial Management, 7:203–
229, 1997.
57
A. Davies and K. Lahiri. A new framework for testing rationality and measuring aggregate
shocks using panel data. Journal of Econometrics, 68:205–227, 1995.
D. A. Dickey and W. A. Fuller. Distribution of the estimators for autoregressive time series
with a unit root. Journal of the American Statistical Association, 74:427–31, 1979.
Tilman Ehrbeck and Robert Waldmann. Why are professional forecasters biased? agency
versus behavioral explanations. The Quarterly Journal of Economics, 111(1):21–40, February 1996.
Graham Elliott and Takatoshi Ito. Heterogeneous expectations and tests of efficiency in the
yen/dollar forward exchange rate market. Journal of Monetary Economics, 43:435–456,
1999.
Robert Engle and Clive W. S. Granger. Cointegration and error correction representation,
estimation and testing. Econometrica, 55:251–276, March 1987.
Eugene Fama. Forward and spot exchange rates. Journal of Monetary Economics, 14:
319–338, 1984.
Stephen Figlewski. Market ”efficiency” in a market with heterogeneous information. Journal
of Political Economy, 86(4):581–597, August 1978.
Stephen Figlewski. Information diversity and market behavior. Journal of Finance, 37(1):
87–102, March 1982.
Stephen Figlewski. Information diversity and market behavior: A reply. Journal of Finance,
39(1):299–302, March 1984.
Stephen Figlewski and Paul Wachtel. Rational expectations, informational efficiency, and
tests using survey data: A reply. Review of Economics and Statistics, 65:529–531, 1983.
Jeffey A. Frankel and Kenneth A. Froot. Chartists, fundamentalists, and trading in the
foreign exchange market. American Economic Review, 80(2):181–185, May 1990.
Jeffrey A. Frankel and Kenneth A. Froot. Using survey data to test standard propositions
regarding exchange rate expectations. American Economic Review, 77(1):133–153, March
1987.
Kenneth A. Froot and Jeffrey A. Frankel. Forward discount bias: Is it an exchange risk
premium? Quarterly Journal of Economics, 104(1):139–161, February 1989.
L. G. Godfrey. Testing for higher order serial correlation in regression equations when the
regressors include lagged dependent variables. Econometrica, 46:1303–1310, 1978.
Michael D. Goldberg and Roman Frydman. Imperfect knowledge and behaviour in the
foreign exchange market. Economic Journal, 106(437):869–893, July 1996.
58
Craig S. Hakkio and Mark Rush. Market efficiency and cointegration: an application to the
sterling deutschemark exchange markets. Journal of International Money and Finance, 8:
75–88, 1989.
John C. Haltiwanger and Michael Waldman. Rational expectations in the aggregate. Economic Inquiry, XXVII:619–636, 1989.
K. Holden and D. A. Peel. On testing for unbiasedness and efficiency of forecasts. Manchester
School, LVIII(2):120–127, June 1990.
Takatoshi Ito. Foreign exchange rate expectations: Micro survey data. American Economic
Review, 80(3):434–449, June 1990.
Takatoshi Ito. Short-run and long-run expectations of the yen/dollar exchange rate. Journal
of the Japanese and International Economies, 8:119–143, 1994.
Michael P. Keane and David E. Runkle. Testing the rationality of price forecasts: New
evidence from panel data. American Economic Review, 80(4):714–735, September 1990.
Alan P. Kirman. Whom or what does the representative individual represent? Journal of
Economic Perspectives, 6(2):117–136, Spring 1992.
J. F. Kiviet. Testing Linear Econometric Models. University of Amsterdam, 1987.
Kon Lai. An evaluation of survey exchange rate forecasts. Economics Letters, 32:61–65,
1990.
Owen Lamont. Macroeconomic forecass and microeconomic forecasters. Journal of Economic
Behavior and Organization, 48(3):265–280, 2002.
David Laster, Paul Bennett, and In Sun Geoum. Rational bias in macroeconomic forecasts.
Quarterly Journal of Economics, pages 293–318, February 1999.
Blake LeBaron. Technical trading profitability in foreign exchange markets in the 1990’s.
Technical report, Brandeis U., July 2000.
Gordon Leitch and J. Ernest Tanner. Economic forecast evaluation: Profits versus the
conventional error measures. The American Economic Review, 81(3):580–590, June 1991.
Karen K. Lewis. Puzzles in international financial markets. In G. Grossman and K. Rogoff,
editors, Handbook of International Economics, volume III, chapter 37, pages 1913–1971.
Elsevier Science, 1995.
Peter Liu and G. S. Maddala. Rationality of survey data and tests for market efficiency in
the foreign exchange markets. Journal of International Money and Finance, 11:366–381,
1992.
Richard K. Lyons. Foreign exchange: Macro puzzles, micro tools. Economic Review, Federal
Reserve Bank of San Francisco, 2002.
59
Ronald MacDonald. Exchange rate survey data: A disaggregated g-7 perspective. The
Manchester School Vol LX Supplement, pages 47–62, June 1992.
Ronald MacDonald and Ian W. Marsh. Currency forecasters are heterogeneous: confirmation
and consequences. Journal of International Money and Finance, 15(5):665–685, 1996.
J. Mincer and V. Zarnowitz. The evaluation of economic forecasts. In Economic Forecasts
and Expectations. New York: National Bureau of Economic Research, 1969.
John Muth. Rational expectations and the theory of price movements. Econometrica, 29:
315–335, 1961.
W. Newey and K. West. A simple positive semi-definite heteroscedasticity and autocorrelation consistent covariance matrix. Econometrica, 55:703–708, May 1987.
W. D. Nordhaus. Forecasting efficiency: concepts and applications. Review of Economics
and Statistics, 69:667–674, 1987.
M. Hashem Pesaran. General diagnostic tests for cross section dependence in panels. URL
http://www.econ.cam.ac.uk/faculty/pesaran/wp.htm#gdt. June 2004.
Peter Phillips and Bruce E. Hansen. Statistical inference in instrumental variables regression
with i(1) processes. Review of Economic Studies, 57:99–125, 1990.
Keith Pilbeam. Exchange rate models and exchange rate expectations: an empirical investigation. Applied Economics, 27:1009–1015, 1995.
Georges Prat and Remzi Uctum. The evidence of a mixed expectation generating process
in the foreign exchange market. In Francois Gardes and Georges Prat, editors, Price
Expectations in Goods and Financial Markets, chapter 11, pages 251–270. Edward Elgar
(Cheltenham, UK), 2000.
Pentti Saikkonen. Asymptotically efficient estimation of cointegration regressions. Econometric Theory, 7:1–21, 1991.
Lucio Sarno and Mark P. Taylor. The Economics of Exchange Rates. Cambridge U. Press,
2002.
Alan C. Stockman. Economic theory and exchange rate forecasts. International Journal of
Forecasting, 3:3–15, 1987.
Henri Theil. Applied Economic Forecasting. North-Holland, Amsterdam, 1966.
Victor Zarnowitz. Rational expectations and macroeconomic forecasts. Journal of Business
& Economic Statistics, 3:293–311, 1985.
Arnold Zellner. Biased predictors, rationality and the evaluation of forecasts. Economics
Letters, 21:45–48, 1986.
60
Appendix 1: Data Description and Variable Definitions
Every two weeks, the JCIF in Tokyo conducts telephone surveys of yen/dollar exchange
rate expectations from 44 firms. The forecasts are for the future spot rate at horizons
of one month, three months, and six months. Our data cover the period May 1985 to
March 1996. This data set has very few missing observations, making it close to a true
panel. For reporting purposes, the JCIF currently groups individual firms into four industry
categories: 1) banks and brokers, 2) insurance and trading companies, 3) exporters, and 4)
life insurance companies and importers. On the day after the survey, the JCIF announces
overall and industry average forecasts. (For further details concerning the JCIF database,
see the descriptions in Ito (1990, 1994), Bryant (1995), and Elliott and Ito (1999).)
61
Appendix 2: Testing Micro-homogeneity with Survey Forecasts
The null hypothesis of micro-homogeneity is that the slope and intercept coefficients in the
equation of interest are equal across individuals. This paper considers the case of individual
unbiasedness regressions such as equation (16) in the text, repeated here for convenience.
st+h − st = αi,h + βi,h (sei,t,h − st ) + εi,t,h
(A2.1)
and tests H0 : α1 = α2 = . . . = αN , and β1 = β2 = . . . = βN .
Stack all N individual regressions into the Seemingly Unrelated Regression system
S = Fθ + ε
(A2.2)
where S is the N T × 1 stacked vector of realizations, st+h , and F is an N T × 2N block
diagonal data matrix

F1

F=


...
FN

.

(A2.3)
Each Fi = [ι sei,t,h ] is a T ×2 matrix of ones and individual i’s forecasts, θ = [α1 β1 . . . αN βN ]′ ,
and ε is an N T × 1 vector of stacked residuals. The vector of restrictions, Rθ = r,
corresponding to the null hypothesis of micro-homogeneity is normally distributed, with
Rθ − r ∼ N [0, R(F′ F)−1 F′ ΩF(F′ F)−1 R′ ], where R is the 2(N − 1) × 2N matrix



R=



1
0
−1
0
...

0
.. 
. 

0 1
0 −1 0
,
..
... ... ...

0 
. 0
0 ... 0
1
0 −1
(A2.4)
and r is a 2(N − 1) × 1 vector of zeros. The corresponding Wald test statistic,
(Rθ̂ − r)′ [R(F′ F)−1 F′ Ω̂F(F′ F)−1 R′ ](Rθ̂ − r), is asymptotically distributed as a chi-square
random variable with degrees of freedom equal to the number of restrictions, 2(N − 1).
For most surveys, there are a large number of missing observations. Keane and Runkle
(1990), Davies and Lahiri (1995), Bonham and Cohen (1995, 2001), and to the best of our
knowledge all other papers which make use of pooled regressions in tests of the REH have
dealt with the missing observations using the same approach. The pooled or individual regression is estimated by eliminating the missing data points in both the forecasts and the
realization. The regression residuals are then padded with zeros in place of missing observations to allow for the calculation of own and cross-covariances. As a result, many individual
variances and cross-covariances are calculated with relatively few pairs of residuals. These
individual cross-covariances are then averaged. In Keane and Runkle (1990) and Bonham
and Cohen (1995, 2001) the assumption of 2(k + 1) second moments, which are common
to all forecasters, is made for analytical tractability and for increased reliability. In contrast to the forecasts from the Survey of Professional Forecasters used in Keane and Runkle
(1990) and Bonham and Cohen (1995, 2001), the JCIF data set contains virtually no missing observations. As a result, it is possible to estimate each individual’s variance covariance
62
matrix (and cross-covariance matrix) rather than average over all individual variances and
cross-covariance pairs as in the aforementioned papers.
We assume that for each forecast group i,
2
E[εi,t,h εi,t,h ] = σi,0
for all i, t,
2
E[εi,t,h εi,t+k ] = σi,k for all i, t, k such that 0 < k ≤ h,
E[εi,t,h εi,t+k ] = 0 for all i, t, k such that k > h,
(A2.5)
Similarly, for each pair of forecasters i and j we assume
E[εi,t,h εj,t ] = δi,j (0) ∀i, j, t,
E[εi,t,h εj,t+k ] = δi,j (k) ∀i, j, t, k such that k 6= 0, and −h ≤ k ≤ h.
E[εi,t,h εj,t+k ] =
0
∀i, j, t, k such that k > |h|.
(A2.6)
Thus, each pair of forecasters has a different T × T cross-covariance matrix,

Pi,j




=



δi,j (0) δi,j (−1)
...
δi,j (−h)
0
δi,j (1) δi,j (0) δi,j (−1)
...
0
..
..
...
...
...
.
.
...
δi,j (1)
δi,j (0) δi,j (−1)
0
δi,j (h)
...
δi,j (1)
δi,j (0)





,



(A2.7)
′
Finally, note that Pi,j 6= Pj,i , rather Pi,j
= Pj,i . The complete variance-covariance matrix,
denoted Ω, has dimension N T × N T , with matrices Qi on the main diagonal and Pi,j off
the diagonal.
The individual Qi , variance covariances matrices are calculated using the Newey and West
(1987) heteroscedasticity-consistent, MA(j) corrected form. The Pi,j matrices are estimated
in an analogous manner,
63