Journal of Forecasting J. Forecast. (2011) Published online in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/for.1231 Price–Dividend Ratios and Stock Price Predictability JYH-LIN WU1,2* AND YU-HAU HU3 1 Institute of Economics, National Sun Yat-Sen University, Kaohsiung, Taiwan 2 Department of Economics, National Chung-Cheng University, Chia-Yi, Taiwan 3 Department of International Trade, Cheng-Shiu Technological University, Kaohsiung, Taiwan ABSTRACT A long-standing puzzle to financial economists is the difficulty of outperforming the benchmark random walk model in out-of-sample contests. Using data from the USA over the period of 1872–2007, this paper re-examines the out-ofsample predictability of real stock prices based on price–dividend (PD) ratios. The current research focuses on the significance of the time-varying mean and nonlinear dynamics of PD ratios in the empirical analysis. Empirical results support the proposed nonlinear model of the PD ratio and the stationarity of the trend-adjusted PD ratio. Furthermore, this paper rejects the non-predictability hypothesis of stock prices statistically based on in- and out-of-sample tests and economically based on the criteria of expected real return per unit of risk. Copyright © 2011 John Wiley & Sons, Ltd. KEY WORDS price–dividend ratios; time-varying means; out-of-sample predictability; random walks; long-horizon regression tests, economic significance INTRODUCTION Since the seminal papers by Campbell and Shiller (1988) and Fama and French (1988), the predictability of stock returns has become a central research topic in empirical financial economics over the past two decades. The predictability of stock returns is typically analyzed based on a long-horizon predictive regression equation with a price–dividend (PD) ratio as a regressor.1 This is because a stationary PD ratio suggests predictable movements of stock prices. Almost all of the empirical evidence related to return or stock price predictability has relied on long-horizon regression equations, and the evidence with regard to predictability is interpreted based on the significance of t- or F -statistics and high R2 values. Based on in-sample tests, several researchers find that the PD ratio * Correspondence to: Jyh-Lin Wu, Institute of Economics, National Sun Yat-Sen University, Kaohsiung, Taiwan. E-mail: [email protected] 1 Useful survey papers offered by Campbell (2000) and Campbell et al. (1997) provide detailed discussions. Copyright © 2011 John Wiley & Sons, Ltd. J.-L. Wu and Y.-H. Hu has little forecasting power for stock price changes when the forecast horizon is 1 year, but its forecasting power increases significantly when forecast horizons are long (Campbell and Shiller, 1998, 2001).2 However, Campbell and Yogo (2006) and Torous et al. (2004) point out that the conventional t-test is incorrect when regressors in the prediction equation are nearly integrated. After adopting the framework of local-to-unity, they find evidence of in-sample return predictability. On the other hand, there is no convincing evidence for out-of-sample predictability of stock price movement in the literature (Goyal and Welch, 2003; Lettau and Van Nieuwerburgh, 2008; Welch and Goyal, 2008). Avramov (2002) find that Bayesian methodology reveals the existence of in- and out-of-sample predictability of stock returns; however, their results also indicate that dividend yields have relatively small posterior probabilities of being correlated with future returns. The above results give rise to the puzzle of why PD ratios do not provide a strong and stable predictive power for future movements of stock prices in out-of-sample prediction tests. A key assumption embedded in conventional long-horizon regressions is the stability of the PD ratio, which implies that the means of the ratio are relatively unchanged over time. Figure 1 plots the US historical PD ratio over the relevant sample period, and reveals that in this 136-year period PD ratios fluctuate around 2.97 before 1955. There is an unprecedented rise in the PD ratio from 1955 to 1982 and another rise from 1982 all the way until 2000, after which the PD ratio goes down slightly. The average PD ratio is 2.97 from 1872 to 1954, 3.32 from 1955 to 1981, and 3.71 from 1982 to 2007. The mean of the PD ratio varies in the above three periods and there appears to have been an upward trend in the mean. Several papers find significant structural changes in PD ratios and predictive regression models (Rapach and Wohar, 2005b; Coakley and Fuertes, 2006; Paye and Timmermann, 2006; Lettau and Van Nieuwerburgh, 2008). In such a case, out-of-sample prediction of stock price changes is difficult for investors since they need to estimate a break date and the new mean of the PD ratio if a new break is detected (Lettau and Van Nieuwerburgh, 2008). Instead of focusing on discrete structural changes, this paper argues that the long-run equilibrium mean of the ratio is time varying and hence the PD ratio reverts to its time-varying mean. Moreover, this work focuses on the nonlinear adjustments of the ratio toward its time-varying mean. The current research adopts an exponential smooth transition autoregressive (ESTAR) model to describe the nonlinear dynamics of PD ratios and a nonlinear deterministic trend function to model the timevarying mean of the ratio. The above specification on the dynamics of PD ratios is motivated by the view that both fundamental and non-fundamental components affect stock prices. The existence of a time-varying mean of PD ratios renders a long-horizon prediction equation unstable if such behavior of the mean is not taken into account and could be one of the reasons for finding poor in- and out-of-sample predictability in stock prices. If our argument of a time-varying mean is correct then we expect to find evidence for stock price predictability at different forecast horizons based on bootstrapped critical values constructed from an appropriate representation of data. This paper therefore performs in- and out-of-sample tests to investigate the predictability of stock returns statistically and economically. Based on data from the USA over the period of 1872–2007, this paper obtained several results of interest. First, an ESTAR model with a time-varying mean well describes the dynamics of the PD ratio and the PD ratio could be a unit root process whereas the trend-adjusted PD ratio is stationary. The 2 The weakness of the PD ratio as a predictor may be due to its exclusion of non-dividend cash flows. Robertson and Wright (2006) show that a new cash-flow yield including both dividend and non-dividend cash flows provides a strong and stable in-sample predictive power for aggregate returns. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability 4.5 4.0 3.5 3.0 2.5 2.0 1880 1890 1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 Figure 1. PD ratios (—) and simulated PD ratios (---) generated based on a non-parametric bootstrap above results indicate that modeling the impacts of fundamental and non-fundamental components in stock prices is crucial in empirical analysis. Second, assuming a constant mean in the conventional linear dynamics of PD ratios provides no evidence of stock price predictability based on in- and outof-sample bootstrap tests. Furthermore, the local-to-unity framework of Campbell and Yogo (2006) fails to reject the hypothesis of no in-sample predictability of stock returns, which is consistent with our results from the bootstrap t-test. Third, imposing the time-varying mean in ESTAR dynamics of the PD ratio, our bootstrap tests provide strong in- and out-of-sample evidence of stock price predictability. The above results of beating the random walk forecast model are crucial since Lettau and Van Nieuwerburgh (2008), Welch and Goyal (2008) and Goyal and Welch (2003) fail to beat the random walk model in their out-of-sample contests. Our findings reconcile inconsistent results of in- and out-of-sample predictive tests in extant literature. Fourth, applying the criteria of expected return per unit of risk, the economic significance of the proposed nonlinear model is supported. Fifth, our bootstrap tests achieve reasonable power, but suffer a slight size distortion for in-sample statistics when forecast horizons are less than 3 years. Nevertheless, our conclusions on stock price predictability are not affected by the slight size distortion. The next section provides a nonlinear ESTAR model with a time-varying mean to describe the dynamics of PD ratios and discuss its properties. The third section discusses in- and out-of-sample bootstrap tests of a competitive model and the random walk with drift, respectively. The fourth section reports empirical results and the fifth section examines the economic significance of models under consideration. The sixth section examines the power and size of the bootstrap tests. Finally, the seventh section concludes our discussion. THE NONLINEAR ESTAR MODEL OF PD RATIOS This paper argues that there exist both short-run nonlinear fluctuations and long-run persistent movements in PD ratios. Theoretically, the long-run value of the PD ratio (the mean of the PD ratio) is Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu affected by the mean growth rate of real dividends and the mean real discount rate as discussed in Poterba and Summers (1986). Facing permanent regime switches in stock markets, investors will continuously revise their expectations of the mean of the previous two fundamental factors resulting in persistent movements in the long-run PD ratio.3 Several permanent fundamental changes are usually offered as explanations for the upward movement of the mean growth rate of real dividends or the downward movement of the mean real discount rate. These include increased expected future economic growth brought about by the information revolution (Hobijn and Jovanovic, 2001); institutional changes such as shifting payout policies away from paying dividends to share repurchases (Cole et al., 1996; Liang and Sharpe, 1999; Siegel, 2002); a decrease in the equity premium due to lower transaction costs resulting from the advance of information technology (Siegel, 1999); broader participation in financial markets and greater diversification in portfolios (Vissing-Jorgenson, 1998; Heaton and Lucas, 1999); demographic effects of the baby boomers (Siegel, 2002); and a decline in inflation (Ritter and Warr, 2002; Sharpe, 2002). In other words, the assumption of the stability of the PD ratio is challenged when stock markets witness consecutive permanent regime switches. Fama and French (2002) also find that the downward trend in dividend yields is permanent. Our arguments are consistent with the fact that financial markets experience consecutive permanent regime switches even though those changes are small. We set up an ESTAR model with a time-varying mean to describe the dynamics of the PD ratio: t D t C .t 1 t 1 /F Œt 1 , , t 1 C ut (1) F Œt 1 , , t 1 D expŒ.t 1 t 1 /2 , ut i.i.d..0, 2 /, < 0, t D h.t/ where t is the PD ratio, defined as the deviation of the log of real S&P 500 price index, pt , from the log of real dividend, dt , and hence t D pt dt . The term t is the time-varying mean of the PD ratio, and its trend specification, h.t/, is data determined and hence will vary with sample periods. The trend specification of t in the PD ratio can be justified by the ‘new era’ explanation in the literature. The development of new technology results in investors expecting higher future earning and dividends that shift the growth rate of real dividends upward. The increased ease of trade results in investors reducing the real discount rate. In other words, consecutive regime switches in financial markets shift the mean growth rate of real dividends upward or the mean real discount rate downward which in turn results in an upward trend movement of the mean of the PD ratio (Balke and Wohor, 2001, 2002; Caporale and Gil-Alana, 2004; Psaradakis, 2004). In addition, the trend specification of t in equation (1) is also consistent with our finding below (‘The stationarity of the PD ratio’) that the trend-adjusted PD ratio is stationary. Based upon the lengthy period from 1872 to 2007, the quadratic trend is identified among four different trend specifications: constant, linear, quadratic and cubic formulations. The time-varying mean, t , in equation (1) is therefore specified as t D a + bt + ct2 . It is worth noting that the specification of t is re-identified when more data are available. As pointed out by Lettau and Van Nieuwerburgh (2008), the uncertainty of estimating the time-varying mean of PD ratios is responsible for the difficulty of forecasting stock returns in real time. If our trend measure of the time-varying mean of the PD ratio is at variance with the true mean, then short-run deviations of the PD ratio in equation (1) 3 The property of time-varying PD ratio is discussed in Poterba and Summers (1986) in which the driving forces of the model are the growth rate of real dividends and the real discount rate. The derivation of the model is not reported here but is available upon request from the authors. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability should also be incorrectly measured and will result in the failure of beating the benchmark random walk model in out-of-sample contests. Therefore, results from out-of-sample prediction tests allow us to answer whether our trend specification of t provides a reasonable measure of the time-varying mean of the PD ratio. The transition function, F Œt 1 , , t 1 , captures the nonlinearity of the model, which is a function of the lagged PD ratio and its time-varying mean. The parameter (<0) is the parameter that determines the extent of nonlinearities. It also denotes the speed of transition between regimes. The model in equation (1) degenerates into an ESTAR model with a constant mean if t D a. The value of the transition function is bounded between zero and one. In the case where the deviation of the lagged PD ratio from its long-run equilibrium is small then the ratio is highly persistent since the value of F is close to one. As the ratio moves further away from equilibrium, the transition function F moves toward zero and hence the PD ratio follows a stationary process. The PD ratio, as equation (1) indicates, is stationary overall although it is highly persistent when the ratio is very close to its mean at time t. This could be the reason for failing to reject the unit root hypothesis for the PD ratio in the literature. In addition, if the parameter determining the extent of nonlinearities, , is zero then the model degenerates to a unit root process. Otherwise it is a nonlinear stationary process. Therefore, testing the hypothesis of D 0 is the same as testing a linear model against a nonlinear ESTAR model.4 There are two theoretical justifications for a smooth transition autoregressive specification for short-run deviations of the PD ratio. The first one is the significance of noise trading on asset price dynamics as suggested by Poterba and Summers (1986) and Shleifer and Summers (1990). Given the actions of noise traders, the perceived deviations of asset prices from their fundamental value represent risky arbitrage opportunities. Small deviations from fundamentals may not be arbitraged since the perceived gains may be too small to outweigh this risk. However, arbitrage increases as the degree of deviations from the fundamentals increases. The risk arbitrage hypothesis implies nonlinear dynamics of asset prices (Gallagher and Taylor, 2001; McMillan, 2007). The second one is the existence of transaction costs in financial markets. Facing investment uncertainties, the existence of transaction costs leads investors to wait for large arbitrage opportunities before entering the market. Theoretical justifications for the argument of transaction costs are provided by Dumas (1992) and Sercu et al. (1995). EMPIRICAL METHODOLOGY In-sample tests The long-horizon predictive regression model is typically applied in examining the predictability of stock prices, which is given as follows: k pt D ˛k C ˇk .t t / C "t Ck , k D 1, 2, 3, : : : , 10 (2) where t t is the deviation of the PD ratio from its long-run equilibrium value, and k pt pt Ck pt . Equation (2) differs from a conventional long-horizon regression equation in that t t , instead of t , is used to serve as an error correction component. It is t t that reflects deviations from long-run equilibrium given that the PD ratio mean reverts to its time-varying mean. 4 The critical values of testing the hypothesis of D 0 is constructed through bootstrap. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu Based on equation (2), the non-predictability of stock price changes can be examined by testing the hypothesis of ˇk D 0 versus ˇk < 0 for a given forecast horizon of k. We adopt the Newey and West (1987) method to construct a consistent estimate of the covariance matrix when the forecast horizon is greater than one. The lag order in constructing the Newey–West covariance matrix is determined based upon Andrews’ (1991) procedure. We also apply a joint test to examine whether the smallest t ratio among the 10 horizons, tmin D min ft.k/ W k D 1, 2, 3, : : : , 10}, is significant. It is well known that the estimate of ˇk in equation (2) is subject to the problem of finite-sample bias since the innovations of PD ratios and price changes may be correlated (Keim and Stambaugh, 1986; Richardson and Stock, 1989). Therefore the inference of ˇk based on a conventional t-test may not be appropriate (Campbell and Yogo, 2006). The inference of ˇk in the paper is conducted based on a bootstrapped t-test in which the critical values of the tˇ statistic are constructed through bootstrap (Kilian and Taylor, 2003; Rapach and Wohar, 2005a). In addition, this paper investigates the inference of ˇk based on a local-to-unity framework below (‘Empirical Investigation’), which assumes that PD ratios follow a process with a root near unity. k Out-of-sample tests To evaluate the out-of-sample predictability of the long-horizon predictive equation relative to the random walk with drift, this work constructs a sequence of recursive forecasts from the above two models, respectively. The first 30-year observations are reserved for estimation and hence the out-ofsample forecast period starts from 1902. This paper re-identifies the trend function and re-estimates the long-horizon regression equation each time we add one observation to our sample and thus use only data available up to the date of forecast. Conventionally, researchers compare the value of root mean square errors (RMSE) from different models to conclude for the superiority of a model in out-of-sample contests if the model has the smallest RMSE. Simply comparing the values of RMSE does not give any idea of the significance of differences. In our case, the null model is a random walk and it is nested with the alternative of a long-horizon regression equation. This paper therefore adopts the CW test provided by Clark and West (2006, 2007) to examine the predictability of stock price movements. The Newey–West method is applied to construct an autocorrelation consistent standard error. If the p-value of the CW statistic is less than 0.1 then this paper asserts that the long-horizon predictive equation outperforms random walk with drift in out-of-sample contests. A joint test is also applied to examine whether the largest CW statistic among the 10 horizons, CWmax D max fCW.k)I k D 1, 2, 3, 4, : : : , 10g, is significant. The critical values of the CW statistic are constructed through a non-parametric bootstrap. Results from out-of-sample tests allow us to examine the significance of the assumptions of time-varying mean and ESTAR dynamics of PD ratios. EMPIRICAL INVESTIGATION Estimation results of the ESTAR model Annual data for the US consumer price index (CPI), nominal stock prices and dividends for S&P 500 from 1872 to 2004 are obtained from Robert Shiller’s web page, and are updated to 2007. The CPI is used to convert the series of nominal stock prices into real terms. The PD ratio is defined as the log of real S&P 500 price index minus the log of real dividend paid on the portfolio of the previous year. This paper applies an ESTAR(1) model to describe short-run deviations of PD ratios. The estimation results, reported in Table I, reveal that all estimates are significant at the 5% level. As for Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability Table I. Estimation of the ESTAR model t D t C fexpŒ1.567 .t1 t1 /2 g.t1 t1 / C "t .3.596/ Œ0.005 t D 3.151 0.013 t C 0.0002 t2 .27.64/ .3.325/ .5.195/ Q(1) = 3 105 [0.995], LM(1) = 1.5 105 [0.997], Q2 (1) = 2.851 [0.091], ARCH(1) = 2.798 [0.097], Reset(1) = 0.013 [0.909] ; ET = 0.242 [ 0.623 ] Note: t is the PD ratio. A number in parentheses under an estimate is its t statistic, and a number in square brackets is a p-value. LM(p/ and Q(p/ are respectively the Lagrange multiplier and the Ljung–Box autocorrelation tests for up to a pth-order autocorrelation. They are 2 distributions with p degrees of freedom. Q2 .p/ is the Ljung–Box autocorrelation test for squared residuals for up to a pth-order autocorrelation, which has a 2 distribution with p degrees of freedom. ARCH(p/ is a test statistic for up to a pth-order autoregressive conditional heteroscedasticity. It has 2 distribution with p degrees of freedom. RESET is the Ramsey regression specification error test, which has an F distribution. ET is the statistic for firstorder residual autocorrelation in a nonlinear model provided by Eitrheim and Teräsvirta (1996). The p-value of the estimated transition parameter, , is constructed based on a non-parametric bootstrap. residual diagnostic tests, the hypothesis of no serial correlation of residuals is examined using Q, LM and ET statistics.5 The Q statistic on squared residuals, Q2 , and the ARCH statistic of Engle (1982) are applied to examine the autoregressive conditional heteroscedasticity of residuals. Finally, the RESET statistic by Ramsey (1969) is applied to examine the misspecification of the model. Based on these statistics, we find that estimated residuals are well behaved since there is no evidence of serial correlation or conditional heteroscedasticity as indicated by Q, LM, Q2 , and ARCH statistics, respectively. In addition, there is also no evidence of model misspecification as indicated by the RESET statistic. The current research simulates PD ratios based on a non-parametric bootstrap and plots the simulated and realized ratios in Figure 1. Simulated PD ratios fit realized ratios well and the simulated ratio also captures the unprecedented rise of the PD ratio during the period of 1955-1970 and that of post 1982. This further strengthens the appropriateness of the ESTAR model in equation (1) to describe the dynamics of PD ratios. The stationarity of the PD ratio Based on equation (2), the stationarity of the trend-adjusted PD ratio is crucial since it makes little sense to predict price changes with I (1) variables. In addition, neglecting the unit-root problem of regressors leads to spurious results of the predictability of stock returns (Lanne, 2002). This paper applies two different linear unit root tests provided by Dicky and Fuller (1979, ADF) and Elliott et al. (1996, DF-GLS), respectively, to examine the stationarity of PD ratios. In addition to linear unit root tests, this work also applies the nonlinear unit root test provided by Kapetanios et al. (2003, KSS). The KSS test examines the unit-root hypothesis against the hypothesis of a stationary nonlinear smooth transition autoregressive process. The model includes a constant but no trend, and the lag order of the model is set to one in the KSS test based on the Akaike information criteria (AIC) and it is selected based on the modified AIC rule in the ADF and DF-GLS tests as suggested by Ng and Perron (2001). It is worth noting that applying the critical values of the abovementioned unit root tests will bias test results toward rejecting the unit root hypothesis since the 5 The Q and LM statistics are Ljung–Box and Breusch–Goldfrey statistics for residual autocorrelation. The ET statistic, provided by Eitrheim and Teräsvirta (1996), tests for first-order residual autocorrelation in a nonlinear model. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu trend-adjusted PD ratio is an estimated series rather than an observed series. To correct the involved estimation uncertainty in our case, we construct the finite sample distribution of unit root tests using a non-parametric bootstrap. Results from Table II indicate that the unit root hypothesis of PD ratios is not rejected at the 5% level by all three statistics. However, the unit root hypothesis of the trend-adjusted ratio (t t / is rejected at the 10% level by the ADF statistic and is rejected at the 5% level by DF-GLS and KSS statistics. In addition, results from Table I indicate that PD ratios exhibit significant nonlinear trends. Results from Tables I and II imply that the PD ratio is stationary and has a time-varying mean. Our results are crucial for several reasons. First, there are several articles that support the unit root hypothesis of the PD ratio and hence reject the present value model (Froot and Obstfeld, 1991; Lamont, 1998; Balke and Wohar, 2002). Several articles in the literature explain the apparent failure of the present value model through the presence of non-fundamental components. A typical non-fundamental component has been ascribed to the presence of a rational bubble, but there are serious theoretical doubts regarding its existence (Campbell et al., 1997, pp. 259–260). An interesting alternative is to consider a nonlinear modeling approach arising from noise traders or the existence of transaction costs (Shleifer, 2000; McMillan, 2007; Gallagher and Taylor, 2001). Following this alternative approach, we model the PD ratio by an ESTAR model with a trend, and find that the trend-adjusted PD ratio is stationary. In contrast to existing findings, our results support that stock prices include both fundamental and non-fundamental components. It also indicates that capturing the impact of non-fundamental components is crucial in empirical analysis. Second, Campbell and Shiller (2001) argue that it is unwise to reject the mean reversion hypothesis of the PD ratio since no pure empirical evidence can indicate that the mean of the ratio has changed. Results of rejecting the unit root hypothesis for the trend-adjusted PD ratio indicate that the long-run PD ratio is time varying, and hence it could be misleading to treat it as a constant. Third, our results point out that conventional long-horizon regression equations using the level of PD ratio, t , as an error correction component are not justified empirically. Stock price predictability under the ESTAR dynamics of PD ratios Given that the ESTAR model with a time-varying mean fits the data well, it is interesting to question the significance of nonlinearities in a data-generating process in improving the out-of-sample predictability of stock prices. This paper estimates predictive regressions at different forecast horizons, Table II. Unit root tests of the PD ratio ADF t Statistics CV 5% CV 10% 1.577 2.883 2.578 DF-GLS t t 3.458 3.691 3.400 t 1.203 1.943 1.615 KSS t t 2.672 2.659 2.353 t t t 1.250 2.930 2.660 4.173 4.055 3.774 Note: t and t t are the PD ratio and trend-adjusted PD ratio, respectively. CV 5% and CV 10% are 5% and 10% critical values. Asterisks (*) and (**) indicate significance at the 10% and 5% level, respectively. ADF is the augmented Dickey– Fuller test, DF-GLS is the unit root test provided by Elliott et al. (1996) and KSS is the nonlinear stationarity test provided by Kapetanios et al. (2003). The lag order of the KSS test is determined based on the Akaiki information rule but the lag order of ADF and DF-GLS tests is selected based on the MAIC rule as suggested by Ng and Perron (2001). Critical values of ADF, DF-GLS and KSS tests for trend-adjusted PD ratio are constructed using a non-parametric bootstrap. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability as in Campbell and Shiller (1988, 1998), and then examine the hypothesis of no return predictability under the ESTAR dynamics of PD ratios. This work first examines stock price predictability over the period 1872–2007. Table III reports the marginal significance level (p-value) of the t-statistic at different forecast horizons under the hypothesis that stock prices follow the process of random walk with drift. The finite sample distribution of the t-statistic under different forecast horizons is obtained through a bootstrap procedure in which the DGP of the PD ratio is given by equation (1) with a quadratic function, t D aCbt Cct 2 . Results from the last column of Table III indicate that the bootstrap t-test rejects, at the 5% level, the no-predictability hypothesis of stock price changes for all forecast horizons. Next, the current research examines the out-of-sample predictability of stock prices. Based on the longhorizon regression equation and the model of the random walk with drift, this work computes the ratio of root mean square error (RMSE) from the above two models. The bootstrap CW test is then applied to investigate the hypothesis of no significant difference in the accuracy of two competing forecasts. Results found in the last two columns on the nonlinear DGP panel of Table IV indicate that the ratios of RMSEs are less than one and CW statistics reject the no-predictability hypothesis at the 10% level when the forecast horizon is greater than 1 year. In addition, the joint statistic is also significant at the 5% level. These results are interesting since they reveal both in- and out-of-sample evidence of predictability of stock prices even when the forecast horizon is short. Although RMSE ratios are less than but close to one when the forecast horizon is greater than 1 year under the assumptions of time-varying means and ESTAR nonlinearities of PD ratios, these ratios are significant statistically. These results reveal statistical significance of the above two assumptions in predicting US stock prices. Table III. In-sample predictability test of stock price changes pt ptk D ˛k C ˇk .tk tk / C "t k (year) 1 2 3 4 5 6 7 8 9 10 JS Linear DGP 1872–1997 Nonlinear DGP 1872–2007 1872–1997 1872–2007 CM TVM CM TVM CM TVM CM TVM 0.338 0.182 0.234 0.178 0.080 0.078 0.060 0.014 0.020 0.036 0.036 0.146 0.040 0.098 0.050 0.024 0.026 0.016 0.004 0.008 0.018 0.018 0.480 0.330 0.358 0.292 0.266 0.320 0.372 0.404 0.458 0.490 0.416 0.084 0.024 0.028 0.012 0.010 0.010 0.010 0.002 0.010 0.010 0.018 0.332 0.172 0.244 0.134 0.062 0.048 0.038 0.012 0.026 0.042 0.028 0.112 0.026 0.066 0.032 0.014 0.010 0.010 0.004 0.004 0.008 0.012 0.516 0.330 0.372 0.316 0.284 0.334 0.374 0.402 0.454 0.474 0.430 0.046 0.010 0.018 0.006 0.004 0.004 0.002 0.002 0.002 0.006 0.002 Note: The number in the table is the bootstrapping P -value of in-sample t -statistic, in which the finite sample distribution of the statistic is constructed through bootstrap. The t statistics are constructed based on serial correlation robust standard errors. If a number in the table is less than 0.1 then we claim that the in-sample predictability of stock price changes is rejected at the 10% level of significance. DGP, TVM and CM are a data-generating process, a time-varying mean and a constant mean, respectively. k is the forecast horizon and JS indicates a joint statistic. Numbers in bold indicate significance at the 10% level. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu Table IV. Out-of-sample predictability tests of stock prices k 1872–1997 (year) CM RMSE 1872–2007 TVM CM TVM CW(k/ RMSE CW(k ) RMSE CW(k ) RMSE CW(k ) Nonlinear DGP 1 1.0044 2 0.9967 3 1.0005 4 0.9951 5 0.9825 6 0.9773 7 0.9716 8 0.9446 9 0.9351 10 0.9309 JS – 0.408 0.142 0.196 0.130 0.096 0.104 0.104 0.046 0.056 0.074 0.100 1.0010 0.9882 0.9931 0.9862 0.9784 0.9698 0.9680 0.9439 0.9334 0.9347 – 0.284 0.054 0.009 0.068 0.048 0.058 0.056 0.018 0.014 0.030 0.036 1.0083 1.0156 1.0238 1.0292 1.0311 1.0326 1.0368 1.0342 1.0285 1.0215 – 0.540 0.302 0.370 0.272 0.208 0.248 0.276 0.260 0.314 0.332 0.388 1.0025 0.9964 0.9987 0.9949 0.9920 0.9856 0.9844 0.9688 0.9579 0.9552 – 0.266 0.072 0.104 0.076 0.054 0.062 0.052 0.024 0.024 0.042 0.046 Linear DGP 1 1.0044 2 0.9967 3 1.0005 4 0.9951 5 0.9825 6 0.9773 7 0.9716 8 0.9446 9 0.9351 10 0.9309 JS – 0.416 0.152 0.214 0.146 0.104 0.126 0.090 0.058 0.060 0.066 0.126 1.0048 0.9957 1.0028 1.0009 0.9952 0.9857 0.9857 0.9637 0.9530 0.9536 – 0.528 0.092 0.208 0.152 0.102 0.074 0.074 0.028 0.020 0.028 0.052 1.0083 1.0156 1.0238 1.0292 1.0311 1.0326 1.0368 1.0342 1.0285 1.0215 – 0.506 0.294 0.352 0.246 0.208 0.264 0.288 0.268 0.298 0.316 0.358 1.0059 1.0027 1.0071 1.0086 1.0088 1.0029 1.0048 0.9925 0.9823 0.9797 – 0.482 0.120 0.196 0.130 0.084 0.088 0.092 0.054 0.058 0.068 0.128 Note: DGP, TVM and CM are a data-generating process, a time-varying mean and a constant mean, respectively. k is the forecast horizon and JS indicates a joint statistic. The numbers under the ‘RMSE’ column are the ratio of the root mean square error from the predictive equation to that from the random walk with drift under different forecast horizons. If the value of RMSE ratios is less than one then we claim that the long-horizon predictive equation provides more accurate forecasts than random walks. CW(k/ is the statistic provided by Clark and West (2007) under the forecast horizon k in which the truncated lag in constructing the Newey–West covariance matrix is determined based on Andrews’ (1991) procedure. The number under the ‘CW(k/’ column is the bootstrapping p-value of the CW statistic, in which its finite sample distribution is constructed through a bootstrap method. If the value of CW(k/ is less than 0.1 then we claim that the long horizon predictive equation outperforms random walks in out-of-sample contests. ‘–’ indicates that a statistic is not available. Numbers in bold indicate significance at the 10% level. Under the assumption of a constant mean, the trend function in the dynamics of PD ratio, equation (1) is assumed to be a constant, t D a. Results from the eighth column of Table III indicate that the hypothesis of no predictability of stock returns is not rejected by the bootstrap t-test when a constant mean of PD ratio is assumed. Under the assumption of stable PD ratios, the sixth and seventh columns on the nonlinear DGP panel of Table IV report results of out-of-sample tests. These findings reveal that RMSE ratios are greater than one and CW statistics are insignificant at all forecast horizons. In short, there is no evidence of stock price predictability based on both in- and out-of-sample tests under the assumption of stable PD ratios. These results challenge the appropriateness of the stability assumption of PD ratios. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability Shiller (2005) points out that the period after 1997 is full of institutional changes in stock markets. Top capital gains tax rate was cut from 28% to 20% at 1997 and a large number of users did not discover the Web until 1997. In addition, PD ratios reveal an enormous spike at 2000. The 10-year average of PD ratios (without taking logs) over 1998–2007 is 67, which is about two to three times the historical average by this measure.6 This paper therefore excluded the period 1998–2007 and then applied this sub-period to check the robustness of our results. Results from the seventh column of Table III reject the no-predictability hypothesis of stock returns as long as the forecast horizon is greater than 1 year when the time-varying mean of PD ratios is allowed. RMSE ratios and CW statistics found in the fourth and fifth columns on the nonlinear DGP panel of Table IV indicate evidence of out-of-sample predictability when forecast horizons are greater than 1 year. In addition, results found in the sixth column of Table III and the second and third columns on the nonlinear DGP panel of Table IV support the evidence of stock price predictability under the stability hypothesis of PD ratios only when the forecast horizon is long. Based on the nonlinear DGP panel of Tables III and IV, this paper finds two interesting results. First, the evidence of stock price predictability is sensitive to sample periods under a constant mean assumption of PD ratios yet it is robust to sample periods when the assumption of time-varying means is made. Second, the assumption on the time-varying mean of PD ratio is helpful to strengthen the inand out-of-sample evidence of return predictability. Lettau and Van Nieuwerburgh (2008) challenge the assumption of a fixed long-run value of the PD ratio. Allowing for structural changes in the predictive equation, they find support for the in-sample predictability of stock returns, but fail to find evidence of beating the benchmark random walk model in out-of-sample contests. Results from the nonlinear DGP panel of Tables III and IV indicate both in- and out-of-sample predictability of US stock prices regardless of sample periods when the timevarying mean and nonlinearity of PD ratios are taken into account. The result of beating the random walk model is crucial since a general consensus in extant literature indicates the failure of outperforming the random walk model in out-of-sample tests (Goyal and Welch, 2003; Lettau and Van Nieuwerburgh, 2008; Welch and Goyal, 2008). A last question further arises: Why is it that outof-sample evidence of stock price predictability is much weaker than in-sample evidence in the literature? Our results show that these inconsistent results can be reconciled once the time-varying mean and nonlinearities of PD ratios are controlled. Stock price predictability under a linear AR model for PD ratios For the purpose of comparison, this paper estimates predictive regressions at different forecast horizons and then examines the hypothesis of no return predictability under a linear AR model of PD ratios. The linear dynamic of PD ratios is described as follows: t D t C m P ut i.i.d..0, 2 / i .t i t i / C ut , i D1 (3) t D a C bt C ct 2 where m is set to one based on the Akaike information rule. This research first examines stock price predictability over the period 1872–2007. Results from the fifth column of Table III indicate that the hypothesis of no predictability of stock prices is rejected at the 10% level by the bootstrap t-test 6 In addition, the average of the 10-year average of PD ratios (without taking logs) over 1872–1997 is 23. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu regardless of forecast horizons. As for out-of-sample tests, results found in the last two columns on the linear DGP panel of Table IV indicate that RMSE ratios are less than one when the forecast horizon is greater than 7 years. The CW statistic rejects the no-predictability hypothesis of stock prices, at the 10% level, when the forecast horizon is greater than 4 years. However, the joint CW statistic has a P -value of 0.128, which fails to reject the no-predictability hypothesis. It is worth noting that RMSE ratios are greater than 1.0 but the P -values of CW statistic are less than 0.1 when the forecast horizon is 5, 6 and 7 years. This is possible since the CW statistic uses the MSPE adjusted series, instead of MSPE series, to examine the no-predictability hypothesis of stock prices. Based on CW statistics and RMSE ratios, this research concludes that the no-predictability hypothesis of stock prices is rejected, at the 10% level, when the forecast horizon is greater than 7 years. Our results also reveal that it might be misleading to reject the no-predictability hypothesis solely based on the CW statistic. Under the assumption of a constant mean, the t-statistics from column four of Table III indicate no evidence of in-sample return predictability, at the 10% level, regardless of forecast horizons. Results found in the six and seventh columns on the linear DGP panel of Table IV indicate that RMSE ratios are greater than one and that CW statistics fail to reject the no-predictability hypothesis of stock prices, at the 10% level, regardless of forecast horizons. Are the foregoing results sensitive to sample periods? To check the robustness of our findings to the sample periods, this paper repeats the experiment of stock price predictability over the period 1872–1997. Results from second and third columns of Table III indicate that the hypothesis of no predictability of stock price changes is rejected by the bootstrap t-test for the forecast horizon being greater than 1 year when the time-varying mean of the PD ratio is assumed. However, the above hypothesis is rejected for the forecast horizon being greater than 4 years under the assumption of stable PD ratio. Under the time-varying mean assumption of PD ratios, results found in columns four and five on the linear DGP panel of Table IV indicate that RMSE ratios are less than one when the forecast horizon is greater than 4 years. In addition, CW statistics are significant when the forecast horizon is greater than 5 years and the joint CW statistic has a P -value of 0.052, rejecting the no-predictability hypothesis. Under the assumption of a constant mean, results found in column two on the linear DGP panel of Table IV indicate that RMSE ratios are less than one when the forecast horizon is greater than 3 years. Results found in the third column on the linear DGP panel of Table IV reveal that CW statistics are significant when the forecast horizon is greater than 6 years. In other words, under the assumption of a linear AR model for PD ratios, the evidence of out-of-sample predictability on stock prices is observed only when the forecast horizon is long. Overall, results from Tables III and IV are interesting. First, the evidence of stock price predictability is strengthened once the nonlinearity and the time-varying mean of the PD ratio are taken into account and the evidence is robust to sample periods. Second, results of stock price predictability are sensitive to sample periods when the mean of PD ratios is assumed to be a constant even though the nonlinearity of the ratio has been taken into account. Why are the results of stock price predictability sensitive to sample periods under the assumption of stable PD ratios? Our empirical investigation points out that the reason could be structural changes in the predictive equation when the mean of PD ratios is time varying but is assumed to be a constant. Return predictability within a local-to-unity framework The inference in predictive regressions depends critically on the assumed stochastic properties of the PD ratio. Although this paper fails to reject the unit root hypothesis of PD ratios as found in Table II, it is very likely that we are not able to distinguish whether the ratio has a unit root. To Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability take this uncertainty into account, several recent articles re-examine the evidence of in-sample return predictability using tests that are available within a local-to-unity framework (Cavanagh et al., 1995; Torous et al., 2004; Campbell and Yogo, 2005, 2006). Local-to-unity asymptotics provide an accurate approximation to the finite-sample distribution of test statistics when the predictor variables are highly persistent. This paper applies a new Bonferroni procedure, based on the DF-GLS test and the Q test, provided by Campbell and Yogo (2006), to re-examine the in-sample predictability of stock price changes. Table V reports the correlation coefficients (ı/ between the innovation of the PD ratio and price changes, the 95% confidence interval of the autoregressive root ( / for the PD ratio, the estimate of the slope coefficient (ˇk / in the predictive equation, the t-statistic of ˇk constructed based on a serial correlation robust standard error, and the 90% confidence interval of ˇk based on DF-GLS and Q statistics. Based on the period 1872–2007, results found in column two on the 1872–2007 panel of Table V indicate that the correlation coefficients are positive and large when the forecast horizon is short, and they decrease with the forecast horizon. This is because movements in stock price changes and PD ratios mostly come from the movement in stock prices when the forecast horizon is short. Table V. Tests of return predictability within the local-to-unity framework pt ptk D ˛k C ˇk tk C "t t D ! C t1 C ut D 1 C c=T with c a fixed constant and T is the number of observations k (year) ı ˇk t statistic 95% CI: 90% CI ˇk : Q-test 1872–2007 1 2 3 4 5 6 7 8 9 10 0.831 0.645 0.446 0.457 0.384 0.283 0.285 0.314 0.240 0.201 0.028 0.065 0.070 0.104 0.128 0.124 0.121 0.143 0.150 0.154 0.610 1.208 1.138 1.472 1.620 1.395 1.177 1.204 1.206 1.240 [0.887,1.012] [0.889,1.013] [0.889,1.014] [0.869,1.006] [0.894,1.016] [0.902,1.019] [0.900,1.018] [0.873,1.010] [0.832,0.993] [0.795,0.977] [0.063,0.089] [0.124,0.058] [0.152,0.057] [0.198,0.038] [0.235,0.025] [0.252,0.042] [0.269,0.069] [0.313,0.077] [0.335,0.077] [0.343,0.070] 1872–1997 1 2 3 4 5 6 7 8 9 10 0.828 0.647 0.428 0.420 0.332 0.229 0.227 0.251 0.162 0.140 0.063 0.137 0.154 0.216 0.262 0.252 0.255 0.302 0.311 0.318 1.087 1.857 1.739 2.352 3.175 3.252 3.684 5.367 5.004 4.330 [0.766, 0.964] [0.910, 1.025] [0.910, 1.025] [0.898, 1.022] [0.895, 1.022] [0.882, 1.017] [0.885, 1.019] [0.883, 1.019] [0.878, 1.017] [0.876, 1.017] [0.093, 0.119] [0.128, 0.102] [0.216, 0.073] [0.285, 0.016] [0.332, 0.058] [0.334, 0.075] [0.325, 0.092] [0.345, 0.152] [0.381, 0.172] [0.411, 0.166] Note: ı indicates the correlation between the innovations of PD ratio and price changes. CI indicates a confidence interval. t .k/ is the t statistic of ˇk under the forecast horizon k and it is constructed based on serial correlation-robust standard errors. A bold confidence interval of ˇk indicates rejection of the hypothesis of no-return predictability at the 10% level of significance. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu The confidence interval of does contain a unit root except when the forecast horizon is greater than 8 years. Results found in the fourth column of the 1872–2007 panel of Table V indicate that the no-predictability hypothesis is not rejected by t statistics at all forecast horizons. The last column in the 1872–2007 panel of Table V indicates that a 90% Bonferroni confidence interval for ˇk , based on a Q test, does not lie strictly below zero for all forecast horizons. Hence the current research fails to reject the hypothesis of no in-sample predictability of stock price changes at the 10% level regardless of forecast horizons. These results are consistent with those from the bootstrap t-test under the constant mean assumption of PD ratios. With the period 1872–1997, results found in the 1872–1997 panel of Table V indicate that the contemporaneous correlation of innovations is high with a short horizon and decreases with the length of the forecast horizon. The confidence interval of does contain a unit root except for the forecast horizon of 1 year. The t statistics reject the no-predictability hypothesis at the 10% level when the forecast horizon is greater than 1 year. However, a 90% Bonferroni confidence interval for ˇk based on a Q test lies strictly below zero when the forecast horizon is greater than 4 years, revealing evidence of long-horizon predictability of stock price changes. Again, this paper finds that results from a linear model within the local-to-unity framework are sensitive to sample periods. Campbell and Yogo (2006) examine one-step-ahead return predictability based on several financial ratios within the framework of nearly integrated regressors. They fail to find evidence of return predictability based on PD ratios. Our results are consistent with theirs.7 It is interesting to find that results from the last column of Table V are consistent with those from the second and fourth columns of Table III. This further confirms the appropriateness of applying bootstrap tests to examine the predictability of stock returns. ECONOMIC SIGNIFICANCE Given the statistical evidence of rejecting the no-predictability hypothesis of stock prices, it is interesting to ask if a long-horizon predictive equation is helpful for investors to make a profit in stock markets. This paper investigates the economic significance of the following four models: the random walk with drift (RW), a linear long horizon predictive equation with a constant mean (MD1) and a linear long horizon predictive equation with a time-varying mean in the linear autoregressive process of PD ratios (MD2) and the ESTAR process of PD ratios (MD3), respectively. The economic significance of a model is evaluated based on the following two different criteria: expected real return per unit of risk and ex post real profit. The annualized expected real return per unit of risk is obtained by dividing the conditional p real return (Et .pt Ch pt //h/ by the conditional standard deviation of the return at time t vart .pt Ch pt /= h . To construct ex post real returns, this paper sets up the following trading rule. Given the information set at time t, an investor predicts h-period-ahead log real stock price (pOt Ch / based on the long horizon prediction equation in (1). If an investor predicts an increase of the real stock price (pOt Ch > pt /, then the investor will buy one unit of portfolio at time t and then sell the unit at t C h. The annualized ex post real return after trading at t C h is .pt Ch pt /= h. Otherwise, the investor will sell a unit of portfolio at time t and then buy back a unit of portfolio at time t C h. In such a case, the ex post real return at t C h is .pt pt Ch /= h. For each criterion, this paper computes the expected real return and the ex post real return for each out-of-sample period and then reports their 7 It is worth noting that the definitions of return and sample periods in Campbell and Yogo (2006) are different from ours. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability means over the out-of-sample period under a specific forecast horizon in Table VI. The significance of these means is examined using a simple t-test. Results from the top panel of Table VI report the economic significance of models based on the expected real return per unit of risk. The means of expected real return per unit of risk under different forecast horizons are positive for all models under consideration. The proposed nonlinear model (MD3) results in the highest mean of expected real return per unit of risk regardless of forecast horizons in the sub period (1872–1997), which is also true when the forecast horizon is greater than 4 years throughout the entire period (1872–2007). Results from the bottom panel of Table VI indicate that the means of ex post real return from different forecast horizons based on the constant mean model (MD1) are positive for 9 out of 10 horizons in the sub-period, but the means are all negative in the whole period. The random walk model reveals the highest mean of ex post real return among the four models for most horizons regardless of sample periods. Although the means of ex post real return based on MD2 and MD3 are all positive, the value of significance based on MD3 is much larger than that from MD2 regardless of sample periods. Table VI. Economic significance RW: k pt D ˛k C "tCk , MD1: k pt D ˛k C ˇk .t c/ C "tCk I k D 1, 2, 3, : : : , 10 MD2: k pt D ˛k C ˇk .t t / C "tCk I t D t C 1 .t1 t1 / C ut , t D h.t/, k D 1, 2, 3, : : : , 10 MD3: k pt D ˛k C ˇk .t t / C "tCk I t D t C .t1 t1 / expŒ.t1 t1 /2 C ut I t D h.t/, k D 1, 2, 3, : : : , 10 k 1872–1997 RW MD1 1872–2007 RW MD1 MD2 MD3 1 2 3 4 5 6 7 8 9 10 Annualized expected real return per unit of risk 0.093 0.058 0.098 0.101# 0.094 0.044 0.108 0.113# 0.107 0.061 0.115 0.118# 0.111 0.057 0.133 0.136# 0.118 0.069 0.155 0.160# 0.137 0.098 0.170 0.173# 0.148 0.115 0.184 0.187# 0.150 0.123 0.191 0.194# 0.152 0.128 0.186 0.187# 0.155 0.134 0.186 0.187# 0.096 0.096 0.108 0.112 0.117 0.135 0.145 0.146 0.147 0.149 0.053 0.025 0.045 0.031 0.034 0.061 0.079 0.084 0.094 0.103 0.084 0.083 0.092 0.104 0.121 0.138 0.156 0.165 0.164 0.168 0.089 0.091 0.098 0.110 0.128# 0.144# 0.160# 0.169# 0.168# 0.172# 1 2 3 4 5 6 7 8 9 10 Annualized ex post real returns 0.039 0.020 0.005 0.036 0.007 0.030 0.033 0.001 0.007 0.031 0.009 0.008 0.029 0.011 0.010 0.028 0.013 0.012 0.027 0.014 0.011 0.026 0.023 0.013 0.025 0.023 0.018 0.024 0.024 0.019 0.057 0.055 0.055 0.054 0.051 0.052 0.055 0.058 0.060 0.059 0.062 0.017 0.034 0.018 0.015 0.015 0.018 0.014 0.016 0.016 0.023 0.025 0.009 0.013 0.005 0.001 0.028 0.004 0.038 0.039 MD2 MD3 0.014 0.048# 0.022 0.029 0.027 0.031# 0.016 0.023 0.024 0.022 0.032 0.044 0.029 0.033 0.032 0.041 0.032 0.038 0.044 0.047 Note: k indicates the forecast horizon. Numbers in the table are means of expected real return per unit of risk and ex post real return under different models and forecast horizons. Numbers in bold indicate significance at the 10% level. ‘#’ indicates that the number is greater than that from the random walk. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu Overall, results from Table VI indicate that the expected real return per unit of risk from the proposed nonlinear model is higher than that from the random walk and other competitive models for most forecast horizons. Ex post real returns from the proposed nonlinear model are significantly positive if the forecast horizon is greater than 1 year. The above results are robust to sample periods. These results echo our findings from Tables III and IV indicating the significance of the time-varying mean and nonlinear dynamics of PD ratios in the empirical analysis. SIZE AND POWER OF BOOTSTRAP TESTS Given the rejection of the hypothesis of no stock price predictability as indicated in Tables III and IV, it is interesting to inquire whether our findings result from the problem of size distortion or from a reasonable power of bootstrap tests. To address this question, we adopt a three-step strategy to evaluate the power and size of bootstrap tests, which is available upon request from the authors. Results presented in Table VII indicate that the effective size of out-of-sample tests at different forecast horizons is close to 0.10. However, the effective size of the in-sample test is about 0.13 when the forecast horizon is less than 3 years, and is close to 0.10 for the rest of the horizons. Given that the P -values of the bootstrap t statistic in the last column of Table III are fairly low (less than 0.05), the slight size distortion will not affect our conclusion about in-sample predictability. Overall, this paper concludes that the rejection of the no-predictability hypothesis of stock prices found in the last column of Table III and the nonlinear DGP panel of Table IV is not likely due to the problem of size distortion. Following the three-step bootstrapping strategy, we report our results of power analysis in Table VII. The power of the in-sample bootstrap test is in general high and varies around 0.90. As for the power of the out-of-sample bootstrap test, it varies in the range of 0.50–0.67. Based on the size and power analysis, we conclude that the major reason for rejecting the non-predictability hypothesis of stock prices is the reasonable power of bootstrap tests. Results shown in the sixth and seventh columns of the nonlinear DGP panel of Table IV, respectively, indicate no evidence of stock price predictability when the assumption of a constant mean of PD ratios is imposed. Could these results be due to the fact of neglecting a time-varying mean of PD ratios in empirical analysis? If our conjecture is correct, then we should find low powers of bootstrap tests when the existing time-varying mean of PD ratios is neglected in empirical analysis. To verify this conjecture, we investigate the power of bootstrap tests where a time-varying mean of PD ratio exists in data but is treated as a constant in long-horizon prediction analysis and in bootstrapping the finite sample distribution of various statistics. The power and size of bootstrap tests for this case Table VII. Size and power of bootstrap tests k (year) Size t.k/ CW(k/ 1 2 3 4 5 6 7 8 9 10 JS 0.132 0.122 0.138 0.116 0.130 0.122 0.114 0.118 0.122 0.108 0.112 0.114 0.106 0.112 0.112 0.110 0.106 0.102 0.110 0.108 0.108 0.110 Power t.k/ CW(k/ 0.916 0.566 0.938 0.622 0.938 0.652 0.938 0.668 0.930 0.650 0.920 0.640 0.914 0.604 0.904 0.570 0.886 0.528 0.866 0.492 0.904 0.562 Note: k is the forecast horizon. t .k/ and CW(k/ are t and CW statistics under the forecast horizon k. The data-generating processes for size and power analysis are available upon request from the authors. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability Table VIII. Size and power of bootstrap tests: without correcting the time-varying mean in the long-horizon regression equation k (year) Size t.k/ CW(k/ 1 2 3 4 5 6 7 8 9 10 JS 0.080 0.084 0.070 0.070 0.074 0.076 0.076 0.072 0.076 0.076 0.072 0.068 0.070 0.066 0.072 0.066 0.072 0.066 0.072 0.064 0.064 0.060 Power t.k/ CW(k/ 0.238 0.334 0.242 0.354 0.266 0.374 0.242 0.382 0.234 0.380 0.214 0.342 0.200 0.308 0.190 0.282 0.176 0.268 0.160 0.232 0.166 0.216 Note: k is the forecast horizon. t .k/ and CW(k/ are t and CW statistics under the forecast horizon k. Detailed simulation procedures are available upon request from the authors. are reported in Table VIII. The reported results show that the power of in-sample (out-of-sample) bootstrap tests varies from 0.16 to 0.27 (0.22 to 0.38).8 The effective size of in- and out-of-sample bootstrap tests varies from 0.06 to 0.08, given the nominal size of 10%, indicating a medium size distortion. In sum, the incorrect assumption of a constant mean in the PD ratio results in low power of bootstrap tests. This could be a major reason for failing to find out-of-sample predictability of stock prices in Table IV and existing literature. CONCLUSIONS There has never been convincing evidence that PD ratios were ever useful for out-of-sample predictability of stock returns (Goyal and Welch, 2003). Using US data from 1872 to 2007, this paper reinvestigates the predictability of stock prices based on PD ratios. In contrast to the conventional stability assumption of the PD ratio, we stress the significance of its time-varying mean and its ESTAR dynamics. Several interesting results are obtained from our empirical investigation. First, we find that the dynamics of the PD ratio can be well described by an ESTAR model with a time-varying mean and that the PD ratio could be a unit root process, whereas the trend-adjusted PD ratio is stationary. These results support the premise that stock prices include both fundamental and non-fundamental components and that modeling the impact of both components is crucial in empirical analysis. Second, our bootstrap tests, based on a nonlinear data-generating process with time-varying means, provide significant in- and out-of-sample evidence to reject the no-predictability hypothesis of stock prices. The hypothesis is not rejected when the stability of the PD ratio is assumed in the data-generating process. Third, the expected real return per unit of risk from the proposed nonlinear model is larger than that from the random walk and other competitive models for most forecast horizons. Ex post real profits from the proposed nonlinear model rather than the constant mean model are significantly positive if the forecast horizon is greater than 1 year. However, ex post real profits from the proposed nonlinear mode are less than those from the random walk in general. Fourth, the above results are robust to a subsample of 1872–1997. Fifth, our bootstrap tests have good size and achieve reasonable power. These results indicate that allowing for a time-varying mean in the PD ratio is crucial for predicting stock price movements and hence support our assertion that the mean of the US PD ratio is time varying. Our results also indicate that inconsistent results for in- and out-of-sample tests 8 It is meaningless to compare the power of tests in Table VII with that of Table VIII since their DGPs are different. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu in the extant literature are reconciled once the time-varying mean and nonlinearity of the PD ratio are taken into account in regressions. ACKNOWLEDGEMENTS For helpful comments we are grateful to an anonymous referee, Alan Taylor, as well as participants in seminars at UC Davis and National Taiwan University. Part of the paper was finished while Wu was visiting UC Davis, where the Department of Economics graciously provided him with superior research facilities. Wu is grateful for financial support from the National Science Council in Taiwan. REFERENCES Andrews D. 1991. Heteroscedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59: 817–858. Avramov D. 2002. Stock return predictability and model uncertainty. Journal of Financial Economics 64: 423–458. Balke NS, Wohar ME. 2001. Explaining stock price movements: is there a case for fundamentals? Federal Reserve Bank of Dallas Economic and Financial Review third quarter: 22–34. Balke NS, Wohar ME. 2002. Low-Frequency Movement in Stock Prices: A State-space Decomposition. Review of Economics and Statistics 84: 649–667. Campbell JY. 2000. Asset pricing at the millennium. Journal of Finance 55: 1515–1567. Campbell JY, Shiller RJ. 1988. The dividend–price ratio and expectations of future dividends and discount factors. Review of Financial Studies 1: 195–227. Campbell JY, Shiller RJ. 1998. Valuation ratios and the long-run stock market outlook. Journal of Portfolio Management 24: 11–26. Campbell JY, Shiller RJ. 2001. Valuation ratios and the long-run stock market outlook: an update. NBER Working Paper 8221. Campbell JY, Yogo M. 2005. Implementing the econometric methods in ‘Efficient tests of stock return predictability’. Working paper, University of Pennsylvania. Campbell JY, Yogo M. 2006. Efficient tests of stock return predictability. Journal of Financial Economics 81: 27–60. Campbell J, Lo AJ, Mackinlay AC. 1997. The Econometrics of Financial Markets. Princeton University Press: Princeton NJ. Caporale GM, Gil-Alana LA. 2004. Fractional cointegration and tests of present value models. Review of Financial Economics 13: 245–258. Cavanagh CL, Elliott G, Stock JH. 1995. Inference in models with nearly integrated regressors. Econometric Theory 11: 1131–1147. Clark TE, West KD. 2006. Using out-of-sample mean squared prediction errors to test the martingale difference hypothesis. Journal of Econometrics 135: 155–186. Clark TE, West KD. 2007. Approximately normal tests for equal predictive accuracy in nested models. Journal of Econometrics 138: 291–311. Coakley J, Fuertes AM. 2006. Valuation ratios and price deviations from fundamentals. Journal of Banking and Finance 30: 2325–2346. Cole K, Helwege J, Laster D. 1996. Stock market valuation indicators: is this time different? Financial Analysts Journal May/June: 56–64. Dickey DA, Fuller WA. 1979. Distribution of the estimators for autoregressive time series with a unit root. Journal of American Statistical Association 74: 427–431. Dumas B. 1992. Dynamic equilibrium and the real exchange rate in a spatially separated world. Review of Financial Studies 5: 153–180. Eitrheim O, Teräsvirta T. 1996. Testing the adequacy of smooth transition autoregressive models. Journal of Econometrics 74: 59–76. Elliott G, Rothenberg TJ, Stock JH. 1996. Efficient tests for an autoregressive unit root. Econometrica 64: 813–836. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for Stock Price Predictability Engle R. 1982. Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50: 987–1007. Fama EF, French KR. 1988. Dividend yields and expected stock returns. Journal of Financial Economics 25: 23–49. Fama EF, French KR. 2002. The equity premium. Journal of Finance 57: 637–659. Froot K, Obstfeld M. 1991. Intrinsic bubbles: the case of stock prices. American Economic Review 81: 922–930. Gallagher LA, Taylor MP. 2001. Risky arbitrage, limits of arbitrage, and nonlinear adjustment in the dividend-price ratio. Economic Inquiry 39: 524–536. Goyal A, Welch I. 2003. Predicting the equity premium with dividend ratios. Management Science 49: 639–654. Heaton J, Lucas D. 1999. Stock prices and fundamental. NBER Macroeconomic Annual: 213–241. Hobijn B, Jovanovic B. 2001. The information-technology revolution and the stock market: evidence. American Economic Review 91: 1203–1220. Kapetanios G, Shin Y, Snell A. 2003. Testing for a unit root in the nonlinear STAR framework. Journal of Econometrics 112: 359–379. Keim D, Stambaugh RF. 1986. Predicting returns in the stock and bond markets. Journal of Financial Economics 17: 357–390. Kilian L, Taylor MP. 2003. Why is it difficult to beat the random walk forecast of exchange rates? Journal of International Economics 60: 85–107. Lamont O. 1998. Earnings and expected returns. Journal of Finance 53: 1563–1587. Lanne M. 2002. Testing the predictability of stock returns. Review of Economics and Statistics 84: 407–415. Lettau M, Van Nieuwerburgh S. 2008. Reconciling the return predictability evidence. Review of Financial Studies 21: 1607–1652. Liang JN, Sharpe SA. 1999. Share repurchases and employee stock options and their implications for S&P 500 share retirements and expected returns. Finance and Economic Discussion Series 1999-59, Board of Governors of the Federal Reserve, November. McMillan DG. 2007. Bubbles in the dividend-price ratio? Evidence from an asymmetric exponential smoothtransition model. Journal of Banking and Finance 31: 787–804. Newey WK, West KD. 1987. A simple positive semi-definite heteroscedasticity and autocorrelation consistent covariance matrix. Econometrica 55: 703–708. Ng S, Perron P. 2001. Lag length selection and the construction of unit root tests with good size and power. Econometrica 69: 1519–1554. Paye BS, Timmermann A. 2006. Instability of return prediction models. Journal of Empirical Finance 13: 274–315. Poterba JM, Summers LH. 1986. Mean reversion in stock prices: evidence and implications. Journal of Financial Economics 22: 27–59. Psaradakis Z, Sola M, Spagnolo F. 2004. On Markov error-correction models, with an application to stock prices and dividends. Journal of Applied Econometrics 19: 69–88. Ramsey JB. 1969. Tests for specification errors in classical linear least squares regression analysis. Journal of the Royal Statistical Society 31: 350–371. Rapach DE, Wohar ME. 2005a. Valuation ratios and long-horizon stock price predictability. Journal of Applied Econometrics 20: 327–344. Rapach DE, Wohar ME. 2005b. Structural breaks and predictive regression models of aggregate U.S. stock returns. Journal of Financial Econometrics 4: 238–274. Richardson M, Stock JH. 1989. Drawing inferences from statistics based on multiyear asset returns. Journal of Financial Economics 25: 323–348. Ritter JR, Warr RS. 2002. The decline of inflation and the bull market of 1982–1999. Journal of Financial and Quantitative Analysis 37: 29–61. Robertson D, Wright S. 2006. Dividends, total cash flow to shareholders, and predictive return regressions. Review of Economics and Statistics 88: 91–99. Sercu P, Uppal R, Van Hulle C. 1995. The exchange rate in the presence of transaction costs: implications for tests of purchasing power parity. Journal of Finance 50: 309–319. Sharpe SA. 2002. Reexamining stock valuation and inflation: the implications of analysts’ earnings forecasts. Review of Economics and Statistics 84: 632–648. Shiller R. 2005. Irrational Exuberance, 2nd edn. Princeton University Press: Princeton, NJ. Shleifer A, Summers LH. 1990. The noise trader approach to finance. Journal of Economic Perspectives 4(2): 19–33. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for J.-L. Wu and Y.-H. Hu Shleifer A. 2000. Inefficient Markets: An Introduction to Behavioural Finance Clarendon Lectures in Economics. Oxford University Press: Oxford. Siegel J. 1999. The shrinking equity premium. Journal of Portfolio Management Fall: 10–17. Siegel J. 2002. Stocks for the Long-Run: The Definitive Guide to Financial Market Returns and Long-Term Investments Strategies, 3rd edn. McGraw-Hill: New York. Torous W, Valkanov R, Yan S. 2004. On predicting stock returns with nearly integrated explanatory variables. Journal of Business 77: 937–966. Vissing-Jorgenson A. 1998. Limited stock market participation. Dissertation, Massachusetts Institute of Technology. Welch I, Goyal A. 2008. A comprehensive look at the empirical performance of equity premium prediction. Review of Financial Studies 21: 1455–1508. Authors’ biographies: Jyh-Lin Wu graduated from Tamkang University in 1980 with a B.A. in international trade, and earned a PhD in economics from the Ohio State University in 1990. In 1990 he began to work at National Chung-cheng University until 2003. He worked at the Sun Yat-sen university since 2004. His research interests include empirical international finance and empirical macroeconomics. Yu-Hau Hu graduated from National Cheng-chi University in 1994 with a B.A. in economics, and earned a PhD in International Economics from National Chung-cheng University in 2007. In 1998 he began to work at the Cheng-Shiu Technological University. His research interests include empirical international finance and empirical macroeconomics. Authors’ addresses: Jyh-Lin Wu, Institute of Economics, National Sun Yat-Sen University, Kaohsiung, Taiwan; and Department of Economics, National Chung-Cheng University, Chia-Yi, Taiwan. Yu-Hau Hu, Department of International Trade, Cheng-Shiu Technological University, Kaohsiung, Taiwan. Copyright © 2011 John Wiley & Sons, Ltd. J. Forecast. (2011) DOI: 10.1002/for
© Copyright 2026 Paperzz