Analyzing the short-term predictability of the stock market Alex P. Taylor∗ Alliance Manchester Business School Jingya Wang† Alliance Manchester Business School February 14, 2016 ∗ Alex P. Taylor, Accounting and Finance Group, Alliance Manchester Business School, The University of Manchester, Booth Street West, Manchester, M15 6PB, England, e-mail: [email protected], Tel: +44(0)161 275 0441, Fax: +44(0)161 275 4023. † Corresponding Author: Jingya Wang, a PhD student in finance, Alliance Manchester Business School, The University of Manchester, Booth Street West, Manchester, M15 6PB, England, e-mail: [email protected] 1 Abstract This paper evaluates the ability of lagged commodity returns to forecast market returns. In order to exploit the predictive information from a relatively large amount of commodity returns, we apply the partial-leastsquares (PLS) method pioneered by Kelly and Pruitt (2013). We find that the commodity returns measured over previous twelve months show strong predictive power in monthly and three-month forecasts, in-sample and outof-sample. The findings are robust to controlling for risk factors such as momentum, Fama-French three factors and industry returns previously identified to be significant predictors of market returns (Hong, Torous and Valkanov, 2007). 2 1 Introduction A large array of predictors have been identified in the literature on forecasting aggregate market returns. Prominent amongst these predictors are valuation ratios, for example the dividend-price (dp) ratio, the dividend-yield (dy) ratio, the book-to-market (bm) ratio, the earnings-dividend (ed) ratio, the price-earnings (pe) ratio.1 Researchers find that these valuation ratios show good performance in forecasting market returns at long horizons due to their high persistence (e.g., Jegadeesh, 1991; Lettau & Ludvigson, 2001; Welch & Goyal, 2008; Cochrane, 2005, pp.391-395). Researchers also have linked the stock market with other markets, such as the bond market predictors (e.g., Campbell, 1987; Fama & French, 1989; Asness, 2003; Fama & French, 1989; Campbell, 1987; Hodrick, 1992), the exchange market (Amihud, 1994; Bartov & Bohnar, 1994; Obben, Pech, & Shakur, 2007). However, relatively little attention has been paid to whether the the commodity market contains information relevant for predicting stock market returns. The relationship between commodity returns and stock market returns has mostly been investigated by examining the contemporaneous correlation of stocks returns and commodity factors (e.g., Brooks, Fernandez-Perez, Miffre, & Nneji, 2014; Baker & Routledge, 2015). The literature primarily concentrates on oil commodity exposure (e.g., Chen, Roll, and Ross (1986), Hamilton (1983); Jones and Kaul (1996)), however, other commodities are seldom investigated. More recently, some studies have examined the role of commodity returns as predictors of stock returns, and find evidence that the commodity risk indeed matters for future investment opportunities (e.g., Jacobsen, Marshall, & Visaltanachoti, 2009, Hu & Xiong, 2013; Hong & Yogo, 2012; Bakshi, Gao, & Rossi, 2014; Miffre, 1 For dp ratio see Rozeff (1984), Fama and French (1988), Hodrick (1992), Campbell and Viceira (2002), Campbell and Yogo (2006), Lewellen (2004), Menzly and Veronesi (2004); for the book-to-market (bm) ratio see Kothari and Shanken (1997), Pontiff and Schall (1998); for the earnings-dividend (ed) ratio see Campbell and Shiller (1988, 1998), Lamont (1998); for the price-earnings (pe) ratio see Cole, Helwege, and Laster (1996), Campbell and Shiller (1998), Lander, Orphanides, and Douvogiannis (1997), Pu (2000), Welch and Goyal (2008). 3 Fuertes, & Fernandez-Perez, 2015). Among these studies, the work by Jacobsen, Marshall, and Visaltanachoti (2009), Hu and Xiong (2013) examines the predictive power of commodity returns for aggregate market returns. These studies both rely on univariate regressions of market returns on selected commodity returns, and do not evaluate the overall predictive ability of a collection of commodities. In this paper we consider a wide cross-section of commodities in order to establish the maximum amount of information about future returns that can be extracted from commodity prices. Our study is motivated by the findings of Kelly and Pruitt (2013) that there is a large amount of information about future stock returns hidden in the cross-section of disaggregate bm ratios. The main problem, when faced with the large pool of potential predictor variables, as is the case when considering a large cross-section of commodities, is the issue of dimensionality which means that standard approaches, such as an OLS regression of the market returns on the predictors variables, cannot be used. However, there are various techniques to overcome this problem by reducing the dimension of the predictors. The most well-known technique is the method of principal component analysis (PCA) (e.g., Stock & Watson, 2002; Ludvigson & Ng, 2007). More recently Kelly and Pruitt (2013, 2015) show that a partial-leastsquares (PLS) methodology can be used to extract the predictive information from a large group of potential predictor variables. They argue that because PCA relies on the structure of the covariance between predictors, rather than the covariance between predictors and the forecast target as in PLS, the estimated expected return may not be an optimal predictor.2 In this paper we use the PLS methodology to optimally extract forecasts of market returns from the past commodity returns. We use data on 16 commodities over the period from January 1950 to January 2015. The lagged returns of the commodities are used as potential predictor variables and the three-step process 2 See Kelly and Pruitt (2013, 2015) for details. 4 of the PLS methodology is used to extract an estimate of the expected return for both in-sample and out-of-sample tests. Various horizons of commodity returns are considered from one to sixteen months so we can identify the horizon over which the commodity returns show the strongest predictive power. Our results may be summarized as follows. First, we show that information contained in commodities prices can predict market returns at short horizons (e.g., monthly and three-month). The optimal predictor variable is constructed from lagged commodity returns measured over last twelve months. The R̄2 values for the in-sample forecast of monthly and three-month market returns reach 1.79% and 5.66%, respectively. Also out-of-sample forecasts show that the forecast performance of the commodity predictor outperforms the historical mean forecast of market returns. Moreover, the forecast performance holds in subsample tests. Second, the latent factor extracted from commodity returns retains its significance in the presence of other short-term predictors such as momentum, SMB, HML, lagged excess returns and industry returns.3 Our research contributes to the literature in a number of ways. Firstly, we show that the cross-section of commodity prices contain important information about future market returns and that this may be extracted using the PLS technique of Kelly and Pruitt (2013, 2015). The resulting estimate of the expected market return is a strong predictor both in-sample and out-of-sample. Secondly, compared to previous analysis, based on univariate regressions on individual commodity predictors, we are able to establish the overall predictive information from the cross-section of available commodities. Thirdly, the finding that the commodity returns remain significant after accounting for commonly used predictors indicates that the information we identify in commodity prices is novel and provides new evidence that market returns are predictable over the short horizon. The source of the predictability, whether it could be due to variation in risk or 3 Hong, Torous, and Valkanov (2007) show that various industry returns lead stock market returns. 5 mispricing, is difficult to analyze and we leave this question to future work.4 The rest of the paper is organized as below, section 2 introduces the PLS method along with the univariate predictive regression which has been widely used in conventional studies; section 3 describes the data; section 4 shows empirical findings and the results of robustness check; section 5 concludes. 2 Methodology In this section, we introduce the PLS methodology and univariate predictive regression. 2.1 The univariate predictive regression We firstly describe the univariate predictive regression conventionally used for testing the predictive power of commodity returns: e rt+M = β0 + βi ri,t−H:t + ωt+1 . (1) e is stock market returns in excess of the risk-free rate at a horizon Where rt+M of M months; ri,t−H:t is the commodity returns measured over last H periods; e βi reflects how rt+M varies with one-unit change in ri,t−H:t .5 The measure-of-fit (R̄2 ) indicates the amount of expected market returns that is predicted by each ri,t−H:t . 4 Various explanations of short term predictability have been proposed including the gradualdiffusion-hypothesis (Hong, Torous, & Valkanov, 2007). They show that information from industry returns can pass slowly to the aggregate stock market. Hong, Torous, and Valkanov (2007) provide empirical findings confirming the hypothesis that some industries show significant performance in forecasting one-month- and two-month-ahead stock market returns. One explanation for the hypothesis is that investors have to spend some time in reactions as the limited capacity for understanding all the available information (e.g., Shiller, 2000; Sims, 2001). 5 It appears that current literature about the sign of βi lacks explicit interpretations. However, empirical evidence suggests that the commodity market may be negatively related to the stock market. Hong, Torous, and Valkanov (2007) show that industry returns associated with industrial minerals returns are negatively related to market returns. Jacobsen, Marshall, and Visaltanachoti (2009) observe negative predictive relationship between commodity returns that are measured over various horizons and the stock market returns. 6 2.2 The PLS methodology Kelly and Pruitt (2013) apply the PLS methodology to extract a unique predictive factor from portfolio-level bm ratios. This method can be implemented as a threestep OLS regression-based procedure. In their study, the potential predictor variables are a set of disaggregate bm ratios and they assume that these variables are differentially exposed to the state variable driving variations in the expected market returns. In our research, the potential predictors are the past returns of various commodities and we assume that fluctuations in commodity returns are also driven by the same state variable. The PLS technique, which consists of three steps of OLS regressions, is able to isolate the component related to the expected returns, whilst filtering out factors related to cash-flow shocks. The first step of PLS procedure is to run a time-series regression of disaggregate commodity returns on excess market returns e ri,t−H:t = αi + βi rm,t+1:t+M + εi,t−H:t . (2) e Where αi is the intercept of regression; βi is the loading on rm,t+1:t+M . β̂i de- scribes the amount of variations in each ri,t−H:t with respect to one-unit change e . This regression is the empirical implementation of the folin expected rm,t+1:t+M e ] + εi,t−H:t , which measures the lowing relationship: ri,t−H:t = αi + βi Et [rm,t+1:t+M exposure of each of the potential predictor variables, ri,t−H:t , to expected market e returns Et [rm,t+1:t+M ]. The empirical implementation substitutes the expected returns with an unbiased proxy, the realized market returns. Using β̂i from the first-step regression we run the second-step cross-sectional regression at time t ri,t−H:t = φi + Ft β̂i + i,t−H:t . (3) where Ft is the estimated cross-sectional regression coefficient. Since β̂i approximately captures how each potential predictor variable, ri,t−H:t , is affected e by variations in Et [rm,t+1:t+M ], the cross-sectional regression of ri,t−H:t provides 7 e estimation of Ft which asymptotically equals the latent factor Et [rm,t+1:t+M ].6 The cross-sectional regression is repeated for all t to build up a time series of the expected returns, or the composite predictor variable, Ft . e By running the first and second steps of regressions, Et [rm,t+1:t+M ] represents the predictive factor exploited from ri,t−H:t and is a linear combination of disaggregate commodity returns. The weight of each ri,t−H:t shows contribution of each ri,t−H:t to the forecast.7 Finally the composite predictor variable, F̂t , is tested in a predictive regression e on F̂t of rm,t+1:t+M e rm,t+1:t+M = c + γt+M F̂t + ξt+1:t+M . (4) e Where γ̂ captures how rm,t+1:t+M varies with one-unit change in F̂t and γ̂t+M is asymptotically normally distributed 8 The measure-of-fit of equation (4) indicates the amount of expected rm,t+1:t+M that is predicted by F̂t . 2.2.1 The out-of-sample implementation In our analysis, we perform a recursive out-of-sample forecast procedure to examine whether the forecast from ri,t−H:t is reliable in a real-time manner. The procedure exactly follows the way suggested by Welch and Goyal (2008) in which the forecast of each time point only uses the information prior to the forecast point of date. This ensures that the forecast at each time point avoids look-ahead bias. We describe this procedure as below. We take the example of forecasting the three-month market returns from come modity returns measured over last four months.9 For simplicity, we define rm,τ +3 6 Kelly and Pruitt (2011) provide the matrix algebra in details. We provide the matrix algebra of calculating the weight in appendix. 8 Kelly and Pruitt (2015) provide statistical proofs with underlying assumptions in details. 9 The monthly forecast would not have look-ahead bias as the data is not overlapped. Regressions on overlapping data may suffer estimation bias. Hansen and Hodrick (1980), Hansen (1982) show that there may be serial correlation existed in residuals when the data is overlapping. Boudoukh, Richardson, and Whitelaw (2008), Amihud, Hurvich, and Wang (2009) show that when forecasting long-horizon market returns, which have overlapping observations, using 7 8 as three-month excess return measured over the period from τ + 1 to τ + 3 and ri,τ as commodity returns over the period from τ − 3 to τ . The full sample having observations {1, 2, 3, ..., T } is split at ttr and the training sample has observations {1, 2, 3..., ttr }. In the first step regression, the market returns used as independent e e e variable are observations {rm,7 , rm,8 , ..., rm,t }, and the commodity returns used tr as dependent variable are observations {ri,4 , ri,5 , ..., ri,ttr−3 }. In the second step, we run regressions of the cross-sectional commodity returns, which are observations {ri,4 , ri,5 , ..., ri,ttr−3 }, on β̂i estimated from the first-step regression. The estimated coefficient F̂ttr−3 corresponds to predictive information extracted up to the time point {4, 5, ..., ttr−3 }. Finally, we run a time-series regression of market e e e } on F̂ttr−3 . The forecast of rm,ttr+3 , ..., rm,t , rm,8 returns using observations {rm,7 tr equals sum of the constant and the product of last observation of F̂ttr−3 and the predictive loading estimated in the last step regression. We repeat the whole procedure by expanding the training sample period-by-period until ttr = T − 3. We compare the forecast with forecast equals historical mean of the market returns. The error of our forecast (errunrestr ) equals the difference between realized market returns and forecast from commodity returns, and the error of historical mean model (errrestr ) equals the difference between realized market returns and historical mean. The pseudo R2 is calculated as pseudo R2 = 1 − M SEunrestr . M SErestr Where M SEunrestr = err2 unrestr M SErestr = err2 restr . persistent predictors, the significance of predictive coefficients may be over-estimated and the R̄2 of predictive model may increase with the forecast horizon. We follow Kelly and Pruitt (2013) and report Newey-West corrected t-statistics to account for the serial correlation that may exist in residuals caused by overlapping data. 9 3 Data 〈Insert table 1 here〉 We obtain the daily spot prices of commodities from the Global Financial Data (GFD).10 We aim to form the dataset with the longest time series and have to omit commodities, such as energy products, which mostly become available after 1986, that start late. There are 16 commodities, which may be categorized into four industries: agriculture, industrial metals, energy and precious metals, included in the dataset. Table 1 presents the name and start date of each commodity along with the corresponding industry. We measure commodity price change over last H month (ri,t−H:t ) as the log-price difference between the last trading day in month t and the last trading day in month t − H + 1. We let the sample start from 1950 and form the dataset without missing values. Therefore, our sample period is January 1950-March 2015. We split the entire sample to investigate whether the stock market return predictability varies with sample period. The first subsample is from January 1950 to December 1984 and the second subsample is from January 1985 to March 2015. We use market returns available from the Kenneth R. French Data library. The stock market return equals the value-weighted market return based on all CRSP firms which become corporations in the US and are traded in NYSE, AMEX or NASDAQ, and the risk-free rate is one-month treasury bill rate which is from Ibbotson Associates.11 The three-month, six-month and twelve-month returns are computed from monthly data. Apart from the commodity returns, we use the predictors investigated in Welch and Goyal (2008) as benchmarks and the data file which ends in December 2013 is from Amit Goyal’s personal website. These alternative predictors include the price-dividend ratio (pd), price-to-earning ratio (pe), book-to-market ratio 10 By comparing GFD and Datastream (DS), we find that the commodity data included in GFD generally have longer series than the data in DS. 11 Please see Keneth R. French Data library for details. 10 (bm), the t-bill rate (tbl), the net equity expansion ratio (ntis), the long-term yield (lty), the inflation rate (inf l) the long-term rate of return (ltr) and stock variance (svar).12 In addition, we perform a robustness check with control variables that are frequently used as risk factors in multivariate asset pricing models, such as the momentum factor, SMB, HML, lagged excess returns, and industry returns that are identified to be significant predictors of market returns (Hong, Torous, & Valkanov, 2007). The data of risk factors and lagged values of excess returns is also from Kenneth R. French Data library. Following Hong, Torous, and Valkanov (2007), the data of industry returns is Fama-French 38 value-weighted industry returns and the data of real estate industry is returns on REIT index from NAREIT website. Apart from the data of real estate industry, which starts from January 1972, the data of other control variables all starts from January 1950 and ends in December 2014. We further test the predictability of industry returns from commodity returns. For this test, the industry returns are those given by the Fama-French 49 industry portfolios. We follow the definition of industry classification and use returns of agriculture industry, mines (regarded as the industrial metals industry), gold (regarded as the precious metals industry), and oil (regarded as energy industry) in analysis. 4 Empirical findings 4.1 Forecasting monthly excess returns from commodity returns 〈Insert table 2 here〉 The findings of Jacobsen, Marshall, and Visaltanachoti (2009) show that 12 We exclude the cross-section premium (csp) from the data file as it is no longer available from January 2003. 11 the predictive power of commodity returns may vary according to the horizon over which the commodity return is measured. Consequently we apply the PLS method separately to commodity returns measured at various horizons. We let the horizon be one, two, four, and up to sixteen months (ri,t−1:t , ri,t−2:t , ri,t−4:t ,..., ri,t−16:t ). Table 2 shows the prediction results for equation (4). The in-sample test provides estimation of predictive loading on the latent factor (γ̂t+M ), the measure-of-fit (R̄2 ) of the third-step regression, the t-statistic (t-statKP ) suggested by Kelly and Pruitt (2013, 2015) and Newey-West t-statistic (t-statN W ).13 The pseudo R2 of the out-of-sample forecast (Welch & Goyal, 2008), pseudo R2 = 1 − M SEunrestricted M SErestricted Where M SErestricted is the mean-squared-error (MSE) of the forecast which equals the average value of historical excess returns; M SEunrestricted is the MSE of the forecast from predictive model. A positive pseudo R2 means the forecast from the predictive model is more accurate than the forecast equals historical mean of excess returns. Since the value of pseudo R2 is sensitive to the size of training sample (e.g., Brennan & Xia, 2005), we report pseudo R2 s generated from two training samples: January 1950-December 1984 and January 1950-December 1994. Also, we perform the ENC-NEW test (e.g., Clark & MacCracken, 2001) under the null hypothesis that the historical mean of forecast target encompasses the predictive information carried by commodity returns. In panel A of table 2, we report the results estimated from the entire sample which covers a period over 65 years. Columns 2-10 correspond to regressions using ri,t−1:t , ri,t−2:t ,..., ri,t−16:t as predictors, respectively. Throughout these results, all predictors have t-statKP and t-statN W values consistently greater than 2.58, 13 Kelly and Pruitt (2013, 2015) provide statistical evidence that the predictive loading of the third-step regression is asymptotically normally distributed when the number of observations and predictors are large. Please see their paper for details. As the forecasts from commodity returns at horizons over two months consist of overlapping data, we use t-statistics with NeweyWest corrections to overcome heteroskedasticity and serial correlations existed in residuals. 12 suggesting that the predictors are significantly different from zero at the 1% significance level, excluding ri,t−1:t and ri,t−2:t of which values of t-statN W are slightly smaller than 2.58. We notice that the value of R̄2 generally follows a hump shape, starting from 0.86% (using ri,t−1:t as predictor), rising to the peak value 1.98% (using ri,t−14:t as predictor) and falling to 1.51% (using ri,t−16:t as predictor). The value of estimated predictive loading (γ̂t+M ), also shows the same trend that it reaches the peak point 2.10 (in percentage) when using ri,t−14:t as predictor, indicating the greatest impact of the predictor. However, in the last two rows of panel A, the out-of-sample forecast results provide little evidence that the commodity returns outperform the historical mean of market returns. Only the forecast from ri,t−12:t performs well, regardless of the choice of training sample. When the training sample period is January 1950December 1984, the pseudo R2 of forecast from rt−12:t is 0.52% with ENC-NEW statistic significantly rejects the null hypothesis at the 10% significance level; when the training sample expands to December 1994, the pseudo R2 increases to 1.74%, which is higher than values generated by other predictors, and the ENCNEW statistic significantly rejects the null hypothesis at the 1% level. Although ri,t−14:t shows the strongest predictive power in-sample, it fails to show convincing evidence that the out-of-sample forecast outperforms the forecast which equals average value of historical market returns with pseudo R2 s of -0.93% and 1.28% (the ENC-NEW statistic significantly rejects the null hypothesis at the 1% level) corresponding to the two training samples. In addition, we examine whether the significant predictive performance of commodity returns still holds in subsamples. The entire sample is split evenly into two parts: January 1950-December 1984 and January 1985-March 2015. Panels B and C report the results. Indeed, the prediction results from subsample tests show that all commodity returns are significant predictors with values of t-statKP and t-statN W that are at least greater than 1.65 (significant at the 10% level), and the value of R̄2 is somewhat higher than the value identified in full 13 sample test. The R̄2 ranges from 1.34% (using ri,t−1:t as predictor) to 2.59% (using ri,t−14:t as predictor) and from 2.44% (using ri,t−1:t as predictor) to 5.46% (using ri,t−2:t as predictor) when testing on the first and second subsamples, respectively. 4.2 Forecasting the stock market returns at three-month horizon 〈Insert table 3 here〉 Given that ri,t−12:t appears to be the most appropriate predictor of monthly market returns, in this section, we propose examining whether the ability of the strongest predictor still holds in three-month forecast. To achieve this, we regress three-month market returns on the same predictors studied in section 4.1 in-sample and out-of-sample, and compare the measure-of-fit of regressions. Table 3 presents the results. The results show that ri,t−12:t has the greatest ability in forecasting three-month market returns in-sample and out-of-sample. We begin with the forecast on entire sample (panel A), the value of R̄2 keeps the hump shape and ranges from 0.56% (using ri,t−1:t as predictor), peaks at 5.66% (using ri,t−12:t as predictor) and falls to 4.74% (using ri,t−16:t as predictor). As for the significance of predictors, only the coefficients estimated from regressions on ri,t−1:t and ri,t−2:t have values of t-statKP and t-statN W that are greater than 1.96 (significant at the 5% level), the coefficients on other commodity returns all have values of t-statKP and t-statN W that are greater than 2.58 (significant at the 1% level). Compared with the out-of-sample performance in monthly forecast, the performance in three-month forecast slightly improves. The pseudo R2 s are consistently positive with forecasts starting from January 1985 and January 1995 when using ri,t−10:t , ri,t−12:t and ri,t−14:t as predictors. Moreover, the ENC-NEW statistics are primarily significant at the 1% level, suggesting that the forecast performance from these predictors are significantly better than the performance of historical 14 mean. The findings in subsample tests (panels B and C) are similar with findings in monthly forecast: commodity returns measured over various horizons all predict market returns significantly and ri,t−12:t forecasts three-month market returns substantially with R̄2 s values of 6.63% and 11.01% in the first and second subsample tests. 4.3 Forecasting market returns at six-month and twelvemonth horizons We examine whether the predictive power continues to hold over longer horizons of six and twelve months in tables 4 and 5. 〈Insert table 4 here〉 Table 4 shows the results of forecasting six-month market returns. The findings in entire sample forecast (panel A) are somewhat similar with findings in monthly and three-month forecasts that, the commodity returns all show significant predictive impact, the value of R̄2 remains the hump shape and peaks at 10.02% when using ri,t−10:t as predictor; the out-of-sample forecasts from ri,t−6:t , ri,t−8:t , ri,t−10:t generate positive pseudo R2 s with forecasts starting from January 1985 and January 1995, and the ENC-NEW statistics are significant at 1% level. Interestingly, ri,t−12:t , the most appropriate predictor in monthly and three-month forecast, no longer has the strongest ability in six-month forecast. The measureof-fit (R̄2 ) of the model approximates to 9.69% which is slightly lower than the highest value generated from the model using ri,t−10:t as predictor. Moreover, the results of out-of-sample forecasts indicate that the forecast from ri,t−12:t is no longer reliable in a real-time manner, with inconsistent signs of pseudo R2 s: -1.32% (forecast starting from January 1985) and 0.33% (forecast starting from January 1995). 15 As reported in panels B and C, the significant predictive impact of commodity returns exists in subsample tests, except for ri,t−4:t of which t-statN W has a value of 1.45 that fails to reject the null hypothesis that the coefficient of predictor is zero. 〈Insert table 5 here〉 Table 5 presents results of forecasting twelve-month market returns. In panel A, the in-sample results estimated from the entire sample show that commodity returns measured over various horizons are significant predictors, excluding ri,t−16:t of which t-statN W is 1.49 which fails to reject the null hypothesis that the coefficient of predictor is zero, and that the value of R̄2 follows a clear hump shape, with R̄2 starting from 0.97% (using ri,t−1:t as predictor), peaking at 14.09% (using ri,t−12:t as predictor) and gradually diminishing when commodity returns are measured over 14 months or longer horizons. The steadily diminishing predictive ability is in line with changes in γ̂t+M and t-statistics. For instance, γ̂t+M estimated from regressions on ri,t−12:t has a value of 14.21 (in percentage) which is higher than the values estimated from regressions on other commodity returns, suggesting the greatest predictive impact; and the findings that t-statKP has a value of 9.50 (significant at the 1% level), t-statN W has a value of 1.87 (significant at the 5% level) consistently confirm the significance of the predictor. However, as the horizon of commodity returns extends to 14 months, γ̂t+M slightly reduces to 13.67 (in percentage) with smaller t-statKP (9.05) and t-statN W (1.72); as the horizon extends to 16 months, γ̂t+M reduces to 12.04 (in percentage) with even smaller t-statKP (8.38) and t-statN W (1.49), the inconsistency in t-statistics makes us believe that ri,t−16:t no longer has significant predictive impact on the twelve-month market returns. The out-of-sample forecast shows poor performance of commodity returns that forecasts from commodity returns over various horizons never generate positive pseudo R2 s when the forecast starts from either January 1985 or January 1995. The poor out-of-sample performance stands in contrast to the performance 16 identified in monthly, three-month and six-month forecasts that strong predictors in in-sample forecast always show good performance in out-of-sample forecast. Not surprisingly, the results in subsample tests show that the market returns are predictable by commodity returns, although the significant predictive performance may not exist in the first subsample test (e.g., values of t-statN W estimated from regressions on ri,t−1:t , ri,t−2:t , ri,t−4:t and ri,t−6:t are smaller than 1.65). 〈Insert figure 1 here〉 Given the superior performance of the predictor created from lagged 12-month commodity returns (ri,t−12:t ), we focus on this estimate of the expected return. We plot out the weight of each commodity i in the composite predictor in Figure 1.14 The figure clearly provides us clues about the contribution from each commodity returns to the forecast. In monthly forecast, apart from returns of aluminum, high grade copper, live hog and zinc special high grade, returns of other commodities all contribute to the predictability. Specifically, returns of soybean oil, Brazil Santos Arabicas, Chicago yellow corn, tin, sugar, soybeans, West Texas intermediate oil, Wheat and silver make great contributions. In three-month forecast, in addition to the commodity returns that have little to contribute to predictability, returns of soybean meal, gold and gold bullion (New York) also make small contributions. We note that, among the 16 commodities, returns of soybean oil, Chicago yellow corn and West Texas intermediate oil always contribute substantially to monthly and three-month forecasts. 4.4 Forecasts from other predictors In this section, we start with investigating predictive ability of individual commodity returns, and then examine predictive ability of the benchmark: predictor variables studied in Welch and Goyal (2008). 14 We show the matrix logarithm of calculating the coefficient B in Appendix. 17 4.4.1 Univariate regressions on individual commodity returns 〈Insert table 6 here〉 Table 6 presents forecast results from each ri,t−12:t . We report in-sample results including the estimated predictive loadings (β̂i ), t-statistics with Newey-West corrections in brackets along with the measure-of-fit (R̄2 ) of the predictive model, and the out-of-sample pseudo R2 of which the training sample is January 1950December 1984. We also perform the ENC-NEW test (e.g., Clark & MacCracken, 2001) to examine the significance of out-of-sample performance. Panels A and B report forecast results of monthly and three-month market returns, respectively. We begin with the monthly forecast (panel A). Combining the in-sample and out-of-sample forecast results together, when forecasting monthly market returns, only returns of soybean oil and West Texas intermediate oil show consistently significant predictive performance. The returns of a few commodities show significant in-sample predictive impact, but fail to outperform historical mean of market returns in out-of-sample forecast. For instance, returns of Chicago yellow corn, soybeans, have t-statistics of -3.28 (significant at 1% level) and -2.12 (significant at 5% level), however, the out-of-sample forecasts fail to outperform forecasts from historical mean significantly, with pseudo R2 s of -0.22% and 0.17%, respectively. The results in panel B show that returns of some commodities have stronger ability in forecasting three-month market returns. Commodity returns, such as soybean oil, Chicago yellow corn, soybeans, West Texas intermediate oil, exhibit significant forecast performance in-sample and out-of-sample. This finding is largely consistent with findings in figure 1, potentially suggesting that the PLS procedure efficiently extracts the relevant information from a cross-section of potential predictors. 4.4.2 A comparison with the benchmark 〈Insert table 7 here〉 18 Table 7 shows forecast results of benchmark predictors. Among the predictors that have been examined in Welch and Goyal (2008), only the long-term yield (lty) shows significant performance in forecasting three-month market returns insample and out-of-sample for which the first forecast starts from January 1985.15 Other predictors fail to show consistently significant performance in-sample and out-of-sample. 4.5 Robustness checks 〈Insert tables 8 here〉 The information about future market returns extracted from commodities may be correlated with existing predictor variables. To address this concern, we conduct a robustness check by adding control variables into the predictive regression. We let the well-known risk factors, the momentum factor and FamaFrench three factors, along with the industry returns that are identified to be significant predictors of monthly market returns (Hong, Torous, & Valkanov, 2007) be control variables. All factors, excluding the market returns and industry returns that are lagged values, are contemporaneous. Table 8 presents the results. For consistency, we test on the entire sample along with the two subsamples.16 Panels A and B report results from regressing on monthly and three-month market returns, respectively. The t-statistics with Newey-West corrections (Newey & West, 1987) are reported in brackets. We focus on the composite predictor created from lagged 12-month commodity returns (ri,t−12:t ). With findings in panels A and B, we find statistically strong evidence that the significant predictive impact of the factor (Fcommodity,t−1 ) extracted from ri,t−12:t always exists in tests on entire sample, the first and second subsamples, controlling 15 This finding is consistent with the monthly forecast results reported in Kelly and Pruitt (2013). 16 Since the commercial real estate returns are available from January 1972, we have to include the return (rRLEST,t−1 ) in the second subsample test. 19 for momentum factor, Fama-French three factors and industry returns. We notice that the industry returns provide no significant predictive power. This finding is not against our expectation as these industry returns are inherently correlated with fundamentals of stock market which the size and book-to-market stocks are proxying for (e.g., Hong, Torous, & Valkanov, 2007; Fama & French, 1995). 4.6 Forecasting industry returns from commodities 〈Insert table 9 here〉 As an extension of our research we examine whether commodities can predict industry returns, and report the results in table 9. As we show in table 1, the commodities are related to agriculture, industrial metals, energy and precious metals, returns of these industries may be predictable from commodity returns, ri,t−12:t . Therefore, we conduct regressions of monthly returns of related industry on the composite predictor extracted from ri,t−12:t . In in-sample tests, all industry returns are significantly predictable by the composite predictor with values of t-statKP and t-statN W that are consistently greater than 2.58 (significant at the 1% level), except that the values of t-statKP and t-statN W of the regression on agriculture industry returns are 2.35 and 2.43 (both significant at the 5% level). Specifically, the predictor exhibits the strongest power in forecasting industry returns of industrial metals with an R̄2 2.35% that is higher than R̄2 s generated from other forecasts, and the magnitude of estimated coefficient γ̂t+M , which is also the greatest, reaches 2.48 (in percentage). Indeed, the strong and significant predictive impact of ri,t−12:t is only robust for outof-sample forecast of industrial metal industry. The pseudo R2 s with forecasts starting from January 1985 and January 1995 are 0.39% and 1.15% with ENCNEW statistics significant at the 5% and 1% levels, respectively. Findings in forecasting other industry returns lack convincing evidence that the forecasts are reliable in a real-time manner. For example, the forecast of agriculture industry returns generates an extremely low pseudo R2 0.04% with ENC-NEW statistic 20 significant at the 10% level, whereas the pseudo R2 obtained from a smaller training sample (January 1950-December 1984) becomes negative (-1.30%). This inconsistency makes us believe that the positive pseudo R2 is insufficient for reliably forecasting agriculture industry returns. The situation is even worse in forecasting returns of energy and precious metals industries, the pseudo R2 s are never positive for any training sample, suggesting greater errors of forecasts from the composite predictor extracted from ri,t−12:t than forecasts from average value of historical industry returns. 5 Conclusion In this paper, we propose evaluating predictive power of disaggregate commodity returns that are measured over various horizons. By assuming that variations in commodity returns are driven by the state variable that determines expected market returns, we employ the PLS methodology (Kelly & Pruitt, 2013) to extract the latent factor from a cross-section of commodity returns. The main findings may be summarized as following: i) the amount of predictive information carried by commodity returns may be different, depending on the horizon over which the commodity returns are measured; ii) the predictabilities of monthly and three-month stock market returns are significantly improved by using commodity returns measured over a horizon of twelve months; iii) significant predictive impact of commodity returns measured over last twelve months holds in subsample tests; iv) in the forecasts of monthly and three-month market returns, returns of soybeans, Chicago yellow corn and West Texas intermediate oil typically make substantial contributions; v) the strong predictive ability of commodity returns measured over last twelve months appears to exist in monthly and three-month forecasts, rather than six-month and twelve-month forecasts; vi) the returns of industrial metals portfolio are significantly predictable by commodity returns measured over a horizon of twelve months. 21 To confirm that the success we have achieved is not by chance we examine the performance of out-of-sample forecasts by reporting pseudo R2 s obtained at different forecast starting points to lessen the possibility of accidentally reporting positive pseudo R2 . Finally, we perform a robustness check with control variables and find that our predictor is not proxying for any determinants of expected market returns captured by common predictors from the literature. 22 References Amihud, Y. (1994). Exchange rates and the valuation of equity shares. In Y. Amihud & R. Levich (Eds.), Exchange rates and corporate performance (p. 49-50). Irwin. Amihud, Y., Hurvich, C. M., & Wang, Y. (2009). Multiple-predictor regressions: hypothesis testing. Review of Financial Studies, 22 (1), 410 - 434. Asness, C. S. (2003). Fight the fed model. Journal of Portfolio Management, 30 (1), 11 - 24. Baker, S. D., & Routledge, B. R. (2015). The price of oil risk. Working paper. Bakshi, G., Gao, X., & Rossi, A. (2014). A better specified asset pricing model to explain the cross-section and time-series of commodity returns? Working paper. Bartov, E., & Bohnar, G. (1994). Firm valuations, earnings expectations and the exchange rate exposure effect. Journal of Finance, 49 , 1755 - 1785. Boudoukh, J., Richardson, M., & Whitelaw, R. F. (2008). The myth of longhorizon predictability. Review of Financial Studies, 21 (4), 1577 - 1605. Brennan, M. J., & Xia, Y. (2005). tay’s as good as cay. Finance Research Letters, 2 , 1 - 14. Brooks, C., Fernandez-Perez, A., Miffre, J., & Nneji, O. (2014). Commodity risk factors and the cross-section of equity returns. ICMA Centre. Campbell, J. Y. (1987). Stock returns and the term structure. Journal of Financial Economics, 18 (2), 373 - 399. Campbell, J. Y., & Shiller, R. J. (1988). Stock prices, earnings, and expected dividends. Journal of Finance, 43 (3), 661 - 676. Campbell, J. Y., & Shiller, R. J. (1998). Valuation ratios and the long-run stock market outlook. Journal of Portfolio Managements, 24 (2), 11 - 26. Campbell, J. Y., & Viceira, L. M. (2002). Strategic asset allocation: portfolio choice for long-term investors. Oxford: Oxford University Press. Campbell, J. Y., & Yogo, M. (2006). Efficient tests of stock return predictability. 23 Journal of Financial Economics, 81 (1), 27 - 60. Chen, N., Roll, R., & Ross, S. A. (1986). Economic forces and the stock market. Journal of Business, 59 , 383 - 403. Clark, T. E., & MacCracken, M. W. (2001). Tests of equal forecast accuracy and encompassing for nested models. Journal of Econometrics, 105 , 85 - 110. Cochrane, J. H. (2005). Asset pricing (Rev ed.). Princeton, NJ: Princeton University Press. Cole, K., Helwege, J., & Laster, D. (1996). Stock market valuation indicator: is this time different? Financial analysts Journal, 52 (3), 56 - 64. Fama, E. F., & French, K. R. (1988). Dividend yields and expected stock returns. Journal of Financial Economics, 22 (1), 3 - 25. Fama, E. F., & French, K. R. (1989). Business conditions and expected returns on stock and bonds. Journal of Financial Economics, 25 (1), 23 - 49. Fama, E. F., & French, K. R. (1995). Size and book-to-market factors in earnings and returns. Journal of Financie, 50 , 131 - 155. Hamilton, J. (1983). Oil and the macroeconomy since world war II. Journal of Political Economy, 91 , 228 - 248. Hansen, L. (1982). Large sample properties of generalized method of moments estimators. Economitrica, 1029-1054. Hansen, L., & Hodrick, R. (1980). Forward exchange rates as optimal predictors of future spot rates. Journal of Political Economy, 829-853. Hodrick, R. J. (1992). Dividend yields and expected stock returns: alternative procedures for inference and measurement. Review of Financial Studies, 5 (3), 257 - 286. Hong, H., Torous, W., & Valkanov, R. (2007). Do industries lead stock markets? Journal of Financial Economics, 83 , 367 - 396. Hong, H., & Yogo, M. (2012). What does futures market interest tell us abou tthe macroeconomy and asset prices? Journal of Financial Economics, 105 , 473-490. 24 Hu, C., & Xiong, W. (2013). The informational role of commodity futures prices. Working paper. Retrieved from http://www8.gsb.columbia.edu/ rtfiles/finance/misc/Information Commodity4 c.pdf Jacobsen, B., Marshall, B., & Visaltanachoti, N. (2009, May). Return predictability revisited. FMA Asian Conference. Jegadeesh, N. (1991). Seasonality in stock price mean reversion: evidence from the u.s. and u.k. Journal of Finance, 46 , 1427 - 1444. Jones, C. M., & Kaul, G. (1996). Oil and the stock markets. Journal of Finance, 51 , 463 - 491. Kelly, B., & Pruitt, S. (2011). Market expectations in the cross-section of present values. Working Paper. Kelly, B., & Pruitt, S. (2013). Market expectations in the cross-section of present values. Journal of Finance, 68 (5), 1721 - 1756. Kelly, B., & Pruitt, S. (2015). The three-pass regression filter: A new approach to forecasting using many predictors. Journal of Econometrics, 186 , 294 316. Kothari, S., & Shanken, J. (1997). Book-to-market, dividend yield, and expected market returns: a time-series analysis. Journal of Financial Economicss, 44 (2), 169 - 203. Lamont, O. (1998). Earnings and expected returns. Journal of Finance, 53 (5), 1563 - 1587. Lander, J., Orphanides, A., & Douvogiannis, M. (1997). Earnings forecasts and the predictability of stock returns: evidence from trading the s&p. Journal of Portfolio Management, 23 (4), 24 - 35. Lettau, M., & Ludvigson, S. (2001). Consumption, aggregate wealth and expected stock returns. Journal of Finance, 56 (3), 815 - 849. Lewellen, J. (2004). Predicting returns with financial ratios. Journal of Financial Economicss, 74 (2), 209 - 235. Ludvigson, S. C., & Ng, S. (2007). The empirical risk-return relation: a factor 25 analysis approach. Journal of Financial Economics, 83 , 171-222. Menzly, L. T., & Veronesi, P. (2004). Understanding predictability. Journal of Political Economy, 112 (1), 1 - 47. Miffre, J., Fuertes, A.-M., & Fernandez-Perez, A. (2015). Commodity risk factors and intertemporal asset pricing? Working paper. Retrieved from http:// papers.ssrn.com/sol3/papers.cfm?abstract id=2432884 Newey, W. K., & West, K. D. (1987). A simple, positive semi-definite, heteroskedasticity and autocorrelation consistent covariance matrix. Econometrica, 55 , 703 - 708. Obben, J., Pech, A., & Shakur, S. (2007). Analysis of the relationship between the share market performance and exchange rates in new zealand: a cointegrating var approach. Working paper. Pontiff, J., & Schall, L. D. (1998). Book-to-market ratios as predictors of market returns. Journal of Financial Economicss, 49 (2), 141 - 160. Pu, S. (2000). The p/e ratio and stock market performance. Economic Review of the Federal Reserve Bank of Kansas City, 4th Quarter, 23 - 36. Rozeff, M. S. (1984). Dividend yields are equity risk premiums. Journal of Portfolio Management, 11 (1), 68 - 75. Shiller, R. J. (2000). Irrational exuberance. New York: Broadway Books. Sims, C. A. (2001). Rational inattention: beyond the linear-quadratic case. American Economic Review, 96 (2), 158 - 163. Stock, J. H., & Watson, M. W. (2002). Forecasting using principal components from a large number of predictors. Journal of the American Statistical Association, 97 , 1167-1179. Welch, I., & Goyal, A. (2008). A comprehensive look at the empirical performance of equity premium prediction. Review of Financial Studies, 21 , 1455 - 1508. 26 Tables Table 1: Monthly forecast from commodity returns row 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 category agriculture industrial metals energy precious metals ticker BO1599D COFSAD C US2D SU1599D SYB TD SYM 4D W USSD IHXD CMALSD CU NYD MZN2MD SN NYD WTC D XAG HD XAU BD XAU D start date 31/12/1948 31/12/1946 02/01/1900 31/12/1948 31/12/1936 01/12/1947 02/01/1900 02/01/1900 31/12/1919 02/01/1900 31/12/1903 31/12/1911 31/12/1874 31/12/1919 08/09/1933 08/09/1933 name Soybean Oil Brazil Santos Arabicas Chicago Yellow Corn Sugar Soybeans Soybean Meal Wheat Live Hog Aluminum High Grade Copper Zinc Special High Grade Tin West Texas Intermediate Oil Silver Gold Spot Price-London PM Fixing Gold Bullion Price-New York Notes: This table lists the names of commodities which include commodities in agricultures, industrial metals, energy, and precious metals. The data is obtained from Global Financial Data. The start date of the commodity varies from each other but all the data ends in March 2015 at daily frequency. 27 Table 2: Monthly forecast from commodity returns ri,t−1:t β̂ (t-statKP ) (t-statN W ) R̄2 2 pseudo R198501 2 pseudo R199501 0.98∗∗ (2.95) (2.53) 0.86 −2.95 −3.53 β̂ (t-statKP ) (t-statN W ) R̄2 1.58∗∗ (2.23) (1.99) 1.34 β̂ (t-statKP ) (t-statN W ) R̄2 2.71∗∗ (2.41) (2.37) 2.44 ri,t−2:t ri,t−4:t ri,t−6:t ri,t−8:t ri,t−10:t ri,t−12:t ri,t−14:t ri,t−16:t Panel A: Entire sample: 195001-201503 1.01∗∗ 1.04∗∗∗ 1.09∗∗∗ 1.80∗∗∗ 2.07∗∗∗ 1.91∗∗∗ 2.10∗∗∗ 1.63∗∗∗ (2.78) (2.84) (2.84) (3.10) (3.93) (3.15) (3.95) (3.66) (2.55) (2.98) (2.86) (3.14) (3.99) (3.25) (3.89) (3.34) 0.89 0.92 0.97 1.67 1.95 1.79 1.98 1.51 ∗ −1.44 −1.27 0.02 −1.93 −1.07 0.52 −0.93 0.07 −0.32 −1.70 −0.74 −0.91 −0.20 1.74∗∗∗ 1.28∗∗∗ 1.12∗∗∗ Panel B: Subsample1 : 195001-198412 ∗∗∗ 2.09 2.03∗∗ 1.22∗∗ 2.19∗∗ 2.43∗∗∗ 2.18∗∗ 2.82∗∗∗ 2.01∗∗∗ (2.82) (2.80) (2.21) (2.91) (3.16) (2.49) (2.82) (2.71) (2.59) (2.34) (1.96) (2.14) (2.85) (2.30) (2.74) (2.75) 1.86 1.79 0.98 1.95 2.20 1.95 2.59 1.78 Panel C: Subsample2 : 198501-201503 ∗∗∗ 5.73 3.81∗∗∗ 4.02∗∗∗ 4.22∗∗∗ 4.42∗∗∗ 4.38∗∗∗ 4.04∗∗∗ 2.77∗∗∗ (4.39) (3.79) (3.27) (3.65) (4.07) (3.43) (3.43) (3.16) (3.60) (3.57) (2.47) (3.23) (3.53) (3.25) (3.07) (2.87) 5.46 3.55 3.75 3.96 4.15 4.12 3.78 2.50 Notes: This table reports results of forecasting monthly excess returns. The dependent variable is the return (in percentage) on CRSP value-weighted index for the US stock market in excess of the one-month Treasury bill rate, the independent variable is the latent factor (F̂t−H:t ) extracted by the PLS procedure (equation (3)) from commodity returns that are measured over horizons from last one to sixteen months (ri,t−1:t , ri,t−2:t ,..., ri,t−16:t ) and columns 2-10 show the results. Panels A-C report the forecast results on the entire sample January 1950-March 2015, the first subsample January 1950-December 1984 and the second subsample January 1985-March 2015, respectively. Each panel includes in-sample results which are the coefficients (β̂) of the latent factor (F̂t−H:t ), the t-statistic of the predictive loading in the third step regression (equation (4)) suggested by Kelly and Pruitt (2013, 2015) (t-statKP ) and t-statistic with Newey-West correction (Newey and West (1987)), and the R̄2 (in percentage). The last two rows in panel A show the pseudo R2 s generated from out-of-sample forecast, with the training sample January 1950-December 1984 and January 1985-March 2015, respectively. The ENC-NEW test (Clark and MacCracken (2001)) is performed with the null hypothesis that the forecast from historical mean model encompasses the forecast from commodity returns. *, **, *** denote significance at 10%, 5% and 1% levels. 28 Table 3: Three-month forecast from commodity returns ri,t−1:t β̂ (t-statKP ) (t-statN W ) R̄2 2 pseudo R198501 2 pseudo R199501 0.69∗∗ (2.06) (2.12) 0.56 −1.43 −1.58 β̂ (t-statKP ) (t-statN W ) R̄2 2.08∗∗ (2.64) (2.36) 1.84 β̂ (t-statKP ) (t-statN W ) R̄2 3.55∗ (2.83) (1.84) 3.28 ri,t−2:t ri,t−4:t ri,t−6:t ri,t−8:t ri,t−10:t ri,t−12:t ri,t−14:t ri,t−16:t Panel A: Entire sample: 195001-201503 1.57∗∗ 2.36∗∗∗ 3.55∗∗∗ 4.64∗∗∗ 5.14∗∗∗ 5.78∗∗∗ 5.07∗∗∗ 4.86∗∗∗ (2.68) (3.95) (4.42) (4.86) (5.30) (6.16) (6.26) (6.22) (2.40) (2.82) (3.10) (3.11) (3.14) (2.99) (3.06) (3.04) 1.45 2.24 3.42 4.51 5.02 5.66 4.95 4.74 ∗∗∗ ∗∗∗ ∗∗ −0.33 −1.57 −2.66 −0.58 2.67 1.44 0.69 −1.16 −1.28 −0.51 0.86∗∗ 1.82∗∗∗ 5.38∗∗∗ 4.83∗∗∗ 1.32∗∗∗ −1.24 Panel B: Subsample1 : 195001-198412 ∗∗ 3.70 3.54∗ 4.32∗ 5.89∗∗ 6.62∗∗ 6.86∗∗ 6.98∗∗∗ 5.98∗∗∗ (3.42) (3.43) (3.53) (4.35) (4.62) (4.49) (5.51) (5.23) (2.30) (1.82) (1.75) (2.06) (2.45) (2.24) (2.64) (2.76) 3.47 3.31 4.09 5.66 6.40 6.63 6.76 5.75 Panel C: Subsample2 : 198501-201501 ∗∗∗ 7.20 7.96∗∗∗ 9.56∗∗∗ 10.17∗∗∗ 10.43∗∗∗ 11.25∗∗∗ 8.45∗∗∗ 7.28∗∗∗ (4.81) (4.75) (4.65) (5.37) (5.29) (5.42) (5.12) (5.10) (4.41) (3.64) (3.17) (3.79) (5.20) (6.20) (5.73) (5.15) 6.95 7.70 9.31 9.92 10.18 11.01 8.19 7.02 Notes: This table reports forecast results of the three-month excess returns. The forecast target is the excess returns (in percentage) at three-month horizon that is transformed from the monthly data, the independent variable is the latent factor (F̂t−H:t ) extracted by the PLS procedure (equation (3)) from commodity returns that are measured over horizons from last one to sixteen months (ri,t−1:t , ri,t−2:t ,..., ri,t−16:t ) and columns 2-10 show the results. Results in panels A-C correspond to forecasts on the entire sample January 1950-March 2015, the first subsample January 1950-December 1984 and the second subsample January 1985-March 2015. In each panel, the reported in-sample test results include the coefficients (β̂) of the latent factor (F̂t−H:t ), the t-statistic of the predictive loading in the third step regression (equation (4)) suggested by Kelly and Pruitt (2013, 2015) (t-statKP ) and t-statistic with Newey-West correction (Newey and West (1987)), and the R̄2 (in percentage). The last two rows in panel A show the pseudo R2 s generated from the out-of-sample forecast, with the training sample January 1950-December 1984 and January 1985-March 2015, respectively. The ENC-NEW test is performed with the null hypothesis that the forecast from historical mean model encompasses the forecast from commodity returns. *, **, *** denote significance at 10%, 5% and 1% levels. 29 Table 4: Six-month forecast from commodity returns ri,t−1:t β̂ (t-statKP ) (t-statN W ) R̄2 2 pseudo R198501 2 pseudo R199501 0.61∗ (1.89) (1.98) 0.48 −2.28 −0.65 β̂ (t-statKP ) (t-statN W ) R̄2 1.22∗ (1.81) (1.71) 0.98 β̂ (t-statKP ) (t-statN W ) R̄2 8.31∗∗∗ (4.23) (2.73) 8.06 ri,t−2:t ri,t−4:t ri,t−6:t ri,t−8:t ri,t−10:t ri,t−12:t ri,t−14:t ri,t−16:t Panel A: Entire sample: 195001-201503 1.68∗∗ 3.90∗∗∗ 6.65∗∗∗ 8.37∗∗∗ 10.14∗∗∗ 9.81∗∗∗ 9.13∗∗ 8.45∗∗ (2.89) (4.02) (5.43) (6.26) (8.46) (8.38) (8.22) (7.83) (2.54) (2.86) (3.06) (3.38) (3.04) (2.73) (2.47) (2.19) 1.56 3.77 6.53 8.25 10.02 9.69 9.01 8.34 ∗∗∗ ∗∗∗ ∗∗∗ −2.84 −1.65 2.29 4.30 2.77 −1.32 −6.85 −10.33 −1.07 −1.15 1.96∗∗∗ 4.07∗∗∗ 4.25∗∗∗ 0.33 −4.57 −8.65 Panel B: Subsample1 : 195001-198412 ∗ 2.36 4.43 6.70 8.99∗∗ 11.05∗∗ 9.39∗∗ 10.93∗∗∗ 11.46∗∗∗ (2.78) (3.29) (4.31) (5.26) (6.19) (6.15) (7.75) (7.73) (1.91) (1.45) (1.64) (2.02) (2.40) (2.11) (2.66) (2.87) 2.12 4.20 6.47 8.77 10.83 9.17 10.71 11.24 Panel C: Subsample2 : 198501-201503 ∗∗∗ 16.01 18.38∗∗ 16.33∗∗ 15.53∗∗ 16.83∗∗∗ 19.26∗∗ 18.92∗∗ 16.34∗∗ (6.74) (6.89) (6.36) (6.53) (7.09) (7.78) (7.62) (7.06) (3.03) (2.46) (2.19) (2.21) (2.73) (2.31) (2.41) (2.10) 15.77 18.15 16.10 15.29 16.60 19.04 18.69 16.10 Notes: This table reports forecast results of the six-month excess returns. The forecast target is the excess returns (in percentage) at six-month horizon that is transformed from the monthly data, the independent variable is the latent factor (F̂t−H:t ) extracted by the PLS procedure (equation (3)) from commodity returns that are measured over horizons from last one to sixteen months (ri,t−1:t , ri,t−2:t ,..., ri,t−16:t ) and columns 2-10 show the results. Panels A-C report forecast results based on three testable samples: the entire sample January 1950-March 2015, the first subsample January 1950-December 1984 and the second subsample January 1985-March 2015. Each panel shows in-sample test results including the coefficients (β̂) of the latent factor (F̂t−H:t ), the t-statistic of the predictive loading in the third step regression (equation (4)) suggested by Kelly and Pruitt (2013, 2015) (t-statKP ) and t-statistic with Newey-West correction (Newey and West (1987)), and the R̄2 (in percentage). In addition, the last two rows in panel A show the pseudo R2 s generated from the out-of-sample forecast, with the training sample January 1950-December 1984 and January 1985-March 2015, respectively. The ENC-NEW test is performed with the null hypothesis that the forecast from historical mean model encompasses the forecast from commodity returns. *, **, *** denote significance at 10%, 5% and 1% levels. 30 Table 5: Twelve-month forecast from commodity returns ri,t−1:t β̂ (t-statKP ) (t-statN W ) R̄2 2 pseudo R198501 2 pseudo R199501 1.10∗∗ (2.36) (2.05) 0.97 −3.62 −2.47 β̂ (t-statKP ) (t-statN W ) R̄2 1.22 (2.00) (1.32) 0.97 β̂ (t-statKP ) (t-statN W ) R̄2 4.78∗ (2.57) (1.82) 4.52 ri,t−2:t ri,t−4:t ri,t−6:t ri,t−8:t ri,t−10:t Panel A: Entire sample: 195001-201503 2.71∗∗ 6.63∗∗ 9.52∗∗ 12.07∗∗ 14.17∗∗ (3.34) (6.84) (7.97) (8.90) (9.65) (2.14) (2.46) (2.33) (2.28) (2.07) 2.58 6.51 9.41 11.95 14.05 −6.31 −13.08 −18.52 −23.53 −27.74 −3.43 −6.49 −10.97 −16.54 −21.99 Panel B: Subsample1 : 195001-198412 2.12 5.53 8.39 10.37∗ 13.58∗∗ (2.18) (3.65) (5.02) (6.16) (7.62) (1.26) (1.18) (1.41) (1.65) (2.11) 1.88 5.30 8.16 10.15 13.37 Panel C: Subsample2 : 198501-201503 ∗∗ 11.35 18.77∗∗ 25.25∗∗∗ 29.85∗∗∗ 31.13∗∗∗ (5.04) (7.38) (9.25) (10.38) (10.49) (2.26) (2.13) (2.60) (3.03) (3.10) 11.10 18.54 25.05 29.66 30.94 ri,t−12:t ri,t−14:t ri,t−16:t 14.21∗ (9.50) (1.87) 14.09 −29.10 −24.14 13.67∗ (9.05) (1.72) 13.56 −34.57 −29.10 12.04 (8.38) (1.49) 11.92 −34.30 −29.78 18.37∗∗ (9.07) (2.52) 18.17 29.91∗∗ (10.48) (2.33) 29.72 23.88∗∗∗ 25.06∗∗∗ (10.84) (11.45) (3.23) (3.30) 23.69 24.87 28.51∗∗ (10.29) (2.19) 28.31 28.10∗∗ (10.17) (2.01) 27.90 Notes: This table reports forecast results of the twelve-month excess returns. The forecast target is the excess returns (in percentage) at twelve-month horizon that is transformed from the monthly data, the independent variable is the latent factor (F̂t−H:t ) extracted by the PLS procedure (equation (3)) from commodity returns that are measured over horizons from last one to sixteen months (ri,t−1:t , ri,t−2:t ,..., ri,t−16:t ) and columns 2-10 show the results. Panels A-C report forecast results based on three testable samples: the entire sample January 1950-March 2015, the first subsample January 1950-December 1984 and the second subsample January 1985March 2015. Each panel includes the in-sample test results, the coefficients (β̂) of the latent factor (F̂t−H:t ), the t-statistic of the predictive loading in the third step regression (equation (4)) suggested by Kelly and Pruitt (2013, 2015) (t-statKP ) and t-statistic with Newey-West correction (Newey and West (1987)), and the R̄2 (in percentage), and the last two rows in panel A show the pseudo R2 s generated from the out-of-sample forecast, with the training sample January 1950-December 1984 and January 1985-March 2015, respectively. The ENCNEW test is performed with the null hypothesis that the forecast from historical mean model encompasses the forecast from commodity returns. *, **, *** denote significance at 10%, 5% and 1% levels. 31 Table 6: Prediction results from individual commodity returns Panel A: monthly forecast β̂ R̄2 pseudo R2 ∗∗ soybean oil −1.28 0.78 1.47∗∗∗ (−2.20) aluminum −0.79 −0.02 −0.29 (−0.84) Brazil Santos Arabicas 0.38 −0.05 −0.43 (0.85) high grade copper −0.55 −0.05 1.63∗∗∗ (−0.73) Chicago yellow corn −1.99∗∗∗ 1.20 −0.22 (−3.28) live hog −0.43 −0.07 −1.02 (−0.60) zinc special high grade 0.33 −0.09 −0.63 (0.40) tin −1.11 0.26 0.20 (−1.51) sugar −0.46 0.09 0.05 (−1.16) soybeans −1.33∗∗ 0.37 0.17 (−2.12) soybean meal −0.52 −0.02 −0.06 (−0.70) West Texas intermediate oil −1.65∗∗ 0.92 0.79∗∗ (−2.33) wheat −1.06 0.23 0.36 (−1.21) silver −0.83 0.16 0.24 (−1.15) gold −0.89 0.04 0.19 (−1.00) gold bullion NY −0.81 0.01 0.23 (−0.92) predictor Panel B: three-month forecast β̂ R̄2 pseudo R2 ∗ −4.14 2.70 2.40∗∗∗ (−1.80) −2.12 0.11 −1.09 (−0.81) 1.44 0.23 −0.98 (1.01) −1.58 0.08 −0.93 (−0.91) −6.18∗∗∗ 3.72 0.69∗∗∗ (−2.89) −1.45 0.07 0.31∗ (−0.68) 1.04 −0.01 −1.76 (0.29) −4.09 1.45 1.34∗∗∗ (−1.58) −1.19 0.31 0.52∗∗ (−0.84) −3.89∗ 1.16 2.10∗∗∗ (−1.74) −1.97 0.31 0.88∗∗ (−0.73) −5.06∗∗ 2.84 3.04∗∗∗ (−2.33) −4.80∗∗ 2.10 3.24∗∗∗ (−2.00) −2.55 0.70 1.50∗∗∗ (−1.08) −3.21 0.52 1.07∗∗∗ (−0.99) −2.86 0.39 0.91∗∗ (−0.90) Notes: This table presents forecast results of monthly and three-month market returns from individual commodity returns measured over last twelve months (ri,t−12:t ). The estimation results are from the univariate regression of market returns on each commodity returns (equation (1)). The details of the commodities are listed in table 1. Panels A and B show results of forecasting monthly and three-month market returns, respectively. In each panel, the in-sample results include the estimated coefficient on the predictor (β̂), the measure-of-fit (R̄2 ) of the mode, and pseudo R2 of the out-of-sample forecast with the training sample January 1950-December 1984. The Newey-West corrected t-statistics are reported in parentheses. The ENC-NEW test is performed to examine significance of the out-of-sample forecast performance under the null hypothesis that the performance of the historical mean model encompasses the performance of predictive model. The sample period is January 1950-March 2015. The three-month excess returns are calculated from monthly data. *, **, *** denote significant at 10%, 5% and 1% levels. 32 Table 7: Prediction results from alternative predictors predictor dp pe bm tbl ntis lty inf l ltr svar Panel A: monthly forecast β̂ R̄2 pseudo R2 −0.00 0.08 −0.67 (−1.11) 0.00 0.12 −0.62 (1.26) 0.01 0.05 −0.39 (1.00) −0.28 0.00 −0.59 (−0.42) 0.02 −0.12 −1.71 (0.17) −0.07 0.05 −0.41 (−1.11) −0.76 0.21 −0.55 (−0.24) −0.02 −0.11 0.05 (−0.33) 0.04 −0.13 −0.50 (0.06) Panel B: three-month forecast β̂ R̄2 pseudo R2 −0.00 0.52 −2.00 (−1.00) 0.00 0.63 −1.87 (1.11) 0.02 0.46 −0.94 (1.00) −1.22 0.77 −1.01 (−1.34) 0.09 −0.09 −5.66 (0.14) −0.22 0.46 −1.38 (−1.41) −2.40 0.87 −1.86 (−1.02) 0.23∗∗∗ 0.51 0.71∗∗∗ (2.33) 0.72 0.01 −2.49 (0.79) Notes: This table reports results of examining predictive power of other alternative predictors including the dividend-price ratio (dp), earning-price ratio (ep), book-to-market ratio (bm), treasury bill rate (tbl), net equity expansion ratio (ntis), long-term yield (lty), inflation rate (inf l), return of the long-term rate (ltr) and stock variance (svar). The results include the insample test results which are the estimated predictive loading (β̂), the Newey-West corrected t-statistic (in parentheses) and R̄2 (in percentage) along with out-of-sample pseudo R2 (in percentage) for which the training sample is January 1950-December 1984. The NEW-ENC test is performed under the null hypothesis that the performance of historical mean model encompasses the performance of the predictive model. *, **, *** denote significance at 10%, 5% and 1% levels, respectively. The data for alternative predictors is obtained from Amit Goyal’s personal website over the period January 1950-February 2014. 33 Table 8: Robustness check with control variables e independent Panel A: dependent variable: rm,t+1 variable entire sample subsample1 subsample2 Fcommodity,t−1 (3.16) (2.25) (2.10) M omt (−2.32) (0.08) (0.34) SM Bt (4.14) (4.88) (−1.43) HM Lt (−3.82) (−1.98) (−0.40) e rm,t−1 (0.66) (0.49) (−1.34) rmines,t−1 (−0.70) (−0.42) (0.41) rstone,t−1 (−1.52) (−0.65) (0.15) rapprl,t−1 (0.14) (−0.38) (1.55) rprint,t−1 (0.33) (0.72) (0.45) rptrlm,t−1 (−1.90) (−1.55) (−0.66) rlethr,t−1 (0.89) (0.25) (−1.34) rmetal,t−1 (−0.71) (−1.28) (2.20) rtrans,t−1 (0.50) (0.61) (−1.68) rtv,t−1 (0.89) (0.59) (1.62) rutils,t−1 (0.12) (−0.06) (3.01) rrtail,t−1 (−1.24) (−0.88) (0.88) rmoney,t−1 (0.81) (1.75) (0.64) rsrv,t−1 (−0.22) (−1.35) (−0.34) rreit,t−1 (−1.55) R̄2 14.25 13.42 2.65 e Panel B: dependent variable: rm,t+3 entire sample subsample1 subsample2 (3.04) (2.04) (1.93) (−1.87) (−0.17) (1.36) (2.43) (2.84) (−0.83) (−3.87) (−1.81) (−0.42) (1.10) (2.34) (−0.91) (1.09) (−0.57) (0.29) (−1.16) (−0.16) (−0.05) (0.21) (0.85) (−0.44) (0.39) (−0.66) (0.23) (−2.06) (−2.32) (−1.50) (0.02) (0.99) (−1.18) (−1.33) (−2.29) (1.32) (0.30) (−0.14) (0.25) (0.50) (−0.65) (0.56) (0.33) (−0.54) (1.14) (−1.85) (−1.74) (0.70) (−0.16) (1.05) (1.02) (0.43) (−0.84) (0.79) (−0.22) 10.00 9.22 5.19 Notes: This table shows prediction results of monthly and three-month market returns from the latent factor (Fcommodity,t−1:t ) extracted from commodity returns measured over last twelve months controlled by the momentum factor (M omt ), SM Bt , HM Lt , lagged values of excess returns and industry returns. The results are estimated from the regression, e e rm,t+h = α0 + β1 Fcommodity,t−1:t + β2 M omt + β3 SM Bt + β4 HM Lt + rm,t−1 + β5 rmines,t−1 + β6 rstone,t−1 + β7 rapprl,t−1 + β8 rprint,t−1 + β9 rptrlm,t−1 + β10 rlethr,t−1 + β11 rmetal,t−1 + β12 rtrans,t−1 + β13 rtv,t−1 + β14 rutils,t−1 + β15 rrtail,t−1 + β16 rmoney,t−1 + β17 rsrv,t−1 + β18 rreit,t−1 The regression is performed on three samples: the entire sample January 1950-December 2014, the first subsample January 1950-December 1984 and the second subsample January 1985-March 2015. The data of industry returns that are identified to be significant predictor in Hong, Torous, and Valkanov (2007) is obtained from Fama-French 38 value-weighted industry portfolios which covers the period from January 1950 to December 2014, and these industry returns include mines, stone, apparel, print, petroleum, leather, metal, transportation, TV, utilities, retail, money and services. The data of commercial real estate is available from January 1972, therefore, we exclude the return when testing on the entire sample and the first subsample. Rows 1-19 report t-statistics with Newey-West corrections of the predictive loadings and the bottom row reports R̄2 . For the test on commodity returns (ri,t−12:t ), we report the t-statistics suggested in Kelly and Pruitt (2013) in brackets. 34 Table 9: Forecasting monthly industry returns from commodity returns β̂ (t-statKP ) (t-statN W ) R̄2 (%) pseudo R2 (%)198501 pseudo R2 (%)199501 agriculture 0.87 (2.35) (2.43) 0.75 −1.30 0.04∗ industrial metals 2.48 (3.94) (3.62) 2.35 0.39∗∗ 1.15∗∗∗ energy 2.16 (3.53) (4.06) 2.03 −1.96 −1.18 precious metals 1.70 (2.75) (4.36) 1.62 −4.20 −4.56 Notes: This table shows results of forecasting monthly industry returns from the compositive predictor extracted from commodity returns measured over last twelve months (ri,t−12:t ). The estimation results are obtained by implementing the PLS procedure. Columns 2-5 report the results of forecasting returns on four industries, agriculture, industrial metals, energy and precious metals, respectively. The results from the in-sample test include the predictive loading (β̂) estimated from equation (4), the t-statistic suggested by Kelly and Pruitt (2013, 2015) and Newey-West t-statistic (Newey and West (1987)) and the measure-of-fit (R̄2 ) of the model. The last two rows show pseudo R2 s of out-of-sample forecasts, with the training sample January 1950-December 1984 and January 1950-December 1994, respectively. *, **, *** denote significant at 10%, 5% and 1% levels, respectively. 35 Figures Predictive loadings on individual commodity returns 2 predictive loadings(%) ● ● 0 ● ● Forecast horizon ● ● ● ● ● 1 month 3 months ● ● −2 ● ● ● ● ● gold bullion NY gold silver wheat WT intermediate oil soybean meal soybeans sugar tin zinc special high grade live hog Chicago yellow corn high grade copper Brazil Santos Arabicas aluminum ● soybean oil −4 commodities Figure 1: This figure plots contribution of each commodity returns to monthly and three-month market return predictability. 36 Appendix We follow Kelly and Pruitt (2011, 2015) and show the matrix algebra of calculating the weight of each predictor as below ŷ = ιȳ + (JT XJN X0 JT y(y0 JT XJN X0 JT XJN X0 JT y)−1 y0 JT XJN X0 JT y) (A.1) Equation A.1 can be rewritten as ŷ = ιȳ + JT XJN B Where ŷ is the expected market return; X is the commodity returns used as predictors with the dimension (T × N ); ι is a T -vector of ones; B is the predictive loading on X which is also the weight of each commodity returns for forming the linear combination. Therefore, B = X0 JT y(y0 JT XJN X0 JT XJN X0 JT y)−1 y0 JT XJN X0 JT y Where JT = IT − T−1 ιT ι0T , IT is T -dimensional identity matrix. 37
© Copyright 2026 Paperzz