Realized skewness at high frequency and link to conditional market premium Zhi Liub,c , Kent Wanga , Junwei Liua a The Wangyanan Institute for Studies in Economics, Xiamen University, China b Department of Mathematics, University of Macau, Macau, China c Corresponding author: Zhi LIU, Email: [email protected] Abstract We propose a reliable new estimator for realized skewness which is robust to microstructure noise at ultra-high frequency level. Asymptotic theory for the new estimator has been de- rived. Simulation example verifies its superior performance. We apply the new estimator to tick data of the S&P 500 index for forecasting equity premium in the U.S. market from 1990-2011 and find that it has significant forecast-ability both in-sample and out-of-sample. We also show that the new skewness measure plus the variance risk premium provides right decomposition for the skewness risk. We thus provide evidence that realized skewness links to conditional market premium. Keywords: , High-frequency, Jump, Noise, Skewness, Stock return prediction 1. Introduction Traditional approaches of asset pricing concentrate mainly on the first and second moment of returns. But there is considerable evidence that the third moment may also be important. Barro (2009) argues that the inclusion of the possibility of rare disasters (extreme negative skewness) in an otherwise reasonably standard representative-agent economy, generates equity premia and risk free rates that accord with observation. The empirical literature for skewness going back to Kraus and Litzenberger (1976), and including more recently Das and Sundaram (1997), Harvey and Siddique (2000), Ang et al. (2006), Chen et al. (2006) and Xing et al. (2010), suggests that the asymmetry of the returns distribution both for individual stocks and for the equity market as a whole is important for asset pricing and investment management. Different skewness measures have been proposed in the literature but it is generally admitted that skewness is hard to measure precisely. Major existing skewness proxies with theoretical foundations include risk neutral skewness measure advocated by Bakshi et al., realized skewness measures used in such as Amaya et al. (2011)(ACJV (2011) hereafter)1 , Neuberger (2012) and skewness premium by combining implied and realized skewness. The difficulty for measuring skewness precisely 1 Note that the skewness measure in ACJV (2011) is standard skewness measure according to skewness definition but uses high frequency data. In the current study, ACJV (2011) is one example we use among many who apply such measure in empirical works. Preprint submitted to 2013 World Statistics Congress April 24, 2013 lies in the fact that people know little about the determinants or sources of skewness and the understanding of the issue comes from limited empirical evidence which are mainly based on hypothesis or even speculations2 . The recent advent of readily-available high-frequency data, however, has spurred a renewed interest into alternative ways for more accurately estimating skewness. It is generally found that measures based on high frequency data can improve the estimation and result in far better empirical performances (see, e.g. ACJV 2011). It is believed that one needs to be able to exploit high frequency data to improve the estimation of skewness if one seeks to understand the dynamics of skewness of period returns Neuberger (2012). However, the realized skewness measures proposed based on high frequency data are mostly just proxies with limited validity due to lack of theoretical foundation and empirical considerations. For example, the realized skewness measure appeared in ACJV (2011), which is in terms of the ratio of the third cumulant and the 1.5th power of the second cumulant, has not considered any of the common factors in the high frequency estimation such as microstructure noise and jumps etc. They neither provide any theoretical proof for the unbiasedness and consistency of the proposed empirical estimator.Neuberger (2012) provides a very interesting ex-post estimator for skewness and we believe it is by far the most theoretically justified measure for realized skewness. But Neuberger (2012) only provides unbiasedness proof for the estimator without the consistency validation. In addition, due to model setting, his measure does not accommodate important factors such as noise in the high frequency data and hence can not be applied to the tick data if needed. This leads us to believe that most of the current empirical measures for ex-post skewness based on high frequency data are in fact rough indicators for the “true” skewness in high frequency. There is great need to improve those estimators based on theoretical and empirical considerations, which is what we propose to do in the current study. Based on the existing realized skewness measure, we derive the asymptotic properties for it and then develop our own measure of realized skewness accounting for microstructure noise effect in the data. Simulation study is also performed to investigate finite sample properties of the new measure and it is found that the proposed new estimator can produce far more accurate estimation for the realized third moment that is quite close to the ideal situation. We then apply the new method to the tick data of the S&P 500 index from 1990-2011 in forecasting near future excess equity market returns. We find that the new estimator for realized skewness based on tick data significantly forecasts one-month-ahead excess equity market returns and such result is obtained controlling for commonly used stock return predictors in the literature including the powerful variance risk premium (VRP). The forecast ability of our skewness measure remains significant at 5% in the out-of-sample prediction and shows considerable economic importance. This sheds new light on the recognized puzzle that skewness measures only have cross-sectional pricing effects while little time series performance has been documented before. The result thus suggests that realized skewness is an effective predictor for stock market returns and 2 However, it is not too difficult to understand at intuitive level that asset return skewness is caused by the jumps in the returns along with the leverage effect. Such point of view can be found from works such as Das and Sundaram (1997) and Neuberger (2012). controlling for noise can realistically reveal such effectiveness. In a further attempt to explore the sources of documented predictability of the realized skewness in the current study, we find the new realized skewness measure subsumes to a certain degree the market momentum effect in the short run. The finding that both our new realized skewness measure and the variance risk premium are significant in predicting aggregate stock market premium indicates that the new measure provides an independent channel for stock return prediction and we supply the literature with a clean decomposition of the skewness risk into jump and leverage effect components. We therefore shed new light on both the time series pricing performance of the skewness and the hypothesis regarding the sources of the skewness risk. We organize the rest of the paper as following. Section 2 provides theoretical discussions for deriving asymptotic properties of the existing high frequency based estimator for skewness and kurtosis as in ACJV (2011) as well as theories for developing our new skewness measure that accounts for microstructure noise; Section 3 provides discussions for the simulation studies; Section 4 presents empirical illustrations for applying the new skewness measures in forecasting excess S&P 500 index returns; Section 5 provides further discussion for the results obtained and Section 6 concludes. We postpone all the necessary theoretical proof and justification for our methodology to the Appendix. 2. Methodology 2.1. The Model and Estimators We define two processes {Xt , t ≥ 0} and {Yt , t ≥ 0}, both are adapted to the filtered probability space (Ω, F, Ft , P ) as follows: Z t Z bs ds + Xt = 0 t Z σs dWs , Yt = 0 t Z bs ds + 0 t σs dWs + 0 X ∆s Y, (1) s≤t where {bt , 0 ≤ t ≤ T } is adapted locally bounded processes, {σt , 0 ≤ t ≤ T } is an deterministic càdlàg volatility process. ∆s Y = Ys − Ys− is the jump of Y at time s, if there is no jumps, it is simple zero. We assume that the jump of Y arrives via a finite activity jump process, which includes commonly used compound Poisson process as a special case. We observe the processes at some discrete time points ti , 0 = t0 ≤ ti ≤ tn = T , that is, we have the increments as ∆Zi = Zti+1 − Zti for either Z = X or Z = Y . Hence, in the following context, we use Z referring to either as X or Y , namely, when Z = X, it has continuous path, Z = Y means there is jumps in the path. 2.2. Microstructure-noise free estimator In this section, we investigate the properties of the estimator proposed by ACJV (2011) under both X and Y . We assume that the log price follows the process Z, and then the increments correspond to the log returns for each firm. ACJV (2011) proposed the realized daily skewness and realized kurtosis as the following: P √ Pn n ni=1 (∆Zi )4 n i=1 (∆Zi )3 RDSkew = , RDKurt = , (2) 3/2 RDV arT2 RDV arT where RDV art defines well-known realized volatility RDV arT = now study the asymptotic properties of the estimators. Pn 2 i=1 (∆Zi ) . We Theorem 1. Z defines a Itô process, and ∆Zi = Zti+1 − Zti . Then 1. When Z = X, we have n X i=1 2 Z T (∆Zi ) →P σt2 dt, n 0 Z n n X X 3 4 (∆Zi ) →D N, n (∆Zi ) →P 3 i=1 i=1 T σt4 dt. (3) 0 Where, N is a random variable with normal distribution, and →D defines the convergence in distribution and →P is the convergence in probability. 2. When Z = Y , we have Z n X (∆Zi )2 →P T σt2 dt + 0 i=1 X (∆t Y )2 , t≤T n n X √ X 3 n (∆Zi ) →P ∞, n (∆Zi )4 →P ∞. i=1 (4) (5) i=1 Remark 2. From the above results, we have that asymptotically, both of RDSkew and RDKurt are not well-defined in the jump-diffusion model (tend to infinity asymptotically). In the diffusion model, the RDSkew is not reasonable since RDSkew →P 0. Considering of jump is accepted both theoretically and empirically, instead of pursuing more sophistical estimator for the diffusion model, we aim to propose some reasonable estimators for jump-diffusion model. To deal with this problem, we propose to use the following new estimators for skewness and kurtosis: Pn Pn 4 3 i=1 (∆Zi ) i=1 (∆Zi ) , J − RDKurt := . (6) J − RDSkew := 3/2 RDV art2 RDV art Here we use J to emphasize the “jumps”. The following theorem specifically describes the asymptotic performance of the estimators proposed above. Theorem 3. Z = Y is defined as in (1), we have P 3 t≤T (∆t Z) J − RDSkew →P R T , P ( 0 σt2 dt + t≤T (∆t Z)2 )3/2 P 4 t≤T (∆t Z) J − RDKurt →P R T P ( 0 σt2 dt + t≤T (∆t Z)2 )2 (7) (8) We set the limits as the measure of skewness and kurtosis under the high frequency situation with the jump-diffusion driven underlying. We aim to estimate this quantity (note that both of them are random variables) using noisy high frequency data. 2.3. Estimation with microstructure noise The above proposed estimators can not typically be used to the financial data with higher frequency, say tick-by-tick, or even one-minute, because high frequency data inevitably is contaminated by microstructure noise, which is usually induced by bid-ask bounce, rounding error, etc. More specifically, we observe the noisy process Mi = Zi + i , rather than the latent process Z, at points ti , 0 ≤ i ≤ n. To deal with the noisy data, existing approaches including two-time scales method proposed by Zhang et al. (2005), preaveraging method studied by Jacod et al. (2009) and further developed by Jacod et al. (2010), and realized kernel method suggested by Barndorff-Nielsen et al. (2008). In this paper, we use the preaveraging approach as described in Jacod et al. (2010). Here we briefly describe the procedure, although it is well-known already. Suppose that the latent process or the log price of a security is Z, but we only can observe M = Z + , where ⊥ Z. As shown in Zhang et al. (2005), if the noise left untreated, will plays dominant role in the estimator. To reduce the effect of microstructure noise, a block with length kn is constructed. For example, in ith block, we use ∆Mi , ∆Mi+1 , · · · , ∆Mi+kn −1 to construct a sequence of preaveraging returns for 1 ≤ i ≤ n − kn : ∆ni,kn M (g) = kn X j=1 k g( n 2 X j j j−1 (g( ) − g( )∆Mi+j , ∆ni,kn M̄ (g) = ))∆Mi+j , kn kn kn j=1 with a non-negative piece-wise R 1 differentiable Lipschitz function g, satisfying g(x) = 0 when x 6∈ (0, 1) and ḡ(p) = 0 g p (x)dx > 0. Consequently, the effect of microstructure noise is significantly reduced to a matched order of the latent increment. Our estimator is constructed basing on preaveraging returns. We collect all the preaveraging returns by the following way: Un (M, p) = n−k Xn (∆ni,kn M (g))p , Ūn (M ) = i=1 n−k Xn ∆ni,kn M̄ (g) (10) i=1 We have the following asymptotics for the above statistic. Theorem 4. Z = Y is defined as in (1), we have X 1 Un (M, p) →P ḡ(p) (∆s Y )p For all p > 2, kn (11) s≤T Z T X 1 1 Un (M, 2) − Ūn (M ) →P ḡ(2)[Y, Y ]T = ḡ(2)( σs2 ds + (∆s Y )2 ).(12) kn 2kn 0 s≤T Inspired by above results, using the Slustky’s theorem, we obtain a consistent estimator of skewness and kurtosis which we set in the earlier section. Theorem 5. Z = Y is defined as in (1), we have P √ 3 kn Un (M, 3) (ḡ(2))3/2 s≤T (∆s Y ) → , (13) R P P T ḡ(3) (Un (M, 2) − Ūn (M )/2)3/2 ( 0 σs2 ds + s≤T (∆s Y )2 )3/2 P 4 (ḡ(2))2 kn Un (M, 4) s≤T (∆s Y ) → . (14) P RT P ḡ(4) (Un (M, 2) − Ūn (M )/2)2 ( σs2 ds + (∆s Y )2 )2 0 s≤T (9) References Amaya, D., Christoffersen, P., Jacobs, K., Vasquez, A., 2011. Does realized skewness and kurtosis predict the cross-section of equity returns? Working paper. Ang, A., Hodrick, R.J., Xing, Y., Zhang, X., 2006. The cross-section of volatility and expected returns. The Journal of Finance 61, 259–299. Bakshi, G., Kapadia, N., Madan, D., . Stock return characteristics, skew laws, and the differential pricing of individual equity options. Review of Financial Studies 16, 101–143. Barndorff-Nielsen, O.E., Hansen, P.R., Lunde, A., Shephard, N., 2008. Designing realised kernels to measure the ex-post variation of equity prices in the presence of noise. Econometrica 76, 1481–1536. Barro, R.J., 2009. Rare disasters, asset prices, and welfare costs. American Economic Review 99, 243–64. Chen, J., Ang, A., Xing, Y., 2006. Downside risk. Review of Financial Studies 19, 1191–1239. Das, S.R., Sundaram, R.K., 1997. Taming the skew: Higher-order moments in modeling asset price processes in finance. NBER working paper No.5967. Harvey, C.R., Siddique, A., 2000. Conditional skewness in asset pricing tests. The Journal of Finance 55, 1263–1295. Jacod, J., Li, Y., Mykland, P.A., Podolskij, M., Vetter, M., 2009. Microstructure noise in the continuous case: the pre-averaging approach. Stochastic Processes and Their Applications 119, 2249–2276. Jacod, J., Podolskij, M., Vetter, M., 2010. Limit theorems for moving averages of discretized processed plus noise. Ann. Statist. 38, 1478–1545. Kraus, A., Litzenberger, R.H., 1976. Skewness preference and the valuation of risk assets. Journal of Finance 31, 1085–1100. Neuberger, A., 2012. Realized skewness. Review of Financial Studies, forthcoming. Xing, Y., Zhang, X., Zhao, R., 2010. What does the individual option volatility smirk tell us about future equity returns. Journal of Financial and Quantitative Analysis 45, 641–662. Zhang, L., Mykland, P., Aı̈t-Sahalia, Y., 2005. A tale of two time scales: Determining integrated volatility with noisy high-frequency data. Journal of the American Statistical Association 100, 1394–1411.
© Copyright 2026 Paperzz