FO R M A T What Goes into Risk-Neutral Volatility? Empirical Estimates of Risk and Subjective Risk Preferences IN A N Y STEPHEN FIGLEWSKI IS TH C E U D O R EP R TO L A G IL LE IS IT FALL 2016 JPM-Figlewski.indd 29 LE volatility input. As an alternative to statistical estimation, market makers and many others rely on implied volatilities extracted from option market prices. Implied volatility (IV) makes a theoretical model consistent with pricing in the market, but one has to specify which model “the market” is using. Black–Scholes IV is nearly always chosen, even though an enormous amount of research has shown that a constant volatility lognormal diffusion for returns is greatly oversimplified.1 Constraining IV to be equal across strike prices, as the BS model requires, fails immediately in the face of the ubiquitous volatility “smile” (or “smirk” or “skew,” depending on its shape). Equity market volatility changes over time; returns are not really lognormal, especially in the tails; and there appear to be nondiffusive jumps in both returns and volatility. Time variation in the returns distribution should matter to investors, beyond the obviously greater difficulty of hitting a moving target, because stochastic volatility is unspanned (meaning it can’t be fully hedged by other securities). Once more complex returns processes are allowed, numerous possibilities arise for unspanned stochastic factors that are not in the BS model but that can affect actual options and may be priced in the market. The resulting risk premia may themselves vary stochastically over time. Implied volatility is a risk-neutral value, in which all risk premia in the market have R TI he Black–Scholes (BS) model showed that option value depends heavily on the volatility of the underlying stock, which is assumed to follow a constant-volatility logarithmic diffusion. With this process, a single parameter σ captures “risk” in all its manifestations. These include (1) instantaneous volatility, which determines delta hedging performance; (2) the standard deviation of the stock price around its mean on option expiration day, which is what implied volatility extracted from pricing model measures; (3) the stock’s probable trading range over the option’s lifetime, which determines whether a speculator can profit from a short-term trade and how much financing risk an arbitrage trade is exposed to; (4) the frequency and size of tail events, which risk managers worry about most; and more. Different types of investors care about some of these risks more than others. For example, a market maker hedging his trading book worries about very short-run f luctuations and price jumps that are of little consequence to a buy-and-hold investor. But under Black–Scholes, they all care about the same volatility parameter. Volatility in the BS model is a known parameter, but real world volatility varies randomly over time and is hard to predict accurately. And no matter how volatility is estimated, market prices invariably differ from model values computed with that A is a professor at the New York University Stern School of Business in New York, NY. [email protected] C T STEPHEN FIGLEWSKI THE JOURNAL OF PORTFOLIO M ANAGEMENT 29 18/10/16 11:13 am IS A R TI C LE IN A N Y FO R M A T address the question in the title of this article: “What Goes Into Risk-Neutral Volatility?” The market bases its volatility forecasts and required risk premia on current and historical data. But these estimates apply to the stock’s behavior over the future lifetime of the options involved. Thus, we are interested in both what information gleaned from historical data goes into the market’s RND and how that RND relates to the volatility actually realized in the future. We have divided the investigation into two parts. This article focuses on what past and current information appears to enter today’s volatility estimates and risk preferences. A second article will consider risk-neutral volatility as a forecast, addressing issues like the following: How accurate is it? How does it compare with other estimators, such as generalized autoregressive conditional heteroscedasticity (GARCH) and the CBOE Volatility Index (VIX)? How does the market’s forecasting horizon compare to the option’s remaining life? In this article, we address several broad questions. Which return- and volatility-related factors are most important to investors in forecasting the empirical probability density embedded in the RND? What factors influence the process of risk neutralization? Do the answers to these questions differ for long maturity versus short maturity options, or in different market conditions? Since risk premia are significant components of total return, there is considerable value in understanding risk neutralization just to be able to estimate how much extra return the market requires for bearing different sorts of risks; or on the flip side of the same question, how much would the market pay to hedge those risks? For example, Bollerslev and Todorov [2011] argued that much of the volatility risk premium during the fall of 2008 was compensation for lefttail “crash risk,” whereas Kozhan et al. [2010] found that about half of the implied volatility “skew” reflects a risk premium for skewness (i.e., the third moment of returns). We use the Breeden and Litzenberger [1978] technology to extract RNDs from daily S&P 500 Index option prices. The primary objective is not to build a formal model of option risk premia, but to establish a set of empirical stylized facts that any such model will need to be consistent with. Two significant problems, which we have addressed in a technical appendix (available online), are how to smooth and interpolate market option prices to limit the effect of pricing noise and how to extend the distribution into the tails beyond the range of traded strike prices. An enormous advantage of the IT IS IL LE G A L TO R EP R O D U C E TH been impounded into a modified probability distribution for returns, known as the risk-neutral density (RND). In an important paper, Harrison and Kreps [1979] proved that in a world free of profitable arbitrage opportunities, there will always be an RND that combines investors’ risk preferences with their objective forecast of the probability density for the stock price at option expiration. The RND is often called the “Q measure,” while the expected true probability distribution is the “P measure.” Implied volatility and other parameters computed under the Q measure will rarely be equal to their real world expected values under the P measure. Violations of the BS assumptions, such as fatter than lognormal tails, cause BS implied volatilities to exhibit the customary but theoretically anomalous smile shape. Researchers on volatility risk, for example, Carr and Wu [2008], have shown that the market places a negative risk premium on volatility risk. The market dislikes volatility, so expected returns are lower and prices are bid up for options and other securities that can hedge volatility risk, because they rise in price with higher volatility. They will be priced (under the Q measure) as if volatility were expected to be higher than investors really think. IVs that differ across strikes are inconsistent with BS model assumptions. But when a cross-section of option prices is available, the full RND can be extracted without specifying a pricing model. Breeden and Litzenberger [1978] showed how this could be done very simply, given prices for options with exercise prices that span a broad range of future stock prices. Option prices are now being used to extract not just BS implied volatilities, but the entire risk-neutral probability density for the price of the underlying on expiration day.2 The RND ref lects the current state of supply and demand for options with the same maturity and different exercise prices. It will impound the market’s beliefs about the true probabilities of all volatility-related properties of the underlying stock’s returns, such as time-variation in instantaneous volatility, fat tails, and jumps; and it will also ref lect risk premia on those factors and any others that investors care about. Risk-neutral volatility is the standard deviation under this density. We analyze these densities to explore how the RND, and by extension option pricing in the market, is affected by various volatility-related properties of the true returns distribution and by factors that may ref lect risk preferences. In so doing, we begin to 30 WHAT G OES INTO R ISK-NEUTRAL VOLATILITY? EMPIRICAL ESTIMATES OF R ISK AND SUBJECTIVE R ISK P REFERENCES JPM-Figlewski.indd 30 FALL 2016 18/10/16 11:13 am T The next section explores the model’s robustness by fitting it on data subsets, according to option maturity and time period. The final section offers some concluding comments. M A DATA IS A R TI C LE IN A N Y FO R The variable we wish to explain is the standard deviation of the S&P 500 Index on option expiration day under the risk-neutral probability density. The RND aggregates the individual risk-neutralized subjective probability beliefs within the investor population. The resulting density is not a simple transformation of the true but unobservable distribution of realized returns on the underlying asset, as in BS, nor should we expect it to obey any particular probability law. Our model-free fitting procedure allows maximum f lexibility to ref lect option market pricing. To compare RNDs from options with different maturities, the RND standard deviation is converted into an annualized percentage value. This is what we call risk-neutral volatility. The basic strategy of nonparametric RND extraction is well known by now, so to save space, the specifics of our implementation are sketched out in the Appendix, which provides more details about this and other technical aspects of the empirical work.5 IT IS IL LE G A L TO R EP R O D U C E TH RND is that it does not require us to know the market’s option pricing model. It is model-free. The VIX, which measures implied volatility for the overall stock market portfolio, is roughly based on the same model-free methodology that we employ. 3 However, the VIX focuses on a single 30-day maturity, whereas maturities in our sample range from 14 to 199 days. The VIX calculation also ignores several issues of data handling that we try to deal with more carefully, including extending the tails of the density beyond the range of available strike prices.4 Still, it is worth noting that over our 15-year sample period, the VIX averaged 22.1% while realized volatility was only 20.6% on average. This implies a volatility risk premium on index options equivalent to about 1.5 volatility points. Before moving on, we must consider how Ross’s [2015] recent theoretical breakthrough in this area applies to this research. It has long been thought that objective probability estimates and risk premia cannot be separately deduced from option prices without further information about either the market’s risk preferences or the true return-generating process. Ross proves a deep theoretical result showing this is not strictly true. But the assumptions required are strong enough that the immediate problem we are addressing remains, which is how to understand better how objective measures of different aspects of price risk and behavioral proxies for market “sentiment” get into real world option prices. The next section defines the variables we conjecture could inf luence investors’ demands for options. A practical issue rarely considered is how far back in time investors look in calculating volatility and other statistics from past returns. In this research, we have investigated different lag definitions as well as different cutoffs to define tail events and assume investors pay most attention to those that contain the most information about future volatility. Due to space limitations, this material is relegated to the online appendix. The following section first looks at univariate relationships in the form of simple correlations between the volatility measures and the explanatory variables and then presents a grand regression with all of the variables together. Among the interesting results from this exercise is strong evidence that several distinct aspects of volatility seem to be independently important and variables that were hypothesized to be correlated with investor risk attitudes do in fact appear to have significant explanatory power for RND volatility. FALL 2016 JPM-Figlewski.indd 31 RND and Realized Volatility Option prices. We take the midpoints of the closing bid and ask quotes for all traded S&P 500 Index options, downloaded from OptionMetrics that satisfied the following criteria: • Observation dates and option expirations between January 4, 1996, and April 29, 2011. • Option bid prices ≥ 0.50. The average index level in the sample is 1151. • Maturities 14 < T < 199 calendar days, allowing about three RND maturities per day. Interest rates and dividend yield. Riskless interest rates, interpolated to match the option maturity, and S&P 500 dividend yields were obtained from OptionMetrics. Realized volatility to expiration. An option’s price should embed the market’s best forecast of the volatility that will be realized from the present through option expiration, but this is not the volatility under the RND THE JOURNAL OF PORTFOLIO M ANAGEMENT 31 18/10/16 11:13 am T M A R FO N Y (3) IN A Like the absolute return for the day, this variable was included in our initial exploration, but the results seemed odd. In regressions with both the absolute return and the trading range, trading range comes in with a highly significant positive coefficient, but the coefficient on absolute return was consistently about the same size but strongly negative. Subsequently, market makers explained that managing their risk on a day when the market simply moves strongly up or down is much easier than on a day when there is high uncertainty and prices f luctuate over a broad range without finding a new equilibrium much different from the initial level. They suggested using the following instead. Date t trading range minus absolute return. The range minus the absolute change produced a single variable with a strongly positive and significant coefficient. Date t left- and right-tail returns. One of the most apparent differences between the empirical return distribution and the lognormal is that large absolute returns—tail events—are more common than the lognormal allows for, especially on the downside, so the occurrence and size of a tail event might be especially important to investors. But what constitutes a tail event? Rather than making ad hoc assumptions, as is usually done, we treated it as an empirical question, which we explored by comparing the explanatory power of alternative tail definitions, both in univariate regressions of RND volatility on a tail variable and also along with our other explanatory variables in multivariate specifications. Based on this investigation, as detailed in the online technical appendix, we classify the return on date t as a left- (right-) tail event if it falls in the 2% left(right-) tail (relative to a normal density). The cutoffs for tail returns are set at ±2.054 times the previous day’s one-day GARCH volatility forecast. If the return is not in the tail, the value of the tail variable is set to zero. IS This equation assumes 252 trading days in a year and adopts the standard approach of treating the daily mean return as equal to zero.6 Realized volatility is expressed in annualized percent, for example, 20% is 20.0. We consider explanatory variables in three sets. The first are variables related to volatility and extreme price movements that do not require analyzing historical data. In the second set are volatility-related variables calculated from historical data. The third set are variables chosen to ref lect behavioral factors that we feel might affect the risk-neutralization process. ⎛ S − St ,LOW ⎞ Range(t ) = 100 × log 1 + t ,HI ⎟ ⎝ ( t,, ,LOW )/2 ⎠ LE (1) C 252 T 2 ∑r . (T t ) t +1 τ R TI σ Realized = Date t trading range. The intraday trading range contains a lot of volatility information.8 Also the cost for market makers to hedge their “Greek letter risks” depends on the range over which the stock price travels in a day. For consistency with the other variable definitions, the date t range is converted into a logarithmic percentage in the following way: A when there is a volatility risk premium. We explore the real world differences by regressing RND volatility and future realized volatility on explanatory variables that we hypothesize are important to investors. Realized volatility for date T maturity is defined as TH Date t Volatility Variables σ t zt , zt ~ N (0,1) G A rt L TO R EP R O D U C E Date t return. The date t return is defined as 100 × log(St /St−1). Volatility, both empirical and implied, has a strong tendency to go up following a negative return. Models with separate stochastic factors for returns and volatility typically find correlation of about −0.7 between them. Absolute value of date t return. A large return shock of either sign is consistent with high current volatility, but this variable was replaced with a different but related one, as described in the following. GARCH model forecast. We adopt the Glosten, Jagannathan, and Runkle [GJR, 1993] specification: IL LE σ t2 = a b σ t2−1 + c rt2 + d 2 {rt − 1 < 0} t −1 r (2) IT IS where rt and σ t2 are the date t log return and conditional variance, and zt is a standard normal i.i.d. random variable. The return equation contains no constant term, consistent with Equation 1. The variance σ t2 is a function of the date t − 1 conditional variance and squared return shock, plus a term that increases σ t2 when rt−1 is negative. GJR-GARCH model forecasts were provided by the NYU Volatility Institute’s VLAB.7 32 WHAT G OES INTO R ISK-NEUTRAL VOLATILITY? EMPIRICAL ESTIMATES OF R ISK AND SUBJECTIVE R ISK P REFERENCES JPM-Figlewski.indd 32 FALL 2016 18/10/16 11:13 am T M A R FO Y N A IN Risk Attitude Variables A R TI C LE The variables discussed so far are all tied to statistical characteristics of returns. We now turn to the risk-neutralization process. We have selected a few factors that are potentially promising, but there are many others that it might make sense to explore. University of Michigan Index of Consumer Sentiment. This is a classic survey of consumers’ confidence level. It is not specifically tied to finance, but it is widely known and followed by investors. The daily value is set to the monthly sentiment index value for every day of the month. Baker–Wurgler Index of Investor Sentiment. Baker and Wurgler [2006] devised a measure of investor confidence based on factors such as the discount on closedend mutual funds, the first day returns on initial public offering (IPOs), etc.9 With all monthly series, the value is repeated for every day of the month. The price/earnings for the S&P 500 Index portfolio. The ratios of price to earnings (P/E) will be higher when investors anticipate favorable conditions and strong earnings growth in the future. We hypothesize that optimistic investors also accept lower volatility risk premia. The monthly P/E for the S&P Index was downloaded from Robert Shiller’s “Irrational Exuberance” website.10 The Baa–Aaa corporate bond yield spread. This daily credit spread series is the average yield spread between corporate bonds rated Baa by Moody’s and AAA bonds. It measures the default-risk premium on high-grade corporate debt. The series was downloaded from the St. Louis Fed’s FRED (Federal Reserve Economic Data) database.11 GARCH model error over a recent period equal to the option’s remaining lifespan. A positive (negative) value EP 252 t − 65 2 ∑ rτ . 65 t −1 R σ Hist = R O D U C E TH Past realized volatility and similar factors are computed from historical returns, but it is not known a priori what sampling horizon the market focuses on. As with tail definitions, we used the behavior of the RND as a window onto this issue. Most of the variables showed similar performance over a fairly broad range of sample lengths. In the end, we selected estimation windows that gave reasonable performance in the various exploratory regressions and that “made sense” in terms of the behavior one might expect from an intelligent investor. Details are given in the Appendix. The final choices are as follows. Left and right-tail events. The empirical distribution’s fat tails can be due to more frequent tail returns than under the normal and/or unexpectedly large tail outcomes conditional on falling in the tail. As mentioned previously, 2% and 98% cutoffs define a tail event. A historical window of two years (500 trading days) was chosen to gauge frequency for both tails. Return and absolute return from date t − n to t − 1. Recent index return is defined as 100 × log(St−1/St−n), but the “right” choice of n was not clear cut. We selected the return over just the last five trading days because it had good explanatory power and less chance of proxying for some other factor. Historical volatility. Historical volatility is computed from the squared log returns over the last 65 trading days, treating the expected drift as zero. Specifically, volatility uncertainty may require a larger risk premium when their forecasting model has been performing badly, so we incorporate two variables measuring historical accuracy of the GARCH model. A positive coefficient on the average GARCH error from date t − n to t − 1 means that RND volatility is higher when the GARCH model has been underpredicting realized variance, whereas a positive GARCH RMSE coefficient increases risk-neutral volatility following large model errors on either side. The historical sample length over which investors are assumed to gauge the GARCH model’s accuracy is set equal to the option’s remaining lifetime. IS Historical Volatility Variables (4) IT IS IL LE G A L TO Average daily range t − n to t − 1. The daily range is a proxy for potential hedging costs. Each day’s range is computed as in Equation 3. In the online technical appendix, we find that averaging the daily range over a window the same size as the option’s remaining lifetime performs distinctly better than any fixed horizon. Range over t − n to t − 1. If the underlying stock trades over a wide range during an option’s lifetime, an investor may make a short-term profit on an option early even if it ultimately expires out of the money. The range the index has traversed in the most recent 25 days is computed as a logarithmic percentage relative to the average price in the interval. GARCH error and GARCH RMSE (root mean squared error). Investors who dislike volatility and also FALL 2016 JPM-Figlewski.indd 33 THE JOURNAL OF PORTFOLIO M ANAGEMENT 33 18/10/16 11:13 am EXHIBIT 1 R TI C LE IN A N Y FO R M A T Sample Correlations A corresponds to average under- (over-) prediction of the squared returns. GARCH model errors in previous periods are more likely to affect current risk-neutral volatility through risk aversion than through the objective conditional forecast of future volatility. GARCH model RMSE. This variable is described earlier and in the online technical appendix. RND volatility premium on date t − 1. If “market sentiment” is persistent, perhaps because of excluded exogenous variables, that dependence can be captured by including the previous day’s volatility risk premium, which we measure as date t − 1 RND volatility minus the GARCH forecast of volatility to expiration. But simply “explaining” risk-neutral volatility by its own lagged value is not very revealing, so this variable is not included in the “all variables” specification. It is useful, however, to see if yesterday’s volatility risk premium contains information for predicting realized volatility and also to see how the coefficients on the other variables are affected if this variable is included in the regression. TH IS RESULTS Notes: The table shows the simple correlation coefficients among the variables considered in the study. The sample includes data from 10,152 date-option maturity pairs from January 4, 1996, through April 29, 2011, with bad data points excluded as described in the text. IL LE G A L TO R EP R O D U C E We first look at univariate correlations among the volatility-related dependent variables and the explanatory variables. This will show the typical explanatory power of each factor when considered in isolation. We then fit a grand regression on all variables together to gauge each one’s marginal contribution when included with the others. Because RND volatility impounds a forecast of volatility over an option’s remaining lifetime, there is potential correlation in residuals from any two options whose lifetimes overlap. This does not bias coefficient point estimates, but ordinary least squares (OLS) standard errors are inconsistent. The standard errors are corrected using an adaptation of the Newey and West [1987] model, as detailed in the online technical appendix. Simple Correlations IT IS Exhibit 1 shows simple correlations between the dependent variables and each of the explanatory variables. With more than 20 of them, multicollinearity is a concern, but only a few correlations are above 0.7, and these tend to be where one would expect them. For example, variables like historical volatility and the daily trading range are highly correlated with the GARCH forecast, 34 as one might expect. Yet, these correlations are not so high that they preclude precise coefficient estimates.12 Recall that in a one-variable regression, the correlation and regression coefficient have the same sign, and the R 2 is the square of the correlation coefficient. We first note that the correlation between RND volatility and realized volatility to expiration is only 0.613. The low value ref lects the fact that risk neutralization modifies the market’s prediction of the empirical density and, moreover, future volatility is just hard to forecast WHAT G OES INTO R ISK-NEUTRAL VOLATILITY? EMPIRICAL ESTIMATES OF R ISK AND SUBJECTIVE R ISK P REFERENCES JPM-Figlewski.indd 34 FALL 2016 18/10/16 11:13 am IS A R TI C LE IN A N Y FO R M A T The next four variables relate to the frequency and size of past extreme events. Seeing many returns in the tails should increase investors’ perceptions of volatility risk, and the exhibit shows positive correlations with the number of both left and right 2% tail events. The average size of a tail return is another dimension of tail risk. The average left-tail return is negative, so negative correlation means volatility increases when there is a large negative left-tail return. The right-tail return shows the expected positive correlation. Realized volatility is also related to these tail variables but with substantially lower correlations than RND volatility shows, in each case. The last section of Exhibit 1 contains variables related to the risk-neutralization process. Both RND and realized volatility are negatively correlated with the University of Michigan consumer sentiment survey and the P/E for the S&P index. The Baker–Wurgler measure, however, is less correlated with RND volatility and has the “wrong” sign for realized volatility, although the size is quite small. To the extent that these variables ref lect investor confidence, negative correlation with RND volatility is expected. The Baa–Aaa corporate bond yield spread measures market-perceived default risk. The variable is strongly positively correlated with the two volatility variables, which suggests that wider credit spreads in the bond market correspond to higher volatility in equity options. The next two variables capture how accurate the GARCH model forecast has been in the recent past. Investors perceive more volatility risk, and there seems to be more actual volatility, following a period in which volatility models exhibited large errors, especially when volatility was underpredicted during the sample period (i.e., mean GARCH error was positive). Both RND and future realized volatility are strongly correlated with GARCH RMSE. The final variable in Exhibit 1 is the previous day’s RND volatility premium. There is almost no correlation between yesterday’s volatility risk premium and today’s RND volatility level (although the premium itself is highly autocorrelated), and the correlation with realized volatility is actually negative. Thus, the difference between risk-neutral volatility and a good forecast of empirical volatility does not appear to contain information about future realized volatility or even about tomorrow’s risk-neutral volatility. The correlations in Exhibit 1 show that the factors expected to inf luence RND volatility and realized IT IS IL LE G A L TO R EP R O D U C E TH accurately—that is, the GARCH forecast and historical volatility show similar correlation levels with realized volatility. The next section contains the date t variables. Date t return is negatively correlated with the two dependent variables, but only weakly, probably because the exhibit reports correlations among levels, not changes. The absolute value of date t return is distinctly more important than the signed return, especially for RND volatility. The GARCH forecast of volatility through expiration is an extremely important variable. Its correlation with both RND and realized volatility is close to the largest of any of the exogenous variables.13 Extreme negative and positive “tail” returns show the expected signs—a large negative left-tail return leads to higher volatility and so does a positive return in the 2% right tail—but correlations are low. Next are the volatility-related variables computed from historical prices and returns. With negative correlations on last week’s return and positive correlations on the absolute return, when the market went up (down) over the past week, volatility fell (increased), but volatility was higher after a week with a large return of either sign. Historical volatility should be similar to GARCH, although less accurate as a forecast because it uses the same past returns, but in a less sophisticated way. Still, investors may well incorporate it in RND volatility because it is more accessible than a full-f ledged GARCH model to many market participants. As expected, the correlations between historical volatility over the last 65 days and the two dependent variables are very high; but for realized volatility, they are a little lower than the GARCH forecast. As we look backward, two different trading range concepts could be important. If the index moves over a broad range intraday, it signals high volatility and also greater hedging risk for market makers. The daily range averaged over a longer time period measures the latter effect in recent returns, and the correlations with the two volatility variables are strongly positive, more so for risk-neutral volatility.14 The second trading range concept is of more interest to a longer-term investor. Trading over a wide range during the option’s lifetime can be desirable if it provides an opportunity for the investor to gain a short-run profit, even when the option in question will eventually expire out of the money. Although RND volatility is very highly correlated with both range variables, they have less explanatory power for the volatility that is actually realized. FALL 2016 JPM-Figlewski.indd 35 THE JOURNAL OF PORTFOLIO M ANAGEMENT 35 18/10/16 11:13 am M A T columns feature key results and some interesting differences between RND volatility and realized volatility. Both show a significant leverage effect, with negative date t returns leading to higher volatility, and highly significant coefficients on the GARCH forecast. However, the coefficient on GARCH is more than twice as large for realized volatility as for RND volatility and both are well below 1.0. Trading on date t over a much wider range than the change from yesterday’s close is associated with higher risk-neutral volatility and even more strongly with subsequent realized volatility. But left- or righttail returns had no significant effect on subsequent Y N IN Exhibit 2 presents the results from four regressions run with the full set of explanatory variables, with t-statistics corrected for cross-correlation. The first four A Regressions with All Variables FO R volatility appear to do so with the anticipated signs. But correlation among the explanatory variables is high, so it is not clear how much any individual factor contributes beyond the fact that all of them are correlated with “volatility” broadly defined. To gauge the marginal inf luence of each variable, we now combine them all into a single grand regression. LE EXHIBIT 2 IT IS IL LE G A L TO R EP R O D U C E TH IS A R TI C Full Sample Regressions Notes: Results from OLS regressions of RND volatility and realized volatility from observation date through option expiration, regressed on the full set of explanatory variables. t-Statistics are corrected for cross-correlation caused by overlapping option lives. 36 WHAT G OES INTO R ISK-NEUTRAL VOLATILITY? EMPIRICAL ESTIMATES OF R ISK AND SUBJECTIVE R ISK P REFERENCES JPM-Figlewski.indd 36 FALL 2016 18/10/16 11:13 am R TI C LE IN A N Y FO R M A T we do not favor this specification for the RND volatility regression because this difference is a large part of what we are trying to explain with exogenous factors. We include it here simply to show that it has little effect on the other coefficients. There were changes in coefficient sizes and significance levels, but just a few variables changed sign and only the date t right-tail return went from significantly positive to significantly negative. The lagged RND volatility premium in the realized volatility regression is nearly significant but negative, and it contributes almost nothing to raising the regression R 2. This variable ref lects the risk-neutralization process, which is not a prediction of future volatility at all. A negative coefficient suggests that holding the other predictive variables constant, future realized volatility tends to be lower (higher) when the volatility risk premium is high (small). A SUBSAMPLE RESULTS IS This section explores the robustness of the relationships uncovered in the last section in subsamples broken down by maturity and in different economic environments. Exhibit 3 runs the regression of RND volatility on the full set of explanatory variables separately for contracts with less than 75 days to expiration and for longer maturity contracts. The results for short-term and longer-term options are remarkably similar. Only the average size of left-tail events changes sign and is insignificant in both subsamples. The average right-tail return in the last two years becomes significant in the longer maturity regression, while the coefficient on GARCH RMSE becomes insignificant (with the wrong sign in both regressions). Exhibit 4 compares regressions for both RND volatility and realized volatility on all variables over three different time periods. The first is January 1996–June 2003, which includes the Russian debt crisis and the Internet bubble, followed by the bear market of 2001– 2003. It was a period of medium volatility with a sharp run-up in stock prices and a sharp drop in the second half. The second subsample, July 2003–December 2007, was a time of persistently rising stock prices and extraordinarily low volatility. The third subperiod, January 2008–April 2011, includes the market crash in 2008, when volatility soared to extreme heights, followed by the aftermath. IT IS IL LE G A L TO R EP R O D U C E TH realized volatility. The t-statistics in the RND volatility regression are much higher, but the anomalous positive coefficient on a negative tail return implies that a shock to the downside reduced RND volatility. For RND volatility, all nine historical return variables except the average left-tail return were significant with the expected signs, although only three had significant explanatory power for realized volatility. A behavioral explanation is that different volatilityrelated aspects of the returns process may be independently relevant to, and valued by, certain classes of investors beyond their connection to future realized volatility. Market makers may focus especially on the recent daily range, whereas speculators may favor options on stocks that have traded over a wide range in the past month, and the most risk-averse investors might be particularly worried about tail events. Including historical volatility and GARCH in the same specification allows for the expected result that GARCH, as a superior forecaster, should dominate historical volatility in the realized volatility equation. Indeed, historical volatility gets a negative coefficient in the realized volatility equation, even though the simple correlation between the two is 0.589. Yet, historical volatility could be important to investors who are less comfortable with sophisticated econometrics. And in fact, both GARCH and historical volatility do get significantly positive coefficients in the RND equation, albeit with the GARCH coefficient being more than three times as large. Among the risk-neutralization variables, strength in the Michigan survey, the Baker–Wurgler investor confidence measure, and the market P/E were associated with lower RND volatility, although only the Michigan survey was significant. The Michigan sentiment measure was also significant for realized volatility, but Baker–Wurgler investor sentiment came out strongly significant with the wrong sign. The Baa–Aaa bond yield spread was not significant for RND volatility, but it had the expected positive sign and was quite large and nearly significant for realized volatility. In contrast to Exhibit 1, the two variables related to accuracy of the GARCH model were insignificant, and three of four had the wrong signs. The four rightmost columns present results from the same regressions with yesterday’s volatility premium added as an explanatory variable. Although yesterday’s volatility premium is obviously highly significant, FALL 2016 JPM-Figlewski.indd 37 THE JOURNAL OF PORTFOLIO M ANAGEMENT 37 18/10/16 11:13 am EXHIBIT 3 EP R O D U C E TH IS A R TI C LE IN A N Y FO R M A T Regressions of RND Volatility on All Variables for Short and Long Maturities Contracts R Note: The table reports results from the same RND volatility regressions shown in Exhibit 2, broken down according to option maturity. IT IS IL LE G A L TO Not surprisingly, there is considerable dispersion, but the important full sample results largely hold across subperiods for both RND and realized volatility. The differences tend to be more in statistical significance levels than in the fitted coefficients. The GARCH forecast remains highly significant, except for realized volatility in the second subperiod. Historical volatility is positive and significant in two of three RND volatility regressions, but insignificant in two subperiods for realized volatility, and significant but with a negative sign in the third. The date t and “last week” return variable coefficients were negative in nearly all cases and significant for most of them. These variables were consistently more signif icant in the RND volatility than the 38 realized volatility equations, suggesting that returns may affect risk attitudes in addition to objective volatility expectations. The date t trading range minus the absolute return was positive in all six regressions and highly significant in five of them. A large average daily range in the past increased RND volatility significantly in every subperiod but increased realized volatility only in the earliest one. The range covered by the index over the previous 25 days did not perform consistently. The coefficients in the RND volatility regressions were positive and significant overall in Exhibit 2 but only significant in the final subperiod; whereas for realized volatility, this variable was significantly positive in the earliest subperiod and significantly negative in the last. WHAT G OES INTO R ISK-NEUTRAL VOLATILITY? EMPIRICAL ESTIMATES OF R ISK AND SUBJECTIVE R ISK P REFERENCES JPM-Figlewski.indd 38 FALL 2016 18/10/16 11:13 am EXHIBIT 4 R O D U C E TH IS A R TI C LE IN A N Y FO R M A T Regressions of RND Volatility on All Variables over Three Subperiods EP Note: The table reports results from the same RND volatility regressions shown in Exhibit 2, broken down into subsamples by time period. IT IS IL LE G A L TO R The left- and right-tail variables did not perform well. A date t left-tail return received a significantly positive coefficient in all three RND volatility regressions, implying that an unusually large negative return was associated with a decrease in risk-neutral volatility. Frequent left-tail events in the past increased both riskneutral and realized volatility overall, but results varied widely across subperiods. Similarly, the average size of left-tail events showed inconsistent coefficient estimates and more than half had the wrong (i.e., positive) sign. Right-tail events showed a similar lack of consistency between RND volatility and realized volatility equations and over time. Among the risk-neutralization variables, a higher value for the Michigan survey reduced both RND volatility and realized volatility significantly, with a couple FALL 2016 JPM-Figlewski.indd 39 of exceptions, while the Baker–Wurgler measure of investor confidence was more ambiguous—with both volatility measures increasing when this confidence measure was higher in the middle subperiod. By contrast, the S&P 500 P/E had the right (negative) signs in Exhibit 2 but was not significant. Here, the coefficients are negative in all regressions, and half are significant. Overall, the coefficient on the bond yield spread was insignificant for RND volatility but positive and significant for realized volatility; whereas in Exhibit 4, it is significantly positive for realized volatility only in the final subperiod, consistent with default risk becoming a more important factor after 2008. Finally, the GARCH model errors and the GARCH RMSE continue to be largely insignificant and anomalous in the subsamples, as they were in Exhibit 2. THE JOURNAL OF PORTFOLIO M ANAGEMENT 39 18/10/16 11:13 am IS A R TI C LE IN A N Y FO R M A T beyond expected future volatility, such as the recent trading range, and the risk premia they require appear to be related to measures of confidence and sentiment. These non-BS factors are important determinants of how options are priced in the real world. A GARCH model forecast was found to be highly significant for both realized and RND volatility. As a less accurate predictor, historical volatility did not contribute significantly in forecasting realized volatility, but it did in the RND volatility regressions. Investors appear to pay attention to both estimators in pricing options. The well-known negative relationship of volatility with return was strongly manifested by risk-neutral volatility. Both the date t return and also the return over the previous week were estimated with highly significant negative coefficients overall and, with a single exception, in every subperiod regression. The larger and more significant coefficients in the RND volatility regressions suggest that empirical (P measure) volatility is correlated with returns, but transforming the P measure into the Q measure increases the effect. Trading over a wide range that produced little change in the closing index level was highly significant in increasing both RND and realized volatility. But a large average daily range in the recent past or a large total range also increased RND volatility substantially, while having no significant effect on realized volatility. These variables seem to be more strongly connected to investors’ and market makers’ risk aversion than to their objective forecasts of future volatility. Six variables were specifically chosen for their possible correlation with risk preferences. RND volatility was negatively correlated with the Michigan survey of consumer sentiment and the Baker–Wurgler index of investor sentiment, with the former showing strong performance for both RND volatility and realized volatility. It is not surprising to find high market volatility in times when consumers and investors are less confident. The effect appears distinctly stronger for RND volatility than for realized volatility, supporting the hypothesis that investor sentiment impacts risk neutralization. The P/E of the S&P 500 and the credit spread in the bond market are direct measures of the market’s willingness to bear risk. The consistently negative coefficient on the P/E suggests that it may also be a useful ref lection of market sentiment, although it was only significant in a few cases. However, the bond yield spread results were odd overall and inconsistent TO R EP R O D U C E TH A negative coefficient on GARCH RMSE implies that RND volatility is lower when the GARCH model has been less accurate in the recent past; and a negative coefficient on the GARCH error means that RND volatility was lower still when GARCH underpredicted volatility. To summarize the results of these subsample regressions, we note that there was not much difference between short- and long-maturity contracts in their sensitivities to the explanatory variables. More diverse results showed up when the sample was broken into subsamples by time period. High predicted volatilities from both the GARCH and historical volatility models were associated with significantly higher risk-neutral and future realized volatilities in most cases, although the relationship was weaker for historical volatility in the realized volatility regressions. The return variables showed the expected strongly negative correlation between return and volatility, whereas absolute return was more ambiguous. The variables relating to the trading range intraday and over a longer historical period were for the most part strongly positive: trading over a wider range increased RND volatility, and realized volatility too, in most cases. The tail event frequency and size showed inconsistent results, with some coefficients significant, but with possibly different signs in different subperiods and quite a few anomalous values. The results on proxies for the market’s risk tolerance were promising, though not overpoweringly strong. In most cases, positive sentiment seems to be associated with lower volatility. However, the Baa–Aaa bond yield spread and the two variables measuring the accuracy of the GARCH model did not add much explanatory power. L CONCLUDING COMMENTS IT IS IL LE G A We set out to develop stylized facts about what factors determine risk-neutral volatility. Let us summarize what we have uncovered. The first question was what return- and volatilityrelated factors are most important to investors. Under BS, this amounts to getting the most accurate estimate of the instantaneous volatility. We found that RND volatility is strongly, but not perfectly, correlated with realized volatility, with correlation around 0.6. Forecasts of realized volatility from a GARCH model are highly significant in explaining RND volatility. But investors seem to care about other features of the returns process 40 WHAT G OES INTO R ISK-NEUTRAL VOLATILITY? EMPIRICAL ESTIMATES OF R ISK AND SUBJECTIVE R ISK P REFERENCES JPM-Figlewski.indd 40 FALL 2016 18/10/16 11:13 am ENDNOTES 13 T Although the model is f itted on historical data, GARCH is a “date t” variable because ambiguity over how far back the historical data sample should go is not a major issue with a parametric model like GARCH. 14 Based on the analysis in the online technical appendix, the option’s remaining lifetime is selected as the horizon. M A in the breakdown across subperiods. Lastly, the two variables designed to measure investors’ confidence in the GARCH model’s accuracy did not add explanatory power. The coefficients were mostly insignificant with anomalous signs. FO N Y Baker, M., and J. Wurgler. “Investor Sentiment and the Cross-Section of Stock Returns.” Journal of Finance, 61 (2006), pp. 1645-1680. LE IN A Birru, J., and S. Figlewski. “Anatomy of a Meltdown: The Risk Neutral Density for the S&P 500 in the Fall of 2008.” Journal of Financial Markets, 15 (2012), pp. 151-180. R TI C Bliss, R., and N. Panigirtzoglou. “Testing the Stability of Implied Density Functions.” Journal of Banking and Finance, 26 (2002), pp. 381-422. IS A ——. “Option Implied Risk Aversion Estimates.” Journal of Finance, 59 (2004), pp. 407-446. Bollerslev, T., and V. Todorov. “Tails, Fears and Risk Premia.” Journal of Finance, 66 (2011), pp. 2165-2211. IT IS IL LE G A L TO R EP R O D U C E TH I thank the Q-Group and the NASDAQ OMX Derivatives Research Project at NYU for research support for this project. Thanks to Rob Capellini for preparing the GARCH forecasts. Valuable comments from Joanne Hill, Larry Harris, and participants at the Q-Group 2012 Spring Seminar, the University of Michigan finance seminar, and the ITAM Finance Conference are greatly appreciated. 1 See Poon and Granger [2003] or Jackwerth [2004] for reviews. 2 See, for example, Bliss and Panigirtzoglou [2002, 2004], Kozhan et al. [2010], Figlewski [2010], Christoffersen et al. [2011], or Giamouridis and Skiadopoulos [2012]. 3 See Chicago Board Options Exchange [2015]. 4 Jiang and Tian [2007] demonstrate that ignoring these features in the VIX methodology introduces substantial and unnecessary inaccuracy. 5 See also the appendix to Birru and Figlewski [2012] for full details on the RND extraction methodology. 6 The average daily return is only a few basis points, which is much smaller than the average sampling error on the mean, so it is generally felt that the small bias introduced by treating the mean as zero is preferable to the much larger errors that would result from using the sample mean in the calculation. 7 A GJR-GARCH model was f itted for every day during the sample and out-of-sample variance forecasts were generated for 1 to 252 trading days ahead. These were used to construct predictions of average variance from each observation date through option expiration. The resulting variances were converted to annualized percent volatilities. See vlab.stern.nyu.edu/. 8 Parkinson [1980] showed that a volatility estimator based on the intraday high and low can achieve the same accuracy as one using only close-to-close returns with five times as many observations. 9 The data were downloaded from Wurgler’s website, “Investor sentiment data (annual and monthly), 1934–2010,” people.stern.nyu.edu/jwurgler/. 10 www.econ.yale.edu/~shiller/data.htm. 11 research.stlouisfed.org/fred2/. 12 The full correlation matrix for all dependent and exogenous variables is shown in the Appendix. R REFERENCES FALL 2016 JPM-Figlewski.indd 41 Breeden, D., and R. Litzenberger. “Prices of StateContingent Claims Implicit in Option Prices.” Journal of Business, 51 (1978), pp. 621-652. Carr, P., and L. Wu. “Variance Risk Premiums.” Review of Financial Studies, 22 (2008), pp, 1311-1341. Christoffersen, P., K. Jacobs, and B.Y. Chang. “Forecasting with Option-Implied Information.” SSRN Working paper, 1969863, 2011. Chicago Board Options Exchange. “The CBOE Volatility Index – VIX.” http://www.cboe.com/micro/vix/vixwhite .pdf, 2015. Figlewski, S. “Estimating the Implied Risk Neutral Density for the U.S. Market Portfolio.” In Volatility and Time Series Econometrics: Essays in Honor of Robert F. Engle, edited by T. Bollerslev, J.R. Russell, and M. Watson. Oxford, U.K.: Oxford University Press, 2010. Giamouridis, D., and G. Skiadopoulos. “The Informational Content of Financial Options for Quantitative Asset Management.” In The Oxford Handbook of Quantitative Asset THE JOURNAL OF PORTFOLIO M ANAGEMENT 41 18/10/16 11:13 am Management, edited by B. Scherer and K. Winston, K. Oxford, U.K.: Oxford University Press, 2012. R M A T Glosten, L.R., R. Jagannathan, and D.E. Runkle. “On the Relation between the Expected Value and the Volatility of the Nominal Excess Return on Stocks.” Journal of Finance, 48 (1993), pp. 1779-1801. Y FO Harrison, J.M., and D.M. Kreps. “Martingales and Arbitrage in Multiperiod Securities Markets.” Journal of Economic Theory, 20 (1979), pp. 381-408. IN A N Jackwerth, J.C. Option-Implied Risk-Neutral Distributions and Risk Aversion. Charlottesville, VA: Research Foundation of AIMR, 2004. C R TI A C E TH Newey, W., and K. West. “A Simple, Positive Semi-definite, Heteroskedasticity and Autocorrelation Consistent Covariance Matrix.” Econometrica, 55 (1987), pp. 703-708. IS Kozhan, R., A. Neuberger, and P. Schneider. “Understanding Risk Premia in Index Option Prices.” Working paper, 2010. LE Jiang, G., and Y.S. Tian. “Extracting Model-Free Volatility from Option Prices: An Examination of the VIX Index.” The Journal of Derivatives, 14 (2007), pp. 35-60. O D U Parkinson, M. “The Extreme Value Method for Estimating the Variance of the Rate of Return.” Journal of Business, 53 (1980), pp. 61-68. R EP R Poon, S.-H., and C.W.J. Granger. “Forecasting Volatility in Financial Markets: A Review.” Journal of Economic Literature, 41 (2003), pp. 478-539. L TO Ross, S. “The Recovery Theorem.” Journal of Finance, 70 (2015), pp. 615-648. IT IS IL LE G A To order reprints of this article, please contact Dewey Palmieri at dpalmieri@ iijournals.com or 212-224-3675. 42 WHAT G OES INTO R ISK-NEUTRAL VOLATILITY? EMPIRICAL ESTIMATES OF R ISK AND SUBJECTIVE R ISK P REFERENCES JPM-Figlewski.indd 42 FALL 2016 18/10/16 11:13 am
© Copyright 2026 Paperzz