Information Asymmetry about the Firm and the Permanent Price Impact of Trades: Is there a Connection? Gideon Saar Lei Yu1 First Draft: July 2001 This Version: July 2002 1 Both authors are from the Stern School of Business, New York University, 44 West Fourth Street, Suite 9-190, New York, NY 10012-1126. Saar can be reached at Tel: 212-998-0318, Fax: 212-995-4233, E-mail: [email protected], and Yu can be reached at Tel: 212-998-0321, Fax: 212-995-4233, E-mail: [email protected]. The authors thank I/B/E/S for providing the data on analysts’ earnings forecasts, Frank Russell Company for data on the 1999 reconstitution of the Russell 1000 index, and Foster Provost for helping with the matching procedure used in the paper. The authors are grateful for helpful comments from Hendrik Bessembinder, Ekkehart Boehmer, Joel Hasbrouck, and seminar participants at Babson College, University of Utah, and NYU. Abstract We investigate whether permanent price impact measures describing intra-day price movements are helpful in characterizing information asymmetry about the fundamentals of firms. We conduct an event study of the Russell 1000 index reconstitution and find that the permanent price impact measures change around the event despite the fact that Russell 1000 membership is based on market capitalization and therefore the event is not associated with any change in private information about the firms. We also examine a large cross-sectional sample and find that the permanent price impact measures do not consistently reflect uncertainty about future earnings or relate to price informativeness about future earnings in a manner that is compatible with their use as proxies for information asymmetry about firms’ fundamentals. Information Asymmetry about the Firm and the Permanent Price Impact of Trades: Is there a Connection? The literature examining price movements on trades often makes a distinction between “temporary” and “permanent” price movements. Short-horizon temporary price movements have been attributed, for example, to inventory management on the part of market makers; the most common reason cited for the permanent price impact of trades has been the presence of investors with private information about the future cash flows of the firm who trade to profit from their information. A number of papers have put forward econometric specifications for estimating the permanent and temporary price impacts of trades using intra-day data comprised of trades and quotes (see, for example, Glosten, 1987; Glosten and Harris, 1988; Hasbrouck, 1988; Stoll, 1989; Hasbrouck, 1991a; Hasbrouck, 1991b; Madhavan and Smidt, 1991; Huang and Stoll, 1997; Madhavan, Richardson, and Roomans, 1997).1 While the aforementioned papers focused on quantifying the relative importance of temporary and permanent factors in explaining price movements, researchers have soon found a rich set of applications for these new tools. Since theoretically the permanent price impact of trades was associated with the incorporation into prices of private information about fundamentals, measures of the permanent price impact were used to quantify the extent of information asymmetry about a firm. These measures proved attractive in areas outside market microstructure (e.g., corporate finance) where researchers have been interested in characterizing the information asymmetry environment of a firm, which is inherently unobservable.2 There are two potential problems with using measures of permanent price impact estimated from trade and quote data to tell a story about the future cash flows, or fundamentals, 1 George, Kaul, and Nimalendran (1991) propose a methodology for estimating components of the spread that uses daily and weekly rather than intra-day data. 2 For example, these measures were used to investigate information asymmetry in the contexts of corporate events (Brooks, 1994; Jennings, 1994; Barclay and Dunbar, 1996; Krinsky and Lee, 1996; Desai, Nimalendran, and Venkataraman, 1998), the debt-equity mix (Illesy and Shastri, 2000), corporate diversification (Fee and Thomas, 1999), disclosure quality (Heflin, Shaw, and Wild, 2000), the opaqueness of banking firms’ assets (Flannery, Kwan, and Nimalendran, 2000), the size of the analyst following (Brennan and Subrahmanyam, 1995), and ownership structure (Heflin and Shaw, 2000a; Sarin, Shastri, and Shastri, 2000, Dennis and Weston, 2001; Dey and Radhakrishna, 2001). 1 of a firm. The first problem is that many elements in the trading environment of a stock can affect the estimation procedures and bias the estimates. For example, specific rules that govern trading, like the price continuity requirement at the NYSE, affect the adjustment of prices to order flow and therefore the measures of permanent price impact estimated from trades and quotes. A slowly moving target inventory level of NYSE specialists (see, for example, Hasbrouck and Sofianos, 1993) can also affect the estimates. This may partially explain the fact that, in our large cross-sectional sample, the magnitude of permanent price impact measures for NYSE stocks is on average three times that of NASDAQ stocks. Does it really mean that there is so much more information asymmetry about NYSE firms relative to Nasdaq firms? When elements that conceptually should not affect prices permanently get incorporated into the permanent price impact measures, it is unclear whether differences in the measures reflect differences in the information asymmetry environment or other influences. Even if we are certain that the permanent price impact measures indeed capture only things that should affect the price permanently, the second potential problem is that not only information asymmetry about the future cash flows of a firm can give rise to the permanent price impact of trades. The “discount rate” effect documented in the literature for longer horizon returns (e.g., Fama, 1990; Campbell and Ammer, 1993) demonstrates how return variation is partly caused by shocks to future expected returns, not just shocks to expected future cash flows.3 Saar (2001) shows in a sequential trade model how uncertainty about the distribution of preferences and endowments of investors affects the risk premium and generates a permanent price impact for trades even in the absence of information asymmetry about future cash flows. Therefore, the permanent price impact measures may be picking up information about investors that is unrelated to the fundamentals of firms. 3 Fama (1990), for example, finds that the discount rate effect accounts for about 30% of the variance of annual real returns on a value-weighted portfolio of NYSE stocks. Campbell and Ammer (1993) use a vector autoregression (VAR) decomposition of monthly stock excess returns and find that only 15% of the variance of returns is attributed to the variance of news about future dividends, while 70% is attributed to the variance of news about future excess returns. 2 Both problems weaken the link between the permanent price impact measures and information asymmetry about the firm. To have confidence in using these measures to make an inference about the level of information asymmetry, it is important to investigate empirically how close is the association between them. The objective of this paper is to examine the question whether measures constructed from trade and quote data to describe intra-day price movements are helpful in characterizing information asymmetry about the fundamentals of firms. Four papers in particular are related to our investigation: Neal and Wheatley (1998) and Clarke and Shastri (2001) are two papers looking at close-end funds; Clarke and Shastri (2000) and Van Ness, Van Ness, and Warr (2001) conduct “horse races” among different permanent price impact measures. Neal and Wheatley postulate that since the net asset value of a closed-end fund is reported every week, there can be little information asymmetry about its current liquidation value. They find large permanent price impact measures, and conclude that the estimates may be unreliable. Clarke and Shastri (2001) reach a different conclusion, arguing that uncertainty about private benefits paid to blockholders creates adverse selection costs. Clarke and Shastri (2000) use six econometric methodologies to estimate permanent price impact measures for 320 NYSE firms. They look at how these measures relate to each other, and also at the correlations between the measures and certain firm characteristics. The correlations with firm characteristics are statistically significant only for net sales and a dummy for regulated firms. Similarly, Van Ness, Van Ness, and Warr (2001) conduct a horse race of five permanent price impact measures using 856 NYSE stocks and examine their relations with corporate finance variables, volatility measures, and measures that involve analysts and institutional holdings. They find mixed results, with no single econometric methodology producing estimates that are consistently correlated with the variables they consider as information proxies. The strongest relation they document is between the permanent price impact measures and volatility. 3 Both “horse race” papers mentioned above are subject to the same conceptual problem. Since information asymmetry is unobservable, they use a variety of proxies about which they have stories of how the proxies relate to information asymmetry about fundamentals. However, since the permanent price impact measures also have a story that relates them to information asymmetry, the exercise turns into looking at relations among proxies without the ability to determine whether lack of relation between the permanent price impact measures and the other information proxies is the fault of the measures or the other proxies. The design in the present paper is more powerful since we use an event study that allows for a characterization of changes (or their absence) in the information asymmetry environment without the need to rely on proxies. We additionally provide a cross-sectional analysis for three reasons. First, it allows us to examine whether a weaker link exists between the measures and other information asymmetry proxies that would justify using the measures under some conditions. Second, we look at a much larger sample (2,797 stocks) than the other two papers and use stocks from the NYSE, AMEX and Nasdaq as opposed to only NYSE in these studies. Third, we are able to test the measures even without information asymmetry proxies by examining the cross-sectional relation between the measures and price informativeness about future earnings. While the measures used in the aforementioned papers have different names, such as the “permanent impact of the order flow innovation” or the “adverse selection component of the spread” or the “trade-correlated component of the efficient price variance,” they all use intra-day data to estimate the permanent changes in price induced by trades.4 Using any estimate of permanent price impact as a measure of information asymmetry about fundamentals is subject to the same two problems identified above. Thus instead of examining all possible measures, we choose two of the methodologies that impose different degree of structure on the trading process: a more structured trade-indicator model and a less structured variance decomposition procedure. The trade-indicator model, from Madhavan, Richard4 See Huang and Stoll (1997) for a discussion of the relationships among some of the measures. 4 son, and Roomans (1997), provides an estimate of the permanent price impact of the order flow innovation (henceforth, MRR). The variance decomposition procedure from Hasbrouck (1991b) provides two measures: (i) the trade-correlated efficient price variance component that is used as an “absolute” measure of information asymmetry (HASAB), and (ii) the ratio of the trade-correlated component to the total efficient price variance that is taken as a measure of the amount of private information relative to the total amount of information (HASR). We formulate the investigation of the association between the permanent price impact measures and information asymmetry about fundamentals in terms of “necessary” and “sufficient” links. The strongest possible association is when changes in information asymmetry about fundamentals are necessary to produce changes in the permanent price impact measures (holding constant the amount of uninformed trading). While such a claim is difficult to prove (as it requires considering all possible circumstances in which future cash flows can change), its converse is not. Showing that changes in private information about fundamentals are not necessary for the permanent price impact measures to change requires only one example. The Russell 1000 index reconstitution event provides a suitable environment in which we can test this issue. The composition of the index changes once a year and the new component stocks are determined solely based on their market capitalization at a single point in time (the last trading day of May), which is public information. Therefore, the event does not add information to what the market already knows, and we expect no change in information asymmetry about fundamentals. The permanent price impact measures should not change when controlling for the amount of uninformed trading if the measures indeed reflect information asymmetry about fundamentals. We look at how the permanent price impact measures change from the last trading week in May (before the new ranking of the Russell 1000 is established) to the first trading week in July (when the new index takes effect). We find that both MRR (the permanent price impact 5 of the order flow innovation) and HASAB (the trade-correlated component of the randomwalk variance of quote midpoint changes) decrease significantly around the event. The design of the test also enables us to demonstrate that these results cannot be attributed just to changes in liquidity trading or analyst activity around the event. We test the robustness of our findings with respect to alternative estimation intervals, price level effects, and possible changes in market-wide liquidity, and find that these do not change our conclusions. The event study is a strong test of the relation between the permanent price impact measures and information asymmetry about the firm since it does not require us to use other variables to independently characterize information asymmetry. But while the MRR and HASAB measures fail to pass the “necessary” test, it is still possible that a weaker “sufficient” relation exists. More specifically, we would like to know whether differences across stocks in the degree of information asymmetry about fundamentals are sufficient to generate differences in the estimates of the permanent price impact measures that are used as proxies for information asymmetry. Generally speaking, it is difficult to find good proxies for information asymmetry about future cash flows, a reason that no doubt contributed to the popularity of the permanent price impact measures. We choose for our cross-sectional tests only proxies that are constructed using earnings or their forecasts. Intuitively, variables that explicitly incorporate information about the firm’s future earnings should be more tightly connected to the firm’s future cash flows than measures that do not use such direct information (like the estimated permanent price impact measures). To implement the test we use uncertainty about future cash flows to represent information asymmetry since the greater the uncertainty, the more room (and incentives) there is for investors to acquire private information and trade on it profitably. Two of our proxies for uncertainty about future cash flows are based on dispersion in analysts’ earnings forecasts and two others are based on past earnings variability. We find that MRR provides conflicting results since it is negatively related to dispersion in analysts’ earnings forecasts but also negatively related to earnings predictability (as it should be). HASAB is related to one 6 of the analysts’ forecasts variables in the predicted direction in univariate tests, but gives conflicting results when the two types of proxies are used together. HASR is not related to any of the proxies for future cash flows uncertainty.5 We then conduct a cross-sectional test that bypasses the need for information asymmetry proxies by examining the relation between the permanent price impact measures and price informativeness about future earnings (i.e., how much of the information about future earnings is impounded into current prices). Ceteris paribus, the more trading on private information about the firm, the more informative should prices be with respect to future earnings (see, for example, Grossman and Stiglitz, 1980). We find that none of the permanent price impact measures relates to price informativeness in the predicted manner. By utilizing a variety of designs and tests, we offer a comprehensive assessment of the relation between the permanent price impact measures and information asymmetry about fundamentals. The results we present do not invalidate the use of measures estimated from intra-day data to examine the price impact of trades. However, they cast doubt on the use of very short-horizon price movements for making inferences concerning information asymmetry about the fundamentals of firms. The rest of the paper proceeds as follows. Section 1 describes the permanent price impact measures we use. Section 2 examines changes in the permanent price impact measures around the Russell 1000 index reconstitution. Section 3 presents cross-sectional analysis that relates the permanent price impact measures to proxies for uncertainty about future cash flows. Section 4 investigates the relation between the permanent price impact measures and price informativeness, and Section 5 is a conclusion. 5 For robustness, we also look at how the permanent price impact measures relate to analysts’ forecast errors, another common proxy for information asymmetry that uses earnings information. The analysis using this proxy points to the same conclusions as the analysis of uncertainty about future cash flows. 7 1 Permanent Price Impact Measures In this section we provide some details on the estimation of the permanent price impact measures. For a more comprehensive exposition of these methodologies we refer the readers to the original papers that developed them. 1.1 Madhavan, Richardson, and Roomans (1997) Let xt denote an indicator variable taking the value of 1 if the transaction at time t is buyer initiated and −1 if it is seller initiated, and let µt denote the post-trade expected value of a stock. The revision of beliefs following a trade is the sum of the change in beliefs due to public information and the change in beliefs due to the order flow innovation: µt = µt−1 + θ (xt − E[xt |xt−1 ]) + t (1) where θ is the permanent impact of the order flow innovation and is a measure of the degree of information asymmetry, and t is the innovation in beliefs between times t−1 and t due to public information. Let pt denote the transaction price at time t, and φ denote the market makers’ cost per share of supplying liquidity (compensating them for order processing costs, inventory costs, and so on). The transaction price can then be expressed as: pt = µt + φxt + ξt (2) where ξt captures the effects of stochastic rounding errors induced by price discreteness or possibly time-varying returns. Equations (1) and (2) can be used to obtain: ut = pt − pt−1 − (φ + θ)xt + (φ + ρθ)xt−1 (3) where ρ is the first-order autocorrelation of the trade initiation variable. Then, the measure of permanent price impact θ, alongside φ, ρ, λ (the unconditional probability that a transaction occurs within the quoted spread), and a constant α can be estimated using GMM applied 8 to the following moment conditions: E xt xt−1 − x2t−1 ρ |xt | − (1 − λ) ut − α (ut − α)xt (ut − α)xt−1 =0 (4) We use data from the TAQ database to estimate (4).6 Following Madhavan et al., the classification into buys and sells is done as follows: (i) if a trade price is greater than or equal to the prevailing ask then xt = 1, (ii) if the trade price is less than or equal to the bid then xt = −1, and (iii) if the trade price falls between the bid and the ask then xt = 0. We use the log transaction price for pt , and refer to the estimate of θ, the permanent impact of the order flow innovation, as the “MRR” measure.7 1.2 Hasbrouck (1991b) Hasbrouck models the quote midpoint qt as the sum of two unobservable components: qt = mt + st (5) where mt is the efficient price (i.e., expected end-of-trading security value conditional on all time-t public information) and st is a residual discrepancy term that is assumed to incorporate inventory control, price discreteness, and other influences that cause the midquote to 6 We apply various filters to clean the data. We only use trades for which TAQ’s CORR field is equal to either zero or one, and for which the COND field is either blank or equal to B, J, K, or S. We eliminate any trades with non-positive prices or where the change from the last trade price is greater than $5 ($20 for stocks with average price greater than $100). We also exclude a trade if its price is greater (less) than 150% (50%) of the price of the previous trade. We only use quotes from the exchange on which a stock is listed. We eliminate quotes for which TAQ’s MODE field is equal to 4, 5, 7, 8, 9, 11, 13, 14, 15, 16, 17, 19, 20, 27, 28, or 29. We exclude quotes with non-positive ask or bid prices, where the bid is greater than the ask, and where the ask is more than $9999. We require that the difference between the bid and the ask be smaller than 25% of the sum of the bid and the ask. We also eliminate a quote if the bid or the ask are greater (less) than 150% (50%) of the bid or ask of the previous quote. When signing trades, we employ yet an additional filter that requires the difference between the price and the prevailing midquote to be smaller than $8. 7 Madhavan et al. estimate (4) using the transaction price, rather than its logarithmic transformation, and therefore the components they estimate are in dollar terms (i.e., components of the dollar spread). In the event study (Section 2), we estimate two versions of θ: one using dollar prices and the other using log prices to get components in percentage terms (i.e., components of the relative spread). The results from both specifications are similar. In the cross-sectional part of the paper (both in Section 3 and Section 4), we use log transaction prices since a component of the relative spread seems to be better suited for comparisons across stocks. 9 deviate from the efficient price. The efficient price evolves as a random walk: mt = mt−1 + wt (6) where the innovation wt reflects updates to the public information set including the information in the order flow. The market’s signal of private information is the current trade innovation defined as xt − E[xt |Φt−1 ], where xt is signed volume and Φt−1 is the public information set prior to the trade. The impact of the trade innovation on the efficient price innovation is E [wt |xt − E[xt |Φt−1 ]]. Two measures of information asymmetry that Hasbrouck proposes are: 2 σw,x ≡ Var (E [wt |xt − E[xt |Φt−1 ]]) 2 Rw ≡ 2 σw,x Var (E [wt |xt − E[xt |Φt−1 ]]) = 2 σw Var(wt ) (7) (8) 2 where σw,x is the trade-correlated component of the random-walk variance of quote midpoint 2 changes representing an absolute measure of information asymmetry, and Rw is a measure of the amount of private information relative to the total amount of information. Let x0t be an indicator variable that takes the values {−1, 0, +1}, and define: x1t = log(xt + 1) (the log volume), and xt = {x0t , x1t }. Let rt = log qt − log qt−1 . The absolute and relative permanent price impact measures are estimated using the vector autoregressive (VAR) model: 5 rt = 5 ai rt−i + i=1 5 xt = bi xt−i + v1,t (9) Di xt−i + v2,t (10) i=0 5 ci rt−i + i=1 i=1 where bi is a 2 × 1 vector of coefficients, Di is a 2 × 2 matrix of coefficients, and v2,t is a 2 × 1 vector of error terms.8 8 Hasbrouck (1991b) uses a specification with volume and volume squared instead of log volume. Applying his exact specification to our analysis did not materially affect the results. We chose to present the specification using log volume because it seems to fit better the concave relation between trade size and price impact. 10 We use data from the TAQ database for the estimation of the VAR model in (9) and (10). Following Hasbrouck, the trade classification into buys and sells is done as follows: (i) if a trade price is greater than the prevailing midquote then x0t = 1, (ii) if the trade price is less than the prevailing midquote then x0t = −1, and (iii) if the trade price is at the prevailing midquote then x0t = 0. We estimate the VAR system using OLS. A finite VAR generally possesses a vector moving average (VMA) representation of an infinite order. The random walk variance and the trade-correlated component are calculated from the coefficients of the VMA representation truncated after 100 lags. Throughout the paper, we refer to the estimate of the absolute measure, expressed in the form of standard deviation per hour, as 2 “HASAB” and to the estimate of the relative measure (Rw ) as “HASR.”9 2 Event Study: Russell 1000 Reconstitution In this section we investigate whether changes in information about the firm are necessary for the permanent price impact measures to change, holding constant the amount of liquidity trading. The Russell 1000 index reconstitution provides us with a natural experiment for testing the “necessary” connection.10 The Russell 1000 index is comprised of the largest 1000 firms in terms of market capitalization.11 Membership in the Russell 1000 index is determined strictly by market capitalization at the end of the last trading day in May and reconstitution occurs once a year. During June, Frank Russell Company issues a list of firms to be added to and deleted from the indexes. The new indexes become effective on July 1. The more heavily studied changes to the S&P equity indexes, in contrast, involve a committee of nine people who make their decision taking into account attributes such as liquidity, profitability of the firm, market capitalization, 9 We also conduct the analysis using the raw estimates of the trade-correlated component of the randomwalk variance without conversion to standard deviation per hour. The results are very similar in both sign and significance level, and are therefore omitted from the presentation. 10 Madhavan (2001) examines return effects of the Russell reconstitution. 11 Excluded from all Russell indexes are foreign firms, limited partnerships, limited liability companies, royalty trusts, closed-ended investment management companies, ADRs, preferred stocks, pink-slipped companies, OTC bulletin board companies, and warrants and rights. The indexes also exclude stocks trading at below $1. 11 and sector representation of the stock.12 Revisions to the S&P indexes also occur continuously throughout the year on a stock-by-stock basis. As such, the Russell reconstitution differs from changes to the S&P indexes in that the criterion used is public information (market capitalization on May 28) and all changes occur at the same time. More importantly, additions to and deletions from the Russell 1000 index convey no new information about the firm, while S&P revisions might be viewed as expressing an opinion about a firm. Thus if the measures arise solely from private information about the fundamentals of firms, they should not change around the Russell 1000 reconstitution when controlling for the amount of uninformed or liquidity trading. If these measures are affected by other factors, like changes in the characteristics of investors that are unrelated to being informed or uninformed about the fundamentals of firms, we might detect changes around the reconstitution event. Controlling for the amount of uninformed trading is important because it has direct effect on the permanent price impact measures. However, it is difficult to identify trades as being initiated by liquidity traders (being uninformed is not an observable attribute). We use two variables in the regressions in an attempt to control for the amount of normal or liquidity trading. The analysis of both the addition and deletion samples, however, provides us with a more powerful robustness check on the effects of liquidity trading. The reason is that most stocks in our sample that were added to the Russell 1000 index (89 out of 105) were deleted from the Russell 2000 index, and all stocks that were deleted from the Russell 1000 index were added to the Russell 2000. Say we believe that a move from the Russell 1000 to the Russell 2000 index increases the amount of liquidity trading (perhaps more passive funds follow the Russell 2000 index). If the proxy for normal volume is not good enough, some of the increase in liquidity trading will be captured by the intercept in the regression we use to examine changes in the measures around the event, making it negative. At the same time, stocks that move from the Russell 2000 to the Russell 1000 should experience a decrease in liquidity trading. For them, insufficient control for liquidity trading will result in a positive 12 For papers examining changes to S&P equity indexes see Harris and Gurel (1986), Shleifer (1986), Jain (1987), Dhillon and Johnson (1991), and Lynch and Mendenhall (1997). 12 intercept. If the intercepts of the addition and the deletion samples have the same sign, however, the effect cannot be solely due to insufficient control for uninformed or liquidity trading. It is clear that the Russell reconstitution event is fairly transparent, and it is possible to predict prior to the last trading day in May which stocks are more likely to be added or deleted. In fact, firms like Investment Technology Group, Inc. offer the service of predicting the changes in order to assist fund managers who wish to begin rebalancing their portfolios in advance of the announced index changes. While some fund managers could be trading ahead of time based on a certain assessment as to which stocks will switch indexes, others could be trading based on a different assessment or waiting until the new composition is made known. Rebalancing before May 28 does not necessarily bias the results against finding any change between the permanent price impact measures estimated just before the end of May and immediately after July 1 if such behavior does not reduce the uncertainty about final membership. If the uncertainty is reduced, any effect that we detect is likely to be an underestimate of the real change in the measures. 2.1 Sample and Methodology The initial sample is constructed from a list of 157 stocks that were added to the Russell 1000 index in 1999 and 103 stocks that were deleted from the index.13 We restrict the sample to common stocks (for compatibility with the cross-sectional tests in Section 3 and Section 4) with data from April 1 through mid-July. We use CRSP and Dow Jones Interactive to identify firms with corporate events including mergers and acquisitions, share offerings, earnings announcements, dividends distributions, and stock splits within the sample period. These firms are eliminated from the sample in order not to confound the results (if a corporate event occurs within the sample period, the information environment of a firm can change). 13 The preliminary list of the changes was published by the Frank Russell Company on June 11 and the final list was published on July 7 (though the new composition of the index took effect on July 1). To construct the sample, we eliminate 12 stocks that were not present in both the preliminary and final lists. Note that the number of additions and deletions can be different due to bankruptcies and mergers that decrease the number of firms in the Russell 1000 index throughout the year. 13 The final sample consists of 105 additions and 70 deletions.14 Table 1 presents summary statistics for the firms used in the event study. There seems to be a slight increase in mean price between May 28 (the last trading day in May) and July 1 (when the new composition of the index takes effect). Average turnover increases slightly from the last week of May to the first week of July for both samples, but the cross-sectional mean of the average number of trades decreases for the addition sample. Dollar spreads and percentage spreads decrease for both additions and deletions. Our event study methodology is an adaptation of standard specifications.15 Let ψi,τ denote the permanent price impact measure for stock i estimated over time interval τ ∈ {pre, post} (i.e., either the pre-event interval or the post-event interval). The simplest specification assumes that the permanent price impact measure can be described as the sum of a stock-specific unconditional mean (µi ), an event effect (α), and an error term ( ψi,τ = µi + αδτ + i,τ ): (11) i,τ where δτ is an indicator variable that takes the value zero in the pre-event interval and one in the post-event interval, and i,τ is assumed i.i.d and normally distributed with mean zero and variance σi2 /2. Theoretical models that show how information asymmetry causes trades to have permanent price impacts also demonstrate how the price impact of trades can be affected by changes to normal or liquidity trading (i.e., volume due to uninformed traders that is unrelated to information events). A formulation that takes normal trading into account can be written as: ψi,τ = µi + αδτ + βVi,τ + i,τ (12) where Vi,τ is a proxy for the amount of normal volume. By examining differences between the pre- and post-event intervals, we can eliminate the 14 15 Similar results are obtained without eliminating firms with major corporate events. See, for example, Schipper and Thompson (1983) and Thompson (1985). 14 firm-specific mean. Equation 11 can be written in terms of differences as: ∆ψi ≡ ψi,post − ψi,pre = α + where i (13) i is normally distributed with mean zero and variance σi2 . Similarly, the formulation in terms of differences of Equation 12 is: ∆ψi = α + β ∆Vi + (14) i where ∆Vi ≡ Vi,post − Vi,pre .16 Testing that the Russell 1000 reconstitution event affects the permanent price impact measures is therefore equivalent to testing whether the intercept (α) of either Equation 13 or Equation 14 is statistically different from zero. Figure 1 shows the timeline for the event study. We use one week as the length of the interval over which the permanent price impact measures are estimated. Since the ranking of firms for the reconstitution is based on the closing prices on May 28, we use the week ending May 28 as the pre-event interval. Similarly, since the effective date of the changes is July 1, we use five trading days beginning July 1 as the post-event interval.17 For a start, we assume that the error terms are identically distributed across stocks and Equation 13 is implemented as a simple t-test. We also use the non-parametric Wilcoxon signed rank test to perform the analysis under less restrictive assumptions. Equation 14 is estimated using OLS with White’s heteroscedasticity-consistent standard errors. 16 We also used a specification with an interaction between the event dummy and the volume measure: ψi,τ = µi + αδτ + βVi,τ + γδτ Vi,τ + i,τ Thus, ∆ψi = α + β ∆Vi + γVi,post + i,τ Tests under this formulation give similar results to those without the interaction. 17 Unlike the pre-event interval, the five days beginning July 1 do not include all days of the week (due to Independence Day). Since there may be a day-of-the-week effect in the estimated permanent price impact measures (see, for example, Foster and Viswanathan, 1993), we also conducted the analysis with the first five days in July that comprise a full set of the days of the week (dates 1, 2, 6, 7, and 12). The results obtained were very similar to the results using the first five trading days in July, and are therefore omitted for brevity of exposition. 15 Figure 1: Timeline for Event Study 5-day pre-event interval 5-day post-event interval May 28 July 1 Market capitalization calculated from closing prices is used to determine membership in the new Russell 1000 index. New Russell 1000 index takes effect. We use two proxies to control for changes in normal volume: the average daily turnover and the average daily number of trades in the interval. We fully acknowledge that these may not be perfect controls. However, the internal check discussed in Section 2 enables us to test whether our results are driven by insufficient adjustment for liquidity trading by examining the intercepts of the addition and deletion samples to see if they have opposite signs. While throughout the paper we use the MRR measure estimated using log prices to get a measure that can be interpreted as a component of the percentage spread, in the event study part of the paper we also use the same methodology with dollar prices to get the permanent price impact measure in terms of dollars, MRR$. This is done to examine the robustness of our results to possible changes in the level of prices induced by the event that are unrelated to the information asymmetry environment. 2.2 Results Panel A of Table 2 presents the results of the event study for the addition sample. The t-tests indicate that the decrease in the mean measure between the pre- and post-event intervals is statistically different from zero for MRR$, MRR, and HASAB. The non-parametric Wilcoxon statistics are also highly significant. Normalizing by the pre-event magnitude of the measures, the median percentage changes in MRR$, MRR, and HASAB are −27.41%, −45.78%, and −25.80%, respectively. The fourth column of Panel A shows the intercept from the regression with AvgTurn as a proxy for normal volume and the sixth column shows the intercept from the regression with AvgTrd as a proxy for normal volume. The intercepts in the regressions 16 with MRR$, MRR, and HASAB are all negative and statistically significant. The mean, median, and intercepts in the HASR regressions are all positive but insignificant. The results for the deletion sample are presented in Panel B of Table 2. The means and medians of differences between the pre- and post-event measures are negative and statistically different from zero for the two MRR measures. HASAB also decreases after the event, though the t-test is only marginally significant (with a p-value of 0.0529); the Wilcoxon signed rank test is significant. Normalizing by the pre-event magnitude of the measures, the median percentage changes in MRR$, MRR, and HASAB are −21.50%, −17.44%, and −9.55%, respectively. The intercepts of the regressions with the proxies for normal volume tell the same story: significant negative intercepts for MRR$, MRR and HASAB. Again, HASR does not show any significant change around the event. The results from both the addition and deletion samples point to the same conclusion: the Russell 1000 reconstitution event is associated with a decrease in the permanent price impact measures (with the exception of HASR). This cannot be due to a change in private information about the fundamentals of the firms since the only criterion used for the reconstitution–market capitalization on May 28–is public information. The finding of a decrease in the measures for both the addition and deletion samples also rules out an explanation based on insufficient control for changes in liquidity trading. As the discussion in Section 2 points out, even if the proxies for normal volume are not perfect, a change in normal trading would produce opposite results in the addition and deletion samples. The reason is that if a membership in a certain index is associated with a particular amount of uninformed trading, we would expect that being added to the Russell 1000 and deleted from the Russell 2000 would produce the opposite effect to being added to the Russell 2000 and deleted from the Russell 1000. Finding that the permanent price impact measures decrease in both addition and deletion samples is evidence that the result is not being driven by changes in liquidity trading. This feature of the test also enables us to rule out that the effects on the permanent 17 price impact measures are due to changes in the production of fundamental information by analysts induced by changes in index membership. It is possible that analysts are more willing to follow stocks that are included in a certain index, say the Russell 1000. This can be due to more trading activity in these stocks by index funds, or just because of a larger market capitalization that induces more mutual funds to own the stock. If this is the case, it could be that the amount of fundamental information production would increase after inclusion in the Russell 1000 index. But at the same time, a stock leaving the Russell 1000 index should experience a decrease in fundamental information production. The fact that we get a similar decrease in the permanent price impact measures for both addition and deletion samples means that our results cannot be solely due to a change in fundamental information production by analysts. We conducted several tests to evaluate the robustness of our results. First, we used interval lengths other than five days. Two-day intervals displayed a similar significant decrease in the permanent price impact measures MRR$, MRR, and HASAB.18 We also used onemonth intervals and obtained results similar to those presented in Table 2 (i.e., a statistically significant decrease in MRR$, MRR, and HASAB).19 Since MRR$ and MRR behave in a similar fashion, it seems unlikely that the results are due to possible changes in the price level that may affect the estimated measures due to discreteness of the price gird. Nonetheless, we added the change in AvgPrc as a regressor to the specification in Equation 14 to account for changes in the price level, and found that it did not alter our results. We also used a dummy variable to examine whether the 16 stocks that were added to the Russell 1000 index but were not previously in the Russell 2000 index behave differently from the rest of the stocks. The coefficient of the dummy variable was not significant and similar results were obtained. For MRR$ and MRR, we repeated the tests 18 We used May 27 and 28 for the pre-event interval and July 1 and 2 for the post-event interval. Both intervals are comprised of the days Thursday and Friday, and hence the results are not due to a day-of-theweek effect. 19 The only difference was with respect to the HASR measure, which was significantly negative in the deletion sample and significantly positive in the addition sample. 18 using only positive estimates and only estimates that were statistically different from zero. This was done to eliminate the influence of stocks that did not have sufficient amount of trading to produce reliable estimates of the permanent price impact measures. The results were similar to those reported in Table 2. Lastly, we constructed a matched sample of stocks based on both market capitalization and average turnover in April 1999.20 The motivation behind constructing the matched sample was to investigate whether the Russell 1000 reconstitution event coincided with a market-wide change in liquidity that could have affected the estimates of the permanent price impact measures of all stocks. Unlike the results for the addition and deletion samples, all permanent price impact measures except one showed no statistically significant decrease in the matched sample. The decrease in HASAB in the addition-matched sample was significant but smaller in magnitude than the decrease in the addition sample, and a paired t-test (as well as a Wilcoxon signed rank test) showed that the difference between them was statistically significant. To summarize the results so far: the event study shows that the “necessary” link between information asymmetry about future cash flows and the permanent price impact measures does not exist. The estimated measures should not have changed around the Russell 1000 reconstitution event if they only reflect the information asymmetry environment, but we have found a significant decrease in all measures (except for HASR). The decrease in the measures is consistent with changes to the extent of uncertainty about the investor population in the spirit of Saar (2000, 2001). The Russell indexes are popular benchmarks of stock market performance in the United States. In 2001, an estimated 117 billion dollars in investment were indexed to the Russell 1000, 2000, and 3000 indexes.21 Therefore, the annual reconstitution event can generate buying and selling pressures on 20 The universe of stocks that was used for the matching consisted of all common stocks in the CRSP database with information from April 1 to July 12 that were not added to or deleted from the Russell 1000 index. 21 This estimate is taken from a report by Investment Technology Group, Inc. on the 2001 Russell index reconstitution. 19 stocks that are added to or deleted from the indexes. Uncertainty about whether a stock will end up in the Russell 1000 index or the Russell 2000 index during the last week of May creates uncertainty with respect to the characteristics of the investors (e.g., the amount of money in index funds) who trade the stock. This increased uncertainty for stocks that may be added to or deleted from the index increases the price impact of trades and is picked up by the permanent price impact measures. In the post-event interval, there is less uncertainty about the investor population since the changes to the index are announced and therefore the permanent price impact of trades decreases. 3 Uncertainty about Future Cash Flows In this section we conduct a cross-sectional investigation of the relation between the permanent price impact measures and proxies for uncertainty about future cash flows that we use to represent information asymmetry about the firm. We test whether differences across stocks in the degree of information asymmetry about future earnings are sufficient to generate differences in the estimates of the permanent price impact measures. 3.1 Sample The initial sample for the cross-sectional analysis is constructed from all common stocks with information in the CRSP database for the entire year of 1999 that had, on average, more than one analyst covering them in the I/B/E/S database. The reason for requiring more than one analyst is to allow the calculation of the standard deviation of analysts’ earnings forecasts. This screen leaves 3,144 stocks. The sample is further restricted to firms with information in TAQ, COMPUSTAT, and Value Line Investment Survey in order to allow the computation of the permanent price impact measures and the construction of other variables that will be explained in greater detail below. This requirement eliminates 268 firms from the sample. We also eliminate firms that switched exchanges during the sample period, leaving a final sample of 2,797 firms. 20 Table 3 presents summary statistics for the sample. The average daily market capitalization (AvgCap) over the sample period (January 4 through December 31, 1999) for firms in our sample ranges from $6.05M to $446.16B, testifying to its heterogeneous nature. The sample also spans a range of trading activity and price levels. The most active firm has on average 29,845.81 daily trades (AvgTrd), while the average firm has 459.26 daily trades, and the least actively traded firm in the sample has 3.28 trades per day. Average daily closing prices (AvgPrc) range from $0.24 to $550.05. The sample is also diverse with respect to analyst following and institutional holdings. NumEst is the average of 12 monthly observations (January through December, 1999) of the number of analysts with end-of-fiscal-year earnings forecasts for the current year in I/B/E/S. While the mean of NumEst is 8.80, it ranges from a minimum of 2 to a maximum of 40.18. The percentage institutional holdings from Value Line Investment Survey (InstHol) ranges from 0.23% to 99.59%. For eight stocks in the sample, the MRR estimates came out negative. While there is nothing in the estimation to constrain the MRR measure from being negative, it is unclear how to interpret these estimates. The negative measures do not affect our results as our findings are practically identical with and without these eight stocks. The estimation of the Hasbrouck (1991b) specification proved infeasible for two stocks in the sample for which the VMA could not converge. These stocks are eliminated from the sample for the tests involving the HASAB and HASR measures. 3.2 Methodology The first test we run examines whether cross-sectional differences in MRR, HASAB, and HASR reflect differences in information asymmetry about future cash flows. Since we do not have a way to directly measure information asymmetry, we focus instead on uncertainty about future cash flows. We use two definitions of dispersion of analysts’ forecasts to represent uncertainty about future cash flows: the standard deviation of analysts’ forecasts (StdEst) and the coefficient 21 of variation (CoefEst).22 The logic behind the test is that the greater the uncertainty about future cash flows, the more room (and incentives) there is for investors to acquire private information and trade on it profitably. A theoretical treatment of the information environment of analysts is provided by Barron, Kim, Lim, and Stevens (1998). They show that an increase in private information about the firm, ceteris paribus, will increase the dispersion of analysts’ forecasts. Monthly observations for the standard deviation and the mean of analysts’ earnings forecasts for the upcoming end-of-fiscal-year are taken from I/B/E/S. StdEst is the simple average of the monthly standard deviation observations for all months in 1999, and CoefEst is the ratio of StdEst to the absolute value of the average monthly mean observations. Time left until the end-of-fiscal-year seems to exert an influence on the monthly observations of the standard deviation of the forecasts in I/B/E/S (as well as the number of estimates). However, since we use twelve consecutive months of data, the average should have similar properties across firms with different end-of-fiscal-year dates. We also use two proxies for uncertainty about future cash flows constructed from past earnings information. The first is the earning predictability measure provided by Value Line Investment Survey (VLPred). This variable, available for 2016 stocks in our sample, is derived from the standard deviation of percentage changes in quarterly earnings over an eight-year period, with special adjustments for negative observations and observations around zero. VLPred takes the values 1 to 100, where higher values are associated with less variability of past earnings and hence higher predictability of future earnings. We also compute the standard deviation of past annual earnings from COMPUSTAT using the five years prior to 1999 (StdEPS).23 The idea behind both measures is that higher past earnings variability makes it more likely that future earnings cannot be predicted with great precision (allowing more room for private information production and profitable informed trading). 22 We also used the difference between the high and the low forecasts as an additional definition of uncertainty about future cash flows. The results using this definition were very similar to the results using the standard deviation of earnings’ forecasts, and are therefore omitted for brevity. 23 We repeated the analysis with the standard deviation of annual earnings computed using five years that include 1999. The results were very similar to those obtained using StdEPS. 22 We then estimate regressions with the permanent price impact measures as dependent variables and the proxies for uncertainty about future cash flows as independent variables using OLS with White’s Heteroscedasticity-consistent standard errors. The choice of control variables to add to the regressions is guided by the proxies we use for future cash flows uncertainty. More dispersion among analysts may reflect the nature of the industry or the number of analysts that typically follow firms in a specific industry. Therefore, we add to the regressions a set of 2-digit SIC dummies to control for industry effects. To relate our measures of uncertainty about future cash flows to information asymmetry we need to control for public information, and we (partially) do it by including the standard deviation of daily returns in 1999 from CRSP as a control variable. Similarly, there are more public information events (e.g., news articles) about larger firms. So, we add a second control for public information flow by including AvgCap, the average market capitalization in 1999 from CRSP (in log form), to the regressions. Two other controls are motivated by the market microstructure literature. First, the impact of informed trading on prices diminishes as the amount of liquidity trading or normal trading increases (see, for example, Kyle, 1985; Easley and O’Hara, 1987). Hence, we want to hold constant the amount of liquidity trading when we compare across firms. As a proxy for normal trading, we use the median daily turnover (daily number of shares traded over the number of shares outstanding) in 1999 from CRSP (MedTurn). The median is less sensitive to informational shocks that generate a lot of trading, and therefore is a better description of “normal” volume.24 Lastly, we use dummies for the three primary markets (NYSE, AMEX, and Nasdaq) to control for differences in the manner specialists or market makers set prices in these markets. The motivation behind this control is that on the NYSE, for example, the price continuity requirement affects the way in which specialists change prices, and this may be picked up by the spread decomposition and variance decomposition procedures (for a discussion of this point see Hasbrouck, 1991a). Table 4 presents summary statistics for 24 We use median turnover in log form, and add a small constant (0.00001) to the raw data before making the transformation to be able to accommodate stocks with zero median turnover. 23 the analysts’ forecasts and past earnings proxies, as well as correlations among the (log transformed) proxies and control variables that are used in the regressions. A potential econometric problem with the OLS estimation for the two proxies that use analysts’ forecasts is that the dispersion in forecasts can be endogenous.25 In other words, the amount of time analysts spend on producing their estimates (which may affect the dispersion), the number of estimates or the frequency of their updates may be influenced by how much profit can be made from this information. The profit opportunity, in turn, depends on the permanent price impact of trades–the same permanent price impact that the econometric methodologies estimate and that we use as a dependent variable. This rationale suggests that the dispersion of analysts’ forecasts may be correlated with the error term, causing OLS to be inconsistent. To examine this potential problem, we also estimate the cross-sectional relation using two-stage-least-squares (2SLS) and conduct Durbin-WuHausman tests.26 3.3 Results Panel A of Table 5 presents the results of the cross-sectional regressions with the permanent price impact of the order flow innovation, MRR, as the dependent variable. Of the two proxies using analysts’ forecasts, only StdEst is statistically significant, and it is negative. 25 This is not a problem with the proxies that use actual earnings data since it is hard to imagine how the permanent price impact of trades can affect the firms’ earnings per share. 26 See Wu (1973), Hausman (1978), and Davidson and MacKinnon (1989) for a discussion of this test. Using data in COMPUSTAT and CRSP, we construct six instruments for the dispersion in analysts’ forecasts. The first instrument is the log standard deviation of EPS for the five years prior to 1999 (available for 2767 firms in our sample). The second instrument is the one year lagged EPS growth (percentage change in EPS from 1997 to 1998), using the absolute value of EPS in the denominator to maintain consistent signs when EPS is negative (available for 2744 firms in our sample). The motivation for this instrument is that growth firms are associated with more uncertainty about future earnings. The third instrument is the average absolute value of quarterly sales growth in 1999 from COMPUSTAT (available for 2701 firms in our sample). This is a measure of the magnitude of change in the firm’s operations that presumably is correlated with uncertainty about its sales and earnings potential. The fourth instrument is the number of days with listing records in CRSP, which is a measure of the maturity of the firm. Presumably, it is easier to estimate the future earnings of more mature firms. The fifth instrument is the log number of SIC codes that are associated with the businesses of the firm (available for 2677 firms in our sample). Firms comprised of more businesses may be more difficult for analysts to evaluate, and this should increase the dispersion of the forecasts. The sixth instrument is the dividend payout ratio for 1999 (available for 2610 firms in our sample). The idea behind this instrument is that the more money is paid out, the less money is spent of new projects and therefore the less uncertainty there is about earnings. 24 This implies a greater permanent price impact for stocks with less uncertainty about future cash flows, which goes against the predicted direction. As for the two proxies using past earnings, VLPred is negative and (weakly) statistically significant. Here the direction is consistent with our expectation: higher predictability is associated with lower information asymmetry and therefore a smaller permanent price impact. Panel B of Table 5 presents the results with the trade-correlated component of the efficient price variance, HASAB, as the dependent variable. Here StdEst is negative but not significant by itself (though it gains weak statistical significance when it is in the regression alongside VLPred). CoefEst is positive and statistically significant. The positive relation between HASAB and uncertainty about future cash flows is in line with the role of this measure a proxy for information asymmetry. Both of the proxies using past earnings (VLPred and StdEPS) are negative but insignificant. Panel C of Table 5 presents the results using the amount of private information relative to the total amount of information, HASR, as the dependent variable. None of the proxies for uncertainty about future cash flows is statistically significant.27 The Durbin-Wu-Hausman exogeneity tests and the coefficients of the two-stage-leastsquares (2SLS) estimation are presented in Table 6. For all three measures, the exogeneity tests are insignificant (with very low chi-squared statistics). This gives us confidence that the results of the OLS estimations provide an accurate picture of the relationships among the variables. To further test the robustness of our results, we examined another variable constructed from earnings that had been used in the literature as a proxy for information asymmetry about future cash flows: analysts’ forecast errors.28 The findings using forecast errors were similar in nature to those reported in Table 5 for uncertainty about future cash flows. 27 Since HASR is a proportion, we also estimated the regressions with a logit transformation of HASR as the dependent variable. The results were qualitatively similar to those reported in Panel C and are therefore not presented here. 28 We define a forecast error as the absolute value of the difference between the actual annual earnings and the median forecast one month prior to the earnings announcement, deflated by the stock price five days prior to the announcement. See, for example, Thomas (2002). 25 To summarize the results of the analysis in this section: HASAB is related to one of the analysts’ forecasts proxies for uncertainty about future cash flows in the predicted direction in univariate tests, but gives conflicting results when the two types of proxies are used together. MRR provides conflicting results that are difficult to interpret, and HASR is unrelated to any of the proxies. 4 Price Informativeness about Future Earnings 4.1 Methodology We use the same sample to perform a different kind of cross-sectional analysis that does not require a proxy for information asymmetry about future cash flows. Our tests involve two measures of the informativeness of prices about future earnings that were developed in the accounting literature (see, Kothari, 1992; Kothari and Sloan, 1992; Collins, Kothari, Shanken, and Sloan, 1994; Durnev, Morck, Yeung, and Zarowin, 2001). Let rt = (Pt − Pt−1 )/Pt−1 be the current quarterly return and rt+τ be the return τ quarters ahead. Similarly, let ∆Et be the current quarterly earnings-per-share change and ∆Et+τ be the earnings change τ quarters ahead, where all earnings changes are scaled by the price at the beginning of the current quarter, Pt−1 . To measure stock price informativeness, we regress current stock returns on current and future earnings changes: rt = a + b0 ∆Et + bτ ∆Et+τ + τ cτ rt+τ + ut (15) τ Future stock returns are included among the regressors to control for the measurement error problem described by Collins et al. (1994). The two measures of price informativeness are: FERC ≡ bτ (16) τ 2 FINC ≡ Ra+b 0 ∆Et + τ bτ ∆Et+τ + τ cτ rt+τ +ut − R2a+b0 ∆Et +ut (17) FERC (Future Earnings Response Coefficients) is defined as the sum of the coefficients on the future earnings changes, and FINC (Future earnings INCremental) is the incremental 26 explanatory power of the future earnings changes (given that current earnings are already in the model). Both FERC and FINC are designed to detect the extent to which current stock prices capitalize information about future earnings. The general intuition behind using FERC and FINC to test the permanent price impact measures is that the more trading on private information, holding constant the extent of public information, the more informative are prices with respect to future earnings (for a theoretical justification see, for example, Grossman and Stiglitz, 1980). The need to control for public information arises since if we take the extreme case where all future cash flow information is revealed publicly, then both FERC and FINC will be at their maximum but there will be no information asymmetry. However, once we control for public information, the relation between trading on private information and FERC and FINC should be monotonically increasing. This is true in particular since MRR and HASAB (or HASR) are not just measures of information asymmetry, but are constructed as measures of trading based on private information that moves prices. Therefore, these measures relate directly to information incorporation into prices and thus should be more tightly related to FERC and FINC. Since we are interested in evaluating how much information about future earnings is incorporated into prices during our sample period, we use returns for the four calendar quarters in 1999. Our earnings data is taken from COMPUSTAT, and we have earnings up to and including the second quarter of 2000. To be able to use all four quarterly return observations, we set τ = 2 and estimate the information that is impounded into current prices about earnings two quarters in the future. Since each firm has only four observations, we cannot estimate FERC and FINC for individual firms. We group stocks with similar permanent price impact measures and estimate the specification in Equation (15) for each group. The motivation behind this procedure is that each group of stocks that have the same level of informed trading (holding public information constant) should have similar FERC and FINC estimates, and hence can be estimated together. 27 For example, to estimate FERC and FINC for the MRR measure we sort all stocks according to MRR and form 279 groups of ten stocks each.29 Each such group has approximately 40 firm-quarter observations that can be used for the estimation of FERC and FINC. We estimate Equation (15) using OLS with White’s heteroscedasticity-consistent standard errors. We then use the mean of the MRR coefficients of stocks in each group (MRRG ) to represent the level of permanent price impact for the group. FERCMRR and FINCMRR are computed according to (16) and (17). We perform separate estimations of Equation (15) for groups formed by sorting on HASAB and HASR. We then use correlations to examine the relationship between the price informativeness measures and the group measures of permanent price impact. Ideally, one should control for cross-sectional differences in public information and liquidity trading when looking at the relation between the permanent price impact measures and price informativeness. However, since we group stocks according to their permanent price impact measures, we cannot simply use a regression and plug in control variables. The mean of the market capitalization (or the volatility of returns) of stocks in a group has no real economic content. To exercise some control, we divide the stocks into four categories. First, we sort the 2,789 stocks according to their market capitalization. We then separate them into two categories, Low AvgCap and High AvgCap, with equal number of stocks. Each category is then sorted according to MedTurn and divided into two equal categories, Low MedTurn and High MedTurn. This procedure produces four control categories. Within each category, we sort according to the permanent price impact measure, create 70 groups of ten stocks each, and estimate FERC and FINC for the groups. We then compute the correlations separately for each category. Such a structure provides (partial) control for public information and liquidity trading. We use a similar procedure to control for public information and liquidity trading with StdRet and MedTurn, respectively. 29 We dropped the eight stocks for which the MRR measures were negative. Inclusion of these stocks in the groups does not materially affect the results. 28 4.2 Results The results of the FERC and FINC analysis are provided in Table 7. The predicted direction is that more informed trading is associated with a higher degree of price informativeness, which means larger magnitudes of FERC and FINC. The results of the unconditional analysis, where we look at the correlations for the entire sample, seem to go the other way. MRR and HASR are not significantly correlated with either FERC or FINC; HASAB has a negative correlation with FINC (-0.1316) that is statistically significant. The conditional analysis, where we sort the stocks into four categories according to two conditioning variables, should lessen the influence of differences across stocks in the degree of public information or the amount of liquidity trading. Panel B of Table 7 shows the conditional analysis for the MRR measure. When the controls are average market capitalization (AvgCap) and median turnover (MedTurn), the correlations of MRR with either FERC or FINC are not statistically different from zero. Similarly, when the controls are the standard deviation of daily returns (StdRet) and MedTurn, none of the correlations is statistically different from zero. Panel C of Table 7 shows the conditional analysis for the HASAB measure. When the controls are AvgCap and MedTurn, none of the four categories has correlations with FERC and FINC that are significantly different from zero, and most correlations are negative. When the controls are StdRet and MedTurn, the only correlation that is statically different from zero (and negative) is in the High StdRet and High MedTurn category. Panel D of Table 7 presents the conditional analysis for the HASR measure. There are both positive and negative correlations of HASR with FERC and FINC, but only one category (small, inactive stocks) has significant negative correlation with FINC. The bottom line of the FERC and FINC analysis is that despite our expectation that more trading by investors who are informed about future cash flows would increase the informativeness of prices with respect to future earnings, all permanent price impact measures seem to be either negatively correlated or not significantly correlated with the measures of 29 price informativeness. 5 Conclusion Our goal in this paper was to examine the question whether measures constructed from trades and quotes that describe very short-horizon price movements are helpful in characterizing information asymmetry about the fundamentals of firms. The most powerful test we conduct uses the Russell 1000 index reconstitution to look at whether changes in information asymmetry are necessary to produce changes in the estimated measures. We then investigate whether cross-sectional differences in uncertainty about future cash flows that is used as a proxy for information asymmetry are sufficient to produce differences in the estimated measures. Finally, we relate the permanent price impact measures directly to price informativeness about future earnings. In the introduction we mention two potential problems with the permanent price impact measures. One of the problems is that even if these methodologies capture only real permanent price changes, these can be due to uncertainty about investors or other factors that affect the risk premium and not necessarily due to information asymmetry about future cash flows. The results of the event study seem to suggest that this problem is real. The MRR (both in dollar and percentage terms) and the HASAB measures decreased significantly around the reconstitution event, and this cannot be attributed to mere changes in liquidity trading. These findings are consistent with an explanation of the permanent price impact of trades based on uncertainty about the preferences and endowments of investors rather than information asymmetry about the firm’s fundamentals. The cross-sectional results seem to suggest the presence of the other potential problem mentioned in the introduction: that the estimation procedures may incorporate into the permanent price impact measures influences of the trading environment that should not affect the price permanently. Because the various econometric specifications used to generate the measures differ in the assumptions they make, the resulting measures may incorporate 30 temporary price effects to a different degree. The MRR measure did not relate to uncertainty about future cash flows in a consistent fashion. HASAB performed slightly better in that it had a strong relation in the predicted direction with one proxy using analysts’ forecasts. Both HASAB and MRR, however, did not fare well in the price informativeness analysis, where more informed trading according to these permanent price impact measures was associated with less information in prices about future earnings. While HASR showed promise in the event study analysis in that it did not change around the Russell 1000 reconstitution, it did not have a significant relation with any of the proxies for uncertainty about future cash flows or with price informativeness. If information about the firm indeed affects prices in a more permanent fashion than other factors, then the better we are at identifying permanent price changes, the stronger will be the connection between the measures and future cash flow information. The methodology that generates HASAB uses five lags of price changes to identify permanent effects. The econometric model that generates the MRR measure essentially uses only one lag. As such, the MRR measure may incorporate more of the other factors that affect trading, reducing its effectiveness in identifying information about the firm. Further increasing the number of lags in the variance decomposition methodology, while preventing its use for thinly traded stock, may increase its effectiveness. This is especially important after the move to a decimal price grid, as the reduced tick size dramatically increases the number of quote changes. It is not clear, however, that very short-horizon price movements can at all be used effectively to identify information asymmetry about the firm. Callahan, Lee, and Yohn (1997) claim that the information asymmetry risk faced by a dealer, presumably the one reflected in the permanent price impact of trades, is not a measure of the long-term fundamental risk of investing in a particular firm. Market makers are concerned with order imbalances over very short horizons (often less than an hour), and these need not correspond to the creation of new information about the firm or the trading by investors with special knowledge about the firm. Even if information about the firm is one cause for these order flow imbalances, its 31 effects may be miniscule relative to other factors that influence minute-by-minute trading. Our findings suggest that the two potential problems we have identified with the use of the permanent price impact measures as proxies for information asymmetry about fundamentals may be at work. Without necessarily disqualifying the measures as tools for analyzing the price impact of trades, the overall weight of the evidence points to a conclusion that measures based on very short-horizon price movements may not be suitable for describing information asymmetry about the firm. 32 References Barclay, M. J., and C. G. Dunbar (1996): “Private Information and the Costs of Trading around Quarterly Earnings Announcements,” Financial Analysts Journal, NovemberDecember, 75—84. Barron, O. E., O. Kim, S. C. Lim, and D. E. Stevens (1998): “Using Analysts’ Forecasts to Measure Properties of Analysts’ Information Environment,” Accounting Review, 73(4), 421—433. Brennan, M. J., and A. Subrahmanyam (1995): “Investment Analysis and Price Formation in Securities Markets,” Journal of Financial Economics, 38, 361—381. Brooks, R. M. (1994): “Bid-Ask Spread Components Around Anticipated Announcements,” Journal of Financial Research, 42(3), 375—386. Callahan, C. M., C. M. Lee, and T. Lombardi Yohn (1997): “Accounting Information and Bid-Ask Spreads,” Accounting Horizons, 11(4), 50—60. Campbell, J. Y., and J. Ammer (1993): “What Moves the Stock and Bond Markets? A Variance Decomposition for Long-Term Asset Returns,” Journal of Finance, 48(1), 3—37. Clarke, J., and K. Shastri (2000): “On Information Asymmetry Metrics,” working paper, Katz Graduate School of Business, University of Pittsburgh. (2001): “Adverse Selection Costs and Closed-End Funds,” working paper, Katz Graduate School of Business, University of Pittsburgh. Collins, D. W., S. P. Kothari, J. Shanken, and R. G. Sloan (1994): “Lack of Timeliness and Noise as Explanations for the Low Contemporaneous Return-Earnings Association,” Journal of Accounting and Economics, 18, 289—324. Davidson, R., and J. G. MacKinnon (1989): “Testing for Consistency Using Artificial Regressions,” Econometric Theory, 5, 363—384. Dennis, P. J., and J. P. Weston (2001): “Who’s Informed? An Analysis of Stock Ownership and Informed Trading,” working paper, McIntire School of Commerce, University of Virginia. Desai, A. S., M. Nimalendran, and S. Venkataraman (1998): “Changes in Trading Activity Following Stock Splits and their Effect on Volatility and the Adverse-Information Fomponent of the Bid-Ask Spread,” Journal of Financial Research, 21(2), 159—183. Dey, M. K., and B. Radhakrishna (2001): “Institutional Trading, Trading Volume, and Spread,” working paper, Carlson School of Management, University of Minnesota. Dhillon, U., and H. Johnson (1991): “Changes in the Standard and Poor’s 500 List,” Journal of Business, 64(1), 75—85. 33 Durnev, A., R. Morck, B. Yeung, and P. Zarowin (2001): “Does Greater Firmspecific Variation Mean More or Less Informed Stock Pricing?,” working paper, Stern School of Business, New York University. Easley, D., and M. O’Hara (1987): “Price, Trade Size, and Information in Securities Markets,” Journal of Financial Economics, 19, 69—90. Fama, E. F. (1990): “Stock Returns, Expected Returns, and Real Activity,” Journal of Finance, 45(4), 1089—1108. Fee, C. E., and S. Thomas (1999): “Corporate Diversification, Asymmetric Information, and Firm Value: Evidence from Stock Market Trading Characteristics,” working paper, Eli Broad College of Business, Michigan State University. Flannery, M. J., S. H. Kwan, and M. Nimalendran (2000): “Market Evidence on the Opaqueness of Banking Firms’ Assets,” working paper, Graduate School of Business Administration, University of Florida. George, T. J., G. Kaul, and M. Nimalendran (1991): “Estimation of the Bid-Ask Spread and its Components: A New Approach,” Review of Financial Studies, 4(4), 623— 656. Glosten, L. R. (1987): “Components of the Bid-Ask Spread and the Statistical Properties of Transaction Prices,” Journal of Finance, 42, 1293—1307. Glosten, L. R., and L. E. Harris (1988): “Estimating the Components of the Bid-Ask Spread,” Journal of Financial Economics, 21, 123—142. Grossman, S. J., and J. E. Stiglitz (1980): “On the Impossibility of Informationally Efficient Markets,” American Economic Review, 70(3), 393—408. Harris, L., and E. Gurel (1986): “Price and Volume Effects Associated with Changes in the S&P 500 List: New Evidence for the Existence of Price Pressures,” Journal of Finance, 41(4), 815—829. Hasbrouck, J. (1988): “Trades, Quotes, Inventories, and Information,” Journal of Financial Economics, 22, 229—252. (1991a): “Measuring the Information Content of Stock Trades,” Journal of Finance, 46(1), 179—207. (1991b): “The Summary Informativeness of Stock Trades: An Econometric Analysis,” Review of Financial Studies, 4(3), 571—595. Hasbrouck, J., and G. Sofianos (1993): “The Trades of Market Makers: An Empirical Analysis of NYSE Specialists,” Journal of Finance, 48(5), 1565—1593. Hausman, J. A. (1978): “Specification Tests in Econometrics,” Econometrica, 46, 1251— 1272. 34 Heflin, F., and K. W. Shaw (2000a): “Blockholder Ownership and Market Liquidity,” Journal of Financial and Quantitative Analysis, 35(4), 621—633. Heflin, F., K. W. Shaw, and J. J. Wild (2000): “Disclosure Quality and Market Liquidity,” working paper, Krannert Graduate School of Management, Purdue University. Huang, R. D., and H. R. Stoll (1997): “The Components of the Bid-Ask Spread: A General Approach,” Review of Financial Studies, 10(4), 995—1034. Illessy, J. K., and K. Shastri (2000): “The Debt-Equity Mix in Preferred Stock and Adverse Selection Costs: An Empirical Investigation,” working paper, University of Pittsburgh. Jain, P. C. (1987): “The Effect on Stock Price of Inclusion in or Exclusion from the S&P 500,” Financial Analyst Journal, pp. 58—65. Jennings, R. (1994): “Intraday Changes in Target Firms’ Share Price and Bid-Ask Quotes Around Takeover Announcements,” Journal of Financial Research, 17(2), 255—270. Kothari, S. P. (1992): “Price-Earnings Regressions in the Presence of Prices Leading Earnings,” Journal of Accounting and Economics, 15, 173—202. Kothari, S. P., and R. G. Sloan (1992): “Information in Prices about Future Earnings: Implications for Earnings Reseponse Coefficients,” Journal of Accounting and Economics, 15, 143—171. Krinsky, I., and J. Lee (1996): “Earnings Announcements and the Components of the Bid-Ask Spread,” Journal of Finance, 51(4), 1523—1535. Kyle, A. S. (1985): “Continuous Auctions and Insider Trading,” Econometrica, 53(6), 1315—1335. Lynch, A., and R. R. Mendenhall (1997): “New Evidence on Stock Price Effects Associated with Changes in the S&P 500 Index,” Journal of Business, 70(3), 351—383. Madhavan, A. (2001): “The Russell Reconstitution Effect,” Manuscript. Madhavan, A., M. Richardson, and M. Roomans (1997): “Why Do Security Prices Change? A Transaction-Level Analysis of NYSE Stocks,” Review of Financial Studies, 10(4), 1035—1064. Madhavan, A., and S. Smidt (1991): “A Bayesian Model of Intraday Specialist Pricing,” Journal of Financial Economics, 30, 99—134. Neal, R., and S. M. Wheatley (1998): “Adverse Selection and Bid-Ask Spreads: Evidence from Closed-End Funds,” Journal of Financial Markets, 1, 121—149. Saar, G. (2000): “Prices and Spreads in Sequential Markets with Information Imperfections,” working paper, Stern School of Business, New York University. 35 (2001): “Investor Uncertainty and Order Flow Information,” working paper, Stern School of Business, New York University. Sarin, A., K. A. Shastri, and K. Shastri (2000): “Ownership Structure and Stock Market Liquidity,” working paper, Katz Graduate School of Business, University of Pittsburgh. Schipper, K., and R. Thompson (1983): “The Impact of Merger-Related Regulations on the Shareholders of Acquiring Firms,” Journal of Accounting Research, 21(1), 184—221. Shleifer, A. (1986): “Do Demand Curves for Stocks Slope Down?,” Journal of Finance, 41(3), 579—590. Stoll, H. R. (1989): “Inferring the Components of the Bid-Ask Spread: Theory and Empirical Tests,” Journal of Finance, 44(1), 115—134. Thomas, S. (2002): “Firm Diversification and Asymmetric Information: Evidence from Analysts’ Forecasts and Earnings Announcements,” Forthcoming in the Journal of Financial Economics. Thompson, R. (1985): “Conditioning the Return-Generating Process on Firm-Specific Events: A Discussion of Event Study Methods,” Journal of Financial and Quantitative Analysis, 20, 151—168. Van Ness, B. F., R. A. Van Ness, and R. S. Warr (2001): “How Well Do Adverse Selection Components Measure Adverse Selection?,” Financial Management, 30(3), 77—98. Wu, D.-M. (1973): “Alternative Tests of Independence Between Stochastic Regressors and Disturbances,” Econometrica, 41(4), 733—750. 36 Table 1 Summary Statistics for the Event-study Sample The initial sample for the event study consists of all stocks that were added to or deleted from the Russell 1000 index in 1999. We restrict the sample to common stocks with trading data between April and mid-July, and eliminate firms about which we find news reports discussing mergers and acquisitions, share offerings, earnings announcements, dividends distributions, and stock splits in the months around the Russell reconstitution dates. The following variables are calculated for each stock using data in CRSP: CapMay28 is the market capitalization on May 28 (the number of shares outstanding multiplied by the closing price), PrcMay28 and PrcJuly1 are the closing prices on these two dates, BegTurn is the average daily turnover (the number of shares traded divided by the number of shares outstanding) for the pre-event interval (May 24–28), and EndTurn is the average daily turnover for the post-event interval (July 1–8). The following variables are calculated for each stock using data in TAQ: BegTrd and EndTrd are the average daily number of trades in the pre- and post-events intervals, Beg$Sprd and End$Sprd are the average dollar spreads in the pre- and post-event intervals, and Beg%Sprd and End%Sprd are the average relative spreads (dollars spreads divided by the mid-quotes) in the pre- and post-event intervals, respectively. Panel A presents the statistics for the 105 stocks in the addition sample, and Panel B presents the statistics for the 70 stocks in the deletion sample. Panel A: Addition Sample CapMay28 PrcMay28 PrcJuly1 (in $) (in million $) (in $) Mean Median Std. Dev. Min. Max. Obs. 2,456.23 1,704.72 2,628.09 197.75 21,402.83 105 48.98 44.50 25.78 9.88 177.19 105 50.81 46.50 26.88 10.06 149.66 105 BegTurn (in %) 1.54 0.76 1.77 0.03 8.52 105 EndTurn (in %) 1.69 0.69 3.24 0.07 25.58 105 BegTrd EndTrd 1,349.89 417.00 3,020.15 22.00 23,130.60 105 1,252.56 373.00 2,325.53 28.40 12,510.20 105 BegTrd EndTrd Beg$Sprd End$Sprd Beg%Sprd End%Sprd (in $) (in $) (in %) (in %) 0.27 0.22 0.18 0.09 1.13 105 0.22 0.19 0.13 0.08 0.72 105 0.60 0.53 0.28 0.19 2.49 105 0.48 0.43 0.23 0.14 1.97 105 Panel B: Deletion Sample CapMay28 PrcMay28 PrcJuly1 (in million $) (in $) (in $) Mean Median Std. Dev. Min. Max. Obs. 982.01 1,071.70 302.92 310.08 1,346.02 70 19.32 18.94 9.11 3.25 42.69 70 20.44 19.13 9.72 4.81 47.20 70 BegTurn (in %) 0.61 0.47 0.61 0.02 3.91 70 EndTurn (in %) 0.65 0.48 0.58 0.07 2.94 70 37 271.25 151.40 337.65 15.80 1,851.60 70 292.45 167.90 397.17 27.20 2,693.80 70 Beg$Sprd End$Sprd Beg%Sprd End%Sprd (in $) (in $) (in %) (in %) 0.15 0.13 0.07 0.06 0.46 70 0.13 0.11 0.04 0.06 0.31 70 0.87 0.70 0.45 0.41 3.23 70 0.73 0.62 0.30 0.28 1.69 70 Table 2 Event Study Analysis This table presents the analysis of changes in the permanent price impact measures around the Russell 1000 reconstitution in 1999. The pre-event interval is May 24–28 and the post-event interval is July 1–8 (each contains 5 trading days). ∆MRR$ is the change in the MRR$ measure (similar to MRR but estimated with price differences rather than log price differences) between the post- and pre-event intervals. Similarly, ∆MRR is the change in the MRR measure, ∆HASAB is the change in the HASAB measure, and ∆HASR is the change in the HASR measure. The second column reports the mean of the change in measures and the p-value (in parentheses) of a t-test against the hypothesis of zero change. The third column reports the median of the change in measures and the p-value (in parentheses) of a Wlicoxon signed rank test against the hypothesis of zero change. Regressions 1 and 2 (columns four through seven) both have the following form: ∆ψ i = α + β ∆Vi + ε i where ∆ψi is the change in the permanent price impact measure and ∆Vi is the change in a proxy for normal volume. In Regression 1, the proxy for normal volume is the average daily turnover over the interval (i.e., ∆Vi = EndTurn BegTurn). In Regression 2, the proxy for normal volume is the average daily number of trades over the interval (i.e., ∆Vi = EndTrd - BegTrd). The estimation of the regressions is done using OLS, and p-values (in parentheses) are calculated using White's heteroskedasticity-consistent standard errors. Panel A presents the results for the 105 stocks in the addition sample, and Panel B presents the results for the 70 stocks in the deletion sample. "**" indicates significance at the 1% level and "*" indicates significance at the 5% level (both against a two-sided alternative). Panel A: Addition Sample Dependent Variable ∆MRR$ ∆MRR ∆HASAB ∆HASR Mean (p-value of t-test) -0.004127** (0.0040) -0.009407** (0.0031) -0.144830** (0.0000) 0.000829 (0.9245) Median (p-value of Wilcoxon test) -0.001497** (0.0029) -0.005500** (0.0002) -0.126247** (0.0000) 0.003620 (0.8629) Regression 1 α (p-value) -0.004100** (0.0046) -0.009341** (0.0037) -0.146163** (0.0000) 0.001268 (0.8854) β(Turn) (p-value) -0.000179 (0.2825) -0.000433 (0.2250) 0.008628 (0.4067) -0.002844** (0.0014) Regression 2 α (p-value) -0.004183** (0.0037) -0.009525** (0.0030) -0.143406** (0.0000) 0.000421 (0.9618) β(Trd) (p-value) -0.000001* (0.0130) -0.000001** (0.0044) 0.000015 (0.2207) -0.000004 (0.0709) Panel B: Deletion Sample Dependent Variable ∆MRR$ ∆MRR ∆HASAB ∆HASR Regression 1 Regression 2 Mean (p-value of t-test) Median (p-value of Wilcoxon test) α (p-value) β(Turn) (p-value) α (p-value) β(Trd) (p-value) -0.004670** (0.0020) -0.029490** (0.0010) -0.059153 (0.0529) -0.030318 (0.0701) -0.001836** (0.0002) -0.012100** (0.0000) -0.076776* (0.0147) -0.014745 (0.0782) -0.004608** (0.0023) -0.029137** (0.0011) -0.062035* (0.0354) -0.029879 (0.0743) -0.001603 (0.1534) -0.009135* (0.0404) 0.074634 (0.3375) -0.011379 (0.6531) -0.004666** (0.0021) -0.029322** (0.0011) -0.063433* (0.0323) -0.030479 (0.0703) -0.000000 (0.8929) -0.000008 (0.4310) 0.000202* (0.0322) 0.000008 (0.6354) 38 Table 3 Summary Statistics for the Cross-sectional Sample The cross-sectional sample is constructed by including all common stocks with information in the CRSP database between January 4 and December 31, 1999 (the sample period) that have at least one analyst covering them in the I/B/E/S database. The sample is further restricted to stocks with information in TAQ, COMPUSTAT, and Value Line Investment Survey. The following variables are calculated for each stock over the sample period using data in CRSP: AvgCap is the average daily market capitalization (the number of shares outstanding multiplied by the daily closing price), StdRet is the standard deviation of daily returns, MedTurn is the median daily turnover (the number of shares traded divided by the number of shares outstanding), AvgVol is the average daily number of shares traded, and AvgPrc is the average daily closing price of the stock. From the TAQ database, AvgTrd is calculated as the average daily number of trades, and %Sprd is the average relative spread (the bid-ask spread divided by the midquote) calculated from all quote observations. NumEst is the average of 12 monthly observations in 1999 of the number of analysts with end-of-fiscal-year earnings forecasts for the current year in I/B/E/S. InstHol is the percentage of shares held by institutions from Value Line Investment Survey. The table presents summary statistics for the entire sample, as well as means for four equal groups sorted by AvgCap, StdRet, and MedTurn, respectively. Entire Sample Means For AvgCap Groups Means for StdRet Groups Means for MedTurn Groups Mean Median Std. Dev. Min. Max. Obs. Q1 (Low) Q2 Q3 Q4 (High) Q1 (Low) Q2 Q3 Q4 (High) Q1 (Low) Q2 Q3 Q4 (High) AvgCap (in million $) 4,403.02 549.40 19,124.85 6.05 446,161.27 2,797 105.54 349.29 1,033.10 16,107.41 7,236.77 7,347.18 2,138.52 894.62 3,410.40 5,773.78 4,894.88 3,534.26 StdRet MedTurn (in %) (in %) 3.62 0.50 3.32 0.31 1.70 0.65 0.50 0.00 25.73 15.64 2,797 2,797 4.46 0.37 3.90 0.50 3.27 0.55 2.86 0.59 1.89 0.20 2.81 0.31 3.90 0.53 5.89 0.97 2.84 0.11 2.99 0.24 3.69 0.42 4.98 1.24 AvgVol (in 100s) 5,152.00 1,606.85 14,028.88 11.00 280,833.61 2,797 829.05 1,812.48 3,014.31 14,938.17 3,602.80 6,023.61 5,775.37 5,206.16 1,180.33 3,818.48 5,150.48 10,451.15 39 AvgPrc (in $) 26.59 20.84 24.58 0.24 550.05 2,797 10.11 18.53 28.98 48.71 36.50 28.37 21.56 19.96 21.50 26.33 27.34 31.18 AvgTrd 459.26 121.50 1,614.92 3.28 29,845.81 2,797 75.78 163.50 258.08 1,338.40 239.24 446.16 499.57 651.79 83.08 240.11 346.37 1,166.46 %Sprd NumEst (in %) 8.80 1.25 6.45 0.88 6.78 1.19 2.00 0.07 40.18 19.47 2,797 2,797 3.87 2.63 5.71 1.26 8.58 0.74 17.01 0.38 10.28 0.70 10.01 1.02 8.39 1.43 6.50 1.85 5.84 1.86 9.55 1.17 9.79 1.12 0.86 10.00 InstHol (in %) 50.30 51.06 22.70 0.23 99.59 2,797 36.71 47.86 55.52 61.10 51.34 56.31 52.63 40.94 36.98 52.87 57.43 53.91 Table 4 Summary Statistics for Future Cash Flows Uncertainty Proxies We use as proxies for uncertainty about future cash flows two definitions of dispersion of analysts' earnings forecasts: log of the standard deviation of analysts' forecasts (StdEst) and log of the coefficient of variation of analysts' forecasts (CoefEst). I/B/E/S provides monthly observations of the standard deviation and mean of the analysts' forecasts for the current end-of-fiscal-year. StdEst is the simple average of the monthly standard deviation observations for all months in 1999, and CoefEst is the ratio of the average standard deviation to the average absolute mean forecast. We also use two proxies for uncertainty about future cash flows based on past earnings: VLPred is an earning predictability measure provided by Value Line Investment Survey derived from the standard deviation of percentage changes in quarterly earnings over an eight-year period, and StdEPS is log of the standard deviation of annual earnings for the five years prior to 1999. Panel A presents summary statistics for the entire sample (before taking the log transformations), as well as for four equal groups sorted by AvgCap (average daily market capitalization), StdRet (the standard deviation of daily returns), and MedTurn (the median daily turnover), respectively. Panel B presents pairwise correlations and p-values of the four future cash flows uncertainty proxies and the three control variables used in the cross-sectional regressions (where LAvgCap and LMedTurn are the log transformations of AvgCap and MedTurn, respectively). The two-sided p-values for the asymptotic test of zero correlations are shown in parentheses. "**" indicates significance at the 1% level and "*" indicates significance at the 5% level. Panel A: Summary statistics of untransformed variables StdEst CoefEst Mean Entire 0.0875 0.3825 Median Sample 0.0482 0.0405 Std. Dev. 0.1254 7.1752 Min. 0.0008 0.0008 Max. 2.5400 363.0000 Obs. 2,797 2,797 Q1 (Low) Means 0.0939 1.0636 Q2 for 0.0863 0.2035 Q3 AvgCap 0.0783 0.1374 Q4 (High) Groups 0.0913 0.1260 Q1 (Low) 0.0483 Means 0.0816 Q2 for 0.0706 0.0844 Q3 StdRet 0.0850 0.7530 Q4 (High) Groups 0.1127 0.6440 Q1 (Low) Means 0.0747 0.1361 Q2 for 0.0764 0.1517 Q3 MedTurn 0.0981 0.9204 Q4 (High) Groups 0.1006 0.3220 40 VLPred 47.6984 45.0000 28.3023 5.0000 100.0000 2,016 33.3788 40.6672 51.3684 59.7168 62.2476 51.4136 38.9774 26.9558 54.1795 53.6630 45.3531 35.5310 StdEPS 0.9144 0.5500 1.2750 0.0100 22.0300 2,767 0.6920 0.8209 0.9022 1.2416 0.9278 0.9567 0.8688 0.9046 0.7267 0.8385 1.0180 1.0726 Panel B: Unconditional correlations of transformed variables used in the regressions Control Variables StdEst CoefEst VLPred StdEst (p-value) _ CoefEst (p-value) 0.6757** (0.0000) _ VLPred (p-value) -0.4081** (0.0000) -0.5739** (0.0000) _ StdEPS LAvgCap StdRet LMedTurn 41 StdEPS (p-value) 0.3784** (0.0000) 0.1582** (0.0000) -0.3612** (0.0000) _ LAvgCap (p-value) -0.0693** (0.0003) -0.3141** (0.0000) 0.3844** (0.0000) 0.2117** (0.0000) _ StdRet LMedTurn (p-value) (p-value) 0.1145** 0.0468* (0.0000) (0.0174) 0.4512** 0.1408** (0.0000) (0.0000) -0.4077** -0.2185** (0.0000) (0.0000) -0.0267 0.1339** (0.2118) (0.0000) -0.3738** 0.1752** (0.0000) (0.0000) _ 0.4686** (0.0000) _ Table 5 Uncertainty about Future Cash Flows Results This table provides the results of cross-sectional regressions where the permanent price impact measures are the dependent variables and the proxies for uncertainty about future cash flows (alongside controls) are the independent variables. We use as proxies for uncertainty about future cash flows two definitions of dispersion of analysts' earnings forecasts: log of the standard deviation of analysts' forecasts (StdEst) and log of the coefficient of variation of analysts' forecasts (CoefEst). I/B/E/S provides monthly observations of the standard deviation and mean of the analysts' forecasts for the current end-of-fiscal-year. StdEst is the simple average of the monthly standard deviation observations for all months in 1999, and CoefEst is the ratio of the average standard deviation to the absolute value of the average monthly mean observations. We also use two proxies for uncertainty about future cash flows based on past earnings: VLPred is an earning predictability measure provided by Value Line Investment Survey that is derived from the standard deviation of percentage changes in quarterly earnings over an eight-year period, and StdEPS is log of the standard deviation of annual earnings for the five years prior to 1999. The following set of control variables is used in each regression: the logarithmic transformation of the average daily market capitalization (LAvgCap), the standard deviation of daily returns (StdRet), and the logarithmic transformation of the median daily turnover (LMedTurn). The regressions also include (but they are not shown in the table) a set of two-digit SIC dummies to control for industry effects, and dummies for the three primary markets (NYSE, AMEX, and Nasdaq) to control for institutional characteristics of the markets. Panel A presents the results for the MRR measure, the permanent price impact of the order flow innovation, estimated using the methodology in Madhavan, Richardson, and Roomans (1997). Panel B presents the results for the HASAB measure, the trade-correlated component of the efficient price variance (in standard deviation per hour form), estimated using the methodology in Hasbrouck (1991b). Panel C presents the results for HASR, the ratio of the trade-correlated component to the total efficient price variance, also from Hasbrouck (1991b). All estimations are done using OLS, and we report p-values in parenthesis calculated using White’s heteroskedasticity-consistent standard errors. "**" indicates significance at the 1% level and "*" indicates significance at the 5% level. Panel A: MRR (dependent variable) Control variables StdEst (p-value) CoefEst (p-value) VLPred (p-value) StdEPS (p-value) LAvgCap (p-value) -0.00468** (0.0001) -0.0263** (0.0000) -0.0260** (0.0000) -0.0238** (0.0000) -0.0258** (0.0000) -0.0240** (0.0000) -0.0240** (0.0000) -0.0265** (0.0000) -0.0261** (0.0000) -0.00130 (0.1527) -0.00011* (0.0372) -0.00023 (0.8506) -0.00405** (0.0076) -0.00221 (0.0630) -0.00506** (0.0001) -0.00138 (0.1382) -0.00015** (0.0048) -0.00014** (0.0067) 0.00143 (0.2848) 0.00008 (0.9470) 42 StdRet (p-value) 0.6288** (0.0000) 0.6263** (0.0000) 0.4969** (0.0024) 0.5453** (0.0000) 0.5278** (0.0012) 0.5668** (0.0014) 0.5868** (0.0000) 0.5873** (0.0000) LMedTurn (p-value) AdjR2 (in %) Obs. -0.0360** (0.0000) -0.0365** (0.0000) -0.0377** (0.0000) -0.0357** (0.0000) -0.0375** (0.0000) -0.0379** (0.0000) -0.0354** (0.0000) -0.0357** (0.0000) 63.21 2788 63.02 2788 65.23 2011 63.18 2758 65.37 2011 65.29 2011 63.39 2758 63.20 2758 Panel B: HASAB (dependent variable) Control variables StdEst (p-value) CoefEst (p-value) VLPred (p-value) StdEPS (p-value) -0.00047 (0.8380) 0.00957** (0.0000) -0.00020 (0.1514) 0.00150 (0.5692) -0.00673* (0.0193) 0.01030** (0.0014) -0.00028 (0.0604) -0.00002 (0.8764) -0.00122 (0.6418) 0.00190 (0.5201) -0.00068 (0.8104) 0.00953** (0.0001) LAvgCap (p-value) -0.0256** (0.0000) -0.0237** (0.0000) -0.0204** (0.0000) -0.0254** (0.0000) -0.0206** (0.0000) -0.0195** (0.0000) -0.0256** (0.0000) -0.0235** (0.0000) StdRet (p-value) LMedTurn (p-value) AdjR2 (in %) Obs. 0.0268** (0.0000) 0.0263** (0.0000) 0.0307** (0.0000) 0.0270** (0.0000) 0.0310** (0.0000) 0.0318** (0.0000) 0.0271** (0.0000) 0.0270** (0.0000) 67.74 2786 68.04 2786 68.57 2009 67.77 2756 68.63 2009 68.83 2009 67.76 2756 68.05 2756 LMedTurn (p-value) AdjR2 (in %) Obs. 0.0092** (0.0000) 0.0091** (0.0000) 0.0118** (0.0000) 0.0095** (0.0000) 0.0118** (0.0000) 0.0117** (0.0000) 0.0096** (0.0000) 0.0095** (0.0000) 75.26 2786 75.26 2786 72.15 2009 75.37 2756 72.14 2009 72.14 2009 75.37 2756 75.38 2756 7.9227** (0.0000) 7.62220* (0.0000) 8.4458** (0.0000) 7.9068** (0.0000) 8.4970** (0.0000) 8.1206** (0.0000) 7.9169** (0.0000) 7.6174** (0.0000) Panel C: HASR (dependent variable) Control variables StdEst (p-value) CoefEst (p-value) VLPred (p-value) StdEPS (p-value) -0.00093 (0.4650) -0.00112 (0.2206) 0.00001 (0.8170) -0.00008 (0.9522) -0.00054 (0.7636) -0.00099 (0.4412) -0.00112 (0.3929) -0.00127 (0.1683) 0.00001 (0.8986) -3.33e-06 (0.9587) 0.00029 (0.8296) 0.00021 (0.8724) LAvgCap (p-value) -0.0049** (0.0001) -0.0050** (0.0000) -0.0040** (0.0094) -0.0048** (0.0001) -0.0040** (0.0090) -0.0041** (0.0081) -0.0050** (0.0000) -0.0051** (0.0000) 43 StdRet (p-value) -0.6509** (0.0000) -0.6248** (0.0000) -0.5386** (0.0002) -0.6724** (0.0000) -0.5345** (0.0003) -0.5075** (0.0008) -0.6632** (0.0000) -0.6338** (0.0000) Table 6 Durbin-Wu-Hausman Tests and Cross-sectional 2SLS Estimation This table provides the results of the Durbin-Wu-Hausman tests as well as cross-sectional two-stage-least-squares (2SLS) regressions where the permanent price impact measures are the dependent variables and the analyst proxies for uncertainty about future cash flows (alongside controls) are the independent variables. The following instruments are used for analysts' earnings forecasts dispersion: (i) the log standard deviation of EPS for the five years prior to 1999 from COMPUSTAT, (ii) the one-year lagged growth of EPS, (iii) the average absolute value of quarterly sales growth in 1999 from COMPUSTAT, (iv) the number of days with listing records in CRSP, (v) the log of the number of SIC codes associated with the firm, and (vi) the dividend payout ratio for 1999 from COMPUSTAT. We use as proxies for uncertainty about future cash flows two definitions of dispersion of analysts' earnings forecasts: log of the standard deviation of analysts forecasts (StdEst) and log of the coefficient of variation of analysts forecasts (CoefEst). I/B/E/S provides monthly observations of the standard deviation and mean of the analysts' forecasts for the current end-of-fiscal-year. StdEst is the simple average of the monthly standard deviation observations for all months in 1999, and CoefEst is the ratio of the average standard deviation to the absolute value of the average monthly mean observations. The following set of control variables constructed over the sample period is used in each regression: the logarithmic transformation of the average daily market capitalization (LAvgCap), the standard deviation of daily returns (StdRet), and the logarithmic transformation of the median daily turnover (LMedTurn). The regressions also include (but they are not shown in the table) a set of two-digit SIC dummies to control for industry effects, and dummies for the three primary markets (NYSE, AMEX, and Nasdaq) to control for institutional characteristics of the markets. The table presents the regression results for the MRR measure, the permanent price impact of the order flow innovation, estimated using the methodology in Madhavan, Richardson, and Roomans (1997), the HASAB measure, the trade-correlated component of the efficient price variance (in standard deviation per hour form), estimated using the methodology in Hasbrouck (1991b), and HASR, the ratio of the trade-correlated component to the total efficient price variance, also from Hasbrouck (1991b). "**" indicates significance at the 1% level and "*" indicates significance at the 5% level. Control variables Dependent variables MRR HASAB StdEst (p-value) CoefEst (p-value) LAvgCap (p-value) StdRet (p-value) LMedTurn (p-value) Durbin-WuHausman statistic (p-value) 0.6052** (0.0000) 0.5990** (0.0030) -0.0332** (0.0000) -0.0339** (0.0000) 0.17725 (0.6737) 0.00106 (0.9740) 2387 -0.00148 (0.7646) -0.0266** (0.0000) -0.0262** (0.0000) 7.7102** (0.0000) 7.4620** (0.0000) 0.0289** (0.0000) 0.0284** (0.0000) 0.06131 (0.8044) 0.07384 (0.7858) 2387 0.00715 (0.4295) -0.0262** (0.0000) -0.0245** (0.0000) -0.5204** (0.0000) -0.4872** (0.0072) 0.0091** (0.0000) 0.0088** (0.0000) 0.15898 (0.6901) 0.01534 (0.9014) 2387 -0.00171 (0.7207) -0.0057** (0.0001) -0.0057** (0.0010) -0.00628 (0.1073) -0.00244 (0.7616) -0.00251 (0.5405) HASR 44 Obs. 2387 2387 2387 Table 7 Price Informativeness Analysis This table provides estimates of correlations between the permanent price impact measures and two measures of price informativeness about future earnings, FERC and FINC. To construct these measures, we run the following two regressions: 2 2 τ =1 τ =1 rt = a + b0 ∆E t + ∑ bτ ∆E t +τ + ∑ cτ rt +τ + u t rt = a + b0 ∆E t + v t where rt is the quarterly return at time t, rt+τ is the return τ quarters ahead, ∆Et is the change in EPS at time t, and ∆Et+τ is the change in EPS t quarters ahead. All earnings changes are scaled by the price at time t-1. We construct the returns for four quarters in 1999 using CRSP, and take EPS data from COMPUSTAT for 1999 and the first two quarters of 2000 (we set τ = 2). The definitions of the price informativeness measures are: FERC ∫ ∑ bτ τ FINC ∫ R 2 r =a +b0 ∆Et + ∑ bτ ∆Et +τ +∑ cτ rt +τ + u t − R 2 rt = a +b0∆Et + vt 2 2 τ =1 τ =1 Since each firm has only four observations, we group stocks with similar permanent price impact measures and estimate for each group a common price informativeness measure. The permanent price impact measure for the group is the average of the measures of the stocks in the group. For the MRR measure, we sort the sample firms according to MRR, form 279 groups of 10 stocks each, estimate the two equations above for each group separately, and calculate FERC and FINC for the group. A similar procedure is applied to the HASAB and HASR measures. Panel A presents the Pearson correlations between the groups' permanent price impact measures (where a group measure is the mean of the measures of the individual stocks in the group) and both FERC and FINC. Panels B, C, and D present the correlations and p-values of the three information asymmetry measures with FERC and FINC controlling for the amount of public information and liquidity trading by dividing the stocks into four categories. For the MRR measure, we sort the stocks according to AvgCap, and separate into two categories (Low and High) with equal number of stocks. Each category is sorted according to MedTurn and divided into two equal categories. We then sort the stocks within each category according to MRR, and form groups of 10 stocks each. We estimate FERC and FINC for each group and compute the correlations separately for each category. The process is repeated with control variables StdRet and MedTurn. A similar procedure is applied to the HASAB and HASR measures. In all panels, the two-sided p-values for the asymptotic test of zero correlations are shown in parentheses. "**" indicates significance at the 1% level and "*" indicates significance at the 5% level. Panel A: Unconditional correlations MRR HASAB HASR FERC (p-value) -0.0549 (0.2467) -0.0501 (0.2638) -0.0035 (0.9624) FINC (p-value) 0.0223 (0.7134) -0.1316* (0.0147) -0.0899 (0.1349) 45 Obs. 279 279 279 Panel B: MRR Conditional Correlations Control Categories Low AvgCap Low MedTurn Low AvgCap High MedTurn High AvgCap Low MedTurn High AvgCap High MedTurn FERC (p-value) -0.0929 (0.3433) -0.0857 (0.2996) -0.1495 (0.1854) -0.1236 (0.2312) FINC (p-value) -0.1398 (0.2744) 0.1344 (0.2932) 0.1847 (0.2527) -0.1016 (0.3208) Obs. 70 70 70 70 Control Categories Low StdRet Low MedTurn Low StdRet High MedTurn High StdRet Low MedTurn High StdRet High MedTurn FERC (p-value) 0.0249 (0.8508) -0.0491 (0.6863) -0.1875 (0.2489) 0.0097 (0.8959) FINC (p-value) 0.2437 (0.1179) 0.0226 (0.8679) -0.0220 (0.8057) 0.1247 (0.5390) Obs. Control Categories Low StdRet Low MedTurn Low StdRet High MedTurn High StdRet Low MedTurn High StdRet High MedTurn FERC (p-value) 0.0659 (0.5086) -0.0452 (0.6672) -0.1749 (0.0654) -0.2213 (0.0967) FINC (p-value) -0.0233 (0.8443) -0.0063 (0.9398) 0.0770 (0.4581) -0.3831** (0.0006) Obs. Control Categories Low StdRet Low MedTurn Low StdRet High MedTurn High StdRet Low MedTurn High StdRet High MedTurn FERC (p-value) -0.0370 (0.7668) 0.1021 (0.2980) 0.0229 (0.8188) 0.0713 (0.3200) FINC (p-value) 0.1489 (0.2492) -0.0462 (0.6388) -0.0727 (0.4296) 0.0877 (0.2855) Obs. 70 70 70 70 Panel C: HASAB Conditional correlations Control Categories Low AvgCap Low MedTurn Low AvgCap High MedTurn High AvgCap Low MedTurn High AvgCap High MedTurn FERC (p-value) -0.0459 (0.6619) -0.0655 (0.3895) 0.0592 (0.4164) -0.0246 (0.8636) FINC (p-value) 0.0133 (0.8977) -0.1653 (0.1301) -0.0397 (0.6814) -0.0778 (0.5078) Obs. 70 70 70 70 70 70 70 70 Panel D: HASR Conditional correlations Control Categories Low AvgCap Low MedTurn Low AvgCap High MedTurn High AvgCap Low MedTurn High AvgCap High MedTurn FERC (p-value) -0.0839 (0.4768) -0.0011 (0.9886) -0.0164 (0.8863) -0.0682 (0.6454) FINC (p-value) -0.3826** (0.0005) 0.0286 (0.7630) -0.0170 (0.8948) 0.0751 (0.5108) Obs. 70 70 70 70 46 70 70 70 70
© Copyright 2026 Paperzz