Reverse Survivorship Bias - The University of Chicago Booth School

THE JOURNAL OF FINANCE • VOL. LXVIII, NO. 3 • JUNE 2013
Reverse Survivorship Bias
JUHANI T. LINNAINMAA∗
ABSTRACT
Mutual funds often disappear following poor performance. When this poor performance is partly attributable to negative idiosyncratic shocks, funds’ estimated alphas
understate their true alphas. This paper estimates a structural model to correct for
this bias. Although most funds still have negative alphas, they are not nearly as low
as those suggested by the fund-by-fund regressions. Approximately 12% of funds have
net four-factor model alphas greater than 2% per year. All studies that run fund-byfund regressions to draw inferences about the prevalence of skill among mutual fund
managers are subject to reverse survivorship bias.
THE TYPICAL SURVIVORSHIP BIAS argument starts from the observation that
mutual funds often disappear following poor performance. Thus, a study
that conditions on fund survival overstates performance.1 In this paper, I
show that the correlation between performance and survival induces another bias with the opposite sign: mutual fund alphas estimated from a survivorship bias-free data set are biased downward relative to the true alpha
distribution.
The mechanics of this bias are transparent in an example in which investors
learn about fund alphas. Suppose, first, that investors have a prior belief that
a fund’s alpha is zero; second, that the true alpha is fixed; and third, that
investors abandon a fund (and the fund shuts down) when their posterior belief about the fund’s alpha is lower than −ᾱ. Investors update their beliefs
by observing risk-adjusted returns. The posterior mean always lies between
∗ Juhani Linnainmaa is at University of Chicago Booth School of Business. I thank Alan Bester,
John Cochrane, Phil English, Gene Fama, Andrea Frazzini, Bobbie Goettler, Mark Grinblatt,
John Heaton, Markku Kaustia, Bryan Kelly, Matti Keloharju, Ralph Koijen, Annette Larson
(Morningstar), Robin Lumsdaine, Toby Moskowitz, Robert Novy-Marx, Ľuboš Pástor, Antti
Petäjistö, Alexi Savov, Clemens Sialm (NBER discussant), Rob Stambaugh, Matt Taddy, Luke
Taylor, Pietro Veronesi, Z. Jay Wang (WFA discussant), and Russ Wermers for helpful discussions.
I also thank seminar and conference participants at the University of Chicago, the Norwegian
School of Economics and Business Administration (Bergen), American University, BI Norwegian
School of Management (Oslo), Copenhagen Business School, Aalto University (Helsinki), the University of Texas at Austin, the University of Illinois at Chicago, the University of Missouri, the
NBER Asset Pricing working group conference (Winter 2010), and the 2010 Western Finance Association meetings for comments on earlier drafts. Finally, I am especially grateful for the comments
of an anonymous referee, an Associate Editor, and the Editor, Campbell Harvey.
1 See, for example, Brown et al. (1992), Elton, Gruber, and Blake (1996), Carpenter and Lynch
(1999), and Carhart et al. (2002).
DOI: 10.1111/jofi.12030
789
The Journal of FinanceR
790
α
Prior distribution about alpha
Posterior mean: E [α rte, rte – 1 ,..., prior]
0
t
Fund disappears
–αˉ
Reverse
survivorship
bias
Average risk-adjusted return
Figure 1. Illustrating reverse survivorship bias. This figure illustrates the downward bias
in estimated alphas. Investors have a prior distribution about a fund’s alpha centered around zero
and they abandon a fund if the posterior mean falls below −ᾱ. Investors learn about alphas from
risk-adjusted returns. Because the posterior mean is the weighted average of the prior mean and
the signal, a fund can only disappear if the average risk-adjusted return (= the estimated alpha)
is strictly less than −ᾱ. The gap between the true alpha and the estimated alpha is the reverse
survivorship bias.
the prior mean and the average risk-adjusted return. As a consequence, if the
posterior mean ever hits the −ᾱ boundary and the fund disappears, then the
estimated alpha must have been strictly lower than −ᾱ. If not, the posterior
mean could not have crossed the threshold. I call this positive gap between the
true and estimated alphas reverse survivorship bias.2 This bias corresponds
to a counterfactual absent from the data: if the disappearing funds were allowed to survive, their average alpha would increase, converging toward −ᾱ.
Figure 1 illustrates this example.
Reverse survivorship bias arises from the correlation between performance
and fund survival. It does not matter whether this correlation involves learning
or not. To appreciate this bias as a statistical result, suppose a mutual fund
has a fixed alpha α and that its risk-adjusted returns are r̃te ≡ α + ε̃t , where ε̃t
is mean zero and independently distributed. A fund survives for a random T̃
number of months. The expected sum of ε̃t s is zero:3
⎡
E⎣
T̃
⎤
ε̃t ⎦ = 0.
(1)
t=1
2 Pástor, Taylor, and Veronesi (2009) make a similar argument in the IPO literature. If a firm
has an IPO only if the posterior mean about its profitability hits some threshold, then its pre-IPO
profitability must have strictly exceeded this threshold. The market thus rationally expects the
firm to experience a post-IPO drop in profitability.
3 See, for example, Williams (1991).
Reverse Survivorship Bias
791
It follows immediately from equation (1) that the expected average riskadjusted return is
⎡
⎤
⎛
⎞
T̃
T̃
1
1
E⎣
(2)
r̃te ⎦ = α + cov ⎝ ,
ε̃t ⎠ .
T̃ t=1
T̃ t=1
If the survival probability is increasing in the risk-adjusted return, the average
risk-adjusted return is a downward-biased measure of the fund’s true alpha,
T̃ e
r̃t ] < α.
E[ T̃1 t=1
Intuitively, the bias arises from the ambiguity about whether a realized
return is low because the true alpha is low or because the idiosyncratic shock,
ε̃t , is low. A fund manager may have a small positive alpha, but negative
idiosyncratic shocks result in low returns. If such a fund dies, it leaves behind
an alpha estimate that is too low. No mechanism eliminates just-lucky mutual
funds to offset this bias.
This bias is related to, but not an example of, the “baby-boy fallacy”: if parents
stop having children after their first son, there will not be more boys than girls
in the population. The difference between this fallacy and reverse survivorship
bias is the same as the difference between the sums and averages in equations
(1) and (2): do we count the number of boys and girls, or do we compute the
average fraction of girls in each family? If parents follow a stop-at-a-boy rule,
the expected number of boys and girls is the same but the expected fraction
of girls is less than 12 . The fraction of girls will be zero in 12 of families, 12
in 14 of families (girl−boy), 23 in 18 of families (girl–girl–boy), and so forth.
Continuing ad infinitum, the expected fraction of girls in a family is then
( 12 )1 10 + ( 12 )2 12 + · · · = 0.307. Analogously, if mutual funds are more likely to
disappear following “bad” outcomes then, conditional on a fund dying, realized
returns do not constitute a random sample. These returns contain on average
T̃
ε̃t ] < 0, and this negative average
more negative than positive errors, E[ T̃1 t=1
error bleeds into the alpha estimate.
This paper adjusts for reverse survivorship bias by estimating a structural
model with heterogeneous alphas and a correlation between performance and
survival. In estimation I match, between the data and model, the average alphas of both surviving and disappearing funds, the changes in these averages
as funds mature, mutual fund survival rates, the mean and standard deviation of the distribution of estimated alphas, and the volatility of idiosyncratic
shocks. The estimates indicate that reverse survivorship bias is economically
important. Whereas the average true four-factor model alpha is −0.49% after
accounting for reverse survivorship bias, the simple average alpha estimated
from the data is −0.92% per year. The difference, 43 basis points, is the estimate of the size of the reverse survivorship bias in the mean. An alternative regression-based approach places the bias in the mean around 62 basis
points. Because idiosyncratic shocks are very volatile compared to the crosssectional variation in alphas, var(ε̃ j,t ) var(α̃ j ), funds with positive alphas also
frequently disappear because of a run of bad luck. As a consequence, the
792
The Journal of FinanceR
distribution of estimated alphas shows less skill in its right tail than what
there actually is. The structural model estimates suggest, for example, that
12% of funds have true (after-fees) four-factor model alphas greater than 2%
per year.
It is important to understand both the scope and limits of reverse survivorship bias. For example, the return on a strategy that invests the same amount
in each actively managed fund is unaffected by this bias. Its profitability does
not depend on the counterfactual of how well a fund would have performed had
it not disappeared. Reverse survivorship bias is a concern, however, when we
draw inferences about the prevalence of skill among mutual fund managers.
Because the field has converged, by my reading, to the view that the average
dollar invested in actively managed funds probably has a negative alpha,4 its
focus has shifted toward studying heterogeneity in skill. Reverse survivorship
bias becomes an issue when we estimate fund-by-fund regressions to draw
inferences about this heterogeneity.
Several recent papers measure the prevalence of skill (or lack thereof) among
mutual fund managers.5 Kosowski et al. (2006) find that a “sizable minority”
of managers pick stocks well enough to more than cover their costs. Barras,
Scaillet, and Wermers (2010) suggest that the number of negative-alpha funds
(24%) far outweighs the number of positive-alpha funds (0.6%). Fama and
French (2010) set estimated alphas to zero and run simulations to separate luck
from skill. They find only weak evidence of superior skill in the extreme right
tail of the net alpha distribution. Related to my study, they find that the actual
alpha distribution’s left tail is thicker than that of the simulated distribution.
Although this thick tail could arise from either negative stock-picking skills
or high trading costs, it may also signify the presence of reverse survivorship
bias. The average alpha estimate is too low for funds that disappear and so
they end up thickening the distribution’s left tail.
The paper is organized as follows. Section I describes the data and examines
the correlation between survival and performance. Section II estimates the bias
in the mean of the alpha distribution and uses simulations to demonstrate the
distortion in the shape of this distribution. Section III estimates a structural
model to measure the prevalence of skill among actively managed mutual
funds. Section IV concludes and discusses the broader implications of reverse
survivorship bias.
I. Data and Correlation between Performance and Survival
A. Data and Performance Measurement
I use mutual fund data from the Center for Research in Security Prices
(CRSP). Following French (2008), Fama and French (2010), and Barras,
Scaillet, and Wermers (2010), I first include only actively managed funds that
4
See, for example, Fama and French (2010) for a discussion.
Although here I discuss the recent literature, the tradition of estimating alphas fund by fund,
and not for portfolios of funds, goes back to Jensen (1968).
5
Reverse Survivorship Bias
793
invest in U.S. common stocks and then combine different share classes of the
same fund into a single fund. I obtain information on funds’ objectives and
multiple share classes by merging the CRSP database with the Thomson/CDA
database using the MFLINKS product of Wharton Research Data Services.
Barras, Scaillet, and Wermers (2010) suggest that the Thomson fund investment objective information is more consistent over time compared to the CRSP
codes. Moreover, identification of multiple share classes using the CRSP data
set is inadvisable because it is based on an examination of error-prone fund
names. The downside of the Thomson/CDA database is that, if a fund exists
only for a year or two, it may never get added to these data.6
I restrict the sample to mutual funds that start on January 1984 or later
because the earlier data are unreliable.7 A mutual fund enters the sample
after its combined net asset value across all share classes exceeds $5 million
in December 2006 dollars. Once a fund has exceeded this threshold, I keep
the fund in the sample no matter what happens to its net asset value to avoid
selection bias. This procedure guards against the incubation bias of Evans
(2010).8 If a monthly return is missing, I delete the following-month return
because this represents the cumulative return over the reporting gap in the
CRSP database. Similarly, I delete any leading-zero return observations from
fund returns because these appear to be incorrect entries in the CRSP database.
I measure performance throughout this study using the CAPM alpha, the
three-factor model alpha (Fama and French (1993)), and the four-factor model
alpha (Carhart (1997)) by estimating regressions such as
r j,t − rf ,t = α j + b j (Rm,t − rf ,t ) + s j SMBt + h j HMLt + mj MOMt + e j,t ,
(3)
where r j,t is fund j’s month-t return, rf ,t is the risk-free rate, Rm,t is the month-t
return on the value-weighted CRSP index, and SMBt , HMLt , and MOMt are
month-t returns on long-short portfolios for size, value, and momentum. I require a fund to have at least 8 months of return data to estimate its alpha.
The CRSP mutual fund files contain monthly returns up to the end of December 2010. My sample contains data on 1,853 mutual funds that have at least
8 months of return data.
6 I account for this possibility when estimating the structural model by choosing moment conditions that are not influenced by the omission of some funds that do not survive for more than
2 years. I thank Russ Wermers for his notes on the issues surrounding the CRSP mutual fund data
set, MFLINKS, and the Thomson/CDA database.
7 See, for example, Elton, Gruber, and Blake (2001) and Fama and French (2010). The average
returns in the pre-1984 part of the data are significantly higher for mutual funds that report
monthly returns than for those that report annual returns.
8 Incubation bias arises when fund management companies provide seed money to new private
funds and then selectively open the funds with the best histories to the public. When funds can
backfill their return histories for the incubation period, a positive wedge emerges between the
fund’s past performance, which is selected to be good, and its expected performance. Fama and
French (2010) use the same $5 million threshold as their main specification.
794
The Journal of FinanceR
B. Correlation between Performance and Survival
The correlation between fund returns and disappearance is important for
the size of the reverse survivorship bias (see equation (2)). Although a number
of papers9 find a positive correlation between performance and survival, I also
study this correlation because, first, my sample differs from those used in these
earlier studies, and second, these estimates serve as inputs for the structural
model estimation.
Table I reports on average alphas for mutual funds that either survive or do
not survive through the T th year of their lives, conditional on having survived
through year T − 1. If a fund disappears by the end of year T , I compute its
alpha using all available return data. If the fund is still alive at the end of year
T , I compute its alpha using data up to the end of year T . Each column in
Table I reports on average alphas for funds that either survive or disappear in
year T . The four-factor model estimates show, for example, that the average
alpha is −4% per year for a fund that disappears during its fifth year but 0.19%
for those that survive through this year.
The last column reports the cumulative survival rates. If a mutual fund is
T years old at the end of the CRSP mutual fund data set, I count the fund
as a survivor in year 1, . . . , T computations but ignore it in year T +1, . . . , 10
computations. Ten percent of mutual funds disappear before reaching the age
of 4 and approximately one-third disappear by the end of the 10th year. The
proper treatment of dead funds in the estimation of alphas is thus important.
The high survival rates for the first 2 years are consistent with the view that
the Thomson/CDA database may not have data on some of the most short-lived
funds. Whereas the conditional disappearance probabilities for years 3 through
) to 0.051, these probabilities are just 0.005 and
10 range from 0.034 (= 1 − 0.943
0.976
0.019 for the first 2 years.
The alpha estimates in Table I suggest that mutual funds that disappear
perform considerably worse than surviving mutual funds. This conclusion is
not sensitive to the year in which the comparison is made or to the choice of the
asset pricing model used to estimate alphas. The disappearance of poorly performing funds is potentially a significant factor in performance evaluation via
the reverse survivorship bias channel. For example, the reverse survivorship
bias argument suggests the four-factor model alpha estimate of −4% for the
funds that die in their fifth year probably understates these funds’ true alphas.
This average may be low not only because these funds’ true alphas are low but
also because some of them experienced bad luck. It is possible that the average
expected alpha for these funds at the time of disappearance was higher than
−4%. That is, had these funds been allowed to survive, their alpha estimates
would probably have shown some recovery.
Figure 2 plots cumulative risk-adjusted returns for surviving and disappearing funds to show how these returns vary from year to year leading up to fund
9 See, for example, Brown and Goetzmann (1995), Khorana (1996, 2001), Chevalier and Ellison
(1999), and Lynch and Musto (2003).
Reverse Survivorship Bias
795
Table I
Mutual Fund Alpha Estimates Conditional on Fund Survival,
Percentage Points per Year
This table reports mutual fund alpha estimates conditional on a fund surviving or not surviving
through year T . The sample includes 1,853 actively managed mutual funds that (1) invest primarily
in U.S. common stocks, (2) first appear in the CRSP data in January 1984 or later, and (3) have at
least 8 months of return data. The sample period ends in December 2010. Different share classes
of the same fund are combined into a single fund using net asset value weights. I estimate alphas
using a time-series regression that includes monthly returns up to the end of year T (if the fund
survives) or all available data (if the fund disappears). A fund that disappears during year T is
assigned into the year-T “Do Not Survive” pool of funds; funds still alive at the end of year T are
assigned into the year-T “Survive” pool. Alphas are estimated using the CAPM, the three-factor
model, and the four-factor model. I report t-values in parentheses. The last column reports the
fraction of funds that survive for at least T years. These survival rate computations also include
those funds that do not have 8 months of return data.
Fund
Age
1
2
3
4
5
6
7
8
9
10
Survive
Do Not Survive
CAPM
FF
FF+MOM
CAPM
FF
FF+MOM
− 0.62
(− 2.11)
− 1.16
(− 4.97)
− 0.75
(− 3.89)
− 0.11
(− 0.67)
0.15
(0.98)
0.35
(2.67)
0.54
(4.58)
0.73
(6.73)
0.72
(7.03)
0.75
(7.66)
0.08
(0.32)
− 0.21
(− 1.03)
0.02
(0.13)
0.26
(1.66)
0.35
(2.40)
0.32
(2.55)
0.26
(2.30)
0.30
(2.82)
0.21
(2.02)
0.18
(1.78)
− 0.09
(− 0.37)
− 0.36
(− 1.92)
− 0.15
(− 0.91)
0.12
(0.82)
0.19
(1.45)
0.17
(1.48)
0.10
(0.99)
0.15
(1.48)
0.05
(0.54)
0.03
(0.34)
− 5.92
(− 3.77)
− 3.66
(− 2.68)
− 5.42
(− 6.11)
− 4.25
(− 3.84)
− 4.37
(− 6.25)
− 2.73
(− 2.73)
− 2.31
(− 4.96)
− 2.83
(− 4.40)
− 1.62
(− 3.94)
− 2.14
( − 6.04)
− 5.87
(− 3.96)
− 2.98
(− 3.40)
− 3.88
(− 5.67)
− 4.28
(− 4.86)
− 4.48
(− 7.07)
− 2.72
(− 2.90)
− 2.20
(− 5.34)
− 2.90
(− 5.51)
− 1.75
(− 4.74)
− 1.89
( − 5.42)
− 4.57
(− 4.41)
− 2.88
(− 3.02)
− 3.49
(− 5.44)
− 4.74
(− 5.83)
− 4.03
(− 6.78)
− 2.66
(− 3.08)
− 1.97
(− 5.01)
− 2.88
(− 5.11)
− 1.73
(− 4.46)
− 2.04
( − 5.62)
Survival
Rate
0.995
0.976
0.943
0.908
0.868
0.830
0.789
0.754
0.717
0.681
disappearance (or survival). I compute risk-adjusted returns using both the
CAPM and the four-factor model. The three-factor model estimates are similar to the four-factor model estimates and thus are not reported. The month-t
risk-adjusted return from the four-factor model is
r̂ ej,t = R j,t − rf ,t − b̂ j (Rm,t − rf ,t ) − ŝ j SMBt − ĥ j HMLt − m̂j MOMt ,
(4)
where b̂ j , ŝ j , ĥ j , and m̂j are fund j’s loadings on the market, size, value, and
momentum factors. If a fund disappears in year T , I estimate its factor loadings by using all data up to the month of disappearance. If the fund survives, I use all data up to the end of year T . Figure 2 reports on cumulative
risk-adjusted returns for funds that survive for at least 2, 4, 6, 8, or 10 years,
The Journal of FinanceR
796
Panel B: Four-Factor Model
Cumulative Risk-Adjusted Return (%)
Cumulative Risk-Adjusted Return (%)
Panel A: CAPM
Do Not Survive
Survive
Fund Age
Do Not Survive
Survive
Fund Age
Figure 2. Cumulative risk-adjusted returns conditional on fund survival. I estimate
CAPM and four-factor model alphas for all mutual funds that either survive or do not survive
through year T . This figure uses these estimates to plot the cumulative risk-adjusted returns up
to the end of year T or the month in which the fund disappears. This figure plots estimates only for
even-numbered years. The sample includes 1,853 actively managed mutual funds that (1) invest
primarily in U.S. common stocks, (2) first appear in the CRSP data in January 1984 or later, and
(3) have at least 8 months of return data. The sample period ends in December 2010. Different
share classes of the same fund are combined into a single fund with net asset value weights. Solid
(dashed) lines denote average cumulative risk-adjusted returns of surviving (disappearing) funds.
For example, the solid (dashed) year-10 line represents average cumulative risk-adjusted returns
of funds that survive through (disappear during) year 10.
as well as for funds that disappear in years 2, 4, 6, 8, or 10. The estimates
suggest mutual funds often disappear following a string of low returns. Mutual
funds that disappear during their sixth year, for example, cumulatively lose
approximately 15% on a risk-adjusted basis in both specifications.
C. Cox Proportional Hazards Model of Mutual Fund Survival
Table II reports Cox proportional hazards model estimates that measure how
sensitive mutual fund survival is on performance. Each fund enters the sample
for every month of its existence. The time-varying covariate in Panel A is the
fund’s alpha (the first three columns) or the t-value associated with the fund’s
alpha (the last three columns), estimated using monthly returns up to the
month of the observation. Panel B reports on otherwise identical regressions
but lags the covariates by 1 year. Panel C includes Morningstar ratings and
assets under management as additional covariates. I cluster standard errors
by fund to adjust for within-fund correlation in observations.10
10 These regressions are not free of the errors-in-variables problem because the regressor is
a first-stage alpha estimate. The average alphas reported in Table I, however, suggest that the
errors-in-variables problem does not materially influence the inferences.
Reverse Survivorship Bias
797
Table II
Cox Proportional Hazards Model for Mutual Fund Survival
This table reports Cox regressions that measure sensitivity of mutual fund survival to performance,
as measured by funds’ alphas or t-values associated with these alphas. The sample includes 1,853
actively managed mutual funds that (1) invest primarily in U.S. common stocks, (2) first appear
in the CRSP data in January 1984 or later, and (3) have at least 8 months of return data. The
sample period ends in December 2010. Different share classes of the same fund are combined into
a single fund using net asset value weights. I exclude the first 7 months of data for each fund
because only funds that survive at least this long enter the sample. The data consist of 240,784
fund-month observations. Panel A estimates fund alphas and t-values using all data up to the
current month. Panel B lags the alpha estimates 1 year. Alphas are estimated using the CAPM,
the three-factor model (FF), and the four-factor model (FF+M). Alpha estimates are multiplied by
100 so that “1” stands for an annualized alpha of 1%. Panel C reports regressions estimated using
a Morningstar database (see text for details). These regressions include, as additional covariates,
dummy variables for funds’ Morningstar ratings (the five-star category is the omitted dummy) and
the log-value of assets under management (AUM). Standard errors, reported in square brackets,
are clustered by fund.
Panel A: Current Alpha Estimates as Time-Varying Covariates
Covariate: α̂
Hazard ratio
SE
Pseudo R2
Covariate: t(α̂)
CAPM
FF
FF+M
CAPM
FF
FF+M
0.913
[0.005]
0.019
0.900
[0.006]
0.020
0.891
[0.006]
0.022
0.642
[0.019]
0.024
0.713
[0.016]
0.015
0.707
[0.018]
0.015
Panel B: 1-Year Lagged Alpha Estimates as Time-Varying Covariates
Covariate: α̂
Hazard ratio
SE
Pseudo R2
Covariate: t(α̂)
CAPM
FF
FF+M
CAPM
FF
FF+M
0.938
[0.005]
0.013
0.927
[0.005]
0.013
0.922
[0.006]
0.014
0.641
[0.019]
0.024
0.710
[0.016]
0.015
0.704
[0.018]
0.015
Panel C: Fund Survival in the Morningstar Database
Covariate
(1)
t(CAPM α̂)
(2)
(3)
0.616
[0.014]
t(FF α̂)
0.660
[0.013]
(6)
★✩✩✩✩
★★★✩✩
★★★★✩
14.144
[2.909]
8.704
[1.760]
4.218
[0.860]
1.634
[0.359]
ln(AUM, millions)
0.034
0.027
(7)
0.870
[0.022]
0.649
[0.014]
★★✩✩✩
Pseudo R2
(5)
0.862
[0.026]
t(FF + Mα̂)
Morningstar rating:
(4)
0.023
0.023
3.774
[0.850]
3.369
[0.721]
2.198
[0.460]
1.083
[0.247]
0.661
[0.017]
0.091
3.979
[0.887]
3.446
[0.723]
2.201
[0.457]
1.073
[0.244]
0.659
[0.017]
0.091
0.851
[0.021]
3.855
[0.859]
3.356
[0.700]
2.155
[0.445]
1.062
[0.241]
0.659
[0.017]
0.091
798
The Journal of FinanceR
Table II, Panel A, shows that the probability a mutual fund disappears
decreases significantly as the fund’s alpha estimate increases. To put these
estimates in perspective, consider mutual funds that have survived for 4 years.
The unconditional probability that a fund disappears sometime during its fifth
year is 0.044 and the standard deviation of the distribution of estimated CAPM
alphas of these funds is approximately 5%. The hazard ratio of 0.913 in Panel
A for the CAPM alpha suggests that, if a fund’s alpha is one standard deviation above the average, the disappearance probability decreases to 0.028
(= 0.044 × (0.913)5 ). Conversely, a one standard deviation decrease in the alpha estimate increases the disappearance probability to 0.07. The results are
similar when alpha estimates are replaced with the t-values associated with
these alphas. For example, if the t-value associated with the CAPM alpha increases by two, the disappearance probability decreases from its unconditional
value of 0.044 to 0.018. Panel B shows the results are largely the same when
current alpha estimates are replaced with estimates lagged by 1 year. The correlation between performance and survival is thus not only due to exceptionally
poor performance just before a fund disappears.
Panel C includes Morningstar ratings and the log-amount of assets under
management as additional covariates. I use a January 1984 through December 2010 Morningstar mutual fund database for this analysis. The sampleconstruction rules described in Section I.A apply to these data as well. For
example, I again delete a fund’s return history until it has at least $5 million
in assets, and funds have to start their operations in 1984 or later. If a fund
has multiple share classes, the combined fund’s star rating is the (rounded)
value-of-assets weighted average of individual classes’ star ratings. Although
the Morningstar sample is constructed using the same rules as the main sample, they are not identical because the Morningstar and CRSP fund populations
do not overlap perfectly—each database has some funds not available in the
other.11 Columns two through four replicate Panel A’s “t-values” regressions
using the Morningstar sample for reference.
Panel C shows that, in addition to alphas, Morningstar ratings predict the
disappearance of mutual funds. The hazard ratio increases monotonically as
the star rating decreases. While the average monthly disappearance rate is
1.5% for one-star funds in the data, it is just 0.1% for five-star funds. The
first column’s regression with the star ratings has a slightly higher explanatory power than the second column’s regression, which uses the t-value associated with the CAPM alpha. Both the star ratings and alpha t-values remain
significant in the regressions that contain both covariates simultaneously.12
The regressions that include everything—the star ratings, alpha t-values, and
11 The databases diverged in 2008 when CRSP rebuilt its mutual fund database using Lipper
Global Holdings Feed as the data source. The discrepancies and efforts to rebuild the database are
detailed in the database release notes, available at www.crsp.com.
12 Morningstar computes its ratings using up to 10 years of style-specific risk-adjusted returns.
Funds younger than 3 years old are not rated. See www.morningstar.com/help/data.html for additional details. A referee suggests that 401(k) plan sponsors review Morningstar (and Lipper)
Reverse Survivorship Bias
799
log-value of assets under management—have significantly higher pseudo R2
values than the regressions with just some of the covariates.
Panel A’s results are the most relevant for understanding the reverse survivorship bias. This bias arises from the “raw” correlation between the performance evaluation regression’s error term and survival, not from the correlation
that remains after conditioning on other attributes of survival. The other attributes matter for the bias only indirectly through their correlations with
estimated alphas. Suppose, for example, that we run the four-factor model regression for each fund and study the distribution of estimated alphas. The bias
in this distribution is due to the correlation between this regression’s error
term and survival. Because the usual performance evaluation regressions do
not include any variables other than the pricing factors, the partial correlations of survival with variables such as Morningstar ratings do not influence
the bias. The results in Panel C, however, remind us that one could think of this
bias as an omitted variable problem. If there is a variable that correlates with
survival but not with funds’ true alphas, we could use such a variable to reduce
the bias. The fundamental problem, similar to the one we face when searching
for instrumental variables, is that such variables are not easy to find. Morningstar ratings and assets under management, for example, are unsuitable for
this purpose as they probably also correlate with the true alphas.
The Cox regression estimates are similar across the three asset pricing models in all three panels. Table II indicates that the t-values associated with
the CAPM alphas have the most predictive power of fund disappearances, as
measured by the pseudo R2 . Two effects may make it difficult, however, to empirically distinguish between these models. First, as the asset pricing model
becomes more complicated, more data are needed to maintain the precision of
the alpha estimates. If the amount of data is fixed, then the addition of a new
factor decreases the precision at which alphas are estimated. For example, even
if the four-factor model was the model used by the market to evaluate funds,
the noisiness of the four-factor model alphas relative to the CAPM alphas could
tilt the Cox regressions to favor the CAPM. Second, if fund disappearance is
related to investors’ inferences about alphas, the relevant question is not which
model is the true asset pricing model but rather which model was used by investors during the sample period. Before Carhart (1997), for example, mutual
fund performance studies did not typically include the momentum portfolio as
a risk factor (or as a passive benchmark). Cremers, Petäjistö, and Zitzewitz
(2008), moreover, note that practitioners commonly evaluate fund managers
by comparing their returns against long-only benchmark indices. Nevertheless, the estimates in Tables I and II indicate that, no matter which asset
pricing model we choose, performance correlates significantly with fund survival. These findings in turn suggest that reverse survivorship bias may distort
inferences about the prevalence of skill among fund managers.
performance reports as the basis for putting funds on a “watch” list or dropping them. The fact
that Morningstar ratings are significant determinants of disappearance after controlling for alphas
is consistent with the view that the market uses additional information to make decisions about
investing in mutual funds.
The Journal of FinanceR
800
II. Measuring Reverse Survivorship Bias
A. Bias in the Mean of the Distribution
Because some good managers experience poor returns because of bad luck,
and their funds die, the estimated alpha distribution does not show enough
skill. Before using a structural model to extract the full distribution of alphas,
it is instructive to first measure the bias in the mean of the estimated alpha
distribution. The benefit here is that this gap can be measured with relatively
few assumptions about fund returns. In contrast, more structure is needed to
extract the full distribution of alphas from the data.
The shift in the mean of the alpha distribution can be estimated by comparing
two sets of alpha estimates. I assume that funds’ returns conform to a factor
model:
r j,t − rf ,t = α j + Ft β j + ε j,t ,
(5)
where Ft is a 1 × K vector of date-t factor realizations and β j is a K × 1 vector
of fund j’s risk loadings. I assume that neither the fund’s alpha nor its K × 1
risk loadings change over time. If equation (5) describes fund returns, the
N
average fund alpha N1
j=1 α j can be estimated in two ways: (1) run one timeseries regression for each fund to estimate α̂ j s and compute the average alpha,
1 N
j=1 α̂ j , or (2) compute the average fund return for each month and estimate
N
one time-series regression for this average. That is, average equation (5) across
funds,
N
N
N
N
1 1 1 1 r j,t −rf ,t =
α j +Ft
βj +
ε j,t ,
N
N
N
N
j=1
j=1
j=1
j=1
r̄t
ᾱ
β̄
(6)
ε̄t
and estimate the average fund alpha, ᾱ, by regressing the average return
against the factors.
Both methods should yield consistent estimates of the average alpha given
equation (5). The reverse survivorship bias argument, however, suggests that
the fund-by-fund regression approach will produce a lower estimate. Because
fund j’s time series is sometimes cut short because ε j,t is low, the average
T
dead fund’s returns contain more negative than positive errors, E[ T1 t=1
ε j,t ] <
0, and this biases the alpha estimate downward. A comparison of these two
methods can thus be used to measure the size of the reverse survivorship bias
in the mean.
Because the sample only includes funds that start on January 1984 or later,
only a handful of funds are present for the first few years. To facilitate a fair
comparison between the two methods, I begin the sample here in January 1987.
If a fund is born before this date, I use only post-January 1987 return data to
compute its alpha. This approach ensures that the monthly average return in
equation (6) is computed over at least 209 funds, which is the number of funds
present in January 1987.
Reverse Survivorship Bias
801
The two procedures yield very different estimates of the average alpha. The
average annualized CAPM, three-factor, and four-factor model alpha estimates
from the fund-by-fund regressions are −0.69% (t-value = −6.4), −1.10% (t-value
= −12.0), and −1.14% (t-value = −13.2), respectively.13 The alpha estimates
based on monthly returns averaged across funds are notably higher: −0.16%
(t-value = −0.3), −0.51% (t-value = −1.4), and −0.53% (t-value = −1.4). These
estimates thus place the bias in the mean of the distribution around 0.46%
to 0.62% per year, depending on the asset pricing model. Although this comparison indicates that reverse survivorship bias is an economically significant
consideration, it is uninformative about the distortion in the right tail of the
distribution where the skilled managers reside.
B. Simulation Evidence on the Magnitude of the Bias
I run simulations to investigate the distortion in the shape of the alpha distribution. By fixing the true alpha distribution, I can quantify reverse survivorship using a variety of metrics. I assume that funds’ true annual alphas are
normally distributed with mean zero and standard deviation of 1.25%, which
is the same estimate as that used by Fama and French (2010) to frame their
discussion about the amount of skill among managers. I assume that mutual
fund returns are again governed by the factor model in equation (5) and that
investors back out risk-adjusted returns, r ej,t , from raw returns,
r̃ ej,t = α j + ε̃ j,t ,
(7)
where ε̃ j,t ∼ N(0, σ j2 ) and σ j is fund j’s idiosyncratic volatility. I use the CRSP
data to generate a distribution for σ j2 . I estimate monthly CAPM regressions
for all sample funds and record their estimated idiosyncratic variances. I then
fit a gamma distribution to this empirical distribution via maximum likelihood,
and draw σ j2 s from this distribution in the simulations. The average fund has
an annualized idiosyncratic volatility of 7.1%. The 10th and 90th percentiles
are 3.1% and 12.0%, respectively.
A fund’s survival is related to the market’s belief about the fund’s alpha.
Funds are initially indistinguishable from each other and hence the rational
prior distribution about each fund’s alpha is the same as the distribution from
which alphas are drawn, α̂ j,0 ∼ N(0, 0.01252 ). The market knows each fund’s
idiosyncratic volatility, σ j , and uses Bayes’s rule to update from the prior distribution to the posterior distribution based on each return realization:
mj,t+1 = (1 − w j,t )mj,t + w j,t r ej,t+1 ,
(8)
v j,t+1 = σ j2 w j,t ,
(9)
13 The t-values here assume independence across funds and should thus be interpreted cautiously. See Fama and French (2010) for a discussion on across-fund correlations in residuals. The
estimates reported here are higher than those reported in Fama and French (2010, Table II) for an
equal-weighted portfolio of funds because of differences in sample construction and time period.
The Journal of FinanceR
802
where w j,t =
v j,t
v j,t +σ j2
is the weight Bayes’s rule places on month t + 1’s risk-
adjusted return.
I consider two alternative rules of fund disappearance. Under the first rule
a fund disappears if its posterior mean, mj,t , falls below some fixed threshold. I
choose this threshold so that the fraction of funds surviving for at least 10 years
is the same between the simulations and data, 68% (see Table I). This rule sets
the critical threshold at ᾱ = −0.55%. Under the second rule the market uses a
statistical test to determine fund survival. A fund disappears if the probability
that the fund is “good enough” falls below 0.05. To match the 10-year survival
rates between the simulations and the data, I set the threshold at ᾱ = 1.3%,
that is, a fund disappears if the probability that its alpha is at least 1.3% falls
below 0.05.
I simulate data for 10 million funds as follows. I first draw funds’ alphas
independently from each other from the true alpha distribution and generate
sequences of risk-adjusted monthly returns. I then process this sequence and
check for fund survival using one of the two exit rules described above. If a fund
T j e
r j,t , where T j is the
disappears, I record the fund’s estimated alpha, T1j t=1
month fund j disappears. For each fund, I generate a maximum of 10 years of
data. I also run a second set of simulations to generate a randomized-exit alpha
distribution. In these simulations, which use the same draws of alphas and riskadjusted monthly returns as the actual simulations, funds disappear randomly.
When a fund disappears in the actual simulation, a randomly selected fund
closes in the shadow simulation. The funds in the shadow simulations thus
disappear from the data at exactly the same rate as they do in the actual
simulations.
Figure 3 quantifies the reverse survivorship bias in the simulated data using
three measures. Because the results are similar for the two exit rules, here I
only discuss those based on the first rule. The left-hand panel compares the distribution of estimated alphas to the randomized-exit alpha distribution. This
comparison is useful because, even with 10 years of data, alpha estimates are
very noisy. However, the true and randomized-exit distributions have the same
amount of noise, so they are comparable. The left panel shows how the correlation between performance and survival pushes mass toward the distribution’s
left tail. The resulting distortions are significant. First, the mean of the realized
distribution is −1.1% as opposed to the true mean of zero. Whereas 50% of funds
have positive true alphas, the fraction of positive estimated alphas is 45.9%.
Second, the amount of mass that disappears from the distribution’s right tail is
nontrivial. For example, 7.3% of the funds in the randomized-exit alpha distribution have estimated alphas greater than 5% per year. This fraction is 4.2% in
the actual estimated alpha distribution. These numbers are suggestive of the
amount of skill we may not see in the distribution of estimated alphas due to
reverse survivorship bias. In this example, 43% of the mass of the distribution
is missing above the 5% threshold.
While the left panel is useful in suggesting that the realized alpha distribution is indeed distorted, it is less useful in revealing how significantly the
Reverse Survivorship Bias
803
Panel A: Fund disappears if mj,t <−0.55%
Realized Alpha Distribution
Estimated Alpha (%)
All Funds
Survive > 1 Year
All Funds
Survive > 1 Year
Fraction of Alphas Understated (%)
Observed minus True Alpha (%)
Density
Fraction of Alphas Understated
Reverse Survivorship Bias by True Alpha
Observed
Randomized
True Alpha (%)
True Alpha (%)
Panel B: Fund disappears if Pr(αj >1.3% | mj, t , vj, t )< 0.05
Estimated Alpha (%)
Reverse Survivorship Bias by True Alpha
All Funds
Survive > 1 Year
True Alpha (%)
Fraction of Alphas Understated
Fraction of Alphas Understated (%)
Density
Observed
Randomized
Observed minus True Alpha (%)
Realized Alpha Distribution
All Funds
Survive > 1 Year
True Alpha (%)
Figure 3. Reverse survivorship bias in simulated fund data. This figure illustrates the
reverse survivorship bias using simulated data for 10 million funds. Each fund’s alpha is drawn
from N(0, 0.01252 ) and the idiosyncratic volatility distribution is matched to the CRSP mutual
fund data for U.S. equity funds. Upon observing each month’s risk-adjusted return, the market
uses Bayes’s rule to update beliefs about each fund’s alpha, α j , starting from the prior of α j ∼
N(0, 0.01252 ). In Panel A, a fund disappears if the posterior mean about its alpha falls below
−0.55%; in Panel B, a fund disappears if the probability that its alpha is above 1.3% falls below
0.05. These cut-offs are chosen to match the 10-year survival rate between the simulations and data.
The left-hand panel on each row compares the distribution of estimated alphas to a distribution
that would obtain if funds were to disappear randomly at exactly the same rate as they do in
simulations. The middle panel plots the difference between the estimated alpha and the true
alpha, α̂ j − α j , as a function of the true alpha. The right panel measures the frequency with which
the estimated alpha is lower than the true alpha, α̂ j < α j . If the alpha estimates were unbiased,
the line in the middle panel would be flat at zero and the one in the right panel would be flat at
50%. In the middle and right-hand panels, the dashed line excludes funds that disappear before
turning 1 year old.
observed alpha distribution deviates from the true alpha distribution after the
former is stripped clean of estimation noise. The middle and right panels in
Figure 3 focus on this comparison. The middle panel plots the average difference between the estimated and true alphas, α̂ j − α j , as a function of the true
alpha. The logic of this computation is as follows. If we pick α j = 0 on the
x-axis, we consider generating many zero-alpha funds and putting them
The Journal of FinanceR
804
through the simulations. For each fund, we obtain a noisy alpha estimate,
α̂ j . If these estimates were unbiased, the average estimated alpha would conN
verge to the true alpha, that is, lim N→∞ N1
j=1 α̂ j = 0 for zero-alpha funds.
However, as the middle panel shows, the bias is substantial. The average zeroalpha fund’s alpha estimate, for example, is −1.2%. The bias decreases as we
move toward the right tail of the distribution. As we increase the true alpha,
α j , a fund is less likely to die because of bad luck and thus have too low an
alpha estimate. The bias does not, however, disappear. Even for funds with a
true alpha as high as 2%—a value that is 1.6 standard deviations above the
mean—the bias is −0.54%.
The reason the bias exists even in the right tail of the distribution is rooted
in the relative amounts of luck and skill. Whereas the standard deviation of the
true alpha distribution is just 1.25%, the average fund’s idiosyncratic volatility
is almost 10% per year. It is thus not too uncommon an occurrence that a fund
with a positive alpha experiences sufficiently bad luck that it disappears.
The rightmost panel in Figure 3 reports how often the estimated alpha is
lower than the true alpha, α̂ j < α j . If the estimated alphas were unbiased,
they would fall evenly above and below the true alphas. The estimated alpha
is too low for 6% of the zero-alpha funds and for 2% of the funds with a true
alpha of 1% per year. These are funds that disappear because of bad luck. Their
estimated alphas are far lower than their true alphas—if not, they could not
have crossed the disappearance threshold.
The dashed lines in the middle and right-hand panels include funds that
survive for at least a year. Although reverse survivorship bias decreases, its
importance is unchanged. The problem with trying to control for reverse survivorship bias by throwing out short-lived funds is that one then introduces
standard survivorship bias. These simulations suggest that reverse survivorship bias is an important consideration when measuring the prevalence of skill
among mutual fund managers. In addition to having to strip out estimation
noise from the distribution of estimated alphas, we also need to adjust for the
fact that individual alpha estimates may be biased downward.
III. Simulated Method of Moments Estimation
A. Structural Model
In this section, I estimate a structural model of mutual fund survival using
the CRSP mutual fund data. The Internet Appendix constructs and calibrates
an alternative model that endogenizes fund survival.14 This structural model
is related to simulations given in Section II. The difference is that, instead of
making assumptions about the alpha distribution and the disappearance of
funds, I use the model to extract these objects from the mutual fund data.
I assume that each mutual fund’s alpha is drawn from a normal distribution with mean μ and variance σ 2 . The market knows the parameters of this
14
An Internet Appendix may be found in the online version of this article.
Reverse Survivorship Bias
805
distribution. Funds are initially indistinguishable from each other and hence
the rational prior distribution about each fund’s alpha is the same as the distribution from which alphas are drawn, α̂ j,0 ∼ N(μ, σ 2 ). Each mutual fund generates monthly return observations. The market uses some asset pricing model to
obtain an estimate of the fund’s monthly risk-adjusted return, r̃ ej,t = α j + ε̃ j,t ,
where ε̃ j,t is normally distributed with mean zero and variance σe2 . Idiosyncratic variances are the same across funds for simplicity. The market’s asset
pricing model here is correct in that E[r̃ ej,t ] = α j .
The market uses each return realization to update from the prior distribution
to the posterior distribution. Because both the alphas and returns are normally
distributed, the mean (mj,t ) and variance (vt ) of the posterior distribution about
fund j’s alpha evolve according to
where wt =
vt
.
vt +σ j2
mj,t+1 = (1 − wt )mj,t + wt r ej,t+1 ,
(10)
vt+1 = σe2 wt ,
(11)
The posterior variance does not have the fund subscript j
because its process is deterministic and shared by all funds.
I assume that the probability a fund disappears is a function of the fund’s
posterior distribution. I construct the hazard rate as follows. First, let ᾱ denote
some critical level of alpha and record the probability p j,t that fund j’s alpha
ᾱ
is below this level, p j,t ≡ −∞ φ(α; mj,t , vt )dα, where φ() is the normal density.
The term p j,t can be interpreted as a summary statistic about fund quality. A
fund disappears after month t with probability
(12)
H( p j,t , t) = Pr ũ j,t ≤ γ0 + γ1 p j,t + γ2 p2j,t + γ3 t ,
where ũ is an i.i.d. draw from a uniform distribution on the interval [0, 1].
This hazard rate approach avoids the need to specify the economic mechanism
that drives the disappearance of mutual funds. Moreover, because equation
(12) does not force the exit probability to decrease in alpha, it can permit some
successful funds to close as well.15 The structural model can, in fact, “undo”
the built-in learning process if the data suggest such a reversal is warranted.
The eight structural parameters of the problem are the mean and variance of
the alpha distribution, μ and σ 2 ; the variance of risk-adjusted returns, σe2 ; and
the hazard rate parameters, ᾱ, γ0 , γ1 , γ2 , and γ3 .
B. Identifying Structural Parameters
I use 15 moment conditions to identify these eight structural parameters.
The first eight moment conditions are the average alphas of funds that either
survive or do not survive through years 4, 6, 8, and 10. These are the same
15 Kostovetsky (2009), for example, finds evidence of a flight from top-performing mutual funds
to the hedge fund industry. Deuskar et al. (2011), however, caution that it is predominantly the poor
performers who leave the mutual fund industry for smaller and younger hedge fund companies.
806
The Journal of FinanceR
statistics as those reported in Table I except I aggregate the data to the biennial
frequency to reduce noise and I omit the first 2 years because, as discussed
in Section I.A, the fund data here may not be representative. The next four
moment conditions are the drop-out rates for the same 2-year periods, that is,
the fraction of funds that disappear in years 3 and 4, 5, and 6, 7 and 8, and 9
and 10. These first 12 moments conditions are instructive about the hazardrate function parameters. Funds have to drop out from the model at the correct
rate and the drop outs (and survivors) have to have alphas that match those
observed in the data.
The next two moment conditions are the mean and standard deviation of the
estimated alpha distribution.16 These two moments instruct the model to push
the distribution of estimated alphas close to the distribution in the data. The final moment condition is the average standard deviation of idiosyncratic shocks,
which is represented by σe2 in the model. This moment tells the model to make
sure that risk-adjusted returns (and in turn alpha estimates) are just as noisy
as they are in the data. I use these 15 moments to estimate the eight structural
parameters (μ, σ 2 , σe2 , ᾱ, γ0 , γ1 , γ2 , γ3 ) to obtain an overidentified model.
Because the structural model does not yield closed-form estimation equations, I use simulated method of moments (SMM) for indirect inference. I draw
random funds from the true alpha distribution, generate monthly risk-adjusted
returns, update the market’s beliefs using equations (10) and (11), and then
compute the probability that a mutual fund disappears from the data. The
SMM estimator θ̂ is
θ̂ = arg min( M̂ − M(θ ))
W( M̂ − M(θ )),
θ
(13)
where θ is an eight-by-one vector of structural parameters, M̂ is the vector of estimated moments from the data, M(θ ) is the vector of model-implied moments,
and W is an arbitrary positive-definite weighting matrix. I use the optimal
weighting matrix, which is the inverse of the estimated covariance of moments
M.17
The downside of SMM is that the simulated moments inherit simulation
noise. Instead of applying the simulated annealing method to minimize (13), I
exploit the problem’s specific structure to smooth the simulated moments. If the
simulations are for N funds, I first generate N × (120 + 1) draws of standard
normal variables. Every time I evaluate equation (13) for some parameter
16 I compute each fund’s alpha using up to 10 years of data. Because some funds in the CRSP
data start within the last 10 years of the sample, I run the simulations so that the simulated funds
have the same probability of getting truncated as they do in the data. For example, if 10% of funds
in the data can appear in the data for at most 4 years, I let 10% of funds in the simulations appear
for at most 4 years.
17 Some elements of cov(M) cannot be estimated from the data. Funds, for example, do not
have both “survive” and “do not survive” alphas for the same period. Because many off-diagonal
elements, such as those corresponding to covariances between different drop-out rates, are also
identically zero, I diagonalize the estimated cov(M). This procedure, unlike the alternative that
constructs the matrix from pairwise covariances, ensures that the estimated covariance matrix is
positive definite.
Reverse Survivorship Bias
807
vector θ , I recall this set of draws to generate the N alphas and the N × 120
monthly returns. For example, as the structural parameters μ and σ 2 change
slightly, the same primitive draws now correspond to (slightly) different fund
alphas. This approach thus smooths the simulated moments, improving the
behavior of the objective function in equation (13). Because fund exits are
probabilistic, every fund can also be used to generate two observations for each
month, one for the fund that survives through this month and other for the
fund that disappears. I combine these 2 × 120 distinct per-fund observations
with their respective probabilities of being realized to compute M(θ ) in equation
(13). I estimate the model by simulating data for 100,000 funds.
C. Estimation Results
Table III compares the moment conditions between the data and the structural model (Panel A) and reports on the estimates of the eight structural
parameters (Panel B). The structural model matches the salient features of the
mutual fund data. First, the observed alphas are low for funds that disappear
early on but then increase monotonically as funds grow older. For example,
the average four-factor model alpha for funds that disappear in years 3 or 4 is
−4.1% in the data and −4.6% in the model. These averages increase to −1.9%
and −1.5% for funds that disappear in years 9 or 10. Second, the alphas of
the surviving funds also behave similarly in the data and simulations. Third,
the drop-out rates are very close to each other in the model and the data. The
discrepancies between the model and data are never greater than 0.7%. The
last three moment conditions indicate that the model also matches the mean
and volatility of the distribution of estimated alphas and that it is able to keep
the amount of idiosyncratic volatility in line with the data.
The first two parameters in Panel B describe the shape of the true alpha
distribution. Both the mean and volatility of this distribution are higher in the
CAPM than they are in the four-factor model. I limit my discussion here to the
four-factor model results because the CAPM estimates are usually deemed
inappropriate for evaluating funds’ stock-picking abilities.18 The estimated
volatility of the true alpha distribution in the four-factor model, 2.1%, is close
to the upper bound suggested by Fama and French (2010).
Panel C of Table III reports on the means and percentiles of three different alpha distributions. The “Data” row of each block reports the empirical
distribution of alphas in the data. The average alpha over all sample mutual
funds is −0.92% per year in the four-factor model. The second row reports the
distributions of estimated alphas in the model. The mean, −0.91%, is close to
that seen in the data. In addition to the mean, the percentiles of the data- and
model-based distributions are very similar. The 5th and 95th percentiles, for
example, are −7.7% and 5.2% in the data and −8.0% and 4.7% in the model.
18 See, for example, Carhart (1997) and Fama and French (2010). The latter, for example,
comment on this issue by writing that “the CAPM tests are a lesson about how failure to account
for common patterns in returns and average returns can affect inferences about the skill of fund
managers” (p. 1946).
The Journal of FinanceR
808
Table III
Estimates of the Structural Learning Model
This table reports the estimates of a structural model of mutual funds. The sample includes
1,853 actively managed mutual funds that (1) invest primarily in U.S. common stocks, (2) first
appear in the CRSP data in January 1984 or later, and (3) have at least 8 months of return
data. The sample period ends in December 2010. Different share classes of the same fund are
combined into a single fund using net asset value weights. In the model each mutual fund’s alpha
is drawn from a normal distribution with mean μ and variance σ 2 and generates monthly riskadjusted returns r̃ ej,t = α j + ε̃ j,t , where ε̃ j,t ∼ N(0, σe2 ). The market updates its beliefs about α j
from each return realization. The hazard rate for fund j disappearing after month t is H( p j,t , t) =
Pr(ũ j,t ≤ γ0 + γ1 p j,t + γ2 p2j,t + γ3 t), where ũ j,t is an i.i.d. draw from a uniform distribution, p j,t ≡
ᾱ
−∞ φ(α; mj,t , vt )dα, φ() is the normal density, and mj,t and vt are the mean and variance of the
posterior distribution. The eight structural parameters of the problem are the mean and variance
of the alpha distribution, μ and σ 2 ; the variance of risk-adjusted returns, σe2 ; and the hazard
rate parameters, ᾱ, γ0 , γ1 , γ2 , and γ3 . These parameters are estimated using Simulated Method
of Moments with the optimal weighting matrix. The 15 moment conditions are: (1) four moment
conditions for the average alphas of funds that disappear in years 3 or 4, 5 or 6, 7 or 8, and 9 or
10; (2) four moment conditions for the average alphas of the funds that survive through years 4,
6, 8, and 10; (3) four moment conditions for the fraction of funds that disappear in years 3 or 4,
5 or 6, 7 or 8, and 9 or 10; (4) two moment conditions for the mean and standard deviation of the
distribution of estimated alphas, where alphas are estimated using at most 10 years of data and
the censoring in the model is set to match that in the data; and (5) one moment condition for the
average standard deviation of the idiosyncratic shock. All moment conditions and parameters μ,
σ , ᾱ, and σe are annualized in this table. I estimate the model using moments computed from the
CAPM, the three-factor model, and the four-factor model. Panel A compares simulated moments to
sample moments, Panel B reports the structural parameter estimates, and Panel C tabulates the
distribution of estimated alphas in the data (“data” row) and the model (“model, observed” row),
and the distribution of true alphas in the model (“model, true” row).
Panel A: Actual Moments versus Simulated Moments
CAPM
Moment Condition
Data
Model
Alpha Estimates for Disappearing Funds
Years 3–4
− 4.830
− 5.040
Years 5–6
− 3.583
− 3.401
Years 7–8
− 2.552
− 2.414
Years 9–10
− 1.881
− 1.719
Alpha Estimates for Surviving Funds
Years 3–4
− 0.112
0.269
Years 5–6
0.355
0.451
Years 7–8
0.726
0.596
Years 9–10
0.754
0.697
Fraction of Funds Disappearing
Years 3–4
6.760
6.301
Years 5–6
8.143
8.194
Years 7–8
8.543
8.852
Years 9–10
9.056
9.020
Observed Alpha Distribution
Mean
−0.546
−0.587
Standard Deviation
4.835
4.999
SD of Idiosyncratic Risk
9.357
9.340
Three-Factor
Four-Factor
Data
Model
Data
Model
− 4.081
− 3.640
− 2.521
− 1.817
− 4.681
− 3.159
− 2.220
− 1.561
− 4.124
− 3.374
− 2.396
− 1.883
− 4.562
− 3.026
− 2.131
− 1.502
0.265
0.323
0.302
0.176
− 0.016
0.154
0.279
0.360
0.120
0.172
0.146
0.031
− 0.128
0.023
0.132
0.200
6.760
8.136
8.534
9.059
6.775
8.309
8.513
8.370
6.767
8.139
8.537
9.058
6.730
8.289
8.494
8.368
−0.847
4.318
7.355
−0.816
4.304
7.366
−0.920
4.031
7.131
−0.910
4.007
7.142
(Continued)
Reverse Survivorship Bias
809
Table III—Continued
Panel B: Structural Parameter Estimates
CAPM
Parameter
μ
σ
ᾱ
σe
γ0
γ1
γ2
γ3
J-test, χ 2 ( p-value)
EST
−0.129
2.756
−2.846
9.340
0.286
−0.035
0.001
−0.043
9.78
Three-Factor
SE
(0.079)
(0.452)
(3.931)
(0.125)
(1.044)
(0.102)
(0.000)
(0.120)
(0.202)
Four-Factor
EST
SE
EST
SE
−0.406
2.478
−2.173
7.366
0.664
−0.058
0.001
−0.111
13.26
(0.132)
(0.566)
(4.448)
(0.097)
(2.828)
(0.176)
(0.001)
(0.400)
(0.066)
−0.494
2.112
−2.098
7.142
0.767
−0.073
0.002
−0.119
12.92
(0.092)
(0.375)
(2.195)
(0.095)
(1.968)
(0.126)
(0.001)
(0.249)
(0.074)
Panel C: Distributions of Estimated and True Alphas
Percentiles
Model
CAPM
Data
Model
Observed
True
Three-Factor Model
Data
Model
Observed
True
Four-Factor Model
Data
Model
Observed
True
Mean
1%
5%
25%
50%
75%
95%
99%
−0.547
−16.048
−8.568
−2.567
−0.242
1.908
5.969
9.725
−0.587
−0.129
−15.284
−6.539
−9.293
−4.662
−3.261
−1.988
−0.188
−0.129
2.615
1.730
6.662
4.404
9.915
6.282
−0.847
−13.292
−7.719
−2.937
−0.758
1.169
5.720
9.804
−0.816
−0.406
−13.393
−6.170
−8.369
−4.482
−3.051
−2.077
−0.444
−0.406
1.908
1.266
5.369
3.670
8.343
5.359
−0.920
−11.889
−7.665
−2.848
−0.767
0.956
5.202
8.993
−0.910
−0.494
−12.851
−5.407
−7.988
−3.968
−2.921
−1.919
−0.539
−0.494
1.606
0.930
4.746
2.980
7.533
4.419
Given that the distributions of estimated alphas are similar between the data
and model, it is reasonable to ask how the model’s true alpha distribution differs
from the distribution of estimated alphas. That is, what underlying true distribution of alphas is responsible for the distribution of estimated alphas? The
third row of each block, which reports on the true distribution, shows that the
differences between the true and observed distributions are economically significant. In the four-factor model, the true mean alpha is −0.49% per year as opposed to the alpha estimate of −0.92% in the data. The gap between these two,
43 basis points, is comparable to the regression-based estimate in Section II.A.
The estimates of the true alpha distribution suggest that a minority of mutual
managers may pick stocks well enough to cover the fees they impose on their
investors. In the four-factor model, 24% of managers have true alphas greater
810
The Journal of FinanceR
than 1% per year and 12% have alphas greater than 2% per year. These estimates of the size of the skilled-manager group lie between the estimates in
Kosowski et al. (2006) (who give a higher estimate) and Barras, Scaillet, and
Wermers (2010) and Fama and French (2010) (who give lower estimates). My
estimates suggest that a minority of managers may be able to pick stocks well
enough to cover the costs they impose on their investors.
Although my estimates suggest that some fund managers have skill, they offer a rather bleak assessment about one’s ability to find them using past returns
alone. The problem is that the average fund has a negative alpha (−0.49% per
year in the four-factor model) and returns are so noisy compared to the amount
of cross-sectional variation in alphas. As a consequence, the market resolves
uncertainty about alphas very slowly. In the underlying normal–normal updating problem, the variance of the date-T posterior is
vT =
v0 σe2
.
T v0 + σe2
(14)
The posterior variance thus reaches some level vT in T = ( v1T − v10 )σe2 years,
an amount of time that increases linearly in the variance of returns. Based
on the four-factor model estimates, it takes 11.4 years for the variance of the
belief distribution to decrease by half. My results offer no advice on how to
find these skilled managers. To do so, one may have to study fund managers’
characteristics (Chevalier and Ellison (1999)), the way they tilt their portfolios
(Coval and Moskowitz (2001)), and the social networks they may be able to
benefit from (Cohen, Frazzini, and Malloy (2008)).
IV. Conclusions
Reverse survivorship bias affects the measurement of mutual fund managers’
stock-picking abilities. Because mutual funds often disappear following poor
performance, some funds disappear because of bad luck and not because their
true alpha is low. A fund that disappears because of a negative idiosyncratic
shock leaves behind an alpha estimate that is too low. Because no mechanism
eliminates just-lucky mutual funds, the observed distribution of alphas gives
too pessimistic a view about the prevalence of skill among actively managed
funds. In this paper I estimate a structural model to measure the size of the
bias using CRSP mutual fund data.
The size of the reverse survivorship bias in the mean, around 60 basis points
per year, is similar in magnitude but opposite in direction to that of the direct
survivorship bias that plagued early mutual fund databases.19 Thus, if we
take a database that excludes all dead funds and then start adding them back
19 Grinblatt and Titman (1989) estimate that the direct (upward) survivorship bias is “relatively
small” and between 10 and 30 basis points per year; Brown and Goetzmann (1995) get estimates
between 20 and 80 basis points; Elton, Gruber, and Blake (1996) find estimates between 71 and 91
basis points based on three-factor model alphas; and Carhart et al. (2002) show that the bias can
be as large as 1% in samples longer than 15 years.
Reverse Survivorship Bias
811
in, the (positive) direct survivorship bias decreases but the (negative) reverse
survivorship bias begins to drag fund-specific alpha estimates down. When all
dead funds are in, the mean alpha estimate is too low by approximately 60
basis points per year.
The question of whether any active fund managers have valuable information
is a central question in the field of empirical asset pricing. Although the average actively managed dollar must lose money because of the market-clearing
constraint and transaction costs,20 nothing rules out the possibility that some
active investors profit at the expense of others. Some mutual funds, for example, could be privy to a stream of signals that lets them trade profitably against
other mutual funds, individual investors, or pension funds. Reverse survivorship is important in examining whether any mutual fund managers trade on
valuable information. To properly measure the prevalence of skill among mutual fund managers, we need to consider the counterfactual of what would have
happened had no fund disappeared following poor performance.
I do not consider the equilibrium effects of money flows on alphas. In Berk
and Green (2004), alphas get pushed to zero because investors can perfectly
diversify all nonmarket risk and because the model’s mutual fund monopolistically sets its fee to choose its size. In Pástor and Stambaugh (2012), by contrast,
equilibrium alphas do not equate to zero because, first, investors cannot diversify away all risk, second, the active management industry is competitive, and
third, there is a finite number of mutual fund investors. Nevertheless, even
if expected alphas are sensitive to money flows, equilibrium considerations do
not alter the premise of reverse survivorship bias. As long as poorly performing mutual funds still shut down, as they do in the data, estimated alphas are
biased downward relative to the true alphas.
Reverse survivorship bias has implications beyond assessing the abilities
of mutual fund managers. First, hedge funds, pension funds, and individual
investors also often cease trading following poor performance.21 The reverse
survivorship bias argument applies to inferences about these investors’ abilities
as well. Individual investors’ true alphas, for example, are not necessarily as
low as what the distribution of estimated alphas suggests. Second, two-stage
inferences about the determinants of skill become problematic. Consider, for
example, a study that seeks to understand variation in CEO skill. If CEOs’
survival correlates with performance, a panel regression of performance with
CEO fixed effects22 will produce coefficient estimates that exhibit the same
reverse survivorship bias. This bias plays a further role if these estimates are
then used as inputs to investigate determinants of ability. If there is a variable
that correlates with the sensitivity of CEO survival to bad performance, this
variable will seem to explain variation in CEOs’ abilities because of the errorsin-variables problem. The same concern extends to any two-stage analysis of
mutual funds, hedge funds, and individual investors.
20
See French (2008) for a discussion.
See, for example, Brown, Goetzmann, and Park (2001), Seru, Shumway, and Stoffman (2010),
and Linnainmaa (2011).
22 See, for example, Bertrand and Schoar (2003).
21
812
The Journal of FinanceR
No simple resolution to reverse survivorship bias seems to exist. One cannot
resolve this bias, for example, by omitting funds that survive for fewer than T
years. The issue is that, although fund performance improves as T increases,
we do not know a priori what T should be. Moreover, even if we could choose
T so that the average estimated alpha coincides with the true mean, the distribution of estimated alphas would still be distorted. This distortion would
render inferences about the prevalence of skill invalid. A viable resolution to
the bias would be to estimate fund alphas by using as little data as possible
from the very beginning of funds’ lives, before any funds shut down. However,
even though in principle this approach circumvents the bias, the problem is
that these alpha estimates would be based on just a handful of data points and
thus would be too noisy to be of any use. One could also approach this bias as
an omitted variable problem and search for a variable that correlates with survival but not with true alphas. It is not obvious, however, that such a variable
exists. Another partial remedy might be to study the distribution of t-values
associated with the alpha estimates and not the alpha estimates themselves.
However this approach is similar to discarding short-lived funds. By construction, short-lived funds’ t-values are pulled toward zero because their alphas are
estimated with greater noise. It seems necessary to impose some structure to
draw valid inferences about the true alpha distribution. The problem of finding
a remedy that is not only effective and robust but also simple to implement is
left for future research.
Initial submission: May 9, 2011; Final version received: February 28, 2012
Editor: Campbell Harvey
REFERENCES
Barras, Laurent, Olivier Scaillet, and Russ Wermers, 2010, False discoveries in mutual fund
performance: Measuring luck in estimated alphas, Journal of Finance 65, 179–216.
Berk, Jonathan B., and Richard C. Green, 2004, Mutual fund flows and performance in rational
markets, Journal of Political Economy 112, 1269–1295.
Bertrand, Marianne, and Antoinette Schoar, 2003, Managing with style: The effect of managers
on firm policies, Quarterly Journal of Economics 118, 1169–1208.
Brown, Stephen J., and William N. Goetzmann, 1995, Performance persistence, Journal of Finance
50, 679–698.
Brown, Stephen J., William Goetzmann, Roger G. Ibbotson, and Stephen A. Ross, 1992, Survivorship bias in performance studies, Review of Financial Studies 5, 553–580.
Brown, Stephen J., William N. Goetzmann, and James Park, 2001, Careers and survival: Competition and risk in the hedge fund and CTA industry, Journal of Finance 56, 1869–1886.
Carhart, Mark M., 1997, On persistence in mutual fund performance, Journal of Finance 52,
57–82.
Carhart, Mark M., Jennifer N. Carpenter, Anthony W. Lynch, and David K. Musto, 2002, Mutual
fund survivorship, Review of Financial Studies 15, 1439–1463.
Carpenter, Jennifer N., and Anthony W. Lynch, 1999, Survivorship bias and attrition effects in
measures of performance persistence, Journal of Financial Economics 54, 337–374.
Chevalier, Judith, and Glen Ellison, 1999, Career concerns of mutual fund managers, Quarterly
Journal of Economics 114, 389–432.
Reverse Survivorship Bias
813
Cohen, Lauren, Andrea Frazzini, and Christopher Malloy, 2008, The small world of investing:
Board connections and mutual fund returns, Journal of Political Economy 116, 951–979.
Coval, Joshua D., and Tobias J. Moskowitz, 2001, The geography of investment: Informed trading
and asset prices, Journal of Political Economy 109, 811–841.
Cremers, Martijn K. J., Antti Petäjistö, and Eric Zitzewitz, 2008, Should benchmark indices have
alpha? Revisiting performance evaluation, Critical Finance Review, forthcoming.
Deuskar, Prachi, Joshua Matthew Pollet, Zhi Jay Wang, and Lu Zheng, 2011, The good or the bad?
Which mutual fund managers join hedge funds? Review of Financial Studies 24, 3008–3024.
Elton, Edwin J., Martin J. Gruber, and Christopher R. Blake, 1996, Survivorship bias and mutual
fund performance, Review of Financial Studies 9, 1097–1120.
Elton, Edwin J., Martin J. Gruber, and Christopher R. Blake, 2001, A first look at the accuracy of
the CRSP mutual fund database and a comparison of the CRSP and Morningstar mutual fund
databases, Journal of Finance 56, 2415–2430.
Evans, Richard B., 2010, Mutual fund incubation, Journal of Finance 65, 1581–1611.
Fama, Eugene F., and Kenneth French, 1993, Common risk factors in the returns of stocks and
bonds, Journal of Financial Economics 33, 3–56.
Fama, Eugene F., and Kenneth French, 2010, Luck versus skill in the cross section of mutual fund
returns, Journal of Finance 65, 1915–1947.
French, Kenneth R., 2008, Presidential address: The cost of active investing, Journal of Finance
63, 1537–1573.
Grinblatt, Mark, and Sheridan Titman, 1989, Mutual fund performance: An analysis of quarterly
portfolio holdings, Journal of Business 62, 393–416.
Jensen, Michael C., 1968, The performance of mutual funds in the period 1945–1964, Journal of
Finance 23, 389–416.
Khorana, Ajay, 1996, Top management turnover: An empirical investigation of mutual fund managers, Journal of Financial Economics 40, 403–426.
Khorana, Ajay, 2001, Performance changes following top management turnover: Evidence from
open-end mutual funds, Journal of Financial and Quantitative Analysis 36, 371–393.
Kosowski, Robert, Allan Timmermann, Russ Wermers, and Hal White, 2006, Can mutual fund
“stars” really pick stocks? New evidence from a bootstrap analysis, Journal of Finance 61,
2551–2595.
Kostovetsky, Leonard, 2009, Human capital flows and the financial industry, Working paper,
University of Rochester.
Linnainmaa, Juhani, 2011, Why do (some) households trade so much? Review of Financial Studies
24, 1630–1666.
Lynch, Anthony W., and David K. Musto, 2003, How investors interpret past fund returns, Journal
of Finance 58, 2033–2058.
Pástor, Ľuboš, and Robert F. Stambaugh, 2012, On the size of the active management industry,
Journal of Political Economy 102, 740–781.
Pástor, Ľuboš, Lucian A. Taylor, and Pietro Veronesi, 2009, Entrepreneurial learning, the IPO
decision, and the post-IPO drop in firm profitability, Review of Financial Studies 22, 3005–
3046.
Seru, Amit, Tyler Shumway, and Noah Stoffman, 2010, Learning by trading, Review of Financial
Studies 23, 705–739.
Williams, David, 1991, Probability with Martingales (Cambridge University Press, Cambridge,
MA).
Supporting Information
Additional Supporting Information may be found in the online version of this
article at the publisher’s web site:
Appendix S1: Internet Appendix