Explaining the The case facts with adaptive agents: of mutual fund

JOURNAL OF
Journal
ELSEVIER
Economic
DpalnicS
& Control
of Economic Dynamics and Control
21 (1997) 1117-1147
Explaining the facts with adaptive agents:
The case of mutual fund flows
Martin Lettau
CentER j6r Economic Research and Department of Finance, Tilburg Universily,
P. 0. Box 90153, 5000 LE Tilburg, Netherlands
Received
1I April 1995; accepted 4 December 1996
Abstract
This paper studies portfolio decisions of boundedly rational agents in a financial market.
Learning is modeled via a genetic algorithm. Learning as modeled in this paper leads
agents to hold too much risk as compared to the optimal portfolio of rational investors.
Moreover, adaptive agents exhibit an asymmetric response after positive and negative
returns where the portfolio adjustment is more pronounced after negative returns. It is
demonstrated that investors in mutual funds show the same investment patterns as the
adaptive agents in the model. A model with entry and exit of agents is able to match the
mutual fund data closely.
Keywords: Genetic algorithm; Mutual fund flows; Learning;
1EL clussiJcution:
D83; Gl I
Bounded rationality
I. Introduction
Standard models of financial markets imply that rational utility maximizing
agents should not trade for speculative reasons (see e.g. Milgrom and Stokey,
I thank John Campbell, David Easley, Dan Friedman,
Kai-Uwe Kiihn, Blake LeBaron, Georg
NGldeke, Ariel Rubinstein, Harald Uhlig, Timothy Van Zandt, and two anonymous referees for
helpful comments. Seminar participants
at Carnegie Mellon, Carlos 111 (Madrid), EUI (Florence),
IAE (Barcelona),
IAS (Vienna), Ohio State, Pompeu Fabra, Princeton, the Santa Fe Institute and
Tilburg provided many constructive remarks. I am also grateful for the hospitality of the Santa Fe
Institute where part of this research was done. The Investment Company Institute kindly provided
the mutual fund flow data. An earlier version of this paper was distributed under the title “Adaptive
Learning in a Financial Market.” All remaining errors are mine.
0165-l 889/97/$17.00 8 1997 Elsevier
PII SO165-1889(97)00046-S
Science B.V. All rights reserved
1118
M. LetraulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
1982). These results are at odds with high trading volume in many financial
markets. Since these no-trade theorems rely on the strong assumption that it is
common knowledge that all market participants are rational, it remains unclear
which kind of trading patterns emerge once the common knowledge assumption
is relaxed. In this spirit, this paper studies portfolio decision of boundedly rational
agents. Agents are boundedly rational in the sense that they cannot compute the
calculations required for expected utility maximization. Hence, they have to learn
from observed outcomes of their investment decision. This learning process is
modeled via a genetic algorithm as developed by Holland (1989). One feature of
adaptive agents is that they adjust their portfolio composition after observing the
return of past investment decisions. This behavior leads to interesting deviations
from rational decisions. First, the adaptive agents tend to take on too much risk
compared to rational agents. The magnitude of this risk taking bias depends on
the number of market observations that the agents use before they update their
investment portfolio. Under certain conditions this bias does not vanish as their
lifetime increases. Second, the risk taking bias leads to an asymmetric response
after positive and negative returns: the portfolio adjustment after negative returns
is larger in absolute value than after positive returns.
Adaptive behavior that leads to systematic deviations from full rationality might
be able to shed light on some of the puzzles in the finance literature. I use
data on flows into mutual funds to present evidence that investment strategies
of mutual fund investors cannot easily be explained by models with rational
agents. However, the mutual fund flows are consistent with adaptive investment
behavior as exhibited by the adaptive agents as modeled in this paper. The data
set used in this study differs from most other studies on mutual fund flows in
that it uses aggregate flows instead of flows of individual mutual funds. This
data set contains monthly net monetary flows into aggregate groups of funds.
In other words, individual funds with similar investment objectives are aggregated
together into classes such as aggressive growth funds or growth and income funds.
Most previous studies on mutual fund flows focused on the monetary movements
across individual funds, see e.g. Ippolito (1992), Lakonishok et al. (1992) and
Sin-i and Tufano ( 1993). A common result is that flows into individual funds are
positively correlated with its past performance. This might be expected because
the performance of a fund might reveal information about the quality of the fund
manager which in turn will lead rational agents to move money into funds which
performed well in the past. The advantage of a methodology using aggregate fimd
flows is that the information about fund managers will cancel out and therefore
cannot lead to flows across fund classes. Hence, any imperfection in information
about fund managers can be ignored and the focus of this study will be how
investors change their portfolio between mutual funds with different investment
objectives and hence different risk structures. The most important findings are
as follows. Mutual fund flows are highly positively correlated with returns of
the funds. After a positive return, investors move funds into mutual funds while
M. LetrauIJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
1119
they decrease their holdings in mutual funds after a negative market return. The
correlation is higher for riskier funds than for low risk mutual funds. Moreover,
the flows after a negative return are larger than after a positive return. Both
of these features are present in the investment behavior of the learning agents.
Indeed, when confronted with actual stock returns the adaptive agents produce
patterns which are very close to the mutual fund flow data.
How do these results square with conventional financial theory? In the standard
model following Markowitz (1952, 1959) and Sharpe (1964) the optimal portfolio
strategy is a passive one, i.e. investment flows should not follow any pattern
and can only be due to liquidity needs. The no-trade theorems of Milgrom and
Stokey ( 1982) show that rational agents should not trade with each other even
when there is asymmetric information. Wang (1993) presents a model which is
partially consistent with the mutual fund flow patterns. In his dynamic model with
asymmetric information
and noise trading, the uninformed
speculators behave
under certain parameter values like price chaser: buy when prices go up and
sell when they go down. This is just the investment pattern of mutual fund
investors. However, this result obtains only under very specific parameter values;
for example, it requires a very high amount of asymmetric information and strong
mean reversion in the process of noise trader’s demand. Moreover, the correlation
between the payoff and the portfolio adjustment in his model is much smaller than
in the data, even with the most favorable parameter choices. Since the optimal
investment policy is symmetric with respect to positive and negative returns, this
model is also inconsistent with the asymmetry found in the data. While the Wang
model is at least partially consistent with the mutual fund flow data, traditional
models are not able to create a substantial amount of investor’s responsiveness
to
current payoffs. Models with boundedly rational agents are a promising alternative
to explain trading volume and flow data. ’
In this spirit, I study how boundedly rational agents learn about their investment behavior in a very simple financial market. Agents are assumed to be unable
to perform any kind of maximization
as required in standard settings with rational
agents. They learn solely from outcomes of past investment decisions. The model
is kept as simple as possible to isolate the effects of learning compared to rational
behavior. Agents have to decide how much to invest in a single risky asset. Their
investment horizon is one-period so that learning takes place as repeated one-shot
investment decisions. Two version of the model are studied. The first model models a population of agents whose investment portfolio converges to a common
value over the lifetime of the agents. The second model consists of a population
of agents with new agents coming into the market and some of the existing agents
leaving the market. In this way I can compare the behavior of the adaptive agents
to the mutual fund flow data. The second model is able to replicate the behavior
’ Strictly speaking, the Wang (1993)
models since they assume completely
model and many others using noise trading
irrational noise traders to break the no-trade
are not ‘rational’
theorems.
1120
M. Lettoul
Journal
of Economic
Dynamics
and Control
21 (1997)
II 17-l I47
of mutual fund investors more closely than any available model with rational
agents. The adaptive agents produce a reasonably high correlation of returns and
portfolio adjustment and they also exhibit the asymmetric reaction after positive
and negative returns. I therefore conclude that models with adaptive agents provide a viable alternative to conventional rational models in explaining observed
behavior of participants in financial markets. It remains to be seen whether it is
possible to explain a wider array of puzzles with boundedly rational agents. 2
The remainder of the paper is organized as follows. Section 2 defines the
model and describes how agents learn. The results for the single agent model
are presented. Section 3 gives the empirical results of the mutual fund flow data,
presents the multi agents model and compares the results of the adaptive agents
to the behaviour of the mutual fund investors. Section 4 concludes.
2. A financial market with adaptive agents
2.1. The model
As mentioned in the introduction, the model is taken to be a very simple one.
In order to obtain an analytical solution I make assumptions so that optimal portfolio decisions are linear in the expected excess return of the asset. Specifically,
I assume that there is a single asset whose value is normally distributed (with
mean 0 and variance cr,“), and that the agent’s utility of net payoff w has constant
absolute risk aversion with coefficient y:
U(w) = - exp( -yw).
(1)
In order to maximize expected utility the agent has to decide how many units
s of the risky asset she should buy. Let pe be the price of the asset. I assume
that the price of the risky asset is determined exogenously and is not influenced
by the adaptive agents. This is a restrictive assumption but can be defended on
multiple grounds. First, the focus of this paper is on how adaptive agents change
their portfolio over time and not on the patterns in prices generated by boundedly
rational investors. Given the available data set on mutual fund flows, a model with
exogenous prices appears to be a reasonable simplification. Of course, studying
the price formation in this market might also be in interesting extension. Second,
orders to change holdings in mutual funds are carried out only with a fairly
long lag, usually half a day, often only at the end of a trading day. Even if
a fund investor tried to react to current information, such as prices, his investment
decision will not impact prices until some later period. Hence, demand of fund
investors cannot affect current prices of assets. If there is a link from demand
2 Timmerman
returns.
(1993) uses
a learning model to explain excess volatility
and predictability
of stock
M. LettauIJournal
01 Economic Dynamics and Control 21 (1997)
1117-1147
1121
to current prices, it has to be lagged. Hence, it seems a reasonable modeling
approach to assume that prices are given to mutual fund investors and that they
can only react with a lag to observed prices. The real shortcut is that the delayed
reaction is ignored in terms of their impact on prices.3
The agent’s net payoff given s and the realization of u is given by w = ~(a--PO).
The utility of the payoff is evaluated every period so that there is no dynamic
link between periods. The risk free interest rate is assumed to be zero. Under the
full rationality paradigm the optimal solution is relatively easy to calculate. The
optimal portfolio is a linear function of the mean value of the asset E and the
current price PO,
with optimal coefficients CL;and cl;:
1
LX;= CL;= --Yj’
P”
Note that for the agent to calculate her optimal portfolio she must know all
the relevant parameters: the distribution of the asset and her own utility fimction. Moreover, she must be able to perform the necessary calculations to obtain
the above equations. None of these requirements is necessary when we assume
that the agent is learning the portfolio decisions through an inductive learning
process. Of course, there are many ways to model learning in the context of
portfolio decision making. In this paper I model learning in a very simple way.
Agents observe the current market outcome and revise their next periods portfolio
using this observation. This is obviously a very stylized model of learning and
restricts agents severely in the way they can form portfolios. For example, they
are assumed not to use any statistical model and use observations to estimate it.
Despite these restrictions, the learning model captures the way many investors
behave in real markets such as chartists or technical traders.
Mutual fund investors appear to be a logical starting place to model boundedly rational behavior in financial markets. Apart from arguments made by the
popular press that mutual fund investors are the least informed participants in
financial markets, there are other theoretical arguments as well. The decision of
mutual fund investors to consciously put their money into the hands of fund
managers can be viewed as a decision in terms of an optimal allocation of time.
Researching the financial market is time-intensive and hence it might be optimal
3 Of course, this simplification
leaves a number of interesting questions unanswered.
For example,
it should be interesting to study how more rational investors could exploit such behavior. It is not
clear, however, that investors with this type of boundedly rational behavior should be driven out of
the market by more rational investors, see e.g. Palomino (1996). It is also not obvious how to write
down a model including GA investors and rational investors. Because the GA is highly nonlinear
algorithm, no closed-form solution for the behavior of the rational agents exists.
1122
M. Lettaut Journal of Economic Dynamics and Control 21 (1997)
1117-1147
to allocate one’s time into other activities. Instead, mutual fund investors choose
to use simple ways to adjust their portfolio without spending too much time on
specific investment decisions. Often noise-trading models have been motivated by
these arguments. In this paper, I assume a weaker notion of noise trading in that
investors use a simple learning method to improve their investment decisions. The
GA described below allows investors to learn from past experience. However, the
learning mechanism is not optimal in the Bayesian sense. Investors use a mechanistic algorithm to update their portfolio decisions. The GA is a useful tool in
this context, because on the one hand more successful investment policies are
more likely to be used again in the future, and on the other hand, new investment policies are introduced by combining existing ones. These features appear
to be sensible ingredients when one aims at modeling adaptive behavior in financial markets. In that sense, modeling the behavior of mutual fund investors
in this very simple way appears to be a reasonable framework. Of course, the
results depend on the learning model and different learning models will most
1ikeIy produce different results.
2.2. The genetic algorithm
GAS were introduced by Holland (1989) to study learning, adaption, and optimization in complex systems. The basic idea of GAS is taken from processes
of evolutionary dynamics. A GA consists of a set of operations which manipulate a given population which is defined as a finite number of strings. In most
applications a string is a list of binary numbers, i.e. bits. The interpretation of
a string depends on the application. When applied to game theory each string
might represent a strategy of an agent. For the purpose of this paper a string
will be interpreted as a set of coefficients in an agent’s demand function. The
population of binary strings represents a set of economic agents whose behavior
is characterized by the strings. Each string is assigned a measure of performance.
In economic applications this fitness criterion might be a payoff or a utility level.
The task of the GA is to manipulate the strings to improve their performance
according to the fitness criterion. In other words, the population evolves over time
creating new strings which are better adapted to the environment. The key task
of the GA is to produce new populations based on the existing ones.4 Usually
the initial population is generated randomly.
To perform this transformation, the GA first selects copies of strings in the
current population. This is done by random draws from the existing population.
The probability that a given string is copied to the new population is based on
its performance or fitness: a string that did well according to the fitness measure will be more likely to be copied than a string with a lower fitness. The
4 The GA used in this paper is a fairly standard version. Buckles and Petry (I 992) and Goldberg
(1989) describe different implementations of GAS.
M. LetrauIJournal
of’ Economic Dynamics
and Control 21 (1997)
1117-1147
1123
GA then introduces new strings through two genetic operators that alter some
strings of the population. These two operations are called crossover and mutation. Both operations retain important characteristics of the ‘better’ strings in the
existing population. Crossover is the most important genetic manipulation.
In real
organisms, chromosomes rearrange their genetic code through a similar process.
Specifically, two strings are chosen randomly from the parent population. They
are lined up against each other, a point is selected randomly and the portions to
the right of the point are swapped between the two strings. To give an example,
suppose the strings ( 101101) and (011000) are selected. Suppose the crossover
point is chosen to be three. After crossover we obtain (101000) and (011101).
Both new strings replace the parent strings in the new population. The key feature
of the crossover operator is that it supports the development of compact building
blocks in the population. A compact building block is a block of genes that are
located next to each other. These building blocks tend to survive crossover and
in the long run only ‘successful’ blocks will prevail. It turns out that crossover
enables GAS to focus their attention on the most promising parts of the search
space. More precisely, crossover guarantees that all building blocks of the search
space are sampled at a rate proportional to their fitness (see Holland, 1989 for
a proof). The second genetic operator is mutation. Mutation simply flips a 0 to
a 1 and vice versa. Each chromosome undergoes mutation with a given (small)
probability.
The purpose of mutation is to introduce new genetic material and
to avoid the development
of a uniform population which will be incapable of
further evolution over time.
Without getting too deeply into the theory of GAS, I will illustrate briefly
why GAS are a powerful tool for simulating evolutionary
processes, including
economic learning. GAS exhibit implicit parallelism. In effect, each binary string
covers many different regions of the search space. For example, the string (10010)
belongs to the region of all strings whose first chromosome
is a 1. But it also
belongs to the region of strings which end with 10. This feature allows the GA
to cover many different regions with relatively few strings. This property is very
important when the GA is faced with nonlinear problems. Moreover GAS are also
able to balance the trade-off between exploration and exploitation.
Exploitation
means that an algorithm exploits actions which are currently successful. However,
if exploitation
is overemphasized
the system bears the potential cost that the
current action might not be the globally optimal one. To avoid this dilemma, the
system has to explore new parts of the search space. It has to try new things;
most of them will fail, of course, so too much exploration
will hurt overall
performance. Striking a healthy balance between exploration and exploitation is an
important component of adapting and learning systems. It can be shown (see e.g.
Holland, 1989) that GAS exhibit a very efficient trade-off between exploration
and exploitation. 5
’ The proof of this statement
is closely linked to the well-known
multi-armed
bandit theorems
1124
M. Lettaul Journal of Economic Dynamics and Control 21 (1997) 1117-1147
Another positive feature of GAS is that they can be employed in a very broad
class of models. In other words, regardless of the economic model the adaptive
agents endowed with a GA always use the same set of operations. Of course,
one can always find more elaborate rules, e.g. Bayesian learning, but this requires
learning methods that can vary considerably from one situation to another. For
example, a Bayesian learning rule is usually strongly affected by the statistical
distribution of what is unknown. A set of learning rules that can be used in more
general situation seems to be preferable for modeling learning in an economic
context.
Examples of GA applications in economics include Arifovic (1992, 1994, 1995,
1996), Miller (1986) and Miller and Andreoni (1990). Some of these papers
compare the behavior of the GA with experimental evidence and conclude that
the GA tends to produce results which are consistent with experiments. When
compared to other learning models such as Bayesian learning or least squares
learning, the GA tends to do a better job in reproducing the experimental data.
2.3. Decision rules
A strategy or decision rule maps the observable parameters to the demand for
the asset. In this model, the observable parameters are pa and fi and in general the
asset demand decision can be written as s(pe, fi). There might be unobservable
parameters or uncertainty (noise) that is resolved only after the agent makes her
decision. In this model, the unobservables are the realization of the asset value.
A priori the decision rule could be any function of the observables. However, it
is impossible to completely encode this infinite dimensional space. Although it is
possible to get an arbitrarily close approximation of the space (e.g., using polynomials), this is not desirable either because (i) the computational requirements
are larger the longer are the encodings of the strategies, and (ii) although allowing a wider range of functions makes it more likely that the optimal decision
rule can be approximated by an encodable decision rule, this also slows down
learning (convergence). Therefore, I choose a parsimonious encoding of the demand function. I assume that i! and po are constant and known to the agents.
Furthermore, I specify the simplest class of demand functions which includes the
optimal one. Hence, the agent’s problem is reduced to finding a scalar a, in the
linear function 6
s, = a,(6 - PO).
6 An earlier working paper version included results using more general timctional forms and variable
parameters 6 and po, e.g. s,(po,l)=q,
+ al,,po + CQ,,I?.The only important difference compared
to the simple case (4) is a slower convergence speed because agents have to learn more parameters.
All other conclusions are essentially unchanged.
M. LertaulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
1125
2.4. The GA implementation
I implement the GA algorithm as follows. Let T represent the lifespan of each
agent. In each period t = 1,. . . , T the decisions of an agent are represented by
a binary string. Let J be the (constant) number of agents in each period t. Each
string of length L represents a value for a parameter in the demand function of
one agent. The strings are decoded in the following way. The range of values
for the parameter is the interval [MIN, MAX] 7 and let oi,* E (0, l} denote the
ith bit of a string in period t. Then the value of parameter CQin the demand
function decoded by the string wI = (~1,~. . . a~,,) is given by
a,=MINf(MAX-MM)~i=Z~~r:i-‘.
(5)
Eq. (5) is the binary decoding, scaled appropriately. Of course, this implies that
the parameter value MIN is encoded by the string (0.. . 0), while (1 . . .1) represents MAX. All other strings are between these two values. To give an example,
let MIN = - 1, MAX = 1 and L = 5. The string (11010) encodes the parameter
value a = - 0.290322.
Before the first period, the initial population of strings is chosen randomly.
Each bit is independently set to 0 or 1, each with equal probability. In each
period t the agents face S portfolio decisions, so that each agent makes a total
number of T * S decisions in her lifespan. The population of decision rules is
constant for all S drawings in a period t. After S realizations the population is
updated using crossover and mutation as described in the previous section. The
performance of each rule is measured by the sum of utilities generated by the
S drawings of the asset. It turns out that the number of decisions per period, S,
is crucial for the behavior of the GA agent (see simulation results below). Each
decision rule is assigned a probability of being copied to the next generation
based on its performance. A rule with a high cumulative utility receives a higher
probability than a rule with a low cumulative utility. The drawings from the
pool of rules is done with replacement. Specifically, let K = CT=, Vi(Wj) be the
cumulative utility after S drawings. Then string i is assigned the probability
(6)
of being copied in each drawing. Note that this transformation keeps the relative
strengths in proportion, i.e. a string with cumulative utility of -1 has a four
times higher probability than a string with cumulative utility of -4.
After J rules are drawn, the strings are randomly lined up in pairs of two.
Each pair undergoes crossover with a constant probability CROSS. The mutation
operator is then applied to each bit in the population with probability MUT. MUT
’ MIN and MAX are exogenously
chosen constants.
1126
M. Leiraul Journal of Economic Dynamics and Control 21 (1997)
1117-1147
is exponentially decreasing over time with half-life T* to guarantee a fixed limit
as T grows. This completes the formation of the new population.
The GA parameters used in the simulations are as follows. The number of
strings in the population is J = 30, and each string is of length L = 20. The initial mutation rate is set to MUT = 0.08. MUT decays exponentially with half-life
T’ = T/2, so that the mutation rate in the last period is 2%. Pairs of strings undergo crossover with a constant probability of 40%. The strings decode possible
parameter values between MIN = - 4 and MAX = 4. It should be noted that the
performance of the algorithm is not very sensitive to changes in these parameters.
If the mutation rate falls too rapidly the population might get stuck in a suboptimal bitstring. Recall that the crossover operator does not change two equal
bitstrings so that only mutation is able to alter a completely homogeneous population. Changing the crossover rate has mostly an influence on the convergence
speed.
2.5. Simulation results
Tables 1A and 1B report the simulation results for various values of the parameters. Table 1A reports the results for the simplest case where all observables
are constant. The price of the asset po is set to zero while the values for all other
parameters are given in the table. A ‘*’ denotes the theoretical optimal value for
the parameters. Each simulation is repeated 25 times for a given set of parameter values. Recall that in each period t = 1,. . . , T there are S subperiods. The
population of strings is constant for S periods before the crossover and mutation
operators are applied. To get a good measure of the performance of the GA the
following statistics are reported. First, I compute the average parameter values
of the population in each of the last 25 periods, T - 24,. . . , T, thus allowing the
algorithm to settle down. Then I compute the average and the variance of these
25 values for the 25 runs for each set of parameter values. The tables report
these averages with the variances in parentheses.
The first row shows the results for a benchmark case. The variance of the
asset value and the coefficient of absolute risk aversion are set to unity. Hence
the optimal value for parameter at is 1 as well. The total number of periods
T for the benchmark simulation is 500 with S = 150 portfolio decisions in each
period. The average value of coefficient (rt of the demand function of the GA
agent is 1.0233 which is about 2% higher than the optimal value. The effect that
the GA agent holds ‘too much’ of the risky asset can be observed in almost
all simulations. I will return to this feature in detail later. The low variance of
0.0088 shows that this result is very precise across repetitions. Note that the
average utility generated by the GA agent is extremely close to the utility under
full rationality. The maximal (average) utility is -0.6074 while the utility using
GA decision parameter is -0.6099, this amounts to a loss in utility of less than
0.5%. This is remarkable since mutation of a single bit in one of the last periods
M. LettauIJournal
of Ecunomic Dynamics
and Control 21 (1997)
fff7-1147
1127
Table IA
Simple GA model
T
s
u
Y
u;
a;
I
500
150
I .oo
1.00
1.oo
1.oo
2
500
150
I .oo
2.00
1.00
0.50
3
500
150
1.oo
1.oo
2.00
0.50
4
500
150
1.oo
1.00
4.00
0.25
5
500
100
1.00
1.oo
1.00
1.oo
Simulation
6
500
50
I .oo
1.oo
1.00
1.oo
7
500
25
I .oo
I .oo
1.oo
1.00
8
500
10
1.oo
1.oo
1.oo
I .oo
9
100
100
1.00
1.oo
1.oo
1.oo
10
100
50
1.oo
1.oo
1.oo
1.oo
II
100
25
1.00
1.oo
I .oo
1.oo
12
100
10
1.oo
1.oo
1.oo
1.00
Now
Average
parameter
Table 1B
E[arg max L/J depending
values for 25 runs, variances
1.0233
(0.0088)
0.5230
(0.0038)
0.5 132
(0.0006)
0.2581
(0.0025)
1.0502
(0.0024)
1.1020
(0.0140)
1.1257
(0.1276)
1.3064
(0.0552)
1.0891
(0.0054)
1.1005
(0.0359)
I .3201
(0.0741)
1.4876
(0.0846)
in parentheses.
on S
CG =
1 a’ = 1
Eiarg max U,]
G=l
6 = 0.5
I
2
5
IO
25
50
100
500
3.3704
3.0779
2.4986
2.0177
2.7656
2.4097
1000
5000
S
1.2027
1.0969
1.8879
1.5466
1.2089
1.0897
1.0424
1.0205
1.0113
1.0103
I .0003
1.0058
1.0003
1.4187
1128
M.
LertaulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
can cause a parameter in the population to change dramatically. Of course, this
string will most likely not survive to the next population but it will affect average
utility negatively.
Simulations 2, 3 and 4 change the parameters of the economy while keeping
the parameters of the algorithm constant. The algorithm produces stable results
for different coefficients of absolute risk aversion and variances of the asset value.
The following simulations reduce the number of total runs T and the number of
drawings S per period. Simulations 5-8 keep the lifespan T at 500 periods, but
reduce the number of drawings per period, S. As might be expected, the portfolio
chosen by the GA agent moves further away from the optimal one. Note that
the GA agent holds more of the risky asset as S decreases. I will discuss this
phenomenon later in more detail. Note, that the variance is increasing as well.
The subsequent simulations use a gradually decreasing lifespan while keeping
the same pattern for S. For a given S a lower T implies in most cases a bigger
deviation from the optimal portfolio. But the effect of decreasing S is much
stronger than of lowering T. If S is high enough, the algorithm produces results
close to the optimum even with a very low T. For example, simulation 10 uses
T = 100 and S = 50. The GA agent holds 1.1005 units of the risky asset which
is about 10% above the optimal holding of one unit. On the other hand, for
T = 500 and S = 10 (simulation 8) the portfolio of the GA agent consists of
1.3064 units of v which is more than 30% above the optimum. Even though both
simulations have 5000 realization, the case with higher S yields a portfolio which
is closer to the optimal one. Thus it is clear that the number of decisions before
the population of strings is updated, S, is much more important for the behavior
of the algorithm than the lifespan of the agent, T.
Why is the performance so much dependent on S? To develop an intuition,
consider the following experiment. Start with a set of portfolio weights, at I . . . <
a~. Recall that the rational agent maximizes expected utility, thus the optimal
portfolio weight is given by a* = arg max, E[ U,(W)]. Now fix S. Suppose that the
agent has to choose an a after observing only S drawings. Naturally, she would
choose the portfolio weight that induced the highest cumulative utility after S
drawings. Hence, she picks a** = arg max,, XT=, Uol,(wj). Note that in general
Era**] # a*. Thus the agent’s portfolio weight will in expectation deviate from
the optimal one, at least for finite S. I was not able to obtain a closed form
for Era**] except for the trivial case S = 1. In this case a** is either al, if the
only realization of u is negative, or aN, if the realization of v is positive. Hence,
E[a’*] = Pr(u ~0) al + Pr(v > 0) aN. This expression might be smaller or bigger
than a* depending on the set of a’s and the distribution of V. For a1 = 0, a,V = 4,
5 = 1, and 0,” = 1, we get E[a”] = 3.3704 >a*. Table 1B shows Monte Carlo
results for different S using the same values. For small S the bias is positive,
as S increases the bias is converging to 0.
Of course, the mechanism of the GA is not that simple. The set of portfolio
weights is not constant as only a part of the old population is carried over to the
M. Lettau I Journal of Economic Dynamics and Control 21 (1997)
1 I 17-1147
1129
new population and new strings are invented. Nevertheless,
the above analysis
is helpful in understanding
the behavior of the GA agent. After S drawings the
GA copies new strings from the old population according to their cumulative
utility. For small S the strings close to the minimal and maximal values of
the parameter space (i.e. weights close to MIN or MAX) tend to generate above
average fitness. Since V is chosen to be positive, this tendency is more pronounced
for weights close to MAX, for negative i! the effect is reversed. In other words,
since there are more ‘good’ states of the world with a positive payoff than ‘bad’
states with negative outcomes, the riskier strategies are more successful than
safer ones in more than 50% of market realizations. This explains why the GA
selects portfolio weights that are too high as long as S is low. As S increases
this effect becomes smaller since more realizations of u are observed. The whole
range of possible realizations will be covered and hence parameters closer to a+
will on average receive a higher utility than parameters close to the endpoints
of the parameter range. The fact that E[argmax U,] + argmax E[U,] as S + 00
illustrates this behavior. In other words, the agent must have a lot of information
about the distribution of the asset in order to learn how to behave optimally. If
she observes only a few realization of the uncertain asset, she selects portfolios
with too much risk since she does not take rare negative events correctly into
account.
Table 1C demonstrates the effect of S and fi in the simulations. The first three
rows show that the bias decreases when ij is lowered. Even for relatively high
values of S this effect is noticeable. Next, consider the extreme case when the
population is updated after each realization of the risky asset (rows 4-6). Recall
that the maximum value of a is set to 4. For fi = 1, the GA converges to a value
very close to MAX. When V is lowered the bias goes down even for S = 1. These
simulations reflect the theoretical intuition developed in the last paragraph. The
last four rows demonstrate that when S is set to a very large number, the bias
towards risky strategies vanishes. This suggests a trade-off between S and 7’.
Suppose that the agent has a fixed total of T t S decisions to make. What is the
optimal choice of S to maximize average utility? Fig. 1 illustrates this trade-off
for S * T = 1000 (other parameter values from simulation
1 in Table 1A). The
average utility for very low S is very small for the same reasons explained above.
The agent holds too much risky asset implying a low average utility. When S
is high, the demand parameter is updated rarely resulting again in lower utility.
The optimal S is about 20 with corresponding
T of 50. In other words, the
intermediate values of S guarantee a good trade-off between improving existing
strategies and producing suboptimal ones when the updating is too fast.
Figs. 2A and B illustrates the evolution of the parameter in the demand function
of the GA agent. Fig. 2A shows the average parameter value while Fig. 2B
presents the variance in each period t. Since the initial population
is chosen
randomly, the mean is close to 0 for the first periods. Of course, the variance is
very large initially. The GA agent is able to recognize quickly that the negative
M. Lettaul Journal of Economic Dynamics and Control 21 (1997)
1130
1117-1147
Table 1C
Simple GA model: effect of S and a
T
Simulation
1
S
100
100
v
Y
2
cu
a;
1.00
1.00
1.00
1.00
aI
1.0671
(0.0080)
100
2
3
100
4
100
5
100
0.50
1.00
1.oo
1.0250
(0.0082)
0.9927
(0.005 1)
0.25
1.oo
1.oo
I .oo
1
1.oo
1.oo
1.oo
1.oo
100
1
0.50
1.oo
1.00
1.00
6
100
1
0.25
I .oo
1.00
I .oo
7
100
1000
1.00
1.oo
1.oo
1.oo
8
100
5000
1.00
1.oo
1.oo
1.00
10
100
10000
1.oo
1.oo
I .oo
I .oo
11
100
Note: Average
parameter
100
1.00
1.oo
25000
I .oo
values for 25 runs, variances
1.00
3.9786
(0.0018)
3.5574
(0.0282
3.2576
(0.0876)
1.0014
(0.0106)
0.9937
(0.0085)
1.0010
(0.0059)
0.9974
(0.0110)
1.oo
in parentheses.
d
I
9
i”
7
h,”
.e
._;
=ul
,
S?
,o
u 9
z
‘:
21
s
I
rl
i
’ 0
100
200
300
400
500
600
700
800
900
1000
s
Fig. 1. Average lifetime utility of the agents
S l T = 1000 for different values of S.
for a fixed number
of lifetime
portfolio
decisions
M.
x
‘0
Letraul
50
Journal
100
of Economic
150
Dynamics
200
and Control
250
JO0
21 (1997)
350
1117-1147
1131
I
4w
450
500
ulo
450
500
period
iaa
Fig. 2. GA population
following parameters:
150
200
250
300
350
average and variance for each period t, respectively,
T = 500, S = 150, B = 1, y = 1, ut = I.
for a simulation
with the
parameter values are not successful and concentrates more on positive parameter
values. The variance is still fairly large indicating that the algorithm has not yet
pinned down a certain value. Nevertheless, the mean is slowly increasing towards
the optimal value while the variance is going down. After about 300 periods
1132
M. LettaulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
the mean is fairly close to one, the optimal value. It is slightly above one for
the same reasons as explained before. Nevertheless, the crossover and mutation
operators still alter the strings, hence the variance is still fairly large. After about
300 periods, the variance becomes very small, as indicated by the almost solid
line on the horizontal axis. The dots above the horizontal axis indicate that the
mutation operator still alters some strings. Since most of these mutations are not
successful they are discarded in the next period. Hence, the variance is large
for just one period. Since the mutation rate decreases over time, the number
of mutations is falling. The variance is almost always zero, only an occasional
mutation drives it up. The system settles down close to the optimal value. This
pattern of behavior is a very typical one for the GA agent.
To summarize, the behavioral pattern, the adaptive agents tends to hold too
much of the risky asset because there are more positive than negative events. In
other words, the agents are not able to take rare events correctly into account.
This bias tends to zero in the limit as the agents observe many asset realizations
before she updates her set of decision rules. Thus it is by no means trivial that
learning always leads to optimal portfolio decisions even in very simple static
settings as the one considered in this section.
3. Mutual fund flows and GA learning
The last section identified some consistent deviation of the learning agents
compared to how rational agents are behaving. This section is concerned with the
question whether there is any empirical evidence that learning as studied in this
paper is present in real financial data. To this end, I present next some evidence
that investment patterns of mutual fund investors are inconsistent with standard
financial markets theory but are consistent with the learning model on numerous
counts. Moreover, a slightly adapted version of the model in the previous section
is able to match the mutual fund data quite closely.
3.1. Mutual fund POWS: Some empirical evidence
This section contains an empirical investigation of flows into and out of commonly held mutual funds. The data set consists of monthly data starting in
February 1985 and ending in December 1992. 8 Instead of looking at individual mutual funds the data set focuses on aggregate flows of different mutual fund
categories thus summarizing the flows of funds with comparable investment objectives. The four fund groups are (in order of decreasing risk) aggressive growth
funds (AG), growth funds (GR), growth and income funds (GI), and funds with
balanced portfolio (BP). The investment objectives of these fund categories differ
* The data set was provided by the Investment Company Institute.
M. Lertau I Journal of Economic Dynamics and Control 21 f I997)
Table 2
Descriptive
statistics
Number
of individual
Average
Std. dev.
Risk
Average
Average
Average
Std. dev.
return
of return
of mutual fund data (monthly
funds
asset value (in $1000)
flow (in $1000)
flow in % of asset value
of asset flows
Note: Risk measure
is OLS coefficient
1117-l 147
1133
data 1984-1992)
AG
GR
GI
BP
326
547
428
160
0.013
0.052
1.063
37182
271
0.455
0.021
0.011
0.047
0.947
64330
559
0.606
0.012
0.009
0.040
0.859
93030
843
0.858
0.009
0.010
0.036
0.623
11924
202
in regression
on CRSP-VW
1.440
0.020
returns.
considerably. Aggressive growth funds seek to maximize capital gains thus focusing on risky stocks. Growth funds invest in common stocks of well-established
companies. Growth and income funds invest in stocks of companies with a solid
record of paying high dividends. Balanced funds have a portfolio mix of bonds,
preferred stocks, and common stocks. The data set contains the monthly flow into
the mutual funds measured in percentages of total asset value and the monthly
return of the fund category. Table 2 presents some descriptive statistics of returns and fund flows. Row 1 indicates the number of individual funds belonging
to each of the groups as of 12/92. There are 547 individual growth funds in the
data set while the number of funds with balanced portfolio consists of only 160
funds. The other two fund groups contain 326 funds (aggressive growth) and
428 funds (growth and income). Consider next the returns. For comparison, note
that the CRSP value-weighted index had an average monthly return of 1.2% with
a standard deviation of 4.6% during the sample period. The aggressive growth
funds (AG) yield a return of 1.3% with a standard deviation of 5.2%. Compared
with the CRSP-VW index, the return is 0.1 percentage points higher but at cost
of a higher standard deviation of 0.6 percentage points. This higher risk is also
indicated by a coefficient of 1.063 in an OLS regression of AG returns on CRSPVW returns. Growth funds (GR) are slightly less risky than the CRSP-VW index
and also have a lower average return. For the growth and income funds (GI) and
funds with balanced portfolio (BP) it is interesting to note that the BP funds
have a higher average return and a lower risk than the GI funds. Thus GI funds
did perform very poorly over the sample period.
Consider next the flow statistics. The total asset value for the fund categories
ranges from about 93 billion US dollars for growth and income funds to 12
billion dollars for BP funds. Note that the worst performing group (GI) has
the highest total asset value. The average monthly flows into the funds are all
positive thus indicating that mutual funds attracted investment over the sample
period. GI funds had the highest average monthly inflow of 843 million dollars
1134
hf. LettaulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
which translates to an average inflow of 0.86% of the total asset value. Only BP
funds had a higher relative inflow of 1.44% of the asset value. Aggressive growth
funds along with BP experience the highest variation in flows. GR and GI funds
have somewhat smaller standard deviations in asset flows. The high average flow
and standard deviation number for the BP funds deserve an additional comment.
BP funds experienced very high inflows early in the sample period, 1985 and
1986. After that period the average inflows are much lower. After deleting the
1985 and 1986 data from the sample, the number for the BP funds are in the
range of the GI funds. The subsequent analysis uses the whole sample period,
however. None of the time series except GR exhibit a significant time trend. The
t-value for a linear trend in an OLS regression of the GR data is almost 4. To
correct for this trend all the following regressions for the growth fund time series
include a linear trend.
Next, I study the relationship between mutual find flows and mutual fund
returns. Figs. 3A-D show plots of both time series for each fund category. Consider first the case of aggressive growth funds in Fig. 3A. It is clear from the
picture that both series exhibit an substantial amount of positive contemporaneous
correlation (the correlation coefficient is 0.73). Whenever the return is positive,
flows into aggressive growth funds tend also to be positive. On the other hand,
investors in aggressive growth fund tend to withdraw money in months with negative returns. Take as an example October 1987, the month of the stock market
crash. Aggressive growth funds yielded a return of about -25%. In the same
month AG funds lost about 10% of its assets net of the capital loss. The lower
panel in Fig. 3B illustrates the positive trend in the flows of growth mnds especially between 1988 and 1992. The top panel shows that the GR returns are less
volatile than the AG as is to be expected, of course. The correlation coefficient
of returns and flow of GR funds is 0.54 which is somewhat smaller than for
AG funds. Figs. 3B and C suggest that the correlation of returns and flows are
weaker for GI and BP funds. The visual evidence is supported by corresponding
OLS regressions which are presented in Table 3.
Row 1 of Table 3 shows the correlation coefficient of mutual funds flows S
and returns r for the four fund categories. Funds with higher risk tend to have a
higher correlation. The riskiest fund category, aggressive growth funds, exhibit the
highest correlation with returns with a value of 0.73. All correlation coefficients
are positive. The fairly large correlation coefficients also translate into significant
coefficients in OLS regressions of fund flows on fund return, at least for the
riskier funds. For aggressive growth funds the regression coefficient is 0.30 with
a highly significant t-value of 10.52. Moreover, retums explain more than 50%
of the variation in flows. The numbers for the less risky growth funds are a bit
smaller but the coefficient of returns is still highly significant. The R2 is 0.44. The
evidence for the less risky funds is less clear. The coefficient in the BP regression
is not significant at conventional significance levels while the GI coefficients is
significant at the 1.7% level. The R2’s are low with 0.01 and 0.04, respectively.
M. LeitaulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
1135
n
FIOWS
Fig. 3A. Top Panel shows the return of aggressive growth funds; bottom panel shows the monetary
flows into the class of aggressive growth funds, monthly data, 2/85-12/92.
II36
M. LelraulJournol
of Economic Dynamics and Control 21 (1997)
1117-1147
:
P IS
I
67
w
89
90
91
B-2
(b)
Fig. 3B. Top panel shows the return of growth !?mds; bottom panel shows the monetary flows into
the class of growth funds, monthly data, 2/85-12/92.
M. LerraulJournd
of Economic Dynamics
and Conrrof 21 (1997) i117-(147
I137
FIOWS
.
8
1
.
I
9
063
6.6
6T
69
6Q
90
91
92
a3
Cc)
Fig. 3C. Top panel shows the return of growth and income funds; bottom panel shows the monetary
flows into the class of growth and income funds, monthly data, Z/85-12/92.
1138
n
:
hf. LerraulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
_
P
0
‘.
P 95
w
87
WJ
89
90
91
92
(4
Fig. 3D. Top panel shows the return of balanced portfolio timds; bottom panel shows the monetary
flows into the class of balanced portfolio fknds, monthly data, 2/85-12/92.
M. LettaulJournal
of Economic Dynamics and Control 21 (1997)
Table 3
Mutual fund flows f and returns r (OLS regression:
Corr(/,,rz)
P
f-statistic
R2
8’
B-
fi = a + pr,, monthly
1117-1147
data 1985-1992)
AG
GR
Gl
0.73
0.30
10.52
0.54
0.20
0.34
0.54
0.14
1.34
0.44
0.13
0.14
0.24
0.05
2.42
0.04
0.02
0.11
Note: B+ (fl- ) are OLS coefficients
of f after a positive
(negative)
1139
BP
0.10
0.06
0.98
0.01
-0.06
0.19
return
Thus, there is substantial evidence, at least for the riskier funds, that returns are a
major explanatory variable for flows into and out of mutual funds. The evidence
is very strong for the riskier aggressive growth and growth funds and weaker for
the less risky growth and income funds and funds with balanced portfolio.
However, there is another interesting fact in the data. Consider the responses
of mutual fund investors to positive or negative returns. To check whether there
is any asymmetry in investor’s responses, I split up the data set in months with
positive returns and negative returns and perform regressions on both subsets. The
respective regression coefficients are shown in the bottom row of Table 3. Note,
that the regression coefficients for the subset of months with negative returns are
larger than the respective coefficients for positive months in all four cases. The
difference is substantial for all fund categories but growth funds. Thus, it appears
that mutual fund investors are reacting more heavily after negative returns than
after positive ones. To summarize the most important facts of the behavior of
mutual fund investors, (i) flows into mutual funds are positively correlated with
returns, (ii) flows are more sensitive to negative returns than to positive ones,
and (iii) evidence is stronger for riskier mutual funds.
3.2. Financial flows in models with rational agents
These findings pose a substantial challenge for standard financial theory with
rational agents. Recall that the data used here is on aggregate fund classes and not
on individual funds. The positive correlation of returns and fund flows could be
expected with individual fund flows since a high return on a specific fund might
reveal some information about the quality of the fund manager. This in turn might
lead investors to update their prior of the quality of that manager and increase
their investment in this fund. However, this information story does not apply to
aggregate fund flows used here since investors would move money to a fund in
the same class. Since only flows across different fund classes are reflected in the
data, the flows due to new information would show up in this data set.
The standard market model following Markowitz (1952, 1959) and Sharpe
(1964) is also not able to explain why mutual investors are changing the
1140
M.
Leftaul Journal of Economic Dynamics and Control 21 (1997)
1117-1147
portfolio composition after observing the return of their investment portfolio. In
their model with complete information investors should follow a passive investment strategy ignoring any market outcomes. More generally, it seems unlikely
that any model with complete information is able to explain the reactiveness of
investors. Wang (1993) presents a model of incomplete information in which
returns provide information about the state of the world, thus making the optimal
portfolio strategy dependent on observed market outcomes. In his dynamic model,
agents have asymmetric information about the future growth rates of dividends.
Informed and uninformed agents trade against serially correlated noise demand.
Apart from the noise traders, all agents in the model are rational in that they use
Bayes’ rule in updating their guesses about the noise supply and dividend growth
rate. He shows that for certain parameter values, the uninformed agents behave
like trend chasers, that is, they buy when the price goes up and sell when it goes
down. This leads to a positive correlation between changes in the holdings of
the risky asset in the portfolio of the uninformed investors and price changes. In
that sense the uninformed investors in Wang’s model behave like mutual fund
investors as shown in the preceding section. However, this result is only true
for very specific parameter values in the Wang model. A very high degree of
asymmetric information and a substantial amount of serial correlation in the noise
trader demand are needed. Moreover, the degree of positive correlation between
asset holding changes and price changes is rather small even for the most favorite
parameter settings. This makes Wang’s model more consistent with mutual fund
investors in less risky funds than with high risk mutual fund investors. The optimal portfolio policy of Wang’s traders is also symmetric with respect to positive
and negative price changes which is inconsistent with the mutual fund data.
In general, it seems unlikely that models with agents who are using Bayes’ rule
to update their beliefs are able to generate substantial trading volume in steady
state without relying on unreasonably large amounts of noise trading. Intuitively,
the reason is that in dynamic settings, Bayes’ rule puts relatively little weight on
recent information and relies more on the past. Thus, any new information does
not change beliefs too much and thus does not cause substantial portfolio updates.
Another rational explanation of positive correlation between returns and portfolio changes might be portfolio insurance. If the risk tolerance of an investor
is steeply decreasing with wealth it might be optimal for her to sell stock when
the price goes down and increase her holdings in risky asset when the price goes
up (see e.g. Black, 1988; Leland and Rubinstein, 1988). However, it seems unreasonable that all or most investor in mutual funds followed portfolio insurance
strategies.
3.3. A learning model with entry and exit
The model in Section 2 focused on how a fixed set of agents is learning over
time. The agents learn via the crossover and mutation operators. The algorithm
M.
Lettaul
Journal
of Economic
Dynamics
and Control
21 (1997)
II 17-1147
1141
was designed so that the portfolio of the agents converges to a fixed amount
of the risky asset which is the same for each agent in the population.
In the
long run, the agents will not adjust their portfolios after observing the payoff.
The declining mutation rate and the crossover operator are responsible for this
behavior. Thus, it is clear that a model with a constant population of adaptive
agents will not be able to reproduce the behavior of the mutual fund investors as
shown in Section 2. The simplest way to achieve a heterogeneous population even
in the long run is to insert new random strings over time and to delete some of
the existing strings from the population. These new strings introduce new genetic
material which again will be subject to crossover and mutation. However, the
population will not converge to a homogeneous one as it is the case with only
crossover and mutation due to the constant inflow of new strings. In terms of the
economic interpretation of this new model, some of the investors are leaving the
market and some new ones are entering it. The new agents start with random
strings. In the simulations below, the exiting agents are chosen randomly with
equal probability. Other methods such as basing the probabilities
on the fitness
of the strings are also possible. I experimented with different methods but the
results were not affected much.
The following setup was used in the simulations. The population consists of
60 agents. In each period t = 1,. . . , T three agents, chosen randomly, exit the
market and are replaced with new agents who start with a random strategy.
Agents update their decision rule after each market outcome, i.e. S = 1. To make
the simulations
as close to existing markets as possible, I set the distribution
parameters of the asset payoff, V and oV, so that they match the values of the
first two moments of the returns of the four mutual fund groups assuming that
returns are indeed normal. The coefficient of absolute risk aversion is set to unity.
To let the population settle down, I let it evolve for 5000 periods before starting
with the analysis. The figures and tables use periods t = 5001,. . . ,5 100 because
there are also about 100 observation in the mutual fund data set.
Consider Table 4 and the corresponding
Fig. 4A which shows a plot of 100
market realizations and the portfolio adjustments of the GA population for the
case where the asset payoffs are drawn from a normal distribution with the first
two moments matching the aggressive growth fund returns. Just as investors in
mutual fund, the population of adaptive agents adjusts its portfolio after each
market outcome. The agents become more bullish after positive realizations and
hence increase their holdings of the risky asset and vice versa. The correlation
between returns and portfolio adjustments is 0.76, the OLS regression coefficient
is 0.38. While the regression coefficient is larger than the respective number
for the mutual fund investors, the correlation coefficient is virtually the same
as for the flows into the aggressive growth fund. But this aspect is not the
only similarity with mutual fund investors. The GA population exhibits the same
asymmetry
after positive and negative realizations.
Again, separating the data
into reactions after positive and negative returns reveals the same pattern as the
1142
M. LettaulJournal
of Economic Dynamics and Control 21 (1997)
Table 4
Entry and exit GA model (portfolio
adjustment
fi, return rz, OLS regressions:
1117-1147
fi = c( + /&, )
Asset return r
Corr(fr, rr )
B
f-statistic
R2
P+
B-
N(0.013,0.052)
N(O.O1 1,0.047)
N(0.009,0.040)
N(0.010,0.036)
0.76
0.69
0.67
0.59
0.38
0.29
0.23
0.15
11.23
10.83
9.48
7.03
0.64
0.52
0.50
0.38
0.31
0.26
0.20
0.12
0.48
0.34
0.29
0.21
AG
GR
GI
BP
0.70
0.60
0.54
0.45
0.32
0.25
0.21
0.13
10.38
9.72
8.99
5.65
0.63
0.47
0.46
0.34
0.26
0.22
0.19
0.10
0.41
0.30
0.26
0.18
returns
returns
returns
returns
Note: p+ (p- ) are OLS coefficients
after a positive
(negative)
return.
mutual flow data. The downward reaction after a negative return is more extreme
than the corresponding portfolio revision after a positive outcome. The regression
coefficients are p- = 0.3 1 and /I+ = 0.48, respectively. This is also evident from
Fig. 4A. The reason for this asymmetry is the risk-taking bias as discussed in
Section 4.2. With the distribution moments t7= 0.013 and 0” = 0.052, about 65%
of the market outcomes are positive. As seen in Section 4.2 this asymmetry in
the asset distribution leads the adaptive agents to be too bullish on average. In
this model, the mechanism is very similar. Since the adaptive agents hold too
much of the risky asset, the utility after a negative payoff is very small causing
the agents to suddenly become very bearish. After a series of positive outcomes,
the adaptive agents increase their holdings of the risky asset more and more.
Thus, the following downward revision after a negative return is very extreme
since the level of utility will be very low.
The following three rows of Table 4 present the simulation results for returns
matching the first and second moments of GR, GI and BP mutual fund returns. As
might be expected the values for correlation of returns and portfolio adjustment
and regression coefficients are smaller for the returns with a lower risk. Note that
the numbers are all slightly larger than the numbers for actual mutual fund flows
in Table 3. This indicates that the adaptive agents in the model are more reactive
than mutual fund investors. However, the general pattern of investment behavior
is very similar.
Finally, I confront the GA agents with the actual mutual fund returns and
compare the portfolio reaction to the mutual fund flows. Again, the population
evolves 5000 periods using a normal distribution with matched first two moments
as payoff before feeding the mutual fund returns. Fig. 4B plots the change of
the portfolio of the GA population while the lower panel of Table 4 presents
the corresponding correlation and OLS regression results. The results are very
similar to the case with normal distributions. The portfolio adjustment is more
extreme for high risk mutual fund returns. As before, the numbers are larger
M. LettaulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
1143
period
3020
5060
5040
5060
5100
period
Fig. 4. Top panel shows the normally distributed asset payoff and the change in the holdings of the
risky asset of the GA population. The bottom panel shows the changes of the asset holdings when
the return of aggressive growth funds are used as input.
M. LettaulJournal
i -0.25
-0.20
-0.15
of Economic Dynamics and Control 21 (1997)
-0.10
-0.05
AC
-0.00
0.05
1117-1147
0.10
0.t
return
:!
d
I
-0.25
-0.20
-0.0
-0.10
-0.05
-(
0
0.05
0.10
5
ffi return
Fig. 5. Top panel shows the flows of aggressive growth funds plotted against the return of the AG
funds. Bottom panel shows the changes of the asset holding of the GA agents plotted against the
return of the AG funds.
M. LertaulJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
1145
than for the mutual fund flows in Table 3. The model with GA agents exhibit
the same asymmetry after positive and negative returns while the coefficients p+
and /?- are both bigger than for the AG funds. While the investment patterns of
the GA agents and the mutual fund investors are very similar in many aspects,
the GA population appears to be even more sensitive to market outcomes than
investors in risky growth fimds. The scatter plot in Fig. 5B demonstrates this overreaction of the GA agents when compared to investors into aggressive growth
funds (Fig. 5A) very clearly.
4. Conclusion
The empirical part of this paper investigates data on how investors in mutual
funds move money into and out of four different groups of mutual funds. The
paper shows that mutual fund investors change their portfolio composition after
observing market outcomes. The correlation between flows into mutual funds and
returns is positive and large, at least for riskier funds. OLS regressions show that
returns alone explain up to 54% of mutual fund flows. The regression coefficient is significant for the riskier fimd categories. The second interesting finding
is that mutual fund investors exhibit an asymmetric response after positive and
negative returns. They tend to react more heavily after negative market outcomes than after positive ones. This effect is also more pronounced for riskier
funds.
These findings pose a substantial challenge for conventional theory. Most standard models suggest that rational agents should follow a passive investment strategy. In these models there is usually little trading volume. Wang’s (1993) model
with asymmetric information
is partially consistent with the mutual fund flow
data. For certain parameter values, some of the agents in his model rationally
follow a price-chasing
investment strategy. However, this is only true for very
specific parameter values and the correlation of portfolio adjustment and returns
is lower than in the mutual fund flow data even in the most favorable case. In
general, it seems unlikely that a rational model is capable of explaining the facts
about mutual fund investors in a satisfying way. Instead, this paper suggests an
adaptive learning approach.
The paper presents a simple model in which agents use a genetic algorithm
to update their portfolio decisions. In the first version of the model the adaptive
algorithm is implemented
so that the agent’s investment strategy converges to
a fixed portfolio as time grows. During the learning process the agents look at
market outcomes and revise their portfolio after observing a certain number of
observations.
It turns out that the adaptive agents exhibit a positive risk taking
bias that depends how many market outcomes are observed before the agents
update her portfolio. Thus, they tend to hold a portfolio which is too risky compared to the portfolio of a rational agent who maximizes expected utility. Next,
1146
hf. LettauIJournal
of Economic Dynamics and Control 21 (1997)
1117-1147
a second model with a varying set of agents is introduced. In this model a constant inflow of new agents and outflow of old ones keeps the market going. The
behavior of the population of adaptive agents is consistent with the mutual fund
flow data. They adjust their portfolio after observing the market outcome just
like mutual fund investors. The GA population also reacts asymmetrically
after
positive and negative returns just as the mutual fund investors. The regression
coefficients for the GA agents tend to be larger than the corresponding
numbers in the mutual fund flows. In summary, the learning approach as used in
this paper produces behavioral patterns which are consistent with most aspects of
mutual investor’s strategies in real financial markets and thus provide a simpler
and better fitting theory of observed behavior than standard theories with rational
agents.
References
Andreoni, J. and J. Miller, 1990, Auctions with artificially intelligent agents, Working paper 90-01-004
(Santa Fe Institute).
Arifovic, J., 1992, Genetic algorithm: Experimental evidence, manuscript (McGill University).
Arifovic, J., 1994, Genetic algorithms learning and the cobweb model, Journal of Economic Dynamics
and Control 18, 3-28.
Arifovic, J., 1995, Genetic algorithms and inffationary economies, Journal of Monetary Economics
36, 219-243.
Arifovic, J., 1996, The behavior of the exchange rate in the genetic algorithm and experimental
economies, Journal of Political Economy 104, 510-41.
Black, F., 1988, Noise, Journal of Finance 41, 529-543.
Buckles, B. and F. Petry, 1992, Genetic algorithms (IEEE Computer Society Press, Los Alamitos).
Goldberg, D., 1989, Genetic algorithms in search, optimization,
and machine learning (AddisonWesley, New York).
Holland, J., 1989, Adaption in natural and artificial systems, 2nd. ed. (MIT Press, Cambridge).
Holland, J., K. Holyoak, and R. Nisbet, 1987, Induction: Process of inference, learning, and discovery
(MIT Press, Cambridge).
lppolito, R., 1992, Consumer reaction to measures of poor quality: Evidence from the mutual fund
industry, Journal of Law and Economics 35, 45-70.
KarpotT, J., 1987, The relation between price changes and trading volume, Journal of Financial and
Quantative Analysis 22, 109- 126.
Lakonishok, J., A. Shleifer, and R. Visbny, 1992, The structure and performance of the mutual fund
industry, Brookings Papers on Economic Activity - Microeconomics,
339-379.
Leland, H. and M. Rubinstein,
1987, Comments on the stock market crash, Journal of Economic
Perspectives 2, 45-50.
Markowitz, H., 1952, Portfolio selection, Journal of Finance 7, 77-91.
Markowitz,
H., 1959, Portfolio selection: Efficient diversification
of investments (Yale University
Press, New Haven).
Milgrom, P. and N. Stokey, 1982, Information, trade, and common knowledge, Journal of Economic
Theory 26, 17-27.
Palomino, F., 1996, Noise trading in small markets, Journal of Finance 51, 1537-1550.
Shatpe, W., 1964, Capital asset prices: A theory of market equilibrium under conditions of risk,
Journal of Finance 19. 425-442.
M. LeiraulJournal
Sirri, E. and P. Tufano,
of Economic Dynamics and Control 21 (1997)
1117-1147
1993, Buying and selling mutual funds: Flows, performance,
1147
fees, and services,
Working paper (Harvard Business School).
Timmermann,
A., 1993, How learning
in financial
markets generates
excess volatility
and
predictability
in stock prices, Quarterly Journal of Economics 45, l135- 1145.
Wang, J., 1993, A model of intertemporal asset prices under asymmetric information, The Review of
Economic Studies 60, 249-282.