Incorporating State Dependence in Aggregate Brand

Incorporating State Dependence in Aggregate Brand-level Demand
Models∗
Dan Horsky†
Polykarpos Pavlidis‡
Minjae Song§
July 2012
Abstract
Empirical investigations of household-level data show that state dependence is a significant
determinant of consumers’ choices in frequently purchased product categories. In this paper we
examine how one can incorporate state dependence in an aggregate demand model for brandlevel data which are more often available and frequently used in applied research in marketing
and economics. We define state dependence as consumers receiving higher utility when buying
the same brand in two consecutive periods. The main challenge we face is that we do not observe
what portion of consumers who chose a given brand had chosen the same brand in the previous
period. We overcome this challenge by constructing the brand choice probability using observed
market shares from the previous period and the law of total probability. Through Monte Carlo
simulations we show that our model is a good approximation of household-level models. We
apply our method to estimating demand for salty snacks using a multi-market brand-level data
provided by IRI. We find significant evidence of positive state dependence, without which demand
looks less price elastic and less advertising elastic. The profit-maximizing price that our model
implies is 7 percent lower than that of a static model without state dependence. We also show
that typical advertising expenditure is only profitable when we account for the carry-over effects
generated by state dependence.
∗
We would like to thank Paulo Albuquerque, Paul Ellickson, Mitchell Lovett, Sanjog Misra, and participants at
the 2011 Marketing Science Conference in Houston, TX for their valuable comments and suggestions. We would also
like to thank SymphonyIRI Group, Inc. for making the data available. All estimates and analysis in this paper, based
on data provided by SymphonyIRI Group, Inc., are by the authors and not by SymphonyIRI Group, Inc.
†
Benjamin L. Forman Professor of Marketing at the Simon Graduate School of Business, University of Rochester,
[email protected].
‡
Consultant, Nielsen Marketing Analytics, [email protected].
§
Assistant Professor of Economics and Marketing at the Simon Graduate School of Business, University of
Rochester, [email protected].
1
1
Introduction
Consumer state dependence has been an important research topic in the marketing science literature
almost from its inception. Many researchers have documented persistence in consumer choice data
in which consumers tend to choose brands that they chose in the recent past: Kuehn (1962),
Massy (1966), Morrison (1966), Givon and Horsky (1979), Keane (1997), Seetharaman, Ainslie,
and Chintagunta (1999), Horsky, Misra, and Nelson (2006), and Dube, Hitsch, and Rossi (2009,
2010). Typical data used to document these findings are panel data recording individual purchase
histories of brands over time. By observing the same consumers over repeated purchase occasions
researchers assess whether consumers become loyal to brands they purchased in the past, controlling
for marketing mix variables.
Although individual consumer panel data are much preferred in this type of analysis, they
are often available only for a small number of product categories, stores, and regions. On the
other hand, aggregate brand level data are more readily available and are less costly to acquire
than individual panel data. However, estimating state dependence with the latter type of data
is challenging because consumer purchase histories are not directly observable from the aggregate
sample. Because of this difficulty, purchase dynamics related to state dependence are usually ignored
when analyzing aggregate data.
When state dependence exists in choice behavior, ignoring it is problematic in several levels.
First, from a theoretical perspective if individual consumers exhibit state-dependence type behavior,
then the market shares which are aggregation of such consumers should display this phenomenon too,
an issue highlighted early on by Givon and Horsky (1978). Second, from a managerial perspective
ignoring this phenomenon may mislead managers to set sub-optimal prices. State dependence
renders market shares more persistent over time, and if we do not control for this persistence,
marketing mix variables may appear to have smaller impact on market shares than they actually
do. This may result in, for example, underestimating the price coefficient. Moreover, we miss the
dynamic effects of marketing mix variables when we ignore state dependence. For example, the
price effect is no longer static; a price change today not only affects sales today but also affects next
period’s sales because some of today’s brand switchers are likely to stay with the same brand next
period. This dynamic effect may last for several periods, and prices that are set without accounting
2
for this effect will be sub-optimal. Similar issues arise with respect to advertising.
In this paper we construct and estimate a model that incorporates state dependence at the
aggregate brand level. We define state dependence as consumers receiving higher utility when
buying the same brand in two consecutive periods, and incorporate it in a standard discrete choice
model for aggregate brand-level data. As is standard in such models we assume consumers receive
an idiosyncratic taste shock that is distributed type I extreme value, and use observed brand market
shares as a consistent estimator for the probability of choosing a brand.
The main challenge is that we do not observe what portion of consumers who chose a given
brand at period t had chosen the same brand in the previous period, t − 1. For example, suppose
there are two brands, brand A and brand B, and no outside option in the market. We observe the
number of consumers who choose brands A and B in each period but do not know how many of
them choose the same brand in the previous period. However, we can still construct the market
share function using Bayes rule and the previous period market shares. Using the same example,
the probability of choosing brand A this period, P (At ), is the sum of the probability of choosing
brand A for two consecutive periods and the probability of choosing brand B in the previous period
followed by choosing brand A this period. That is, P (At ) = P (At−1 , At ) + P (Bt−1 , At ). Using
Bayes rule, we write this as
P (At ) = P (At |At−1 ) P (At−1 ) + P (At |Bt−1 ) P (Bt−1 )
Then we replace P (At−1 ) and P (Bt−1 ) with last period’s market shares and the conditional probabilities P (At |At−1 ) and P (At |Bt−1 ) with the conditional multinomial probability function. The
latter probability function is a function of state dependence and brand/market characteristics.
Using this model, we estimate state dependence by exploiting the persistence in market shares
that is not explained by observed and unobserved brand/market characteristics. In our model, we
also allow the price variable to be an endogenous variable and we correct for such endogeneity using
a control function approach proposed by Petrin and Train (2010). It is important to note that we
do not attempt to identify the nature of state dependence. We do not think it is possible to do this
with typical aggregate data available to researchers. Admittedly, our estimates of state dependence
can represent any form of persistence in brand choices. Reasonable possibilities are brand loyalty,
3
addiction, or switching costs. We rely on other individual level studies for the existence and nature
of state dependence. For example, Horsky, et.al. (2006) use individual consumer brand choice
panel data jointly with preference information from the same individuals to disentangle preference
heterogeneity and state dependence, and they find strong evidence for state dependence. Shin,
Misra, and Horsky (2012), using the same data set further, disentangle the choice persistence into
brand preference and learning. Dube, et.al. (2010) document, using individual consumer panel data,
that persistence in brand choice exists and is not due to a mis-specified distribution of heterogeneity
in preferences or due to autocorrelated choice errors.
Our model is useful for researchers and managers who only have access to aggregate data and
want to better understand purchase dynamics generated by state dependence. As far as we know,
our paper is the first paper to estimate state dependence using brand-level data. There are two
related papers that use a Bayesian approach to a similar problem. Chen and Yang (2007) and
Musalem, Bradlow, and Raju (2009) use aggregate data to estimate demand generated by individual
consumers’ dynamic purchases. They treat individual consumers’ purchase history as a latent
variable and use Bayesian data augmentation to simulate a set of latent choices that are consistent
with observed aggregate data. However, their models have no structural parameter that corresponds
to state dependence. Their goal is to estimate static aggregate demand models that are consistent
with individuals’ dynamic purchase behaviors.
For the empirical implementation of our model we analyze brand-level salty snack data on ten
regional markets, provided by IRI. We use a simulated maximum likelihood estimation method
in the spirit of Park and Gupta (2009), with prices in other regions serving as instruments for
the correction of price endogeneity. Our results show that there exists substantial positive state
dependence in salty snack demand and that one would underestimate both the price coefficient and
long-term effects of price by ignoring state dependence. We show that the sales effect of a one time
price perturbation lasts for more than 20 periods, and as a result, the long-run price elasticity is
about four times as high as the short-run price elasticity.
The managerial implications of these results for pricing are very important. We find that profit
maximizing prices that are set using our state dependence model and its estimates are about 7%
lower than profit maximizing prices that a static demand model implies. This result is consistent
with what Dubé, et.al. (2009) find using their state dependence demand model for individual-level
4
data.
We also estimate the effect of national-level monthly advertising expenditures for the brands in
our sample. There are two important findings related to advertising. First, we do not find strong
lagged effects of advertising; this is true for the specifications with or without state dependence.
Second, incorporating state dependence increases the (contemporaneous) advertising coefficient and
generates advertising carry-over effects over multiple periods in a way similar to that of the longrun price effect. This carry-over effect is crucial in justifying advertising expenditures typically
observed in data. Our counterfactual exercise shows that one-period investment in advertising is
only profitable in the long run thanks to the carry-over effects generated by state dependence.
The rest of the paper is organized as follows. Section 2 describes our state dependent demand
model, followed by an estimation procedure in section 3. Section 4 presents Monte Carlo simulation
results and section 5 discusses estimates of salty snack demand using IRI brand-level data. Section
6 provides managerial implications and section 7 concludes.
2
Demand Model
Assume consumers’ current brand choices depend on their choices in the previous period. The
indirect utility for consumer i for brand j at period t is
uijt = δjt + λI [j = kt−1 ] + εijt
(1)
ui0t = εi0t
for j = 1, ..., J where I [j = kt−1 ] is an indicator function that takes 1 if her choice at time t
(i.e. j) is the same as her choice at time t − 1 (i.e., kt−1 ) and 0 otherwise. δjt is brand quality as a
function of price and brand attributes. εijt is an idiosyncratic taste term with Type I Extreme Value
distribution. j = 0 denotes the outside option and its mean utility is set to zero.
λ is the state
dependence parameter and we do not restrict its sign. A positive λ means that consumers obtain
higher utility by making the same choice over time. This may imply brand loyalty. A negative λ
means that consumers are worse off by making the same choice over time, implying variety seeking.1
1
However, we do not think we can successfully estimate negative state dependence with brand-level data because
negative state dependence and the idiosyncratic taste term generate a similar purchase trend over time.
5
Notice that we assume λ is zero for the outside option. As is common in the individual-level logit
based brand choice models, in our specification the state dependence parameter does not vary over
choices and does not depend on brand attributes. The first assumption is a simplification and the
second assumption excludes a case where firms directly affect state dependence by changing brand
attributes.
Let Pt (j) be the unconditional probability of choosing brand j at time t, and Pt (j|k) the
(conditional) probability of choosing brand j at time t conditional on choosing brand k at time
t − 1. Using the law of total probability, the unconditional probability that brand j is chosen at
time t can be written as
Pt (j) =
J
X
Pt−1 (k) Pt (j|k)
(2)
k=0
Due to the distributional assumption for the idiosyncratic error term, conditional probabilities can
be written as
Pt (j|k) =
exp (δjt + λ)
P
, k=j
1 + exp (δjt + λ) + m6=j exp (δmt )
(3)
Pt (j|k) =
exp (δjt )
P
, k 6= j
1 + exp (δkt + λ) + m6=k exp (δmt )
(4)
Pt (j|k) =
exp (δjt )
P
, k=0
1 + m exp (δmt )
(5)
Notice how the state dependence parameter shifts the conditional probability depending on last
period’s choice. Suppose the state dependence parameter is positive. If the consumer makes the
same choice in two consecutive periods, her probability of choosing the same brand in the second
period is higher, (i.e. equation 3). Consequently, the probability of choosing a different brand in
the second period (brand switching) is lower, (i.e. equation 4). However, if the outside option is
chosen in the first period, the brand choice probabilities are not affected by the state dependence
parameter (i.e. equation 5).
While we can use observed market shares as an empirical counterpart of the unconditional
probability, there is no empirical counterpart for the conditional probability in brand-level data.
Let sjt be observed market share for brand j at time t. By replacing the unconditional probabilities
6
with the observed market share, equation (2) can be rewritten as
sjt =
J
X
(sk,t−1 × Pt (j|k))
(6)
k=0
where Pt (j|k) is a function of (δ1,t , ..., δJ,t ) and λ.
This model does not suffer from the independence of irrelevant alternatives (IIA) property.
This well-known property implies that the relative choice probability of two options is not affected by the availability of a third option. The famous red bus and blue bus example demonstrates why this property is problematic. A standard conditional logit model has the IIA property
as Pr (i = 1) / Pr (i = 2) = exp (δ1 ) / exp (δ2 ) where Pr (i = j) is the probability that consumer i
chooses option j. One can easily see that our model is free from this property as the denominator
is not cancelled in the relative probability ratio.
Incorporating consumer heterogeneity in our model is not a trivial task. First of all, it is not
clear whether we can identify consumer heterogeneity with state dependence using brand-level data.
Even when we set aside the identification issue, it is challenging to put heterogeneity in a way that is
consistent with the utility maximization model. Consider the unconditional probability of choosing
brand j in time t. With consumer heterogeneity, equation (2) becomes
ˆ
ˆ
∞
Pt (jt ) =
∞
( J
X
−∞
k=0
Pi,t (jt ) f (β) dβ =
−∞
)
Pi,t−1 (k) Pit (j|k) f (β) dβ
Notice that we cannot use the observed market shares in period t − 1 for Pi,t−1 (k). Thus, the
probability that consumer i who is endowed with ci chooses brand jt 6= 0 in period t is
Pi,t (jt )
=
Jt−1
Jt−1
X
X
Pi,t−1 (kt−1 ) Pi,t (jt |kt−1 ) =
kt−1 =0
Jt−1
=

Jt−2
X

kt−1 =0
kt−2 =0
...
J2
X

J1
X

k2 =0

Jt−2
X

kt−1 =0
X

Pi,t−2 (kt−2 ) Pi,t−1 (jt−1 |kt−2 ) Pi,t (jt |kt−1 )
kt−2 =0

Pi,1 (k1 ) Pi,2 (j2 |k1 ) Pi,3 (j3 |k2 ) ...Pi,t−1 (jt−1 |kt−2 ) Pi,t (jt |kt−1 ) (7)
k1 =0
where
Pi,t (jt |kt−1 ) =

exp (δijt )
P
, kt−1 6= jt
1 + exp (δikt + λ) + m6=k exp (δimt )
7
Pi,t (jt |kt−1 ) =
exp (δijt + λ)
P
, kt−1 = jt
1 + exp (δijt + λ) + m6=j exp (δimt )
In order to construct the likelihood function, we need to track each simulated consumer’s purchase
history from the first period. In doing so, we do not use information given by the previous period
market shares, sj,t−1 , as in equation (6), and the model-fit becomes much worse.
3
Estimation Methodology
We first assume that brand quality is a linear function of brand characteristics including price.
We estimate our model by constructing the likelihood of observing the unconditional brand-level
market shares and finding parameters that maximize the log of that likelihood. Let Xjt be brand j’s
characteristics at t and Gjk (Xjt |θ) be the conditional probability that brand j is chosen when brand
k is chosen at the previous period. The parameter set θ includes the preferences of characteristics
β and the state dependence parameter λ. Gjk (Xjt |θ) becomes either equations (3), (4) or (5)
depending on the previous period’s choice. The likelihood that we observe s = [s0 , s1 , ...sJ ] in
market m at time t is
Lmt (θ) =
J
Y
Ljmt (θ)sj Nmt
(8)
j=0
PJ
× Gjkmt (Xjmt |θ)) and Nmt is the number of consumers in market
Q
Qm=M
m at time t. The log likelihood function we maximize is the log of L (θ) = t=T
t=1
m=1 Lmt (θ).
where Ljmt (θ) =
k=0 (sk,m,t−1
In addition to brand characteristics and price, Xjmt includes time dummy variables for timevarying demand shocks and market dummy variables for unobserved market-level differences. Even
after controlling for these demand factors, the idiosyncratic taste term may still contain brand-level
demand shocks that are correlated with the price variable. We use the control function approach
by Petrin and Train (2010) to correct for the price endogeneity bias. To implement the correction
we first regress observed brand prices on the explanatory variables of our model and instrumental
variables. Then, in the next stage, we use the residuals from the first stage pricing regressions
as proxies for any unobserved factors in each brand’s demand that may be correlated with the
respective brand’s price. We use three instruments: i) the sum of prices of brands produced by
other firms in other markets at the same period, ii) the sum of prices of other brands produced by
the same firm in other markets at the same period and iii) the sum of prices of the same brand in
8
other markets at the same time period.
If there is no state dependence parameter, MLE estimation is straightforward. When demand is
state dependent as in our model, estimation of consistent parameters becomes more involved, especially with brand-level data. With consumer-level data researchers do observe whether consumers
buy the same brand in two consecutive category purchases, and use this observation in estimating
state dependence. With brand-level data, on the other hand, individual purchase histories are not
available and therefore we need to exploit the persistence of brand market share over two consecutive periods that is not explained by changes in brand characteristics and price. Suppose one brand
lowers its price this period and raises it back to the original level in the following period. If demand
exhibits positive state dependence, its market share would go up this period but would not go down
to the original level next period.
4
4.1
Monte Carlo Simulation Results
Homogeneous Consumers
In this section we use Monte Carlo simulations to evaluate how well our brand-level state dependent
model approximates an individual-level state dependent model. For this we generate data using an
individual-level state dependent model, and then aggregate them to the brand level and estimate
the brand-level state dependent model. In both models we do not allow any consumer heterogeneity
other than through the idiosyncratic error terms. The synthetic data has M=6 markets, N=500
households per market, T=60 time periods and J=2 brands (2 brands and the outside option). We
repeat data generation and estimation for 100 times and report the empirical moments of estimates.
The indirect utilities for consumer i in market m and period t are defined as follows:
uijt = γj Dj + γm Dm − αP ricejmt + λ × I(Yij,t−1 = 1) + εijt
ui0t = εi0t
for j = 1, 2 where
P rice1mt ∼ U nif orm(0.5, 1.5)
P rice2mt ∼ U nif orm(0.9, 1.8)
9
The means and standard deviations of estimated parameters are reported in table 1 for three different
levels of state dependence, λ = 1, 3, 5. The table also reports estimates from the individual-level
state dependent model for comparison. Results show that our brand-level model recovers true
parameter values reasonably well. There is no significant difference in the mean between the two
models for all three levels of the state dependence parameter. The standard deviation is slightly
larger in the brand-level model, but the difference is negligible. The only parameter whose variance
is significantly larger is the state dependence parameter. When λ = 1, its standard deviation is
0.091 while it is 0.012 in the individual-level model. For λ = 3 and λ = 5, it is 0.085 and 0.078
respectively while it is less than 0.02 for both cases in the individual-level model. However, these
numbers are still reasonably small.
Using the same data we also estimate a brand-level model that ignores state dependence by
constraining λ to be 0. Table 2 reports the results. The table shows that all the brand attributes
are overestimated, and the inconsistency drastically increases with state dependence. For example,
the mean of brand 1 quality estimates, whose true value is -1.5, is -1.423 when λ is 1 (with the S.D.
0.026), and goes up to -1.084 when λ is 3 (with the S.D. 0.046) and 0.006 when λ is 5 (with the
S.D. 0.058). The price coefficient, whose true value is -1, is also overestimated. The mean value
is -0.967 when λ is 1 and goes up to -0.575 when λ is 3 and -0.195 when λ is 5. These results
show that, when state dependence is ignored, purchase persistence is attributed to brand quality
and price sensitivity, misleadingly implying that consumers’ brand preference is higher and price
sensitivity is lower than they really are.2
4.2
Heterogeneous Consumers
In section 2 we explained why estimating consumer heterogeneity with brand-level data is a challenging task in the state-dependent model. To demonstrate this, we simulate data using the individuallevel state-dependent model with consumer heterogeneity on the price coefficient, and estimate
the brand-level state-dependent model. Table 3 reports results for both the brand-level and the
individual-level models. We estimate the latter model using a straightforward MLE procedure, and
estimate the former model using equation (7) as the likelihood function for each simulated consumer.
The table shows that the price coefficient tends to be overestimated and the degree of hetero2
We repeat this exercise in the individual-level model with λ constrained to be 0 and find a very similar pattern.
10
geneity (the standard deviation of the price coefficient) underestimated in the brand-level model.
The other estimates are either underestimated or overestimated. The only exception is the state
dependence parameter whose mean value is not different from the true value. The standard deviation of the estimates in the brand-level model is much larger (2 to 6 times larger) compared to the
individual-level model.
We also estimate the homogeneous consumer brand-level model to assess the magnitude of inconsistency resulting from ignoring consumer heterogeneity. Table 4 shows that the state dependence
parameter is less affected than the other parameters when we ignore consumer heterogeneity. When
the standard deviation of the price coefficient is 0.25, the mean of the state dependence estimate is
2.99 (the true value is 3) while the mean of the price coefficient estimate is -2.64 (the true value is
-3) and the mean of the brand intercepts are -1.69 (-1.5) and -1.22 (-1.0) respectively. When the
standard deviation of the price coefficient goes up to 0.5, the price coefficient goes up to -2.33 and
the brand intercepts to -1.83 and -1.37, but the state dependence estimate stays at 2.99. However,
when the standard deviation of the price coefficient is 1, the mean of the state dependence estimate
goes down to 2.89 while the other parameters depart further from the true values. Note, however,
that a standard deviation larger than 1 for the heterogeneity in price sensitivity is rarely found in
CPG data. Nevertheless, these results suggest that the brand-level state dependence model may
have some limitations for categories where consumer heterogeneity is important.
4.3
State Dependence with Memory
A very well-known and widely-used model of brand loyalty in the marketing literature is that by
Guadagni and Little (1983). In the Guadagni & Little model (the GL model hereafter) the brand
loyalty that enters the utility of each brand j in time t (GLjt ) is defined as
GLjt = α × GLj,t−1 + (1 − α) × Yj,t−1
where Yj,t−1 is a lagged purchase dummy variable for brand j and α is a weighting parameter that
takes a value between 0 and 1. Our state dependence model is a special case of the GL model
where we set α = 0. In the next Monte Carlo experiment we generate individual-level data using
the GL model and estimate their aggregated counterparts with and without state dependence. By
11
doing so, we evaluate the performance of our state dependent model against the model without
state dependence when data are generated by the GL model.
Table 5 shows that our state dependent model can greatly reduce biases in the model without
state dependence. Brand quality rankings are much closer to their true values when state dependence
is included and the same is true for the price coefficient which goes down from -0.719 to -0.971.
However, the state dependence parameter is smaller than the true parameter (2.06 versus 3). Since
α is set to 0.5 in the simulated data, last period’s choice increases brand loyalty by 1.5 (3 × 0.5) in
the GL model, which means that our model overestimates brand loyalty. This is expected as our
state dependence model is not the data generating model. Nevertheless, we can conclude from this
experiment that when data are believed to exhibit brand loyalty of the GL type and only brand-level
data are available, our state dependent model describes data much better than the model without
state dependence.
5
Application to the Salty Snack Category
5.1
Data
We estimate our state dependent model using the IRI Marketing data set on the salty snack category.
Bronnenberg, Kruger, and Mela (2008) describe this data set in detail. Data are originally at
week/store/UPC level, and we aggregate them at month/brand/region level and define a market as
a region-month pair. For sales we use one serving of 16 oz. as the unit of measurement. Prices are
computed by dividing the total dollar sales for each brand with the total number of servings sold in
each market. We assume that the maximum number of servings that can be sold in any particular
market is equal to two times the maximum number of servings sold in the corresponding market
during the sample period, and use it as the market size for each market. This allows us to reflect
regions’ different consumption levels in market sizes. Then we obtain market shares by dividing the
total number of servings sold with the corresponding market size. The underlying assumption is
that no more than a half of the category buyers buy salty snacks in any given period.
For the category of salty snacks the IRI Marketing data set is distributed along with advertising
data. The source of the advertising data is TNS and the available information includes nationallevel, monthly, advertising expenditures for each product line in the category. The largest part
12
of advertising expenditures for salty snacks is dollars spend on TV. We aggregate this variable
over product lines to construct national-level advertising expenditures for each brand. Because
advertising expenditures are reported for month t on the first day of month t + 1, we use the
reported expenditures for month t + 1 as the advertising expenditures for month t.
We select five major brands, namely Doritos, Lays, Pringles, Tostitos, and UTZ, and ten US
non-adjacent regions such as Charlotte, Houston, Knoxville, Milwaukee, New England, etc. Our
regions are equivalent to IRI-defined markets. Our sample period is 60 months from January of
2001 to December of 2005. Table 6 presents the descriptive statistics of our sample. The table
shows that the average price is almost perfectly negatively correlated with the average share. A
brand with a higher average price has a smaller average market share. Note that UTZ is present in
only five regions and the average market share does not account for zero market share in the other
five regions. It has particularly a high market share (14%) in the Washington D.C. area. The table
also shows that both prices and market shares vary more across markets than across time and that
market shares are more stable over time than prices.
Lastly, the table shows that Pringles spends most heavily on national advertising, followed by
Lays, Doritos and Tostitos, while UTZ does not spend on national advertising. However, table 6 does
not completely describe how firms advertise over time. Figure 1 shows national-level advertising
expenditures for each brand throughout the sample period. The figure shows that all brands’
advertising expenditures follow pulsing patterns. Each firm follows slightly different pulsing patterns
but usually advertises heavily for a few months and then advertises lightly for the next few months.
They spend over $5 million per month during the heavy-advertising periods and spend nothing or
far less than $1 million per month during the light-advertising periods. This pulsing pattern is in
sharp contrast with the stability in market shares over time.
5.2
Estimation Results: Brand Qualities, State Dependence, and Price Effects
We first estimate the brand-level model without state dependence. For brand characteristics we
include four brand intercepts, excluding UTZ as the base, and the price variable. We control for
regional differences by dividing regions into four general groups (i.e. South, North-east, Mid-west
and West) and including three region dummy variables. We also include three season dummy
variables to control for seasonality. The consumer size, Nt in equation (8), is set to 100 per market.
13
The estimation results are reported in table 7. On the left panel we report results without
the price endogeneity correction. The price coefficient is negative and statistically significant. The
brand intercept estimates imply that Pringles has the lowest quality, followed by Tostitos, Doritos,
Lay and UTZ, reflecting the market-share ranking. The other estimates show that demand is lower
in the west region and in the winter season.
The right panel of the table reports results with the control function for the price endogeneity.
Including the control function lowers the price coefficient from -0.40 to -2.44 and it is still statistically
significant. The 1st-stage residual term shows that prices are positively correlated with unobserved
demand factors that the brand intercepts and the regional and seasonal dummy variables do not
capture. The brand quality ranking drastically changes. The highest quality brand is now Pringles
followed by Tostitos, Doritos, Lay and UTZ. This is totally opposite to the previous ranking and
consistent with the price ranking. Also, demand is higher in the summer and fall seasons than in
the winter and spring seasons. The regional effects, however, stay the same.
Next, we estimate the brand-level state-dependence model, using the same specification and
instrumental variables as above. We report estimation results in table 8. The state dependence
parameter is statistically significant and estimated at 4.95 without the control function and 4.93
with the control function. The price coefficient goes down from -0.40 to -0.56 without the control
function and from -2.44 to -3.63 with the control function. The coefficient on the control function
goes up to 3.13 showing a higher degree of correlation. This change in the price coefficient is
consistent with the Monte Carlo simulations in section 4. The model fit improves considerably with
state dependence. The log likelihood increases from -50,742 to -50,161 without the control function
and from -50,694 to -50,132 with the control function. Similarly, the predictive ability of the model
improves with RMSPE decreasing by 28% from 0.025 when state dependence is ignored to 0.018
when it is included. However, the brand quality ranking and the regional/seasonal demand effects
that the estimates imply do not change much with state dependence, although their magnitude
slightly changes.
Using the estimates on the right panel of table 8 we calculate the short-run and the long-run
price elasticity. The short-run price elasticity is straightforward to calculate. It measures the sameperiod demand changes (in %) with respect to one percent price change. For the long-run elasticity
we consider two measures. One is to perturb the price in a given period and track demand changes
14
over time. That is,
∞
X
∂sr
r=t
∂pt
=
∂st
∂st+1 ∂st
∂st+2 ∂st+1 ∂st
+
+
+ ...
∂pt
∂st ∂pt ∂st+1 ∂st ∂pt
(9)
where st is a market share in period t. Although this equation shows we should track demand
changes infinitely, the future demand effect wears out completely in finite periods as ∂st+1 /∂st
quickly approaches zero. In our case 50 periods is sufficient for the demand effect to completely
wear out. The long-run elasticity in this case is defined as
eLR1 =
∞
X
∂sr
r=t
∂pt
!
pt
st
Figure 2 illustrates the typical path of demand changes resulting from one period price decrease.
For this graph we decrease Dorito’s price by 1 percent from the median level in the second period
of the sample and track demand changes over 50 periods. We fix the prices of the other brands
and the 1st-stage residual at their median value, while accounting for the seasonal and the regional
effects using the estimates. For the first period of our prediction we use the average brand share
from the data as the lagged share in each market, and in the following periods we recursively
substitute the model-predicted share of a given period as the lagged share in the next period’s
demand function. The figure shows that the demand effect of one time price perturbation quickly
but smoothly diminishes and becomes negligible after about 20 periods.
An alternative measure is to change the price for all periods and track demand changes over
time. That is,
∞
X
∂st
∂st ∂st−1
∂st ∂st−1 ∂st−2
∂st
=
+
+
+ ...
∂pt−r
∂pt ∂st−1 ∂pt−1 ∂st−1 ∂st−2 ∂pt−2
(10)
r=0
In this case we define the long-run elasticity as
eLR2 =
∞
X
r=0
∂st
(∂pt−r /pt−r )
!
1
st
As in equation (9) the dynamic effect diminishes over time and we only need to track back 50
periods to compute equation (10).
Table 10 shows the brand-level short-run and the two measures of the long-run price elasticities
with respect to one percent price change in all markets. We compute the elasticities for each month
15
and average them over 12 months to account for the seasonal effects. The short-run price elasticities
range from -4.2 (UTZ) to -8.8 (Pringles). The two measures of the long-run elasticity are similar
although the second measure is slightly larger than the first one. The first measure of the longrun price elasticity ranges from -15.7 (UTZ) to -21.3 (Pringles) and the second measure from -16.8
(UTZ) to -22.5 (Pringles). On average, the long-run elasticity is 3∼4 times larger than the short-run
one.
5.3
Estimation Results: State Dependence and Advertising Effects
In this section we estimate the effects of advertising on market shares in the brand-level model.
As mentioned in section 5.1, the advertising expenditure follows a pulsing pattern while market
shares are relatively stable over time. This contrasting pattern may make it hard to establish a link
between advertising and market shares. Controlling for state dependence, therefore, is important as
it explains large part of the stability in market shares. We first estimate the advertising effects using
the model without state dependence and put state dependence in the model to compare results.
To accommodate the dynamic effects of advertising we construct an adstock variable using
the current period and two lagged period expenditures with a geometrically declining carry-over
parameter. That is,
Adstock = At + γa At−1 + γa2 At−2
where At is advertising expenditure in period t and γa is the carry-over parameter.
The left panel of table 9 shows estimation results for the model without state dependence. We
use a one-dimensional grid search for the carry-over parameter and find that the log likelihood is
highest when the parameter value is 0 and strictly decreases as the parameter value increases. The
coefficient for the adstock variable (the current period effect) is 0.135 and statistically significant.
Including the adstock slightly changes the magnitude of the other estimates but the demand implications hardly change. These results show that in the model without state dependence the current
advertising effects are moderate and the lagged advertising effects are absent. The short-run advertising elasticities for Doritos, Lays, Pringles and Tostitos are 0.02, 0.03, 0.04 and 0.02 respectively,
while the long-run elasticity is 0 for all brands.
The right panel of table 9 shows estimation results for the state dependent model. It shows
16
that the adstock coefficient is much larger (0.37) than in the model without state dependence. This
change is intuitive: the advertising expenditure fluctuates a lot over time, while sales are relatively
stable, and by controlling for the persistence of sales with state dependence, the sales become more
sensitive to advertising. The short-run advertising elasticities are now 0.03, 0.04, 0.07 and 0.03 for
Doritos, Lays, Pringles and Tostitos.
The carry-over parameter is still 0 as in the model without state dependence, but the advertising
effects are long lasting because of state dependence. The effects are similar to the dynamic price
effects. Figure 3 shows how a 1 percent increase in Lays’ advertising expenditure affects sales over
time. The advertising effect lasts for 27 periods and the long-run advertising elasticity is 0.14.
These results are in accordance with reported advertising elasticities in Sethuraman, Tellis, and
Briesch (2011) who find in their extensive meta-analysis that the median short-term elasticity of
advertising is 0.05 and the median long-term elasticity is 0.1. Our findings are also consistent with
those reported by Givon and Horsky (1990) who find, in addition to state dependence, current but
not lagged effects of advertising. Their conclusion is that consumers are much more impacted by
their past consumption experience than by their memory of past ads.
6
Managerial Implications
6.1
Comparison of Optimal Prices
To study the managerial implications of our model further we compute the optimal prices that
correspond to a fully forward-looking pricing behavior and compare them with the optimal prices
implied by static per-period profit maximization. In the forward looking model firms account for
customers’ current states in setting prices and know that current-period prices affect future-period
profits through market shares. Thus, they set prices as a function of last period’s market share
and maximize the discounted sum of future profits over infinite horizon. For the static per-period
profit maximization firms use a Bertrand Nash pricing model where demand is static and not state
dependent.
In the static case, each firm (brand) maximizes the following per-period profit.
πjt = s̃jt × M × (pjt − cj ) ∀j
17
where s̃jt corresponds to the standard logit probability of choosing brand j without state dependence, M denotes market size, and cj stands for marginal cost for brand j. The optimal prices in
this case simultaneously satisfy the following J first-order conditions, one for each brand.
s̃jt × M +
∂s̃jt
× M × (pjt − cj ) = 0 ∀j
∂pjt
The computation of optimal prices is much more involved for the forward-looking case. Essentially we solve a dynamic game using our proposed model to find the optimal pricing policies
for each brand. Time is discrete and the state variable is the vector of brands’ market shares last
period. The profit of brand j in period t is a function of the state vector and prices. That is,
πjt (xt , pt ) = sjt (xt , pt ) × M × (pjt − cj )
where now xt = (x0 , x1 , ..., xJ ) denotes the vector of last period shares for all brands and the outside
option and sjt (xt , pt ) is our state-dependent choice probability.
Our demand specification implies a natural transition rule for the state vector. Basically, given
last period’s shares and current prices, firms know current period’s shares and these shares become
firms’ states for next period. This deterministic evolution of the state vector is operationalized with
a Markovian transition matrix.
xt+1 = g(pt , xt ) = Q(pt ) × xt
Qjk (pt ) = Pt (j|k)
Firms maximize their current and discounted future profits conditional on payoff relevant information that is summarized in the state vector. Strategies are functions that map the state space
into the continuum of possible prices, σj : X → R . The current and future payoffs of each firm are
described by the related Bellman equation.
Vj (x) = maxpj ≥0 {πj (x, p) + βVj [g(p, x)]}
∀x ∈ X
(11)
where Vj represents all payoffs given a strategy profile for firm j, σj . We use the notion of Markov
18
Perfect Equilibrium to compute equilibrium prices in the form of pure strategies. At equilibrium
each firm has an optimal strategy that prescribes the best response to all rivals and for all possible
states. Denoting the strategy profiles of competitors with σ−j , the optimal strategy for j, σj∗ , is
described by the following equation.
∗
∗
σj∗ (x) = argmaxpj ≥0 πj x, pj , σ−j
(x) + βVj g(pj , x, σ−j
(x))
∀x ∈ X, ∀j
(12)
This means that the optimal price of each firm should satisfy the following first order condition,
for each possible state vector
sj (x) × M +
∂sj (x)
∂Vj (x) ∂sj (x)
× M × (pj (x) − cj ) + β ×
=0
∂pj (x)
∂sj (x) ∂pj (x)
∀x ∈ X, ∀j ∈ J
Compared to the first-order condition in the static problem, this first-order condition has a different
market share function and a new third term that summarizes all future payoffs accruing from
current period actions. Thus, differences in optimal prices come from (i) a way that the currentperiod market share responds to price changes and (ii) the additional effects of price changes on
future profits. The first effect is expected to push prices upwards as state-dependent demand is
less sensitive to price. The second effect is expected to push prices down because the future value
increases as prices go down.
Our dynamic model is similar to that by Dubé, Hitsch, and Rossi (2009), although some of
details are different. One of them is that we allow state dependence to become zero after the choice
of the outside option. This creates a situation where not everyone in the market is loyal to one of
the brands in the choice set. It turns out that in our implementation a large portion of consumers
is not loyal to any of the brands at any given period and, thus, has a lower purchase probability
than in Dubé, et.al. (2009).
We use a numerical policy-iteration algorithm to compute pure strategy equilibrium (MPE) for
the pricing game3 . To approximate the continuous state space, we discretize market shares such that
a grid is finer along the range of values that we observe in the data. Cost estimates for the pricing
3
Extensive details for our algorithm is available from the authors upon request.
19
game are derived from the first order conditions of the static game and average prices observed
in the data. Although we cannot provide a formal proof that a unique equilibrium exists, all our
numerical computations converge to the same pricing policies and simultaneously satisfy the system
of first-order conditions.
Figures 4 and 5 show the policy functions of “Doritos” (Brand 1) and “Lays” (Brand 2). In these
figures we fix the state variables of the other brands to points on the grid that are closest to the
brands’ average market shares and vary the two focal brands’ state variables. The figures show that
prices are increasing with a decreasing rate in the own-brands’ states. A higher market share last
period is associated with a higher price, everything else constant. The slope is relatively steep up
to about 0.20, which covers the whole range of market shares shown in the data, and levels out
for higher values. This suggests that firms do take advantage of consumer loyalty and charge more
when a larger fraction of the market is “locked-in”.
The figures also show that firms increase prices as a rival’s last-period market share increases.
This is because prices are (i) an increasing function of an own-brand state variable and (ii) strategic
complements. However, this price increase is not monotonic. For low and moderate levels of
own states, up to 0.15 for Doritos’ case, prices increase in a rival’s states, first steadily and then
exponentially. For higher levels of own states, prices increase in a rival’s states with a decreasing
rate and quickly flatten out.
Using the computed policy functions we simulate the market until all firms reach a steady state
where last-period shares are the same as current-period shares. The steady state exists because the
model does not have demand shocks that perturb the market at a positive probability every period.
Figures 6, 7, and 8 show the time-path of the state variable (i.e. last period’s market share) with
a different set of initial market shares in each figure. One may interpret them as the time-path of
each brand going back to its steady state after one-time perturbation.
All figures show that, irrespective of the initial state vector, the state variables of all brands
converge very close to steady state levels within about ten periods and converge completely in about
thirty periods.4 In particular, when a brand’s state is higher than the steady-state level, as Doritos,
Pringles, and Tostitos in figure 8, it charges moves towards a lower state by charging a higher-thansteady-state price. Similarly, when a brand’s state is lower than the steady-state level, as all brands
4
Note that our optimal prices are optimal “regular” prices. We do not consider more sophisticated pricing policies
that include temporary price reductions.
20
in figure 7, it moves towards higher level by charging lower-than-steady-state prices.
Table 11 reports optimal prices in three different models: (i) a static demand model without
state dependence and with myopic firms, (ii) a state dependent demand model with myopic firms,
and (iii) a state dependent demand model with forward-looking firms. The second model uses the
same grid for last period’s market shares as the third model. The demand specifications are identical
in all three compared models except for state dependence.
The optimal prices for each model are computed as follows. For the first model we solve the
system of first-order conditions as a nonlinear system of equations to obtain optimal prices and
corresponding market shares. For the second model, we repeat the same process but use the statedependent demand model where current-period demand depends on last period shares. To report
results succinctly we iterate the process until current-period shares become the same as last-period
shares. For the third model, we use the steady-state prices computed in section 6.2.
Comparing the no state-dependent static model (the first model) with the forward-looking statedependent model (the third model), prices are about 8 percent lower and market shares are 40
percent higher in the forward-looking model. In the forward-looking model firms charge lower
prices today to gain higher market shares in the future. Since all firms adopt the same strategy,
larger market shares come from consumers who choose the outside option in the static pricing model.
The total category market share is 0.25 in the forward-looking model, while it is 0.15 in the first
model. These results imply that if a manager is myopic and sets prices without accounting for state
dependence, she will set higher-than-optimal prices and will lose consumers to competing brands
with forward-looking managers.
The results in the second model show that even if myopic managers account for state dependence
in demand, they will still set much higher prices and gain lower market shares than those of the
forward-looking model. Compared to the first model, prices are lower except for Lays but market
shares are not higher even for Lays. This means that demand is less price elastic in the second model.
Since the price coefficient is more negative in the second model, we may conclude that the effects
of the state dependence, which renders demand less price sensitive, dominates the effects of more
negative price coefficient. This result implies that myopic managers may do worse by accounting
for state dependence.
21
6.2
Return on Advertising Investment
To further demonstrate the implications of our model, we compare return on investment (ROI)
in advertising for the state dependence model with that of the model without state dependence.
Note, however, that our sales data come from a sample of markets in the US while the advertising
expenditure is for the whole US market. Thus, to correctly compute the advertising ROI we need
to rescale our market size to make it relevant to national advertising.5
Using data from Euromonitor International, Agriculture and Agri-Food Canada reports the
annual volumes of snacks sold in 7 categories in the US market (see International Markets Bureau
(2011)). Among them we add the volumes of 4 categories including chips/crisps, extruded snacks,
tortilla/corn, and other sweet/savory snack for 2005 and 2006, which are 2,075,700 and 2,095,110
tones respectively, and take the average of these two year volumes.6 Then we convert this volume
into annual servings by using 16 oz. as one serving. This gives us 4.5 billion servings as an estimate
for the annual US market size.
With this estimate at hand, we compute the advertising ROI for Frito Lays, the market leader
in the salty snack category.7 We use the state dependence model as well as the model without state
dependence to simulate predicted sales for fifty periods from advertising. Recall that we capture all
of the long-term advertising effects in fifty periods. In the base scenario we assume that all brands
do not advertise at all for all periods. In the counterfactual scenario we assume that only Lays
spends five million dollars on advertising in the first month. This is the level of advertising that is
commonly observed in the data for Lays (see Figure 1).
In the model without state dependence where there is no long run effect of advertising, the
advertising ROI is 27.1 percent, meaning that five million dollar advertising expenditure increases
revenue by 1.36 million dollars and Lays makes a loss from the advertising campaign. In the state
dependence model the one period ROI is 32.5 percent and the fifty period ROI is 118 percent. The
one period ROI is higher than that of the model without state dependence mainly because of a
larger coefficient on the advertising variable but still does not cover the advertising expenditure.
However, at the end of fifty periods Lays’ revenue goes up by 5.9 million dollars, resulting in 0.9
million dollar net profit. It is worth pointing out that the advertising investment breaks even within
5
Note that the market size does not play a role in computing the optimal prices.
The excluded categories are nuts, popcorn, and pretzels.
7
Frito Lays has the highest share in the total market, although it does not lead in all regional markets.
6
22
six months. This result shows that the long-term effects generated by state dependence rationalize
Lays’ advertising investment, while the typical advertising expenditure observed in the data would
not be profitable in the standard demand model.
7
Conclusion
We propose a demand model for aggregate brand-level data that can capture choice dynamics due
to state dependence. Through Monte Carlo experiments we show that our model and estimation
method recover true parameter values as well as individual-level demand models do. We also show
that marketing-mix variables are underestimated when state dependence is neglected. Estimation
with real brand-level data on salty snacks from ten regions in the U.S. shows statistically significant
positive state dependence. Our model is strongly supported by the data against the equivalent
brand-level models that ignore state dependence.
The managerial implications of our estimation results are important for both the evaluation
of advertising effects and pricing decisions. When state dependence is included, both current and
carry-over effects of advertising are stronger, but these stronger effects are attributed not to lagged
advertising effects but to market-share persistence. With respect to pricing, we show that the
optimal prices of forward-looking firms based on our dynamic model are much lower than the
corresponding optimal prices implied by the static model.
Incorporation of state dependence in aggregate brand-level models as advocated in this paper
is important in several levels. First, from a theoretical perspective if individual consumers exhibit
state-dependence type behaviors, the market shares which are aggregation of such consumers have
to display this phenomenon too. Second, from a managerial perspective taking account of state
dependence in aggregate models allows managers, as we have just shown, to assess better the
impact of prices and advertising and to set these marketing mix variables at more appropriate
levels. Third, the importance of the existence and modeling of state dependence have been further
highlighted recently by Freimer and Horsky (2008, 2012) who demonstrate that consumers’ state
dependence may well be the underlying force that leads firms to use temporary price promotions
and advertising pulsing; strategies that are indeed used by the managers in the product category
we examined as well as by managers of brands in many other product categories.
23
References
[1] Berry S. (1994), “Estimating Discrete-Choice models of Product Differentiation”, RAND Journal of Economics, 25, 242-262.
[2] Berry S., J. Levinsohn, and A. Pakes (1995), “Automobile Prices in Market Equilibrium”,
Econometrica, 63, 841-890.
[3] Bronnenberg, Bart J., Michael W. Kruger, and Carl F. Mela (2008), “Database Paper—The
IRI Marketing Data Set”, Marketing Science, 27, 745-748.
[4] Chen Y. and S. Yang (2007), “Estimating Disaggregate Model Using Aggregate Data via Augmentation of Individual Choice”, Journal of Marketing Research, 44, 613-621.
[5] Dubé J. P., G. Hitsch, and P. Rossi (2009), “Do Switching Costs Make Markets Less Competitive?”, Journal of Marketing Research, Vol. 46, No. 4, 435-445.
[6] Dubé J. P., G. Hitsch, and P. Rossi (2010), “State Dependence and Alternative Explanations
for Consumer Inertia”, RAND Journal of Economics, Vol. 41, No. 3, 417-445.
[7] Freimer M. and D. Horsky (2008), “Try It You Will Like It - Does Consumer Learning Lead to
Competitive Price Promotions?”, Marketing Science, Vol. 27, No. 5, 796-810.
[8] Freimer M. and D. Horsky (2012) “Periodic Advertising Pulsing in a Competitive Market”,
Marketing Science, forthcoming.
[9] Givon M. and D. Horsky (1978), “Market Share Models As Approximators of Aggregated
Heterogeneous Brand Choice Behavior”, Management Science, Vol. 24, No. 13, 1404-1416.
[10] Givon M. and D. Horsky (1979), “Application of a Composite Stochastic Model of Brand
Choice”, Journal of Marketing Research, Vol. 16, No. 2, 258-266.
[11] Givon M. and D. Horsky (1990), “Untangling the Effects of Purchase Reinforcement and Advertising Carryover”, Marketing Science, Vol. 9, No. 2, 171-187.
[12] Horsky D., S. Misra, and P. Nelson (2006), “Observed and Unobserved Preference Heterogeneity
in Brand-Choice Models”, Marketing Science, Vol. 25, No. 4, 322-335.
[13] International Markets Bureau (2011), “Consumer Trends: Salty snack Food in the United
States”, Agriculture and Agri-Food Canada.
[14] Keane M. (1997), “Modeling Heterogeneity and State Dependence in Consumer Choice Behavior,” Journal of Business and Economic Statistics, 15, 310-327.
[15] Petrin A. and K. Train (2010), “A Control Function Approach to Endogeneity in Consumer
Choice Models”, Journal of Marketing Research, Vol. 47, No. 1, pp. 3-13.
[16] Kuehn A. (1962), “Consumer Brand Choice as a Learning Process”, Journal of Advertising
Research, Vol. 2, 10-17.
[17] Massy W. F. (1966), “Order and Homogeneity of Family Specific Brand Switching Processes”,
Journal of Marketing Research, Vol. 3, 48-54.
24
[18] Morrison D. G. (1966), “Testing Brand-Switching Models”, Journal of Marketing Research, Vol.
3, 401-409.
[19] Musalem A., E. Bradlow, and J. Raju (2009), “Bayesian Estimation of Random-Coefficients
Choice Models using Aggregate Data,” Journal of Applied Econometrics, 24, 490-516.
[20] Park S. and S. Gupta (2009), “Simulated Maximum Likelihood Estimator for the Random Coefficient Logit Model Using Aggregate Data,” Journal of Marketing Research, Vol. 46, 531–542.
[21] Qi S., (2008) “Advertising, Entry Deterrence, and Industry Innovation”, Unpublished
manuscript, Florida State University.
[22] Seetharaman P. B., A. Ainslie, and P. Chintagunta (1999), “Investigating Household State
Dependence Effects across Categories,” Marketing Science, Vol. 36, 488-500.
[23] Sethuraman R., G. Tellis, and R. Briesch (2011), “How Well Does Advertising Work? Generalizations from Meta-Analysis of Brand Advertising Elasticities”, Journal of Marketing Research,
Vol. 48, 457-471.
[24] Shin S., S. Misra, and D. Horsky (2012), “Disentangling Preferences and Learning in Brand
Choice Models”, Marketing Science, Vol. 31, 115-137.
25
Table 1: Monte Carlo Results
MC1.1: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
Aggregate Level Estimation
Individual Level Estimation
TRUE
Mean
Std Dev.
Mean
Std Dev.
-1.5
-1
-1
1
-0.5
-1
1
1.5
0.5
-1.497
-0.997
-0.999
0.985
-0.505
-1.003
1.001
1.502
0.502
0.026
0.032
0.021
0.091
0.023
0.029
0.025
0.028
0.022
-1.502
-1.002
-0.999
1.000
-0.497
-0.997
1.002
1.500
0.500
0.024
0.030
0.020
0.012
0.023
0.025
0.017
0.017
0.018
-1.500
-1.001
-1.000
3.001
-0.500
-1.003
0.999
1.499
0.499
0.026
0.032
0.021
0.012
0.025
0.023
0.019
0.023
0.019
-1.498
-0.997
-1.001
5.001
-0.501
-1.002
1.000
1.499
0.498
0.034
0.042
0.030
0.017
0.024
0.026
0.031
0.033
0.026
MC1.2: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
-1.5
-1
-1
3
-0.5
-1
1
1.5
0.5
-1.502
-1.001
-0.998
2.998
-0.497
-1.002
1.001
1.501
0.502
0.032
0.036
0.030
0.085
0.023
0.027
0.022
0.027
0.022
MC1.3: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
-1.5
-1
-1
5
-0.5
-1
1
1.5
0.5
-1.506
-1.009
-0.992
4.990
-0.496
-0.998
1.005
1.500
0.504
0.039
0.054
0.052
0.078
0.023
0.029
0.036
0.039
0.027
26
Table 2: Monte Carlo Results with Ignored State Dependence
MC2.1: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
Aggregate Level Estimation
Individual Level Estimation
TRUE
Mean
Std Dev.
Mean
Std Dev.
-1.5
-1
-1
1
-0.5
-1
1
1.5
0.5
-1.423
-0.907
-0.967
-0.543
-1.072
1.125
1.697
0.559
0.026
0.031
0.020
0.027
0.033
0.023
0.023
0.023
-1.425
-0.907
-0.967
-0.537
-1.069
1.124
1.698
0.554
0.026
0.031
0.021
0.025
0.028
0.025
0.023
0.021
-1.088
-0.643
-0.569
-0.728
-1.409
1.492
2.221
0.750
0.041
0.052
0.030
0.038
0.041
0.041
0.043
0.040
0.006
0.318
-0.190
-0.860
-1.708
1.612
2.380
0.805
0.065
0.067
0.032
0.071
0.075
0.072
0.082
0.063
MC2.2: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
-1.5
-1
-1
3
-0.5
-1
1
1.5
0.5
-1.084
-0.635
-0.575
-0.725
-1.412
1.492
2.213
0.746
0.046
0.053
0.025
0.046
0.047
0.042
0.045
0.036
MC2.3: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
-1.5
-1
-1
5
-0.5
-1
1
1.5
0.5
0.006
0.312
-0.195
-0.839
-1.681
1.622
2.383
0.823
0.058
0.064
0.030
0.074
0.076
0.077
0.074
0.059
27
Table 3: Monte Carlo Results with Heterogeneity
MC3.1: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Price Coef. Std Deviation
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
Aggregate Level Estimation
Individual Level Estimation
TRUE
Mean
Std Dev.
Mean
Std Dev.
-1.5
-1
-3
1
3
-0.5
-1
1
1.5
0.5
-1.626
-1.142
-2.748
0.856
3.006
-0.467
-0.984
1.010
1.443
0.486
0.143
0.152
0.260
0.134
0.126
0.131
0.130
0.101
0.100
0.110
-1.507
-1.000
-2.943
1.078
3.020
-0.500
-1.004
1.003
1.495
0.499
0.058
0.070
0.045
0.023
0.022
0.076
0.086
0.059
0.057
0.061
28
Table 4: Monte Carlo Results with Ignored Heterogeneity
MC4.1: N = 500, T = 60, 100 iterations, 6 Markets
Aggregate Level Estimation
Brand 1
Brand 2
Price Coefficient
Price Coef. Std Deviation
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
TRUE
Mean
Std Dev.
-1.5
-1
-3
0.25
3
-0.5
-1
1
1.5
0.5
-1.689
-1.219
-2.639
2.993
-0.523
-0.995
0.992
1.466
0.507
0.046
0.062
0.050
0.081
0.050
0.069
0.041
0.041
0.038
MC4.2: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Price Coef. Std Deviation
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
-1.5
-1
-3
0.5
3
-0.5
-1
1
1.5
0.5
-1.831
-1.373
-2.330
2.996
-0.515
-1.039
0.962
1.385
0.484
0.044
0.065
0.056
0.094
0.061
0.070
0.038
0.043
0.041
MC4.3: N = 500, T = 60, 100 iterations, 6 Markets
Brand 1
Brand 2
Price Coefficient
Price Coef. Std Deviation
Lambda
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
-1.5
-1
-3
1
3
-0.5
-1
1
1.5
0.5
29
-2.029
-1.573
-1.826
2.894
-0.498
-1.025
0.825
1.223
0.461
0.050
0.057
0.047
0.106
0.056
0.059
0.035
0.035
0.037
Table 5: Monte Carlo Results with SD Memory
MC5.1: N = 500, T = 60, 100 iterations, 6 Markets
SD as in Guadagni & Little - Aggregate Level Estimation
Ignoring SD
Brand 1
Brand 2
Price Coefficient
Lambda for GL
Smoothing α
Market Effect 1
Market Effect 2
Market Effect 3
Market Effect 4
Market Effect 5
with SD
TRUE
Mean
Std Dev.
Mean
Std Dev.
-1.5
-1
-1
3
0.5
-0.5
-1
1
1.5
0.5
-1.297
-0.726
-0.719
-0.720
-1.359
1.674
2.504
0.818
0.048
0.051
0.029
0.041
0.044
0.045
0.045
0.047
-1.445
-0.901
-0.971
2.060
-0.578
-1.106
1.254
1.886
0.620
0.045
0.049
0.030
0.095
0.027
0.029
0.020
0.045
0.030
30
Table 6: Summary Statistics
Price
Mean
Brand
Share
Std. Dev.
Markets
Time
Mean
Adv. Expend. (000’s)
Std. Dev.
Markets
Time
Mean
Std. Dev.
Time
Doritos
3.02
0.25
0.16
0.068
0.02
0.01
1840.4
1982.36
Lays
2.89
0.11
0.12
0.084
0.02
0.02
2451.4
3309.60
Pringles
3.61
0.25
0.15
0.021
0.00
0.00
2975.5
1795.89
Tostitos
3.14
0.25
0.09
0.055
0.01
0.01
1758.2
2053.34
UTZ
2.78
0.48
0.18
0.094
0.06
0.04
0
0.00
31
Table 7: Estimation Results without State Dependence
No Endogeneity
Correction
Endogeneity Correction
Estimate
Std Error
Estimate
Std Error
Constant
-0.998
0.089
4.75
0.595
Price
-0.397
0.029
-2.438
0.211
Doritos
-0.273
0.04
0.263
0.068
-0.1
0.039
0.192
0.049
Tostitos
-0.438
0.042
0.357
0.092
Pringles
-1.224
0.053
0.546
0.189
Midwest
-0.06
0.035
-0.143
0.036
Northeast
-0.087
0.033
-0.267
0.038
West
-0.126
0.024
-0.449
0.041
Spring
0.17
0.028
0.004
0.033
Summer
0.142
0.028
0.136
0.028
Fall
0.076
0.029
0.18
0.031
2.079
0.213
Lays
1st Stage Residuals
Log Likelihood
RMSPE
-50742.24
-50694.28
0.0248
0.0240
1st Stage IV F Statistic
15.32
32
Table 8: Estimation Results with State Dependence
No Endogeneity
Correction
Endogeneity Correction
Estimate
Std Error
Estimate
Std Error
Constant
-2.413
0.181
6.255
1.227
Price
-0.56
0.062
-3.633
0.45
Doritos
0.194
0.091
0.995
0.154
Lays
0.254
0.087
0.688
0.113
Tostitos
0.133
0.094
1.323
0.204
Pringles
-0.178
0.115
2.475
0.413
Midwest
-0.024
0.063
-0.141
0.065
Northeast
-0.087
0.061
-0.357
0.072
West
-0.152
0.044
-0.645
0.084
Spring
0.265
0.052
0.021
0.06
Summer
0.112
0.051
0.106
0.05
Fall
0.263
0.055
0.412
0.061
3.133
0.445
4.926
0.176
1st Stage Residuals
State Dependence
Log Likelihood
RMSPE
4.946
0.175
-50161.17
-50132.15
0.0176
0.0174
1st Stage IV F Statistic
15.32
33
Table 9: Estimation Results with Advertising
without State
Dependence
with State Dependence
Estimate
Std Error
Estimate
Std Error
Constant
4.627
0.597
5.957
1.232
Price
-2.397
0.212
-3.542
0.451
Doritos
0.229
0.069
0.911
0.154
Lays
0.15
0.051
0.58
0.114
Tostitos
0.315
0.093
1.222
0.204
Pringles
0.47
0.191
2.291
0.413
Midwest
-0.142
0.036
-0.137
0.065
Northeast
-0.263
0.038
-0.347
0.073
West
-0.443
0.041
-0.63
0.084
Spring
0.016
0.033
0.057
0.061
Summer
0.148
0.028
0.135
0.051
Fall
0.189
0.031
0.438
0.062
1st Stage Resid.
2.04
0.213
3.042
0.447
4.96
0.178
0.373
0.08
State Dependence
Adstock
0.135
0.036
Retention Rate
0
0
Log Likelihood
-50685.9
-50119.2
RMSPE
0.0240
0.0173
IV F Statistic
15.23
15.23
34
Brand
Doritos
Lays
Pringles
Tostitos
UTZ
Table 10: Price Elasticities
Short Run
Long Run 1
Long Run 2
-5.2
-18.1
-19.1
(0.2)
(1.4)
(0.9)
-4.4
-17.5
-18.1
(0.2)
(1.2)
(0.8)
-8.8
-21.3
-22.5
(0.4)
(1.5)
(1.1)
-5.7
-19.0
-20.1
(0.3)
(1.5)
(0.9)
-4.2
-15.7
-16.8
(0.2)
(1.3)
(0.8)
Table 11: Optimal Prices Based on Demand Estimates
Estimation
Profit Maxim.
Brand
Costs
Doritos
without S.D.
with S.D.
with S.D.
Myopic
Myopic
Dynamic
Opt. Price
Share
Opt. Price
Share
Opt. Price
Share
2.78
3.21
0.046
3.21
0.025
2.95
0.089
Lays
2.63
3.07
0.061
3.09
0.031
2.81
0.13
Pringles
3.46
3.87
0.012
3.80
0.009
3.66
0.015
Tostitos
2.95
3.38
0.034
3.35
0.019
3.13
0.051
35
Figure 1: Advertising Expenditures (000s)
10
20
30
40
50
4000
60
0
10
20
30
40
Monthly Periods
Monthly Periods
Lays
Tostitos
50
60
50
60
0
6000
0 2000
4000
Total $’s
8000
12000
10000
0
Total $’s
2000
0
2000
Total $’s
6000
6000
Pringles
0
Total $’s
Doritos
0
10
20
30
40
50
60
0
Monthly Periods
10
20
30
40
Monthly Periods
36
Figure 2: The Dynamic Demand Effects of One Time Price Change
6000
4000
0
2000
Incremental Sales
8000
Doritos
0
10
20
30
40
50
60
Monthly Periods
Figure 3: The Dynamic Demand Effects of One Time Advertising Change
80
60
40
20
0
Incremental Sales
100
120
Lays
0
10
20
30
Monthly Periods
37
40
50
Figure 4: Policy Function: Doritos
3.0
B
Policy for
2.9
2.8
rand 1
2.7
2.6
0.8
0.8
St
ate 0.6
for
Br 0.4
an
d2
0.6
0.4
0.2
0.2
te
r
fo
d
an
Br
1
a
St
Figure 5: Policy Function: Lays
2.9
B
Policy for
2.8
2.7
rand 2
2.6
2.5
2.4
0.8
0.8
St
ate 0.6
for
Br 0.4
an
d1
0.6
0.4
0.2
0.2
38
St
e
at
r
fo
d
an
Br
2
0.10
0.15
0.20
0
0
0.08
0.10
5
5
10
10
15
15
20
20
25
25
25
30
Time Periods
39
0.05
20
0.04
15
0.10
0.12
0.015
0.025
0.035
0.045
State (Last period’s share)
0.08
25
0.03
0.06
State (Last period’s share)
20
0.02
10
15
0.01
State (Last period’s share)
0.06
5
10
0.014
0.04
0
5
0.012
0.02
State (Last period’s share)
0
0.010
State (Last period’s share)
0.05
State (Last period’s share)
0.07
0.08
0.09
0.0500
0.0505
0.0510
0.0515
State (Last period’s share)
0.06
State (Last period’s share)
0.05
Figure 6: State Variable Path 1
Doritos
Pringles
Time Periods
30
0
30
0
30
0
0
5
Time Periods
Time Periods
5
5
5
10
10
Doritos
10
10
15
15
15
20
Lays
Tostitos
15
20
20
Time Periods
Time Periods
Lays
Tostitos
20
Time Periods
25
30
Time Periods
25
30
Figure 7: State Variable Path 2
Pringles
25
30
25
30
0.120
0.130
0
5
10
15
20
20
25
Time Periods
25
30
40
0.10
15
0.08
10
0.06
5
0.04
0
0.02
State (Last period’s share)
0.110
State (Last period’s share)
0.100
0.094
0.098
0.05
0.07
0.09
State (Last period’s share)
0.090
State (Last period’s share)
Figure 8: State Variable Path 3
Doritos
Pringles
Time Periods
30
0
0
5
Time Periods
5
10
10
15
15
20
Lays
Tostitos
Time Periods
20
25
30
25
30