Measuring the Gains from Trade in Differentiated Products

Price, Quality, and Variety: Measuring the Gains
from Trade in Differentiated Products
Gloria Sheu∗
US Department of Justice
September 2011
Abstract
The empirical trade literature has found that the ability to import differentiated products has significant positive welfare effects. These gains have been established using
price indices that measure the value to consumers of changes in the set of goods available over time. In this paper, I explore how these price indices are shaped by their
underlying structural assumptions. I draw parallels with methods used in industrial
organization, where a number of researchers have also studied differentiated products,
albeit in domestic markets. I show that a standard trade model, the nested constant
elasticity of substitution (NCES) framework, produces the same market demand system and price index as a standard industrial organization model, the nested logit (NL).
This finding allows me to connect commonly used NCES and NL variants into one
coherent family. Furthermore, I show how standard NCES empirical techniques, which
rely on data that aggregates over individual products, relate to the product-level data
methods more commonly used in logit settings. I then apply these methods to a data
set on Indian imports of computer printers, highlighting how these approaches differ in
a concrete example. In this application, I find that using aggregated data understates
the gains in the price index by 31 to 43 percent relative to using product-level data.
Furthermore, loosening the assumed substitution structure between goods raises the
price index by over 60 percent.
∗
The views expressed here are not purported to reflect those of the US Department of Justice. I
would like to thank Pol Antràs, Elhanan Helpman, Julie Mortimer, and Ariel Pakes for their guidance
and support on this project. This paper has also benefitted from discussions with Deepa Dhume,
Oleg Itskhoki, Greg Lewis, Marc Melitz, David Mericle, Nathan Miller, Eduardo Morales, and Marc
Remer. I am greatly indebted to George Gibson of the Xerox Corporation for giving me access to
the printer data set and for helping me understand the Indian printer market. Mary Carlin at the
Xerox Corporate Library also provided key assistance in obtaining the printer data. Petia Topalova
graciously made the Indian tariff data available. All errors are my own. First version: November 2009.
Email: [email protected]
1
Introduction
As emphasized in the seminal work of Krugman (1979), an important channel by which
countries can gain from trade is through increased access to differentiated products.
Imported goods widen the choice set available to consumers by providing a different
combination of price, quality, and variety than domestic goods alone. Although the
benefits of cheaper imports have long been recognized, consumers also place value on
the varying qualities (that is, the unique mix of non-price characteristics) provided by
foreign differentiated goods. Furthermore, access to imported products may increase the
number of goods available for a given distribution of price and quality, thus increasing
variety. All three of these forces contribute to the gains from trade.
After establishing these effects theoretically, the next step has been to measure them
empirically. Feenstra (1994) facilitated these efforts by introducing a simple procedure
for computing the price index from a differentiated products demand system. This
price index measures how much prices on one set of products would have to fall (or
rise) in order to give consumers the same welfare as that from a different set. The
Feenstra (1994) method has been widely adopted, and much of the resulting literature
underscores the importance of accounting for differentiated products when measuring
the gains from trade.
Now that these papers have confirmed the existence of welfare gains, what more
can be learned? Can the standard theoretical and empirical techniques be refined in
order to better understand the causes of these welfare effects? In considering these
questions, it is important to examine the empirical industrial organization literature
on differentiated products. Like the aforementioned trade literature, there is a rich
industrial organization tradition studying how consumers are affected by changes in
the price, quality, and variety of differentiated products. The basic research question
is the same, just applied to domestic empirical examples instead of international ones.
Therefore, examining the relationship between the methods used in these literatures
shows how the results in trade are shaped by the techniques used to construct them.
The purpose of this paper is to reach a more fundamental understanding of the
international trade approach towards differentiated products by comparing it to techniques that are usually confined to industrial organization. My analysis has two key
components. First I show that the seemingly disparate theoretical frameworks used
in trade and industrial organization are actually tightly linked. More specifically, the
workhorse models of international trade, the constant elasticity of substitution (CES)
1
and the nested constant elasticity of substitution (NCES) setups, produce the same
market demand functions and price indices as two of the most common models in
industrial organization, the multinomial logit (MNL) and the nested logit (NL), respectively. As a consequence, the differences between trade and industrial organization
empirical findings based on these models are driven more by differences in data and
empirical techniques than by differences in theory. Furthermore, this result means that
the random coefficients framework, another common model in industrial organization,
is also connected to the NCES as an extension of the NL that I call the nested random
coefficients logit (NRCL). Therefore, the extra impact of random coefficients can be
assessed through a comparison between the NL and the NRCL.
Second, I present an empirical application that shows how the trade and industrial
organization approaches can differ in practice. Using a product-level data set on imports of computer printers into India, I estimate several industrial-organization-style
logit models. Furthermore, by aggregating this data, I am also able to implement standard trade methods. This exercise produces three main findings. First, by obscuring
improvements in quality and variety amongst the underlying goods, I find that the
aggregated data commonly used in trade understate the gains in the price index by 31
to 43 percent. Second, the MNL, with its restrictive substitution structure, overstates
gains in the price index by 65 to 68 percent relative to the NL and NRCL. Third, the
addition of random coefficients reveals important heterogeneity across types of consumers. The NRCL price index for certain subgroups of consumers is 53 percent lower
than the market-level NL index.
There are two main distinctions between how trade and industrial organization approach the study of differentiated products. First, there is a difference in terms of data.
The standard in the trade literature is to use data collected from customs authorities.
Customs information is often the only data available that exhaustively covers the imports of an entire nation. As a result, these data are the preferred choice for trade
papers, where the emphasis is on studying the economy-wide effects of international
trade. However, a drawback of this data is that it aggregates individual goods into
“Harmonized System” (HS) codes. A product is defined as an HS code/supplier country pair. When reporting imports of computer printers, for instance, a good could be
defined as broadly as “inkjet printers from the United States.”
In contrast, the industrial organization literature usually relies on detailed productlevel data. Continuing with the printer example, a typical data set would report the
quantities sold and prices of individual models. The “HP Deskjet 630C” and the “HP
2
Deskjet 1220C” would be separate observations, recognizing the fact that the latter
model prints at twice the speed as the former. Not only does this level of analysis
match the level of differentiation relevant to consumers, it also allows the researcher
to supplement the sales data with product characteristics information. For instance,
one could obtain proxies for the quality of each printer model by incorporating data on
features like print speed or paper capacity. As a result, this data provide more direct
information on price, quality, and variety when compared to customs data. However,
the downside is that such detailed data are only available for selected sectors and
countries.
The second difference between the trade and industrial organization approaches is
in terms of structural modeling. In order to assess the effects on consumers of changes
in products, the researcher assumes a demand system. A number of theoretical trade
papers use the CES demand system, which is derived from the utility maximization
problem of a representative consumer.1 Because of its simplicity, the CES model is
both algebraically elegant and easy to implement empirically. However, the CES also
severely restricts the substitution structure between products. In order to address this
problem, many empirical researchers have adopted an NCES specification, which allows
the substitution parameter within a sector to differ from that between sectors.
Meanwhile, the industrial organization literature has largely relied on the MNL
demand system.2 Unlike the CES, this framework assumes a distribution of heterogeneous consumers, each purchasing one type of good. But like the CES, the MNL places
strong restrictions on substitution patterns. In response, researchers have developed a
number of modifications, the most common being nesting (as in the NL) and random
coefficients.3 I combine these two extensions into a unified model, the NRCL.
My results build upon the aforementioned empirical trade and industrial organization literatures studying the welfare effects of changes in differentiated products.
Although the many contributions in these literatures are too numerous to fully summarize here, key papers in trade include Broda and Weinstein (2006), Broda et al. (2006),
and Goldberg et al. (2010), along with the aforementioned work in Feenstra (1994).
On the industrial organization side, significant developments include Berry, Levinsohn,
1
This functional form is sometimes referred to as the Spence-Dixit-Stiglitz demand system after
the work in Spence (1976) and Dixit and Stiglitz (1977).
2
This trend follows from the influential work in McFadden (1974).
3
The latter type of model is sometimes referred to as a mixed logit. Train (2009) defines the mixed
logit model based on the form of the resulting demand system. He then notes that one way of deriving
a mixed logit demand system is through a random coefficients specification.
3
and Pakes (1995) (“BLP”), Goldberg (1995), Berry et al. (1999), Petrin (2002), and
Berry et al. (2004). Khandelwal (2010) is one of the few hybrid papers that uses an
industrial organization model (the NL) with a trade data set. Here I show that the
theory behind this approach is equivalent to the NCES setup.
The closest paper to this one is Blonigen and Soderbery (2010), which uses productlevel data on the US automobile market to study how aggregated HS trade data biases
the NCES price index. They find that using aggregated trade data can greatly understate improvements in the price index. I find a parallel result in my data and then
expand upon this analysis by exploring how CES-style models behave relative to logit
models.
In the next section, I describe the CES-based models and the Feenstra (1994) method
for estimating their price indices. Section 3 lays out the basics of the logit models and
relates them back to the CES-based frameworks, followed by a description of empirical
industrial organization techniques for dealing with product-level data. I present results
from the computer printers example in Section 4, including a discussion of the differences
between the CES-based and logit results. Section 5 concludes.
2
CES-Based Frameworks
The NCES model is popular in empirical trade applications because it yields a price
index that combines changes in price, quality, and variety into one easily interpretable
number. Furthermore, the NCES includes the basic CES framework, the workhorse
model of international trade theory, as a special case.
In order to calculate the NCES price index, Feenstra (1994) provides a simple procedure designed for the typical trade data set. This methodology has become the standard
for measuring the gains from imports of differentiated products. These techniques contrast with those favored in industrial organization, where estimation using product-level
data is more common.
2.1
The Consumer’s Problem
Assume there is a representative consumer that has a utility function given by
Ut =
X
γ−1
γ
γ
! γ−1
,
Mgt
g∈G
4
where γ > 1.
(1)
Here g indexes different groups of products from the set G. The time period is indexed
by t. The quantity consumed of each group is denoted by Mgt , and γ is the elasticity
of substitution between these groups.
Within each product group g the consumer has an inner nested utility function of
the form
σ
 σ−1

X 1 σ−1
, where σ > 1.
(2)
Mgt = 
bjtσ mjtσ 
j∈Jgt
Individual products are indexed by j from the set Jgt . The bjt denotes the quality of
good j, and mjt denotes the quantity consumed of good j. The elasticity of substitution
between products within a group is σ.4 Assuming the nesting structure is reasonable,
one would expect σ > γ, meaning that products within a group are closer substitutes
than those in separate groups.
Each time period the consumer’s problem is to maximize current period utility
subject to a budget constraint. This is an entirely static model, with no borrowing or
saving. The consumer can solve the utility maximization problem in two stages. First,
P
the consumer maximizes Mgt subject to the constraint j∈Jgt pjt mjt = Ygt , where Ygt
is the total money spent on group g. Then the consumer decides on the allocation
of expenditure across groups. This exercise results in the expression for the share of
expenditure allocated to product j within Ygt ,
bjt p1−σ
jt
1−σ .
j∈Jgt bjt pjt
sjt|g = P
(3)
In turn, the share of expenditure devoted to group g out of total expenditure is
1−γ
1−σ 1−σ
b
p
j∈Jgt jt jt
sgt =
1−γ .
P
P
1−σ 1−σ
g∈G
j∈Jgt bjt pjt
P
(4)
Multiplying these two expressions gives the share of expenditure allocated to product
4
An extension of this model is to allow the elasticity of substitution to vary by group, giving σg . I
do not pursue this variant for simplicity, but its mechanics are similar to those discussed in the main
text.
5
j out of the money spent on all product groups,
bjt p1−σ
jt
P
γ−σ
P
1−σ
1−σ
sjt = P
j∈Jgt bjt pjt
j∈Jgt
g∈G
1−σ
bjt pjt
1−γ .
1−σ
(5)
In the special case where σ = γ, the NCES reduces to a simpler framework known
as the CES model. Once the elasticities of substitution in the outer and inner utility
functions are equal, the nesting ceases to have any effect. It is as if all the products are
located in a single group.5 The resulting expenditure shares are given by
sjt = P
1−γ
bjt pjt
1−γ
j∈Jt bjt pjt
.
(6)
The CES is one of the most popular differentiated products demand systems in the
theoretical trade literature.6
2.2
The IIA Problem
When quantifying the gains from changes in differentiated products, it is extremely
important to accurately measure the substitutability between goods. For instance, if
one incorrectly finds that a new product is a poor substitute for existing products,
one will mistakenly conclude that this new product greatly increased welfare. The
CES, although useful in a number of theoretical applications, has a highly restrictive
substitution structure. The industrial organization literature commonly refers to this
issue as the “independence of irrelevant alternatives” problem. It is the struggle to solve
this problem that led to the adoption of the NCES in the empirical trade literature.
In order to illustrate the IIA property, note that according to the CES expenditure
share equation (6) the ratio of the quantity demanded for a pair of products 1 and 2 is
b1t p−γ
m1t
1t
=
,
m2t
b2t p−γ
2t
which does not depend on the other products available. Thus, if a third good is in5
That is, utility reduces to Ut =
P
1
γ
γ−1
γ
j∈Jt bjt mjt
γ
γ−1
.
6
The CES and Cobb-Douglas (which the CES reduces to when γ = 1) models have been featured
in standard trade textbooks such as Helpman and Krugman (1985) and in key theoretical papers such
as Dornbusch et al. (1977), Eaton and Kortum (2002), and Melitz (2003).
6
troduced that is identical to good 1 and very different from good 2, the demand ratio
between 1 and 2 will remain constant. One would expect sales of good 1 to fall relative
to good 2, but the CES model does not allow this to occur.
Another way of expressing this problem is using cross-price elasticities, which in the
case of the CES have the following form:
∂mjt pkt
= (γ − 1)skt
∂pkt mjt
∀j 6= k.7
(7)
∀j, k, l such that j, l 6= k.
(8)
These elasticities have the property that
∂mjt pkt
∂mlt pkt
=
∂pkt mjt
∂pkt mlt
Hausman (1997) argues that because of this property, the CES will overvalue new goods.
If one thinks of a new good being introduced as its price falling from the reservation
level, expression (8) says that demand must flow symmetrically towards the new product
from all other products. This may not be a realistic assumption for many sectors. For
instance, it is unlikely that expenditure will flow equally from an old laser printer and
from an old injet printer to a newly introduced laser printer.
If σ 6= γ, the NCES model can partially alleviate the IIA problem. This effect is
apparent in the NCES cross-price elasticities,
∂mjt pkt
=
∂pkt mjt
(
(γ − 1)skt + (σ − γ)skt|g
(γ − 1)skt
if j and k are in the same group
otherwise
(9)
for all j 6= k. Since it is likely that σ > γ, the (σ − γ)skt|g term should be positive and
thus increase the cross-price elasticity between goods of the same type. This adds a level
of realism compared to the CES. However, the IIA problem remains when comparing
goods within the same group.
2.3
The Price Index
A key reason for the popularity of CES-based models is that they yield a simple price
index for measuring the relative benefits of two sets of goods. In order to understand
this index, imagine that the representative consumer is comparing two possible bundles
7
This elasticity can be derived by noting that mjt =
7
sjt Y
pjt
, where Y denotes income.
of goods, the bundle available in time t and that available in time t + 1. These bundles
may vary in terms of the combination of price, quality, and variety they offer. How
could one derive a metric that reflects the difference in value the consumer assigns to
these bundles?
N CES
One common way to build this metric is to look for a factor, τt+1
, by which the
prices of all goods in period t would have to fall (or rise) in order to give the same
utility as the set of goods available in t + 1. This exercise results in the expression
P
N CES
=
τt+1
g∈G
P
P
g∈G
j∈Jgt+1
P
bjt+1 p1−σ
jt+1
1−σ
j∈Jgt bjt pjt
1
1−γ 1−γ
1−σ
1
1−γ 1−γ
1−σ
,
(10)
which is the standard measure of the gains from imports of differentiated products in
the trade literature.8 In the special case of the CES model, this index reduces to
1
1−γ 1−γ
b
p
j∈Jt+1 jt+1 jt+1
.
1
P
1−γ 1−γ
j∈Jt bjt pjt
P
CES
τt+1
=
(11)
N CES
is actually just a geThe nested structure of the NCES model means that τt+1
ometric average of CES price indices calculated within each group of products.9 That
is,
Y
N CES
CES ωgt+1
τt+1
=
τgt+1
,
(12)
g∈G
where
1
1−σ 1−σ
b
p
j∈Jgt+1 jt+1 jt+1
P
1
1−σ 1−σ
b
p
j∈Jgt jt jt
P
CES
τgt+1
=
8
Y/
This
P
g∈G
9
result
P
and ωgt+1 = P
follows
(sgt+1 − sgt )/(ln(sgt+1 ) − ln(sgt ))
.
g∈G (sgt+1 − sgt )/(ln(sgt+1 ) − ln(sgt ))
from the form of the indirect
1
1−γ 1−γ
1−σ
1−σ
. Here Y denotes income.
j∈Jgt bjt pjt
utility
This result follows from the proof in the appendix of Feenstra (1994).
8
function,
which
is
2.4
Estimation
The next task is to estimate this price index. Most empirical trade papers analyze
welfare effects at an economy-wide level. Given this focus, these authors need a method
that can be applied to customs data and requires as few parameters as possible. The
procedure pioneered by Feenstra (1994) satisfies both of these criteria.
In this section I replace the product subscript j with c for “category.” This is
because the standard method uses trade data, where a good is actually an HS category
imported from a certain country. This aggregates over individual products.
Note that once I switch to defining a product using these categories, the bct parameters no longer have a pure quality interpretation. Feenstra (1994) shows that these
terms not only reflect the quality but also the number of underlying products in each
category. If the number of sub-products in a product category increases, this in turn
increases its associated bct . Thus, bct captures both quality and variety.
Define Cg = (Cgt+1 ∩ Cgt ) as the “common goods” set available for a group in two
different time periods. As shown by Feenstra (1994), so long as one assumes that the
products in Cg have constant bct parameters between t and t + 1, the following holds:
ωgt+1
ωgt+1
Y λgt+1 σ−1
Y pct+1 ωcgt+1


.
=
p
λgt
ct
g∈G
g∈G
c∈C

N CES
τt+1
Y
(13)
g
where
P
c∈Cg
pct mct
c∈Cgt
pct mct
λgt = P
and
(sct+1 (Cg ) − sct (Cg ))/(ln(sct+1 (Cg )) − ln(sct (Cg )))
c∈Cg (sct+1 (Cg ) − sct (Cg ))/(ln(sct+1 (Cg )) − ln(sct (Cg )))
ωcgt+1 = P
Here sct (Cg ) denotes the share of expenditure accounted for by category c out of the
categories in the set Cg .
In the special case of the CES model, this index is calculated without regards to
any groups. There is one common goods set C = (Ct+1 ∩ Ct ), giving
1
CES
τt+1
=
Y pct+1 ωct+1 λt+1 γ−1
c∈C
pct
λt
9
(14)
where
P
pct mct
λt = P c∈C
c∈Ct pct mct
and ωct+1 = P
(sct+1 (C) − sct (C))/(ln(sct+1 (C)) − ln(sct (C)))
.
c∈C (sct+1 (C) − sct (C))/(ln(sct+1 (C)) − ln(sct (C)))
Note that sct (C) denotes the expenditure share of product c in time t amongst the
goods in C.
The advantage of this methodology is that the price index can be computed using
only one parameter, the within-group elasticity of substitution, and widely available
trade data. Feenstra (1994) and Broda and Weinstein (2006) show how to estimate
this elasticity of substitution for different industries. Many other authors assume an
elasticity based on estimates in the literature.
Although this procedure is useful, it depends on the assumption that the common
goods do not experience any change in their bct terms. Therefore, both the quality and
variety of the underlying goods in each product category are assumed to be constant.
If quality or variety improve within the common goods, this will not be reflected in the
price index.
In order to solve this problem, one could switch to product-level data and compute
the price index over short enough time periods so that the common goods set is nonempty. This is the approach studied by Blonigen and Soderbery (2010). Of course, this
method is not feasible for most industries in most countries, where only aggregated trade
data is available. As a result, in practice most researchers assume that HS code/country
pairs appearing in both periods have constant bct parameters.10
2.5
The Price Index Decomposition
A key advantage of the Feenstra
approach
is that it decomposes the price index
ωcgt+1 ωgt+1
Q (1994)
Q
pct+1
(which I call the common goods
into two parts. The g∈G
c∈Cg
pct
gt+1
ωσ−1
Q
λgt+1
term) captures gains from price amongst common products, while the g∈G λgt
(which I call the changing goods term) captures gains from new and disappearing
products. If there has been a large relative increase in spending on new products
in period t + 1, the changing goods term will fall, indicating that the price index has
decreased. If however, these new products are highly substitutable with common goods,
the elasticity of substitution σ will be large, and this term will approach 1.
10
An exception is the original Feenstra (1994) paper, where he reports sensitivity results for several
different common goods sets.
10
There are two differences in the NCES decomposition compared to the CES one.
First, in the NCES case, both the common goods and the changing goods terms are
calculated within each group before being aggregated. This reflects the fact that, assuming the chosen grouping is sensible, a change in a product should affect the competing
products in the same group more than those in other groups.
Second, the NCES decomposition uses σ instead of γ for the elasticity of substitution
in the changing goods term. Note that σ is likely to be greater than γ because it
reflects substitution within groups of similar products instead of substitution across all
products. Therefore, the changing goods term is less likely to find large gains from
new products or large losses from disappearing products. Taken together, these two
differences tend to dampen movements in the NCES index relative to those in the CES
index.
Unfortunately, this method does not decompose the distribution of gains into those
due to price, quality, and variety. The common goods term calculates the gain from
price holding quality and variety constant in an artificial set of goods. Because the
common goods set excludes products in time t that did not appear in time t + 1 and
products in time t + 1 that did not appear in time t, this set is not equal to the actual
choice set in either period.
Furthermore, the changing goods term embeds price, quality, and variety effects,
making it difficult to distinguish them. A new product, for example, may have a large
expenditure share because it is cheap or because it is of high quality.
3
Logit-Based Frameworks
In measuring the gains from trade in differentiated products, the trade literature has
struggled with two issues: (1) how to allow for realistic substitution patterns and (2)
how to estimate price indices without product-level data. The industrial organization
literature on differentiated products has faced these same challenges. In tackling the
substitution problem, industrial organization has turned to two strategies, the first being nesting products, the second being random coefficients. These methods can be combined in a unified framework that I call the “nested random coefficients logit” (NRCL).
In addressing the data aggregation problem, industrial organization has bypassed this
issue by focusing on sectors for which product-level data is available. Because these
trade and industrial organization approaches are aimed at solving the same basic prob-
11
lems, comparing them highlights the costs and benefits of each.
At first glance this comparison appears difficult because of the stark differences in
the standard trade and industrial organization modeling frameworks. The empirical
industrial organization literature favors demand systems based on the MNL discrete
choice setup. Unlike in the CES framework, here there is a population of heterogeneous
consumers, each with unique preferences. Each buyer only purchases a quantity of one
product instead of consuming some of every product.
Nevertheless, Anderson et al. (1992) show that the MNL model produces the same
market demand system and price index as the CES. Building upon this insight, I show
that the NCES and NRCL are also tightly linked. In fact, I find that a special case of the
NRCL model, the NL, generates the exact same price index as the NCES. This result
allows me to separate the comparison based on structural theory differences (which arise
between the MNL, NL, and NRCL) from that based on data aggregation differences
(which arise between the CES and MNL or the NCES and NL).
3.1
The Consumers’ Problem
Assume that there are different types of consumers, with each type indexed by r.
Further assume, as in the NCES model, that goods are separated into groups indexed
r 11
by g ∈ G. In addition, each good within a group g is indexed by j in the set Jgt
. The
utility for consumer i of type r buying good j in group g is
r
urijt = ln(arjt mrijt ) + ζigt
+ rijt .
(15)
Here arjt is a good-specific measure of quality similar to bjt . The mrijt is the quantity of
good j that consumer i chooses to buy.12
r
Meanwhile the ζigt
is a random draw from a logit distribution with scale parameter
r
r
µ1 , and the ijt is a random draw from a logit distribution with scale parameter µr2 .13
11
r
The set of goods available Jgt
may vary by consumer, meaning that some goods can be consumed in
zero quantities by all consumers of a certain type. For example, a home office buyer would not consider
purchasing a large enterprize printer, so it should not be in that consumer’s choice set. Another way
r
to think of this is to assume that the utility from goods outside of Jgt
is zero.
12
In most applications of logit models, a consumer can only buy discrete units (usually one unit) of
a good. However, I change this assumption in order to match the CES model, where consumers can
buy a continuous amount of a good.
13
variable
x is distributed logit if it has a cumulative distribution function of
i
hA random
x
exp − exp − µ + % where µ is the scale parameter and % is Euler’s constant (≈ 0.577). This is
often referred to as a “Type I Extreme Value” distribution.
12
Thus, each consumer has a series of independently and identically distributed (iid)
r
and one for each group g ∈ G.
random draws, one for each product j ∈ Jgt
This utility specification combines two common industrial organization methods for
dealing with the IIA problem. First, products are divided into groups, which allows the
substitution between goods in the same group to differ from that for goods in separate
groups. This method is known as nesting, as in the NCES model. Second, there are
some parameters (arjt , µr1 , and µr2 ) that vary according to a probability distribution
across consumers. As a result, aggregate substitution patterns depend on the mix
of substitution responses found in the population. This method is known as random
coefficients.
Each time period consumer i’s problem is to maximize current period utility subject
to a budget constraint. As with the NCES framework, this is an entirely static problem,
with no borrowing or saving. The budget constraint is given by pjt mrijt = y r where y r
is the consumer’s income. Substituting this constraint into the utility function gives an
indirect utility of
r
r
vijt
= ln(arjt ) − ln(pjt ) + ln(y r ) + ζigt
+ rijt .
(16)
As in the NCES model, the consumers’ problem can be tackled in steps, starting
with the demand for goods conditional on being within a certain product group. When
r
term drops out, reducing the choice problem to
focusing on one group, the ζigt
n
o
max ln(ar1t ) − ln(p1t ) + ri1t , . . . , ln(arJgtr t ) − ln(pJgtr t ) + riJgtr t ,
(17)
r
to refer to both the goods set and
where, in a slight abuse of notation, I have used Jgt
its cardinality. Integrating over the logit random shocks gives,
1
−1
r
µ
r
arjt µ2 pjt2
probrjt|g =
1
P
r
j∈Jgt
−1
r
,
µ
r
arjt µ2 pjt2
which is the conditional probability that any type r consumer will choose good j.14
Turning to the choice of which product group to buy from, the consumer chooses
14
This expression follows from the form of the logit distribution. Specifically, when given the
problem max{d1 + 1 , . . . , dJ + J }, the probability that option j will be the maximum is given by
PJ
exp(dj /µ)/ j=1 exp(dj /µ). Here the j are iid logit random variables with scale parameter µ.
13
the group with the maximum expected indirect utility,
max




µr2 ln 
X
1
−1
µr
2


r
r
arjt µ2 pjt  + ζi1t
, . . . , µr2 ln 
r
j∈J1t
X
1
−1
µr
2

r
r
arjt µ2 pjt  + ζiGt
r
j∈JGt


.
(18)

Again I have used G to refer both to the set and to its cardinality.15 Maximization
results in a group probability of
P
probrgt =
r
j∈Jgt
arjt
1
µr
2
g∈G
P
j∈Jtr
µµr2r
1
pjt
P
−1
µr
2
arjt
−1
µr
2
1
µr
2
µµr2r
pjt
.
1
I now derive the connection between this model and the CES-based frameworks.
This is an extension of the proof in Anderson et al. (1992), which shows the relationship
1
r
= 1−σ r , and µ1r = γ r −1. Then convert
between the CES and MNL. Let arjt µ2 = brjt , −1
µr2
1
probrjt|g and probrgt to (expected) expenditure shares by multiplying and dividing by the
consumer’s income. The resulting expenditure shares are
srjt|g
=P
brjt p1−σ
jt
r
and
P
r
j∈Jgt
srgt =
P
g∈G
(19)
r 1−σ r
r bjt pjt
j∈Jgt
1−σ r
brjt pjt
P
r
j∈Jgt
1−γ r
1−σ
r
1−σ r
brjt pjt
1−γ r .
1−σ
r
(20)
Multiplying these two shares gives
r
srjt = P
r
j∈Jgt
1−σ
brjt pjt
γ r −σrr P
P
1−σ r 1−σ
brjt pjt
g∈G
r
j∈Jgt
1−σ r
brjt pjt
1−γ r .
1−σ
r
(21)
Note that if all types of consumers have identical preferences, meaning that brjt = bjt ,
σ r = σ, and γ r = γ for all r, these formulas collapse down to those in the NCES model.
15
Expression (18) follows from the form of the expected maximum of a series of iid logit random
PJ
variables. That is, E[max{d1 + 1 , . . . , dJ + J }] = µ ln( j=1 exp(dj /µ)), where the j are iid logit
random variables with scale parameter µ.
14
In industrial organization terminology, this case is known as the NL model. Therefore,
the nested method is the same in both trade and industrial organization. If additionally
σ = γ, the model collapses to the basic MNL framework. In that case the expenditure
shares are identical to those in the CES model. Therefore, the difference between the
NCES and the NL or between the CES and the MNL is due to the empirical techniques
usually chosen to estimate them as opposed to differences in the underlying theory.
The market-level share is found by integrating srjt across the distribution of consumer
types. For example, if the distribution is discrete the share is then
sjt =
X
ftr srjt ,
(22)
r∈R
where ftr is the fraction of expenditure accounted for by type r consumers in time t,
and R is the set of all consumer types.16
3.2
Addressing the IIA Problem
The NRCL tackles the IIA problem in two ways. First, this model takes the nesting
approach just as in the NCES framework. Second, this model averages across the
heterogeneous preferences of different types of consumers. The latter method breaks
the IIA property between products in the same nest.
The effect of these two approaches can be seen in the cross-price elasticities, which
have the following form:




∂mjt pkt
=
∂pkt mjt 


1
sjt
P
r∈R
ftr [(γ r − 1)srjt srkt + (σ r − γ r )srjt srkt|g ]
if j and k are
in the same group
−
otherwise
(23)
r
r
for all j 6= k. One would expect σ > γ for all r ∈ R because the nests gather together
similar products. Therefore, just as in the NCES model, the (σ r − γ r ) term should
increase the cross-price elasticity between goods in the same groups.
1
sjt
r
r
r∈R ft (γ
P
1)srjt srkt
16
I focus on the discrete distribution case because this allows me to express the NRCL results as
weighted averages of the NL formulas. This makes the relationship between the NRCL and the other
models particularly transparent. The discrete specification is familiar in the marketing literature (see
Kamakura and Russell (1989) for example), while the specification using a normal distribution has been
popularized by Berry, Levinsohn, and Pakes (1995). The normal distribution does not give closed-form
expressions for market demand (because the gaussian integral must be computed numerically), which
makes it cumbersome for my purposes here.
15
The random coefficients have an added effect. If, for example, type a consumers
have high preferences for laser printers, and both goods j and k are laser printers,
the expenditure shares for type a consumers will be large. In turn, this will place
more weight on type a’s elasticities of substitution, γ a and σ a , in the aggregate crossprice elasticity. Similarly, those types that dislike goods j and k will tend to have less
influence on the cross-price elasticity because their expenditure shares are smaller.
3.3
The Price Index
Although the basic logic of the price index remains the same as in the NCES, in this
model one needs to integrate welfare across consumer types. In the case of a discrete
distribution for consumer types, this aggregation weights each type’s utility by their
share in expenditure in time t + 1. Then imagine taking away the set of goods available
to these consumers in period t + 1 and replacing it with the set from time t. The price
index is the factor by which the prices on the time t goods would have to fall to equalize
the weighted sum of indirect utilities.
The expected indirect utility for a consumer of type r can be calculated by finding
the expected maximum of the choice problem in equation (18). The condition for the
price index is then
1−γ r 
 1−σ
r
X  X r N RCL

1−σ r σ r −1 
bjt (τt+1 pjt )
ln 
y


r
X ft+1
γr − 1
r∈R

g∈G
r
j∈Jgt
1−γ r 
 1−σ
r
r
r

X  X r
1−σ σ −1 
ln 
bjt+1 pjt+1
y
.

=
r
X ft+1
γr − 1
r∈R

g∈G
r
j∈Jgt+1
Solving for the price index itself gives

P
N RCL
τt+1

g∈G
Y

=
 
P
r∈R 
P
g∈G
1−γ r
1−σ
r
r
r
ft+1
1
! 1−γ
r
1−σ
brjt+1 pjt+1
r
j∈Jgt+1
P
1−σ r
r
r bjt pjt
j∈Jgt
1
1−γ r 1−γ r
1−σ
r






.
(24)
Note that this expression is the geometric average of individual NL (or NCES) price
16
rN L
, for each type of consumer, where
indices, τt+1
P
g∈G
rN L
τt+1
=
P
brjt+1 pjt+1
r
j∈Jgt+1
P
1−σ r
g∈G
P
1−σ r
r
r bjt pjt
j∈Jgt
1−γ r
1−σ
r
1
! 1−γ
r
1
1−γ r 1−γ r
1−σ
r
.
(25)
rN L
In turn, as in the NCES model, each τt+1
is a geometric average of individual MNL
(or CES) price indices,
Y
r
rM N L ωgt+1
rN L
)
(26)
=
(τgt+1
τt+1
g∈G
where
1r
1−σ r 1−σ
r
b
p
r
jt+1 jt+1
j∈Jgt+1
1r
P
r 1−σ r 1−σ
r bjt pjt
j∈Jgt
P
rM N L
τgt+1
=
and
r
ωgt+1
3.4
(srgt+1 − srgt /(ln(srgt+1 ) − ln(srgt ))
.
=P
r
r
r
r
g∈G (sgt+1 − sgt )/(ln(sgt+1 ) − ln(sgt ))
Estimation
Given the strong theoretical similarities between the trade and industrial organization methods, much of the difference between these literatures stems from the use of
different empirical techniques. Industrial organization usually focuses on industries for
which detailed product-level data, including information on product characteristics, are
available. As a result, quality parameters for each product can be estimated and the
“common goods assumption” can be dispensed with.
Choose one good that appears in every time period to be the “outside good.” Assign
this product the zero index and assume that br0t = 1 for all consumer types and time
periods.17 This gives
r
p1−γ
0t
r
s0t =
P
1−γ rr .
P
r 1−σ r 1−σ
r bjt pjt
g∈G
j∈Jgt
Then take logs of this equation and subtract it from the log of srjt in equation (21).
17
This can be thought of as re-scaling the qualities of all other goods to be in units relative to the
quality of good 0.
17
After some minor algebraic manipulations following Berry (1994), this results in
ln(srjt ) − ln(sr0t ) =
γr − 1
σr − γ r
r
r
)
−
(γ
−
1)[ln(p
)
−
ln(p
)]
+
ln(b
ln(srjt|g ). (27)
jt
0t
jt
σr − 1
σr − 1
This equation is the basis for estimating the model.
Standard product-level data sets will report expenditure shares and prices. The
only question is how to capture quality in equation (27). This is where product characteristics information is useful. In industries commonly studied in empirical industrial
organization, such as automobiles and electronics, product characteristics are a reasonable proxy for quality. In the case of printers, characteristics such as print speed,
color capability, or paper capacity are easily observed and closely related to quality.
Therefore, the standard practice has been to use data on such features to parameterize
quality in the demand model.
Collect the characteristics for each printer j in a vector denoted by xjt . Then assume
the following:
γr − 1
ln(brjt ) = (xjt − x0t )β r + erjt .
(28)
σr − 1
The β r is a vector of parameters to be estimated and erjt is an error term that allows
for quality unobserved by the econometrician. This gives
ln(srjt ) − ln(sr0t ) = (xjt − x0t )β r − (γ r − 1)[ln(pjt ) − ln(p0t )] +
σr − γ r
ln(srjt|g ) + erjt . (29)
r
σ −1
Equation (29) is an estimating equation that can be run on product-level purchase data
that is categorized by consumer types. Once the parameters have been estimated for
each consumer type, one can calculate the price index by plugging directly into equation
(24), bypassing the common goods assumption entirely. Note that if β r = β, γ r = γ,
and σ r = σ for all r ∈ R, this equation then reduces to the NL model. If in addition
the ln(srjt|g ) term is dropped, this equation then reduces to the MNL model.
Estimation of equation (29) could proceed by ordinary least squares, but this is not
the usual practice. One would expect the coefficient on price to have a positive bias
(making it smaller in absolute value) because printer vendors will tend to set higher
prices for models that have high unobserved quality. In addition, it is likely that ln(sjt|g )
is endogenous, as increased unobserved quality can drive higher within group sales. This
would also induce a positive bias on the ln(sjt|g ) coefficient. Therefore, instruments are
18
found for both the log price and the log share variables.18
One could take a different approach if data broken up by consumer type is unavailable. That is, one could specify a distribution for consumer types, be it discrete
or continuous, and estimate its parameters by matching the market-level expenditure
shares (as in equation (22)) with those observed in the data. Because I have access
to data by consumer type for the computer printers example, I have elected to avoid
wading into these econometric intricacies.19 Introducing this level of complexity into
the model would make the estimation routine less transparent and hamper comparisons
between the NRCL and other logit models.20
3.5
Price Index Decompositions
Another advantage of the product-level estimation approach is that it allows for the
decomposition of changes in the price index into their price, quality, and variety components. That is, here one can step beyond the “common goods” and “changing goods”
breakdown by incorporating the estimated qualities into the Feenstra (1994) methodology. The resulting decomposition is useful in exploring the mechanisms at work behind
movements in the overall price index.
Imagine that there are Jgr goods in consumer type r’s group g choice set in period
t. Order these goods by some metric such as increasing quality, and then assign each
an index 1, . . . , Jgr .21 Next choose a size Jgr group of goods from time period t + 1 that
is representative of the t + 1 distribution of price and quality.22 Order these goods
similarly, and again assign each an index 1, . . . , Jgr .23 Because this set is scaled to be of
size Jgr , it captures the price and quality distribution present in t + 1 but holds variety
18
Treating other product characteristics besides price as endogenous is much less common because
finding an instrument that is uncorrelated with unobserved quality but correlated with observed quality
is difficult.
19
For certain distributions, this procedure can be computationally intensive. In the case of the
normal distribution, integrating up to the market-level shares has to be done numerically, and then
the parameters have to be fitted using a non-linear search. See Knittel and Metaxoglou (2008) and
Dubé et al. (2009) for discussions on the challenges involved in this type of estimation.
20
Readers who are interested in these other estimation methods should consult Berry, Levinsohn,
and Pakes (1995), Nevo (2000), and Train (2009).
21
In theory, the ordering of these goods does not matter, so long as one keeps track of which price
and quality goes with which good. However, the resulting decomposition can be sensitive to which
goods are matched in periods t and t + 1, so it is best to establish a consistent procedure.
22
I discuss one method for choosing such a set in Section 4.
23
This procedure assumes that there are at least as many products available in time t + 1 as in t.
Otherwise, the size of the set Jgr would be defined as the number of goods in time t + 1 and the price
r
r
index would be interpreted as measuring the change relative to the set Jgt+1
instead of relative to Jgt
.
19
constant at the period t level. Repeat this process for each g ∈ G and r ∈ R.
Then the price index decomposition is

N RCL
=
τt+1

Y Y


r∈R
g∈G
Jgr
Y
j=1
pjt+1
pjt
Jgr r
ωjgt+1
Y
j=1
r
brjt
ωjgt+1
σ−1
brjt+1
Jgr
λt+1
Jr
λt g
r
1 ωgt+1
! σ−1
r
ft+1



.
(30)
where
Jr
λt g
PJgr
r
j=1 pjt mjt
,
=P
r
r pjt mjt
j∈Jgt
(srjt+1 (Jgr ) − srjt (Jgr ))/(ln(srjt+1 (Jgr )) − ln(srjt (Jgr )))
r
.
= PJ r
ωjgt+1
g
r
r ) − sr (J r ))/(ln(sr
r )) − ln(sr (J r )))
(s
(J
(J
j=1 jt+1 g
jt g
jt+1 g
jt g
and
r
ωgt+1
=P
(srgt+1 − srgt /(ln(srgt+1 ) − ln(srgt ))
.
r
r
r
r
g∈G (sgt+1 − sgt )/(ln(sgt+1 ) − ln(sgt ))
Here srjt (Jgr ) is the share of expenditure by type r consumers accounted for by good
j out of the goods indexed by 1, . . . , Jgr . Equation (30) has three components. The
first part is a geometric average of price ratios, pjt+1 /pjt , which captures the changes
in price in the set {1, . . . , Jgr }. The second part is a geometric average of quality ratios,
brjt /brjt+1 , which captures changes in quality in the set {1, . . . , Jgr }. The third part is an
expenditure share adjustment, which reflects how much expenditure has shifted to the
greater number of goods that are available outside of the set {1, . . . , Jgr }. This term
captures variety.
It is important to note how this full decomposition compares with the MNL and NL
special cases. In the case of the NL model,
NL
τt+1
=
Y
g∈G
ω
Jg Y
pjt+1 jgt+1
j=1
!ωgt+1
Y
pjt
g∈G
Jg Y
bjt
bjt+1
j=1
ωjgt+1
σ−1
!ωgt+1
Y
g∈G
Jg
λt+1
J
λt g
gt+1
! ωσ−1
.
(31)
where
J
λt g
and
PJg
j=1 pjt mjt
=P
j∈Jgt pjt mjt
(sjt+1 (Jg ) − sjt (Jg ))/(ln(sjt+1 (Jg )) − ln(sjt (Jg )))
ωjgt+1 = PJg
.
(s
(J
)
−
s
(J
))/(ln(s
(J
))
−
ln(s
(J
)))
jt+1
g
jt
g
jt+1
g
jt
g
j=1
Note that sjt (Jg ) is the share of expenditure accounted for by good j amongst expen20
diture on the goods 1, . . . , Jg . In the case of the MNL model,
MNL
τt+1
ω
J J Y
pjt+1 jt+1 Y bjt
=
pjt
bjt+1
j=1
j=1
ωjt+1
γ−1
λJt+1
λJt
1
γ−1
(32)
where
λJt
PJ
j=1 pjt mjt
=P
j∈Jt pjt mjt
(sjt+1 (J) − sjt (J))/(ln(sjt+1 (J)) − ln(sjt (J)))
and ωjt+1 = PJ
.
j=1 (sjt+1 (J) − sjt (J))/(ln(sjt+1 (J)) − ln(sjt (J)))
Here sjt (J) denotes the expenditure share of good j out of the goods 1, . . . , J, in time
t.
The comparison between the NL decomposition and the MNL analog is similar
to the comparison between the CES and NCES decompositions. The price, quality,
and variety terms are calculated first within each group instead of immediately across
all products. This reflects the fact that according to the NL model, changes in all
three forces should have the strongest effect amongst goods that are in the same nest.
Furthermore, the NL uses σ instead of γ in the variety term. Since we expect that
σ > γ, this will tend to lower the variety term relative to that in the MNL.24
In moving from the NL to the NRCL, the random coefficients mean that there are
multiple consumer types that must be averaged over. Therefore the NRCL index allows
the effects of a change in price, quality, or variety to be asymmetric across consumers.
Whether this index is larger or smaller than the NL one depends on how the preferences
of individual consumer types compare to the average preferences of all consumers. The
NRCL may pick up gains (or losses) to minority groups of consumers that wash out in
the market-level data.
4
Empirical Example: Computer Printers
Given the similarities and differences between the models discussed above, it is important to see how they compare in practice. To this end, I apply these methods to the
Indian import market for computer printers over the period 1996 to 2005.
24
The quality term also includes σ, but this difference actually does not have an effect on the
relationship between the NL and MNL indices. This is because the definition of bjt in the estimation
equation causes the σ to cancel out.
21
4.1
Market Background
The information technology sector is one of the fastest growing import markets in India.
The quantity imported of computers and associated peripherals as classified in HS 8471
increased from 0.48 percent of the value of imports in 1996 to 1.40 percent in 2005.
Growth was extraordinarily strong in computer peripherals, HS 847160, rising from
0.08 percent of value in 1996 to 0.32 percent in 2005, a roughly 4 times increase in
share.25
Some of this growth has been spurred on by a liberalization of India’s import policies.
In 1997, India signed the WTO’s Information Technology Agreement. By doing so,
India agreed to lower tariffs on printers from 20 percent ad valorem to 0 percent by
2005. This goal was successfully achieved in the middle of 2005.
There are few local printer producers in India, and those firms that do operate almost
exclusively make dot matrix machines. The Department of Scientific and Industrial
Research, a government agency tasked with promoting technology development in India,
released a report in 1996 on the state of the Indian printer sector. They concluded that
India was unlikely to expand into laser printer manufacturing, even with the help of
foreign direct investment (DSIR (1996)). The report pointed to small local demand
and a poor technology infrastructure as major hurdles. This situation has only slightly
improved today. The firm WeP Peripherals announced the first Indian laser printer
factory in 2003, but their line remains small. No other Indian firm has established a
plant.
Instead of sourcing printers locally, most of the market is served by imports from a
number of multinational brands (such as Canon, Epson, HP, or Xerox). These foreign
companies prefer to do their manufacturing in China and Southeast Asia and then ship
into India. I do not know of any multinational brand or electronics outsourcing firm
which has a printer manufacturing facility in India.26 India, although a growing market,
is still too small to warrant major horizontal foreign direct investment.
Therefore, the Indian computer printer sector is a dynamic differentiated products
market that has seen a lot of growth fueled by imports. This makes it an ideal candidate
in which to study the effects on consumers of changes in differentiated goods.
25
These numbers are computed using the UN Comtrade database.
HP has had plants in India since the late 1990s, but they mostly produce computers. Xerox has a
facility in Rampur, but it makes single-function copiers. There are some plants that make components.
26
22
4.2
Data Description
My data is based upon an extract from the IDC Hardcopy Peripherals Database (IDC
(2008)). This data set tracks sales of individual printer models, listing their name,
quantity sold, and average price for every quarter from the beginning of 1996 to the
second quarter of 2006.27 Average price includes the purchase price and shipping costs,
but not taxes. I have converted these prices into real figures using the Indian consumer
price index.28 Only new sales are reported, not sales of used or refurbished models.
IDC, a market research firm that focuses on technology products, collects this information from retailers, distributors, and online vendors. They claim to track all models
of A2 through A4 size laser and inkjet printers.29 Some observations aggregate a base
configuration and other optional configurations into one model. However, this is not
a major occurrence amongst the machines offered in India because they are mostly
low-end models with few extra options.
I define a printer as a device that can print output from a computer, excluding
portable machines meant for travel. I focus on two technologies: inkjet and laser. These
machines may perform other functions, such as copying or scanning (“multi-function
peripherals” or MFPs). I exclude printers that use impact technologies, which are based
on older typewriter-like designs.30 The IDC data has limited coverage of impact models
and does not report purchases of these categorized by consumer types for most of the
models that are included. Furthermore, I am not able to collect characteristics data for
all of these printers because many were sold by small local firms that do not have their
back catalogs available in print or online. Regardless, laser and inkjet models account
for the vast majority of sales, particularly amongst imported models.31 It is also in
the laser and inkjet categories where most of the improvements in terms of quality and
variety have appeared, as impact is a dying technology. Note also that I limit my data
to non-Indian brands.
27
I also have data through the first quarter of 2008, but these observations have to be dropped
because there is no product that appears in all of these years to serve as the outside good in the logit
models.
28
The CPI is from http://labourbureau.nic.in/ and was downloaded in October 2009.
29
IDC does aggregate some models into an “other” category, but this never accounts for more than
2.4 percent of sales revenues in any quarter. I discard this category because it is not clear exactly
how it is constructed. Prior to 2001, MFPs are not recorded in the database. However, when MFP
tracking begins in 2001 Q1, these machines only comprise 4.5 percent of sales revenue, so it is unlikely
that they formed a large part of the market in the prior period.
30
The main impact technology is dot matrix.
31
When the Indian customs authority began reporting printer import quantities by inkjet and laser
versus dot matrix in the spring of 2003, inkjet and laser accounted for over 90 percent.
23
Importantly for my purposes, IDC also separates the units sold of each inkjet and
laser model into those purchased by different subsets of consumers. These divisions
are home office, 1 to 9 employee establishments, 10 to 99 employee establishments,
100 to 499 employee establishments, and 500 or more employee establishments. Note
that home office buyers may include family businesses. In order to form the set R
of consumer types for the NRCL model, I aggregate this data into two categories:
home office or 1 to 9 employee establishments (which I call “small” consumers) and
10 or more employee establishments (which I call “large” consumers). I choose to
use only two consumer types in order to keep the comparisons between the random
and non-random coefficients specifications simple. One would expect that this group
of home office and small firms would exhibit distinct buying patterns relative to the
average behavior in the data. Although larger establishments would consider using a
department-sized laser printer, for example, most small buyers would not because of
the set-up costs, technological expertise, and physical space required. Those costs are
not justified for a small establishment that will not print high volumes.
In order to accurately measure the quality of these printers, I need to know something about their characteristics. IDC provides some basic information, categorizing
models into different bins based on technology (laser, inkjet), function (single, MFP),
color versus monotone printing, and page per minute (PPM) speed (1-10 PPM, 11-20
PPM, etc.). In order to enrich my analysis, I supplement this data with characteristics
collected from manufacturer’s websites and from printer specification sheets published
by the firm Buyers Laboratory.32
I can not find data for all the models reported in the IDC dataset, meaning that
some observations are dropped. After these exclusions, I am left with data for about
96 percent of sales revenue and 97 percent of laser and inkjet units sold in the original
IDC data set. This data cover 1198 unique models. Summary statistics are presented
in Table 1.
There is an important omitted variable from the characteristics listed in Table 1. In
particular, I do not have information on the maintenance costs of each printer model.
These are largely due to printer cartridges (though they also include other factors
like paper and electricity). However, upon examining the industry literature, I have
32
If one or two characteristics for a model can not be found, they are imputed from similar models
of the same brand, or, if those are not available, from similar models across all brands in a given
quarter. This affects 10 percent of observations in the final sample. I convert printing speeds listed in
characters per second to PPM by assuming a rate of 4000 characters per page.
24
found that few estimates of these costs are available. The statistics that do exist
indicate that running costs vary strongly with the technology of the printer (laser or
inkjet) and with whether or not the machine prints in color. Therefore, in estimating
the demand models, I use dummies for these characteristics to proxy for this omitted
variable. Unfortunately, this means that I cannot separately identify tastes for these
technologies from preferences for their maintenance costs.33
I supplement this printer data with information on Indian printer tariffs and exchange rates. Tariff data covering 1996 to 2001 are from Khandelwal and Topalova
(2010) for HS category 847160. I extend this data to 2006 using announcements of
changes to the tariff schedule published by the Indian Central Board of Excise and
Customs.34 I obtain information on the quarterly exchange rate between India and
the US, Japan, South Korean, China, and the European Union from Global Financial
Data.
4.3
Identification
In order to estimate the logit-based models, I need instruments for the log price and
log group share variables. When faced with this situation, industrial organization researchers often struggle to find plausibly exogenous instruments that exhibit enough
time-series and cross-sectional variation to be useful. In my empirical application, I can
leverage the international trade aspect of my data to solve this problem.
As discussed above, the vast majority of sales in the Indian printer market (and all
sales in my data) are accounted for by foreign brands. These brands’ parent companies
are located in the US (such as Xerox and HP), Japan (such as Canon and Ricoh), South
Korea (Samsung), China (Lenovo), and the European Union (Océ). Therefore, any revenues these corporations make by selling printers in India must ultimately be converted
from Indian rupees into their home currency in order to become part of their bottom
lines. As such, the exchange rate between their home currency and the Indian rupee
should affect the prices that are set and in turn affect expenditure shares. However,
given that the buyers of printers in this market are largely small Indian firms that only
33
A related concern is that upfront printer prices may be uninformative if printer vendors are pursuing a strategy of lowering the prices of printers in order to make money on printer cartridges. Although
such a strategy has been used in the US, it is much less prevalent in India because of the high penetration of third party and counterfeit cartridges. IDC estimates that over 50 percent of the cartridges
sold in India are made by third parties.
34
These announcements are available at http://www.cbec.gov.in/ and were accessed in July 2010.
25
operate domestically, the exchange rate should not affect demand independently.35
Thus, I use these exchange rates to build two instrumental variables. First, I take
the exchange rate for the headquarters currency of each brand. Second, I form the
average exchange rate of each model’s rival products in the same IDC product type.36
For example, imagine that there are three models in a certain category, one from Japan
and two from the US. Then the instrument for the Japanese model would be the US
exchange rate, while the instrument for each of the US models would be the average of
the US exchange rate and the Japanese exchange rate. This second instrument helps
me capture the variation in pricing competition across types of printers.
I also construct a third instrumental variable based on tariffs. As mentioned above,
India pursued a dramatic liberalization in the computer printer sector over the time
period I study, zeroing out most printer tariffs. This fall in taxes was mandated by a
WTO agreement covering a number of information technology products, and hence is
probably unrelated to unobservables in printer demand. Furthermore, tariffs are likely
to be correlated with printer prices and in turn with within group expenditure shares,
while not entering into demand separate from their effect on price.
4.4
Overview of Results
I begin by discussing how I estimate the logit-based models. Recall that I have three
logit variants: the MNL, the NL, and the NRCL. In the interest of simplicity, I choose
to define just two product groups in the nested models, one for inkjet and the other for
laser.
The estimating equations for all three models are combined in equation (29). Simply drop the consumer type distinction in order to reduce the NRCL to the NL and
further drop the log group share term in order to reduce the NL to the MNL model.
Because there are two types of consumers in the NRCL model, I have three equations to
estimate: one equation without consumer types, one equation for small consumers, and
one equation for large consumers. I merge all of these equations into one by stacking
35
There are some drawbacks to this approach. Most prices for printers are probably set initially in
Indian rupees, not set in foreign currencies and converted. If there are menu costs, this may mean that
prices are sticky in Indian rupees and hence less sensitive to exchange rate movements. In addition,
the exchange rate may be affected by domestic policy controls or general equilibrium effects that are in
turn related to local demand factors. Nevertheless, the exchange rate is one of the few variables that
exhibits strong variation across time periods while also having a reasonable probability of satisfying
the exogeneity requirement.
36
See the first column of Table 2 for a list of these categories.
26
the observations for all three and interacting the independent variables with a constant
and with dummies for whether an observation is for small buyers and for whether an
observation is for large buyers. I estimate this equation using both least squares and
instrumental variables methods.
I need to difference the data with respect to one printer model, the outside good.
The natural choice is a dot matrix printer, since that is the most common alternative
to the laser and inkjet models included in my main dataset. As previously mentioned, I
only have data on selected dot matrix models, but there is one candidate that appears
in all the years from 1996 to 2005, the Panasonic KX-P1150. This is the product that
I take as my outside good.37
The parameter estimates appear in Table 3. Although I estimate all the models
stacked into one equation, I separate the results into their three component equations
(MNL/NL, NRCL small, and NRCL large) to make the numbers easier to interpret.
In all regressions, instrumenting appears to remove a positive bias on the log price
and log group share coefficients. The price estimate becomes more negative and the
share coefficient becomes less positive. This result accords with the hypothesis that
the log price and the log share variables are positively correlated with unobserved
quality. Note also that instrumenting tends to raise the resulting estimates of σ and
γ, the elasticities of substitution. The first-stage F statistics are all above 50, and the
overidentification test statistics are not significant at conventional levels for any of the
models.38 In what follows, I use the IV results as my preferred specification.
The coefficients indicate that nearly all characteristics increase quality relative to
that for the Panasonic KX-P1150, which has relatively low characteristics. The only
exceptions is resolution, which may result from the fact that a unless they print photos
regularly, many buyers have little use for ultra-high resolution machines. The latter
point reveals a potential heterogeneity between consumer types that I return to in
discussing random coefficients.
The coefficient on the log group share is always highly significant, indicating that
a nested model is appropriate for this data set. This result is to be expected, because
37
I do not have consumer-level data for this model, so I assume that 5 percent of reported sales
are made up of small consumers. This estimate is based on IDC sales data for two other dot matrix
models sold from 1998 to 2003.
38
I use both the homoskedastic and heteroskedasticity robust overidentification tests suggested by
Wooldridge (2002). These F test and overidentification test results hold regardless of whether the three
equations (no consumer types, small consumers, and large consumers) are estimated stacked together
or separately.
27
there is a natural dichotomy in printers between laser and inkjet models.
As for the random coefficients, F tests on the interactions between dummies for
consumer type and the independent variables indicate that these are jointly significant.
Therefore, the NRCL model is the best fit. Focusing on individual coefficients, I find
that small consumers’ coefficients on the color dummy, laser dummy, and network
interface dummy are significantly lower and the coefficient on resolution is significantly
higher compared to those in the non-random coefficient models. Meanwhile, the large
consumers’ coefficient on resolution is significantly lower and the coefficient on the laser
dummy is significantly higher.39 These results indicate that there is some variation in
preferences that the MNL and NL models mask.
A selection of own- and cross-price elasticities appear in Table 4. Each entry is
the percentage change in quantity sold of the row good in response to a percentage
change in price of the column good. In the top panel, which contains the results for the
MNL model, I find that the cross-price elasticity is identical down each column, which
occurs because of the IIA property. Once I shift to the NL model, these elasticities vary
depending on which group (inkjet or laser) the row good is in. Models that are in the
same group as the column good have much larger elasticities compared to those that
are in different groups. Finally, the NRCL elasticities also vary within groups, due to
the random coefficients.
The results for the 1996 versus 2005 price indices are presented in Table 5. These
annual figures average over the results for the first quarter of 1996 versus the first
quarter of 2005, the second quarter of 1996 versus the second quarter of 2005, and so on.
Following Broda and Weinstein (2006), I construct bootstrapped 95 percent confidence
intervals by sampling 100 times from the estimated joint normal distribution of the
regression parameters in Table 3 and calculating the price index for each sample.
In order to facilitate comparisons between the CES-based and logit-based models,
I set the elasticities of substitution in the CES and NCES calculations to the numbers
estimated for the NL and MNL models (Table 3, second column). Remember that
these parameters are the only ones needed in the CES and NCES calculations. The
other input is the aggregated price and expenditure share data. I form this data by
taking the share-weighted average price and the total expenditure for each product
category/country combination reported in Table 2.
In interpreting the numbers in Table 5, note that the index is calculated between
39
These results are based on separate 5 percent t tests for each coefficient.
28
1996 and 2005. Therefore, the index is the factor by which one would have to multiply
the prices of all goods in 1996 in order to give the same welfare as the goods available in
2005. For instance, the 0.061 CES index means that 1996 prices would have to fall by
93.9 percent in order to make consumers as well off as in 2005. All five indices estimate
dramatic falls in the 1996 prices, ranging from about 84 percent to 96 percent. However,
there are some subtle differences. The aggregated indices tend to be higher than their
logit counterparts (compare the CES and MNL or the NCES and NL). Nesting tends to
raise the index (compare the CES and NCES or the MNL and NL). Including random
coefficients appears to slightly lower the index (compare the NL and NRCL).
Figure 1 shows how the 1996 versus 2005 price index developed over the intervening
years. For each year from 1997 to 2005, I calculate the price index with respect to
1996, thus showing how much of the 1996 to 2005 change had occurred by that year.
Broadly speaking, all of the indices for the five models move together. The descent in
the indices over the years is reasonably smooth, except for a large drop between 1997
and 1998. This occurs because 1997 was a year where the price indices exhibited losses
(particularly in the second quarter) when some firms withdrew products.
The decompositions into common and changing goods effects (for the CES models)
or price, quality, and variety effects (for the logit models) appear in Table 6.40 In
constructing the logit-based decompositions, I need to choose a subset of goods from
the 2005 set of products available that is the same size as the set available in 1996.
Keeping with the spirit of the price, quality, and variety breakdown, I build a subset
that approximates the joint distribution of price and quality available in 2005, but
scales it to have the same variety (number of products) as in 1996. In this way, the
price and quality terms in the decomposition will give a good approximation of the
relative changes in price and quality between 1996 and 2005.
For example, suppose that there are X products available in 1996 and Y > X
products available in 2005. I split the 2005 ranges of price and quality into 5 different
percentile bands, giving 25 price/quality bins. I then randomly choose models at a rate
of X/Y from each bin. This procedure gives me a set of 2005 goods that is the same
size as the set of 1996 goods, and then I calculate the decomposition using these sets.
In order to ensure that my findings are not driven by a particular sample, I repeat this
procedure 100 times for different samples, and average across the resulting terms to get
my final numbers.
40
The common goods effects do not have confidence intervals because those terms do not use estimated parameters. They are constructed using only average price data.
29
In comparing the CES and NCES decompositions, the largest difference comes in
the changing goods term, which rises noticeably in the NCES case. As for the logit
decompositions, the price and quality terms are broadly similar across models, while
the variety term rises markedly in the NL and NRCL models. These differences hint at
the importance of the nested elasticity of substitution, σ.
Therefore, an initial scan of the results suggests some important differences between
the five price indices. I discuss these trends in more detail in the following sections.
4.5
Disaggregated Data: CES/NCES vs. MNL/NL
I begin by examining the effect that using product-level data has on the resulting price
indices by comparing the CES with the MNL and the NCES with the NL. Because the
price index formulas in these models are identical, any actual differences in the results
are due entirely to divergent empirical techniques. I find that the CES and NCES
methods cannot distinguish developments within product categories because of these
models’ reliance on aggregated data. As a consequence, the CES and NCES indices
tend to understate gains (or losses) from changes in products.
This pattern is already somewhat apparent in Figure 1. The CES-based indices
fall steadily while the logit indices bounce around. This difference occurs because the
product-level data allow the logit indices to capture subtle changes in the products
offered that the aggregated data miss.
On net, the MNL and NL indices find a larger gain over 1996 to 2005 when compared
to the CES and NCES indices, respectively. This distinction is statistically significant,
as bootstrapped 95 percent confidence intervals for the difference between the 1996/2005
MNL and CES indices and the 1996/2005 NL and NCES indices do not include zero.41
In terms of economic significance, the MNL index is about 42 percent lower than the
CES index, and the NL index is about 31 percent lower than the NCES index. This result is similar to that in Blonigen and Soderbery (2010), who find that using aggregated
trade data understates the improvements in the price index from increased variety in
the US automobile market.
Thus, it appears that using aggregated data tends to understate improvements in
the price index. One way to explore this point is to take a closer look at the common
41
In order to build these confidence intervals, I sample from the estimated asymptotic distribution
of the regression parameters in Table 3 to form 100 simulated parameter sets. Then I calculate the
indices for each sample and take the difference between the MNL and CES indices and the NL and
NCES indices.
30
goods terms. These terms are the price indices calculated using only goods categories
that appear in both time periods. By assumption, these categories are supposed to have
constant quality and variety. However, because printer firms are constantly tweaking
their offerings, this assumption is unlikely to hold. I can assess the effect this has on
the common goods term by calculating the price index for the common goods using
the product-level data underlying these categories. That is, I apply equation (11)
and equation (10) to the individual printer models that are within the common goods
categories.
I graph the resulting “common goods terms” in Figures 2 and 3. The terms constructed using product-level data are lower than those using aggregated data in all years
except 1997. Hence, the product-level data reveal that there were significant increases
in quality and variety (or decreases in the second quarter of 1997) within the common
goods. By assuming these changes away, the standard CES and NCES indices have
missed these movements in the price index.
4.6
Nesting Products: CES/MNL vs. NCES/NL
Another important point of comparison is between the nested and non-nested models.
Nesting allows the elasticity of substitution to be larger within groups (where goods
tend to be more similar) than between groups. As a result, I find that improvements
in the price index tend to be dampened in the nested frameworks.
Comparing the overall 1996/2005 indices in Table 5, the CES is about 62 percent
lower than the NCES and the MNL is about 68 percent lower than the NL. Based on
bootstrapped 95 percent confidence intervals, these differences are statistically significant.
Nesting also has a noticable effect on the price index decompositions in Table 6. As
the elasticity of substitution within groups rises between the CES and NCES models, the
changing goods term becomes larger. Indeed, the CES term is about 44 percent lower
than the NCES one, and the bootstrapped 95 percent confidence interval indicates that
the difference is statistically significant. This reflects the fact that new and disappearing
goods have a smaller effect on utility when they are more substitutable with some
existing common goods.
Similarly, the variety terms in Table 6 also rise when nesting is implemented. The
MNL variety term is about 67 percent lower than that for the NL. The bootstrapped
31
95 percent confidence interval indicates that this difference is significant.42 The change
occurs because increasing variety has less of an effect on utility when the elasticity of
substitution between some goods increases due to nesting. Therefore, it is the variety
channel that is most affected by nesting.
Some of the most dramatic results from nesting appear in the cross-price elasticities.
As Table 4 shows, the NL cross-price elasticities between printers in the same product
group are on the order of 10 times larger than those in different groups. Meanwhile
the elasticities are exactly constant across printers in the MNL case. As a result,
improvements in the printers available are not as highly valued in the NL model, since
this framework recognizes that there are some good substitutes already on the market.
4.7
Random Coefficients: NL vs. NRCL
The final comparison is between the NL and the NRCL, which highlights the effect of
random coefficients. These coefficients allow the model to better reflect the spread of
tastes across consumers, which can be obscured in market-level data. Here I find that
small consumers experience larger gains than average due to improvements in printers.
Although the overall NRCL index is similar to the NL, this similarity masks heterogeneity between consumer types. See Figure 4, which graphs the indices for small and
large types (according to equation (25)) alongside the overall NRCL and NL indices.
The price index for small consumers is always lower than that for large consumers. However, large consumers account for about 80 percent of expenditures. As a result, the
NL model tracks the large index more closely, whereas the NRCL is pulled downwards
because it incorporates small buyer preferences.
In turn, the NRCL price index for 1996 versus 2005 is lower than the NL index,
although the effect is statistically insignificant. The statistically significant distinction
comes in examining the price index for small buyers, which is 53 percent lower than the
NL index. The NL only distinguishes average sales patterns, so it does not recognize
that certain new goods may have a greater effect on some buyers compared to others.
In this case the NL is influenced mainly by the behavior of large consumers and fails
to pick up improvements for small consumers. This difference has a small effect on the
overall price indices, but it is important when assessing the distributional consequences
of improvements in differentiated products.
42
The differences across these models for the price and quality terms are also significant, though
they are much smaller in economic terms.
32
When compared to the results from disaggregating data and nesting products, the
effects of introducing heterogeneity in preference coefficients are less pronounced in the
market-level index. This is related to the nature of the computer printer example,
where products clearly vary a great deal at the model level and where there is an
obvious nesting structure (inkjet versus laser). Heterogeneity in coefficients is a lessobvious concern. Indeed, Table 4 shows that nesting plays a far more important role
in addressing the IIA problem and as a result has the most dramatic effect on the
price indices. That being said, I do find that using random coefficients allows the
resulting price indices to better reflect the preferences of minority consumer types.
An NRCL specification with a richer distribution of consumer preferences (such as a
normal) may produce even more realistic results, although these would come at a higher
computational cost.
5
Conclusion
Although the approaches that international trade and industrial organization take with
regards to differentiated products appear quite different, I find that the underlying
theories are actually closely related. The CES and the MNL models yield the same
demand system and price index, and this fact greatly facilities the comparison between
a number of common variations on these frameworks. In effect, the NCES, NL, and
NRCL are all just ways of addressing the IIA problem. The only differences come in
how these models are used empirically (using aggregated versus product-level data) and
in how they tackle IIA (using nesting versus using random coefficients).
In the computer printers empirical application, I find that aggregated data methods understate the gains from differentiated products in the price index. This occurs
because these data mask product-level improvements in the goods available on the
market. Meanwhile, non-nested models exaggerate improvements in the price index because these frameworks underestimate the substitutability between products. Finally,
incorporating random coefficients improves the model’s ability to match the spread of
preferences in the population.
Although these results apply to this one empirical example, they also highlight two
general lessons that apply to all studies on the effects of changes in differentiated products. First, it is the nature of aggregated data to obscure some movements in quality
and variety at the underlying product level. This caveat must be kept in mind when
33
using the common goods assumption to calculate price indices. Second, the CES and
MNL models fall prey to the IIA problem, which greatly limits the substitution structure between goods and can distort the gains from differentiated products. It is the
constant search for ways in which to alleviate this problem that has produced innovations in demand modeling, such as nesting products and using random coefficients. One
should be sure to choose a flexible option, given the strictures imposed by the current
research question and the data available.
An interesting avenue for future research would be to examine the ramifications
of other demand models on the price index. Several authors have proposed different
frameworks in order to address issues with the CES and MNL beyond just IIA. Feenstra (2009), for example, suggests using a translog specification in order to avoid the
constant markups that obtain in the CES under monopolistic competition. Ackerberg
and Rysman (2005) modify the MNL in order to address the overvaluing of variety
caused by each good having its own logit shock. Gowrisankaran and Rysman (2009)
extend the random coefficients logit so as to deal with durable goods. Exploring advancements such as these would complement the findings in this paper and provide
further insight into the manner in which both trade and industrial organization should
approach differentiated products.
34
References
Ackerberg, Daniel A. and Marc Rysman, “Unobserved Product Differentiation in
Discrete Choice Models: Estimating Price Elasticities and Welfare Effects,” RAND
Journal of Economics, 2005, 36 (4), 771–778. [34]
Anderson, Simon P., André de Palma, and Jacques-François Thisse, Discrete
Choice Theory of Product Differentiation, MIT Press, 1992. [12, 14]
Berry, Steven, “Estimating Discrete-Choice Models of Product Differentiation,”
RAND Journal of Economics, 1994, 25 (2), 242–262. [18]
Berry, Steven, James Levinsohn, and Ariel Pakes, “Automobile Prices in Market
Equilibrium,” Econometrica, 1995, 63 (4), 841–890. [3, 15, 19]
, “Voluntary Export Restraints on Automobiles: Evaluating a Trade Policy,” American Economic Review, 1999, 89 (3), 400–430. [4]
, “Differentiated Products Demand Systems from a Combination of Micro and Macro
Data: The New Car Market,” Journal of Political Economy, 2004, 112 (1), 68–105.
[4]
Blonigen, Bruce A. and Anson Soderbery, “Measuring the Benefits of Foreign
Product Variety with an Accurate Variety Set,” Journal of International Economics,
2010, 82 (2), 168–180. [4, 10, 30]
Broda, Christian and David E. Weinstein, “Globalization and the Gains from
Variety,” Quarterly Journal of Economics, 2006, 121 (2), 541–585. [3, 10, 28]
Broda, Christian, Joshua Greenfield, and David E. Weinstein, “From Groundnuts to Globalization: A Structural Estimate of Trade and Growth,” NBER Working
Paper, September 2006, No. 12512. [3]
Dixit, Avinash K. and Joseph E. Stiglitz, “Monopolistic Competition and Optimum Product Diversity,” American Economic Review, 1977, 67 (3), 297–308. [3]
Dornbusch, Rudiger, Stanley Fischer, and Paul A. Samuelson, “Comparative
Advantage, Trade, and Payments in a Ricardian Model with a Continuum of Goods,”
American Economic Review, 1977, 67 (5), 823–839. [6]
DSIR, “Technology in Indian Laser Printers Industry,” Technology Status Report, August 1996. [22]
Dubé, Jean-Pierre, Jeremy T. Fox, and Che-Lin Su, “Improving the Numerical Performance of BLP Static and Dynamic Discrete Choice Random Coefficients
Demand Estimation,” mimeo, May 2009. [19]
35
Eaton, Jonathan and Samuel Kortum, “Technology, Geography, and Trade,”
Econometrica, 2002, 70 (5), 1741–1779. [6]
Feenstra, Robert C., “New Product Varieties and the Measurement of International
Prices,” American Economic Review, 1994, 84 (1), 157–177. [1, 3, 4, 8, 9, 10, 19]
, “Measuring the Gains from Trade under Monopolistic Competition,” mimeo, June
2009. [34]
Goldberg, Pinelopi Koujianou, “Product Differentiation and Oligopoly in International Markets: The Case of the US Automobile Industry,” Econometrica, 1995, 63
(4), 891–951. [4]
Goldberg, Pinelopi Koujianou, Amit Khandelwal, Nina Pavcnik, and Petia
Topalova, “Imported Intermediate Inputs and Domestic Product Growth: Evidence
from India,” Quarterly Journal of Economics, 2010, 125 (4), 1727–1767. [3]
Gowrisankaran, Gautam and Marc Rysman, “Dynamics of Consumer Demand
for New Durable Goods,” mimeo, February 2009. [34]
Hausman, Jerry A., “Valuation of New Goods Under Perfect and Imperfect Competition,” in Timothy F. Bresnahan and Robert J. Gordon, eds., The Economics of
New Goods, Chicago: University of Chicago Press, 1997. [7]
Helpman, Elhanan and Paul R. Krugman, Market Structure and Foreign Trade,
MIT Press, 1985. [6]
IDC, “Worldwide Quarterly Hardcopy Peripherals Tracker,” electronic resource, accessed April 2008. [23]
Kamakura, Wagner A. and Gary J. Russell, “A Probabilistic Choice Model for
Market Segmentation and Elasticity Structure,” Journal of Marketing Research, 1989,
26 (4), 379–390. [15]
Khandelwal, Amit, “The Long and Short (of) Quality Ladders,” Review of Economic
Studies, 2010, 77 (4), 1450–1476. [4]
Khandelwal, Amit and Petia Topalova, “Trade Liberalization and Firm Productivity: The Case of India,” forthcoming in The Review of Economics and Statistics,
January 2010. [25]
Knittel, Christopher R. and Konstantinos Metaxoglou, “Estimation of Random Coefficient Demand Models: Challenges, Difficulties, and Warnings,” mimeo,
October 2008. [19]
Krugman, Paul R., “Increasing Returns, Monopolistic Competition, and International Trade,” Journal of International Economics, 1979, 9 (4), 469–479. [1]
36
McFadden, Daniel, “Conditional Logit Analysis of Qualitative Choice Behavior,” in
Paul Zarembka, ed., Frontiers in Econometrics, New York: Academic Press, 1974.
[3]
Melitz, Marc J., “The Impact of Trade on Intra-Industry Reallocations and Aggregate
Industry Productivity,” Econometrica, 2003, 71 (6), 1695–1725. [6]
Nevo, Aviv, “A Practioner’s Guide to Estimation of Random-Coefficients Logit Models of Demand,” Journal of Economics and Management Strategy, 2000, 9 (4), 513–
548. [19]
Petrin, Amil, “Quantifying the Benefits of New Products: The Case of the Minivan,”
Journal of Political Economy, 2002, 110 (4), 705–729. [4]
Spence, Michael, “Product Differentiation and Welfare,” American Economic Review: Papers and Proceedings, 1976, 66 (2), 407–414. [3]
Train, Kenneth, Discrete Choice Methods with Simulation, Cambridge University
Press, 2009. [3, 19]
Wooldridge, Jeffrey M., Econometric Analysis of Cross Section and Panel Data,
MIT Press, 2002. [27]
37
Figure 1: All Price Indices, 1996-2005
Notes: These indices are calculated at the quarterly level (comparing the first quarter of 2005 to the first quarter of
1996, for example), and then averaged over all four quarters. The index for each year takes 1996 as the base year.
38
Figure 2: CES Common Goods Terms, 1996-2005
Notes: These indices are calculated at the quarterly level (comparing the first quarter of 2005 to the first quarter of
1996, for example), and then averaged over all four quarters. The index for each year takes 1996 as the base year.
Figure 3: NCES Common Goods Terms, 1996-2005
Notes: These indices are calculated at the quarterly level (comparing the first quarter of 2005 to the first quarter of
1996, for example), and then averaged over all four quarters. The index for each year takes 1996 as the base year.
39
Figure 4: NL and NRCL Indices, 1996-2005
Notes: These indices are calculated at the quarterly level (comparing the first quarter of 2005 to the first quarter of
1996, for example), and then averaged over all four quarters. The index for each year takes 1996 as the base year.
40
Table 1: Summary Statistics
Variable
Price (USD)
Units Sold
Color Dummy
BW PPM Speed
RAM (MB)
Resolution (DPI)
A3 Capable Dummy
Footprint (in2 )
Ethernet Interface Dummy
MFP Dummy
Laser Dummy
Number of Model-Quarters
Number of Unique Models
Mean
604.996
1059.538
0.468
20.806
49.764
1336.798
0.355
416.175
0.343
0.356
0.663
6413
1189
Standard Deviation
1084.060
4763.041
0.499
13.430
98.272
771.611
0.478
381.911
0.475
0.479
0.473
Notes: Data sources are in the text, Section 4. Price is in real 2001 Indian
Rs, then converted to USD at 1 Rs=47.12 USD. “BW PPM Speed” is the
maximum number of pages per minute that can be printed in black and white.
41
Table 2: Product Categories
Product Type
MFP Color Inkjet 1-10 PPM
MFP Color Inkjet 11-20 PPM
MFP Color Inkjet 21 PPM or more
MFP Color Laser 1-10 PPM
MFP Color Laser 11-20 PPM
MFP Color Laser 21-30 PPM
MFP Color Laser 31-44 PPM
MFP Mono Inkjet All Speeds
MFP Mono Laser 1-20 PPM
MFP Mono Laser 21-30 PPM
MFP Mono Laser 31-44 PPM
MFP Mono Laser 45-69 PPM
MFP Mono Laser 70-90 PPM
Printer Color Inkjet 1-10 PPM
Printer Color Inkjet 11-20 PPM
Printer Color Inkjet 21 PPM or more
Printer Color Laser 1-10 PPM
Printer Color Laser 11-20 PPM
Printer Color Laser 21-30 PPM
Printer Color Laser 31-44 PPM
Printer Mono Inkjet All Speeds
Printer Mono Laser 1-20 PPM
Printer Mono Laser 21-30 PPM
Printer Mono Laser 31-44 PPM
Printer Mono Laser 45-69 PPM
Printer Mono Laser 70-90 PPM
Japan
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
US
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
X
Korea
X
X
EU
X
X
X
X
X
X
X
X
Notes: Product types are from the IDC taxonomy. “PPM” stands for pages
per minute.
42
43
Small
OLS Coefficient IV Coefficient
-0.655***
-1.420***
(0.030)
(0.085)
0.854***
0.738***
(0.009)
(0.048)
0.653***
1.153***
(0.097)
(0.131)
0.422***
0.454***
(0.041)
(0.051)
0.162***
0.280***
(0.030)
(0.036)
2.840***
0.469
(0.346)
(0.425)
0.684***
1.410***
(0.056)
(0.129)
0.119
0.717***
(0.091)
(0.250)
-0.094
0.316***
(0.061)
(0.083)
0.687***
0.853***
(0.042)
(0.056)
0.253**
1.414***
(0.102)
(0.152)
1.655***
2.420***
(0.030)
(0.085)
5.487***
6.419***
(0.299)
1.174
2852
2852
Large
OLS Coefficient IV Coefficient
-0.541***
-1.480***
(0.019)
(0.072)
0.887***
0.830***
(0.006)
(0.038)
0.446***
1.413***
(0.044)
(0.095)
0.276***
0.506***
(0.014)
(0.033)
0.177***
0.237***
(0.016)
(0.019)
1.345***
-1.310***
(0.184)
(0.256)
0.430***
1.414***
(0.034)
(0.076)
0.078*
0.436***
(0.040)
(0.077)
0.075**
0.466***
(0.033)
(0.043)
0.639***
0.807***
(0.028)
(0.045)
2.152***
3.783***
(0.054)
(0.156)
1.541***
2.480***
(0.019)
(0.072)
5.799***
9.702***
(0.325)
(2.294)
5944
5944
Notes: * indicates 10% significance, ** indicates 5% significance, and *** indicates 1% significance. Heteroskedasticity robust standard errors are in
parentheses. All regressions include a constant. All variables are differenced with respect to the Panasonic KX-P1150. Small consumers are home
office buyers or 1 to 9 employee establishments. Large consumers are 10 or more employee establishments. “BW PPM Speed” is the maximum number
of pages per minute that can be printed in black and white.
Number of Observations
Implied σ
Implied γ
Laser Dummy
MFP Dummy
Ethernet Dummy
Footprint
A3 Dummy
Resolution
RAM
BW PPM Speed
Color Dummy
Ln(Group Share)
Variable
Ln(Price)
Table 3: Logit Regression Results
No Types
OLS Coefficient IV Coefficient
-0.594***
-1.521***
(0.019)
(0.068)
0.879***
0.810***
(0.006)
(0.035)
0.465***
1.413***
(0.044)
(0.088)
0.286***
0.502***
(0.014)
(0.031)
0.170***
0.232***
(0.015)
(0.018)
2.025***
-0.584**
(0.177)
(0.239)
0.506***
1.462***
(0.033)
(0.075)
0.0890**
0.452***
(0.040)
(0.076)
0.0898***
0.484***
(0.031)
(0.043)
0.658***
0.852***
(0.026)
(0.039)
1.563***
3.169***
(0.053)
(0.136)
1.594***
2.521***
(0.019)
(0.068)
5.927***
8.991***
(0.286)
(1.754)
6413
6413
44
Inkjet
Laser
Inkjet
Laser
Inkjet
Laser
Inkjet
Laser
Inkjet
Laser
Inkjet
Laser
Inkjet
Laser
Inkjet
Laser
Inkjet
Laser
Apple Color StyleWriter 2400
Brother HL-631
Canon BJ-200ex
Epson EPL-5500
HP Deskjet 1600C
Xerox 4505
Apple Color StyleWriter 2400
Brother HL-631
Canon BJ-200ex
Epson EPL-5500
HP Deskjet 1600C
Xerox 4505
Apple Color StyleWriter 2400
Brother HL-631
Canon BJ-200ex
Epson EPL-5500
HP Deskjet 1600C
Xerox 4505
-7.4814
0.0176
0.1915
0.0154
0.1995
0.0163
-8.7649
0.0205
0.2259
0.0205
0.2259
0.0205
-2.5009
0.0205
0.0205
0.0205
0.0205
0.0205
Apple Color
StyleWriter 2400
0.0123
-8.4220
0.0113
0.1098
0.0118
0.1163
0.0143
-8.8709
0.0143
0.1199
0.0143
0.1199
0.0143
-2.5071
0.0143
0.0143
0.0143
0.0143
Brother
HL-631
Canon
Epson
BJ-200ex EPL-5500
MNL Model
0.0332
0.0237
0.0332
0.0237
-2.4882
0.0237
0.0332
-2.4977
0.0332
0.0237
0.0332
0.0237
NL Model
0.3665
0.0237
0.0332
0.1984
-8.6243
0.0237
0.0332
-8.7924
0.3665
0.0237
0.0332
0.1984
NRCL Model
0.3108
0.0179
0.0263
0.1825
-6.8084
0.0165
0.0231
-7.3321
0.2988
0.0172
0.0244
0.1694
0.0618
0.0052
0.0570
0.0046
-7.3328
0.0049
0.0699
0.0063
0.0699
0.0063
-8.9208
0.0063
0.0063
0.0063
0.0063
0.0063
-2.5150
0.0063
HP Deskjet
1600C
0.0251
0.2558
0.0232
0.2242
0.0241
-7.6992
0.0314
0.2627
0.0314
0.2627
0.0314
-8.7281
0.0314
0.0314
0.0314
0.0314
0.0314
-2.4900
Xerox
4505
Notes: These cross-price elasticities are calculated using the formulas discussed in Section 3. Each entry is the percentage change in
quantity sold of the row good in response to a percentage change in price of the column good.
Group
Table 4: A Sample of Own- and Cross-Price Elasticities
Product
Table 5: Price Index Results, 1996 vs. 2005
Model
CES
NCES
MNL
NL
NRCL
NRCL Small
NRCL Large
Index
0.061
[0.056, 0.065]
0.160
[0.148, 0.178]
0.035
[0.028, 0.040]
0.110
[0.094, 0.123]
0.100
[0.090, 0.111]
0.051
[0.038, 0.062]
0.122
[0.105, 0.140]
Notes: These price indices are calculated at the quarterly level (comparing the first quarter of 2005 to the
first quarter of 1996, for example), and then averaged
over all four quarters. Bootstrapped 95 percent confidence intervals are in brackets.
Table 6: Price Index Decompositions, 1996 vs. 2005
Model
CES
Common Goods
0.137
NCES
0.201
Model
MNL
Price
0.386
[0.376, 0.387]
0.411
[0.398, 0.413]
0.374
[0.356, 0.380]
NL
NRCL
Changing Goods
0.446
[0.408, 0.475]
0.800
[0.743, 0.888]
Quality
Variety
0.376
0.247
[0.347, 0.413]
[0.210, 0.275]
0.352
0.757
[0.326, 0.388]
[0.646, 0.863]
0.352
0.783
[0.319, 0.366]
[0.714, 0.852]
Notes: These decompositions are calculated at the quarterly level
(comparing the first quarter of 2005 to the first quarter of 1996, for
example), and then averaged over all four quarters. Bootstrapped 95
percent confidence intervals are in brackets.
45