Paper

JOURNAL OF APPLIED ECONOMETRICS
J. Appl. Econ. (2010)
Published online in Wiley Online Library
(wileyonlinelibrary.com) DOI: 10.1002/jae.1231
‘DUAL’ GRAVITY: USING SPATIAL ECONOMETRICS
TO CONTROL FOR MULTILATERAL RESISTANCE
KRISTIAN BEHRENS,a – c CEM ERTURd AND WILFRIED KOCHa,b *
a
Département des Sciences Economiques, Université du Québec à Montréal (UQAM), Canada
b CIRPÉE ESG-UQAM, Montréal, Canada
c CEPR, London, UK
d LEO, Faculté de Droit d’Economie et de Gestion, Université d’Orléans, France
SUMMARY
We derive a quantity-based structural gravity equation system in which both trade flows and error terms
are cross-sectionally correlated. This system can be estimated using techniques borrowed from the spatial
econometrics literature. To illustrate our methodology, we apply it to a well-known Canada–US trade dataset.
We find that border effects between the USA and Canada are smaller than suggested by previous studies:
about 7.5 for Canadian provinces and about 1.3 for US states. Hence controlling directly for cross-sectional
interdependence among both trade flows and error terms reduces measured border effects by capturing
‘multilateral resistance’. Copyright  2010 John Wiley & Sons, Ltd.
Supporting information may be found in the online version of this article.
1. INTRODUCTION
The gravity equation provides a remarkable empirical framework for analyzing bilateral trade flows.
Having been derived from various trade models under a wide range of modeling assumptions
(e.g., Anderson, 1979; Eaton and Kortum, 2002; Anderson and van Wincoop, 2003; Feenstra,
2004; Helpman et al., 2008; Chaney, 2008; Melitz and Ottaviano, 2008; Behrens et al., 2009), it
is nowadays firmly rooted in mainstream economic theory and has, as such, become an essential
part of every applied trade theorist’s toolbox. Despite its wide applicability and excellent fit, the
gravity equation suffers from several well-known and from several less well-known shortcomings.
The former category comprises mainly empirical issues, such as the treatment of zero trade flows,
the construction of own absorption, the measurement of internal distances, and concerns about
the plausibility of various parameter estimates. These problems have been extensively discussed
elsewhere in the literature (see Anderson and van Wincoop, 2004, pp. 729–733, for a recent
overview). This paper focuses instead on the less well-known problem of how to take into account
the interdependence between trade flows when estimating what is inherently a general equilibrium
system.
Anderson and van Wincoop (2003) have argued that dealing with the regional interaction
structure is important when estimating gravity equation systems. They show that the inclusion of
multilateral resistance terms, i.e., terms which capture the fact that bilateral trade flows do not only
depend on bilateral trade barriers but also on trade barriers across all trading partners, is crucial for
the results one obtains. In other words, bilateral predictions do not readily extend to a multilateral
world because of complex interactions linking all the trading partners. Although such a finding
is hardly surprising in a general equilibrium setting, it has been largely neglected until now in
applied work. Interdependence has, however, to be somehow controlled for in the gravity equation
Ł Correspondence to: Wilfried Koch, Département des Sciences Economiques, Université du Québec à Montréal, Case
postale 8888, Succursale Centre-Ville Montréal, Québec H3C 3P8, Canada. E-mail: [email protected]
Copyright  2010 John Wiley & Sons, Ltd.
K. BEHRENS, C. ERTUR AND W. KOCH
to obtain consistent estimates. Some previous studies aim at doing so by including ad hoc regional
remoteness indices, even though those indices have no theoretical foundation. Other studies try
to capture interdependence across trade flows by including origin- and destination-specific fixed
effects. The somewhat worrisome feature of both of these approaches is the underlying implicit
assumption that trade flows between two trading partners are independent of what happens to the
rest of the trading world. This is clearly a strong assumption that is unlikely to hold and therefore
may bias estimates.
Our paper is a methodological contribution to the rapidly expanding literature on the theorybased estimation of gravity equation systems. Building upon the observation that the structural
estimation of such systems usually hinges on the correct treatment of unobservable prices, our
modeling strategy consists in taking a ‘dual’ approach that relies on observable trade flows only.
More concretely, we derive a gravity equation from the quantity-based version of the constant
elasticity of substitution (CES) model by exploiting the property that the price indices in that
model are themselves implicit functions of trade flows. The resulting equilibrium system allows
us to recover, quite naturally, an econometric specification in which bilateral trade flows between
two regions depend on trade flows involving all the other trading partners. Put differently, our
econometric specification includes cross-sectional correlations among trade flows as structurally
implied by our theoretical model. This interdependence, which reflects complex interactions
between all trading partners, can be dealt with in a manner similar to that used in the spatial
econometrics literature to deal with simultaneous spatial autoregressive structures.1 Controlling
for cross-section correlations with the help of spatial econometric techniques then amounts to
controlling for ‘multilateral resistance’ and yields improved estimates of the gravity equation.2
Unlike in time series analysis, there is no natural ordering of cross-sectional observations. The
fundamental tool used in spatial econometrics to impose a ‘relevant ordering’ is the so-called spatial
weight matrix, which connects each observation to a set of ‘neighboring observations’ by means
of an exogenous pattern. Various spatial weight matrices based on some form of geographical
connectivity (contiguity, nearest neighbors, geographical distance) have been used to study a wide
range of topics including growth and convergence (Ertur and Koch, 2007), spatial patterns of
foreign direct investment (Blonigen et al., 2007), retail price competition (Pinkse et al., 2002),
and strategic interactions between local governments (Case et al., 1993; Brueckner, 1998). To
the best of our knowledge there have been, until now, no applications to trade and the gravity
equation, which may be due to the fact that origin–destination based models have not yet been
much developed in spatial econometrics with the exception of LeSage and Pace (2008), who model
spatial correlations among migration flows between nearby origin–destination dyads. Connectivity
across observations has been usually interpreted in a narrow sense as geographical proximity,
although Anselin (2006, p. 910) acknowledges the interest to consider more abstract spaces such
as those arising in sociology or in political science. What really matters, in the end, is to identify
the appropriate space and the most relevant associated interaction measure. Akerlof (1997), for
instance, points out that countries may be viewed as localized in some general socio-economic and
institutional or political space defined by a range of factors. Determining how countries relate to
each other then requires the choosing of a suitable metric. Conley and Ligon (2002), for example,
1 See Anselin and Bera (1998) for an overview and the recent book by LeSage and Pace (2009) for an up-to-date and
comprehensive presentation of the state of the art in spatial econometrics. The asymptotic properties of some estimators
have been derived by Kelejian and Prucha (1998) and Lee (2004).
2 More recently, Pesaran and Tosetti (2010) and Chudik et al. (2010) propose an alternative approach to modeling crosssection dependence in a panel setting. They show that under some specific assumptions like granularity, some spatial
econometric models can be interpreted as particular cases of weak cross-sectionally dependent (CWD) processes. They
can thus be represented by factor processes with an infinite number of weak factors and no idiosyncratic error.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
define an economic distance based on transport costs, whereas Conley and Topa (2002) define a
socio-economic distance based on social networks.
In this paper, we pursue this latter approach which aims at extending the methodology to deal
with any kind of interactions between observations in any network structure on the basis of a
similarity measure. To this end, we adopt a much broader definition of the spatial weight matrix
that captures those complex interaction structures, and propose to call it the interaction matrix.
We use techniques borrowed from the spatial econometrics literature to estimate our theory-based
gravity equation system characterized by cross-sectional correlation of trade flows. The interaction
coefficients we obtain, by analogy to the spatial autocorrelation coefficients, capture the strength
of cross-sectional correlation resulting from the interaction structure imbedded in the interaction
matrix. Furthermore, in contrast to the spatial econometrics literature, the interaction structure
arises from our theoretical model and is not chosen in an ad hoc way. We believe that this is an
important step forward since in the absence of a theoretical model the lack of a natural ordering
in cross-sections implies that many different interaction matrices may be used to study the same
issue. It is then difficult to identify the most ‘relevant’ one, thereby requiring extensive sensitivity
analysis and leaving the door open to arbitrariness.
On top of controlling directly for cross-sectional interdependence, our approach has several
additional advantages. First, it does not enforce a strict pattern of interdependence as, for example,
the model by Anderson and van Wincoop (2003) does. Our approach is therefore more robust to
potential misspecification concerning the form of the interdependence. Second, it reveals that all
coefficients, including the interaction coefficients for both trade flows and the error terms, are
generally region specific. Hence the proper estimation of the model requires the use of local
techniques that can deal with parameter heterogeneity across observations. Last, we model more
carefully the error structure, thereby also controlling for cross-sectional correlations that may arise
in the error terms.
To illustrate the usefulness of our approach, we apply it to the well-known Canada–US dataset
used by, among others, Anderson and van Wincoop (2003) and Feenstra (2004) to assess the
magnitude of Canada–US border effects. We first estimate a model where all coefficients are
constrained to be identical across regions (‘homogeneous coefficient case’). Doing so simplifies
the econometric implementation and yields results that are comparable to those in the literature.
We then provide estimates for a model with region-specific coefficients, except for the interaction
coefficients which are country specific (‘heterogeneous coefficient case’). Our key results may
be summarized as follows. First, we show that there remains cross-sectional correlation in the
ordinary least squares (OLS) residuals of the gravity equation, given our theoretical interaction
structure, even when including origin- and destination-specific fixed effects. Put differently, OLS
estimates are at best inefficient and at worst inefficient and biased, because the fixed effects fail
to capture the cross-sectional interdependence among trade flows. This finding vindicates the use
of techniques borrowed from spatial econometrics, as well as a more careful modeling of the
error structure. Second, we estimate the homogeneous coefficient specification of the model and
show that, as predicted by theory, there is significant negative cross-sectional correlation between
trade flows. Controlling for this correlation reduces measured border effects between the USA
and Canada, which drop to about 7.5 for Canadian provinces and 1.3 for US states. Last, we
provide results for the heterogeneous coefficient specification of the model under the restriction of
country-specific interaction parameters. Our estimates reveal significant variations in both distance
elasticities and border effects across provinces and states. While border effects for almost all US
states are small and statistically insignificant, those for Canadian provinces are generally larger
and almost all statistically significant.
The remainder of the paper is organized as follows. Section 2 presents the model and derives
our gravity equation system. Section 3 derives the estimating equation from the theoretical model.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
Section 4 presents the data, summarizes briefly previous estimation methods, and discusses our
empirical results for both the homogeneous and the heterogeneous coefficient cases. Section
5 concludes. All technical developments are relegated to a web appendix available online as
supporting information.
2. A ‘DUAL’ GRAVITY MODEL
We begin by presenting a novel way of deriving a system of gravity equations that does not
depend on unobservable price indices, yet encapsulates the general equilibrium interdependencies
of the full trading system. Contrary to Anderson and van Wincoop (2003), who derive gravity
equations from a CES expenditure system with goods that are differentiated by region of origin
and the supply of which is fixed, we rely on a CES monopolistic competition model with free
entry. Using the inverse demand functions and exploiting the fact that price indices depend on
trade flows, we derive an implicit equation system that depends on observables only and that can
be estimated using techniques borrowed from the spatial econometrics literature. In a nutshell,
whereas previous research either derives a gravity equation system with nonlinear constraints in
unobservable price indices (Anderson and van Wincoop, 2003) or uses a linearization with respect
to prices (Baier and Bergstrand, 2009), we derive an ‘unconstrained’ linearized gravity equation
with correlated trade flows.
2.1. Preferences
Consider an economy with n regions. Each region i is endowed with Li workers/consumers, who
each supply inelastically one unit of labor. Labor is the only production factor, so that Li stands
for both the size of, and the aggregate labor supply in, region i. All consumers have identical CES
preferences over a continuum of horizontally differentiated product varieties. A representative
consumer in region j solves the following problem:
1
max Uj qij v
dv subject to
qij vpij vdv D yj
i
i
i
i
where > 1 denotes the constant elasticity of substitution between any two varieties; yj stands
for individual income in region j; pij v and qij v denote the consumer (i.e., the delivered) price
and the per capita consumption of variety v produced in region i and sold in region j; and where
i denotes the set of varieties produced in region i. Varieties produced in the same region are
assumed to be symmetric, which allows us to alleviate notation by dropping the variety index v.
Let mk stand for the measure of k (i.e., the mass of varieties produced in region k). It is readily
verified that the aggregate inverse demand functions for each variety are then given by
1/
pij D Qij
11/
mk Qkj
Yj
1
k
where Qij Lj qij and Yj Lj yj denote the aggregate demand in region j for a variety produced
in region i and the aggregate income in region j, respectively.
2.2. Technology
Each firm produces a single product variety. Thus there is a one-to-one correspondence between
varieties and firms, i.e., mk also stands for the mass of firms operating in region k. To produce q
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
units of output requires cq C F units of labor, where c is the marginal and F is the fixed input
requirement.3 Shipping varieties both within and across regions is costly. More precisely, shipping
one unit of any variety between regions j and k requires dispatching jk > 1 units from the region
of origin, while the rest ‘melts away’ in transportation (the so-called ‘iceberg’ assumption). It is
worth pointing out that we need not make a priori any assumption on either the value of trade
costs ii within regions, nor on the symmetry of trade costs ij across regions.
A firm located in region j maximizes its profit, given by
j D
k
pjk cwj jk Qjk Fwj
with respect to the quantities Qjk and subject to the inverse demand schedule (1). Because price
and quantity competition are equivalent when there is a continuum of firms, the profit-maximizing
prices display a constant markup over marginal cost: pjk D jk pj , where pj cwj / 1
stands for the producer (i.e., the mill) price in region j. Free entry and exit drive profits to zero,
which implies that each firm must produce the break-even quantity
k
jk Qjk D
F 1
Q
c
2
irrespective of the region j it is located in.4
2.3. Equilibrium
To derive the gravity equation system requires determination of the value of trade flows from i to
j. The latter is given by Xij mi pij Qij which, using (1), can be expressed as follows:
11/
Qij
Y
Xij D mi 11/ j
mk Qkj
3
k
Aggregate income
constraints, the equilibrium prices, and the zero profit condition (2) then
imply that Yi D k mi pik Qik D mi pi Q. Solving for mi D Yi /pi Q and substituting the result
into (3) allows us to eliminate the unobservable mass of firms to obtain
11/
Qij
Xij D Yi Yj pi
11/
Yk Qkj
pk
k
4
By definition of the trade flows Xij and the mass of firms mi , it must be that
Qij D
Xij
Xij Q
D
mi pij
Yi ij
5
3 All firms are identical in our model. For gravity equations with heterogeneous firms see Chaney (2008), Helpman et al.
(2008), Melitz and Ottaviano (2008) and Behrens et al. (2009).
4 Strictly speaking, this condition only holds when all regions have at least some firms (interior equilibrium). In what
follows, we focus exclusively on such interior equilibria as they are the empirically relevant ones for our subsequent
analysis.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
Plugging (5) into (4) and simplifying then yields
11/
11/
Xij Q
1/1 Xij
ij
Yi ij
Yi
Xij D Yi Yj
11/ D Yj Lk 1/1 Xkj 11/
pi
Xkj Q
Yk
Li kj
Yk
pk
Yk kj
k
k
6
where we have used the equilibrium relationship pi /pk D wi /wk and the aggregate income
constraint wi D Yi /Li . Expression (6) can be rewritten as follows:
Lk kj Yk 1/1 11/
Xij D Yj
Xkj
8i, j
7
k Li
ij Yi
which is a system of equations capturing the interdependence of all trade flows towards region j.
To close the general equilibrium system, we impose the aggregate income constraints
Yi Xik D 0, 8i
8
k
As can be seen from expressions (7) and (8), all trade flows Xij (including own absorption
Xii ) are linked in equilibrium, both directly (since varieties are substitutes) and indirectly (via the
aggregate income constraints). Formally, we can think about such a system as being a directed
graph, where the Xij are the flows between regions (the ‘nodes’) along trading routes (the ‘edges’),
and where the Yi play the role of flow conservation constraints.
3. ECONOMETRIC SPECIFICATION
We now propose an econometric method for estimating the gravity equation system (7). This
method builds on the foregoing observation that trade flows are interdependent and that this
interdependence needs to be somehow taken into account. Our approach draws quite naturally
on spatial econometric techniques, which are precisely designed to deal with cross-sectional
interdependence in both variables and error terms. When compared to other estimation methods,
we believe that ours offers four advantages:
1. It directly deals with cross-sectional interdependence among trade flows, as implied by our
theoretical model.
2. It uses a more careful modeling of the error structure, thereby controlling for possible crosssectional correlations in the error terms.
3. It reveals that all coefficients, including the distance elasticities and border effects, are generally
region-specific (see Anderson and Smith, 1999; Helpman et al., 2008) and allows for statistical
inference on estimated border effects and distance elasticities.
4. it allows us to sidestep the thorny issue of how to choose by working with a linearized version
of the equilibrium system. As is known in the literature, estimation results for depend both
on the level of aggregation and the estimation method and vary widely from about 1.3 to over
22 (Hummels, 2001; Broda and Weinstein, 2006).
We now linearize the model of Section 2, derive an econometric specification and discuss in
more detail the error structure.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
3.1. Linearization and Matrix Form
Taking the logarithm of (7), we readily obtain:


1
Lk kj Yk 1 1 1
Xkj  f
ln Xij D ln Yj ln 
k Li
ij Yi
9
Clearly, there is interdependence across trade flows as Xij depends negatively on the nominal
sales of the other regions in market j. To obtain a specification that is amenable to estimation with
spatial econometric techniques, we linearize f around D 1. As shown in supporting information
Appendix A, doing so yields the following equation:
Lk
Lk
ln Zij D ln L 1 ln ij ln kj ln wi 1
ln Zkj
k L
k L
10
where Zij Xij /Yi Yj is a GDP-standardized
trade
flow
(but
which
we
will
refer
to
as
trade
flow for short); and where L k Lk denotes the total population. Expression (10) reveals the
essence of interdependence in the gravity equation system: the trade flow Xij from region i to
region j also depends on all the trade flows from the other regions k to region j. Several comments
are in order. First, trade flows from i to j are affected by relative trade barriers, as measured by the
deviation of bilateral trade barriers ij from the population weighted average (second term). Put
differently, relative accessibility matters. Second, trade flows from i to j are negatively affected by
wages wi in the origin region (third term). Higher wages raise production costs and make region
i’s firms less competitive in market j, thereby reducing trade flows. Last, trade flows from i to j
decrease with trade flows Zkj from any third region k into the destination market, because varieties
are substitutes. This effect is stronger the closer substitutes the varieties are (i.e., the larger the
value of ).5 In our estimations, interdependence will be captured by an autoregressive interaction
coefficient, and this coefficient can be seen as a measure of ‘spatial competition’ encapsulating
both aspects related to market power and consumer preference for diversity (via the parameter ).
To make notation more compact, we recast (10) into matrix form as follows:
Z D ln L1I 1I W w 1WZ
11
In expression (11), we have defined Z lnXij /Yi Yj as the n2 ð 1 vector of the logarithms
of trade flows; 1I as the n2 ð 1 vector whose components are all equal to 1; I as the n2 ð n2 identity
matrix; W as the n2 ð n2 matrix whose contents will be made precise below; t ln ij as the
n2 ð 1 vector of the logarithms of trade costs, whose contents will also be made precise below;
and w ln wi as the n2 ð 1 vector of the logarithms of origin wages. Some simple algebraic
manipulations show that W D [S diagL] In , where S is the n ð n matrix whose elements are
all equal to 1; denotes the Kronecker (tensor) product; diag(L) is defined as the n ð n diagonal
matrix of the Lk /L terms; and In is the n ð n identity matrix.
Turning to the functional form of trade costs, we follow standard practice by assuming that ij
is a log-linear function of distance and border effects that can be expressed as follows:
ij dij ebij
12
5 When approaches 1, the linear approximation of the model improves but the autoregressive interaction term disappears,
whereas the approximation gets worse despite more interdependence when is large. When gets very large, trade flows
fall to zero, which corresponds to an extreme form of interdependence where trade frictions almost completely inhibit
interregional trade.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
where dij denotes the distance between regions i and j; and where bij is a dummy variable
taking the value 1 if the flow Xij crosses an international border (the Canada–US border in our
application below), and 0 otherwise. Taking logarithms of (12), we can rewrite the trade cost
specification in matrix form as follows:
D d C b
13
where d ln dij is the n2 ð 1 vector of the logarithms of distance; and where b is the n2 ð 1
vector of dummy variables indicating cross-border flows. We then have
C
D I W D I Wd C b D d
b
14
where variables superscripted with a tilde are measured as deviations from their population
weighted averages. We stick to this notation in the remainder of the paper to ease the exposition.
Substituting (14) into (11) then yields the following estimating equation:
C ˇ2 w C b
C WZ
Z D ˇ0 1I C ˇ1 d
15
where ˇ0 ln L < 0 is the constant term; ˇ1 1 < 0 is the distance coefficient of
deviations from population weighted average distances (which, because of the implicit structure
of the model, differs from the true distance elasticity); and where ˇ2 < 0 is the coefficient
for the wage in the origin region. Turning to the impacts of crossing an international border, i.e.,
the border effects, their coefficient is given by 1 < 0. How to precisely compute and
decompose the border effects into intra- and international components is explained in more detail
in Section 5. Finally, the autoregressive interaction coefficient 1 < 0, which captures
the interdependence across trade flows, is smaller the closer substitutes the varieties are. Hence provides an intuitive measure of ‘spatial competition’.
3.2. Econometric Specification and Error Structure
To obtain a specification that can be estimated by techniques borrowed from the spatial econometrics literature requires rewriting the implicit equation (15) in explicit form, i.e., to move all
the ln Xij terms to the left-hand side which then only depends on the other flows. Furthermore,
we have to introduce the error structure into the model.
Let Wdiag diagL In denote the matrix containing only the diagonal elements of W, each
repeated n times by block. Equation (15) can then be rewritten as follows:
C ˇ2 w C b
C W Wdiag Z
I Wdiag Z D ˇ0 1I C ˇ1 d
Because I Wdiag is an invertible diagonal matrix provided that 1 Li /L 6D 0 for all i, we
can pre-multiply it by its inverse to obtain the following expression:
C ˇ w C b
C Wd Z
Z D ˇ0 1I C ˇ1 d
2
16
where Wd D W Wdiag is our theory-based interaction matrix. It is worth pointing out that Wd
is not row-normalized (i.e., the rows do not sum to one). Note also that Wd does not contain any
form of geographic connectivity, in contrast to the traditional practice in the spatial econometrics
literature. The similarity measure that flows from our theoretical model is based on the relative
size of regions as reflected by population shares.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
The n elements between positions ni 1 C 1 and ni for i D 1, . . . , n of I Wdiag 1 ,
1
which are given by 1 C 1 LLi
, depend on the origin index i only, which is fixed and
identical for all destinations. Observe that in expression (16) the components of the transformed
(overlined) matrices of coefficients are given by
ˇ1i D [1 Li /L]1 < 0,
i D [1 Li /L]1 < 0,
ˇ2i D [1 Li /L]1 < 0,
i D [1 Li /L]1 < 0
17
We have a specification with a distinct set of parameters for each region. The full model therefore
has a ‘club’ structure since all parameters (including the autoregressive ones) should ideally be
estimated separately (‘locally’) for each region. Quite naturally, we refer to the model (16) as the
heterogeneous coefficients model. Since it is econometrically complex to handle, we first estimate
a simpler benchmark in which we constrain all coefficients to be identical across regions, which
we refer to as the homogeneous coefficients model. Formally, constraining the coefficients to be
identical amounts to assuming that the diagonal of W is equal to zero in equation (15), i.e., that
W D Wd . In that case, the model simplifies substantially and can be written as
C ˇ2 w C b
C Wd Z
Z D ˇ0 1I C ˇ1 d
18
where the different coefficients are now identical across regions.
To estimate equation (16) or (18), we need to spell out the error structure underlying the model.
Though fundamental to the analysis, this modeling aspect has received only little attention. This
is quite surprising because when the error terms are introduced into the econometric specification
via the trade costs ij or the trade flows Zij , as usually done in the literature, one must take
into account the fact that ‘the multilateral resistance variables also depend on these error terms’
(Anderson and van Wincoop, 2004, p. 713). The same holds true for the border effects, since
these effects in any region depend in a complex way on a spatially weighted average of the effects
in all the other regions. Consequently, the error terms will exhibit some form of cross-sectional
correlation that has to be dealt with. To the best of our knowledge, this point has largely gone
unnoticed until now in the gravity literature. Although ‘errors can enter the model in many [. . .]
ways of course, about which the theory has little to say’ (Anderson and van Wincoop, 2003,
p. 180), it is likely that the exact way the error terms are introduced into the model is crucial to
the estimates one obtains.
There are several ways to introduce the error terms into the model. First, we could assume that
they arise from the imperfect measurement of trade costs, i.e., ij D dij ebij eεij , where εij is an
i.i.d. normal error term. Second, the error terms could arise in the trade flows Zij themselves.
obs εij
stand for the unobserved ‘real’ trade flow, where Zobs
Let Zreal
ij Zij e
ij denotes the observed
trade flow and where εij is again an i.i.d. normal error term. The resulting error structure that
would, for example, appear in expression (18) for the homogeneous coefficient model under both
assumptions can then be expressed as follows:
u D ε C Wd ε
19
where the coefficient must be estimated from the model. Hence the error terms uij are
autocorrelated under the form of a first-order moving average across interacting regions. Our
econometric specification corresponds therefore to the so-called SARMA model in the spatial
econometric literature (Huang, 1984). Although moving averages are quite common in structural
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
econometric models, especially in time series, they are less so in spatial econometrics. An
explanation for this fact could be the clear lack of structural models.
4. EMPIRICAL IMPLEMENTATION
We apply our methodology to the well-known Canada–US dataset used, among others, by
Anderson and van Wincoop (2003) and Feenstra (2004). We begin by briefly reviewing the
data, previous estimation methods and results. We then estimate the OLS benchmark, both
with and without importer–exporter fixed effects and show that the OLS residuals remain
autocorrelated. This finding vindicates the use of techniques similar to those used in spatial
econometrics for estimating such equations because OLS estimators are at best inefficient and at
worst inefficient and biased. Second, we estimate, using maximum likelihood, our preferred theorybased specification, namely the SARMA model, under the assumption of homogeneous coefficients
(given by equations (18) and (19)). We also run a series of robustness checks by estimating the
model using alternative error structures which are commonly used in spatial econometrics. As will
become clear, the empirical results clearly back the theoretical specification. Last, we estimate,
again using maximum likelihood, the SARMA model in its more general form with heterogeneous
coefficients (given by equations (16) and (19)). Since estimating the fully heterogeneous model
requires estimating as many autoregressive interaction coefficients for the endogenous variable
and the error terms as there are regions, we restrict ourselves to the case where we estimate only
country-specific autoregressive interaction coefficients, whereas all other coefficients are allowed
to vary across regions.6
4.1. Data and Controls
A description of the dataset and the data sources is provided in supporting information Appendix
B. Our dataset features aggregate manufacturing exports Xij between regions, regional GDPs Yi ,
internal absorption Xii (all measured in million US dollars for the year 1993), and great circle
distances dij (measured in kilometers) between regional and provincial capitals for 30 US states
and 10 Canadian provinces. The full list of regions is provided in Table IV.
Estimating the model raises three different sets of issues. First, unlike other gravity models,
which disregard own absorption Xii in the estimation, we require a measure of internal trade costs
because we have to take into account the full interdependence
structure. Following Redding and
p
Venables (2004), we measure internal trade costs as ii surfacei /, where surfacei denotes the
region’s surface (measured in square kilometers). As estimation results are known to be somewhat
sensitive to the measure of internal distance (see, for example, Head and Mayer, 2002) we use
both values of 1/3 and 2/3 for as robustness checks.
Second, we require data on regional wages wi , which we measure using average hourly
manufacturing wages, and regional populations Li . These two regressors may be potentially
endogenous. For example, some unobserved shock to production in a region i may lead to an
increase in trade flows from region i to region j, triggering at the same time an inflow of population
and wage increases as labor demand rises. To control for the potential endogeneity of wages and
populations, we use 5-year lagged values for these two variables.7
6 To the best of our knowledge, ours are the first estimates of SARMA models with heterogeneous autoregressive
coefficients. Technical details, the Matlab codes, and the data in both .xls and .dta format are available online as supporting
information.
7 Both 1988 wages and populations are highly correlated with 1993 wages and populations. We experimented with different
lags up to 10 years, and our qualitative results (including the magnitudes of the border effects) are unaffected. Results
are also robust when using the 1993 values for wages and populations.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
Last, since our estimation method requires all bilateral trade flows across all pairs of regions,
to account for their interdependence, we further have to deal with the well-known problem of
zero trade flows. Indeed, there are 49 zero observations out of 1600 in our dataset, which requires
an appropriate treatment. Since there is no generally agreed upon method for doing so (see, for
example, Anderson and van Wincoop, 2004; Disdier and Head, 2008), we proceed as follows.
First, we augment the trade flows by adding 1, such that that their log is equal to zero. Second,
we control for this adjustment by including a zero-flow dummy variable in the regression, which
takes value 1 if the log-flow is zero and which takes value 0 otherwise. Although this is a crude
way of controlling for zero trade flows, truncating the sample would not perform better or be
theoretically more sound. There are two justifications for our procedure. First, there are only few
zeros in the sample (about 3%, against more than 50% for studies using international trade data).
Hence the data are not dominated by the occurence of zero values and the need for a corrective
procedures is thus less stringent. Second, our zeros are unlikely to be ‘true zeros’ as this would
entail no aggregate manufacturing trade between several US states. While the occurrence of zero
flows is indeed more likely for regions that are further away from each other or for which the
partner GDP is smaller, it seems highly unlikely that there is no aggregate trade between, for
example, Ohio and Maine or Arizona and Texas, as reported in our data.8 We may thus consider
that some flows are just not reported, and the zero-flow dummy controls for these outliers.
4.2. Previous Estimation Methods
We now briefly review previous estimation methods. The first one is based upon the strong
assumption that trade flows are independent: estimating the determinants of Xij can then be done
without taking into account any information provided by Xkl . McCallum (1995), among others,
makes this assumption to estimate by OLS the following empirical gravity equation for Canada–US
interregional trade:
ln Xij D ˛1 C ˛2 ln Yi C ˛3 ln Yj C ˛4 ln dij C ˛5 bij C εij
20
Doing so yields paradoxically large values for the border coefficient ˛5 , ranging from 3.07 to
3.30 (McCallum, 1995). Consequently, Canadian provinces seem to trade 21.5 to 27 times more
with themselves than with US states of equal size and distance, an unrealistically large value for
two well-integrated and culturally similar countries.
In columns 1–3 of Table I, we replicate McCallum-type OLS regressions of the form (20) as
our benchmark. To stay as close as possible to the original analysis, we define the border effects as
in Anderson and van Wincoop (2003). Hence we introduce two sets of dummy variables, bordCAij
and bordUSij , for Canada–US and US–Canada flows, respectively. The implied border effects
can be retrieved as the exponential of minus the coefficient of bordCAij and bordUSij . As can be
seen from Table I, all coefficients have the correct sign, reasonable magnitudes, and are precisely
estimated. Results for the distance elasticity are somewhat sensitive to the definition of internal
distance. As can be further seen from Table I, the magnitude of the border effects for Canadian
provinces ranges from about 14.5 to 15.4, depending on the definition of internal distance. These
estimates are in line with the McCallum-type regressions of Anderson and van Wincoop (2003,
Table I, p. 173), which obtain border effects in the range of about 15.7.9
8 In the presence of a large share of true zeros, such as in international trade data, the use of a Tobit estimator would be
required (Felbermayr and Kohler, 2006; Helpman et al., 2008).
9 The slight difference between our OLS estimates and those of Anderson and van Wincoop (2003), despite using the same
dataset, is due to: (i) inclusion of own trade flows Xii ; (ii) accounting for intra-regional distances; and (iii) controlling for
zero trade flows.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
Table I. OLS regressions
Model
OLS–McCallum
OLS–Feenstra
ln Xij
ln Zij
Dependent variable
Constant
ln Yi
ln Yj
ln dij
bordCAij
bordUSij
bordij
Fixed effects
Internal distance
Adjusted R2
Border effects
Canada
USA
‘Average border’
Moran’s I stat.
8.830a
(0.526)
1.045a
(0.028)
0.920a
(0.024)
1.172a
(0.031)
2.674a
(0.122)
0.393a
(0.056)
—
8.387a
(0.561)
1.044a
(0.028)
0.919a
(0.024)
1.228a
(0.036)
2.734a
(0.129)
0.397a
(0.057)
—
8.888a
(0.548)
1.038a
(0.028)
0.913a
(0.024)
1.144a
(0.033)
2.655a
(0.125)
0.412a
(0.056)
—
No
1 surfi
3
0.937
No
2 surfi
3
0.935
No
1 min fd g
j ij
4
0.936
14.496a
(1.773)
1.482a
(0.083)
4.634a
(0.349)
0.039a
(0.007)
15.401a
(1.979)
1.487a
(0.084)
4.786a
(0.371)
0.044a
(0.007)
14.219a
(1.782)
1.510a
(0.085)
4.633a
(0.355)
0.036a
(0.007)
7.594a
(0.250)
—
6.906a
(0.292)
—
7.852a
(0.282)
—
—
—
—
1.289a
(0.032)
—
1.385a
(0.038)
—
1.248a
(0.037)
—
—
—
—
1.482a
(0.069)
Yes
1 surfi
3
0.921
1.504a
(0.071)
Yes
2 surfi
3
0.919
1.487a
(0.071)
Yes
1
4 minj fdij g
0.920
—
—
—
—
—
—
4.404a
(0.073)
0.015a
(0.001)
4.501a
(0.075)
0.015a
(0.001)
4.422a
(0.075)
0.015a
(0.001)
Note: The first three columns correspond to the estimation of equation (20), whereas the last three columns correspond
to the estimation of equation (23). Standard errors are given in parentheses (with a D 0.01, b D 0.05, c D 0.1); those
for border effect coefficients are computed using the Delta method. The number of observations is 1600. OLS–Feenstra
include importer–exporter fixed effects. Following Feenstra (2004), average border effects are computed as the geometric
mean of the individual border effects. Moran’s I statistic is computed using our theory-based interaction matrix Wd .
However, as can also be seen from the last line of Table I, not a single OLS specification à
la McCallum passes Moran’s I test for the absence of cross-sectional correlation of the residuals
given our theory-based interaction structure (Cliff and Ord, 1981). Stated differently, there remains
a significant amount of cross-sectional correlation in the OLS residuals, which leads at best to
inefficient and at worst to both inefficient and biased estimates (with omitted variable bias because
of the missing spatially lagged variable). The presence of cross-sectional correlation suggests that
the use of appropriate econometric techniques dealing with this problem is required.
To deal with interdependence and with McCallum’s ‘border effect puzzle’, Anderson and van
Wincoop (2003) build on the ‘price version’ of the CES model presented in Section 2. Assuming
equal wages and symmetric trade costs that are a log-linear function of bilateral distance and
the existence of an international border between i and j, they derive the following instance of a
gravity equation system:
1 ln 1 C εij
ln Zij D k C a1 ln dij C a2 1 bij ln i
j
21
1 and 1 are the
where Zij are the GDP-standardized trade flows; k is a constant; and i
j
multilateral resistance terms of regions i and j, which, apart from unitary income elasticities,
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
represent the key difference with equation (20) as estimated by McCallum. The multilateral
resistance terms are linked by a system of nonlinear equations involving all regions’ expenditure
shares k and the whole trade cost distribution:
1 D
1 ea1 ln dik Ca2 1bik 8i
k 22
i
k
k
Equations (21) and (22) reveal that the determinants of Zij cannot be consistently estimated
without taking into account the conditions prevailing in the origin and destination markets i and
j, as captured here by a simple transformation of the CES price indices. These price indices
themselves depend on the inverse demands and therefore on the different trade flows. Hence the
independence assumption underlying the McCallum-type estimates is not valid, which is likely to
bias the estimation results.
Anderson and van Wincoop (2003) estimate equation (21) using nonlinear least squares, where
the multilateral resistance terms are solved for in a first step using (22). While this procedure
accounts for interdependence, it has at least three drawbacks. First, it makes strong structural
assumptions on the form of the cross-sectional interdependence between the multilateral resistance
terms that stem directly from the CES specification. Our approach does not impose a priori such a
structure and is therefore more robust to misspecification. Second, Anderson and van Wincoop’s
method does not allow for parameter heterogeneity across regions, whereas ours does. As will be
shown later, parameter estimates widely vary across regions, and imposing common coefficients
may thus be unwarranted. Last, as the multilateral resistance terms are solved for numerically,
their statistical significance is usually not tested (though this could be done). We will provide such
tests and show that US border effects are mostly insignificant, whereas Canadian border effects
are not.
A third estimation method has been suggested by Anderson and van Wincoop (2003) and
Feenstra (2004). This method replaces the multilateral resistance terms with region-specific
importer–exporter fixed effects. In this case, (21) can be written as
j j
ln Zij D k C a1 ln dij C a2 1 bij C ˇ1i υi1 C ˇ2 υ2 C εij
23
where υi1 denotes an indicator variable that equals one if region i is the exporter, and zero otherwise;
j
and where υ2 denotes an indicator variable that equals one if region j is the importer, and zero
i and ˇj D 1 ln j then provide estimates of
otherwise. The coefficients ˇ1i D 1 ln 2
the multilateral resistance terms.
Although the fixed-effects procedure yields theoretically consistent estimates of the average
border effect in a CES model à la Anderson and van Wincoop with uncorrelated error terms (see
Feenstra, 2004), it disregards some part of the interdependence across trade flows. Hence, while
the fixed effects method has the advantage of being simple to implement, as OLS can be used
under the traditional assumptions on the error terms, its main drawback is that it does not capture
all the interactions of the model. Columns 4–6 of Table I show results for OLS fixed effects
regression. As can be seen from the last line in Table I, even after controlling for multilateral
resistance by using region-specific importer–exporter fixed effects, there remains a significant
amount of cross-sectional correlation in the OLS residuals. This finding suffices to show that fixed
effects capture heterogeneity but do not capture interdependence. In other words, although fixed
effects allow one to partly control for interdependence, they are by no means sufficient from both
a theoretical and an econometric point of view.10
10 Anderson and van Wincoop (2003, p. 180) point out that the fixed-effects estimator could be less efficient than the
nonlinear least squares estimator, which uses information on the full structure of the model. We furthermore show in this
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
Table II. Homogeneous coefficients, SARMA regressions
Model
Dependent variable
18.799a
(1.004)
0.944a
ln wi
(0.176)
ij
d
1.223a
(0.033)
bij
1.127a
(0.064)
0.208a
(0.042)
0.001a
(0.0001)
1 surfi
Internal distance
3
AIC
2.342
BIC
2.325
Border effects (total) (intra)/(inter)
CA
7.353a
(1.280)
CA–CA/CA–US
2.712a /0.369a
(0.056)/(0.056)
US
1.296a
(0.019)
US–US/US–CA
1.138a /0.879a
(0.007)/(0.007)
‘Average border’
3.086a
(0.246)
Constant
SARMA
ln Zij
ln Zij (US–US flows ð 1.3)
18.495a
(1.015)
1.026a
(0.179)
1.298a
(0.037)
1.146a
(0.065)
0.202a
(0.043)
0.001a
(0.0001)
2 surfi
3
2.364
2.347
18.892a
(1.012)
0.945a
(0.177)
1.188a
(0.033)
1.133a
(0.064)
0.214a
(0.043)
0.001a
(0.0001)
1
4 minj fdij g
2.350
2.333
19.322a
(1.000)
0.735a
(0.176)
1.212a
(0.033)
1.277a
(0.064)
0.218a
(0.042)
0.001a
(0.0001)
1 surfi
3
2.357
2.341
19.053a
(1.012)
0.815a
(0.179)
1.286a
(0.036)
1.296a
(0.065)
0.214a
(0.043)
0.001a
(0.0001)
2 surfi
3
2.280
2.363
19.450a
(1.007)
0.733a
(0.177)
1.177a
(0.033)
1.282a
(0.064)
0.226a
(0.043)
0.001a
(0.0001)
1
4 minj fdij g
2.366
2.349
7.607a
(1.360)
2.758a /0.363a
(0.057)/(0.057)
1.301a
(0.020)
1.141a /0.877a
(0.007)/(0.007)
3.146a
(0.258)
7.425a
(1.304)
2.725a /0.367a
(0.057)/(0.057)
1.297a
(0.019)
1.139a /0.878a
(0.007)/(0.007)
3.104a
(0.249)
9.590a
(1.848)
3.097a /0.323a
(0.056)/(0.056)
1.341a
(0.020)
1.158a /0.864a
(0.007)/(0.007)
3.586a
(0.319)
9.912a
(1.963)
3.148a /0.318a
(0.057)/(0.057)
1.347a
(0.020)
1.161a /0.862a
(0.007)/(0.007)
3.654a
(0.334)
9.677a
(1.881)
3.111a /0.322a
(0.057)/(0.057)
1.343a
(0.020)
1.159a /0.863a
(0.007)/(0.007)
3.604a
(0.324)
Note: All six columns correspond to the estimation of the model given by equations (18) and (19). Standard errors are
given in parentheses (with a D 0.01, b D 0.05, c D 0.1); those for border effect coefficients are computed using the Delta
method. See supporting information Appendix C.1 for an explanation of how to compute the border effects. Following
Feenstra (2004), average border effects are computed as the geometric mean of the individual border effects. The number
of observations is 1600. The interaction matrix is Wd in all specifications, it is normalized by its spectral radius (see, for
example, Kelejian and Prucha, 2009) so that the normalized autoregressive and moving average coefficients are both in
the interval (1,1). AIC and BIC stand for the Akaike and the Schwarz information criteria, respectively.
4.3. Econometric Estimation: Homogeneous Coefficients
We now estimate the gravity equation system using techniques similar to those used in spatial
econometrics. First, we impose homogeneous coefficients to estimate the theory-based SARMA
specification (18) and (19). Columns 1–3 in Table II summarize our results. As can be seen, all
coefficients, including the autoregressive ones, have the correct signs, plausible magnitudes, and are
ij , measured as a deviation from the population
precisely estimated. The distance coefficient for d
weighted average, increases slightly in absolute value when compared to the OLS specification
but remains overall fairly stable. Turning to the wage term, it is worth noting that it is negative
and highly significant. Put differently, higher-origin wages reduce trade flows because of increased
production costs. Although one might a priori suspect that interregional wage differentials should
not significantly affect interregional trade flows in an integrated economic environment like North
America, where interregional wages differentials are relatively small, our results show that this
is not the case: interregional wage differentials are significant enough across North American
paper that it could also be biased. Fixed effects allow to control for heterogeneity but generally not for interdependence,
even when more complex fixed-effects specifications are used (e.g., Baltagi et al., 2003).
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
regions to affect trade flows. One of the key empirical results in the SARMA specification is that
< 0), as
there is a significant amount of negative cross-sectional correlation among trade flows (
predicted by our model. There is also significant positive cross-sectional correlation among error
> 0), which suggests that controlling for that correlation is important.
terms (
As can further be seen from Table II, capturing the interdependence of the equilibrium system
in the SARMA model reduces the border effects with respect to the OLS estimates, but also with
respect to Anderson and van Wincoop (2003). Supporting information Appendix C.1 provides a
detailed explanation of how to compute and to decompose the border effects. In our theory-based
specification, the border effects for Canadian provinces range from about 7.3 to 7.6, whereas
those for US states cluster around 1.3. These values must be compared with Anderson and van
Wincoop’s measured border effects of 10.5 for Canada and 2.6 for the USA.
Balistreri and Hillberry (2007) argue that the construction of the state–state trade flows by
Anderson and van Wincoop (2003), using shipping data from the Commodity Flow Survey
(CFS), relies on a scalar numerical scaling to make the data ‘comparable’ to the Canada–US
trade data. However, it is likely that Anderson and van Wincoop’s pre-treatment of the data
has overcorrected the CFS shipping data and therefore understates US state–state trade flows.
Consequently, measured Canadian border effects would appear artificially lower since the intraCanadian trade intensity seems significantly larger when compared to the US one. As a robustness
check with respect to the magnitude of US flows, columns 4–6 of Table II replicate our estimates
when all US state–state trade flows are inflated by a factor of 1.3 (i.e., instead of adjusting the
CFS data by a factor of 0.53, we adjust it by a factor of only 0.69). As one can see, measured
Canadian border effects indeed increase to reflect the larger US trade flows, which is consistent
with the findings by Balistreri and Hillberry (2007).
As stated in the foregoing, there are many ways of modeling the error structure about which
theory has little to say. To see how sensitive the results are to the precise type of error structure,
we now run two robustness checks. First, we approximate the moving average by a more general
autoregressive error structure, which leads to a model similar to the so-called general spatial model
(Anselin, 1988) or SARAR model (Kelejian and Prucha, 2009). Consider a vector of error terms
u that is cross-sectionally correlated according to the autoregressive structure:
u D Wd u C ε
24
where ε is i.i.d. and normally distributed with zero mean and variance 2 I. Provided that jj < 1/,
where is the spectral radius of Wd (see for example, Kelejian and Prucha, 2009), we can write:
u D [In2 Wd ]1 ε D
1
jD1
j
j Wd ε C ε
When the successive powers in the sum converge to 0 sufficiently quickly, the autoregressive
structure approximates appropriately the first-order moving average, i.e., u ³ ε C Wd ε as in (19).
Observe that this approximation is reasonably accurate provided that is small enough and that
the elements of the successive powers of Wd converge to zero sufficiently quickly.
We estimate the SARAR specification with homogeneous coefficients (given by equations (18)
and (24)). The results are summarized in columns 1–3 of Table III. Observe that, as in the
SARMA model, the autoregressive interaction coefficient is negative and highly significant in
all estimations, which is in accord with the underlying theory stipulating that trade flows towards
the same market are substitutes. The magnitude of is slightly larger but reasonably close to the
one obtained in the SARMA model. All remaining coefficients are precisely estimated and the
signs are identical to those obtained under the SARMA specification. The distance coefficients
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
Table III. Homogeneous coefficients, SARAR and SAR regressions
Model
Dependent variable
17.220a
(1.028)
1.023a
ln wi
(0.177)
ij
d
1.246a
(0.034)
bij
1.102a
(0.064)
0.141a
(0.047)
0.504a
(0.054)
1 surfi
Internal distance
3
AIC
8.508
BIC
8.491
Border effects (total) (intra)/(inter)
CA
7.035a
(1.208)
CA–CA/CA–US
2.652a /0.377a
(0.057)/(0.057)
US
1.288a
(0.019)
US–US/US–CA
1.135a /0.881a
(0.007)/(0.007)
‘Average border’
3.010a
(0.236)
Constant
SARAR
SAR
ln Zij
ln Zij
17.049a
(1.040)
1.099a
(0.179)
1.321a
(0.037)
1.123a
(0.065)
0.142a
(0.047)
0.497a
(0.055)
2 surfi
3
8.537
8.520
17.260a
(1.033)
1.031a
(0.178)
1.213a
(0.033)
1.108a
(0.064)
0.145a
(0.047)
0.503a
(0.055)
1
4 minj fdij g
8.518
8.501
13.869a
(0.709)
1.163a
(0.176)
1.255a
(0.035)
1.046a
(0.066)
0.033
(0.029)
—
1 surfi
3
2.299
2.283
13.740a
(0.720)
1.238a
(0.179)
1.331a
(0.038)
1.067a
(0.067)
0.030
(0.030)
—
2 surfi
3
2.327
2.310
13.890a
(0.712)
1.173a
(0.177)
1.223a
(0.034)
1.052a
(0.066)
0.030
(0.030)
—
1
4 minj fdij g
2.309
2.292
7.296a
(1.288)
2.701a /0.370a
(0.058)/(0.058)
1.294a
(0.020)
1.138a /0.879a
(0.008)/(0.008)
3.073a
(0.248)
7.115a
(1.233)
2.667a /0.375a
(0.057)/(0.057)
1.290a
(0.019)
1.136a /0.880a
(0.007)/(0.007)
3.030a
(0.240)
6.366a
(1.080)
2.523a /0.396a
(0.058)/(0.058)
1.272a
(0.019)
1.128a /0.887a
(0.008)/(0.008)
2.845a
(0.220)
6.611a
(1.151)
2.571a /0.389a
(0.059)/(0.059)
1.278a
(0.020)
1.130a /0.885a
(0.008)/(0.008)
2.907a
(0.231)
6.441a
(1.102)
2.538a /0.394a
(0.058)/(0.058)
1.274a
(0.019)
1.129a /0.886a
(0.008)/(0.008)
2.864a
(0.223)
Note: The first three columns correspond to the estimation of the model given by equations (18) and (24), whereas
the last three columns correspond to the estimation of the model given by equation (25). Standard errors are given in
parentheses (with a D 0.01, b D 0.05, c D 0.1); those for border effect coefficients are computed using the Delta method.
See supporting information Appendix C.1 for an explanation of how to compute the border effects. Following Feenstra
(2004), average border effects are computed as the geometric mean of the individual border effects. The number of
observations is 1600. The interaction matrix is Wd in all specifications; it is normalized by its spectral radius (see, for
example, Kelejian and Prucha, 2009) so that the normalized autoregressive coefficients are in the interval (1,1). AIC
and BIC stand for the Akaike and the Schwarz information criteria, respectively.
and border effects remain fairly similar, and the wage coefficient is again negative and highly
significant.
As a second robustness check, we re-estimate the model by introducing the error terms in an
ad hoc way. The easiest way of doing so is to rewrite (18) as follows:
C ˇ2 w C b
C Wd Z C ε
Z D ˇ0 1I C ˇ1 d
25
which simply amounts to adding the i.i.d. normal error term ε to the estimating equation. The
resulting specification (25) is similar to a standard spatial autoregressive model (SAR model)
(Lee, 2004). Columns 4–6 in Table III summarize estimation results for the SAR model with
homogeneous coefficients. Observe that, although the other coefficients remain fairly stable, the
autoregressive interaction coefficient is not significantly different from zero (with even positive
point estimates). This result runs plainly against the underlying theory which predicts a negative
cross-sectional correlation across trade flows. Hence the ad hoc introduction of the error term is
not backed by the data in the sense that it is incompatible with the qualitative predictions of the
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
theory presented in Section 2. As can be further seen, both the wage and border coefficients are
biased.
4.4. Econometric Estimation: Heterogeneous Coefficients
All previous estimates are based on the assumption that coefficients are identical across regions.
Yet, as one can see from equation (16), theory predicts that coefficients are region specific. This is
in accord with recent findings by Helpman et al. (2008, p. 474), who note in an international
context that ‘the elasticities vary widely across different country pairs’. We now re-estimate
the model by allowing for region-specific coefficients as implied by the underlying theoretical
specification. In so doing, we restrict ourselves to the SARMA model (given by (16) and (19))
because: (i) OLS have no theoretical foundation; (ii) the SARAR approximation yields very similar
results; and (iii) the SAR specification yields results that run against the theory.
As already mentioned in the foregoing, we estimate a simpler heterogeneous coefficients model
in which only the non-autoregressive parameters ˇi are allowed to vary across regions, whereas
the autoregressive interaction coefficients and vary by country only (
CA and CA for Canada,
and US and US for the USA). The estimation results for this ˇ-heterogeneous SARMA model are
summarized in Table IV, where we give the estimated coefficients for the distances and the borders,
the true regional distance elasticities εdij , as well as the border effects and their decomposition (see
supporting information Appendix C.2 for additional details). We also report the country-specific
autoregressive coefficients.
Several comments are in order. First, it is worth noting that there is a substantial amount of
heterogeneity in the estimated coefficients, both for distance elasticities, border effects, and autoregressive coefficients. As can be seen from Table IV, the autoregressive interaction coefficients
for both Canada and the USA are negative and significant. Furthermore, as O US < O CA , the US
market appears ‘more competitive’ than the Canadian market. The estimated distance coefficients
(which differ from the true elasticities εdij because of the cross-sectional interdependence and
the implicit form of the estimating equation) range from 0.639 for California to 2.615 for
Newfoundland and Labrador.11 The real elasticities εdij are very similar, with values in the same
range.
The regional structure of distance elasticities is depicted in Figure 1. The Canadian core regions
(Ontario and Québec), as well as the northeast of the USA (Maryland, New York, Pennsylvania)
form a cluster of regions with small distance elasticities (in absolute value) of trade flows. The
same holds true for the western states and provinces (British Columbia, Washington, California),
whereas the Great Plains and the remote Canadian provinces have relatively larger distance
elasticities (in absolute value). Table IV and Figure 2 reveal that, as expected, the Canadian
provinces face larger border effects than the US states. The reason for this is as in Anderson
and van Wincoop (2003) and explained in detail in supporting information Appendix C.2. As
further shown by Table IV, the intranational trade-boosting effect of the border is much larger
for Canadian provinces than for US states. Put differently, ‘trade barriers raise size adjusted trade
within small countries more than within large countries’ (Anderson and van Wincoop, 2003,
p. 176). Furthermore, the trade-reducing international effect of the border for Canadian exports
is larger than that for US exports, which illustrates again that the border has a stronger effect
on Canadian firms than on US firms. The reason is that the internal market of the USA is much
11 As the estimating equation is given in implicit form, the true distance elasticities differ from the estimated coefficients.
Starting from (16), they can be computed as d [I Wd ]1 ˇ1 , which is derived from the explicit solution to the
estimating equation. Note that d is generally an n2 ð n2 matrix of distance elasticities. Yet, since all distance elasticities
with the same origin index are identical, there are only n distinct elasticities.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
Table IV. ˇ-heterogeneous SARMA regressions
Model
SARMA
Code
Region name
AB
BC
MB
NB
NL
NS
ON
PE
QC
SK
AL
AZ
CA
FL
GA
ID
IL
IN
KY
LA
ME
MA
MI
MN
MO
MT
MD
NC
ND
NH
NJ
NY
OH
PA
TN
TX
VT
VA
WA
WI
Alberta
British Columbia
Manitoba
New Brunswick
Newfoundland and Labrador
Nova Scotia
Ontario
Prince Edward Island
Quebec
Saskatchewan
Alabama
Arizona
California
Florida
Georgia
Idaho
Illinois
Indiana
Kentucky
Louisiana
Maine
Massachusetts
Michigan
Minnesota
Missouri
Montana
Maryland
North Carolina
North Dakota
New Hampshire
New Jersey
New York
Ohio
Pennsylvania
Tennessee
Texas
Vermont
Virginia
Washington
Wisconsin
ij
d
1.446a
0.985a
1.883a
1.754a
2.615a
1.460a
1.155a
1.729a
1.152a
1.727a
1.354a
1.158a
0.639a
1.018a
1.359a
1.180a
1.001a
1.287a
1.340a
1.472a
1.333a
0.787a
1.269a
1.424a
1.488a
1.701a
0.933a
1.112a
2.026a
1.025a
0.758a
0.674a
1.095a
0.743a
1.331a
0.958a
1.161a
1.013a
1.167a
1.298a
bij
1.268a
0.976a
1.294a
1.615a
0.061
1.351a
1.635a
1.092a
1.389a
1.326a
2.842
3.700
2.305
0.001
2.787
2.563
0.400
3.141
1.723
2.429
7.115a
2.481
0.867
1.881
0.549
0.959
0.433
2.750
4.393c
1.151
3.697
0.562
1.036
1.282
1.292
1.527
4.264
5.369b
3.264
2.164
CA D 0.001a
US D 0.003a
CA D 0.003a
US D 0.016a
εdij
bij intra
bij inter
Total border
1.449
0.987
1.884
1.755
2.616
1.460
1.162
1.729
1.157
1.729
1.362
1.164
0.659
1.035
1.371
1.182
1.016
1.297
1.346
1.481
1.335
0.793
1.285
1.432
1.498
1.702
0.939
1.122
2.027
1.027
0.765
0.689
1.111
0.754
1.340
0.979
1.161
1.021
1.174
1.307
3.139
2.423
3.190
4.232
1.064
3.349
4.605
2.653
3.594
3.279
1.383
1.528
1.355
0.990
1.380
1.332
1.040
1.437
1.212
1.318
2.256
1.330
0.892
0.794
0.928
0.887
0.941
1.374
1.644
1.132
1.544
1.063
0.872
1.159
1.154
1.201
1.619
1.875
1.456
1.279
0.319
0.413
0.314
0.236
0.940
0.299
0.217
0.377
0.278
0.305
0.723
0.654
0.738
1.010
0.725
0.751
0.962
0.696
0.825
0.759
0.443
0.752
1.122
1.260
1.077
1.128
1.062
0.728
0.608
0.884
0.648
0.940
1.146
0.863
0.867
0.833
0.618
0.533
0.687
0.782
9.850a
5.871a
10.174a
17.906a
1.133
11.216a
21.207a
7.037a
12.918a
10.752a
1.914
2.336
1.835
0.980
1.905
1.775
1.081
2.065
1.470
1.738
5.089a
1.769
0.795
0.630
0.862
0.787
0.886
1.889
2.703c
1.281
2.382
1.131
0.761
1.344
1.331
1.442
2.622
3.516b
2.119
1.635
Note: Results correspond to the estimation of the model given by equations (16) and (19), where we impose countryspecific autoregressive parameters. Significance is displayed as a D 0.01, b D 0.05, and c D 0.1. The interaction matrix
is Wd in all specifications; it is normalized by its spectral radius (see, for example, Kelejian and Prucha, 2009) so that
the normalized autoregressive and moving average coefficients are both in the interval (1,1). See supporting information
Appendix C.2 for an explanation of how to compute the border effects. Distance elasticities εdij are computed from the
explicit form of the model. See supporting information Appendix D for details on the maximum likelihood estimation of
the heterogeneous SARMA model.
larger, so that borders affect only a much smaller part of sales from US firms than from Canadian
firms.
Last, as can be seen from Table IV, almost all border coefficients for US states are statistically
insignificant. There are some statistically significant and relatively large positive measured border
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
-1.88
-1.73
-1.45
-0.98
-2.62
-1.15
-1.16
-1.17
-2.03
-1.70
-1.73
-1.75
-1.16-1.33
-1.46
-0.67
-1.02
-1.27
-0.79
-1.00-1.29-1.09 -0.74
-0.76
-1.49
-1.34
-1.01 -0.93
-1.33
-1.1
1
-1.42
-1.30
-1.18
-0.64
-1.16
-0.96
-1.35-1.36
-1.47
-1.02
Figure 1. Regional structure of distance elasticities
10.75
9.85
5.87
10.17
1.13
21.21
2.12
0.79
1.77
2.70
0.63
17.917.04
2.62 5.09 11.22
1.13 1.28
1.34
1.77
1.082.06 0.76
2.38
0.89
0.86
1.47
3.52
1.33
1.89
1.64
-0.46
1.83
2.34
1.44
12.92
0.79
1.91 1.91
1.74
0.98
Figure 2. Regional structure of total border effects
effects for states such as Maine, North Dakota and Virginia, but these are rather exceptions.12 On
the contrary, the border effects for Canadian provinces are almost all highly significant and there
is a lot of variation. Magnitudes for the border effects range from insignificant in Newfoundland
to about 21.2 in Ontario, yet most values are clustered around 10–12. On the contrary, border
12 Although we obtain some ‘negative border effects’ (i.e., border effects smaller than 1) for several US states, which
runs against theory, these are always statistically insignificant.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
–.5
K. BEHRENS, C. ERTUR AND W. KOCH
CA
NY
–1
BC
VT
ID
–1.5
ME
NS AB
MD
AZ WA
IL FL
OH
ON
VA
NC
QC
WIIN
KY
ALTN GA
MN
LAMO
TX
MI
MT
SK
PENB
MB
–2
distance elasticity
NH
PA
NJ
MA
–2.5
ND
NL
0
.02
.04
.06
.08
.1
.12
population share
Figure 3. Distance elasticities and regional size
effects for the US states are uniformly smaller, ranging from insignificant to a high of about
5. To sum up, border effects are generally small for US exports to Canada, whereas they do
exist for Canadian exports to the USA. Their magnitude, however, seems smaller than previously
thought once ‘multilateral resistance’ is captured via the cross-sectional interdependence of trade
flows.
Last, as shown by expression (17), the model predicts the existence of a positive and concave
relationship between a region’s relative size Li /L and its distance coefficient ˇ1i . Figure 3, which
summarizes our estimation results from Table IV, reveals the existence of this pattern relating
regional sizes to distance elasticities: larger regions face lower distance elasticities than smaller
regions do. One possible explanation for this finding is that smaller markets are less competitive,
so that less productive firms are selected into those markets (Melitz, 2003; Melitz and Ottaviano,
2008). Provided there is a positive correlation between a firm’s productivity and its shipping costs,
these firms then have a greater handicap in serving foreign markets, thus facing higher distance
elasticities than more productive firms in larger and more competitive markets.
5. CONCLUSIONS
Building on a ‘dual’ version of the gravity equation, we have shown how techniques adapted
from the spatial econometrics literature can be used to control for cross-sectional interdependence
among trade flows. Handling such interdependence is a major issue for consistent estimation but
has been rather elusive until now. Our results suggest that, as in Anderson and van Wincoop
(2003), theory-based estimation of the gravity equation leads to smaller border effects than those
obtained with ad hoc specifications or fixed-effects methods. Put differently, there seems to be
less of a ‘border puzzle’ once the cross-sectional correlations in the data have been controlled for.
Besides reducing the magnitudes of the border effects, our methodology offers a number of
additional advantages when compared to previous approaches: (i) it accounts for cross-sectional
interdependence among trade flows, as implied by our model, and thus directly controls for
multilateral resistance; (ii) it uses a more careful modeling of the error structure, thereby controlling
for possible cross-sectional interdependence in the error terms; (iii) it does not impose a rigid
pattern on the interaction structure and is therefore more robust to potential misspecification;
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
DUAL GRAVITY
furthermore, our interaction structure is theory-based; and (iv) it allows for estimation of regionspecific coefficients.
While we have tried to stay as close as possible to the structural model, we have not yet
estimated the fully heterogeneous model with region-specific autoregressive coefficients. Doing so
is a daunting task that is left for future work.
ACKNOWLEDGEMENTS
We thank the editor M. Hashem Pesaran, two anonymous referees, as well as Peter Egger, Giordano
Mion, Yasusada Murata, Jan Mutl, Giovanni Peri, Michael Pfaffermayr, seminar participants in
Louvain-la-Neuve, Innsbruck, and conference participants in Kiel, Toronto, Louvain-la-Neuve,
Cambridge, Tokyo, Paris, and the 62nd European Meetings of the Econometric Society for very
helpful comments and suggestions. Behrens is holder of the Canada Research Chair in Regional
Impacts of Globalization. Financial support from the CRC Program of the Social Sciences and
Humanities Research Council (SSHRC) of Canada is gratefully acknowledged. Behrens also
acknowledges financial support from the European Commission (Grant MEIF-CT-2005-024266)
and from FQRSC Québec (Grant NP-127178). Behrens and Koch gratefully acknowledge financial
support from CNRS-GIP-ANR (Grant JC-05-439904). Part of this research was undertaken while
Wilfried Koch was visiting CORE, Université catholique de Louvain, and UQAM. The usual
disclaimer applies.
REFERENCES
Akerlof GA. 1997. Social distance and social decision. Econometrica 65: 1005–1027.
Anderson JE. 1979. A theoretical foundation for the gravity equation. American Economic Review 69:
106–116.
Anderson JE, van Wincoop E. 2003. Gravity with gravitas: a solution to the border puzzle. American
Economic Review 93: 170–192.
Anderson JE, van Wincoop E. 2004. Trade costs. Journal of Economic Literature 42: 691–751.
Anderson MA, Smith SLS. 1999. Canadian provinces in world trade: engagement and detachment. Canadian
Journal of Economics 32: 22–38.
Anselin L. 1988. Spatial Econometrics: Methods and Models. Kluwer: Dordrecht.
Anselin L. 2006. Spatial econometrics. In Palgrave Handbook of Econometrics, Vol. I: Econometric Theory,
Mills TC, Patterson K (eds). Palgrave Macmillan: New York; 901–969.
Anselin L, Bera A. 1998. Spatial dependence in linear regression models with an introduction to spatial
econometrics. In Handbook of Applied Economic Statistics, Ullah A, Giles D. (eds). Marcel Dekker: New
York; 237–289.
Baier SL, Bergstrand JH. 2009. Bonus vetus OLS: a simple method for approximating international trade-cost
effects using the gravity equation. Journal of International Economics 77: 77–85.
Balistreri EJ, Hillberry R. 2007. Structural estimation and the border puzzle. Journal of International
Economics 72: 451–463.
Baltagi BH, Egger P, Pfaffermayr M. 2003. A generalized design for bilateral trade flow models. Economics
Letters 80: 391–397.
Behrens K, Mion G, Murata Y, Südekum J. 2009. Trade, wages, and productivity. CEPR Discussion Paper
#7369, Center for Economic Policy Research, London.
Blonigen BA, Davies RB, Naughton HT, Waddell GR. 2007. FDI in space: spatial autoregressive relationships in foreign direct investment. European Economic Review 51: 1303–1325.
Broda C, Weinstein D. 2006. Globalization and the gains from variety. Quarterly Journal of Economics 121:
541–585.
Brueckner JK. 1998. Testing for strategic interaction among local governments: the case of growth controls.
Journal of Urban Economics 44: 438–467.
Case AC, Rosen HS, Hines JR. 1993. Budget spillovers and fiscal policy interdependence: Evidence from
the states. Journal of Public Economics 52: 285–307.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae
K. BEHRENS, C. ERTUR AND W. KOCH
Chaney T. 2008. Distorted gravity: the intensive and extensive margins of international trade. American
Economic Review 98: 1707–1721.
Chudik A, Pesaran H, Tosetti E. 2010. Weak and strong cross section dependence and estimation of large
panels. Econometrics Journal (forthcoming)..
Cliff AD, Ord GK. 1981. Spatial Processes: Models and Applications. Pion: London.
Conley TG, Ligon E. 2002. Economic distance and cross-country spillovers. Journal of Economic Growth
7: 157–187.
Conley TG, Topa G. 2002. Socio-economic distance and spatial patterns in unemployment. Journal of Applied
Econometrics 17: 303–327.
Disdier A-C, Head K. 2008. The puzzling persistence of the distance effect on bilateral trade. Review of
Economics and Statistics 90: 37–48.
Eaton J, Kortum S. 2002. Technology, geography and trade, Econometrica 70: 1741–1779.
Ertur C, Koch W. 2007. Growth, technological interdependence and spatial externalities: theory and evidence.
Journal of Applied Econometrics 22: 1033–1062.
Feenstra RC. 2004. Advanced International Trade. Princeton University Press: Princeton, NJ.
Felbermayr GJ, Kohler W. 2006. Exploring the intensive and extensive margins of world trade. Review of
World Economics 142: 642–674.
Head K, Mayer T. 2002. Illusory border effects: distance mismeasurement inflates estimates of home bias in
trade. CEPII Discussion Paper #2002/1.
Helpman E, Melitz MJ, Rubinstein Y. 2008. Estimating trade flows: trading partners and trading volumes.
Quarterly Journal of Economics 123: 441–487.
Huang SJ. 1984. The autoregressive moving average model for spatial analysis. Australian Journal of
Statistics 26: 169–178.
Hummels D. 2001. Towards a geography of trade costs. Mimeo, Krannert School of Management, Purdue
University, West Lafayette, IN.
Kelejian HH, Prucha IR. 1998. A generalized spatial two-stage least square procedure for estimating a spatial
autoregressive model with autoregressive disturbances. Journal of Real Estate Finance and Economics 17:
99–121.
Kelejian HH, Prucha IR. 2009. Specification and estimation of spatial autoregressive models with autoregressive and heteroskedastic disturbances. Journal of Econometrics 157: 53–67.
Lee L-F. 2004. Asymptotic distributions of quasi-maximum likelihood estimators for spatial autoregressive
models. Econometrica 72: 1899–1925.
LeSage JP, Pace RK. 2008. Spatial econometric modeling of origin–destination flows. Journal of Regional
Science 48: 941–967.
LeSage JP, Pace RK. 2009. Introduction to Spatial Econometrics. Chapman & Hall/CRC: New York.
McCallum JM. 1995. National borders matter: Canada–U.S. regional trade patterns. American Economic
Review 85: 615–623.
Melitz MJ. 2003. The impacts of trade on intra-industry reallocations and aggregate industry productivity.
Econometrica 71: 1695–1725.
Melitz MJ, Ottaviano GIP. 2008. Market size, trade, and productivity. Review of Economic Studies 75:
295–316.
Pesaran H, Tosetti E. 2010. Large panels with spatial correlation and common factors. CESifo Working Paper
No. 2103, September 2007, revised January 2010.
Pinkse J, Slade M, Brett C. 2002. Spatial price competition: a semiparametric approach. Econometrica 70:
1111–1153.
Redding S, Venables AJ. 2004. Economic geography and international inequality. Journal of International
Economics 62: 53–82.
Copyright  2010 John Wiley & Sons, Ltd.
J. Appl. Econ. (2010)
DOI: 10.1002/jae