Space Is more than Geography

Space Is more than Geography1
Nathaniel Beck and Kristian Gleditsch
Department of Political Science
University of California, San Diego
La Jolla, CA 92093
USA
Draft of March 22, 2003. Prepared for the Joint Meetings of the European
Consortium for Political Research, Edinburgh, Mar.-Apr., 2003
1 We
thank James Morrow and Randy Siverson for the use of their data. Kyle Beardsley was most
helpful in updating the data and otherwise making it useful for us. James P. LeSage and Kelly Pace
have generously made available their code for spatial analysis in Matlab and were extraordinarily
helpful in answering our numerous questions. Finally, we thank Michael Ward for impressing on
us the importance of space.
Spatial econometric models have begun to make inroads into the study of political science,
and, in particular, the study of international relations.1 Spatial econometrics has its roots in
the study of geography, so, not unnaturally, these applications have typically used geographic
notions in their spatial econometrics. But there is no inherent reason for this. In this paper,
we discuss two very different applications to IR data that, while using spatial econometrics,
do not use geographic notions of space.
Our first application is technical. Much IR analysis is based on the dyad year design.
Much attention has been paid to the interdependence of the dyads, with most attention
having been paid to whether the successive annual observations on the same dyad are independent (Beck and Katz, 1996). More generally, time-series–cross-section analysts have
worried about whether observations on different units at the same time point are independent (Beck and Katz, 1995). But while the latter endeavor proved somewhat deleterious,
the notion that different observations are dependent cannot be ignored. Here we consider
one type of interdependence that arises in dyadic data; two dyads that contain a common
member are unlikely to be independent. Perhaps even more seriously, analysts often study
the directed dyad AB and the directed dyad BA; these two dyads are particularly unlikely
to be independent. We use some spatial econometrics to help with this issue. But rather
than use a geographic notion of closeness, we posit that two dyads are close if they share a
common member, and are especially close if they are the reverse of each other. We apply
this approach to the study of politics and trade of Morrow, Siverson and Tabaras (1999).
A more interesting substantive issue is that spatial econometric methods allow us to use
Deutsch’s (1954) notion of community based on flows of people, communication, trade in our
models. Most econometric models assume that each unit (a nation or dyad) is independent of
all others (with all dependencies being in the same unit at different points in time). Spatial
econometricians have begun to break this down, allowing for attributes such as democracy or
war to be a function of those attributes in neighboring units, where neighboring is used in the
usual geographic sense. This alone was a major breakthrough, but we can do better, allowing
for units to be near each other insofar as they are members of a Deutschian community.
We can also use other political notions of neighborliness. Thus two nations may be
neighbors if they share a common IGO membership or are in a joint alliance. Thus, for
example, nations may become more democratic if they are in common market or tariff
agreement with other democratic nations; this is one of the arguments as to why it was
important that China join the WTO. (Or nations may become more democratic in an attempt
to join a common market of democratic states, as the example of Turkey may perhaps show.)
This also can be modeled with spatial methods.
In the next section of the paper we lay out the basic spatial econometric model. Since the
model is well described elsewhere (Anselin, 1988), and we use only well known estimation
techniques, we do not discuss the estimation of spatial models here. But in the following
sections, we show how spatial methods, with non-geographic notions of “distance.” can be
used in several models of interest to IR scholars.
1
See, for example, the various articles in the special issue of Political Analysis (10:3), as well as Gleditsch
(2002) and Gleditsch and Ward (2000).
1
1
The spatial econometric model/
While much of our interest is in time-series—cross-section (TSCS) data, where units (nations
or dyads) are observed annually over a long time period, we begin with a simple crosssectional setup.
Let yi represent some dependent variable of interest, and, as usual, we assume it is a
linear function of covariates, xi and some unmeasured variables, “the error,” εi , so
yi = xi β + εi .
(1)
Linearity here is purely for ease of notation, and any non-linear extensions available in typical
econometric models are available in the spatial framework.
The most critical assumption, other than that the specification is correct, is that the
covariates are independent of the error process. We begin with that assumption also. However, we weaken the assumption that the error process (and hence the yi |xi ) are independent
across observations. The basic spatial insight is that “errors” (best though of as omitted or
unmeasured variables) in unit i are related to the “errors” in nearby units. While nearby
is typically used in a geographic sense, there is no reason why we cannot use any notion of
nearness that we prefer, so long as this is specified by the analysts.
Spatially lagged errors
Letting wi be a vector for how “close” the other observations are to unit i, and setting
wi = 0, and letting ε be the vector of all errors, we get the “spatially lagged error” model
yi = xi β + εi + λwi0 ε.
(2)
If λ = 0, this reduces to the standard non-spatial model.
If λ 6= 0, OLS is still consistent, but the reported standard errors will be wrong and the
estimated β̂ will be inefficient. This can be fixed by typical GLS reasoning, though complications require a full maximum likelihood estimation.2 Our interest here is on alternative
weighting schemes that are more relevant to political scientists than the typical geographic
schemes used by geographers (where the elements of W are either zero or one based on
contiguity, or a measure of the physical distance between unit i and the various other units).
The spatially lagged error model corresponds to the model in time series analysis where
the errors show some temporal correlation process. The analogy to the time series serially
correlated error model is useful. This analogy tells us that the only way that observations are
interdependent is through unmeasured variables that are correlated, in this case across space.
The model is odd, in that space matters in the “error process” but not in the substantive
portion of the model. Moreover, if we add a new variable to the model, so we move it from
the “error” to the substantive portion of the model, the spatially lagged error model assumes
that this variable no longer has a spatial impact of nearby observations. This assumption
2
For this and all other econometric details, the reader should consult Anselin’s (1988) text.
2
seems to us hard to defend. It is the case, however, that the spatially lagged error model
does seem appropriate for our problem of dealing with interconnected dyads. We return to
this issue in the next section.
If we have TSCS data, with observations indexed both by unit i, and time, t, (and
assuming without loss of generality a rectangular data structure), the spatially lagged error
model becomes
yi,t = xi,t β + εi,t + λwi0 εt
(3)
where εt is the vector of errors for all units i at time t. Implicit in this notation is that
all spatial impacts occur instantaneously, so that only contemporaneous errors show spatial autocorrelation. We will typically deal with temporal dynamics by including a lagged
dependent variable in the model, yielding
yi,t = φyi,t−1 + xi,t β + εi,t + λwi0 εt .
(4)
This equation is only easy to estimate if the error process shows no temporal correlations,
so the lagged y is independent of the error process. This assumption is often reasonable in
practice, and can be tested via a Lagrange multiplier test (Beck and Katz, 1996).
Spatial Lags
The spatially lagged error model corresponds to the time-series serially correlated errors
model. The “spatial lag” model corresponds to the time-series lagged dependent model. In
this model, the dependent variable is impacted by the values of the dependent variable in
nearby units, with nearby suitably defined. It differs from the spatially lagged errors model,
in that both the error term and the covariates in nearby units impact the current unit. Thus,
for example, let the dependent variable be the level of democracy in a country. Is it likely
that this is partly a function of democracy in nearby countries, rather than just being related
to common unmeasured variables in nearby units?
Again, we start with the simple cross-sectional model. Using the same notation as above,
and letting y be the vector of values for y, the spatial lag model has
yi = xi β + κwi0 y + εi .
(5)
This is a difficult model to estimate, but it can be done by complicated maximum likelihood.
As above, we are interested in weighting vectors that are more politically inspired than the
distance or proximity weights used by geographers.
This model can be generalized for TSCS data by
yi,t = xi,t β + κwi0 yt + εi,t
(6)
where the notation implies that the weighting vector is time invariant (although it is easy to
allow it to vary over time, so long as this is specified ex ante). Here yt refers to the vector
of observations on the dependent variable for all units at time t.
3
Since TSCS models normally show temporal dynamics, we can add a temporal lag of y
to the model, yielding
yi,t = xi,t β + φyi,t−1 + κwi0 yt + εi,t .
(7)
As above, estimation is much simplified if the remaining errors are temporally independent,
which can be assessed via a Lagrange multiplier test.
TSCS data allow us for an interesting alternative to the spatial lagged model. Suppose
that we continue to maintain that yi,t is related to the neighboring y’s, but we believe that
this impact occurs with a one period lag. This yields the model
yi,t = xi,t β + φyi,t−1 + κwi0 yt−1 + εi,t .
(8)
If the errors are temporally independent, this model is easy to estimate via OLS. In our
applications in Section 3 we compare the results of the spatial lag model with the simpler to
estimate temporally lagged spatial lag model.
2
Allowing for dyadic interdependence
Much recent work in TSCS analysis has attempted to allow for interdependent observations.
But one form of interdependence that has not been well studied is that dyads which contain
a common member may be interdependent, and, in data sets with directed dyads, any dyad
and its reverse should be strongly related.3 .
We can easily define a weighting vector that says that one dyad is near another if both
share a common member; if the two dyads are the reverse of each other, the weight is some
value v > 1; for all other dyads, the relevant weighting entry is zero. While in principal one
could estimate the value of v, here we just try several values and choose the one yielding the
best fit.
Because y must be continuous,4 we chose a data set dealing with the political determinants
of trade (viewed as exports directed from A to B). While in principal we can deal with very
large data sets, in practice we have not been able to analyze our largest data set, which
consists of yearly dyadic trade for all dyads in the political system from 1950–2000. We
therefore decided to reanalyze the basic model of Morrow, Siverson and Tabaras (1999),
which analyzes the political determinants of trade amongst the seven major powers for
the period 1905–90. This study uses the exports of one country to another (in constant
dollars), yielding 42 directed dyads. Their estimation method assumed that all 42 dyads
were independent of each other. Here we compare the spatially lagged error model (Eq. 4 to
a more typical OLS estimation.
3
The only research on this that we know of is by Mansfield and Bronson (1997) which adjoins to each
model two dummy variables, one to represent each dyad. With much IR data, these fixed effects are not
ideal (Beck and Katz, 2001).
4
If y is discrete, the spatial setup is much more difficult and we do not pursue it here. See citeward:gled:2002 for one approach to this problem.
4
While the reader should see the original article for full details, Morrow et al. use both
political and non-political determinants of trade in a gravity model. Thus they regress the
log of exports on the GDP and population of both exporter and importer and the distance
between the two (all values logged).5 ’ Their political variable are whether the dyad is in
a Militarized Interstate Dispute (MIDL), the similarity of their alliance portfolios (TAUL),
whether the dyad is democracies (DEMDL), and whether the two nations are allied multiplied
by a dummy to mark the multipolar era (pre-1948) or the bipolar era (post-1948). Data from
both World Wars I and II were omitted.6
Morrow et al. use a complicated serially corrected error scheme to deal with temporal dependence, and spent much attention on how to model this given the gaps in the observations.
We choose the econometrically simpler, and theoretically more compelling, error correction
model. This model, due originally to Davidson, Hendry, Srba and Yeo (1978), assumes that
trade and the independent variables are in a long run equilibrium relationship; when the
system is out of equilibrium, it adjust back to the long run equilibrium, with the speed of
re-equilibration empirically determined. The model also allows for short run effects. Thus
the first difference of (logged) trade is a function of the first difference of the independent
variables, and the lags of all variables (including the lagged level of (logged) trade). The lag
of the trade variable gives the speed of re-equilibration; the coefficients on the other lagged
variables yield the long run equilibrium relationship (up to a constant). We drop the first
difference of variables such as distance, which are not meaningful (thus, distance only affects
the long run equilibrium relationship).
Observations with missing lagged values are simply dropped.7 This ensures that there is
no mistakes in using a model with one year dynamics. Since our methods are different from
those of Morrow et al., we present only our own results. (To make the output easier to read,
both trade and lagged trade were multiplied by 100; this of course has no impact other than
making the coefficients reported 100 times larger, except for lagged trade.)
The first columns in Table 1 are the results assuming dyadic interdependence (both with
common OLS standard errors and “panel correct standard errors” or PCSE’s.) In this case,
the PCSE’s are similar to the OLS standard errors.8
OLS with a lagged dependent variable is only consistent if the remaining errors are
serially independent. A Lagrange multiplier test for this unfortunately allows us to reject
the null hypothesis of no serial correlation of the remaining errors. Thus our estimates are
inconsistent. However, the estimated serial correlation of the errors is -.10 (with a standard
error of .02), indicating that the errors show little remaining serial correlation. The harm
5
All values were logged because of the nature of the gravity model of trade used by Morrow, et al. This
leads to non-standard coding for dummy variables and such, but since our interest is only on the changes in
coefficients and standard errors, we need not go into this here.
6
Some other missing data was also omitted. Thus measurements on Germany do not start until 1950,
and so there are no German dyadic measures for 1948 or 1949.
7
By using other data sources and filling in some missing data by interpolation where that made sense, we
were able to limit the number of dropped observations to those immediately following the two World Wars.
8
This means that PCSE’s alone are not capable of correcting for the dyadic dependence we find. This is
not surprising, since PCSE’s were not meant to be used in this way.
5
Table 1: The Political Determinants of Exports Amongst the Major Powers 1907–1990
OLS
Dyadic Interdependence
Variable
Coefficient Std. Err. PCSE Coefficient Std. Err.
POLITICAL VARIABLES
FIRST DIFFERENCES
DEMOC
26.14
12.21
8.58
16.36
13.99
MIDL
−14.28
5.22
6.52
−14.02
5.53
TAUL
−11.54
13.65
12.78
−12.50
15.62
ALLIANCE (pre-1940)
−24.92
8.44
7.82
−16.92
9.13
ALLIANCE (post-1940)
−.61
10.68
8.79
−2.59
11.77
LEVEL (LAGGED)
DEMOC
8.80
3.10
2.86
8.84
2.71
MIDL
−42.14
7.00
8.82
−41.83
7.35
TAUL
11.42
7.79
7.06
9.85
8.38
ALLIANCE (pre-1940)
−26.34
5.43
5.10
−18.42
5.93
ALLIANCE (post-1940)
−2.59
4.86
4.39
−1.38
5.27
CONTROLS
FIRST DIFFERENCES
GNP Exporter
POP Exporter
GNP Importer
POP Importer
LEVELS (LAGGED)
GNP Exporter
POL Exporter
GNP Importer
POP Importer
DISTANCE
Trade
Intercept
λ
R2
N=2446
34.72
5.37
13.61
9.20
3.71
5.15
3.66
5.81
4.13
4.96
4.82
4.53
33.99
25.58
12.00
54.32
4.24
54.67
3.70
55.55
3.98
1.48
4.19
0.82
−3.47
−.08
−33.94
1.10
2.19
1.05
2.19
0.90
0.01
11.67
1.22
2.07
1.28
2.44
0.80
.01
14.23
4.73
1.55
4.66
0.91
−4.20
−0.09
−37.35
.38
0.67
0.46
0.65
0.39
0.96
0.01
14.50
0.01
.13
6
.19
induced by the small amount of serial correlation in the errors is small, and the known
corrections for this harm are difficult to use, impose their own harm, and the degree of this
harm is not proportion to the harm caused by the small amount of serial correlation of the
errors. Thus we work with the OLS results using the lagged dependent variable, simply
noting they are not as perfect as we would prefer.
Since we have an error correction model, we must interpret both the long run equilibrium
values and the short run features of the model. First, we note that trade is very slow to
return to equilibrium, with only about 8% of the disequilibrium value offset in any given
year. Trade moves very slowly (or appears very smooth). However, eventually trade will
return to its equilibrium value.
Starting with the political variables we see that democratic dyads have a higher equilibrium level of trade, and that dyads which are frequently engaged in a militarized dispute
have a lower equilibrium level of trade. While dyads with similar alliance portfolios tend
to trade more, this result does not attain standard levels of significance. Dyads which were
allied in the multipolar (pre-World War II) period tend to trade less (in equilibrium); in the
post World War II bipolar period there is essentially no relationship between being allied
and the level of trade. While this finding puzzles us, Morrow, Siverson and Tabares (1990)
have similar findings, and make a security argument for the reasonableness of this finding.
Looking at first differences, the outbreak of a MID depresses trade. While changes in
alliance patterns are rare, entering an alliance in the multipolar period depresses trade;
similarly, change2 in dyadic democracy scores are rare, but when a dyad does become more
democratic its trade does appear to increase.
The economic control variables behave as expected, though neither the population of the
exporter or importer is statistically significant. There is a strong equilibrium relationship
between trade and the distance of the sender and receiver and the GNPs of both parties;
these are not surprising. Increases in GNP also increase trade in the short run.
The OLS model was re-estimated allowing for spatially lagged errors. The weighting
matrix used had an entry of one for dyads that contain a common dyadic partner (e.g. AB
and AC), three for the dyad and its reverse (AB and BA) and zero otherwise. The value
three was chosen for reverse dyads to note that they are more related than dyads which
simply contain a common member. We estimated models allowing the weights for the dyad
and its reverse to be every integer from two to ten, with three yielding the best fit based
statistics. (It should be noted that the results were almost identical for all weight values
under five, and one could have chosen any of these.) This model is presented in the two right
hand columns of Table 1.
We had expected the introduction of spatially lagged errors, which takes account of
the interconnectedness of the dyads, to have more effect than it actually did. While the
coefficient on the spatial error lag, λ, is large (.38), with a tiny standard error, the substantive
coefficients are little affected by the introduction of spatially lagged errors.
Looking at levels, for the political variables only the puzzling multipolar alliance coefficient was affected; while still significantly negative, it is reduced in magnitude by about one
third. On the first difference of the political variables, the impact of a change in whether
7
a dyad is democratic (again, a rare event) is decreased by about one third, and the impact
of becoming allied in the multipolar period declines similar (and is now marginally significant at best). The only changes in the control variables are in the impact of a change in
population, which is usually of little interest. While the coefficients on population change
dramatically, the standard errors signal to us that the model simply has trouble with the
changes in population.
While we think that the spatially lagged error results are more accurate, it is not clear
that it is worth the effort to produce the changes we have seen. This does not mean that in
other cases we would not see more dramatic changes, but, at least for this exercise, the gains
from introducing spatially lagged errors to deal with dyadic interdependence do not appear
great. While we should not sneer at even small gains, the problem with the use of spatial
econometrics is that it requires specialized software, which often means we cannot use many
techniques that are available for the non-spatial model. Thus, for example, we cannot use
any kind of semi-parametric method if we use a spatial model (this is purely a function of
software, but such a constraint is non-trivial). Thus we are left with a choice of what types
of mistakes we are willing to tolerate.
Why does taking account of dyadic interdependence seem to have such a small effect on
our results. One reason may be that with TSCS data, and a lagged dependent variable, many
of the dyadic interdependencies are already included in the lagged value of the dependent
variable. Thus the greatest gains from allowing for spatially correlated errors may be in
purely cross-sectional data. But at present this is purely speculation. We now move from
models with spatially correlated errors to models which include a spatial lag of the dependent
variable, where space is given a broad definition.
3
Spatial econometrics and Deutschian communities
Following Lipset’s social requisites hypothesis, an extensive literature has examined how
social and economic attributes influence the likelihood that countries will be democratic.
However, there are many reasons to suspects that the level of democracy in one country
could be influenced by the level of democracy in other states.9 Previous analyses have considered diffusion in the context of relations between countries that are geographic neighbors.
However, there is no particular reason why connections between states must be limited to
geographic distance per se.
Defining connectivity and space
We use three different plausible definitions of connectivity between states. Clearly, diffusion
between countries may be likely to occur between nearby countries in a geographical sense.
This is the basis for out first connectivity criterion, using the geographical distance between
9
See Gleditsch (2002) for a more extended discussion.
8
states. We use the minimum distance data from Gleditsch and Ward (2001) to define countries as connected if they are within 500 km of one another. This yields a binary connectivity
matrix where each entry wij is 1 if state i and state j are within 500 km from each other.
Each neighboring country is given equal weight in the row for country i. Here, and elsewhere
in this section, we normalize the matrix so that each row sums to 1.
In many cases, the primary reference countries are not necessarily the closest countries,
but countries that are seen as somehow “similar”. A second possible specification of closeness
could thus be connectivity defined by shared cultural attributes. Exactly what is meant by
shared culture can be somewhat difficult to pin down. Possible candidates include language,
religion, and broad cultural groupings akin to the “civilizations” suggested by Huntington
(1996). Each of these alternatives have their advantages and disadvantages. First, most
available cross-national language data, generally codes the primary languages as proper
names in a very restrictive manner. This could lead to overly narrow definitions of shared
culture that fail to take into account that many identified national languages such as Finnish
and Estonian may be closely related and mutually intelligible to their speakers.10 By contrast,
defining shared culture based on religion would lump together countries such as Switzerland,
Korea, and Paraguay, which though all predominantly Christian states, have few similarities
or connections.
In this application, we opt for a middle ground based on cultural clusters. We use a modified version of Henderson and Tucker’s (2001) operationalization of civilizations, and classify
countries as being either Buddhist, Caribbean, Islamic, Japanese, Jewish, Latin-American,
Orthodox, Pacific Islanders, Sinic or Western.11 Each country is coded as connected to other
countries in the same cultural cluster. This allows for very distant countries with high levels
of contact such as Great Britain and New Zealand to be considered cultural “neighbors”.
Our final specification of connectivity is based on the size of trade flows. A country is
considered connected to all other countries that it has some trade with. However, countries
tend be more dependent or influenced by its major trading partners, where the bilateral trade
flows are large relative to the country’s total trade. The trade connectivity matrix differs from
the previous distance and culture matrices in two notable ways. First, whereas the distance
and culture matrices assign equal weights to any geographical neighbor or country in the
cultural cluster, the trade matrix consists of weights where the importance of another state
is given by its proportion of a country’s total trade. This weights large trading partners
much more heavily than smaller trading partners. Moreover, in the distance and culture
matrix, any neighbor of A must always have A as a non-trivial neighbor. In the trade
matrix, however, it will often be the case that one country, say El Salvador, has another
country, say the United States, as its major trading partner, yet in tern is a relatively small
and trivial trading partner to the other country.
Our measure of democracy is taken from the Polity data. We use the full 21 point institu10
For many Estonians, Finnish television provided a window to the West during the Soviet era.
We modify many of the classifications suggested by Henderson and Tucker. We suggest three new
categories (Caribbean, Jewish, and Pacific Islanders), and reclassify all countries labeled as “other” in their
data.
11
9
tionalized democracy scale suggested by Jaggers and Gurr (1995). Given some specification
of connectivity or dependence we can define the “spatial lag ” or average of y in a state i0 s
connected entities:
(9)
yiw̄ = wi0 y.
Clearly, the bases for each of these three definitions of connectivity matrices are quite
different, and as a result, they may yield very different spatial lag measures for democracy.
However, since culture and trading patterns also are geographically clustered, it could also
be the case that the three different definitions of connectivity will link many of the same
countries. The bivariate correlations in Table 2 indicate that both of these conjectures to
some extent are true. It is clearly the case that all of these measures are positively correlated
with democracy and that they are positively correlated with each other. However, the spatial
lag defined from the trade matrix tends to have much smaller bivariate correlations with the
other variables than is the case for the bivariate correlation between the distance and culture
based spatial lags.
One interesting feature of the trade matrix is that most countries have the bulk of their
trade with large, wealthier countries. These countries more often tend to be democratic. As
a result, for many developing countries, the “spatial lag” or the average democracy in its
trading partners is generally much higher than the country’s own value on the democracy
scale. Whereas the spatial lag of democracy defined over distance and culture has a bimodal
density function, much like the density of the democracy variable itself, the spatial lag of
democracy defined over the trade flow matrix has a single peak, with a mean much higher
than the median (or even mean) of the democracy variable. As such, different measures
clearly stipulate quite different linkages between states.
Variable
Table 2: Bivariate correlations, spatial democracy lags
Democracy L.Democracy(distance) L.Democracy(culture)
L.Democracy(distance)
0.567
L.Democracy(culture)
0.550
0.684
L.Democracy(trade)
0.385
0.248
0.214
Cross-sectional analyses
To explore the importance of spatial linkages between observation in the distribution, we
start by a cross sectional analysis of the most recently available year. For the distance and
culture data, this is 1998. For the trade data measure, we lack data on trade beyond 1996.
The results are shown in Table 3. As can be seen, the OLS results in the second column
suggest a strong positive relationship between GDP per capita and level of democracy.
However, the OLS coefficient estimate of the log of GDP per capita might be biased if
the observations on democracy are spatially clustered. The third column of Table 3 contains
10
Table 3: Democracy and Social Requisites, 1998/1996
Variable
Constant
Ln(GDPPC)
κ (distance)
κ (culture)
κ (trade)
N
R2
OLS,1998
Spatial autoregressive estimates
-21.03 (3.84) -13.01 (3.59) -10.00 (3.35) -20.20 (3.75)
2.81 (0.45)
1.71 (0.43)
1.31 (0.40)
2.29 (0.45)
0.53 (0.08)
0.64 (0.07)
0.53 (0.13)
161
161
161
160
0.19
0.21
0.25
0.14
estimates of a spatially autoregressive model with a lagged term defined by distances.
The first thing to note is that the results in the second column of Table 3 display clear
evidence of spatial clustering. The estimate of κ indicates positive spatial clustering, and the
clustering is statistically significant. Moreover, these results also indicate that the coefficient
estimate and standard error of the OLS models might display substantial bias. The coefficient
estimate is reduced to about 60% of its original size.
The results in the third column of Table 3 pertaining to cultural connections between
states also display clear evidence of clustering. Adjusting for dependence between observations reduces the size of the coefficient estimates for logged GDP per capita to less than half
of its original size.
The results for trade connectivities in the fourth column of Table 3 look substantially
different from the previous two connectivity matrices. Although the estimated κ indicates
significant clustering, the coefficient estimate for ln GDP per capita changes much less relative to the OLS coefficient estimate than was the case for the previous two specifications. The
standard error for κ is much larger than in the previous two specifications. One possibility
might be that the information in the trade defined spatial lag measure is substantially different than the information in the distance/culture based measure. Distance/culture might
reflect the regional dynamics in regional waves of transitions. The trade based measure,
however, may conceivably pick up some of the transitions in developing states that have a
high amount of trade openness, primarily with large and wealthy democracies.
Cross-sectional time-series results
Previous research has shown that factors that seem to be highly associated with differences
democracy in a cross-sectional comparison are not necessarily strong predictors of differences
over time. Political institutions are highly persistent and display little change over short time
periods. In this section, we estimate a model for democracy with pooled annual observations
for the period 1950–1998.
Pooling observations raises questions about whether observations can be assumed to
be independent over time and are likely to display serially correlated residuals. Unlike in
the previous section, where we used the error correction model, here we simply adjoin the
11
lagged dependent variable to the specification. The basic idea is that lagged values of the
dependent variable will incorporate prior influences on the dependent variable and usually
eliminate serial correlation in the residuals. Including the first lag of the dependent variable
also changes the interpretation of the model. In particular, β is the short run impact of a
change in an independent variable on y; βφ is the long run impact, where φ is the coefficient
of the lagged dependent variable. Since φ > .9 is common, this means that the long run
impact may be ten times the short run impact.
The second column of Table 4 contains the results of a standard OLS regression of
democracy on the log of lagged GDP per capita plus its own lag. As can be seen, the
autoregressive parameter is φ = 0.94, which is very close to 1. While this indicates that we
might have to worry about non-stationarity, the small standard error on φ̂ allows us to reject
the hypothesis φ = 1.12
As expected, the estimate of the short run impact of logged GDP per capita on democracy
is greatly reduced from the simple cross-sectional estimate. One way to think about this
is that much of the seeming ability of the log of lagged GDP per capita to account for
differences in democracy between countries is incorporated in the past values of democracy
or the autoregressive term. Alternatively, we note that the long run impact of a change in
logged GDP per capita is almost five, which is larger than the cross-sectional estimate.
Table 4: Democracy
Variable
OLS
Constant
-1.91 (0.27)
Democracyt−1 0.94 (0.00)
Ln(GDPPC)
0.24 (0.03)
κ (distance)
κ (culture)
κ (trade)
N
6242
2
R
0.92
and Social Requisites, 1950–98
Spatial autoregressive estimates
-1.26 (0.28) -1.07 (0.29) -2.11 (0.29)
0.93 (0.00) 0.93 (0.00) 0.94 (0.01)
0.16 (0.03) 0.14 (0.04) 0.23 (0.03)
0.06 (0.01)
0.05 (0.01)
0.05 (0.02)
6242
6242
5873
0.92
0.92
0.92
Columns three to five of Table 4 confirm that the two general results with respect to
spatial associations that we have previously observed also apply for the spatial autoregressive
term in the TSCS model with a lagged dependent variable. The first thing to note is that
the estimate of κ remains statistically significant. Comparing columns three to five with
the OLS estimates in column two reveal that accounting for the spatial connections between
12
The standard error is about .004, so a Dickey-Fuller statistic on the null of non-stationarity is about
25. While this statistic has fatter tails than the usual normal distribution, 25 is well above any critical
value for rejecting the null of stationarity at any level of significance that is as close to zero as one might
like. We also must ensure that the remaining errors show no serial correlation. A Lagrange multiplier test
showed an estimated serial correlation of the errors of -.10 (with a standard error of .01) indicating little
albeit significant remaining serial correlation. Again, we conclude that the the degree of this harm is not
proportion to the harm caused by the small amount of serial correlation of the errors.
12
observations yield a large decrease in the coefficient estimate for the log of lagged GDP per
capita; this decrease is comparable to what we saw in the simple cross-sectional analysis of
Table 3.
Our conjecture is that κ is much smaller for the TSCS data with a lagged dependent
variable than for the simple cross-sectional model because the lagged dependent variable
incorporates all the connectivities in the model, and thus reflects the spatial as well as the
temporal interconnectivities in the data. However, unlike the previous section, accounting
for even the small degree of remaining spatial connectivity in the data yields large changes
in the coefficient estimates.
We have previously noted the technical difficulties in estimating a model with the simultaneous spatial lag. The problem is due to the expected value of the product of the spatial
lag term and the error term being non-zero.13 The consistency of OLS depends on this, and
all simple GLS fix ups require us to start with a consistent estimator.
Imagine instead, however, that we use the spatial lag of the one year lag of the dependent
variable wi0 yt−1 (where yt−1 is the vector of all observations at time t − 1).14 This would
be associated with the errors at t − 1, but not the contemporaneous errors at t. Hence, we
can use OLS to estimate the model if the errors are serially uncorrelated. We report the
results of these estimates in Table 5. Interestingly, these results are relatively similar to
the considerably more complicated spatial autoregressive model estimates in Table 4. These
results suggest that serial correlation in a time-series cross-section can easily be dealt with
if the spatial correlation is not contemporaneous. Little seems to be lost in this application.
This is perhaps not too surprising, given the sticky nature of institutions and the lack of
change in dependent variable. Obviously this is only one example, but we plan to continue
to explore using the temporal lag of the spatial lag in place of the contemporaneous spatial
lag. If this is a good strategy, it allows for much simpler estimation (and hence the ability
to use other tools that are easily used with OLS).
4
Conclusions
Spatial econometric techniques are now starting to be used by political scientists. Because
of the geographic heritage of these models, their primary application has been to incorporate
physical notions of space (distance) into political models, and, particularly, to argue that
13
0 The consistency of OLS depends on E (Wyt ) ε = 0. However,
0 −1
plimN −1 (Wyt ) ε = plimN −1 ε0 W (I − κW) ε.
(10)
(W is the matrix built up from the spatial weighting vectors, wi .) Unless κ = 0, this probability limit will
not equal zero.
14
This is not simply done for econometric simplicity. It seems likely that while democracy in “nearby”
units may have an impact, this is likely to take place with some lag. Political economists, such as Garrett
(1998) and Franzese (2002) have informally used these ideas, including the lagged trade weighted average
of growth in one’s trading partners in a model of economic growth, noting that growth can be imported in
open economics.
13
Table 5: Democracy and Social Requisites, 1951–98,
Variable
OLS estimates
Constant
-1.32 (0.29) -1.03 (0.30) -2.12 (0.29)
Democracyt−1 0.93 (0.005) 0.93 (0.005) 0.94 (0.00)
Ln(GDPPC)
0.17 (0.03) 0.13 (0.04) 0.24 (0.03)
κ (distance)t−1 0.05 (0.01)
κ (culture)t−1
0.06 (0.01)
κ (trade)t−1
0.04 (0.01)
N
6077
6077
5709
2
R
0.92
0.92
0.91
All spatial lags entered with one year temporal lag
geographically nearby units are linked together (or, less usefully, that their error terms are
linked together).
While this approach is highly promising, we think it can be made much more fruitful
if we allow for interconnectivities that go beyond geography. Here we have shown that
spatial econometrics can help deal with dyadic interdependent observations and with notions
of community that flow from the work of Karl Deutsch and related scholars. These are
technically feasible and capable of enriching our econometric models, which all too often
assume that all units are independent of each other.
The results on dyadic interdependence are somewhat negative, in that, at least in our
example, taking dyadic interdependence into account does not have much of an effect on
substantive results. It is, however, useful to know this, since we might have suspected that
the results based on assuming dyadic independence are wrong.
We get larger effects in the Deutschian analysis of democracy; the spatial models used
there also seem more theoretically compelling. Thus at least a third of the effect of being rich
on being democratic appears to be a spatial effect, that is, it is an effect of one’s Deutschian
“neighbors” being democratic.
We are particularly interested in time-series—cross-section data, since so many IR data
sets are of that form. Here, the use of a lagged dependent variable may pick up many of
the features of spatial interdependence, since the lagged dependent variable already contains
such interdependence. But, in our model of democracy, we found that explicitly taking
“space” into account still mattered, even in the presence of a lagged dependent variable.
Finally, we note that for time-series–cross-sectional data, it may make sense to use the
temporal lag of the spatial lag, instead of the contemporaneous spatial lag in the specification.
This makes theoretical sense; nearby units may have an effect, but it is not likely to be
instantaneous. This makes estimation of models with the spatial lag easy, since OLS is
appropriative for such model if the errors are serially uncorrelated. This is easy to test for,
and in practice the errors will be nearly serially uncorrelated if there is a lagged dependent
variable in the model. The advantage of such an approach is that analysts can do many things
in an OLS context that are hard to do with the specialized software of spatial econometrics.
14
But in the end, our message is theoretical, not technical. In international relations and
comparative politics, we would expect units to be affected by the what takes place in other
units. We would expect the connectivity of units to be a function of political and social, as
well as geographic, variables. While the heritage of spatial econometrics is geographic, there
is no reason to limit spatial econometric models to geographic modes of thinking.
15
References
Anselin, Luc. 1988. Spatial Econometrics: Methods and Models. Boston: Kluwer Academic.
Beck, Nathaniel and Jonathan N. Katz. 1995. “What To Do (and Not To Do) with TimeSeries Cross-Section Data.” American Political Science Review 89:634–47.
Beck, Nathaniel and Jonathan N. Katz. 1996. “Nuisance vs. Substance: Specifying and
Estimating Time-Series–Cross-Section Models.” Political Analysis 6:1–36.
Beck, Nathaniel and Jonathan N. Katz. 2001. “Throwing Out the Baby With the Bath
Water: A Comment on Green, Kim and Yoon.” International Organizations 55:487–95.
Davidson, J., D. Hendry, F. Srba and S. Yeo. 1978. “Econometric Modelling of the Aggregate
Time-Series Relationship Between Consumers’ Expenditures and Income in the United
Kingdom.” Economic Journal 88:661–92.
Deutsch, Karl W. 1954. Political Community at the International Level: Problems of Definition and Measurement. Garden City, NY: Doubleday.
Franzese, R. J. 2002. The Political Economy of Macroeconomic Policy in Developed Democracies. New York: Cambridge University Press.
Garrett, Geoffrey. 1998. Partisan Politics in the Global Economy. New York: Cambridge
University Press.
Gleditsch, Kristian S. 2002. All Politics is Local: The Diffusion of Conflict, Integration, and
Democratization. University of Michigan Press.
Gleditsch, Kristian S. and Michael D. Ward. 2000. “Peace and War in Time and Space: The
Role of Democratization.” International Studies Quarterly.
Gleditsch, Kristian S. and Michael D. Ward. 2001. “Measuring Space: A Minimum Distance
Database and Applications to International Studies.” Journal of Peace Research 38:749–
768.
Henderson, Errol and Richars Tucker. 2001. “Clear and Present Strangers: The Clash of
Civilizations and International Conflict.” International Studies Quarterly 45:317–339.
Huntington, Samuel. 1996. The Clash of Civilizations. New York: Simon and Schuster.
Jaggers, Keith and Ted Robert Gurr. 1995. “Tracking Democracy’s Third Wave with the
Polity III Data.” Journal of Peace Research 31:469–82.
Mansfield, Edward D. and Rachel Bronson. 1997. “Alliances, Preferential Trading Arrangements, and International Trade.” American Political Science Review 91:94–107.
16
Morrow, James D., Randolph M. Siverson and Tressa E. Tabaras. 1999. “The Political
Determinants of International Trade: The Major Powers, 1907-90.” American Political
Science Review 92:649–661.
Morrow, James D., Randolph M. Siverson and Tressa E. Tabares. 1990. “Correction to
”The Political Determinants of International Trade”.” American Political Science Review
93:931–3.
17