Socially Optimal Districting: An Empirical Investigation

Preliminary Draft September 2005
Socially Optimal Districting: An Empirical Investigation∗
Abstract
This paper provides an empirical exploration of the potential gains from socially optimal districting.
Districtings, or the allocation of voters of differing ideologies across legislative districts, matter for welfare
because they influence the partisan composition of the legislature and hence policy outcomes. Under a
socially optimal districting, the ideological composition of the legislature is welfare maximizing for any
realization of voter preferences. We develop a methodology for analyzing how actual districting plans
differ from optimal districting plans and for measuring the potential welfare gains that would emerge
from implementing optimal districting. We apply this method to U.S. state legislative elections using
data from the 1990s. We find that the districtings used by the states in our data set lead to electoral
systems that are overly responsive to changes in voters’ preferences. While there is significant variation
across states, the potential welfare gains from optimal districting are on average small relative to the overall
surplus generated by legislatures. This appears to be because state districting plans are reasonably close
to optimal rather than because there are little aggregate gains to be had from varying districting plans.
Interestingly, implementing proportional representation would lead to welfare gains in all states and would
produce welfare levels quite close to those achieved with socially optimal districting.
Stephen Coate
Department of Economics
Cornell University
Ithaca NY 14853
[email protected]
Brian Knight
Department of Economics
Brown University
Providence RI 02906
brian [email protected]
∗ We
are greatly indebted to Jim Snyder for providing the data on state legislative elections used in this study.
1
Introduction
The decennial redrawing of district lines used in electing candidates to federal, state, and local
legislatures in the United States has generated intense interest among voters, politicians, and
parties alike. The intensity of this interest in redistricting, as witnessed through both political
and legal battles, should come as no surprise. Democrats and Republicans are often sharply
divided in their preferences over the choice of public policies. These policies are determined in
part by the representation of political parties in legislatures, which in turn depend upon the
allocation of voters across legislative districts.
This public interest in redistricting has helped to spurn an academic literature analyzing redistricting plans and the redistricting process. In this literature, redistricting plans are typically
characterized by the implied relationship between seats in the legislature and support for parties
among voters. In particular, the seat-vote curve relates the fraction of seats won by the Democratic
party to its support among voters across all districts. Commonly-studied properties of seat-vote
curves include responsiveness, defined as the increase in seats for the Democrats associated with
an increase in voter support across all districts for the Democrats, and partisan bias, defined as
the advantage or disadvantage afforded the Democratic party in terms of seats, relative to the
Republican party, when voters are equally divided between the two parties. Researchers have
developed statistical methods to estimate seat-vote curves and to measure the associated responsiveness and partisan bias parameters. Key questions are then how responsiveness and bias are
altered following the redrawing of district lines and also how different institutional arrangements
for redistricting influence these changes.
While this literature is certainly interesting from a positive perspective, the normative lessons
to be drawn from its findings are unclear. How should voters be allocated across districts? Is
partisan bias necessarily a bad thing from the perspective of voters? What is the optimal degree of
responsiveness? In Coate and Knight (2005), a companion paper, we develop a welfare economics
approach to address these questions from a theoretical perspective. Following the literature, in our
framework districting matters because it determines the equilibrium seat-vote curve. The fraction
of seats held by Democrats determines the ideological make up of the legislature and thereby policy
outcomes and citizen welfare. To understand optimal districting, we first derive the optimal seatvote curve, which describes the welfare-maximizing relationship between Democratic seats and
1
votes. This optimal seat-vote curve is linear with a slope, or responsiveness, that depends on the
degree of variation in the preferences of Independent voters. Furthermore, it is biased in favor
of the party with the largest base. We then show that, under seemingly reasonable conditions,
there exists districtings under which the equilibrium and optimal seat-vote curves coincide. These
districtings are socially optimal districtings because they generate the ideal relationship between
seats and votes.
These theoretical findings raise several natural empirical questions. How close are the equilibrium seat-vote curves associated with real world legislatures to the optimal seat-vote curves
identified via this social welfare approach? Are the conditions under which there exist districtings
that make equilibrium and optimal seat-vote curves coincide satisfied in practice? If so, how large
are the potential welfare gains associated with implementing optimal seat-vote curves? What are
the sources of these welfare gains?
In this companion empirical paper, we begin the development of a methodology that can be
used to answer these and related questions. We first develop an empirical methodology for using
district-specific electoral returns in order to estimate equilibrium seat-vote curves. This approach,
while allowing for a general functional form for the equilibrium seat-vote curve, has micro foundations, and the parameters of the equilibrium seat-vote curve can be related to the parameters
of the theoretical model. Turning to the normative questions, we then develop expressions for the
optimal seat-vote curve and its associated optimal bias and responsiveness parameters; this optimal seat-vote curve can also be identified using data on electoral returns along with information
regarding the fraction of Independent voters across all districts. We then address whether or not
the optimal seat-vote curve can be implemented in practice by developing a sufficient condition
that depends upon the fraction of Independent voters and showing that this condition can be
verified empirically. Finally, we develop expressions for the welfare gains from socially optimal
districting and show that these depend on squared distance between the equilibrium and optimal
seat-vote curves.
We apply this method to U.S. state legislative elections using data from the 1990s. One key
finding is that equilibrium seat-vote curves tend to be overly responsive, relative to optimal seatvote curves, suggesting that, if anything, elections in the U.S. are overly competitive and that there
are too few safe seats from a social welfare perspective. In addition, we find that optimal seatvote curve can be implemented in practice in the sense that there exists districtings that make the
2
equilibrium and optimal seat-vote curves coincide. In general, while there is significant variation
across states, our measures of the potential welfare gains from socially optimal districting are quite
small, at least relative to the overall surplus generated by state legislatures. We argue that this
is because state districting plans are reasonably close to optimal rather than because there are
little aggregate gains to be had from varying districting plans. We also find that implementing
proportional representation would lead to welfare gains in all states and would produce welfare
levels quite close to those achieved with socially optimal districting.
The paper proceeds as follows. Section 2 describes the formal model underlying the analysis
and reviews the key theoretical results. Section 3 presents our empirical methodology for comparing equilibrium and optimal seat-vote curves and evaluating the potential gains from socially
optimal districting. In Section 4, this empirical methodology is applied to data from U.S. state legislative elections during the 1990s. Section 5 compares our approach and results with the existing
districting literature, and Section 6 concludes.
2
Theoretical background1
We consider a state divided into n equally sized districts indexed by i ∈ {1/n, 2/n, ...., 1}. Policymaking is done by a legislature consisting of a representative from each district. Each district
chooses its representative in an election. The policy outcomes chosen by the legislature depend
upon the average ideology of the elected representatives, where ideology is measured on a 0 to 1
scale. In terms of ideologies, citizens can be divided into three groups - Democrats, Republicans,
and Independents. Democrats and Republicans have ideologies 0 and 1, respectively. Independents
have ideologies that are uniformly distributed on the interval [m − τ, m + τ ]. Reflecting the fluid
nature of these voters’ attitudes, the ideology of the median Independent is ex-ante uncertain.
Specifically, m is the realization of a random variable uniformly distributed on [1/2 − ε, 1/2 + ε],
where ε < τ and ε + τ ≤ 1/2. The former assumption guarantees that both candidates receive
support from Independents, while the latter assumption guarantees that the ideologies of the
Independents are always between zero and one. The fraction of voters in district i who are
i
i
, πR
and πIi . Let πD , πR and πI
Democrats, Republicans, and Independents are, respectively, πD
1 This section will be brief, and readers are referred to Coate and Knight (2005) for additional interpretation
and discussion of alternative modeling assumptions.
3
denote, respectively, the fraction of voters in the whole state who are Democrats, Republicans,
and Independents.
Each district must elect a representative. Candidates are put forward by two political parties:
the Democrats and Republicans. Following the citizen-candidate approach, candidates are citizens
and are characterized by their ideologies. Each party must select from the ranks of its membership,
so that the Democrat Party always selects a Democrat and the Republican Party a Republican.
Elections are held simultaneously in each of the n districts and the candidate with the most votes
wins.
The surplus a citizen with ideology x obtains from having a legislature with average ideology
x0 is given by β − γ(x − x0 )2 . The parameter β is the surplus a citizen would obtain from having a
legislature that is perfectly congruent with his own ideology. The parameter γ measures the rate at
which this surplus is dissipated as the ideology of the legislature diverges from the ideology of the
citizen. The ratio γ/β will play an important role in the welfare analysis and can be interpreted
as the fraction of the surplus a partisan (i.e., a Democrat or Republican) obtains from having
a perfectly congruent legislature that is dissipated by having a legislature composed entirely of
the opposition party. We assume throughout that this ratio is bounded between zero and one
(γ/β ∈ [0, 1]).
In each district, every citizen votes sincerely for the representative whose ideology is closest to
his own. If the median Independent has ideology m, the fraction of voters in district i voting for
the Democrat candidate is given by:
i
+ πIi [
Vi (m) = πD
1/2 − (m − τ )
].
2τ
(1)
This consists of all the Democrats and the Independents whose ideologies are less than 1/2.
Similarly, the statewide fraction of voters voting for Democrats is
V (m) = πD + πI [
1/2 − (m − τ )
].
2τ
(2)
Let V and V denote, respectively, the average maximum and minimum Democratic vote shares;
i.e., V = V (1/2 − ε) and V = V (1/2 + ε).
4
2.1
The equilibrium seat-vote curve
We can use the model to derive the equilibrium relationship between seats and average votes.
First, for all V ∈ [V , V ], let m(V ) denote the ideology of the median Independent that would
generate the vote share V ; i.e., m(V ) = V −1 (V ). It is straightforward to verify that
m(V ) =
1
πI + 2πD − 2V
].
+ τ[
2
πI
(3)
Substituting this into the district-specific support, we obtain
i
+ πIi [
Vi (m(V )) = πD
V − πD
].
πI
(4)
Thus, district i elects a Democrat if and only if Vi (m(V )) ≥ 1/2, or, equivalently, if and only if:
V ≥ Vi∗ = πD + πI [
i
1/2 − πD
].
πIi
(5)
Here Vi∗ is the average vote threshold above which district i elects a Democrat candidate. District
i is a safe Democrat (safe Republican) seat if Vi∗ ≤ V (Vi∗ ≥ V ). A seat which is neither a safe
Democrat or Republican seat is competitive.
∗
∗
≤ V2/n
≤ .... ≤ V1∗ . Then, the fraction
Relabelling as necessary, order the districts so that V1/n
of seats the Democrats receive when they have aggregate vote share V is:
S(V ) = max{i : Vi∗ ≤ V }.
(6)
This is the equilibrium seat-vote curve. Its range is the interval [V , V ] and its shape is determined
by the pattern of threshold vote levels (Vi∗ ). These threshold vote levels are in turn determined
by the districting; that is, the allocation of citizen types across districts.
2.2
The optimal seat-vote curve
Consider now the problem of a planner deciding on the number of seats S that should be allocated
to the Democrats when their vote share is V . Aggregate surplus when the median Independent
has ideology m and the Democrats have seat share S is given by:
W (S; m) = β − γ[πD (1 − S)2 + πR S 2 + πI
Z
m+τ
m−τ
(1 − S − x)2
dx
].
2τ
(7)
To avoid tedious integer concerns, assume that the number of districts is very large, so that
we can interpret S as the fraction of seats held by the Democrats and treat the choice set in
5
the optimization problem as the unit interval [0, 1]. It is then straightforward to establish the
following result:
Proposition 1: The optimal seat-vote curve S o : [V , V ] → [0, 1] is given by
S o (V ) = 1/2 + (πD − πR )(1/2 − τ ) + 2τ (V − 1/2).
(8)
Proof: See Appendix.
Thus, the optimal seat-vote curve is linear. Its partisan bias - defined as the advantage or
disadvantage accorded to the Democrats when the two parties split the vote - is (πD −πR )(1/2−τ ).
The system is therefore optimally biased towards the party with the largest partisan base. This
optimal bias reflects the fact that when the legislature is equally divided between Democrats
and Republicans, a marginal change in the composition of the legislature has a greater impact
on partisans’ surplus than on Independents. This in turn reflects the assumption that the loss
experienced by citizens from having a legislature that diverges from their own ideology is a convex
(indeed, quadratic) function of the extent of this divergence.
The responsiveness of the optimal seat—vote curve - defined as the rate at which Democrat
seats increase as their vote share increases - is 2τ . Thus, as the policy preferences of Independent
voters become more diverse, the Democrats’ seat share becomes more responsive to changes in
their aggregate vote. Notice, however, that since τ < 1/2, optimal responsiveness is always less
than 1 meaning that if the Democratic vote share goes up by ∆V , the Democrats share of seats
in the legislature should optimally go up by less than ∆V . To understand this result, note first
that the Democratic seat share will optimally be such as to make average legislator ideology equal
to average ideology in the population at large. Then observe that changes in vote share always
over-estimate changes in average population ideology; that is, if the Democratic vote share goes up
by ∆V , the decrease in average population ideology will be less than ∆V . This is because changes
in vote share are driven by changes in the voting behavior of Independents and such changes can
arise from fairly small shifts in preferences. To illustrate, consider the case in which τ is very
small so that Independents all have basically the same ideology - namely m. Now suppose that m
changes from 0.51 to 0.49 so that all the Independents switch from voting Republican to voting
Democrat. Then there is a massive change in the Democratic vote share but virtually no change
in the average ideology of the population and hence virtually no change in the optimal share of
Democratic seats in the legislature.
6
2.3
The conditions for implementation
The optimal seat-vote curve describes the ideal relationship between seats and votes. It is by no
means obvious that there exist districtings that would actually make the equilibrium and optimal
seat-vote curves coincide. When there do, we say that the optimal seat-vote curve is implementable
and refer to the underlying districtings as socially optimal districtings.
Whether the optimal seat-vote curve is implementable depends upon the fraction of Independents. Two special cases provide intuition for this. First, if there were no Independents, then the
optimal seat-vote curve would be a single point [S o (V ) = πD ] and could be implemented by creating a fraction πR of majority Republican districts and a fraction πD of majority Democrat districts.
On the other hand, if the entire population were Independents, then all districts would necessarily
be identical and the equilibrium seat-vote curve would jump from zero to one at V = 1/2. The
optimal seat-vote curve, by contrast, is linear in this case [S o (V ) = 1/2 + 2τ (V − 1/2)] and is thus
clearly not implementable. In the companion theoretical paper, we establish:
Proposition 2: The optimal seat-vote curve is implementable if and only if
πI [
ε
ε
+ ε − (τ + ε) ln(1 + )] ≤ min(πD , πR ).
2τ
τ
(9)
Proof: See Coate and Knight (2005).
Thus, the fraction of independents must be sufficiently small. The condition does not appear
particularly restrictive since it is necessarily satisfied whenever πI ≤ 2 min(πD , πR ). Moreover,
as shown in Coate and Knight (2005), when the condition is satisfied, there will typically exist a
large class of districtings under which the equilibrium coincides with the optimal seat-vote curve.
This class of optimal districtings typically includes districtings that look quite “straightforward”
in the sense that they do not require patterns of concentration of voter types that would seem
likely to infeasible when account is taken of the geographic constraints that real world districters
face. This nurtures the hope that the optimal seat-vote curve may be an attainable benchmark.
2.4
The welfare gains to socially optimal districting
Assuming that the optimal seat-vote curve is implementable, what would be the welfare gains
from moving to a socially optimal districting? We address this question next. While aggregate
surplus has previously been expressed conditional on the aggregate vote share V, which can be
7
interpreted as welfare at a single point on the seat-vote curve, we require a measure of expected
welfare over the entire seat-vote curve in order to rank alternative curves from a social welfare
perspective. This expected welfare is defined by:
E[W (S(V ))] = β−γ
Z
V
[πD (1−S(V ))2 +πR S(V )2 +πI
V
Z
m(V )+τ
(1−S(V )−x)2
m(V )−τ
dV
dx
](
). (10)
2τ V − V
The welfare gain from socially optimal districting is therefore given by
G = E[W (S o (V ))] − E[W (S(V ))].
(11)
This welfare gain turns out to be proportional to the squared distance between the optimal and
equilibrium seat-vote curves. To see this, note first that expected citizen welfare can be expressed
as a function of the equilibrium and optimal seat-vote curves:
Lemma: It is the case that
E[W (S(V ))] = β − γ{c + E[S(V )2 ] − 2E[S(V )S o (V )]}
(12)
where c is a constant given by c = πD + πI [1/4 + ε2 /3 + τ 2 /3].
Proof: See Appendix.
Using this formula to compute the welfare gain immediately establishes:
Proposition 3: The welfare gain from socially optimal districting can be written as
G = γE[(S o (V ) − S(V ))2 ].
(13)
Intuitively, the smaller the degree of similarity between the optimal and equilibrium seat-vote
curves, the larger are the welfare gains associated with socially optimal districting.
3
Empirical methodology
We can now describe our methodology for comparing actual and optimal districting plans and
evaluating the welfare gains from socially optimal districting. We develop empirical counterparts to
each of the four theoretical concepts described above: the equilibrium seat-vote curve, the optimal
seat-vote curve, the conditions for implementing the optimal seat-vote curve, and the welfare gains
to socially optimal districting. In developing this methodology, we consider a researcher with
8
access to data on district-specific electoral returns, as well as the state-wide fraction of voters who
identify as Independents.2
3.1
Estimating the equilibrium seat-vote curve
As explained in Section 2.1, the equilibrium seat-vote curve is determined by the range of possible
aggregate Democratic vote shares [V , V ] and the pattern of district-specific threshold vote levels
(Vi∗ ). We will show that both the range of vote levels and the vote thresholds can be expressed
as a function of the first two moments of the district-specific and statewide voting distributions.
This then permits the estimation of the equilibrium seat-vote curve by simply estimating these
moments.
The first step in establishing this is to provide expressions for the moments of the voting
distribution. Beginning with the district-specific moments, recall from the theoretical model that
the fraction of votes received by the Democrat candidate in district i is given by:
i
+ πIi [
Vi = πD
1/2 − (m − τ )
].
2τ
(14)
Given that m is uniformly distributed on [1/2 − ε, 1/2 + ε], the mean and standard deviation of
votes in district i are
i
i
i
+ 12 πIi = 12 [1 + πD
− πR
]
µi = E(Vi ) = πD
q
p
πIi2 var(m)
πi ε
σi = V ar(Vi ) =
= 2√I3τ
4τ 2
where the second expression uses the fact that var(m) = ε2 /3.3
(15)
Observe that the district
expected vote share (µi ) reflects a partisan advantage for the Democratic candidate (µi > 1/2) if
i
i
i
i
> πR
but for the Republican candidate (µi < 1/2) if πD
< πR
. The standard deviation of vote
πD
shares is proportional to the fraction of Independents. It is increasing in the degree to which the
median Independent shifts support between the two candidates from election to election (ε) but
is decreasing in the diversity of preferences among Independents (τ ). This latter property reflects
the fact that a small swing in preferences of the median Independent towards the Democrats
2 Data on voter identification are typically available from a variety of sources, such as survey data or, in some
states, voter registration rolls.
3 More generally, a random variable X distributed uniformly on the interval [a, b] has a mean equal to (a + b)/2
and a variance equal to (b − a)2 /12.
9
translates into a large shift in support for the candidate if the distribution of the preferences of
Independents is tight but a small shift if this distribution is disperse.
Turning to the statewide moments, we take cross-district averages of the district-specific means
and standard deviations to obtain:
µ = E(V ) = πD + 12 πI = 12 [1 + πD − πR ]
.
q 2
p
πI var(m)
π
Iε
√
σ = V ar(V ) =
=
4τ 2
2 3τ
(16)
The statewide expected Democratic vote is independent of the fraction of Independents, reflecting
the fact that these voters favor neither party ex-ante. The variance is again proportional to the
fraction of Independents, reflecting the fact that there is no uncertainty in how partisans vote.
Using the state-wide moments and equation (2), we can now write the maximum and minimum
aggregate Democratic vote shares as
V = V (1/2 + ε) = µ −
√
3σ
(17)
√
3σ.
(18)
and
V = V (1/2 − ε) = µ +
Moreover, using the district-specific and statewide moments and the definition of Vi∗ in equation
(5), we can write the vote threshold for electing a Democratic candidate in district i as follows:
Vi∗ = µ +
σ
(1/2 − µi ).
σi
(19)
These results permit the researcher to estimate the equilibrium seat-vote curve by estimating
the means and standard deviations of the Democratic vote share in each district and the associated
bi )1i=1/n
statewide mean and standard deviation. Suppose that these estimates are given by (b
µi , σ
and (b
µ, σ
b). Relabelling the districts so that
σ
b
σ
b
σ
b
(1/2 − µ
b1/n ) ≤ µ
b+
(1/2 − µ
b2/n ) ≤ .... ≤ µ
b + (1/2 − µ
b1 ),
σ
b1/n
σ
b2/n
σ
b1
√
√
the estimated equilibrium seat-vote curve is the function defined on the interval [b
µ − 3b
σ, µ
b + 3b
σ]
µ
b+
given by:
S(V ) = max{i : µ
b+
σ
b
(1/2 − µ
bi ) ≤ V }.
σ
bi
10
(20)
3.2
Estimating the optimal seat-vote curve
Turning to the normative component of the empirical analysis, we next develop measures of the
optimal seat-vote curve, thereby allowing for comparison with the equilibrium seat-vote curve.
Using the fact that πD − πR = 2µ − 1, we can re-write the optimal seat-vote curve described in
Proposition 1 as follows:
S o (V ) = 1/2 + (2µ − 1)(1/2 − τ ) + 2τ (V − 1/2).
(21)
Observe that the optimal seat-vote curve and, in particular, its responsiveness and partisan
bias parameters depend critically upon the diversity of preferences among Independents (τ ). Unfortunately, this preference parameter is not directly identified in the voting data. To see this,
recall that the statewide standard deviation of votes for the Democratic candidate can be written
as follows:
πI ε
σ= √ .
2 3τ
(22)
Thus, with only information on σ, we cannot separately identify the key underlying parameters
πI , ε, and τ . Intuitively, large swings in the Democratic vote share within a state could be due to a
large fraction of Independents (πI ), large swings in the preferences of the median Independent (ε),
or a tight distribution of ideology among Independents (τ ), in which case relatively small swings in
the preferences of the median Independent translate into relatively large swings in voting outcomes.
While we cannot separately identify these preference parameters, it is possible to identify their
ratio (α = ε/τ ), with data on the statewide fraction of Independents (πI ) and the statewide
standard deviation of the voting distribution (σ) as follows:
α=
√
2 3σ
ε
.
=
τ
πI
(23)
Further, using the theoretical restriction on the sum of these preference parameters + τ ≤ 1/2,
we can place an upper bound on the dispersion in the preferences of Independents:
τ≤
1
,
2(1 + α)
(24)
and, we can in turn place an upper bound on the optimal degree of responsiveness [ro = 2τ <
1/(1 + α)]. In the baseline analysis of the empirical application to follow, we assume that the
11
optimal degree of responsiveness equals this upper bound. In addition, as a robustness check, we
allow the optimal responsiveness to fall in a range below this upper bound.
3.3
Verifying the conditions for implementation
The necessary and sufficient condition for implementability presented in Proposition 2 can not
be verified directly without information on the underlying preference parameters (ε, τ ). As just
noted however, with outside information on the fraction of Independents, we can identify the
ratio α = ε/τ . We can use information on this ratio to place an upper bound on the coefficient
associated with the implementability of the optimal seat-vote curve. In particular, we can show
that
(
ε
ε
+ ε − (τ + ε) ln(1 + )) ≤ α/2,
2τ
τ
(25)
and this implies that a sufficient condition for implementability is:
πI ≤
2 min(πD , πR )
.
α
(26)
Clearly as the ratio α approaches unity, this sufficient condition converges to πI ≤ 2 min(πD , πR ),
which is just that given in Section 2.3. For any given α < 1, however, this ratio provides additional
information to the researcher attempting to verify the conditions for implementation. Substituting
in the expression for α from equation (23) and using the fact that µ = πD + πI /2, this sufficient
condition can be re-written as
πI ≤ 2 min(µ −
√
√
3σ, 1 − µ − 3σ).
(27)
Thus, the fraction of Independents must be below a critical value, the calculation of which only
requires information on the statewide mean and standard deviation of the Democratic vote share.
3.4
Estimating the welfare gains from socially optimal districting
Given that we only observe voting outcomes, which do not reveal the intensity of voter preferences
for one party over another, the parameters of the surplus expression (β, γ) in equation (12) are
clearly not identified in the empirical analysis. We can, however, use the theoretical restriction on
the ratio of these parameters in order to calculate the range of proportionate welfare gains.
To this end, first note that the percentage increase in aggregate welfare from socially optimal
districting can be written as follows:
12
∆G =
E[W (S o (V ))] − E[W (S(V ))]
E[W (S(V ))]
(28)
Then, using the expressions from equations (12) and (13) and dividing through by β, we have
that:
γ
o
2
γ
β E[(S (V ) − S(V )) ]
∆G( ) =
β
1 − βγ {c + E[S(V )2 ] − 2E[S(V )S o (V )]}
(29)
Recall from the theoretical section that the ratio γ/β is the fraction of the surplus a partisan
obtains from having a perfectly congruent legislature that is dissipated by having a legislature
composed entirely of the opposition party. When parties are not that polarized in terms of their
underlying ideologies or when the legislature is responsible for choosing only policies on which
there is little disagreement across ideologies (for example, spending on public safety and highway
maintenance) this ratio may be close to zero. When parties are polarized and are choosing policies
on which there is strong ideological disagreement (such as the level of transfer payments for the
poor or the regulation of abortion), this ratio may be close to one. In the former case, districting
is not very important, while in the latter case it is crucial to citizen welfare. Using the restriction
that γ/β ∈ [0, 1], we can thus bound these proportionate welfare gains as follows:
γ
0 ≤ ∆G( ) ≤ ∆G(1).
β
(30)
Because the upper bound will only be relevant for legislatures in states in which parties are
polarized and which choose policies on which there is strong disagreement, we will provide welfare
calculations for different values of this key ratio (γ/β) in the empirical application to follow.
We next turn to the measurement of this upper bound. Inserting equation (21) into equation
(29), we have that
( βγ ){(ξ o )2 + 2ξ o ro µ + (ro )2 (µ2 + σ 2 ) − 2E[(ξ o + ro V )S(V )] + E[S(V )2 ]}
γ
∆G( ) =
β
1 − ( βγ ){c + E[S(V )2 ] − 2ξ o E[S(V )] − 2ro E[V S(V )]}
(31)
where ro = 2τ represents optimal responsiveness and ξ o = µ(1 − 2τ ) represents the vertical
intercept of the optimal seat-vote curve.
Notice that we can express the constant as c = µ +
√
2 3τ σ 2
[ε /3
ε
+ τ 2 /3 − 1/4], and hence,
given a particular value of the ratio γ/β and the parameter τ , this expression can be evaluated by
computing three moments of the equilibrium seat-vote curve: E[S(V )], E[S(V )2 ], and E[V S(V )].
13
We next show that these three expectations can be expressed as functions of the district-specific
vote thresholds. To this end, let the associated range of average votes be [V , V ] and the districtspecific vote thresholds be (Vi∗ ), and let i be the smallest district such that Vi∗ ≥ V and i be the
largest district such that Vi∗ ≤ V .4 Then, using straightforward area calculations, we can convert
the three key expectations into functions of the district-specific vote threshholds as follows:
i+1/n
E[S(V )] =
X
i=i
(i − 1/n)
∆Vi∗
,
V −V
i+1/n
E[S(V )2 ] =
X
(32)
∆Vi∗
,
V −V
(33)
X 1
∆Vi∗2
(i − 1/n)
2
V −V
i=i
(34)
i=i
(i − 1/n)2
and
i+1/n
E[V S(V )] =
∗
∗
= V − Vi∗ , and ∆Vi∗ = Vi+1/n
− Vi∗ for i = i + 1/n, ...., i.
where ∆Vi∗ = Vi∗ − V , ∆Vi+1/n
4
Application to U.S. state legislatures
To illustrate the approach, we study redistricting plans for elections to U.S. state legislatures. We
prefer this setting over the U.S. House because, in federal redistricting, state officials control the
redistricting process and each redistricting plan thus only partially contributes to the resulting
allocation of national seats across parties in Congress. Redistricting plans for state legislatures are
also controlled by state officials, and redistricting plans thus perfectly correspond to changes in
seats in state legislatures.
In the empirical application, we examine state districting plans put in place during the early
1990s. For consistency with the theoretical framework, we focus on states with single-member
districts; many states elect multiple members from each district to state legislatures. In addition,
given the bicameral nature of state legislatures, we follow the existing empirical literature on
redistricting and focus on the elections to the lower house. As shown in Table 1, we have complete
data for 28 states that elect representatives from single-member districts, and most of these states
adopted redistricting plans in 1992 and then again in 2002; for these states, we thus examine state
4
As pointed out in Coate and Knight (2005), any seat-vote curve S(V ) may be equivalently described by a triple
{i, i, (Vi∗ )ii=i } - the so-called inverse seat-vote curve. The interpretation is that districts i = 1/n, .., i − 1/n are safe
Democrat; i = i + 1/n, .., 1 are safe Republican and Vi∗ is the vote threshold for competitive district i ∈ {i, .., i}.
14
legislative elections held in 1992, 1994, 1996, 1998, and 2000. States deviating from this pattern
of elections include Virginia, which has elections in odd years and adopted redistricting plans in
1991 and 2001, and Colorado, whose district lines were redrawn in 1998 following litigation over
the representation of minority groups in the state legislature.
4.1
Data sources
Recall from the empirical methodology section that our approach requires data on district-specific
voting returns and the statewide fraction of voters identifying as Independents. We use data from
Ansolabehere and Snyder (2002) on state legislative election returns, focusing only on returns
for the two major party candidates and computing support for the Democratic candidate as the
fraction of the two-party vote. We also use estimates of the fraction of Independent voters in each
state. These data are derived from annual New York Times surveys in which voters are asked to
self-identify as a Republican, Democrat, or Independent.5 In order to compute the time-invariant
fraction of Independents for each state, we take averages across the years listed in Table 1.
4.2
Estimating the moments of the voting distributions
Consider some legislative district i in a particular state. Suppose that the state has legislative
elections in 1992, 1994, 1996, 1998, and 2000. Assume that all these elections are contested in
district i and let the vote share received by the Democratic candidate in year t be Vit . Then, we
form our estimates of µi and σi by just taking the mean and standard deviation of these vote
5
These data were downloaded from the website http://php.indiana.edu/˜wright1/cbs7603 pct.zip.
15
shares; that is,6
P
µ
bi = t V5it
.
q
P (Vit −b
µi )2
σ
bi =
t
5
(35)
A problem that arises with this procedure is that many elections are in fact uncontested. In
our data, roughly 30 percent of state legislative elections are uncontested, and the support for the
Democratic candidate is thus censored for these elections. One potential solution to this problem
would be to simply assume that uncontested Democratic victories correspond to safe Democratic
seats and that uncontested Republican victories correspond to safe Republican seats. After all,
it is only the vote thresholds for competitive districts that matter for the seat-vote curve. The
primary drawback of this approach is that both the equilibrium and optimal seat-vote curves
depend upon the statewide moments (µ, σ), which require explicit measures of the district-specific
moments (µi , σi ) for both contested and uncontested districts.
To get around this problem, for districts with at least two contested races, we simply estimate
these two moments using voting data from these contested elections; that is, we use the formulas
in equation (35), but with a smaller number of elections. For the 25 percent of districts that have
either zero or only one contested race, we predict the district-specific moments by first regressing
the mean and standard deviation of vote shares in contested districts on voter characteristics taken
from Census data. These data, which are published in Barone et al (1998) are available for state
legislative districts and include the fraction of residents living in urban areas, the fraction living
in suburban areas, household income, percent of residents with a college degree, percent over age
65, percent African American, and percent Hispanic.
To implement this prediction approach, we assume that the mean Democratic vote share in
6 An obvious limitation of this approach is that because of the limited time dimension of the panel, our estimates
of the district-specific moments will be noisy. As an alternative approach, which we leave for future work, one
could estimate statistical models in which the moments of the voting distribution are parameterized in terms of
observed and unobserved district characteristics; the number of parameters to be estimated is smaller and hence
estimates of district-specific moments should entail less noise in this case. For example, using the definitions
of
√
the district-specific moments, we can re-write the district-specific vote as Vi = µi + σi w,where w = 3 (1/2 − m)
and is distributed uniform with mean 0 and variance 1. Adding a time dimension (t) to the model, we have that
Vit = µi + σi wt . Then, we parameterize the mean (µi = Xi0 β + ηi ) and variance (σi2 = exp(Xi0 γ + υi )) of the
voting distribution, where (ηi ,υi ) are district-specific random effects, which are normally distributed with variances
equal to ση2 and σν2 , respectively. Substituting in these parameterizations, we have that Vit = Xi0 β + ηi + uit ,
p
exp(Xi0 γ + υi )wt . Using standard panel data estimation techniques, the researcher could then
where uit =
estimate the parameters of the model (β, γ,ση2 and σν2 ). In this framework, the district-specific moments, and
hence the equilibrium seat-vote curve, are random from the econometrician’s point of view. Simulating the model
by drawing random effects from the normal distribution, the researcher could then make statements about features
of the distribution of the seat-vote curve and the associated welfare gains from socially optimal districting.
16
district i takes the following form, which restricts the value to be between zero and one:
µi =
exp(Xi0 β)
1 + exp(Xi0 β)
(36)
where Xi are voter characteristics and β represents parameters to be estimated. For estimation
purposes, this equation can be written in log-odds form:
ln(µi /1 − µi ) = Xi0 β
(37)
Similarly, we parameterize the standard deviation of the voting distribution as follows:
ln(σi ) = Xi0 γ
(38)
In order to increase the power associated with this prediction approach, we run this regression
across all contested districts in the United States, implicitly assuming that the mapping from
observed characteristics into partisan affiliation is homogeneous across states. While this assumption is somewhat restrictive, we do include a set of state dummy variables, thereby allowing two
districts with identical observable characteristics but in different states to have different fractions
of voter types.
Table 2 provides the results from these two prediction equations. As shown in column 1,
the partisan advantage for the Democratic party (µi ) in contested districts is increasing in the
percent urban and suburban (both of these are relative to the omitted category - percent rural),
percent with a college degree, percent over age 65, percent African American, and percent Hispanic
but is decreasing household income. Interestingly, the coefficients from the regression of these
covariates on the standard deviation of the voting distribution, which is proportional to fraction
of Independent votes (µi ), are mostly reversed. As shown in column 2, the likelihood of being an
Independent voter is decreasing in all of the observable measures.
These coefficients and state-specific constant terms, which are not reported in Table 2, are
then used in order to predict these two moments in uncontested districts. With our measured
district-specific moments in contested districts and predicted moments in uncontested districts,
we can now estimate the statewide moments in a straightforward manner:
n
µ
b =
1X
µ
bi
n i=1
17
(39)
n
σ
b
=
1X
σ
bi
n i=1
With estimates of the district-specific and statewide voting moments in hand, we proceed to
estimation of the equilibrium seat-vote curve.
4.3
Equilibrium seat-vote curves
Figure 1 plots the equilibrium seat-vote curve as estimated for each state in the sample. As shown,
the range of possible statewide support for the Democratic Party seems reasonable. For example,
in New York, typically considered a heavily Democratic state, the party receives support on the
range [0.57,0.72], while in Utah, typically considered a heavily Republican state, the Democrats
receive support on the range [0.34,0.51]. Notice also that the equilibrium seat-vote curve is close
to linear in some states, such as Mississippi, New York, and Ohio, while it has important nonlinearities in other states, such as Delaware, Florida, and Montana. These differences in the degree
of non-linearity across states highlight the importance of allowing for a general functional form in
estimating equilibrium seat-vote curves.
Column 1 of Table 3 reports a measure of the responsiveness of each of these equilibrium seatvote curves. This measure is obtained by computing the slope of the linear seat-vote curve that
best approximates the equilibrium seat-vote curve.7
Notice that in every state, responsiveness
exceeds 1, and, in the vast majority of states, it exceeds 2.
Column 3 of Table 3 reports the partisan bias associated with the equilibrium seat-vote curves.
This is measured by S(1/2) − 1/2 and is the difference between the fraction of seats that the
Democrats hold when they get half the statewide vote and 1/2. Notice that in the four states
for which V = 1/2 does not lie in the range of possible Democratic vote shares, this measure
cannot be computed. The interesting thing to note is that in all but three states, the equilibrium
seat-vote curve is biased towards the Republicans. The average bias of 4.27 percent implies that,
when voters are equally split, Republicans would secure 54.27 of the seats on average, relative to
45.73 percent for Democrats, and would thus hold a significant advantage of 8.54 percent in the
legislature.
7 To be more precise, define the linearized equilibrium seat-vote curve to be S l (V ) = 1/2 + b + r(V − 1/2), where
b and r are chosen in order to minimize the expected square distance between the equilibrium and the linearized
equilibrium seat-vote curves. That is: r =cov[V, S(V )]/var(V ) and b = E[S(V )] − 1/2 − r[E(V ) − 1/2]. Then our
measure of responsiveness is r.
18
4.4
Optimal seat-vote curves
For comparison purposes, Figure 1 also includes the optimal seat-vote curves. These are plotted
under the assumption that responsiveness, 2τ , is at its maximal level 1/(1 + α) in each state
(see equation (24)). It is apparent from the Figures, that the equilibrium seat-vote curve is more
responsiveness than the optimal seat-vote curve in all cases, suggesting that, if anything, U.S.
state legislative elections are too competitive and that there are too few safe seats from a social
welfare perspective! Column 2 of Table 3 reports the responsiveness of the optimal seat-vote
curve. As noted in the theoretical section, optimal responsiveness is always below one, while the
responsiveness associated with the equilibrium seat-vote curves substantially exceeds one in all
cases.
Column 4 of Table 3 reports the partisan bias associated with the optimal seat-vote curve,
which is defined as S o (1/2) − 1/2. It is notable that in all but two states the bias associated with
the equilibrium seat-vote curve is less than that associated with the optimal seat-vote curve. Thus,
the partisan bias towards the Republicans exhibited in the equilibrium seat-vote curves, cannot
be justified as “optimal partisan bias”. Indeed, in twelve states, the equilibrium seat-vote curve
is biased towards the Republicans, when it should optimally be biased towards the Democrats!
A reasonable objection to this comparison of partisan bias under the optimal and equilibrium
seat-vote curves is that it just tells us about the properties of the curves at the vote share V =
1/2. For a more global comparison, we computed the expected Democratic seat share (E(S(V )))
under the equilibrium and optimal seat-vote curves (E(S(V )) and E(S o (V ))). Column 6 of Table
3 reports the difference in this expected seat-share under the equilibrium and optimal curves.
When this difference is positive, the expected Democratic seat share is higher than optimal. The
interesting thing to note is that, in this expected seat sense, there appears to be no obvious bias in
one or the other direction. In almost half the states, the expected Democratic seat share is higher
than optimal and the average difference is very close to zero. This makes us hesitant to draw any
strong conclusions concerning the general direction of bias in U.S. state legislative elections.
4.5
Verifying the conditions for implementation
As shown in Table 4, the conditions for implementation, the restriction that the fraction of Independents must be below a maximal level is satisfied in every state. The state closest to not
satisfying the requirements is Rhode Island, which is reported to have 51% Independents, just
19
slightly below the maximal level of 55%. The next closest state is Connecticut, which is reported
to have 42% Independents, relative to the maximal level of 65%. In summary, these results
demonstrate that the conditions for implementability of the optimal seat-vote curve are indeed
permissive, being satisfied in all of 28 states included in our analysis and by a large margin in all
cases except for the state of Rhode Island.8
4.6
Welfare gains
Given that the optimal seat-vote curve is implementable in all states, we next turn to measuring
the welfare gains associated with socially optimal districting. To begin with, we compute the
percentage welfare gains under the assumptions that the ratio γ/β is at its maximal level (i.e., 1)
and that optimal responsiveness is at its maximal level in each state (i.e., 1/(1 + α)). As shown
in Table 5, the percentage welfare gains to socially optimal districting vary from just 0.31% in
Pennsylvania to 6.5% in Rhode Island. A visual comparison of the equilibrium and optimal seatvote curves in Figure 1 supports these welfare calculations as the optimal and equilibrium seat-vote
curves are similar in states with low potential welfare gains but quite different in states with large
potential welfare gains.9
On average, states could increase welfare by roughly 2%, suggesting
that the gains from socially optimal districting are typically small relative to the surplus that is
generated by state legislatures. Of course, if this surplus is itself very large, then these gains could
be quite large in monetary terms. But without further assumptions on the underlying welfare
parameters β and γ, these percentage gains in welfare cannot be converted into monetary terms.
For a sense of how these results depend on the specific assumptions about the parameters
γ/β and 2τ , Table 6 reports the average welfare gains associated with socially optimal districting
8 Note that in evaluating the conditions for implementation, we have not used external information on the
statewide fraction of Democrats and Republican. It is comforting to note, however, that the implied fraction of
partisans from voting behavior are highly correlated with the fraction of voters self-reporting as partisans. Using
the fact that the implied statewide fraction of Democrats and Republicans are given by πD = µ − πI /2 and
πR = 1 − µ − πI /2, we calculate correlation between the implied and reported fraction of voters of roughly 0.8 for
both Republicans and Democrats.
9 While a welfare comparison of the optimal seat-vote curve, which is continuous, and the measured seat-vote
curve, which is a discrete step-function, is somewhat artificial, we do so in order to apply the results from the
companion theoretical paper, which assumed a continuous seat-vote curve, in order to verify that the optimal seatvote curve is implementable. To provide a sense of the error associated with this approximation, we have derived a
discrete optimal seat-vote curve, which is a step-function approximation of the continuous optimal seat-vote curve.
Welfare associated with this discrete optimal seat-vote curve is very similar to welfare under the continuous optimal
seat-vote curve, and the approximation error associated with the use of a continuous optimal seat-vote curve is
thus small in practice. The small size of this error should not be surprising given that the discrete and continuous
optimal seat-vote curve converge as the number of districts grows large. As shown in table 1, states have a large
number of districts, averaging over 100 districts and ranging from 41 in Delaware to 203 in Pennsylvania.
20
arising under different parameter values. We allow the ratio γ/β to vary from 0.25 to 1 and allow
2τ to vary from 0.25/(1 + α) to 1/(1 + α) in each state. The notation η refers to the numerator
in the ratio η/(1 + α), so that the case in which 2τ is set equal to 0.25/(1 + α) in each state
corresponds to η = 0.25; the case in which 2τ is set equal to 0.5/(1 + α) in each state corresponds
to η = 0.5; etc. As shown, holding the ratio γ/β constant, the welfare gains to socially optimal
districting, averaged across all states, are uniformly increasing as the parameter η is reduced.
This pattern reflects the fact that reductions in the parameter η are associated with reductions
in optimal responsiveness, which, as shown in Table 3, was already below the responsiveness
associated with the equilibrium seat-vote curve in the baseline analysis. Holding the parameter η
constant and reducing the ratio γ/β, we have that welfare is uniformly decreasing. As explained
above, districting matters less for welfare as this ratio decreases, reflecting the fact that the conflict
between parties over the available policy choices is less severe.
Perhaps the most noteworthy lesson to be drawn from Tables 5 and 6 is that the welfare
gains from socially optimal districting are relatively small as a proportion of the total surplus
generated by state legislatures. In principle, there may be two reasons for this. The first is that
the districting plans that states actually implement are relatively close to optimal plans. The
second is that, because of the diverse ideological make up of the U.S. states, the aggregate gains
that can be obtained by tinkering with districting plans are relatively small. To get a feel for
which of these views is correct, we have computed the proportionate welfare gains that would
arise from implementing the optimal seat-vote curve over the welfare minimizing seat-vote curve.
We take the latter to be the seat-vote curve that makes the seat share of the party with the
smallest partisan base equal to 1.10
The idea is that if these gains are small, then the second
view is correct. Table 7 reports the results under the same parametric assumptions that underlie
Table 5. They strongly suggest that it is the first view that is correct: the states seem to be doing
districting in a way that is generating seat-vote curves that are relatively close to optimal.
A further interesting benchmark of comparison is the seat-vote curve that would be generated
by a proportional representation electoral system. Under such a system, seats are allocated in
strict proportion to statewide votes and hence the implied PR seat-vote curve is S(V ) = V .
As shown in Column 1 of Table 8, all states would experience a welfare gain by implementing
10 An important objection to this, is that such a seat-vote curve may not be implementable. We agree with this
objection, but time constraints have prevented the computation of the welfare minimizing implementable seat-vote
curve.
21
the PR seat-vote curve, relative to the welfare level experienced under the equilibrium seat-vote
curve. This result in part reflects the fact that the PR seat-vote curve has responsiveness of 1,
while responsiveness associated with the equilibrium seat-vote exceeds 1 in all cases. As noted
previously, optimal responsiveness is less than 1. Moreover, the PR seat-vote curve is linear, a
feature shared by the optimal seat-vote curve. Perhaps more surprisingly, however, the additional
gains from implementing the optimal seat-vote curve, relative to PR, are very small. As shown
in Column 2 of Table 8, the welfare gains to optimal districting from PR vary between 0.01 and
0.11 percent.
5
Relation to the literature
As noted in the introduction, there is a long history of estimating seat-vote curves, the key
construct in our empirical analysis, in the existing literature. Early studies used the uniform
partisan swing model, first developed by Butler (1951); this method uses district-specific voting
returns and then uncovers a seat-vote curve by simulating how the distribution of seats changes
as national support for the Democrats increases and decreases, implicitly assuming that districtspecific votes move uniformly. For example, if national support for Democratic candidates was
recorded at 55 percent, consider an increase in national support to 60 percent. The approach
assumes that support for Democratic candidates in each district would increase by 5 percentage
points so that seats won by Republicans with between 45 and 50 percent of the recorded vote
would be predicted to flip to the Democrats. This approach was later critiqued for both ignoring
district-specific factors and for not restricting district vote shares to be between zero and one
(King, 1989).
Other early studies, such as Tufte (1973), posit a linear seat-vote curve and estimate the
intercept and slope of this curve by regressing the number of seats for Democrats in a state on
their share of the vote across all districts. One critique of this approach is that the regression line
is not constrained to pass through the points (0,0) and (1,1). To address this concern, King and
Browning (1987) develop a non-linear bilogit model:
S(V ) =
1
.
V
1 + exp(−b − r ln( 1−V
))
(40)
This curve is characterized by the bias parameter (b) and responsiveness parameter (r) and, in
22
contrast to the linear seat-vote curve, is constrained to pass through the points (0,0) and (1,1).
Gelman and King (1990, 1994) advance the literature by developing an approach that does
not rely on particular functional form assumptions. They instead begin with an assumption over
the statistical distribution of the vote-generating process and then estimate the parameters of this
distribution. Using these parameter estimates, they simulate how seats change given hypothetical
changes in support for parties. In their initial paper, they posit a normal distribution for underlying
support for the Democratic party (ui ∼ N (αi , σ 2 ) in district i, where this underlying support
is related to votes (Vi ) according to Vi = exp(ui )/(1 + exp(ui )). Importantly, the variance of
underlying support is assumed to be constant across districts. The authors then make assumptions
over the distribution over the district effect αi and estimate the parameters of that distribution.
In a follow-up paper, Gelman and King (1994) incorporate observable candidate characteristics
(Xi ), such as incumbency status and party control of the district, and estimate the parameters of
the following regression equation:
Vi = Xi β + ui .
(41)
where ui ∼ N (0, σ 2 ). Again, the variance in support for the Democratic party is assumed to be
constant across districts.
Given our goal of evaluating districting plans from a welfare perspective, our approach begins
with underlying citizen preferences over policies, and hence candidates, rather than a functional
form for the seat-vote curve or an assumed vote-generating process. These preferences and associated voting behavior in turn lead to a vote-generating process and ultimately a seat-vote curve.
Similarly to Butler, our approach does not incorporate a district-specific preference shock. It is
important to note, however, that our approach assumes that only preferences, and not necessarily
votes, swing in a uniform manner. As shown in equation (1), these preference shocks translate
into vote shares for the Democrat candidate heterogeneously and in proportion to the fraction of
Independent voters. Moreover, our approach, unlike Butler’s, restricts district-specific vote shares
to be between 0 and 1, as can also be seen in equation (1). Relative to the work of Gelman and
King (1990, 1994), there are two key differences in our approach. First, while our approach allows
for the variance of the vote-generating process to vary across districts, their approach assumes
that this variance is constant across districts; in the context of our model, this assumption would
require that all districts have an equal fraction of Independent voters. Second, in explaining differences in voting patterns across districts, Gelman and King (1994) rely on observable candidate
23
characteristics; this approach is inappropriate for our framework, in which candidates within a
party are homogenous across districts and thus differences in voting outcomes across districts can
be generated only by differences in voter preferences. Developing an approach that incorporates
both candidate and voter characteristics would be desirable, and while beyond the scope of the
current analysis, we hope to incorporate such heterogeneity in candidate characteristics, such as
incumbency and race, in future extensions of this methodology.
The only other work that we are aware of that explores the micro-foundations of seat-vote
curves and their significance for welfare is the independent work of Besley and Preston (2005).
They develop an alternative micro-founded model that generates an equilibrium relationship between seats and votes. They use their model to solve for what the distribution of voter types
must be across districts if the equilibrium seat-vote curve is to be of the bilogit form. Their main
theoretical point is to show that this distribution, and hence the shape of the seat-vote curve, is
a key determinant of parties’ electoral incentives to put in effort on the part of their constituents.
They provide empirical evidence in favor of their theory by showing that local government performance in the U.K. is related to the parameters of the local seat-vote curve in the way the theory
suggests. Their work therefore suggests a novel theoretical mechanism why the form of the seatvote curve (and hence districting) matters for citizens’ welfare and provides evidence for this. By
contrast, our model reflects the conventional view that districting matters because it determines
which party gets the most seats and hence the ideological composition of the legislature. Clearly,
it would be desirable to develop an analysis in which both forces are present.
While our methods differ somewhat from those in the existing literature, it is important to note
that our substantive findings are in many respects quite similar. Our finding that responsiveness
associated with the equilibrium seat-vote curve averages 2.36 is similar to that found in the existing
literature. Although using data from a slightly earlier period, Gelman and King (1994) examine
legislative elections during the 1970s and 1980s for a similar set of U.S. states and report an average
responsiveness of just below two. Both of these estimates appear to be at the low end of the range
of estimated responsiveness parameters in the literature, which has found parameters between two
and four, with three being a focal point, a finding that has become known as the “cube law” (King
and Browning, 1987). Regarding partisan bias, our results are again qualitatively similar to those
in the existing literature: Gelman and King (1994) report bias parameters that average roughly
3 percent in absolute value across all states and years, while we find that bias associated with
24
the equilibrium seat-vote curve averages roughly 5 percent across states. However, while Gelman
and King find that on average the direction of partisan bias favors the Democrats, we find that
it favors the Republicans. This discrepancy may be explained by the fact that we are using data
from the 1990s districting period. In summary, the general similarity in our findings to those in
the existing literature suggest that the social welfare implications of our results may be robust to
alternative datasets and approaches to estimating equilibrium seat-vote curves and are not being
driven by anomalous estimated bias and responsiveness parameters.
6
Conclusion
This paper has explored the development of a methodology for comparing actual and optimal
districting plans and evaluating the welfare gains from socially optimal districting. We start by
showing that the equilibrium seat-vote curve, which is generated by actual districting plans, can
be expressed as a function of the district-specific and statewide moments, namely the mean and
variance, of the voting distribution, suggesting an empirical approach that only requires estimation
of these first two moments. We then develop empirical expressions for the optimal seat-vote curve,
which can be uncovered using the statewide moments of the voting distribution along with data on
the fraction of Independent voters across all districts. In addition, the conditions for implementing
the optimal seat-vote curve can be expressed as an upper bound on the fraction of Independent
voters and can thus be verified using similar data on these statewide moments and the fraction
of Independent voters. Finally, we develop expressions for the welfare gains from socially optimal
districting.
To illustrate the approach, this methodology is then applied to voting returns from U.S. state
legislatures during the 1990s. We find that responsiveness tends to be too large, relative to optimal
responsiveness, suggesting that, if anything, U.S. state legislative elections are too competitive and
that there are too few safe seats from a social welfare perspective. Encouragingly, the conditions
for implementing the optimal seat-vote curve are permissive and are satisfied in all states and by
a large margin in most states. While there is significant variation across states, the welfare gains
to socially optimal districting, relative to the overall surplus generated by state legislatures, are
on average quite small. This appears to be because state districting plans are reasonably close
to optimal rather than because there are little aggregate gains to be had from varying districting
plans. We also find that implementing proportional representation would lead to welfare gains
25
in all states and would produce welfare levels quite close to those achieved with socially optimal
districting.
While we feel that this approach provides a significant advance forward, we recognize its
limitations and hope to make advances both in terms of statistical techniques as well as in the
incorporation of several important real-world features of U.S. legislatures in future extensions
of the methodology. Regarding alternative statistical techniques, one step forward would be
the incorporation of statistical models for estimating the district-specific moments of the voting
distribution. This approach would be similar to that used Gelman and King (1990,1994) and
would require simulation of both the positive features, such as the equilibrium seat-vote curve,
and normative features, such as the welfare gains from socially optimal districting, of the analysis.
Second, in order to better fit the voting data, it would be useful to incorporate a district-specific
shock in the preferences of Independent voters into the empirical approach. With a district-specific
shock, district-specific support for parties is no longer a deterministic function of statewide support
for parties, as expressed in equation (4), and the positive and normative analyses will thus need
to modified accordingly. Regarding institutional features of legislatures, it would be interesting
to incorporate incumbency into the framework; this extension is challenging as it would obviously
require a dynamic political economy model. A second extension would involve race, a key focus
of current policy debates over representation and the redistricting process. One could consider
a model in which voters and candidates are characterized by both their ideology and their race.
Achieving the optimal racial representation and the optimal ideological representation in the
legislature may not be feasible and thus there may be a trade-off between race and ideology.
Finally, this model could be used to analyze partisan gerrymandering, under which districting
officials aim to further partisan, rather than social, objectives. This positive approach may provide
further insights into the differences between optimal and equilibrium seat-vote curves documented
in this paper as well as the sources of the measured welfare gains from socially optimal districting.
26
References
Ansolabehere, Stephen and James Snyder, [2002], “The Incumbency Advantage in U.S.
Elections: An Analysis of State and Federal Offices: 1942-200,” Election Law Journal, 1,
315-338.
Barone, Michael, William Lilley III, and Laurance DeFranco, [1998], State Legislative Elections: Voting Patterns and Demographics, Washington D.C: Congressional Quarterly.
Besley, Timothy and Ian Preston, [2005], “Electoral Bias and Policy Choices: Theory and
Evidence,” mimeo, London School of Economics.
Coate, Stephen and Brian Knight, [2005], “Socially Optimal Districting,” NBER working
paper 11462.
Gelman, Andrew and Gary King, [1990], “Estimating the Electoral Consequences of Legislative Redistricting,” Journal of the American Statistical Association, 85, 274-282.
Gelman, Andrew and Gary King, [1994], “Enhancing Democracy through Legislative Redistricting,” American Political Science Review, 88(3), 541-559.
King, Gary, [1989], “Representation through Legislative Redistricting: A Stochastic Model,”
American Journal of Political Science, 33(4), 787-824.
King, Gary and Robert Browning, [1987], “Democratic Representation and Partisan Bias in
Congressional Elections,” American Political Science Review, 81(4), 1251-1273.
Tufte, Edward, [1973], “The Relationship Between Seats and Votes in Two-Party Systems,”
American Political Science Review, 67(2), 540-554.
27
7
Appendix
Proof of Proposition 1: Note first that ∂ 2 W (S; m)/∂S 2 < 0, so that W (·; m) is strictly concave.
Thus, the first order condition is sufficient for S o to be optimal. Differentiating equation (7) yields
∂W (S o ; m(V ))/∂S = 2γ{πD + πI (1 − m) − S}.
Clearly, ∂W (S; m)/∂S = 0 if and only if
S = πD + πI (1 − m).
Substituting in the expression for m(V ) from (3), we obtain
S o (V ) = 1/2 + (πD − πR )(1/2 − τ ) + 2τ (V − 1/2)
as required. QED
Proof of Lemma: From (7), aggregate welfare when the median Independent has ideology m
and the Democrats have seat share S is given by:
W (S; m) = β − γ[πD (1 − S)2 + πR S 2 + πI
Using the fact that
R m+τ
m−τ
Z
m+τ
m−τ
(1 − S − x)2
dx
].
2τ
2
2
(1 − S − x)2 dx
2τ = (1 − S − m) + τ /3, we can re-write welfare as follows:
W (S; m) = β − γ{πD (1 − S)2 + πR S 2 + πI [(1 − S − m)2 + τ 2 /3]}
Reorganizing terms, we have that:
W (S; m) = β − γ{S 2 − 2S[πD + πI (1 − m)] + πD + πI [(1 − m)2 + τ 2 /3]}
Using the fact that πD + πI (1 − m) = S o , we have:
W (S; m) = β − γ{S 2 − 2SS o + πD + πI [(1 − m)2 + τ 2 /3]}
Finally, taking expected values with respect to V and using the fact that E[(1 − m)2 ] = E(m2 ) =
1/4 + ε2 /3, we have that:
E[W (S; m)] = β − γ{c + E(S 2 ) − 2E(SS o )}
where the constant is given by c = πD + πI [1/4 + ε2 /3 + τ 2 /3]. QED
28
.8
DE
.6
.7
.45
.55 .6 .65
.4
.5
.55
.2
.4
.5
.6
.7
.6
.8
.5
.6
KY
.6
.4
.6
.4 5
.5
.55
.6
.4
.4 5 .5 .55 .6
.2
.4
.2
.2
.2
.4
.4
.4
.4
KS
.6
.6
.45
IL
.8
IA
.8
FL
.5
1
.65
.8
.6
.8
.5 5
.2
.4
.5
.3
.4
.4
.5
.4
.6
.5
.7
.6
.6
.6
.7
.9
.8
CT
.8
CO
.7
CA
.8
AL
.45 .5 .55 .6 .65
.35 .4 .45 .5 .55
Figure 1a: Seat-Vote Curves
S( V)
Graphs by state
Optimal S(V)
.45
.5
.55
.6
MT
.6
.8
.6
.7
.2
.6
.3
.2
.4
.4
.4
.4
.6
.5
.8
.6
MS
.8
MO
.9
MI
.8
ME
.5
.6
.7
.45
.5 5
.6
.45 .5 .5 5 .6 .65
NV
1
.6
.65
.7
.35 .4 .45 .5 .55
OH
OK
.7
.6
.45
.5
.55
.6
.4
.6
.55
.6
.65
.7
.2
.3
.6
.2
.4
.5
.4
.4
.4
.6
.6
.5
.7
.8
.8
.8
.55
NY
.8
NM
.5
.6
.4
.4
.5
Figure 1b: Seat-Vote Curves
S( V)
Graphs by state
Optimal S(V)
.6
.45 .5 .55 .6 .65
1
.6
.4
.45
.5
.5
.4
.4
.6
Figure 1c: Seat-Vote Curves
S( V)
Graphs by state
.55
.6
.3 .4 .5 .6 .7
.8
.6
.4
.2
.35
.5
WI
.3
.1
.2
.4 .5 .6
.45 .5 .55 .6 .65
.4 5
VA
.4
.7 .8
.55 .6 .65 .7 .75
UT
.5
TN
.5
.3
.4
.3
.4
.6
.6 .7 .8 .9
.5
.6
.4
.2
.4 .45 .5 .55 .6
SC
.7
RI
.7
PA
.6
.8
OR
Optimal S(V)
.4 .45 .5 .55 .6
Table 1: States and Years Included in Analysis
state
AL
CA
CO
CT
DE
FL
IA
IL
KS
KY
ME
MI
MO
MS
MT
NM
NV
NY
OH
OK
OR
PA
RI
SC
TN
UT
VA
WI
first redistricting
1994
1992
1992
1992
1992
1994
1992
1992
1992
1996
1994
1992
1992
1995
1994
1992
1992
1992
1992
1992
1992
1992
1992
1992
1994
1992
1991
1992
subsequent redistricting
2002
2002
1998
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
2002
1998
2002
2002
2001
2002
number districts
105
80
65
151
41
120
100
118
125
100
151
110
163
122
100
70
42
150
99
101
60
203
100
124
99
75
100
99
Table 2: Predicting Moments of Voting Distribution for Uncontested Districts
(all regressions include state-specific constant terms, standard errors in parentheses)
Dependent Variable
percent urban
percent suburban
household income (thousands)
percent with college degree
percent over age 65
percent African American
percent Hispanic
R-squared
Number of observations
ln(µ/1−µ)
0.2744
(0.0357)
0.1533
(0.0376)
-0.0152
(0.0016)
0.7805
(0.1624)
1.4364
(0.1960)
2.7355
(0.0876)
1.6590
(0.1441)
0.5689
2287
ln(σ)
-0.1313
(0.0539)
-0.1107
0.0569
-0.0020
(0.0024)
-0.5399
(0.2457)
-0.2967
(0.2965)
-1.2195
(0.1325)
-0.3654
(0.2180)
0.0971
2287
Table 3: Properties of Equilibrium and Optimal Seat-Vote Curves
state
AL
CA
CO
CT
DE
FL
IA
IL
KS
KY
ME
MI
MO
MS
MT
NM
NV
NY
OH
OK
OR
PA
RI
SC
TN
UT
VA
WI
Average
r
3.3018
1.5723
2.3034
2.0138
2.0933
2.9840
3.2076
1.6746
2.2100
4.9837
2.9785
1.4308
2.4998
2.5952
2.9718
2.1481
2.4760
1.2115
1.4504
3.4141
2.4147
1.3501
1.5070
2.3961
2.7522
1.8733
2.1171
2.0200
2.3554
optimal r
0.6947
0.6055
0.6823
0.6190
0.6007
0.6227
0.6867
0.6352
0.5938
0.5906
0.6557
0.6760
0.6570
0.7206
0.6890
0.6223
0.6027
0.6708
0.6199
0.5066
0.6109
0.5489
0.7329
0.6790
0.6903
0.6602
0.6397
0.6739
0.6424
b
n/a
-1.25%
-3.85%
-2.32%
-10.98%
-10.83%
-4.00%
-5.08%
-7.60%
-8.00%
0.33%
-8.18%
-7.06%
n/a
-6.00%
4.29%
11.90%
n/a
-7.58%
-5.45%
-1.67%
-2.71%
n/a
-9.68%
-2.53%
-4.67%
-2.00%
-7.58%
-4.27%
optimal b difference in expected seats
n/a
10.37%
1.24%
2.37%
-1.45%
-5.26%
1.68%
3.01%
-0.54%
-8.32%
0.66%
-3.63%
-0.24%
-3.51%
1.94%
-1.98%
-1.72%
-6.90%
1.04%
5.57%
0.55%
2.08%
0.11%
-4.06%
1.03%
1.52%
n/a
13.62%
-1.79%
-15.49%
0.74%
6.24%
0.77%
8.39%
n/a
6.95%
0.10%
-5.94%
1.67%
6.47%
-0.03%
-4.40%
0.77%
-1.18%
n/a
22.55%
-0.01%
-3.55%
1.32%
3.51%
-3.05%
-12.58%
0.44%
-1.24%
-0.23%
-4.32%
0.21%
0.37%
Table 4: Conditions for Implementability
state
AL
CA
CO
CT
DE
FL
IA
IL
KS
KY
ME
MI
MO
MS
MT
NM
NV
NY
OH
OK
OR
PA
RI
SC
TN
UT
VA
WI
% independents
29.44%
26.01%
39.12%
41.89%
36.20%
27.59%
40.46%
33.64%
31.18%
22.55%
45.71%
35.16%
37.15%
24.94%
36.36%
29.43%
29.78%
30.92%
32.41%
18.79%
30.99%
24.59%
51.19%
33.02%
33.94%
33.91%
34.42%
36.43%
maximum % independents
65.13%
71.89%
76.70%
64.83%
74.87%
79.51%
81.01%
69.07%
70.90%
78.80%
72.72%
75.95%
72.87%
64.07%
73.05%
77.12%
74.46%
56.84%
78.63%
74.59%
80.07%
76.03%
54.72%
81.60%
75.27%
67.75%
77.76%
80.55%
Table 5: Welfare Gains to Optimal Districting
state
AL
CA
CO
CT
DE
FL
IA
IL
KS
KY
ME
MI
MO
MS
MT
NM
NV
NY
OH
OK
OR
PA
RI
SC
TN
UT
VA
WI
Average
% welfare gains
2.59%
0.37%
1.28%
1.47%
2.40%
2.02%
2.40%
0.54%
2.03%
5.82%
3.33%
0.40%
1.37%
2.69%
4.76%
1.28%
2.48%
0.66%
0.73%
3.70%
1.60%
0.31%
6.50%
0.97%
1.21%
2.47%
0.95%
0.83%
2.04%
Table 6: Welfare Gains Under Alternative Parameter Values
η=1
η=0.75
η=0.50
η=0.25
γ/β=1
2.04%
2.22%
2.43%
2.66%
γ/β=0.75
1.44%
1.57%
1.72%
1.88%
γ/β=0.50
0.91%
0.99%
1.09%
1.19%
γ/β=0.25
0.43%
0.47%
0.52%
0.56%
Table 7: Welfare Gains From Worst Seat-Vote Curve
state
AL
CA
CO
CT
DE
FL
IA
IL
KS
KY
ME
MI
MO
MS
MT
NM
NV
NY
OH
OK
OR
PA
RI
SC
TN
UT
VA
WI
Average
% welfare gains (relative to worst S(V))
82.79%
62.22%
50.08%
55.86%
45.15%
50.22%
43.85%
61.18%
55.50%
53.84%
46.58%
53.85%
54.27%
95.31%
58.84%
51.68%
52.93%
97.02%
46.27%
57.09%
44.79%
50.76%
85.82%
47.92%
57.70%
66.74%
47.77%
46.09%
57.93%
Table 8: Welfare Gains to PR
state
AL
CA
CO
CT
DE
FL
IA
IL
KS
KY
ME
MI
MO
MS
MT
NM
NV
NY
OH
OK
OR
PA
RI
SC
TN
UT
VA
WI
Average
% welfare gains from implementing PR
2.57%
0.32%
1.24%
1.37%
2.28%
1.98%
2.37%
0.49%
1.95%
5.78%
3.25%
0.37%
1.33%
2.68%
4.73%
1.23%
2.40%
0.63%
0.66%
3.61%
1.54%
0.22%
6.48%
0.94%
1.18%
2.43%
0.90%
0.80%
1.99%
% welfare gains to optimal districting from PR
0.02%
0.05%
0.04%
0.10%
0.11%
0.04%
0.04%
0.05%
0.08%
0.04%
0.07%
0.03%
0.05%
0.01%
0.03%
0.05%
0.08%
0.03%
0.06%
0.09%
0.06%
0.09%
0.03%
0.03%
0.03%
0.04%
0.05%
0.04%
0.05%