Structural Equilibrium Analysis of Political Advertising

Advertising Competition in Presidential Elections∗
Brett R. Gordon
Wesley R. Hartmann
Columbia University
Stanford University
August 19, 2013
Abstract
Presidential candidates choose advertising strategically across markets based on
each state’s potential to tip the election. The winner-take-all rules in the Electoral
College concentrate advertising in battleground states, ignoring most voters. We
estimate an equilibrium model of competition between candidates to evaluate
advertising and voting outcomes. In a direct vote counterfactual, all states receive
positive advertising and both expenditures and turnout increase. Although states’
political preferences drive competition in the Electoral College, candidates focus
on cheap advertising targets in a direct vote. Simulations removing advertising
price variation suggest a direct vote spreads political attention uniformly across
markets with diverse preferences.
Keywords: Political advertising, presidential election, electoral college, direct
vote, resource allocation, voter choice, contests.
JEL: D72, L10, M37.
∗
We thank Jean-Pierre Dubé, Ron Goettler, Mitch Lovett, Sridhar Moorthy, Michael Peress, Stephan
Seiler, Ron Shachar, Ali Yurukoglu, and seminar participants at Chicago Booth, Columbia, Erasmus, Helsinki
(HECER), Kellogg, Leuven, MIT Sloan, NYU Stern, Stanford GSB, Toronto, University of Pennsylvania,
USC, Yale, Zürich, NBER IO, QME, SICS, and SITE for providing valuable feedback. All remaining errors
are our own. Gordon: Columbia University, Columbia Business School, 3022 Broadway, Uris Hall 511, New
York, NY, 10027, [email protected]. Hartmann: Stanford University, Graduate School of Business, 655
Knight Way, Stanford, CA, 94305, [email protected].
1
Introduction
Advertising is a critical instrument of competition between political candidates. In the 2012
elections, three billion dollars was spent on television advertising, representing the largest
component of candidates’ media expenditures (TVB, 2013). In the general election for
president, candidates strategically allocate advertising to media markets most likely to tip
the election’s outcome. Most advertising focuses on swaying a minority of voters who reside
in battleground states where significant uncertainty exists over the eventual winner of the
state’s contest. In contrast, candidates pay scant attention to the majority of the electorate
who reside in more polarized states where the outcome is predictable. Furthermore, evidence
suggests that candidates’ focus on battleground states extends beyond campaigns to both
policies and federal resources targeted at pivotal voters.1
The battleground state distinction arises under the Electoral College because most states
allocate all of their electoral votes to the candidate who receives a plurality of the state’s
popular vote.2 Marginal votes are only valuable in states where voters’ preferences are closely
divided between candidates. A direct (popular) vote is an oft-proposed alternative that
would eliminate the state-level contests and count each vote equally in a national contest. By
equalizing the relevance of marginal votes for the election outcome, a popular vote may more
equitably balance voters’ influence on candidates’ campaigns and policies.
To evaluate the distribution of candidate attention under a direct vote, we estimate an
1
Garrett and Sobel (2003) and Reeves (2011) find that presidents direct a disproportionate share of FEMA
disaster declarations and expenditures to battleground states. Berry, Burden, and Howell (2010) show that
swing states receive more federal funds after accounting for differences in electoral votes quantities. Outside
the US, Bloom et al. (2013) document that governing parties in the UK are less likely to close hospitals in
areas with narrow expected vote margins. For anecdotal evidence, see Page (2002), Pittman (2004), Solomon
et al. (2007), and Baker and Lipton (2012). One illuminating example is the case of Pennsylvania, which in
2011 contemplated changing its allocation rule to one that might have threatened the state’s battleground
status. Long-time Senator Arlen Specter opposed the move, arguing that “it’d be very bad for Pennsylvania
because we wouldn’t attract attention from Washington on important funding projects for the state. It’s
undesirable to change the system so presidents won’t be asking us always for what we need,” (Liptak, 2012).
The Senator’s explanation highlights the potential link between campaign incentives under the Electoral
College and policy formation preferences.
2
The two exceptions are Maine and Nebraska, which have used a congressional district method since 1972
and 1996, respectively. In practice, however, both states have always allocated all of their votes to a single
candidate.
1
equilibrium model of multimarket advertising competition between presidential candidates.
A direct vote counterfactual reveals that the new distribution of advertising would be more
uniform across markets. However, advertising price variation across markets and funding
disparities between candidates distort the advertising distribution, producing an allocation
that remains correlated with voters’ political leanings. Recomputing the equilibrium in a
counterfactual that equalizes advertising prices and candidates’ funding, we find candidates’
attention would be nearly uniformly distributed across markets despite extensive differences
in voters’ political preferences. This counterfactual likely represents how voters’ influence on
candidates’ policy formulation would change under a direct vote.
Our modeling approach is inspired by recent work on empirical games in industrial
organization (IO). Vote shares arise from an aggregate discrete-choice model in the style of
Berry (1994), where local demand shocks induce advertising endogeneity. Unlike standard IO
applications, however, we assume candidates do not perfectly observe the demand shocks,
which are realized when voters vote on Election Day but after candidates set advertising.
Candidates form rational expectations over these shocks, and set advertising levels to equate
their local marginal benefits of advertising to local advertising prices. A candidate’s local
marginal benefit of advertising is the product of (i) the change in the probability of winning
the election, which depends on the electoral mechanism, voters’ preferences, and candidates’
demand uncertainty, and (ii) the dollar value of such a change, which is a candidate-election
structural parameter that translates the marginal probability of winning into dollar terms
and allows us to endogenize spending levels. We refer to this parameter as the candidate’s
financial strength because of its ability to guide budget changes across election regimes.3
We apply the model to detailed data on votes and advertising from the 2000 and 2004
presidential elections. After estimating voter preferences, we form moments in a GMM
specification around the first-order conditions (FOCs) for candidates’ advertising decisions.
A candidate’s financial strength is identified based on a candidate’s average spending level.
Identification of candidates’ demand uncertainty stems from joint variation in advertising
3
Section 2.2 discusses this interpretation in more detail and describes how our model abstracts away from
a formal fundraising and budget process.
2
and vote shares: a candidate’s willingness to spend money in states with large ex-post voting
margins indicates a degree of uncertainty over voting outcomes.
In a counterfactual experiment with a direct vote in 2000, we find that all states receive
positive advertising. Aggregate spending on advertising increases by 25 percent due to the
expanded set of competitive markets. Advertising in former battleground states declines
by 42 percent and advertising increases in former non-battleground states by 130 percent.
The new distribution of advertising is more equitable, with the standard deviation across
markets in exposures per person dropping by 63 percent. However, advertising exposures
are “tilted” such that markets with left-leaning voter preferences receive 20% fewer exposures
per person. This discrepancy is due to systematically greater advertising costs per voter
in left-leaning markets. Although Gore loses about 50,000 popular votes under this new
advertising equilibrium, he wins the election with about a 450,000 popular vote margin.
Interestingly, we find a direct vote in the 2004 election produces an equilibrium without
advertising by either candidate. This outcome is due to Bush’s 2.5 percent national popular
vote advantage. Whereas advertising competition could overcome the 0.5 percent vote margin
in 2000, advertising’s effect on voters’ choices—relative to advertising prices—is not strong
enough to justify candidate spending in 2004. Of course, a direct vote would probably lead
candidates to alter other dimensions of competition (e.g., policy choices), and these changes
could make the election close enough to warrant positive advertising levels. However, we
must hold these factors fixed in our counterfactual analysis. Without such policy adjustments
to make the election more contestable, the endogenous spending component of our model
predicts that donors would be unwilling to fund advertising in the election. Extant models,
such as Shachar (2009) and Strömberg (2008), which hold budgets fixed, are unable to make
such an assessment and would potentially predict candidate spending even in effectively
uncontested elections.
These results highlight an important distinction between the two election mechanisms:
the collection of state-level contests in the Electoral College is more likely to produce a
set of contests with narrow absolute voting margins compared to the national contest in a
3
direct vote. Each presidential election between 1972 and 2008 had at least five states with
vote margins below the national popular vote margin (2012 had four). Advertising, or other
campaign activities, can be geographically targeted towards a narrow group of pivotal voters
in attempts to tip these contested states in a candidate’s favor. In contrast, in a direct vote,
voters’ influence on candidates is more diffuse, thereby requiring campaign resources and
policy allocations directed toward large groups of voters.
Our work contributes to several literatures. Early game-theoretic analyses in political
science primarily sought to explain observed resource allocations in the Electoral College
(Friedman, 1958; Brams and Davis, 1974; Owen, 1975). Snyder (1989) extends this work to
study two-party competition under more realistic modeling assumptions. Recent research,
such as Grofman and Feld (2005) and Strömberg (2008), examine how states’ size and
competitiveness affect presidential candidates’ strategies.
In contrast to the political science literature, we pursue a structural econometric approach
to study resource allocation in the Electoral College and a direct vote. The aggregate model of
voter choice accommodates instrumental variables to identify the causal effect of candidates’
controls, in contrast to earlier work’s use of probabilistic-voting models (e.g., Lindbeck and
Weibull, 1987; Strömberg, 2008). On the candidate side, we impose the model’s equilibrium
on the data to recover the primitives governing candidates’ decisions. Uncertainty is therefore
inferred from a comparison of candidates’ ex-ante advertising allocations and ex-post demand
realizations.
The analysis of presidential elections extends the structural econometric literature on
advertising. The relevant empirical literature on advertising considers a variety of issues:
the informational effects of advertising (e.g., Ackerberg, 2003; Goeree, 2008), measuring
dynamic effects (e.g., Sahni, 2012), and firms’ inter-temporal advertising decisions (e.g., Dubé
et al., 2005; Doganoglu and Klapper, 2006). Although these issues are likely relevant in
presidential campaigns, the primary strategic dimension for candidates is, however, geographic.
Advertising decisions in our model are interdependent across markets, yielding a multimarket
equilibrium in advertising allocations. Thus, we extend the econometrics of advertising to the
4
context of presidential elections where the stakes of advertising are arguably more important
than in many other applications.
Finally, this paper is relevant to the literature on contests (e.g., Tullock, 1980; Dixit, 1987).
The Electoral College is a nested contest where the final contest outcome depends on the
weighted sum of binary outcomes across state-level contests. In this literature’s terminology,
the contest success function determines a player’s probability of success as a function of
all players’ efforts (Skaperdas, 1996). We construct an empirical contest success function
using our estimates of voters’ preferences and the structure of the candidates’ game. To
econometrically analyze such a contest, we present a computational approach to calculate the
marginal effect of an agent’s effort (e.g., funds directed towards advertising, R&D, lobbying,
etc.). To the best of our knowledge, our work is among the first to estimate an empirical
contest model.
2
Model
We develop a model of strategic interaction between presidential candidates in the general
election. The game has two stages. First, candidates campaign in election t through advertising
to influence voters’ preferences. This campaign activity occurs in a single period as candidates
j = 1, . . . , J form rational expectations about voter preferences and simultaneously choose
advertising levels Atmj across M media markets. A market is composed of a collection
of counties c ∈ Cm , with each county populated by Nc voters. Advertising decisions are
made at the market level, such that advertising is the same across counties within a market,
Atcj = Atmj , ∀c ∈ Cm .
Second, Election Day signifies the conclusion of campaigning. Voters perfectly observe
the demand shocks ξtcj and candidates’ advertising choices. A voter chooses the candidate
who yields the highest utility or opts not to vote. Voting outcomes across all counties are
realized and one candidate is deemed the winner. In our application, we set J = 2 and ignore
minor party candidates for simplicity.
The model differs in an important way from standard equilibrium models of differentiated
5
products in the industrial organization literature (e.g., Berry et al., 1995). Typically firms
set prices to clear the market while observing the demand shocks ξ that influence consumers’
decisions. In our setting, candidates must choose advertising levels before votes are cast and
without perfect knowledge of ξ. As a result, candidates form beliefs over the demand shocks
at the time of their advertising decisions.
2.1
Voters
A voter who resides in county c ∈ Cm receives the following indirect utility for party j:
uitcj = βtj + g (Atmj ; α) + φ0 Xtc + γmj + ξtcj + εitcj
(1)
= δtcj + εitcj
where δtcj is the mean utility of the candidate. βtj is the average national preference for a
party in election t. We set g(Atmj ; α) = α log (1 + Atmj ) to permit advertising to exhibit
diminishing marginal effects. Xtc is a vector of observables at the county or market level that
shift voters’ decisions to turnout or their decisions to vote for a particular candidate. γmj
is a market-party fixed effect that captures the mean time-invariant preference for a party
in a given market (e.g., Democrats consistently attract votes in Boston and Republicans
consistently do well in Dallas). ξtcj is an election-county-party demand shock that voters
observe on Election Day but which is unobserved by candidates when they choose advertising.
εitcj captures idiosyncratic variation in utility. If a voter does not turnout for the election,
she selects the outside good and receives a utility of uitc0 = εitc0 .
Assuming that {εitcj }j are i.i.d. extreme-valued shocks, integrating over them implies
county-level vote shares of the form:
stcj (Atm , ξ; θv ) =
1+
exp (βtj + g (Atmj ; α) + φ0 Xtc + γmj + ξtcj )
P
exp (βtk + g (Atmk ; α) + φ0 Xtc + γmk + ξtck )
(2)
k∈{1,...,J}
where Atm = [Atm1 , . . . , AtmJ ]0 and θv is the vector of voter parameters. Gordon and
Hartmann (2013) explore a number of flexible specifications for voter demand, including
one with heterogeneous advertising preferences. However, since they do not find significant
6
heterogeneity, we focus on the model above with homogeneous preferences.
Our model assumes that voters are not strategic, such that a voter’s beliefs about the
likelihood their vote will be pivotal do not affect their voting decision. This assumption seems
reasonable given the extremely small probability that any given voter’s vote is pivotal in the
presidential election. However, it is possible that a voter’s (potentially biased) perception of
her pivotal probability could change under an alternative election mechanism, whereas our
model assumes that voters do not respond to such a change. Devising a method to incorporate
some notion of strategic voting into our advertising game is an important extension we leave
to future research.4
2.2
Candidates
We begin by discussing how candidates form beliefs over voting outcomes, and then present
a candidate’s objective function for advertising. We suppress the t subscript when possible
since our model treats elections as independent observations.
2.2.1
Candidate Belief Formation
Candidates set advertising without perfectly observing the demand shocks ξcj . Prior to
making their decisions, candidates gather information through campaign research and other
sources about the nature of potential demand shocks in each county. This information
provides the candidate with an expectation ξ¯cj of each shock’s realized value ξcj . Candidates
set advertising levels while forming beliefs according to
ξcj = ξ¯cj + ηmj ,
ηmj ∼ N (0, σ 2 )
(3)
where ηmj is common to all counties c ∈ m and σ is candidates’ ex-ante uncertainty over
voting outcomes. The shock ηmj enters at the market level, as opposed to the county level,
because candidates choose advertising at the market level.5
4
See Shachar and Nalebuff (1999) and Kawai and Watanabe (2013) for different approaches to inferring
strategic voting in large elections.
5
The specification of the shocks could be extended to allow for correlation across markets. This would,
however, increase the computational burden of estimation.
7
The ηmj represent all unknown factors at the time candidates set advertising that will
become known to voters on Election Day. Uncertainty over voting outcomes is an inherent
feature of political contests: unexpected gaffes, surprising news stories, and the weather all
contribute to candidates’ uncertainty over voters’ preferences on Election Day. For example,
Gomez, Hansford, and Krause (2007) find that rain differentially suppresses the turnout
of one party. Since the market-party fixed effects absorb time-invariant unobservables, the
realized value of ξcj captures election-candidate-specific deviations, such as whether it actually
rained in a county on Election Day.
2.2.2
Candidates’ Advertising Decisions
Candidates choose advertising levels Aj = [A1j , . . . , Amj , . . . , AM j ]0 based on the local marginal
costs and benefits of advertising. The marginal cost of advertising is simply the local price of
advertising faced by the candidate, ωmj . To characterize the marginal benefit of advertising,
let dj (A, ξ; θv ) indicate whether candidate j wins the election and let E [dj (A, ξ; θv )], taken
over ηmj , be the probability candidate j wins the election. The marginal benefit of advertising
in market m, expressed in probability units, is the derivative of E [dj (A, ξ; θ)] with respect to
advertising—the increase in the probability of winning the election given a small increase in
candidate j’s ads in market m. A candidate’s first-order conditions (FOCs) for advertising
must satisfy
Rj
∂E [dj (A, ξ; θv )]
≤ ωmj , for m = 1, . . . , M ,
∂Amj
(4)
where Rtj is an unknown structural parameter that translates the probability of winning into
dollar terms, placing both sides of the condition in equivalent units. Thus, the FOCs balance
the value of an increase in the candidate’s probability of winning the election relative to the
marginal cost of one unit of advertising.
Below we show that Rj has a natural interpretation when a candidate’s objective function
is specified in a form consistent with work on either candidate resource allocation or contest
theory. Both approaches yield advertising FOCs consistent with equation (4). We are agnostic
8
as to the precise interpretation of Rj beyond its ability to reasonably guide advertising in
different election regimes, as manifested in the particular form of dj (·).
Optimization with a Budget Constraint. Suppose each candidate sets advertising to
maximize his probability of winning subject to a budget constraint. As in Strömberg (2008)
and Shachar (2009), we do not specify an explicit model of budget formation. Unlike these
papers, however, we do not assume the budget is exogenous. Instead the observed budgets in
our model arise from the optimal allocation of resources among a pool of potential donors.
First, consider a candidate’s constrained objective function:
max E [dj (A, ξ; θv )]
Aj
s.t.
M
X
(5)
ωmj Amj ≤ Bj
m=1
where Bj is the budget. Both the Lagrangian Lj (B) and its multiplier λj (B) depend on all
the budget levels B = [B1 , . . . , BJ ] in the election due to the strategic interaction between
candidates. At a solution, the associated FOC is
∂Lj (B)
∂E [dj (A, ξ; θv )]
:
= λj (B) ωmj for m = 1, . . . , M .
∂Amj
∂Amj
(6)
Inspecting this FOC makes evident its equivalence with the FOC in equation (4). Second,
suppose there exists a pool of representative donors each faced with the decision of whether
to allocate funds to the candidate’s campaign or to an outside opportunity. Our assumed
optimal allocation of donor resources implies that in equilibrium,
λj (B ∗ ) =
M UI
, for j = 1, . . . , J.
Rj
Normalizing the marginal utility of income M UI to one, 1/Rj represents donors’ opportunity
cost of investing in the campaign relative to the utility they expect from the candidate
winning. In equilibrium, donors contribute funds until the shadow price of an additional
dollar, λj , is equal to 1/Rj , yielding the set of optimal budgets B ∗ = [B1∗ , . . . , BJ∗ ].
Thus, Rj is a policy-invariant parameter, independent of campaign fundraising, that
9
we refer to as a candidate’s financial strength. This specification provides a simple way to
conceptualize the endogenous formation of the budget. For example, consider a policy change
that alters dj to some d˜j . The previously optimal budgets B ∗ may imply that λj (B ∗ ; d) is
greater or less than 1/Rj because the left-hand side of equation (6) changes (i.e., moving to
d˜j changes the marginal benefit of advertising). Such an imbalance will result in a new set of
∗
∗ ˜
efficient budgets B̃ that equate each λj B̃ ; d = 1/Rj . We assume that political campaign
contributions are small relative to the broader fundraising market such that the return to
donors’ outside opportunities is invariant to the election mechanism and outcome. This
specification also ignores public good problems in funding candidates who would generate
non-exclusive benefits through their policies as president.6
Optimization in a Contest. Another formulation for a candidate’s objective function
that is consistent with equation (4) draws on contest theory (e.g., Tullock, 1980). Such
models consider the following unconstrained problem where the candidate balances the value
of winning against the total cost of advertising:
πj (A, ξ; θ) = Rj E [dj (A, ξ; θv )] −
M
X
ωmj Amj ,
(7)
m=1
where Rj is the value associated with candidate j winning.
Since the second term above is candidate j’s total spending, scaling the first term by Rj
converts the probability of winning into monetary terms. The key distinction relative to
the objective in equation (5) is the lack of a budget constraint imposed on the candidate.
The contest literature commonly refers to Rj as the “prize” of winning, e.g., Baron (1989)
interprets Rj as the candidate’s expected stream of benefits associated with winning office
6
We do not consider a donor’s expected utility from an election outcome and assume that Rj is invariant
to the actual amount of money donated. This implies two features of donor behavior. First, the marginal
utility of donor income must be independent of the amount they donate to the campaign. This is reasonable
unless policy changes significantly alter the proportion of a donor’s lifetime income that is offered to the
campaign. Second, the expected utility from the candidate winning cannot be contingent on the amount
donated. This assumption may be stronger because of common speculation that large donations earn political
favors. Nevertheless, such issues are beyond the scope of the paper. We merely hope that these assumptions
are reasonable for small local changes in donation amounts.
10
and any future election opportunities if he is successful.7 In contests, such as elections,
lobbying activities, and R&D races, participants expend resources no matter if they win or
lose. An important component of such models is the contest success function which determines
the probability of winning the prize given each participant’s (sunk) effort. In our model,
E [dj (A, ξ; θv )] plays the equivalent role.
2.2.3
Election Mechanism in the Electoral College
The function dj (·) encapsulates the rules of the election system that determine how votes
are tallied to select the winner. We focus here on the observed Electoral College mechanism
and discuss the direct vote mechanism in section 5. The relevance of states in our model
arises through the Electoral College, which aggregates popular votes at the state level to
electoral votes at the national level. The distinction between markets and states requires
some additional notation because multiple markets may exist in a state, m ∈ Ms . Counties
lie within an intersection of a state and market, c ∈ Cms . Finally, let dsj indicate whether a
candidate receives the plurality of a state’s votes,
(
)
X X
X X
dsj (As , ξ; θv ) = 1·
Nc scj (Am , ξ; θv ) >
Nc sck (Am , ξ; θv ), ∀k 6= j (8)
m∈Ms c∈Cms
m∈Ms c∈Cms
where As is the vector of advertising levels across candidates and markets in state s. Under a
winner-take-all rule, a candidate wins all of a state’s Electoral College votes if dsj (As , ξ; θv ) = 1.
Thus, a candidate wins the general election by obtaining a majority of the Electoral College
votes,
( S
)
X
dj (A, ξ; θv ) = 1·
dsj (As , ξ; θv ) · Es ≥ Ē
(9)
s=1
where Es is the state’s electoral votes and Ē = 270 is the minimum number required for a
majority.
7
These benefits could include the perceived monetary value of winning the election, the ability to implement
policies consistent with the candidate’s preferences, or simply the candidate’s “hunger” for the office.
11
3
Data
Four sources of data are combined for the analysis. First, we use advertising spending by
candidate within each of the top 75 designated media markets (DMAs). These markets
account for 78% of the national population. Second, to instrument for advertising levels in
the voter model, we obtain data on the price of advertising across markets. Third, voting
outcomes are measured at the county level. Fourth, we include a collection of control variables,
drawn from a variety of sources, based on local demographics, economic conditions, and
weather conditions on election day. Below we describe the key features of the data and refer
the reader to Gordon and Hartmann (2013) for further details.
3.1
Advertising
We measure advertising as the average number of exposures a voter observes. The advertising
industry commonly refers to this advertising measure as Gross Rating Points (GRPs), which
is equal to the percent of the population exposed (reached) multiplied by the number of times
each person was exposed (frequency). For example, 1,000 GRPs indicates that, on average,
each member of the relevant population was exposed 10 times.
Our advertising data come from the Campaign Media Analysis Group (CMAG) and
contains detailed information about each political advertisement.8 Although the data do not
directly contain GRPs, we derive them using the expenditure for an advertisement divided
by the cost-per-point (CPP) for the appropriate daypart (i.e., one of eight timeslots during
the day) of the ad in a given market.9 The CPPs by daypart come from SQAD, a market
research firm, and we aggregate exposures across dayparts into one advertising quantity, Amj ,
measured in thousands of GRPs. Ads are sponsored by a candidate, a national party, a
8
Freedman and Goldstein (1999) describe the creation of the CMAG data set in more detail.
The price of political advertising in the 60-days prior to the general election is subject to laws which
require the station to offer the purchaser the lowest unit rate (LUR). However, advertising slots purchased
with the LUR may be “preempted” by the TV station and replaced with a higher paying advertiser. The
TV station is only required to deliver the contracted amount of GRPs within a specific time frame, allowing
them to substitute less desirable slots for the original slot. According to the former president of CMAG,
well-financed candidates in competitive races rarely pay the LUR because they want to avoid the possibility
that their ads will be preempted by another advertiser (such as another candidate).
9
12
Table 1: Descriptive Statistics for Candidate Advertising
Obs
Mean Std. Dev.
Min
Max
2000 Election
GRPs Bush
GRPs Gore
Expenditures Bush ($ K)
Expenditures Gore ($ K)
CPMs Bush
CPMs Gore
75
75
75
75
75
75
5.82
4.78
879.84
681.53
8.07
8.11
5.59
0
15.89
5.67
0
17.94
1,218.73
0 6,185.45
1,072.94
0 5,941.61
2.07 3.47
16.73
2.12 3.47
16.87
2004 Election
GRPs Bush (K)
GRPs Kerry (K)
Expenditures Bush ($ K)
Expenditures Kerry ($ K)
CPMs Bush
CPMs Kerry
75
7.81
75
9.73
75 1,123.76
75 1,349.32
75
7.78
75
7.73
10.44
0
35.98
12.81
0
46.22
1,863.43
0 8,386.41
2,207.18
0 9,856.52
2.21 4.24
15.60
2.08 4.15
15.60
hybrid candidate-party group, or an independent interest group. We aggregate advertising
across sponsors when forming Amj for our estimation.
Table 1 reports the GRPs, expenditures, and average cost-per-thousand impressions
(CPMs) for each candidate. CPMs convert the CPPs to a per impression measure that
facilitates comparison across markets of different sizes. The CPMs in table 1 are a weighted
average across dayparts, with weights equal to a candidates’ share of advertising in that
market-daypart (see Appendix A for more details on the construction of advertising levels
and prices). Average CPMs paid decreased from 2000 to 2004 reflecting both decreases in the
actual CPMs in the data and some shifts in the distribution of candidates’ spending across
dayparts.
Figure 1, which plots GRPs by state against a state’s vote margin, illustrates the strategic
allocation of advertising across markets. The winner-take-all rule in the Electoral College
creates sharp incentives for candidates to concentrate their advertising in battleground states,
which contain only 41% of voters but receive 81% of advertising spending. Candidates do not
advertise at all in some markets with large vote margins.
A key point is that the breadth of advertising across vote margins in Figure 1 reveals the
13
Figure 1: GRPs by State-level Voting Margin in 2000 Election: Horizontal axis is
the state-level Republican vote share minus the Democrat vote share. Vertical axis is in hundreds of
GRPs, such that one unit indicates one exposure per voter, on average. Bubbles are proportional to
the state’s voting-age population.
180.0
Republican
Democrats
160.0
Gross Rating Points (00s)
140.0
120.0
100.0
80.0
60.0
40.0
20.0
0.0
-0.5
-20.0
-0.4
-0.3
Democat Stronghold
-0.2
-0.1
0.0
0.1
State Voting Margin (%)
0.2
0.3
0.4
0.5
Republican Stronghold
degree of candidate uncertainty about eventual vote outcomes. Advertising observed in a
state with a significant vote margin suggests a candidate might have been better off moving
those funds to a state at the (ex-post) margin of zero to potentially shift the state’s outcome.
Thus, ex-ante uncertainty about outcomes allows our model to rationalize candidate spending
in states with large realized vote margins.
3.2
Votes, Instruments, and Controls
Table 2 summarizes the data used to estimate the voter model. Voting outcomes at the county
level in 2000 and 2004 come from www.polidata.org and www.electiondataservices.com.
14
We restrict the analysis to the 1,607 counties contained in the 75 DMAs for which we observe
advertising. When estimating the candidate model and analyzing counterfactual policies,
voting behavior is held fixed in counties representing the remaining 22% of the population. In
each county, we observe the total number of votes cast for all candidates and the voting-age
population (VAP). The VAP serves as our market size for the county, which we use to
calculate voter turnout (i.e., the percentage of voters who choose the inside option to vote for
any candidate).10
As described in the model section, advertising is endogenous because candidates possess
knowledge about local demand shocks ξ¯cj . A variable naturally excluded from the demand
side that enters a candidate’s decision problem is the price paid for advertising. However,
particular realizations of the unobservable could induce a candidate to purchase enough
advertising to alter its market clearing price, violating the independence assumption required
for a valid instrument. Stories in the popular press confirm this suspicion (e.g., Associated
Press 2010). To avoid this concern, we use the prior year’s advertising price (i.e., 1999 for
2000 and 2003 for 2004) because there are no presidential, gubernatorial, or congressional
elections in odd-numbered years. Specifically, the instruments are the CPMs by daypart from
SQAD, interacted them with candidate dummies because candidates select different mixes of
dayparts across markets.
The DMA-party fixed effects, γmj , absorb time-invariant geographic variation in the
mean preferences for a given political party. These help establish the orthogonality of the
instruments and unobservables because political preferences and advertising prices tend to
be spuriously correlated across markets. We also include three categories of variables that
serve to explain election outcomes as well as address other potential correlations between
unobservables and instruments: (1) variables that measure local political preferences, (2)
variables that affect voter turnout but not candidate choice, and (3) demographic and
economic variables.
First, data from the National Annenberg Election Surveys (NAES) measures the percentage
10
A more accurate measure of turnout is the voting-eligible population (VEP) because it removes non-citizens
and criminals. However, data on the VEP is only available at the state level.
15
Table 2: Descriptive Statistics: Variables used in the voter model. County-level variables have
1,607 observations and market-level variables have 75 observations per year and party. All averages
are unweighted. LagCPM refers to the one-year lagged cost-per-thousand (CPM) impressions for
a particular daypart, reported in dollars. % identification variables come from the pooled NAES
surveys.
2000
Mean Std. Dev.
2004
Mean Std. Dev.
County-Level Variables
Republican Votes
Democrat Votes
Rain (in.)
Snow (in.)
% Aged 25 to 44
% Aged 45 to 64
% Aged 65 and up
Unemployment Rate
Average Salary (thous.)
Distance*100 (miles)
23,721
25,639
0.20
0.08
0.38
0.32
0.19
4.09
24.99
9.49
51,787
77,929
0.27
0.40
0.05
0.03
0.05
1.56
6.63
4.11
29,349
29,785
0.28
0.02
0.34
0.33
0.19
5.61
27.92
9.49
63,178
89,030
0.51
0.12
0.05
0.03
0.05
1.66
6.93
4.11
Market-Level Variables
LagCPM in Early Morning
LagCPM in Day Time
LagCPM in Early Fringe
LagCPM in Early News
LagCPM in Prime Access
LagCPM in Prime Time
LagCPM in Late News
LagCPM in Late Fringe
% Identifying Republican
% Identifying Democrat
4.18
4.85
5.88
6.26
7.06
12.37
8.50
7.18
0.29
0.30
0.95
1.11
1.45
1.63
1.91
3.47
2.10
1.81
0.06
0.05
5.47
4.89
6.67
7.93
10.41
16.17
12.26
8.30
0.33
0.31
1.55
1.08
1.34
1.73
2.54
4.66
2.56
1.94
0.06
0.06
16
of voters in a market who identify as Democrat or Republican. These data capture variation
in preferences across parties, and hence candidates, within a market. We include interactions
between the Democrat and Republican choice intercepts and the two party identification
variables to allow for asymmetric effects across parties. We also include an indicator for
whether the incumbent governor’s party is the same as the presidential candidate.
Second, we include two types of variables that should solely affect voters’ decisions to
turnout. Gomez, Hansford, and Krause (2007) show that weather can affect turnout in
presidential elections, and so we include county-level estimates of rain and snowfall on
Election Day. The other is a dummy if a Senate election occurs in the same market and
year, since strongly contested Senate races could spur additional turnout for the presidential
election.
Third, we add a set of demographic and economic variables to control for unobserved
changes in these conditions which could be correlated with within-market changes in voter
preferences and the advertising instruments. We use the county-level percentage of the
population in three age-range bins (e.g., 25 to 44) from the Census, the county-level unemployment from the Bureau of Labor Statistics, and the county-level average salary from the
County Business Patterns. Interactions between each candidate’s choice intercepts and the
demographic and economic variables capture differences across parties in voters’ responses to
these conditions.
4
Empirical Application
This section applies our model of advertising competition to data from two elections. We
begin with a discussion of the identification of our parameters, followed by our estimation
strategy, and the parameter estimates.
4.1
Identification
The voter parameters are θv = (β, α, φ, γ) and the candidate parameters are θc = (R, σ).
Identification of θv follows from standard arguments when estimating aggregate market share
17
models. In our particular setting, we use a combination of market-party fixed effects and
advertising prices as instruments to identify advertising effects. We first estimate the voter
model, then take those parameters as given when estimating the candidate model. As the
voter model is a standard differentiated-products logit, the rest of this section focuses on the
candidate model.
For the candidate parameters, inspection of the FOC in equation (4) reveals that the
Rj parameters are identified based on a candidate’s average advertising level in an election.
Identification of candidates’ uncertainty σ relies on systematic variation in advertising levels
and vote margins across markets. Figure 1 shows candidates purchased advertising in states
far from the zero voting margin. To justify such spending, candidates must have believed the
variance of the demand shocks was large enough to give these states a reasonable probability
of being on the margin, such that advertising might tip them in the candidate’s favor. Thus,
candidate uncertainty σ is identified by the degree to which candidates advertise in ex-post
uncontested markets.
To understand this intuition, recall that a candidate forms beliefs concerning the probability
of winning the election, E [dj (A, ξ; θv )], where this expectation depends on σ through equation
(3). A component of this election-level probability is the belief that a candidate wins the
state-level popular vote in s, E dsj (As , ξ; θv ) , where dsj (·) is defined in equation (8). If one
candidate in state s is strongly favored, both candidates’ incentives to advertise in markets in
s will be close to zero because dsj (·) is far from the indifference condition for both candidates.
Small changes in advertising will have little effect on E dsj (As , ξ; θv ) and, consequently, little
effect on E [dj (A, ξ; θv )]. Larger values of σ increase the returns to advertising by increasing
the probability that a demand shock could put dsj (·) close to the indifference point.
An alternative way of understanding the identification is to consider extreme values of σ.
If observed advertising levels were uncorrelated with differences in vote margins across states,
candidates would view outcomes as purely random, such that σ ≈ ∞. Otherwise, if σ ≈ 0, it
is hard to rationalize spending levels in many states with significant realized voting margins.
It is useful to contrast our identification strategy in two ways with the approach in
18
Strömberg (2008). First, he does not estimate the effect of candidates’ decisions (state-level
visits) on votes because visits vary little across candidates within states and the outcome
variable only measures the relative vote share of the two candidates. Second, he does not allow
candidates’ decisions to inform their uncertainty about election outcomes. The empirical
analysis recovers candidate uncertainty as the variances from a time random-effects regression
of state-level Democrat vote shares against various observables (e.g., polls, GDP growth,
etc.). Uncertainty is therefore identified solely through variation in vote shares. Although
this variation may be related to a candidate’s uncertainty, such variation would exist even if
candidates had perfect foresight regarding voting outcomes.
4.2
Estimation
To make the candidate model estimable, we add a stochastic component. Decompose the
true marginal cost of advertising into
ωmj = wmj + vmj ,
(10)
where wmj is an observed estimate of the marginal cost and vmj is a structural error observed
by the candidate but unobserved to the econometrician. The error term vmj forms the basis
of our estimation strategy.11
Given that advertising is a continuous choice variable, we form moments based on the
FOCs of the candidate’s decision problem. Our approach assumes the collection of advertising
choices we observe constitute a (pure-strategy) equilibrium of the advertising competition
game. Note that our model under the Electoral College may possess multiple equilibria but
our estimation strategy does not require us to solve the equilibrium. The primary complication
is that we observe some advertising choices on the boundary.
To explain our approach, first consider observations with positive advertising, where a
11
The unobserved cost shock therefore absorbs any other differences between the observed choices and the
model, such as differences across markets in the degree of uncertainty about outcomes. Note that including
the structural error vmj on the demand side, such as entering through the marginal effect of advertising, is
difficult because the non-linearities in dj (·) prevent inversion of an additively linear error term.
19
solution to the candidate’s FOCs exists and we can recover the econometric unobservable,
Rj
∂E [dj (A, ξ; θv )]
− wmj = vmj .
∂Amj
Given a suitable set of instruments z that satisfy E [v|z] = 0, we could form an estimator
around this moment. However, the equality above does not hold when A∗mj = 0. One
solution is to drop these observations. Although this action would reduce the efficiency of the
estimator, the more serious concern is that it might invalidate the moment condition because
possibly E [v|z, A > 0] 6= 0.
We argue that the specific context of a candidate’s decision problem should minimize
the concern that focusing on positive advertising markets results in a selection problem on
vmj . Whether this selection problem is an issue for estimation hinges on our beliefs about
the potential importance of vmj . In our data the observed marginal costs wmj are SQAD’s
forecasts for the election season. The vmj represent deviations between these forecasts and
candidates’ actual advertising costs. If a sufficiently large vmj draw would lead a candidate
to not advertise in a market, then dropping observations with zero advertising will lead to a
selection problem, such that E [v|z, A > 0] 6= 0.
Such an explanation seems unlikely because the zero advertising outcomes are primarily
due to demand-side shocks which, through the structure of the Electoral College, reduce the
incentives of candidates to advertise in non-battleground states. The more likely situation
is that a candidate only observes realizations of vmj after the candidate commits to some
positive amount of advertising in the market. Since we recover the demand-side shocks
ξcj using revealed preferences in the voter model, this reduces the concern of selection on
unobservable candidate-side shocks. We therefore assume that E [v|z, A > 0] = 0 because
candidates select markets in which to advertise based on wmj and the candidates’ beliefs
about ξ¯cj . We observe the former in our cost data and recover the latter from the demand
estimation.
Under these assumptions, estimation relies on the following moment condition:
∂E [dj (A, ξ; θv )]
E Rj
− wmj |zmj , Amj > 0 = 0 .
∂Amj
20
Let Mj+ denote the set of markets in which candidate j has A∗mj > 0 and M + =
P
j
Mj+ .
The relevant sample moment is
M+
j J
1 XX
∂E [dj (A∗ , ξ; θv )]
m(θ) =
Rj
− wmj ⊗ g(zmj ) ,
JM + j=1 m=1
∂A∗mj
where g(·) is any function and ⊗ is the Kronecker product. For instruments, we use DMA
voting margins differenced across two prior elections (e.g., for 2000, we difference the voting
margin for 1996 and 1992). Differencing removes any location-specific unobservable and
retains information that should relate to a candidate’s uncertainty about voting outcomes in
that market. We interact these variables with party-year dummies to form z.
4.3
Computing the Marginal Effect of Advertising
Calculating the marginal effect of advertising,
∂E[dj (A,ξ;θv )]
,
∂Amj
is challenging because the expec-
tation involves an M × J dimensional integral in the market-candidate specific shocks ηmj
and dj (·) is non-differentiable in Amj . The derivative is therefore only non-zero when a small
amount of advertising can alter the election outcome (i.e., dj (·) changes). We develop an
approach to calculate the derivative that takes advantage of this tipping point and reduces
the effective dimensionality of the integration.12
For a particular candidate j and market m in state s, two conditions define a tipping point
for dj (·). First, the number of electoral votes the candidate can gain (lose) in s, Es , must
surpass the candidate’s national electoral vote deficit (surplus) outside state s. Otherwise,
the election’s outcome is invariant to state s’s outcome. Second, within s, the candidate’s
vote advantage within market m must equal the candidate’s vote deficit outside the market
(i.e across other markets within the state). We state this condition in terms of a candidate’s
internal and external margin, IMmj (Am , ηm ) = EMmj (A−m , η−m ).13
12
Calculating the derivative by simulating E [dj (·)] in conjunction with finite differences will be imprecise
because small deviations from a given Amj are unlikely to shift the election outcome in dj (·). Larger steps
around a given Amj could be used, but this increases the bias of the finite difference estimate of the derivative.
13
It is also possible that there are not enough attainable votes within the market for advertising to place
the internal margin equal to the external margin. We address this and other details, such as when media
markets intersect multiple states, in Appendix B.
21
r
r
Our integration strategy simulates r = 1, . . . , N S draws over ηmk
and EMmj
, and given
r∗
the rth draw, solves for the critical value ηmj
that equates the internal and external margins,
r
r∗
r
r
r
IMmj
= EMmj
. Letting the PDF of η be f (·), we show more
Am , ηmj
, ηmk
A−m , η−m
formally in Appendix B that the marginal effect of advertising is equal to:
r
r
X
,
η
−∂η
A
,
EM
1 X
∂E [d\
(A, ξ; θv )]
m
mj
mk
r∗
=
f ηmj
if Es >
E`k (A` , η`r )−E`j (A` , η`r )
∂Amj
NS r
∂Amj
`6=s
r∗
where ∂η/∂Amj is evaluated at ηmj
. Intuitively, the derivative we seek equals the probability
∗
of drawing a critical value ηmj
times the derivative of this critical value with respect to
advertising. The condition on the right requires that the state be pivotal in the election’s
outcome: the number of electoral votes at stake, Es , must be larger than the candidate’s
electoral deficit outside that state, otherwise the derivative at the rth draw is zero. A
benefit of this approach is that the Monte Carlo integration is effectively over ηmk and EMmj ,
∗
combined with an analytic expression for ηmj
, instead of the original M × J dimensional
integral. Appendix B provides a derivation that generalizes to other contests or goals involving
multidimensional effort and provides more detail on the computation of the derivative in the
Electoral College and the direct vote.
4.4
Voter Model Estimates
Table 3 presents parameter estimates from the voter model. The advertising coefficient is
positive and highly significant. To help interpret the demand estimates, consider that the
average own advertising elasticity is about 0.03. This estimate is smaller than the median
advertising elasticity of 0.05 reported in a meta-analysis of consumer goods in Sethuraman et
al. (2011). We refer the reader to Gordon and Hartmann (2013) for more discussion of the
estimation results from the voter model.
We use the voter estimates to calculate two quantities. First, we measure the political
leaning of media markets and states by removing advertising’s effects while holding all other
factors fixed. Let s0jm be the vote share of candidate j ∈ {R, D} in market m when all
advertising is set to zero. We define the political leaning of a market as the Republican share
22
of the two-party vote in the absence of advertising:
Lm =
s0Rm
.
s0Rm + s0Dm
This provides a summary measure of voters’ party preferences without the potential contaminating effects of advertising.
Second, we calculate the cost per marginal vote at the observed advertising levels in each
media market:
CP Vmj = where smj (Am , ξ; θv ) =
P
c∈Cm
CP Pm
∂smj (Am ,ξ;θv )
∂Atmj
,
(11)
Nm
Nc scj (Am , ξ; θv ) and CP Pm is calculated to be the common
cost per point across candidates.14
Figure 2 plots the cost per marginal vote in the 2000 election against the political leaning
of the media market. The majority of markets with substantial advertising tend to have
a cost per marginal vote around $75. The highest cost, $204 per vote for Republicans in
the Miami-Ft. Lauderdale market, is over twice that faced by the Democrats in the same
market. Inspecting equation (11) indicates that the disparity in costs must arise solely
through asymmetries in the marginal effect of advertising, ∂smj /∂Amj . The Republican’s
marginal effect of advertising is lower in Miami-Ft. Lauderdale for two reasons. First, in the
absence of advertising, this market leans substantially to the left with a political leaning
of Lm = 0.41, such that it is generally more difficult for Republicans to generate votes in
the market. Second, the Republicans purchased 64% more GRPs in Miami-Ft. Lauderdale
than the Democrats, leading to greater diminishing marginal effects of advertising for the
Republicans. Figure 2 also depicts many markets with low costs per marginal vote, yet the
marginal benefit of these votes is generally small because they are in polarized states that are
unlikely to tip.
14
The CP Pm in the equation (11) differs slightly from the wmj used to estimate the candidate model. To
facilitate comparison, we calculate CP Pm as the common cost across candidates, weighting each daypartspecific CP P by the fraction of total exposures purchased in the entire election in that daypart, as opposed
to using candidate-specific weights.
23
Table 3: Voter Model Estimates. Parameter estimates from the two-candidate voter model
with 6,428 observations. Robust standard errors clustered by DMA-Party are in parentheses. F-stat
of excluded instruments is 88.2. ‘*’ significance at α = 0.05 and ‘**’ significance at α = 0.01. Some
coefficients omitted due to space.
Candidate’s Advertising
Senate Election
Gov. Incumbent Same Party
Rain (in.)
Rain × 2004
Snow (in.)
Snow × 2004
Distance*100 (miles)
% 25 ≤ Age < 44
% 25 ≤ Age < 44 × 2004
% 25 ≤ Age < 44 × Republican
% 45 ≤ Age < 64
% 45 ≤ Age < 64 × 2004
% 45 ≤ Age < 64 × Republican
% 65 ≤ Age
% 65 ≤ Age × 2004
% 65 ≤ Age × Republican
% Unemployment
% Unemployment × 2004
% Unemployment × Republican
Average Salary
Average Salary × 2004
Average Salary × Republican
Fixed Effects
Party
Year-Party
DMA-Party
24
Coefficient
Std. Err.
0.0693**
0.0134
0.0090
0.0300
-0.0201
-0.0108
-0.2210**
0.0036
-0.7317**
-1.2942**
0.9047*
3.7490**
0.3468
1.6595**
0.1538
-1.6048**
1.6091**
0.0019
0.0101*
-0.1229**
0.0161**
0.0038**
-0.0195**
0.0159
0.0098
0.0106
0.0293
0.0285
0.0072
0.0606
0.0025
0.2186
0.1987
0.3535
0.3345
0.2247
0.5279
0.3771
0.1700
0.5254
0.0108
0.0050
0.0123
0.0020
0.0007
0.0024
Y
Y
Y
Figure 2: Cost Per Marginal Vote in the 2000 Election. Horizontal axis is a market’s
political leaning Lm . Bubbles are proportional to a candidate’s GRPs in the market.
$250
Republican
Democrat
Cost per Marginal Vote
$200
$150
$100
$50
$0
0.25
0.30
0.35
0.40
0.45
0.50
0.55
0.60
0.65
0.70
0.75
Republican Share of 2-Party Vote in DMA (Net of Ad Effects)
4.5
Candidate Model Estimates
Table 4 presents the parameter estimates from the candidate model. In 2000, Bush outspent
Gore by 29%, and the corresponding Rj estimates imply that Republicans had a 45% greater
financial strength. In 2004, overall spending increased and Kerry outspent Bush, yet Bush’s
Rj is slightly higher than Kerry’s. Bush’s lower spending level may reflect greater confidence
in securing this less contested election rather than weaker financial support.
Uncertainty in 2004 is almost half of that in 2000. To explore the degree of candidate
uncertainty, we consider the implications for both state and national outcomes in the 2000
election. Figure 3 presents candidates’ beliefs about the likelihood of a Republican victory in
25
Table 4: Candidate Model Estimates: Standard errors in parentheses.
Parameters
σ
Rj ($M)
Observed Spending ($M)
2000 Bush
2000 Gore
0.157 (0.010)
146.4 (2.743)
100.8 (2.233)
66.0
51.1
2004 Bush
2004 Kerry
0.079 (0.007)
284.8 (5.999)
282.1 (5.637)
84.2
101.7
each state. The shaded bars represent those states in which each candidate has at least a
ten percent chance of winning. Among these, we see well-known battleground states such
as Florida, New Mexico, and Pennsylvania. The variation in state outcomes translates into
a distribution of electoral vote margins depicted in Figure 4. Although the distribution is
centered around zero because of the closeness of this election, the standard deviation is about
55 electoral votes, indicating a reasonable degree of uncertainty over the election outcome.
Most of this variation is due to the fact that several battleground states, such as Florida,
Ohio, and Pennsylvania, possess a large number of electoral votes.
5
Counterfactuals
We consider a counterfactual election system with a direct (popular) vote, in which the
candidate with the most popular votes is deemed the winner. Although other Electoral
College reforms have been considered, such as the proportional allocation of Electoral College
votes, a direct popular vote has come the closest to being passed (Congressional Research
Service, 2009). Conducting such a counterfactual allows us to understand how candidates
reallocate their resources (e.g., advertising dollars) under a new electoral process and how
voters subsequently respond.
In conducting this counterfactual, we assume other factors remain fixed. Such factors
0
are presently absorbed into a candidates’ local mean utility net of advertising, δcj
= δcj −
α log (1 + Amj ), and include candidates’ other campaign activities (e.g., get-out-the-vote
efforts) and voters’ preferences over their policy positions. The extent to which these
26
Figure 3: Candidates’ Beliefs by State: Each bar represents the probability that the
Republican candidate will win a majority of a state’s popular vote. Shaded bars indicate those
states in which each candidate has at least a ten percent chance of winning.
1
0.9
Probability of Republican Victory
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0
DC
Hawaii
New York
Vermont
Massachusetts
Rhode Island
Maryland
Connecticut
California
New Jersey
Illinois
Delaware
Michigan
Pennsylvania
Maine
Washington
Minnesota
Iowa
Oregon
Wisconsin
New Mexico
Florida
New Hampshire
Nevada
Missouri
Ohio
Tennessee
Arizona
Arkansas
Colorado
West Virginia
Virginia
Georgia
Louisiana
North Carolina
Kentucky
Oklahoma
Indiana
Alabama
Kansas
Alaska
Idaho
Mississippi
Montana
Nebraska
North Dakota
South Carolina
South Dakota
Texas
Utah
Wyoming
0.1
factors could impact our counterfactual results depends on their relationship with advertising
incentives.
5.1
Direct Vote Model
To implement the direct popular vote, we modify the election mechanism represented by dj (·).
The total number of popular votes a candidate receives is
Ṽj (A, ξ; θv ) =
X X
m∈M c∈Cm
27
Nc scj (A, ξ; θv ) .
Figure 4: Candidates’ Beliefs about the Distribution of Electoral Vote Margins.
Simulated distribution of electoral vote margins in 2000 given candidates’ estimated beliefs.
-200 -180 -160 -140 -120 -100
-80
-60
-40
-20
0
20
40
60
80
100
120
140
160
180
200
Republican Electoral Votes - Democrat Electoral Votes
With J = 2, candidate j wins the election if his vote count exceeds the opposing candidate’s
votes,
n
o
d˜j (A, ξ; θv ) = 1· Ṽj (A, ξ; θv ) > Ṽk (A, ξ; θv ) .
In what follows, we express dj (·) as a function of η because we set ξ¯ equal to the recovered
value, ξ, while the model solution requires explicit integration over η. The direct vote objective
function is
M
h
i X
π̃j (A; θ) = Rj E d˜j (A, η; θv ) −
ωmj Amj ,
m=1
with new FOC for advertising of
28
(12)
h
i
∂E d˜j (A, η; θv )
Rj
∂Amj
≤ ωmj , for m = 1, . . . , M .
(13)
Existence of a pure-strategy equilibrium follows from basic results because a candidate’s
objective is continuous and quasi-concave in his advertising variables. To compute the
equilibrium, we solve the 150 advertising choices that simultaneously set the FOC to zero for
each candidate.15 Discussed more fully in Appendix B, the marginal effect of advertising on
the probability of winning is:
r
r
∂ηmj Am , EMmj
, ηmk
1 X
∂E [d\
(A, η; θv )]
r∗
=
−fηmj ηmj
∂Amj
NS r
∂Amj
1 X
α
r∗
=
fηmj ηmj
NS r
1 + Amj
!
.
r
r
r
r∗
where EMmj
≡ EMmj
A−m , η−m
is the external margin in market m and ηmj
=
r
r
ηmj Am , EMmj
, ηmk
is the critical value of the shock that equates the external and internal margins. The derivative of η(·) with respect to Amj in our application is
−α
1+Amj
because Amj and ηmj are perfectly substitutable within the utility function as follows:
δcj = δ̃cj + α log (1 + Amj ) + ηmj . We have suppressed an indicator function that sets the
r
derivative for the rth draw equal to zero when Nm < EMm
. Thus, we compute the equilibrium
by solving the system below for the advertising levels:
"
#
NS
1 X
α
r∗
R`
fη (ηm`
)
− ω`m ≤ 0, for m = 1, . . . , M and ` = j, k
N S r=1
1 + Am`
r∗
r
r
s.t. ηm`
= ηmj Am , EMmj
, ηmk
, for r = 1, . . . , N S
Am` ≥ 0 for m = 1, . . . , M and ` = j, k
15
We set the residuals to zero in all counterfactuals, such that ωmj = wmj . Relatively large residuals
in some markets tend to be correlated within a state (e.g., we recover large unobservables for Bush in all
California DMAs in 2000). These instances likely represent a forecast error in the candidate’s belief about
the probability tipping a given state. However, state boundaries are meaningless in a direct vote. Given
this, we do not expect such residuals to carry over from the Electoral College to the counterfactual. Note
that we did not explicitly model candidate-side unobervables in demand because they are inseparable from
unobserved costs and are not additively separable in the candidate’s FOCs. Our residuals therefore include a
combination of candidates’ demand-side and cost unobservables.
29
subject to a set of complementarity conditions between the FOCs and the advertising choice
variables to allow for zero advertising outcomes.
5.2
Outcomes under a Direct Vote
Table 5 summarizes the counterfactual equilibrium and compares outcomes to those in the
Electoral College. In the direct vote, all markets receive positive advertising and total spending
increases by 25.2%. The advertising levels in the direct vote provide an assessment of the
extent to which battleground states attract disproportionate attention from candidates in the
Electoral College. Advertising intensity, in terms of average exposures per vote, decreases by
about 42% in battleground states and increases by roughly 130% elsewhere. This reallocation
of advertising away from battleground states to many large markets results in a significant
decrease in the overall variation in campaign attention across the county. Despite the fact that
average exposures increases nationally by 26 percent from 91 to 115, the standard deviation
in exposures across markets declines from 110 to 41, or 63 percent.
The new distribution of advertising in the counterfactual flips the popular vote in four
states (Iowa, New Mexico, Oregon, and Wisconsin) from favoring Gore to Bush. Although
Gore loses about 50,000 votes, the net result is that Gore wins with a popular vote margin of
about 450,000 votes. California plays a important role in the counterfactual, as illustrated in
Figure 5, which shows how voting outcomes change between the Electoral College and the
direct vote. Under the direct vote, Democrats lose votes in most states, even those where
they enjoy voters’ support (i.e., Lm < 0.5). Turnout increases in former non-battleground
states because of increased advertising, and turnout declines in former battleground states
due to the reduction in advertising. However, the votes Gore gains in California (bubble in
upper portion of Figure 5) compensate for the votes lost in the smaller states.16
Figure 6 plots the new advertising levels against each market’s political leaning. The
figure makes clear that Bush advertises more than Gore in nearly all markets due to his
higher value for Rj . Although significant variation remains, the distribution of advertising
is generally flatter in the direct vote compared to the Electoral College (recall Figure 1).
16
See Appendix C for a comparison of voter turnout under each election mechanism.
30
Figure 5: Outcomes under the Direct Vote: The vertical axis presents the change in
the Democrat’s state-level voting margin against a measure of a state’s political leaning net of
advertising effects. The bubbles are proportional to the absolute change in voter turnout within
each state, with empty bubbles indicating a reduction in voter turnout. The large bubble in the
upper part of the figure is California.
6%
Democrat Change in Vote Margin
5%
4%
3%
2%
1%
0%
-1%
-2%
-3%
0.2
0.3
0.4
0.5
0.6
0.7
Republican Share of Two-Party Vote (Net of Ad Effects)
31
0.8
Figure 6: Advertising under the Direct Vote. Baseline result from the counterfactual
under a direct vote. The horizontal axis is the political leaning of the market Lm , as defined by
the Republican share of the two-party vote with advertising set to zero. Each bubble’s size is
proportional to the population in the market.
140
Republican
120
Democrat
GRPs (00)
100
80
60
40
20
0
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
Republican Share of Two-Party Vote in DMA (Net of Ad Effects)
32
0.7
0.75
Table 5: Comparison of Observed and Counterfactual Results in 2000: List of battleground
states taken from Cook’s Political Report, September 2000 classification of state competitiveness.
Voting
Votes (m)
Popular Vote Margin (m)
% Voter Turnout
Observed
Bush
Gore
Direct Vote
Bush Gore
50.46 51.00
-0.54
61.89
51.52 52.02
-0.49
63.17
Advertising
Total Spending ($ m)
117.1
Spending ($ m)
66.0
51.1
Avg. Exposures
90.68
Std. Dev. of. Exposures
109.78
Avg. Exposures (K)
50.53 40.15
Avg. Exposures in
Battleground states
115.14 108.59
Non-battleground states 29.89 18.29
146.7
87.1 59.6
114.66
40.81
68.23 46.43
79.30 51.87
64.69 44.69
Table 6 summarizes the population-weighted total GRPs separately for left-, center- and
right-leaning markets, where center markets are defined as those with a political leaning of
Lm ∈ [.45, .55]. The winner-take-all nature of state-level competition in the Electoral College
effectively excluded campaigns from advertising in the most left- and right-leaning markets.
In the counterfactual, voters in left-leaning states receive 21% fewer advertising exposures
compared to voters in centrist states.17 In the next section, we show this discrepancy in
advertising allocations is not simply due to the asymmetry in candidates’ Rj ’s, but depends
on a combination of factors, most notably variation in advertising prices across markets.
5.3
Decomposing the Geographic Distribution of Advertising
Figure 6 reveals significant variation in advertising levels despite the switch to the direct
popular vote. Variation in advertising levels arises through three sources: asymmetries in the
candidates (Rj ), market-level variation in advertising costs (ωmj ), and county-level variation
17
This result is in contrast to the belief in Grofman and Feld (2005) that moving to a direct vote would
lead candidates to focus entirely on the largest media markets.
33
Table 6: GRPs By Political Leaning. GRPs in units of 100 with population-weighted
averages across DMAs. DMAs in the center are those defined to have between a political leaning of
between 45% and 55%, as measured using the Republican share of the two-party vote excluding
advertising. Left-leaning markets are those with less than a 45% political leaning and right-leaning
markets have greater than a 55% political leaning.
Political Leaning of State (Net of Advertising)
Left
Center
Right
GRPs % from
GRPs
% from
GRPs
Center
Center
Electoral College
26.34
Direct Vote
Baseline
97.34
Symmetric Candidates 61.45
Constant CPMs
119.94
Symmetric Candidates 62.07
& Constant CPMs
-85%
177.12
-98%
3.42
-21%
-20%
7%
-9%
122.59
77.25
111.80
68.33
0%
5%
-15%
-3%
122.74
80.79
101.80
66.55
0
in voters’ political preferences (δcj
). To gain a better understanding of the relative importance
of each factor in determining the equilibrium vote and advertising outcomes, we consider a
sequence of simulations that remove the asymmetries in Rj and the variation in ωmj .
Symmetric Rj . The upper panel of Figure 7 presents the equilibrium advertising levels
after setting each candidate’s Rj equal to the average of their estimated values. Both
the Republican and the Democratic advertising levels decline. Advertising levels are more
symmetric across candidate and less dispersed, with the majority of markets receiving six
thousand GRPs or fewer. However, Table 6 documents that even when candidates have
symmetric financial abilities, right-leaning markets receive 5 percent more GRPs than the
center and 31 percent more than the left.
To understand the variation in advertising levels in this symmetric model, it is important
to consider the role of geographic variation in advertising costs. Figure 8 plots the 2000 CPM
against each market’s political leaning. The cheapest media markets are in the center, which
likely explains the greater ad levels in the center in the upper panel of Figure 7. The next
cheapest CPM markets are to the right while more of the expensive markets are to the left.
34
Figure 7: Equilibrium Advertising in a Direct Popular Vote: Upper panel sets makes
each candidates’ Rj symmetric by setting the values equal to the average of the estimates. Middle
panel equalizes the advertising price per exposure across markets. Bottom panel makes symmetric
the Rj values and equalizes advertising prices.
140
Republican
Democrat
120
GRPs (00)
100
80
60
40
20
0
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
0.7
0.75
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
0.7
0.75
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
0.7
0.75
140
120
GRPs (00)
100
80
60
40
20
0
140
120
GRPs (00)
100
80
60
40
20
0
35
Republican Share of Two-Party Vote in DMA (Net of Ad Effects)
Figure 8: CPM by Political Leaning of the DMA in 2000: Vertical axis is the costper-thousand impressions (CPM). Bubble size is proportional to market population.
$18
$16
$14
$12
CPM
$10
$8
$6
$4
$2
$0
0.25
0.3
0.35
0.4
0.45
0.5
0.55
0.6
0.65
0.7
0.75
Republican Share of 2-Party Votes in DMA (Net of Ad Effects)
Constant CPM. To remove the geographic variation in advertising prices, we solve the
equilibrium after equalizing the CPMs across markets, setting it to the population-weighted
average price. This change makes the cost of reaching voters constant across markets. The
middle panel of Figure 7 presents the advertising levels under constant CPMs and with
the (asymmetric) estimated Rj values. Two features stand out in this panel. First, each
candidate’s advertising distribution is flatter across markets of different political leanings. In
fact, the advertising tilts slightly to the left, with left-leaning markets receiving 7 percent more
GRPs than the center and 17 percent more than the right. Second, Republican advertising is
greater than Democrat’s in all but one market. This arises from Bush’s greater Rj .
36
Symmetric Rj and constant CPM. The bottom panel of Figure 7 presents the advertising levels under both symmetric Rj and constant CPMs.18 Advertising exposures are
nearly symmetric across markets. This model with symmetric Rj and equal ad costs across
markets therefore approaches the theoretical ideal of nearly uniform candidate attention
across the country. The standard deviation of exposures is reduced to 10 exposures per
person, whereas the top and middle panels involved standard deviations of 27.5 and 14.4.
Markets in the center still receive slightly more exposures, but the left now has less than 10%
fewer exposures than the center. This remaining variation likely derives from differences in
advertising elasticities across political leanings.
The lack of advertising price and funding variation suggests that this counterfactual may
be more representative of how other dimensions of candidates strategies might be adjusted in
a direct vote. The winner-take-all rule in the Electoral College leads politicians to choose
policies aimed at narrow constituencies in battleground states, rather than developing broad
programs and investment in general public goods.19 Just as a direct vote would lead to a
nearly uniform distribution of advertising attention in the absence of price variation, it may
similarly eliminate candidates’ incentives to cater their policies toward battleground states.
6
Conclusion
States’ winner-take-all rules for electoral votes concentrate advertising in closely contested
states. This concentration of candidate resources minimizes the political role of roughly
two-thirds of voters who reside in more polarized states. This paper develops a tractable
empirical model of advertising competition between presidential candidates to understand
how candidates’ attention changes under different election mechanisms. We recover voter
18
0
Note that making the δcj
constant across markets would effectively make the entire country one large
undifferentiated market, such that advertising would be constant across all markets.
19
Recent theoretical work highlights the economic policy implications of electoral rules (e.g., Lizzeri and
Persico, 2001; Persson and Tabellini, 2009). A wealth of evidence supports the view that incumbent presidents
enact policies to favor battleground states, e.g., see footnote 1 in the introduction. One journalist, commenting
on the 2012 election, wrote that if a presidential candidate needs to carry Florida, “it elevates the already
ever-present need candidates feel to pander to elderly voters, Cuban-Americans, orange-growers and any
other group that can deliver a bloc of Floridians. The same thing with Iowa and ethanol subsidies and other
agriculture-friendly policies,” (Black, 2012)
37
preferences using an aggregate discrete choice model and address the endogeneity of advertising
levels. Estimating the candidate side allows us to identify the primitives that guide candidates’
advertising allocations. With only three candidate structural parameters, the model is
parsimonious yet flexible. Under a direct vote in the 2000 election, we find total advertising
spending increases by 25 percent, with significant shifts across markets: advertising intensity
in former battleground states drops by almost half and more than doubles in other states.
Perhaps the greatest contrast between the election mechanisms is the role of political
preferences in candidates’ strategies. In the Electoral College, the political preferences of
voters in contested states dominate candidates’ strategies. Our countefactuals in the 2000
election illustrate that local political preferences would play a minimal role under a direct vote
and, in the case of advertising, are overwhelmed by local variation in the cost of reaching voters.
Although our focus has been on advertising, the structural change in candidates’ incentives
(through our contest success function) generalizes to the consideration of other candidate
strategies, such as policy platforms during elections and policy decisions in anticipation of
future contests. It is therefore reasonable to expect that a direct vote would similarly diminish
the role that local political preferences currently play in the formulation of policy.
References
[1] Ackerberg, D. (2003), “Advertising, learning, and consumer choice in experience good markets: an
empirical examination,” International Economic Review, 44(3), 1007-1040.
[2] Associated Press (2010), “Sick of Campaign Ad Avalanche? TV Stations Aren’t,” October 30.
[3] Baker, P. and E. Lipton (2012), “In a Tight Race, Obama Draws on the Levers of His Power,” The New
York Times, September 19.
[4] Baron, D. P. (1989), “Service-Induced Campaign Contributions and the Electoral Equilibrium,” The
Quarterly Journal of Economics, 104(1), 45-72.
[5] Berry, C. R., Burden, B. C., Howell, W. G. (2010), “The President and the Distribution of Federal
Spending,” American Political Science Review, 104(4), 783-799.
[6] Berry, S. (1994), “Estimating Discrete-Choice Models of Product Differentiation,” RAND Journal of
Economics, 25(2), 242-262.
[7] Berry, S., J. Levinsohn, and A. Pakes (1995), “Automobile Prices in Market Equilibrium,” Econometrica,
63(4), 841-890.
[8] Black, E. (2012), “10 reasons why the Electoral College is a problem,” Minneapolis Post, October 16.
38
[9] Bloom, N., C. Propper, S. Seiler, J. V. Renenen (2013), “The Impact of Competition on Management
Quality: Evidence from Public Hospitals,” working paper, Stanford University.
[10] Brams, S. J. and M. D. Davis (1974), “The 3/2’s Rule in Presidential Campaigning,” American Political
Science Review, 68(1), 113-134.
[11] Congressional Research Service (2009), “Electoral College Reform: 111th Congress Proposals and Other
Current Developments,” by T. H. Neale, 7-5700, R40895, www.crs.gov.
[12] Dubé, J.-P., G. Hitsch, and P. Manchanda (2005), “An Empirical Model of Advertising Dynamics,”
Quantitative Marketing and Economics, 3, 107-144.
[13] Doganoglu, T. and D. Klapper (2006), “Goodwill and Dynamic Advertising Strategies,” Quantitative
Marketing and Economics, 4(1), 5-29.
[14] Dixit, A. (1987), “Strategic Behavior in Contests,” American Economic Review, 77(5), 891-898.
[15] FairVote.org (2012), “How the Electoral College Became Winner-Take-All,” by D. McCarthy.
[16] Freedman, P. and K. Goldstein (1999), “Measuring Media Exposure and the Effects of Negative Campaign
Ads,” American Journal of Political Science, 43(4), 1189-1208.
[17] Friedman, L. (1958), “Game-Theory Models in the Allocation of Advertising Expenditures,” Operations
Research, 6, 699-709.
[18] Garrett, T. A., and R. S. Sobel (2003), “The political economy of FEMA disaster payments,” Economic
Inquiry, 41, 496-509.
[19] Garrett, R. S. and L. P. Whitaker (2007), “Coordinated Party Expenditures in Federal Elections: An
Overview,” Congressional Research Service, The Library of Congress, Order Code RS22644.
[20] Goeree, M. S. (2008), “Limited Information and Advertising in the U.S. Personal Computer Industry,”
Econometrica, 76(5), 1017-1074.
[21] Gomez, B. T., T. G. Hansford, and G. A. Krause (2007), “The Republicans Should Pray for Rain:
Weather, Turnout, and Voting in U.S. Presidential Elections,” Journal of Politics, 69(3), 649-663.
[22] Gordon, B. R. and W. R. Hartmann (2013), “Advertising Effects in Presidential Elections,” Marketing
Science, 32(1), 19-35.
[23] Grofman, B. and S. L. Feld (2005), “Thinking About the Political Impacts of the Electoral College,”
Public Choice, 123, 1-18.
[24] Gelman, A., J. N. Katz, and F. Tuerlinckx (2002), “The Mathematics and Statistics of Voting Power,”
Statistical Science, 17(4), 420-435.
[25] Kawai, K. and Y. Watanabe (2013), “Inferring Strategic Voting,” American Economic Review, 103(2),
624-662.
[26] Lindbeck, A. and J. W. Weibull (1987), “Balanced-budget redistribution as the outcome of political
competition,” Public Choice, 52, 273-297.
[27] Liptak, A. (2012), “The Vanishing Battleground,” The New York Times, November 3.
[28] Lizzeri, A. and N. Persico (2001), “The Provision of Public Goods under Alternative Electoral Incentives,”
American Economic Review, 91(1), 225-239.
39
[29] Owen, G. (1975), “Evaluation of a Presidential Election Game,” American Political Science Review,
69(3), 947-953.
[30] Page, S. (2002), “Bush policies follow politics of states needed in 2004,” USA Today, June 16.
[31] Persson, T. and G. Tabellini (2009), “Electoral Systems and Economic Policy,” in D. A. Ritchie and
B. R. Weingast (Eds.), The Oxford Handbook of Political Economy, Oxford University Press.
[32] Pittman, C. (2004), “Everglades Plan Leads to Campaign Battle in Swamp,” St. Petersburg Times,
October 15.
[33] Reeves, A. (2011), “Political Disaster: Unilateral Powers, Electoral Incentives, and Presidential Disaster
Declarations,” Journal of Politics, 73, 1142-1151.
[34] Sahni, N. S. (2012), “Effect of Temporal Spacing between Advertising Exposures: Evidence from an
Online Field Experiment,” working paper, Stanford GSB.
[35] Sethuraman, R., G. J. Tellis, R. Briesch (2011), “How well does advertising work? Generalizations from
a meta-analysis of brand advertising elasticity,” Journal of Marketing Research, 48(3), 457-471.
[36] Shachar, R. (2009), “The Political Participation Puzzle and Marketing,” J. of Mkting Res., 46(6), 798-815.
[37] Shachar, R. and B. Nalebuff (1999), “Follow the Leader: Theory and Evidence on Political Participation,”
American Economic Review, 89(3), 525-547.
[38] Skaperdas, S. (1996), “Contest Success Functions,” Economic Theory, 7(2), 283-290.
[39] Snyder, J. (1989), “Election Goals and the Allocation of Campaign Resources,” Econometrica, 57(3),
637-660.
[40] Solomon, J., A. MacGillis, and S. Cohen (2007), “How Rove directed federal assets for GOP gains,”
Washington Post, August 19, p. A1.
[41] Strömberg, D. (2008), “How the Electoral College Influences Campaigns and Policy: The Probability of
Being Florida,” American Economic Review, 98(3), 769-807.
[42] Tullock, G. (1980), “Efficient Rent Seeking.” In J. M. Buchanan, R. D. Tollison, G. Tullock, (Eds.),
Toward a Theory of the Rent-Seeking Society. College Station, TX: Texas A&M University Press, 97-112.
[43] TVB (2013), “2012 Presidential Election Analysis,” accessed at http://www.tvb.org/media/file/
Drum-Roll-Political-2013.pdf.
40
Appendices for Online Publication
Appendix A: Advertising and Advertising Prices
We construct a market-candidate observed aggregate advertising level and advertising price
(Amj and wmj ) based on two observed variables. Expendituremjad is CMAG’s estimate of
the dollars spent by candidate j in market m on an advertisement a in daypart d. CP Pmd
is SQAD’s reported advertising price for the 18 and over demographic in market m during
daypart d. We use the CPP from the 3rd quarter of the election year.20
Let the daypart level of advertising by candidate j in market m be:
P
a∈Amjd Expendituremjad
GRPmjd =
P8
d=1 CP Pmd
where Atmjd is the set of advertisements for a candidate in a market and daypart. Then total
advertising by candidate j in market m is:
Amj =
8
X
GRPmjd .
d=1
The market-specific advertising price for candidate j is defined as follows:


CP Pmd GRPmjd
if Amj > 0
Amj
wmj =

CP Pm
if Amj = 0
where
CP Pm =
8
X
d=1
"
PM
m=1 GRPmjd
j=1
CP Pmd PJ PM P8
j=1
m=1
d=1 GRPmjd
PJ
#
.
In other words, we use a weighted average across the dayparts in which candidate j advertised
in market m if the candidate did in fact advertise there, or a weighted average based on both
candidates advertising in all markets within each daypart if the candidate did not advertise
in the market.
20
While the advertising primarily spans both September (3rd quarter) and October (4th quarter), it is
problematic using a separate cost for each quarter because a discontinuity in costs would be artificially be
generated on October 1. Furthermore, 4th quarter ad costs are likely not a good estimate of the true cost of
the ad because they include the holiday season.
41
The advertising price in our candidate-side estimation is ωmj = wmj + vmj where vmj is the
candidate’s market-specific unobservable component of advertising. (Recall that the SQAD
prices are forecasts) When we analyze the cost per marginal vote, we use CP Pm in all markets
to highlight the role of diminishing marginal effectiveness and political leaning in the costs of
acquiring an additional vote. Finally, when we solve the direct vote counterfactual, we use wmj
as the price of advertising. This avoids odd implications from large local residuals that likely
do not relate to costs, but retains a source of local variation in advertising. We remove both the
candidate and local market ad price variation in the final simulation by setting an equal price
P P
2
M
1
per thousand people (CPM) such that w̃mj =
CP
M
×
P
op
/100.
mj
j=1
m=1
2M
Appendix B: Computing the Marginal Effect of Advertising
We compute the marginal effect of advertising,
∂E[dj (A,ξ;θv )]
∂Amj
using a hybrid approach that
simulates all but one of the unobservables. The method is applicable to any problem where an
agent (or agents) exerts observed effort to pass a threshold while facing some unobservables.
The general objective function under a threshold goal for agent j ∈ J ≥ 1 choosing effort
allocations Aj ∈ RM
+ is
max πj (A; θ) = Rj · E [dj (A, η; θ)] − ω 0 A ,
Aj
where Rj ∈ R≥0 is the agent’s prize for success, A ∈ RJM
≥0 is the collection of effort choices
k
across agents, ω ∈ RM
≥0 is the cost of effort, and θ ∈ Θ is a set of structural parameters.
dj ∈ {0, 1} is the success function that determines whether agent j wins the prize, such that
E [dj ] is the probability of success that integrates over the unobservable η ∈ RJM with known
CDF Fη . An important component of dj is the scoring function sjm (A, η; θ) ∈ R≥0 , which
determines the agent’s score given all effort choices and the unobservables. For all m, we
assume sjm (A, η; θ) is strictly increasing in Ajm and ηjm and strictly decreasing in Akm and
ηkm , for k 6= j. These conditions ensure that s (A, η) is invertible in η, which is necessary for
the computational approach. We suppress θ for the rest of the exposition.
42
Single agent with scalar uncertainty. The simplest application involves a single agent
(J = 1) who chooses a single effort level A ≥ 0 (M = 1) with a scalar unobservable η in an
attempt to exceed threshold V . The success function is
d (A, η) = 1· s (A, η) > V .
The scoring function produces a value v = s(A, η), which is a random variable due to the
presence of η. Assume the scoring function can produce a value high enough to exceed the
threshold, such that s(A, η) ≥ V for some combination of A and η. Otherwise, the agent
is guaranteed to fail and she will not exert any effort. The probability of success can be
expressed a
E [d (A, η)] = Pr s (A, η) > V
= 1 − Fv s (A, η) ≤ V
,
where Fv (·) is the unknown CDF of the score v. Let s−1 (A, v) be the inverse of s(·, ·) through
the η argument. Given the known distribution function for η, we use a change-of-variables to
re-express the cumulative distribution Fv in terms of η,
Fv s (A, η) ≤ V
= Fη s−1 (A, v) ≤ s−1 A, V
= Fη η ≤ η ∗ A, V ,
,
(14)
(15)
where η ∗ ≡ η ∗ A, V = s−1 A, V is the critical value of the shock that equates the score
and the threshold, s (A, η ∗ ) = V . Thus, given a particular value of A , the marginal effect of
A on the probability of success is:
∂ 1 − Fη η ≤ η ∗ A, V
∂E [d (A, η)]
=
∂A
∂A
−1
∂s
A,
V
= −fη η ∗ A, V
.
∂A
If s−1 and ∂s−1 /∂A have closed forms, then knowledge of fη yields an analytic expression for
the marginal effect of effort.
43
Multiple markets and a competing agent. Our presidential election application involves an extension to a competing agent and multiple markets, where each market requires
an action and has an agent-market specific unobservable. The success function in such a
contest is:
dj (A, η) = 1·{sj (A, η) > sk (A, η)} .
The scoring function is
sj (A, η) =
X
smj (Am , ηm ) ,
m
where Am and ηm are those values relevant to market m across all agents. For a given market,
we can rewrite the success function in the following way:
(
)
dj (A, η) = 1· smj (Am , ηm ) − smk (Am , ηm ) >
X
snk (An , ηn ) − snj (An , ηn )
n6=m
= 1·{IMmj (Am , ηm ) > EMmj (A−m , η−m )}
Intuitively, an agent’s score relative to his opponent’s in market m must be greater than his
deficit across all other markets. We refer to these terms as the internal margin IMmj and
external margin EMmj . The inversion to obtain a critical η ∗ is now in terms of the internal
margin instead of a single agent’s score. Note that an inverse of IMmj through ηmj exists due
to our shape assumptions on s (A, η). The probability of success can now be expressed as
E [dj (A, η)] = Pr (sj (A, η) > sk (A, η))
= 1 − FIM (IMmj (Am , ηm ) ≤ EMmj (A−m , η−m ))
−1
−1
= 1 − Fη IMmj
(Am , smj − smk , ηmk ) ≤ IMmj
(Am , EMmj (A−m , η−m ) , ηmk )
−1
∗
= 1 − Fη ηmj ≤ IMmj
Am , ηmj
, ηmk
Given values for A−m and η−m , EMmj is just a scalar representing agent j’s deficit or
surplus outside market m. Thus the critical value of agent j’s shock in market m is the
∗
∗
∗
ηmj
≡ ηmj
(Am , EMmj , ηmk ) that sets IMmj Am , ηmj
, ηmk = EMmj (A−m , η−m ).
44
We combine this result with Monte Carlo simulation over the opponent’s shock inside
market m and the collection of shocks outside market m to produce an estimate of the
marginal effect of Ajm . The key steps are:
r
r
r
r
1. Simulate r = 1, . . . , N S draws over ηmk
and η−m
= η−nj
, η−nk
using the known
n6=m
distributions fηmj . In our application, these are N (0, σ 2 ), which can be generalized at
higher computational cost.
r
r
r
r
2. Use η−m
to calculate the JM values of EMmj
≡ EMmj
, given some particA−m , η−m
ular value A−m .
r∗
r∗
r
r
r
r∗
r
≡ ηmj
Am , EMmj
, ηmk
that set IMmj
Am , ηmj
, ηmk
=
3. Solve for the critical values ηmj
r
EMmj
for all m and j.
4. The estimate of the marginal effect of Amj is:
r
r
,
η
∂η
A
,
EM
1 X
∂E \
[d (A, η)]
m
mj
mk
r∗
=
− fηmj ηmj
.
∂Amj
NS r
∂Amj
(16)
−1
r∗
where ∂η(·)/∂Amj ≡ ∂IMmj
(·)/∂Amj and evaluated at ηmj = ηmj
.
The next two sections discuss the details of implementing this general form of the derivative
in the direct vote and Electoral College, respectively.
Application to a Direct Vote. The realities of an election place certain restrictions on
the empirical analogues of the score functions. Under a direct vote, equation (16) describes
the derivative if it is always feasible for the internal margin of votes to exceed the external
margin of votes in all markets. In practice, bounds on the internal margin are implied by
the number of voters in the market, giving us the following expression for the derivative in a
direct vote:
r
r
∂η
A
,
EM
,
η
∂E \
[d (A, η)]
1 X
m
mj
mk
r∗
r
=
− fηmj ηmj
1· Nm > EMmj
∂Amj
NS r
∂Amj
45
where Nm =
P
c∈Cm
r
Nc is the total number of voters in the market. If Nm < EMmj
, then
r∗
there does not exist a ηmj
to equate the internal and external margins and the derivative at
the rth draw is zero.
Application to the Electoral College. The Electoral College introduces an added layer
of complexity because of the state-level contests. We need to introduce additional notation
corresponding to multiple markets intersecting a single state and markets intersecting multiple
states. Let Ms(m) be the set of all markets intersecting state s (m), where m is the focal
market for the derivative. Let Cms denote the set of counties intersecting both state s and
market m. Finally, let S̃ (m) denote the set of states that market m intersects.
The Electoral College involves two relevant thresholds for winning an election: (i) the
electoral vote margin and (ii) the state-level popular vote margin. For a given r, the derivative
is non-zero only if the electoral votes attainable through advertising in market m are greater
than the electoral vote deficit implied by the shocks in all other markets:
X
Es >
X
Es0 k (ηsr0 ) − Es0 j (ηsr0 ) .
s0 ∈
/ S̃(m)
s∈S̃(m)
If a market is “in play” based on the electoral votes, then the state-level margin can be
separated into an internal and external margin as above. If a market intersects multiple
states there is a potentially relevant internal and external margin for each:
rs
IMmj
=
X
Nc (scj (Am , ηmj , ηmk ) − sck (Am , ηmj , ηmk ))
c∈Cms
rs
EMmj
=
X
X
Nc (sck (Am , ηn ) − scj (Am , ηn )) .
n∈Ms(m) \m c∈Cns
∗rs
We therefore calculate an ηmj
, as described above, for each state s ∈ S̃ (m). Next, we define
∗r
the relevant critical value, ηmj
, to be the smallest of these shocks that yields enough electoral
votes to offset the electoral vote deficit implied by the rth set of draws.
46
The derivative for the rth draw is therefore:

r
r

r∗ ∂η (Am ,EMmj ,ηmk )


−fηmj ηmj

∂A
mj


∂E [d (A, η)] r 
=

∂Amj





0
r
> EMmj
P
P
and
Es >
Es0 k (ηsr0 ) − Es0 j (ηsr0 ) .
if
P
c∈Cms Nc
s∈S̃(m)
s0 ∈
/ S̃(m)
otherwise
and the overall derivative is
∂E \
[d (A, η)]
1 X ∂E [d (A, η)] r
=
.
∂Amj
NS r
∂Amj
Note that the above characterizes the marginal effect of effort in a contest with a general
contest success function. One primary focus of the theoretical contest literature has been
on the derivation of analytically tractable success functions (Skaperdas, 1996). In practice,
contests such as elections have their own specific success functions implying CDFs for the
probability of success that may inherently not be analytically tractable. We show that the
above approach is beneficial by compressing a large multidimensional integration problem
into unidimensional external and internal margins.
Appendix C: Comparison of Voter Turnout in the Electoral College and the Direct Vote
Turnout in the direct vote increases by 1.3%, or about 2 million voters. The popular vote in
four states—Iowa, New Mexico, Oregon, and Wisconsin, all with thin margins—flips from
Gore to Bush. Gore, however, gains enough votes in the Democratic stronghold of California
to win the election even though his national vote margin shrinks from about 543,000 to
494,000.
An important distinction between the Electoral College and a direct vote is a state’s
relative influence in the election outcome. Under the Electoral College, a state’s influence
is fixed and proportional to its fraction of the total electoral votes.21 The Electoral College
21
The Constitution specifies the number of a state’s electoral votes as equal to its number of Senators (two)
plus its number of Representatives (proportional to its Census population). This allocation implies that each
47
essentially protects states from political losses if a state implements policies that make
it more difficult or disqualifies certain voters from casting their votes. Furthermore, the
winner-take-all rule gives partisan members of a state’s government strong motivation to
influence voter turnout to favor their own political party (as witnessed recently in the form
of voter identification and anti-voter fraud laws proposed in many states).
In contrast, in a direct vote, a state’s relative influence in the election outcome is
endogenous—it is proportional to the percent of its population that turns out to vote
relative to national voter turnout. Figure 9 depicts the difference in representation of a state
between each electoral mechanism and the representation that their population constitutes
as percentage of the US population over age 18. States are ordered on the left axis by
increasing size of their voting age population. On the top, the series of positive bars reflect the
electoral college’s protection of small states. On the bottom, large states such as California,
Texas and Florida are under-represented in both the electoral college and a direct vote.
Under-representation in the direct vote arises from a smaller fraction of the state’s voting
age population actually voting. Other states such as Georgia, Arizona and Nevada also are
under-represented in a direct vote. Minnesota, Wisconsin, Michigan and Ohio are however
over-represented in a direct vote. A direct vote therefore eliminates both the electoral college’s
protection of small states and the tie in to state population size, as a state is now represented
only by its voters turning out for the election.
elector in a small state represents fewer voters compared to larger states: as of 2008, each of Wyoming’s three
electoral votes represented about 177,000 voters, compared to 715,000 for each of the 32 electors in Texas.
48
Figure 9: States’ Election Influence under the Electoral College and Direct Vote:
The horizontal axis reports the difference between a state’s relative influence in the election outcome
under a particular electoral system relative to the state’s voting-age population. Under the Electoral
College, a state’s influence is its number of electoral votes divided by the total number of electoral
votes in the country. Under a direct vote, a state’s influence is its voter turnout divided by the total
voter turnout in the country. Bars to the left of zero indicate that a state has less influence under
that system relative to its share of the total voting-age population. States are sorted from top to
bottom in order of ascending population.
Electoral College
Direct Vote
Wyoming
Alaska
District of Col
Vermont
North Dakota
South Dakota
Delaware
Montana
Rhode Island
Hawaii
Idaho
New Hampshire
Maine
Nebraska
New Mexico
West Virginia
Nevada
Utah
Kansas
Arkansas
Mississippi
Iowa
Oklahoma
Connecticut
Oregon
South Carolina
Kentucky
Colorado
Louisiana
Alabama
Minnesota
Arizona
Maryland
Wisconsin
Missouri
Tennessee
Washington
Indiana
Massachusetts
Virginia
Georgia
North Carolina
New Jersey
Michigan
Ohio
Illinois
Pennsylvania
Florida
New York
Texas
California
-0.02
-0.015
-0.01
49
-0.005
0
0.005
Difference Between Representation and Share of Population over 18
0.01