The Marginal Cost of Public Funds is One at the Optimal Tax System

The Marginal Cost of Public Funds is One
at the Optimal Tax System∗
Bas Jacobs∗∗
August 22, 2016
Abstract
This paper develops a Mirrlees (1971) framework with skill and preference heterogeneity to
analyze optimal linear and non-linear redistributive taxes, optimal provision of public goods
and the marginal cost of public funds (MCF). It is shown that MCF equals one at the optimal tax system, for both lump-sum and distortionary taxes, for linear and non-linear taxes,
and for both income and consumption taxes. By allowing for redistributional concerns, the
marginal excess burden of distortionary taxes is shown to be equal to the marginal distributional gain at the optimal tax system. Consequently, the modified Samuelson rule should
not be corrected for the marginal cost of public funds. Outside the optimum, the marginal
cost of public funds for both non-distortionary and distortionary taxes can be either smaller
or larger than one. The findings of this paper have potentially important implications for
applied tax policy and social-cost benefit analysis.
Keywords: marginal cost of funds, marginal excess burden, optimal taxation, optimal redistribution, optimal provision of public goods, Samuelson rule
JEL-codes: H20, H40, H50.
∗
A previous version of this paper circulated under a slightly different title (Jacobs, 2010). I thank the Dutch
Ministry of Economic Affairs and the Netherlands Organisation for Scientific Research (Vidi-grant No. 452-07013) for financial support. I am very grateful to Ruud de Mooij, whose contributions to this paper have been
invaluable. Parts of this paper have been written when I was a guest researcher at the Center for Fiscal Studies, and I thank Uppsala University for its hospitality. Numerous discussions over the years with many people
have contributed to the ideas that are developed in this paper. I would like to thank Geir Bjertnaes, Sören
Blomquist, Robin Boadway, Lans Bovenberg, Vidar Christiansen, Josse Delfgaauw, Casper van Ewijk, Aart Gerritsen, James Heckman, Martin Hellwig, Laurence Jacquet, Carl Koopmans, Etienne Lehmann, Joana Pereira,
Rick van der Ploeg, Agnar Sandmo, Erik Schokkaert, Dirk Schindler, Paul Tang and Floris Zoutman in particular. Furthermore, I thank seminar participants at the Dutch Ministry of Economic Affairs, CPB Netherlands
Bureau for Economic Policy Analysis, NHH-Bergen, Uppsala University, University of St. Gallen, Tinbergen
Institute, Erasmus University Rotterdam, Max Planck Institute Bonn, and Statistics Norway for their comments
and suggestions. In addition, valuable comments were made by participants at the following congresses: ‘Taxation, Public Goods Provision and the Future of the Nordic Welfare Model’, Helsinki, June 15, 2009; Congress of
the European Economic Association, Barcelona, August 24, 2009; the 6th CESifo Norwegian-German Seminar
on Public Economics, Munich, November 13-14, 2009; and the 67th Congress of the International Institute for
Public Finance, Ann Arbor, August 7–11, 2011. Naturally, any remaining errors are my own.
∗∗
Bas Jacobs, Erasmus University Rotterdam, Tinbergen Institute, and CESifo. Address: Erasmus
School of Economics, Erasmus University Rotterdam, PO Box 1738, 3000 DR Rotterdam, The Netherlands. Phone: +31–10–4081452/1441. Fax: +31–10–4089161. E-mail: [email protected]. Homepage:
http://people.few.eur.nl/bjacobs.
1
Pigou (1947, p.34): “Where there is indirect damage, it ought to be added to
the direct loss of satisfaction involved in the withdrawal of the marginal unit of
resources by taxation, before this is balanced against the satisfaction yielded by
the marginal expenditure. It follows that, in general, expenditure ought not to be
carried out so far as to make the real yield of the marginal unit of resources expended
by the government equal to the real yields of the last unit left in the hands of the
representative citizen.”
1
Introduction
The marginal cost of public funds is the ratio of the marginal value of a unit of resources raised
by the government and the marginal value of that unit of resources in the private sector. The
marginal cost of public funds is therefore a measure indicating the scarcity of public resources.
Ever since Pigou (1947) many scholars and policymakers are convinced that the marginal cost
of public funds must be larger than one, since the government relies on distortionary taxes
to finance its outlays. If the marginal cost of public funds is indeed larger than one, this has
important normative consequences for the determination of optimal public policy in many fields.
In early theoretical contributions, Stiglitz and Dasgupta (1971) and Atkinson and Stern
(1974) demonstrated that the Samuelson (1954) rule for the optimum provision of public goods
needs to be modified to account for tax distortions.1 The optimal level of public goods provision
should be lower, and the optimal size of the government should thus be smaller, if the marginal
cost of public funds is higher. Many applied cost-benefit analyses multiply the cost of public
projects with a measure for the marginal cost of public funds that is larger than one. As a result,
public projects are less likely to pass a cost-benefit test. For example, Heckman et al. (2010)
evaluate the Perry Preschool Program and add 50 cents per dollar spent to account for the
deadweight costs of taxation. Sandmo (1975) and Bovenberg and de Mooij (1994) demonstrate
that the optimal corrective tax is generally set below the Pigouvian tax, since a high marginal
cost of public funds raises the cost of internalizing of externalities. Hence, governments should
pursue less ambitious environmental policies if taxation is more costly. Many other examples
can be given, but the message is clear: the marginal cost of public funds has a tremendous
impact on how governments should evaluate the desirability of public policies.2
This paper challenges the conventional wisdom that the marginal cost of public funds is
necessarily larger than one. It does so by explicitly introducing distributional concerns to
motivate tax distortions. Most of the literature, on which the analysis of the marginal cost
of public funds is based, has focused on Ramsey (1927) frameworks with homogeneous agents
where redistributional concerns are absent by assumption see, e.g., Browning (1976), Browning
(1987), Wildasin (1984), and Ballard and Fullerton (1992).3 However, in this context Sandmo
1
Ballard and Fullerton (1992), Dahlby (2008), and Jacobs (2009) provide extensive reviews of the large literature that subsequently emerged.
2
Other examples include Laffont and Tirole (1993), who put the marginal cost of public funds at the center
of their theories on optimal procurement and regulation. A larger cost of public funds renders rent extraction
more valuable compared to provision of cost-reducing incentives. . Similarly, Barro (1979) and Lucas and Stokey
(1983) derive that the optimal path of public debt ensures that tax rates are smoothed over time. Consequently,
debt policy is aimed at equalizing the marginal cost of public funds over time.
3
Some authors have allowed for heterogeneous agents, see, for example, the contributions by Browning and
2
(1998, pp. 378-379) notes the following:
“The observation that many discussions of the marginal cost of public funds are
based on theoretical models where distributional issues have been assumed away is
somewhat paradoxical, since the distortionary effects of taxation, on which thinking
around the MCF is based, can only be justified from a welfare economics point of
view by their positive effects on the distribution of income.”
A whole literature has thus analyzed the marginal cost of public funds for distortionary taxes,
without paying attention to the ultimate reasons why there are distortionary taxes in the first
place.
This paper contributes to the literature on the marginal cost of public funds in a number of
ways. First, it follows the literature on optimal income taxation – pioneered by Mirrlees (1971)
– so as to provide a proper microeconomic foundation for analyzing tax distortions. Individuals
are heterogeneous with respect to their earning ability (or skill level). The government cannot achieve any desired distribution of welfare due to informational constraints, since earning
abilities are private information. Therefore, individualized lump-sum taxes are not incentivecompatible, and the government needs to rely on taxing observable income (or consumption),
which results in labor supply distortions. Distributional concerns in combination with informational constraints thus provide the fundamental reason why distortionary taxation is optimal
in second-best settings. Second, non-distortionary, non-individualized lump-sum taxes are not
ruled out to obtain a non-trivial second-best policy problem. Non-individualized lump-sum
taxes (or transfers) are incentive compatible, and, therefore, part of the instrument set of the
government. It is shown that non-distortionary transfers (‘demogrant’) are always part of the
optimal tax schedule. Moreover, a non-distortionary general tax exemption or general tax credit
is part of virtually all tax systems in the real world.
The main finding of this paper is that, at the optimal tax system, the marginal cost of
public funds is equal to one for all tax instruments. The marginal cost of public funds for
the non-distortionary non-individualized lump-sum tax equals one at the optimal tax system.
This comes at no surprise, since non-individualized lump-sum taxes neither feature distortions
nor distributional benefits. Moreover, the marginal cost of public funds for distortionary taxes
should be one as well, since it should be equal to the marginal cost of public funds for nondistortionary taxes. Intuitively, the excess burden of distortionary taxation should be exactly
compensated by the social marginal redistributional benefits of taxation.
To demonstrate how allowing for heterogeneous agents and non-individualized lump-sum
taxes drive our main findings, this paper also analyzes the special case in which the government
cannot optimize lump-sum transfers. Even then it is not correct to conclude that the marginal
costs of public funds is necessarily larger than one. The marginal cost of public funds for distortionary taxes can either be larger or smaller than one depending on whether the distributional
Johnson (1984) and Allgood and Snow (1998), but these studies focus mainly on the efficiency costs of taxation.
Others have explicitly introduced distributional aspects of public goods and taxes, see, e.g., Christiansen (1981),
Wilson (1991), Boadway and Keen (1993), Kaplow (1996), Sandmo (1998), Slemrod and Yitzhaki (2001), and
Dahlby (2008).
3
gains are smaller or larger than the excess burden of the distortionary tax. The representativeagent models are nested as a special case of the model without non-individualized lump-sum
taxes, where all distributional benefits of taxation are assumed to be zero. Only in that special
case one can unambiguously conclude that the marginal cost of public funds is larger than one,
since all available tax instruments only cause distortions, but yield no redistributional gains.
This case, however, is of no practical relevance, since individuals are not identical in the real
world.
Ramsey (1927) frameworks with representative agents thus produce misleading results on
the marginal cost of public funds by excluding the reasons for distortionary taxation from the
analysis and eliminating the non-individualized lump-sum tax from the government instrument
set. In particular, the government is forced to use distortionary taxes in a setting whereas all
individuals would prefer the government to levy non-distortionary (non-individualized) lumpsum taxes instead. If tax system is optimized, marginal distributional benefits of distortionary
taxes are equal to marginal deadweight losses of distortionary taxes. The marginal cost of
public funds is therefore not a measure for the social cost of taxation, since it also includes the
distributional gains of taxation. Indeed, the social cost of distortionary taxation is properly
measured by standard measures for the marginal excess burden of taxation.
The findings of this paper have important consequences for applied public policy. In particular, tax distortions as such should not lower public good provision. The marginal excess
burden of distortionary taxation is the price of redistribution, and not the marginal cost of
public funds, i.e., the price of government activities that exclude redistributional tasks. In
particular, if the government would not be interested in redistribution at all, then all distortionary taxes would be zero, and all public goods would be financed with non-distortionary
non-individualized lump-sum taxes. Moreover, there should be no corrections for distortionary
taxes in social cost-benefit analysis of public projects if taxes are optimized. Alternatively,
there should be corrections for both the marginal excess burden and the marginal distributional
benefit of distortionary taxation if tax systems are not optimized. This policy prescription goes
against practices in social cost-benefit analysis, where deadweight costs of taxation are included
at the cost side, but the distributional benefits of taxation are ignored at the benefit side. These
practices are bound to yield policy errors, because many public projects can now fail the social
cost-benefit test, whereas they are in fact socially desirable. Jacobs and de Mooij (2015) apply the concepts from this paper to demonstrate that by allowing for distributional concerns,
the optimal corrective tax to internalize consumption externalities should no longer set below
the Pigouvian tax. The distributional benefits of distortionary taxes cancel out against their
deadweight losses. Hence, it is not desirable to pursue less aggressive environmental policies
even in the presence of distortionary taxation, as Sandmo (1975) and Bovenberg and de Mooij
(1994) suggested. Finally, Werning (2007) finds that the optimal path of public debt becomes
indeterminate and Ricardian equivalence is restored under distortionary taxation. Intuitively,
tax distortions are compensated by distributional gains. As a result, the optimal path of public
debt can no longer be determined by smoothing out tax distortions over time.
The literature on the marginal cost of public funds has generated substantial confusion,
see also the reviews in Ballard and Fullerton (1992), Dahlby (2008), and Jacobs (2009). In
4
particular, earlier literature has not settled down on a commonly agreed definition for the
marginal cost of public funds.4 Moreover, for the most regularly used definition, e.g. in Atkinson
and Stern (1974), Ballard and Fullerton (1992), and Sandmo (1998), the marginal cost of
public funds for lump-sum taxes is generally not equal to one, whereas there is no theoretical
presumption that it should differ from one, given that lump-sum taxes are non-distortionary
and produce no distributional benefits.5 It remains unclear how measures for the marginal cost
of public funds relate to the marginal excess burden of taxation, even though this relationship
is often suggested by, e.g., Pigou (1947), Harberger (1964), and Browning (1976). Indeed, in
the absence of distributional concerns, one would expect a direct correspondence between the
marginal cost of public funds and the deadweight loss of taxation. Finally, standard measures
for the marginal cost of public funds are highly sensitive to the choice of the untaxed numéraire
good – even if allocations are identical under each normalization. Many authors have tried to
remedy these issues, but as of today applied policy analysts are still confronted with a host of
unsettled definitional and conceptual problems in the literature on the marginal costs of public
funds, which seriously limits its practical applicability.
Clearing up the morass of conceptual problems in the literature is a prerequisite before a
meaningful analysis of the marginal cost of public funds can be made. To that end, this paper
defines the marginal cost of public funds (M CF ) as the ratio of the social marginal value of
public income and (the average of) the social marginal value of private income, i.e., M CF ≡
∆ social welfare 1 unit public resources
∆ social welfare 1 unit private resources , where Diamond (1975)’s definition of the
of private income is used.6 This paper demonstrates that most conceptual
social marginal value
problems in the liter-
ature originate from employing the private marginal value of private income as a measure for the
social marginal value of private income, i.e., M CF ≡
∆ social welfare 1 unit public resources 7
∆ private welfare 1 unit private resources .
How-
ever, the private marginal social value of private income does not measure the social marginal
value of private income due to policy-induced income effects on taxed bases, which the Diamondbased measure takes into account. The Diamond-based measure for the marginal cost of public
funds resolves most problems in the literature. First, the marginal cost of public funds is equal
to one if lump-sum taxes are optimized. Moreover, a direct correspondence is obtained between
4
On the one hand, the so-called Pigou-Harberger-Browning approach (also called ‘differential analysis’) relates
the marginal cost of public funds to the marginal excess burden of taxation, which is determined by the compensated tax elasticity of earnings, see also Pigou (1947), Harberger (1964), Browning (1976), and Browning (1987).
On the other hand, the Atkinson-Stern-Ballard-Fullerton approach (also called ‘balanced-budget approach’) bases
the marginal cost of public funds on the ratio of the marginal value of income of the public and the private sector
(i.e., the ratio of Lagrange multipliers on the government budget constraint and private budget constraint). In
that case, the uncompensated tax elasticity of earnings supply determines the marginal cost of funds, see also
Atkinson and Stern (1974) and Ballard and Fullerton (1992). By using the latter approach to the marginal cost
of funds Sandmo (1998) and Slemrod and Yitzhaki (2001) include distributional aspects of distortionary linear
taxation, while Gahvari (2006) extends it to non-linear taxation. Using models with representative agents, Triest
(1990), Håkonsen (1998) and Dahlby (1998) develop yet another MCF concept relying on correcting the standard
MCF measures for the shadow value of public resources before and after the introduction of distortionary taxes.
Up to this date, it remains unclear which MCF measure should be used in applied policy analysis.
5
Lump-sum taxes have no distributional benefits in optimal tax models, because the covariance between social
welfare weights and lump-sum taxes is zero.
6
This paper is related to an unpublished paper by Lundholm (2005) that also analyzes optimal second-best
public goods provision under optimal linear income taxation using the Diamond approach to the social marginal
value of income.
7
The conceptual problems remain if the private marginal value of private income is multiplied by a social
welfare weight to capture a social preference for redistribution.
5
the marginal cost of public funds and the marginal excess burden of taxation if distributional
concerns are absent. In particular, the marginal cost of public funds of a distortionary tax then
equals the inverse of one minus the excess burden of the tax. Finally, the marginal cost of public
funds measures have become insensitive to the normalization of the tax code.
This paper also contributes to the literature on optimal non-linear taxation and public
goods provision, as in Christiansen (1981), Kaplow (1996) and Gahvari (2006), that assumed
identical utility functions for all agents. This paper allows for fully general preference structures,
which can be dependent on skill levels.8 The optimal non-linear tax structure and modified
Samuelson rule for public goods provision are derived under non-linear income taxation in
terms of sufficient statistics as in Saez (2001): earnings, willingness to pay for public goods,
behavioral elasticities, and social welfare weights. The modified Samuelson rules under optimal
linear and non-linear income taxation are shown to be nearly identical, and do not contain
corrections for the marginal cost of public funds. Moreover, when preference heterogeneity is
allowed for, public goods provision under optimal non-linear income taxation is also determined
by distributional concerns, which is not the case with identical preferences. In particular, if the
marginal willingness to pay for the public good declines with earning ability, the government
will provide more public goods, compared to the first-best Samuelson rule for public goods
provision. The reverse reasoning holds if the willingness to pay for public goods increases with
earning ability.9
The contributions of this paper are, furthermore, related to Kaplow (1996), Kaplow (2004),
Laroque (2005) and Gauthier and Laroque (2009). These authors show that, if preferences are
identical across individuals and weakly separable between labor supply and other commodities,
the government can design benefit-absorbing changes in the non-linear tax schedule that perfectly offset the distributional impact of public good provision without changing labor supply
incentives. Consequently, the benefit-absorbing tax change perfectly imitates a pure benefit
tax. Standard first-best policy rules can be applied in second-best, and no corrections for the
marginal cost of public funds need to be made. In contrast, this paper shows that the marginal
cost of public funds should never play a role in second-best policy rules, as long as the income
tax is optimized.10 This result holds true without changing the entire tax schedule to neutralize
the distributional impact of the public good, without imposing weakly separable or identical
utility functions, and without assuming that the tax system is non-linear.
The remainder of this paper is structured as follows. Section 2 introduces the model. Section
3 is devoted to optimal taxation, optimal provision of public goods, and the marginal cost of
8
This paper is related to Boadway and Keen (1993) who developed a 2-type Stiglitz (1982) model with
preference heterogeneity. Most of their applications, however, assume identical preferences. This paper fully
traces the implications of preference heterogeneity on optimal non-linear taxes and public goods provision in the
continuous-type Mirrlees (1971) model. Moreover, we express all tax rules in terms of sufficient statistics. Similarly, Kreiner and Verdelin (2012) explore tax reforms and public good provision with heterogeneous preferences
under sub-optimal non-linear income taxation, whereas the current paper analyzes optimal taxation and optimal
public-good provision.
9
These findings extend the earlier results of Mirrlees (1976) and Saez (2002) on optimal commodity taxation
with preference heterogeneity to optimal public goods provision with preference heterogeneity.
10
Naturally, second-best interactions of public good provision with labor supply can enter the Samuelson rule,
see also Atkinson and Stern (1974). Moreover, the fact that the policy rule is not affected by the marginal cost
of public funds does not imply that the policy is invariant to taxation. As the second-best allocation will be
different from the first-best allocation, the various elements of the policy rule will be different as well.
6
public funds under linear tax instruments. Section 4 discusses optimal taxation and public
goods provision under non-linear instruments. Section 5 discusses the results and Section 6
concludes. An Appendix contains a long proof of a Proposition.
2
Model
The model consists of heterogeneous individuals optimally supplying labor on the intensive
margin and a benevolent government optimally setting taxes and public goods.11 Without loss
of generality, a partial-equilibrium setting is assumed in which prices are fixed, so that firms
can be ignored.12 Attention is initially focused on the optimal setting of linear tax instruments.
Later, optimal non-linear taxation is explored. Real-world tax systems are neither completely
flat, nor completely non-linear. Hence, it is useful to explore both cases.
2.1
Individuals
There is a mass N of individuals that differ by a single-dimensional parameter n ∈ N = [n, n],
where the upper bound n could be infinite. n is the individual’s earning ability (‘skill level’),
which equals the productivity per hour worked. The density of individuals of type n is denoted
R
by f (n) and the cumulative distribution function by F (n). Note that N N f (n)dn = N .
Each individual n derives utility u(n) from consumption c(n) and pure public goods G.
Furthermore, it derives disutility from supplying labor l(n). Each individual has an endowment
of time, which is allocated between leisure and working. Consumption and leisure are both
assumed to be normal goods. This paper generalizes the analysis of optimal taxation and
public goods provision by allowing for preference heterogeneity: the utility function depends
on the skill level n. The utility function is strictly (quasi-)concave, and is twice continuously
differentiable:
u(n) ≡ u(c(n), l(n), G, n),
uc , −ul , uG > 0,
∀n.
(1)
Subscripts denote partial derivatives. The Spence-Mirrlees conditions are imposed on the utility
function so as to guarantee the implementability of the non-linear policies derived in the next
section (see also Lemma 2 below).
The government employs a tax schedule consisting of a linear tax rate t on gross labor
earnings z(n) ≡ nl(n), a linear tax rate τ on consumption goods c(n), and a non-individualized
lump-sum transfer g. The informational requirements for employing linear taxes are that the
government should be able to verify aggregate labor income or consumption. Note that in
Mirrlees (1971)-analyses non-individualized lump-sum transfers are always part of the instrument set of the government, since the government can always provide each individual with an
11
It is based on earlier contributions by Ramsey (1927), Mirrlees (1971), Diamond and Mirrlees (1971), Sheshinski (1972), Diamond (1975), Christiansen (1981), Boadway and Keen (1993), Kaplow (1996) and Sandmo (1998).
12
Jacobs (2010) analyzes the model in general-equilibrium settings allowing for a representative firm operating
a constant-returns-to-scale technology and none of the results change. Almost all of the papers in the literature fix
the marginal rates of transformation between all commodities at one. Hence, all prices are constant, and allowing
for general equilibrium provides no additional insights, see previous footnote. Moreover, our partial-equilibrium
results fully generalize to general-equilibrium settings with non-constant prices, since optimal second-best tax
rules in general equilibrium are identical to the ones in partial equilibrium as long as there are constant returns
to scale in production and all labor types are perfect substitutes, see also Diamond and Mirrlees (1971).
7
equal amount of resources. The individual budget constraint states that net expenditures on
consumption are equal to net labor earnings:
(1 + τ )c(n) = (1 − t)z(n) + g,
∀n.
(2)
One tax instrument is redundant, since the consumption tax is equivalent to the income tax.
Thus, without loss of generality, one tax instrument can always be normalized to zero.
The individual maximizes utility (1) subject to its budget constraint (2). This yields the
standard first-order condition for labor supply:13
−ul (c(n), l(n), G, n)
(1 − t)n
=
,
uc (c(n), l(n), G, n)
1+τ
∀n.
(3)
Taxation is distortionary as it drives a wedge between the marginal social benefits (n) and the
marginal private benefits ((1 − t)n/(1 + τ )) of an increase in labor supply. Higher levels of
taxation (income or consumption taxes) reduce the price of leisure in terms of consumption and
individuals substitute leisure for consumption.
Indirect utility of individual n can be written as: v(n) ≡ v(t, τ, g, G, n) ≡ u(ĉ(n), ˆl(n), G, n),
where hats denote the optimized values for consumption and labor supply. Straightforward
application of Roy’s identity yields the following properties of v(·):
−λ(n)c(n),
∂v(n)
∂g
= λ(n) and
∂v(n)
∂G
=
λ(n) uuGc ,
∂v(n)
∂t
= −λ(n)z(n),
∂v(n)
∂τ
=
where λ(n) is the private marginal utility of
income of individual n. Given that preferences depend on the skill level n, the private marginal
utility of income is not necessarily non-increasing in n. To ensure that a well-defined redistribution problem is obtained, it is assumed that
∂λ(n)
∂n
≤ 0. This inequality always holds if
preferences are identical for all individuals due to the assumptions on the derivatives of the
utility function.
Some additional notation is introduced to express the optimal policy rules in terms of
well-known elasticity concepts. In particular, the compensated, uncompensated and income
elasticities of labor supply and consumption demand with respect to the tax rates and the
∂lu (n) 1−t
∂lc (n) 1−t
∂l(n) G
c
u
c
∂t l(n) , εlt ≡
∂t l(n) < 0, εlG ≡ ∂G l(n) , εlG ≡
u
∂cc (n) 1+τ
∂cu (n) G
c
u
0, εucτ ≡ ∂c∂τ(n) 1+τ
∂τ c(n) , εcG ≡
∂G c(n) ,
c(n) , εcτ ≡
∂c(n)
τ ) ∂g > 0, where the superscript u (c) denotes an un-
public good are denoted by: εult ≡
∂lc (n) G
∂G l(n) , εlg ≡
c
G
εccG ≡ ∂c∂G(n) c(n)
,
(1 − t)n ∂l(n)
<
∂g
and εcg ≡ (1 +
compensated (compensated) change. In addition, the Slutsky equations will be employed to
c
c
∂lu (n)
∂cu (n)
= ∂l∂t(n) − z(n) ∂l(n)
= ∂c∂τ(n) − c(n) ∂c(n)
∂t
∂g ,
∂τ
∂g ,
∂lu (n)
∂lc (n)
∂cc (n)
uG ∂l(n) ∂cu (n)
uG ∂c(n)
∂G = ∂G + λ(n) ∂g ,
∂G = ∂G + λ(n) ∂g . Rewriting the Slutsky equations yields
u
c
u
c
εlt = εlt − εlg , εcτ = εcτ − εcg , εclG = εulG − ωωGl εlg and εclG = εulG − ωωGc εlg , where ωG ≡ uGuG ,
ωl ≡ −ulul(n) and ωc ≡ uc c(n)
denote the shares of public goods, labor and consumption in utilu
simplify the optimal tax expressions:
ity. In the remainder of the paper, a bar is used to indicate an income-weighted elasticity, e.g.,
R
R
−1
ε̄clt ≡ N εclt z(n)dF (N ) N z(n)dF (n) .
13
Under linear instruments, first-order conditions are both necessary and sufficient given the restrictions on
the derivatives of the utility function. Under non-linear policies, however, budget constraints are non-linear and
additional assumptions, i.e., monotonicity and Spence-Mirrlees conditions, are required to ensure that first-order
conditions are both necessary and sufficient, see also section 4.3.
8
2.2
Government
The social objectives is a Samuelson-Bergson social welfare function, which is a sum of concave
individual utilities:
Z
Ψ(u(n))dF (n),
N
Ψ0 (u(n)) > 0,
Ψ00 (u(n)) ≤ 0.
(4)
N
Maximizing social welfare under a Utilitarian criterion (Ψ0 = 1) implies a social preference for
redistribution if the private marginal utility of income λ(n) declines with skill n at the individual
level. However, due to the concavity of Ψ(·) the government may exhibit a stronger preference
for redistribution than individuals do.14
The government budget constraint states that total tax revenues equal spending on transfers
and public goods:
Z
(tz(n) + τ c(n))dF (n) = N g + pG.
N
(5)
N
where p denotes the constant marginal rate of transformation (i.e., the price) of public goods
in terms of private goods.
3
Optimal linear taxation and public goods provision
The government maximizes social welfare subject to its budget constraint by choosing the nonindividualized lump-sum transfer g, the tax rate on income t or consumption τ , and the level
of public goods provision G. Optimal policies are derived under both tax normalizations (i.e.,
either consumption or income remains untaxed). The social marginal value of one unit of public
resources is denoted by η. The Lagrangian for maximizing social welfare is given by:
Z
L≡N
Z
Ψ(v(n))dF (n) + η N
(tnl(n) + τ c(n))dF (n) − N g − pG .
N
The first-order conditions for a maximum are:
Z ∂L
∂l(n)
∂c(n)
0
=N
Ψ (v(n))λ(n) − η + ηtn
+ ητ
dF (n) = 0,
∂g
∂g
∂g
N
Z ∂l(n)
∂c(n)
−Ψ (v(n))λ(n)nl(n) + ηnl(n) + ηtn
+ ητ
dF (n) = 0,
∂t
∂t
N
Z ∂L
∂l(n)
∂c(n)
0
=N
−Ψ (v(n))λ(n)c(n) + ηc(n) + ηtn
+ ητ
dF (n) = 0,
∂τ
∂τ
∂τ
N
Z ∂L
∂l(n)
∂c(n)
0
=N
Ψ (v(n))uG + ηtn
+ ητ
dF (n) − pη = 0,
∂G
∂G
∂G
N
∂L
=N
∂t
(6)
N
0
(7)
(8)
(9)
(10)
where the derivatives of indirect utility have been used in each first-order condition.15
14
Alternatively, exogenously given Pareto weights δ(n) can be used, i.e., Ψ(u(n)) ≡ δ(n)u(n), where Ψ0 = δ(n)
and Ψ00 = 0. None of the results below would be affected by using this social welfare function instead.
15
The first-order conditions are necessary, but may not be sufficient to characterize the optimum (Diamond
and Mirrlees, 1971). In the remainder it’s assumed that the second-order conditions for maximization of social
welfare are always respected.
9
3.1
Marginal cost of public funds – The standard approach
This section first derives optimal policies using the standard measure for the marginal cost
of public funds, see, e.g., Wilson (1991), Ballard and Fullerton (1992), Sandmo (1998), and
Gahvari (2006). To that end, the following definitions are introduced: marginal social value
of private income, the marginal cost of public funds, and the Feldstein (1972) distributional
characteristics of the tax bases and public goods.
Definition 1 The social marginal value of one unit of private income accruing to individual n
equals
αs (n) ≡ Ψ0 (u(n))λ(n),
∀n.
(11)
According to the traditional definition, the social marginal value of private income is captured
by the mechanical increase in social welfare of transferring one unit of resources to individual
n. Utility rises with λ(n), and this increase in utility is valued by the government by Ψ0 (u(n)).
In the remainder of this section the superscript s is used to denote variables are derived under
the standard approach.
Definition 2 The marginal cost of public funds based on the standard measure of the social
marginal value of private income is given by
η
M CF s ≡ R
N
αs (n)dF (n)
.
(12)
The traditional marginal cost of public funds M CF s is the ratio of the social marginal value of
public income η and the average of the social marginal value of private income αs (n).
Definition 3 The distributional characteristics ξys of tax bases y(n) = {z(n), c(n)} are given
by
ξys ≡ − R
N
cov[αs (n), y(n)]
R
> 0.
N y(n)dF (n)
αs (n)dF (n)
(13)
ξys is a normalized covariance between a tax base and the welfare weights. It represents the
gain in social welfare (expressed in monetary equivalents and then divided by the taxed base) of
redistributing a marginal unit of resources through base y(n) = {z(n), c(n)}. The distributional
characteristics of the tax bases are positive, since the covariance between tax base y(n) and
the social welfare weights αs (n) is negative. Individuals with higher incomes or consumption
levels feature lower welfare weights because the social marginal utility of income is diminishing
in income or consumption due to diminishing private marginal utility of income or concavity
of the social welfare function. The positive distributional characteristic ξys therefore implies
that redistribution through taxing income or consumption yields distributional benefits. A
stronger social desire for redistribution or greater inequality in the skill distribution raise the
distributional characteristic. Since ξys is a positive normalized covariance, it ranges between
zero and one. ξys = 0 is obtained either if the government is not interested in redistribution
because it attaches the same social welfare weights αs (n) to all individuals or if the base y(n)
is the same for all n so that there is no inequality.
10
Definition 4 The distributional characteristic of the public good is
i
h
(·)
cov αs (n), uuGc (·)
s
.
ξG
≡ −R
R uG (·)
s (n)dF (n)
α
dF
(n)
N
N uc (·)
(14)
s is the negative normalized covariance
The distributional characteristic for the public good ξG
between the social marginal valuation of income αs (n) and the marginal willingness to pay for
the public good
uG
uc .
s > 0 if the public good mainly benefits the rich, who feature the lowest
ξG
s = 0 if the government is not interested in
social welfare weights αs (n), and vice versa. ξG
redistribution and attaches the same welfare weights αs (n) to all individuals or if all individuals
benefit equally from the public good, i.e.,
uG
uc
is equal for all n.
The next Lemma derives the marginal excess burden of the income or consumption tax. The
excess burden measures the reduction in social welfare, measured in monetary units, expressed
as a fraction of the tax base, of raising the distortionary income or consumption tax.
Lemma 1 The marginal excess burdens for the income tax and the consumption tax are given
by
M EBt = −
t c
ε̄ ,
1 − t lt
M EBτ = −
τ
ε̄c .
1 + τ cτ
(15)
Proof. The welfare effect of a rise in the tax rate is evaluated, while public goods provision
remains constant. A rise in the tax rate is considered, while each individual n receives a
individual-specific lump-sum transfer s(n) so as to keep its utility constant.16 The excess
burden is equal to the resulting loss in tax revenue, which is summed over all individuals.
To determine the excess burden of the income tax, assume that τ = 0. The change in taxes
dt and lump-sum income ds(n) for each individual n that leaves utility unaffected: dv(n) =
λ(n)ds(n) − λ(n)nl(n)dt = 0. Hence, ds(n) = nl(n)dt for all n. Public revenue changes
c
c
according to dRn ≡ −ds(n) + nl(n)dt + tn ∂l∂t(n) dt = tn ∂l∂t(n) dt for all n. Since utility remains
constant, changes in labor supply are compensated changes. Rewriting yields a revenue loss
of dRn =
∂lc (n) 1−t
t
1−t nl(n) ∂t l(n) dt Rfor
dividing by taxable income N
of taxed income:
N
each individual n. Summing
dRn
dt
over all individuals, and
nl(n)dF (n), yields the marginal excess burden as a fraction
R
M EBt ≡ − R N
N
dRn
dt dF (n)
nl(n)dF (n)
=−
t c
ε̄ .
1 − t lt
(16)
Recall that the bar indicates an income-weighted elasticity. Using similar steps and setting t = 0 gives the marginal excess burden of the consumption tax as a fraction of taxed
consumption:
R
M EBτ ≡ − R N
N
dRn
dτ dF (n)
c(n)dF (n)
=−
τ
ε̄c .
1 + τ cτ
(17)
Proposition 1 replicates Sandmo (1998) by characterizing optimal tax policies and public
goods provision under the standard measure for the marginal cost of public funds, and assuming
that the consumption tax is normalized to zero. Each tax instrument has its own marginal cost
16
Of course this instrument does not exist, since it boils down to an individualized lump-sum tax. However,
this thought-exercise allows us to calculate the excess burden of the tax.
11
of public funds. Therefore, M CFgs and M CFts are used to denote the marginal cost of public
funds of the lump-sum tax and the tax rate, respectively, see also Sandmo (1998).
Proposition 1 With the standard MCF definition, and with the consumption tax normalized
to zero, the optimal rules for public goods provision and the linear income tax are given by
s
(1 − ξG
)N
Z
N
uG (·)
dF (n) = (1 − γt ε̄ulG ) · M CF s · p,
uc (·)
Z
1
M CFgs =
<
1,
ε̃
≡
εlg dF (n) < 0,
lg
t
1 − 1−t
ε̃lg
N
1 − ξzs
1 − ξzs
M CFts =
,
t u =
t
1 + 1−t ε̄lt
1 − M EBt − 1−t
ε̄lg
M CF s = M CFgs = M CFts < 1.
(18)
(19)
(20)
(21)
Proof. Equation (7) can be simplified by setting τ = 0 and substituting equation (12) to find
equation (19). Equation (8) is simplified by using equation (13) and setting τ = 0 to find the
first part of equation (20). The second part of equation (20) follows upon substitution of the
Slutsky equation ε̄ult = ε̄clt − ε̄lg and using equation (15). Equation (10) is simplified by using
R
equation (14), setting τ = 0, and using γt ≡ N t N nl(n)dF (n)/pG to find equation (18).
Equation (18) is the modified Samuelson rule for the optimal provision of public goods.
The modified Samuelson rule equates the sum of the marginal social benefits with the marginal
social costs of providing the public good. The benefits – the sum of the marginal rates of
R
substitution N N uuGc dF (n) – are deflated by the distributional characteristic of the public
s . If poor individuals value the public good more (less) than rich individuals do, then
good ξG
s < 0 (ξ s > 0), and the level of public goods provision increases (decreases) – ceteris paribus.
ξG
G
The right-hand side gives the marginal cost of providing the public good. p equals the marginal
rate of transformation between the public good and the reference private good. The cost
side of the Samuelson rule is corrected for tax distortions via the marginal cost of public funds
M CF s . A higher cost of public funds lowers the level of public goods provision – ceteris paribus.
Providing the public good may exacerbate (reduce) pre-existing labor-tax distortions if the
public good reduces (boosts) labor supply. This correction is governed by the uncompensated
cross-elasticity of labor with the public good (if ε̄ulG 6= 0), see also Atkinson and Stern (1974).
R
γt ≡ N t N z(n)dF (n)/pG denotes the share of public goods that is financed with distortionary
taxes. If γt = 1, then public goods are financed exclusively with distortionary taxes. If the
public good and labor supply are complementary (ε̄ulG > 0), the provision of the public good
boosts labor supply and this raises tax revenue in the presence of a positive labor tax (γt > 0).
These positive revenue effects decrease the cost of public goods provision. If the public good
reduces labor supply (ε̄ulG < 0), the cost of the public good rises, since lower labor supply erodes
the labor tax base.
Equation (19) gives the marginal cost of public funds for the lump-sum tax. If there is a
positive income tax (t > 0), the marginal cost of public funds for the lump-sum tax is always
smaller than one (M CFgs < 1) as long as leisure is a normal good (ε̃lg < 0 ). Intuitively,
transferring an extra unit of funds from individuals to the government via a larger lump-sum
tax generates an income effect in labor supply, and this raises revenues from the income tax if
12
it is positive (t > 0). Consequently, one can raise the lump-sum tax by less than one unit to
raise one unit of public funds. However, one would expect the lump-sum tax to have a marginal
cost of public funds equal to one, since the lump-sum tax neither causes distortions nor features
distributional effects (i.e., the normalized covariance between the social welfare weights αs (n)
and the lump-sum tax g is zero). The result that the lump-sum tax does not have a marginal
cost of public funds of unity is the first problem that is identified in the literature.
Equation (20) gives the marginal cost of public funds for the tax rate (M CFts ). M CFts
depends on the income-weighted uncompensated tax elasticity of labor supply ε̄ult . The sign
of this elasticity is ambiguous due to offsetting income and substitution effects.17 Clearly, by
allowing for distributional concerns (ξzs > 0), distortionary taxes have not only costs, but also
distributional benefits. These distributional gains lower the marginal cost of public funds for
the distortionary income tax.
In Ramsey (1927) frameworks the non-individualized lump-sum tax is ruled out and distributional concerns are absent. Hence, M CFgs is not applicable and ξzs = 0 in M CFts , see also
Atkinson and Stern (1974). M CFts > 1 is then obtained if the labor supply curve is upwardsloping (ε̄ult < 0) and M CFts < 1 if there is a backward-bending labor supply curve (ε̄ult > 0).
The results that (in the absence of redistributional concerns) the M CFts can be smaller than
one and that it cannot be directly related to the marginal excess burden of the tax led to a
large literature trying to resolve these counterintuitive findings, see for example Triest (1990),
Ballard and Fullerton (1992) and Dahlby (2008).
In the absence of distributional concerns (i.e., ξzs = 0), equation (20) demonstrates that
it is not possible to directly relate the marginal cost of public funds of the distortionary income tax to the marginal excess burden of the income tax, as suggested by the analyses in
Pigou (1947), Harberger (1964) and Browning (1976). The often-used approximation that the
marginal cost of public funds equals one plus the marginal excess burden of the tax will only
apply as long as income effects are absent in labor supply (ε̄lg = 0), since then ε̄clt = ε̄ult and
M CFts = [1 − M EBt ]−1 ≈ 1 + M EBt at low levels of t. However, assuming away income
effects is theoretically not very appealing and is not empirically warranted, see also Blundell
and MaCurdy (1999) and Meghir and Phillips (2010). Thus, in the absence of distributional
benefits of taxes, the marginal cost of public funds of the distortionary tax is not directly related
to the excess burden of the tax instrument. This is the second problem in the literature on the
marginal cost of public funds.
Equation (21) states that, in a second-best optimum, the marginal cost of public funds for all
tax instruments should be equalized (i.e., M CF s = M CFgs = M CFts ). Consequently, Sandmo
(1998) concluded that the marginal cost of public funds at the optimal tax system is smaller
than one in a second-best setting with redistributional concerns. However, note that this result
holds as long as there is a positive labor tax, and irrespective of whether there are distributional
concerns or not, see also Wilson (1991).
The next Proposition demonstrates that the standard definition for the marginal cost of
public funds is highly sensitive to the normalization of the tax system. In particular, it gives
17
Most empirical studies conducted in recent decades suggest a positive value for the uncompensated wage
elasticity of labor supply, implying that M CFts will exceed one, see, for example, Blundell and MaCurdy (1999)
and Meghir and Phillips (2010).
13
the optimality conditions for the tax system and public goods provision when the income tax is
normalized to zero and consumption is taxed instead. In this case, M CFgs and M CFτs denote
the marginal cost of public funds of the lump-sum tax and the consumption tax.
Proposition 2 With the standard MCF definition, and with the income tax normalized to zero,
the optimal rules for public goods provision and the linear consumption tax are given by
(1 −
s
ξG
)N
Z
N
uG (·)
dF (n) = (1 − γτ ε̄ucG ) · M CF s · p,
uc (·)
Z
1
s
M CFg =
> 1, ε̃cg ≡
εcg dF (n) > 0,
τ
1 − 1+τ
ε̃cg
N
1 − ξcs
1 − ξcs
M CFτs =
=
,
τ
τ
1 + 1+τ ε̄ucτ
1 − M EBτ − 1+τ
ε̄cg
M CF s = M CFgs = M CFτs > 1.
(22)
(23)
(24)
(25)
Proof. Equation (7) can be simplified by setting t = 0 and substituting equation (12) to find
equation (23). Equation (8) is simplified by using equation (13), and setting t = 0 to find the
first part of equation (24). The second part of equation (24) follows upon substitution of the
Slutsky equation ε̄ucτ = ε̄ccτ − ε̄cg and using equation (15). Equation (10) is simplified by using
R
equation (14), setting t = 0, and using γτ ≡ N τ N c(n)dF (n)/pG to find equation (22).
The interpretations of equations (22) and (24) are identical to those of (18) and (20) that
appear below Proposition 1. Hence, these will not be repeated.
Equation (23) reveals that the marginal cost of public funds for the lump-sum tax is always
higher than (or equal to) one, rather than always smaller than (or equal to) one when income is
taxed. The reason is that the income elasticity of consumption is positive (if consumption is a
normal good): ε̃cg > 0. If the lump-sum tax is increased (transfer is reduced), there is a negative
income effect in consumption demand, which reduces the revenue from the consumption tax.
In order to raise one unit of public revenue by raising the lump-sum tax, the lump-sum tax
must therefore be increased by more than one unit to compensate for the revenue loss from the
consumption tax. Hence, as the income effect on the taxed base switches in sign, M CFgs switches
from a number below one under income taxation to a number above one under consumption
taxation.
In Ramsey (1927) frameworks with a representative agent (ξcs = 0) and no lump-sum taxes,
equation (24) gives the marginal cost of public funds of the consumption tax. It reveals that
M CFτs > 1, since substitution effects and income effects in consumption demand are reinforcing
rather than offsetting. Consequently, the marginal cost of public funds for the consumption
tax is always larger than one, whereas it could be smaller than one for the income tax. In
the presence of distributional concerns (ξcs > 0), equation (25) shows that at the optimal tax
system the marginal cost of public funds for all tax instruments are equalized. As a result,
M CF s is larger than one if consumption is taxed, and M CF s is smaller than one if income
is taxed. A different normalization of the tax system therefore produces completely different
marginal cost of public funds measures for both lump-sum and distortionary taxes. However,
the normalization of the tax system does not influence the optimal second-best allocation. This
14
is the third problem with the standard marginal cost of public funds measures.18
The normalization explains the findings of Wilson (1991) and Sandmo (1998). They both
suggest that distributional concerns are the reason why the marginal cost of public funds is
smaller than one. However, their findings stem from the normalization of the tax system,
and would be reversed if consumption would be taken as the tax base instead of income, as
Proposition 2 demonstrates.
The standard M CF measure thus suffers from three shortcomings. First, the marginal cost
of public funds for lump-sum taxes are not equal to one – irrespective of the normalization of
the tax system. Second, in the absence of distributional concerns, the marginal cost of public
funds for a distortionary tax instrument cannot be directly related to the excess burden of
the tax instrument – irrespective of the normalization of the tax system. Hence, the M CF
measure does not properly capture the welfare costs of taxation in the absence of distributional
concerns. Third, the M CF measures for both distortionary and lump-sum taxes are shown to
be critically sensitive to the normalization of the tax system. All these problems are suggestive
of a conceptually problematic M CF measure. From a practical point of view, these problems
render the applicability of standard M CF measures in policy analysis doubtful.
3.2
Marginal cost of public funds – The Diamond approach
This section argues that the three problems with the standard M CF measures can be resolved
by appropriately specifying the social marginal value of private income, which is used in the
definition of the marginal cost of public funds. In particular, it is argued that the social marginal
value of private income should also include the income effects on the taxed bases, as in Diamond
(1975). If the government transfers one unit of funds to the private sector, it should not only
take into account the direct utility gains to individuals – as the standard measure does –, but
also include the indirect effect of a higher transfer on revenues from other taxes as social losses
(under income taxation) or gains (under consumption taxation).
According to the traditional approach, the income effects on the taxed bases are not included
in the social marginal value of private income, see also Definition 1. Consequently, to calculate
the M CF , the social value of a unit of funds to the public sector is compared with the private
value of private funds – up to a transformation of the latter through the social welfare function
if non-utilitarian social preferences are adopted.19 However, the marginal cost of public funds
should measure the ratio between social marginal value of a unit of public funds and the social
marginal value of a unit of private funds, and not the private marginal value of private funds. By
defining the marginal cost of public funds based on Diamond (1975), it will be demonstrated
that the three problems identified above are nullified. The social marginal value of private
income is thus defined as follows.
18
The sensitivity of M CF s to the normalization of the tax system was already pointed out by Atkinson and
Stern (1974) without using the M CF terminology, however. In particular, M CF s can be shown to be bigger
(smaller) than one if consumption goods (factor supplies) are taxed, and factor supplies (consumption goods) are
not taxed.
19
Naturally, this does not mean that one cannot define the marginal cost of public funds to be as such, since
a definition cannot be wrong in and of itself. However, from an economic point of view it would be incorrect
to interpret the traditional measure as the relative social marginal value of public funds in terms of the social
marginal value of private funds.
15
Definition 5 The Diamond definition for the social marginal value of one unit of private income accruing to individual n equals
α(n) ≡ Ψ0 (u(n))λ(n) + ηtn
∂l(n)
∂c(n)
+ ητ
,
∂g
∂g
∀n.
(26)
The direct utility effect is still measured by Ψ0 (u(n))λ(n). In addition, transferring one unit of
∂c(n)
funds to the individual reduces labor supply ( ∂l(n)
∂g < 0) and raises consumption demand ( ∂g >
0), since leisure and consumption are assumed to be normal goods. Hence, the government loses
∂c(n)
−tn ∂l(n)
∂g in tax revenues from the income tax (if t > 0) or gains τ ∂g in revenues from the
consumption tax (if τ > 0) if the individual receives an additional unit of funds. The welfare
effects are obtained by multiplication of the revenue changes with η, the social marginal value
of one unit of public resources. Adopting α(n) as the social marginal value of private income
makes it possible to define the marginal cost of public funds.
Definition 6 The marginal cost of public funds based on the Diamond measure of the social
marginal value of private income is given by
M CF ≡ R
N
η
.
α(n)dF (n)
(27)
Analogously to the traditional measure for the marginal cost of public funds, the Diamondbased measure for the marginal cost of public funds M CF is defined as the ratio of the social
marginal value of public resources η and the average social marginal value of private income
α(n). Similarly, the definitions for the distributional characteristics can be adjusted by using
α(n) instead of αs (n) as the social welfare weights.
Definition 7 The distributional characteristics ξy of tax bases y(n) = {z(n), c(n)} based on
the Diamond measure of the social marginal value of private income are given by
ξy ≡ − R
N
cov[α(n), y(n)]
R
> 0.
α(n)dF (n) N y(n)dF (n)
(28)
Definition 8 The distributional characteristic of the public good based on the Diamond measure
of the social marginal value of private income is
h
i
(·)
cov α(n), uuGc (·)
ξG ≡ − R
.
R uG (·)
α(n)dF
(n)
dF
(n)
N
N uc (·)
(29)
Using the new definitions, the optimal policies under linear income taxation are expressed
as follows.
Proposition 3 Under the Diamond based MCF definition, and the consumption tax normalized
16
to zero, the optimal rules for public goods provision and the linear income tax are given by
Z
(1 − ξG )N
N
uG (·)
dF (n) = (1 − γt ε̄clG ) · p,
uc (·)
M CFg = 1,
M CFt =
1 − ξz
1 − ξz
t c = 1 − M EB ,
1 + 1−t ε̄lt
t
M CF = M CFg = M CFt = 1.
(30)
(31)
(32)
(33)
Proof. Equation (7) can be simplified by setting τ = 0 and substituting equation (27) to find
equation (31). Equation (8) is simplified by using equation (28), the Slutsky equation
∂lc (n)
∂t
−
nl(n) ∂l(n)
∂g ,
∂lu (n)
∂t
=
and setting τ = 0 to find the first part of equation (32). The second part
follows from substituting equation (15). Equation (10) is simplified by using equation (29), the
R
u
c (n)
uG ∂l(n)
Slutsky equation ∂l∂G(n) = ∂l∂G
+ λ(n)
∂g , setting τ = 0, and using γt ≡ N t N nl(n)dF (n)/pG
to find equation (30).
Equation (30) reveals that the modified Samuelson rule is almost the same as before, see
equation (18), except that the compensated cross-elasticity of labor supply with respect to public
goods ε̄clG enters the expression (rather than the uncompensated cross-elasticity).20 However,
the most important difference with Proposition 1 is that there should be no correction for the
marginal cost of public funds on the cost side of the modified Samuelson rule. Consequently, tax
distortions do not affect the decision rule for the optimal supply of public goods. Although the
decision rule does not contain a correction for the marginal cost of public funds in second-best,
the level of public goods provision is generally different, since the allocations are not identical
in first- and second-best. Therefore, one cannot conclude that tax distortions do not affect the
second-best level of public goods provision.
Equation (31) demonstrates that, at the optimal tax system, the marginal cost of public
funds for the lump-sum tax (M CFg ) is always equal to one. The reason is that the lump-sum
tax does neither cause deadweight losses nor have distributional gains (losses). Indeed, there is
a zero covariance between the lump-sum tax g and the social welfare weights α(n), so that the
distributional characteristic ξ for the lump-sum tax is zero. Hence, the result that the marginal
cost of public funds is equal to one is, therefore, merely a statement that the tax system is
optimal. Intuitively, at the optimal tax system, one unit of resources should be equally valuable
in the private as in the public sector. Thus, the government should be indifferent between
transferring funds from the public to the private sector. The first problem that the lump-sum
tax does not have a marginal cost of public funds equal to one is thereby resolved.21
Equation (32) shows that in Ramsey (1927) analyses without lump-sum taxes and redistributional concerns (ξz = 0), the marginal cost of public funds for the distortionary tax would be
20
The compensated cross-elasticity is a different measure than the uncompensated cross-elasticity. Whereas
the uncompensated elasticity is zero with utility functions exhibiting (weak) separability between public goods
and labor (leisure), the compensated cross-elasticity is generally different from zero and positive for a wide class
of utility functions including the separable ones, see Jacobs (2009).
21
Gahvari (2014) suggested that M CF = 1 is a definition rather than a result. This is not correct, since
M CF = 1 is an optimality condition, not a definition. Outside the tax optimum, the marginal cost of public
funds forRthe lump-sum tax does not equal one (i.e., M CFT 6= 1) if the average social marginal value of private
income ( N α(n)dF (n)) is unequal to the social marginal value of public income (η).
17
directly related to the marginal excess burden of the tax. Indeed, without distributional concerns it is exactly equal to the inverse of 1 − M EBt . This finding resolves the second problem
that there is no link between the marginal cost of public funds and the excess burden of taxation.
The standard definition of the marginal cost of public funds does not include the income effects
on taxed bases in the social marginal value of private income. Hence, these income effects pop
up in the expression for the marginal cost of public funds, see equation (20).
In Mirrlees (1971) analyses, however, the government only introduces distortionary taxes
if doing so contributes to equality (i.e., if taxing labor income yields distributional benefits).
With distributional concerns, ξz > 0, and M CFt is lowered, as equation (32) reveals. At the
optimal tax system, the marginal cost of public funds for the income tax should be equal to the
marginal cost of public funds for the lump-sum tax, as equation (33) demonstrates. Therefore,
the marginal deadweight losses of income taxes should be exactly equal to the marginal distributional gains of income taxes: M EBt =
t c
1−t ε̄lt
= ξz . At the optimum, the marginal excess
burden of a distorting tax rate (expressed in monetary units, as a fraction of taxed income)
exactly equals the marginal benefits of redistribution (expressed in monetary units, as a fraction of taxed income). Tax distortions are therefore only introduced insofar as these distortions
contribute to a more equal distribution of utility. The more society cares about distribution,
the larger is ξz , and the higher is the optimal income tax. The more elastically labor supply
responds to taxes, the larger is −ε̄clt , and the lower is the optimal income tax. This is the
standard trade-off between equity and efficiency.
Naturally, there should be no correction of the modified Samuelson rule in equation (30) if the
public good is financed at the margin with the lump-sum tax, since there is no deadweight loss
involved (and no distributional gains either). However, neither should it contain a correction if
the marginal source of finance for the public good is the distortionary tax. This is an application
of the envelope theorem: the deadweight loss of a marginally higher tax exactly cancels against
the distributional gain of the tax if the tax is optimally set.
Equation (30) is mathematically identical to the modified Samuelson rule in equation (18),
only it is expressed in terms of compensated elasticities by using the Slutsky equations, cf.
Wildasin (1984). Although both equations provide equally valid characterizations of the modified Samuelson rule for the optimal provision of public goods, the policy conclusions derived
from both equations may seem to be different (but are not). Indeed, only by correcting the
definition for the marginal cost of public funds, it becomes clear that it should not play a role
in the modified Samuelson rule.
Tax distortions are introduced only for redistributional reasons, not for public goods provision. If the government would not be interested in income redistribution (i.e., ξz = ξG = 0),
distortionary income taxes would be optimally zero (t = 0), see Proposition 3. Thus, in the absence of a preference for redistribution, all public goods would be financed with non-distortionary
non-individualized taxes. The marginal excess burden of the income tax is therefore the price
of equality and not the price of public goods provision.
The next Proposition demonstrates that all results remain valid also using the different tax
normalization, where consumption rather than income is taxed.
Proposition 4 Under the Diamond based MCF definition, and with the income tax normalized
18
to zero, the optimal rules for public goods provision and the linear consumption tax are given by
Z
(1 − ξG )N
N
uG (·)
dF (n) = (1 − γτ ε̄ccG ) · p,
uc (·)
M CFg = 1,
M CFτ =
1 − ξc
1 − ξc
,
=
τ
1 + 1+τ
1 − M EBτ
ε̄ccτ
M CF = M CFg = M CFτ = 1.
(34)
(35)
(36)
(37)
Proof. Equation (7) can be simplified by setting t = 0 and substituting equation (27) to find
equation (35). Equation (8) is simplified by using equation (28), the Slutsky equation
∂cc (n)
∂τ
−
c(n) ∂c(n)
∂g ,
∂cu (n)
∂τ
=
and setting t = 0 to find the first part of equation (36). The second part
follows from substituting equation (15). Equation (10) is simplified by using equation (29), the
R
c
u
uG ∂c(n)
Slutsky equation ∂c∂G(n) = ∂c∂G(n) + λ(n)
∂g , setting t = 0, and using γτ ≡ N τ N c(n)dF (n)/pG
to find equation (34).
Proposition 4 demonstrates that the marginal cost of public funds measures have become
independent from the particular normalization of the tax system. At the optimal tax system,
the marginal cost of public funds remains equal to one for all tax instruments, as shown in
equation (35) and (37). Equation (36) derives that the excess burden of the consumption tax
τ
equals its distributional gain: M EBτ = − 1+τ
ε̄ccτ = ξc . Moreover, the characterization of the
optimal policy rule for public good provision in equation (34) has become independent from the
particular tax normalization.
The marginal excess burdens of consumption and income taxes – as defined in Lemma 1 – are
not quantitatively identical under different tax normalizations in the absence of distributional
concerns (ξz = ξc = 0), even though allocations are, see also Håkonsen (1998). The contributions
by Triest (1990) and Dahlby (2008) are related. If there are no distributional concerns, and
lump-sum taxes are ruled out, then M CFt = (1 − M EBt )−1 6= M CFτ = (1 − M EBτ )−1 ,
even if the Diamond concept of the social marginal value of private income is adopted. To
resolve this apparent inconsistency, Triest (1990), Håkonsen (1998) and Dahlby (2008) develop
alternative M CF concepts relying on correcting the M CF measures for the shadow values of
public resources (η), both before and after the introduction of distortionary taxes. However,
the explanation for the difference in the M CF measures is that both tax instruments have
equal marginal excess burdens in absolute terms, but these excess burdens are expressed as
fractions of different tax bases. Hence, the M EB measures differ quantitatively. However, once
distributional concerns are included, this issue has become moot, since marginal distortionary
costs and marginal distributional gains are equal, and the normalization of the costs or benefits
of the tax instrument with a particular tax base has become immaterial.
The next Proposition derives a special case in which the first-best Samuelson rule for public
goods provision is obtained in second-best settings with distortionary taxation.22
Proposition 5 If utility is given by u(n) ≡ c(n) − v(l(n)) + Γ(G), v 0 , Γ0 > 0, v 00 > 0, Γ00 ≤ 0,
then the optimal provision of public goods follows the first-best Samuelson rule in second-best
22
Note that in this particular case it does not matter how one defines the marginal cost of public funds as the
traditional and the Diamond measures of M CF are identical in the absence of income effects.
19
settings with optimal distortionary taxation:
Z
N
N
uG (·)
dF (n) = p.
uc (·)
(38)
Proof. The first-order condition for labor supply is given by v 0 (l(n)) =
εclG
εclG
= 0. Furthermore,
uG
uc
=
Γ0 (G)
(1−t)n
(1+τ ) , ∀n.
Therefore,
is independent from skill n, hence, ξG = 0. Substitution of
= 0 and ξG = 0 in equation (30) or equation (34) yields the result.
This simple, special case demonstrates why it is misleading to ignore distributional concerns
in the analysis of the marginal cost of public funds. When distributional concerns are ignored,
the cost side of the Samuelson rule would be multiplied with M CF , whereas the distributional
benefits of the distortionary tax would be left out of the picture. The explanation why the
first-best Samuelson rule is found in a second-best setting is twofold. First, the public good
does not increase or decrease compensated labor supply. Therefore, the public good cannot be
used to alleviate the tax distortions on labor supply. Second, the public good does not affect
the welfare distribution, since every individual benefits to an equal extent from the public good.
Indeed, the public good is a perfect substitute for the lump-sum cash transfer g. Since the
provision of the public good neither affects efficiency nor equity, provision of the public good
should follow the first-best policy rule.
3.3
Sub-optimal taxation
To conclude the discussion on linear taxation, this subsection explores to what extent the results
are driven by allowing for heterogeneous agents and non-individualized lump-sum transfers.
To that end, suppose that the government cannot optimize the lump-sum tax. Then, the
government has to resort to distortionary taxation as the marginal source of finance for public
goods. For brevity, this section only discusses income taxation. The following Proposition
derives optimal policy for any level of lump-sum taxation.23
Proposition 6 Under the Diamond based MCF definition, the lump-sum tax exogenously given,
and the consumption tax normalized to zero, the optimal rules for public goods provision and
the marginal cost of public funds are given by
Z
(1 − ξG )N
N
uG (·)
dF (n) = (1 − γt ε̄clG ) · M CFt · p,
uc (·)
1 − ξz
1 − ξz
M CFt =
t c = 1 − M EB R 1.
1 + 1−t ε̄lt
t
(39)
(40)
Proof. This follows from Proposition 3.
The modified Samuelson rule in equation (39) now features a correction (M CFt ) for the
marginal cost of public funds of the income tax. Since M CFt =
1−ξz
1−M EBt ,
a correction needs to
be made for both the excess burden of the income tax M EBt and the distributional benefits ξz
23
If the marginal source of public finance is the lump-sum tax, and the government jointly optimizes the lumpsum tax and public goods provision, for a given level of the income tax, then the marginal cost of public funds is
one. In that case, there should be no correction of the the modified Samuelson rule. This follows trivially from
Proposition 3.
20
of the income tax. These distributional benefits are omitted in models adopting a representative
agent. Clearly, when the lump-sum tax cannot be optimized, the marginal cost of public funds
ceases to be equal to one. However, even when lump-sum taxes are unavailable, one can not
conclude that the marginal cost of public funds for distortionary taxation is necessarily larger
than one. This depends on the distributional benefits of the distortionary tax. If the income tax
is sub-optimally low (high) from a distributional perspective (i.e., ξz > (<)M EBt ), then the
marginal cost of public funds is smaller (bigger) than one, i.e., M CFt < 1 (> 1), and optimal
public goods provision is larger (smaller), everything else equal. Intuitively, if M CFt < 1,
the government over-provides public goods – relative to the second-best rule with optimized
transfers – to compensate for the sub-optimal income redistribution by the income tax (and
vice versa if M CFt > 1).
The representative-agent models are nested as a special case of the model where lumpsum taxes are excluded and the distributional effects of taxation or public goods are absent
(ξz = ξG = 0). In that particular case, the marginal cost of public funds are unambiguously
larger than one, since M CFt =
1
1−M EBt
> 1 if ξz = 0. Indeed, the modified Samuelson rule
from Proposition 6 then collapses to the expression derived by Atkinson and Stern (1974). This
case, however, is of no practical relevance, since individuals in the real world are heterogeneous.
Hence, it would be incorrect to ignore the distributional effects of taxes and public goods
provision, and only count for the distortions of taxation and the interaction of the public good
with labor supply in social cost-benefit analysis.
If tax systems are not optimized because lump-sum taxes are not available, the standard
Samuelson rule in equation (39) should thus be adjusted for four factors in second-best settings
with distortionary taxation: i) interactions of the public good with distorted labor supply (ε̄clG ),
ii) distributional benefits of the public good (ξG ), iii) distortions of the financing of the public
good, i.e., the marginal excess burden of distortionary taxation (M EBt ), and iv) distributional
benefits of the financing of the public good (ξz ). If tax systems are optimized, the distributional
gains equal the deadweight losses of financing the public good, so that M EBt = ξz , and effects
iii) and iv) cancel from the modified Samuelson rule.
4
Optimal non-linear taxation and public goods provision
The analysis has so far been confined to linear policy instruments. In the real world, however,
most tax systems are non-linear. This section extends the model to allow for non-linear income
taxation, as in Mirrlees (1971). It is demonstrated that all major findings derived under linear
policies carry over to non-linear policies. Moreover, the formulae for optimal non-linear taxation
and optimal provision of public goods are expressed in terms of sufficient statistics so as to obtain
the closest correspondence with the corresponding expressions derived under linear instruments.
Finally, this section extends previous literature by analyzing optimal taxation and public goods
provision with preference heterogeneity.
21
4.1
Individuals
Let the non-linear tax schedule be denoted by T (z(n)), where T 0 (z(n)) ≡ dT (z(n))/dz(n)
denotes the marginal income tax rate at income z(n). The tax function is assumed to be
continuous and differentiable. The government can only verify total income of an individual
(z(n) = nl(n)), not its labor supply l(n) or ability n, which rules out individualized lump-sum
taxes. The individual budget constraint is modified:
c(n) = z(n) − T (z(n)),
∀n.
(41)
The first-order condition for labor supply of each individual reads as
−ul (c(n), l(n), G, n)
= (1 − T 0 (z(n)))n,
uc (c(n), l(n), G, n)
∀n.
(42)
This first-order condition is the same as before, see equation (3), except that the non-linear
marginal tax rate replaces the linear one.
4.2
Government
The social welfare function remains the same as in equation (4). In the Mirrlees setup it is more
convenient to work with the economy’s resource constraint rather than the government budget
constraint.24 The economy’s resource constraint is:
Z
(nl(n) − c(n))dF (n) = pG.
N
(43)
N
4.3
Incentive compatibility
To determine the non-linear tax schedule T (·) and optimal public goods provision G, a standard mechanism-design approach is employed. Any second-best allocation must satisfy the
incentive-compatibility constraints. Since n is not observable by the government, every bundle {c(n), z(n)} for individual n must be such that each individual self-selects into this bundle
and does not prefer another bundle {c(m), z(m)} intended for individual m 6= n. By adopting
the first-order approach, as in Mirrlees (1971), the incentive-compatibility constraints can be
summarized by a differential equation on utility:
du(c(n), l(n), G, n)
l(n)
= un (c(n), l(n), G, n) −
ul (c(n), l(n), G, n),
dn
n
∀n.
(44)
The first-order approach yields an implementable allocation only if the second-order conditions for utility maximization are fulfilled at the optimum allocation. Lemma 2 states the
conditions under which the first-order approach is both necessary and sufficient to describe the
optimum.25
24
If the government maximizes social welfare subject to the resource constraint, and all households respect
their budget constraints, then the government budget constraint is automatically satisfied by Walras’ law.
25
Lemma 2 implies that the second-order conditions are locally satisfied, which is a standard result in the
Mirrlees model without public goods provision, see also Hellwig (2010).
22
Lemma 2 The optimal allocation derived under the first-order approach is implementable with
the non-linear income tax function if i) utility U (c(n), z(n), G, n) ≡ u(c(n), z(n)/n, G, n) satisfies the Spence-Mirrlees condition, i.e., d(−Uz /Uc )/dn < 0, and ii) gross incomes are nondecreasing with skill n, i.e., dz(n)/dn ≥ 0.
Proof. See Jacobs (2010).
Since the public good is identical for all agents, incentive compatibility constraints are not
affected by introducing public goods, even if different individuals exhibit a different valuation
for the public good, see also Kreiner and Verdelin (2012). Intuitively, one individual cannot
mimic another individual’s ability to benefit from the public good, since his utility function
remains the same. In the remainder of this section it’s assumed that Lemma 2 holds.
4.4
Optimal taxation and public goods provision
To characterize the optimal second-best policy, Proposition 7 provides the conditions for the
optimal non-linear income tax, the marginal cost of public funds, and the optimal provision of
public goods. The optimal non-linear tax and modified Samuelson rule are expressed in terms
of sufficient statistics, i.e., the earnings distribution, the willingness to pay for public goods,
behavioral elasticities, and social welfare weights. The earnings distribution equals the skill
distribution, i.e., F̃ (z(n)) ≡ F (n), in view of the one-to-one mapping between earnings and
skills (Saez, 2001). The corresponding density of earnings equals f˜(z(n)).
Proposition 7 The optimal non-linear income tax, the marginal cost of public funds, and
optimal public goods provision are given by
T 0 (z(n))
1 − T 0 (z(n))
=
1
−εclT 0
R z(n) z(n)
1−
(1 − ξG )N
N
f˜(z(m))dz(m) 1 − F̃ (z(n))
1 − F̃ (z(n))
z(n)f˜(z(n))
,
η
= 1,
˜(z(m))dz(m)
α(m)
f
z(n)
Z
uG (·)
N T 0 (nl(n))nl(n)
c
dF (n) = p 1 −
γn εlG dF (n) , γn ≡
.
uc (·)
pG
N
M CF ≡ R z(n)
Z
α(m)
η
∀n,
(45)
(46)
(47)
Proof. See Appendix.
The expression for the optimal non-linear income tax in equation (45) is identical to Saez
(2001). This expression is well known in the literature and the reader is referred to Mirrlees
(1971), Seade (1977), Tuomala (1984), Diamond (1998) and Saez (2001) for more elaborate
discussions of the optimal non-linear tax.
At the optimum, the marginal cost of public funds in equation (46) is once again equal to
one. The formulation of optimal tax policy in terms of sufficient statistics formally demonstrates
that the marginal cost of public funds equals one also in the Mirrlees (1971) model. Intuitively,
the government has acces to a non-distortionary marginal source of public finance: the intercept
of the tax function T (0). The intercept −T (0) is equivalent to the non-individualized lump-sum
transfer g under linear taxation. The government lowers the intercept of the tax function T (0)
until the marginal utility gains plus income effects (α(n)) are on average equal to its marginal
23
cost in terms of revenue (η). When the tax system is optimized, the government is indifferent
to a marginal redistribution of income from the public to the private sector.
The marginal cost of public funds for non-distortionary taxes should be equal to the marginal
cost of public funds for all distortionary taxes. Thus, tax distortions should be equal to distributional gains for marginal tax rates at each point in the income distribution. This can be seen
by rewriting the expression for the optimal non-linear income tax in equation (45):
M EB(n) · z(n)f˜(z(n)) =
z̄
α(m) ˜
1−
f (z(m))dz(m),
η
z(n)
Z
M EB(n) ≡ −
(48)
T 0 (z(n)) c
ε 0.
1 − T 0 (z(n)) lT
Clearly, the total marginal excess burden of a higher marginal tax rate at earnings level z(n)
equals the marginal excess burden per unit of tax base M EB(n) multiplied by the tax base
f˜(z(n))z(n). The distributional gain of a higher marginal tax rate at income z(n) equals the
social value of extracting a marginal unit of revenue from all individuals above z(n). The latter
equals 1 – the unit of revenue – minus the Diamond (1975) social marginal value of income
α(n)/η – the utility cost in money equivalents of paying a unit more tax.
If marginal tax rates are not optimized, equation (48) does not hold with equality. In that
case, the marginal excess burden of a marginal tax rate is not equal to the marginal distributional
benefit of the marginal tax rate. Moreover, if equation (48) does not hold with equality at all
points in the income distribution, the marginal cost of public funds is no longer equal to one,
hR
i−1
z(n)
i.e., M CF ≡ η z(n) α(m)f˜(z(m))dz(m)
6= 1.
The expression for the optimal provision of public goods in equation (47) is identical to
expression (30) for optimal public goods provision under linear taxation. The only (minor)
difference is that the share parameters γn take into account the non-linearity of the tax schedule.
The interpretation of the modified Samuelson rule otherwise remains completely the same, and
the reader is referred to the previous discussion. Clearly, there does not need to be a correction
in the modified Samuelson rule for the marginal cost of public funds under optimal non-linear
income taxation, since M CF = 1 at the optimal tax system. The non-linear case therefore
confirms the linear case.
To gain more intuition for the reasons why the modified Samuelson rule differs from the
standard Samuelson rule, it can be expressed as well in terms of the model’s primitives (see
Appendix):
Z
N
(1 + ∆(n))
N
where
∆(n) ≡
T 0 (z(n)) εclT 0
1 − T 0 (z(n)) εuzn
uG (·)
dF (n) = p,
uc (·)
∂ ln(uG /uc ) ∂ ln(uG /uc )
−
∂ ln l(n)
∂ ln n
(49)
,
∀n.
(50)
The modified Samuelson rules in equations (47) and (49) are the same, except that equation
(47) is expressed in terms of sufficient statistics rather than the model’s primitives, i.e. the skill
distribution and the utility function (recall that 1 − T 0 (z(n)) = −ul (·)/uc (·)). ∆(n) indicates
the extent to which public goods are overprovided relative to the first-best Samuelson rule, cf.
equation (38). ∆(n) can be interpreted as an implicit subsidy on public goods provision at skill
24
level n. ∆(n) is different from zero if the marginal willingness to pay for the public good
uG (·)
uc (·)
varies with labor supply or with ability.
Whether public goods are under- or overprovided compared to the first-best rule is determined by the presence of the distortionary income tax (T 0 (z(n)) > 0). The more distortionary
is income taxation, as indicated by a larger tax rate T 0 (z(n)) or a larger compensated elasticity
εclT 0 , the more public goods provision deviates from the first-best policy rule. In the absence
of redistributional concerns, marginal tax rates are zero (T 0 (z(n)) = 0), and so is the implicit
subsidy on public goods provision (∆(n) = 0). In that case, public goods provision satisfies the
first-best Samuelson rule (38).
The public good is more (less) complementary to work than private consumption, if the
marginal willingness to pay for the public good rises (falls) with labor effort (i.e.,
∂ ln(uG /uc )
∂ ln l
>
0 (< 0)).26 The provision of public goods then increases. The intuition is the same as in
Atkinson and Stiglitz (1976). The government should provide more public goods if they are
more complementary to work than private goods are, and provide fewer public goods if they are
more complementary to leisure than private goods are. In doing so, the government alleviates
the distortions of the labor income tax on work effort. To put it differently, if
∂ ln(uG /uc )
∂ ln l
>0
(< 0), a higher (lower) level of public goods provision relaxes the incentive constraints associated
with the redistribution of income, as individuals with a high ability are less tempted to mimic
individuals with a lower ability. The Atkinson and Stern (1974) term capturing the interaction
of the public goods with labor supply is therefore also present under non-linear taxation. See
also Christiansen (1981) and Boadway and Keen (1993). The term
is therefore associated with the compensated cross-elasticity
εclG
∂ ln(uG /uc )
∂ ln l
in equation (49)
of labor supply with respect to
the public good in the modified Samuelson rule under linear taxation in equation (47).
If the utility function differs across individuals, public goods provision should also be employed for redistribution. Public goods provision can extract additional information on the skill
level n. Intuitively, the willingness to pay for public goods varies with ability even if all individuals would have the same labor earnings. Therefore, the willingness to pay for public goods
reveals information about earnings ability that is independent from labor income. This makes
over- or underprovision of public goods attractive for redistribution. Over- or under-provision
of public goods – relative to the first-best Samuelson rule – results in more redistribution than
the government can achieve with the non-linear income tax alone. Naturally, under- or overprovision causes inefficiencies, and these need to be traded off against the distributional gains. If
the marginal willingness to pay for the public good rises (falls) with ability (i.e.,
∂ ln(uG /uc )
∂ ln n
>0
(< 0)), then the public good benefits the individuals with higher skill levels relatively more
(less).27 Consequently, the optimal level of public goods provision falls (increases) for redistriG /uc )
l
The term ∂ ln(u
can be rewritten as ωl (ρlG − ρlc ), where ωl ≡ −lu
> 0 is the utility share of labor,
∂ ln l
u
ulG u
ρlG ≡ (−ul )uG is the Hicksian partial elasticity of complementarity between public goods and labor, and ρlc ≡
ulc u
is the Hicksian partial elasticity of complementarity between private consumption and labor. Thus, the
(−ul )uc
willingness to pay for the public good increases with labor supply if ρlG > ρlc . That is, if the public good is a
stronger Hicksian complement to labor than private consumption is.
27
G /uc )
can be rewritten as ωn (ρnG − ρnc ), where ωn ≡ nuun > 0 is the utility share of
The term ∂ ln(u
∂ ln n
u
ability, ρnG ≡ uunnG
is
the
Hicksian partial elasticity of complementarity between public goods and ability, and
uG
ρnc ≡ uunncuuc is the Hicksian partial elasticity of complementarity between private consumption and ability. The
willingness to pay for the public good increases with ability if ρnG > ρnc . That is, if public goods are stronger
Hicksian complements to ability than private goods are.
26
25
butional reasons. The term
∂ ln(uG /uc )
∂ ln n
in equation (49) is thus associated with the distributional
characteristic ξG of the public good in the modified Samuelson rule under linear taxation in
equation (47).
Alternatively, one can understand this result by looking at the incentive constraints. If
∂ ln(uG /uc )
∂ ln n
> 0 (< 0), then high-skill types are less (more) tempted to mimic low-skill types
if public goods provision expands, since they have a stronger (weaker) preference for public
goods. Consequently, incentive-compatibility constraints are relaxed (tightened), and public
goods provision reduces (increases) the distortions of redistributing income.
If the utility function is identical for all individuals n, the distributional term drops out (i.e.,
∂ ln(uG /uc )
∂ ln n
= 0). Intuitively, if differences in earning ability are the only source of heterogeneity,
and everyone has the same utility function, the willingness to pay for public good is the same if
labor earnings are the same. Therefore, differences in the willingness to pay for public goods only
originate from differences in labor earnings. In that case, under- or or overprovision of the public
good – relative to the first-best Samuelson rule – does not result in more income redistribution
than can be achieved with the non-linear income tax alone. However, under- or overprovision
of public goods – relative to the first-best rule – causes inefficiencies. These distortions can
be avoided by organizing all redistribution via the non-linear income tax and providing public
goods according to the Samuelson rule. See also Christiansen (1981) and Boadway and Keen
(1993) for the case of homogeneous preferences. Hence, when the utility function is the same
for all individuals, public goods are not provided for redistributional reasons.
The finding that public goods are not used for redistribution if utilities are identical contrasts
with linear case. Under linear income taxation, optimal provision of the public good is always
found to be dependent on the distributional impact of the public good, except for the (trivial)
cases in which the marginal willingness to pay is equal or is linear in income for all individuals.
With linear taxation, the government uses an informationally inferior instrument to redistribute
income by ignoring the information on individual earnings. Hence, the government optimally
complements the income tax by using indirect instruments for redistribution, such as public
goods provision.
Christiansen (1981), Boadway and Keen (1993) and Kaplow (1996) discuss the case in
which the utility function is identical for all individuals and is (weakly) separable between private/public goods and leisure. They find that the optimal provision of public goods then follows
the first-best Samuelson rule. The next Proposition demonstrates this formally. Naturally, the
allocations will differ between first- and second-best.
Proposition 8 If utility is given by u(n) ≡ u(h(c(n), G), l(n)), ∀n, then the optimal provision of public goods follows the first-best Samuelson rule in second-best settings with optimal
distortionary taxation:
Z
N
N
uG (·)
dF (n) = p.
uc (·)
Proof. If u(n) ≡ u(h(c, G), l), it is immediately established that
(51)
∂ ln(uG /uc )
∂ ln l
=
∂ ln(uG /uc )
∂ ln n
= 0.
Substitution in (49) yields (51).
Proposition 8 is the non-linear counterpart of Proposition 5. It demonstrates that, once
second-best interactions of the public good with labor supply are absent, or when redistribution
26
via public goods has no value added over direct income redistribution, the optimal rule for public
good provision follows the first-best Samuelson rule. Hence, there should be no correction for
the marginal cost of public funds as distortions of taxation cancel against the distributional
gains of taxation.
5
Discussion
5.1
Instrument set
This paper has demonstrated that ruling out non-individualized lump-sum taxes – as a shortcut for an unspecified distributional problem – has large and policy-relevant consequences for
the analysis of the marginal cost of public funds. However, some may question whether in the
real world governments have in fact access to non-distortionary lump-sum taxes as a marginal
source of public finance. One may argue that ruling out non-individualized lump-sum taxes is
in fact a realistic feature due to practical or even legal problems in implementing such taxes.
However, from a theoretical point of view, the lump-sum part of the tax system generally
consists of subsidies (i.e., transfers) to individuals, since the intercept of the tax function is
optimally negative. Therefore, even if one likes to rule out non-individualized lump-sum taxes
as a marginal source of public finance, it is certainly feasible to marginally reduce lump-sum
transfers to the private sector to finance public goods. Moreover, virtually all real-world tax
systems entail a general tax credit or a general tax exemption, which works as a lump-sum
subsidy and can be (and often is) changed by policy makers without running into all kinds
of practical or legal obstacles. Hence, this paper’s assumption that non-individualized lumpsum transfers are a marginal source of public finance can be defended on both theoretical and
empirical grounds.
5.2
Kaplow and sub-optimal taxation
Kaplow (1996), Kaplow (2004) and Kaplow (2008) argue that even if the tax system is not
optimal, neither distributional concerns nor incentive effects of the financing of public goods
with distortionary taxes should be included in the discussion of second-best policy analysis.
Laroque (2005) and Gauthier and Laroque (2009) contain similar ideas and provide rigorous
proofs of this claim. In this argument, the government simultaneously does two things: i) it
changes the provision of the public good, and ii) it applies a benefit-absorbing change in the
non-linear tax schedule that fully extracts each individual’s willingness to pay for the public
good. Given that the utility function is identical across agents and weakly separable, this tax
change does not affect labor supply incentives. Hence, the tax adjustment perfectly imitates a
pure benefit tax. Consequently, the traditional Samuelson rule can be used to judge whether
public goods provision should increase or not, without making corrections for the marginal cost
of public funds.
Kaplow’s approach has a number of disadvantages. First, preferences must be weakly separable and identical, otherwise the benefit-absorbing change in the non-linear tax schedule is
generally not incentive compatible, and cannot be implemented as a result, see also Laroque
27
(2005) and Gahvari (2006). Consequently, Kaplow’s approach cannot be generalized to nonseparable or heterogenous preferences. Second, while the tax system is not assumed to be
optimal, the non-linear income tax schedule is flexible enough to perfectly off-set any distributional effect of public good provision at each income level. However, if there are binding
constraints that make non-linear tax schedule sub-optimal, the very same constraints may prevent the implementation of the benefit-absorbing tax changes. Hence, it may not be possible
to implement the benefit-absorbing tax change in the non-linear tax schedule. Third, Kaplow’s
analysis cannot be readily generalized to linear tax schedules, because linear taxes generally do
not neutralize all distributional effects of public goods – except in the knife-edge case where
the benefits of public goods are proportional in income. Fourth, in real-world policy making,
changes in the provision of public goods generally occur without neutralizing the distributional
effects with benefit-absorbing changes in the tax system (Gahvari, 2006).
This paper avoided these disadvantages by analyzing public good provision and income
taxation with non-separable and heterogeneous preferences, allowing for both linear and nonlinear tax schedules, and without adjusting the (linear or non-linear) tax schedules to fully
neutralize the distributional impact of public goods. It demonstrated that the marginal cost of
public funds should not be present in the modified Samuelson rule for public goods provision.
Intuitively, if the tax system is optimized, a marginal change in distortionary taxes to finance
public goods produces exactly offsetting distortions and distributional gains, which cancel out in
the modified Samuelson rule. This is an application of the envelope theorem for optimal taxes,
which also explains why this result generalizes to non-seperable and heterogeneous preferences,
linear and non-linear tax systems, and without the need to implement benefit-absorbing tax
changes. Consequently, the marginal cost of public funds should not play a role in the Samuelson
rule for public goods provision if taxes are optimized.
5.3
Implications for other public policies
The results of this paper have larger relevance to other public policies than public goods provision. First, Jacobs and Boadway (2014) analyze optimal linear or non-linear commodity taxes
jointly with optimal non-linear income taxes in the frameworks of Atkinson and Stiglitz (1976)
and Mirrlees (1976). They show that the marginal cost of public funds – based on Diamond’s
social value of income – remains equal to one in the full tax optimum. Hence, distortions from
commodity taxation do not drive up the marginal cost of public funds above one when income
and commodity taxes are optimized.
Second, Kleven and Kreiner (2006) extend the literature on the marginal cost of public
funds – based on the traditional definition – to account for extensive-margin distortions in
labor supply, besides intensive-margin distortions. They argue that participation distortions
tend to raise the marginal cost of public funds. Jacquet, Lehmann, and Van der Linden (2013)
merge the Mirrlees model of optimal income taxation with labor supply on the intensive margin
with Diamond (1980)’s model of optimal income taxation with labor supply on the extensive
margin. Zoutman, Jacobs, and Jongen (2016) follow their analysis and demonstrate that in
this model the marginal cost of public funds is once more equal to one in the tax optimum if
the Diamond-based measure for the marginal costs of public funds is employed. Participation
28
distortions are therefore not a reason why the marginal cost of public funds would rise above
one in the tax optimum.
Third, Sandmo (1975), Bovenberg and de Mooij (1994) and Bovenberg and van der Ploeg
(1994) show that the optimal corrective tax is driven below the Pigouvian level if the marginal
cost of public funds is larger than one. Intuitively, providing environmental quality directly
competes with provision of ordinary public goods. Jacobs and de Mooij (2015) analyze optimal
corrective taxes alongside optimal linear and non-linear income taxes in the models of Atkinson
and Stiglitz (1976) and Mirrlees (1976), which are extended with externalities. They demonstrate that the marginal cost of public funds is once more equal to one in the tax optimum with
optimal corrective taxation. They derive that it is generally incorrect to set optimal corrective
taxes below Pigouvian levels, because the marginal cost of public funds is one. Consequently,
governments should not pursue less ambitious environmental policies if tax rates are high.
Fourth, Barro (1979) and Lucas and Stokey (1983) show that under distortionary taxation
Ricardian equivalence breaks down, and debt-financing and tax-financing of public spending
cease to be equivalent. Since tax distortions are convex in tax rates, it is better to smooth
tax rates over time, rather than having time-varying tax rates. Therefore, public debt should
optimally be used to smooth tax rates over time. Werning (2007), however, demonstrates that
ignoring distributional concerns to motivate tax distortions has fundamental consequences for
the theory of optimal debt management. The reason is that, at the margin, the government
always has access to non-distortionary sources of finance (i.e., non-individualized lump-sum
transfers). Werning (2007)’s analysis implies that the marginal cost of public funds is equal
to one at all times if governments optimize tax systems, see also Jacobs (2009). As a result,
debt and (lump-sum) tax financing are equivalent, Ricardian equivalence is restored, and the
optimal path of public debt becomes indeterminate.
Fifth, if the marginal cost of public funds is equal to one, revenue-raising instruments (such
as taxes) are not superior to revenue-neutral instruments (such as regulation), or to revenuereducing instruments (such as subsidies). The reason is that the fundamental informational
constraint in second-best analysis – ability is private information – cannot be relaxed by simply adopting a different policy implementation. Many indirect mechanisms may exist that can
sustain the second-best optimal allocation derived from an incentive-compatible direct mechanism. Quantity controls, regulation or auctions may achieve the second-best optimal allocation
equally as well as price instruments. Hence, one needs to include additional, instrument-specific
constraints into the analysis in order to assess the desirability of revenue-raising over revenueneutral or revenue-reducing policy instruments.
5.4
Social cost-benefit analysis
In social cost-benefit analysis it is probably best not to make corrections for the marginal cost
of public funds. Tax distortions cancel out against distributional gains if taxes are optimally
set, which renders the marginal cost of public funds equal to one.28 Alternatively, if one does
not want to assume that taxes are optimally set, then not only the deadweight costs of taxation,
28
This does not mean that tax distortions do not affect the allocations, and thereby affect both the costs and
the benefits of a project, however.
29
but also the distributional benefits of taxation should be included in social cost-benefit analysis.
A common practice to add, say, 50 cents for every dollar spent on a public project to account
for tax distortions, but completely ignoring the distributional benefits of tax instruments at the
same time, is incorrect. Indeed, too few public projects would then pass the social cost-benefit
test.
To justify setting the marginal cost of public funds equal to one in social cost-benefit analysis, policy makers may invoke Becker (1983)’s efficient redistribution hypothesis, which argues
that the political system should achieve an outcome in which all opportunities for political gains
to redistribute incomes are exhausted. Alternatively, one may invoke the principle of insufficient reason, since the calculation of deadweight costs and distributional benefits of taxation is
fraught with difficulties. In particular, if tax systems are not considered to be optimal, policy
analysts should estimate both the marginal distortions and the marginal distributional benefits of taxation. It is in principle possible to estimate the deadweight losses of taxation, even
though uncertainties in deadweight loss estimates can be substantial. The estimation of the
distributional benefits of taxation, however, is a hazardous task for any policy analyst, since
this inevitably involves political judgements about the desirability of income redistribution from
which policy analysts should preferably abstain. Indeed, any policy analyst claiming that the
marginal cost of public funds is necessarily larger than one is merely revealing his/her political
preferences for income redistribution: apparently the deadweight losses of taxation are always
considered to be larger than his/her perceived benefits of redistribution.
6
Conclusions
This paper analyzed the simultaneous setting of optimal taxes and the provision of public goods
in standard optimal tax models with heterogeneous agents having heterogeneous preferences.
Both optimal linear and non-linear tax schedules are analyzed. It has been demonstrated that
tax distortions are the ultimate result of redistributional concerns. Using Diamond’s definition
for the social marginal value of private income, which includes policy-induced income effects on
tax bases, the marginal cost of public funds at the optimal tax system is shown to be one, for
optimal linear and non-linear taxes, and for income and consumption taxes. At the optimal tax
system, the marginal cost of public funds for all tax instruments should be equalized. Hence, the
marginal cost of public funds for distortionary taxation equals the marginal cost of public funds
for non-distortionary taxation, which equals one. The distributional benefits of distortionary
tax rates are therefore equal to the marginal excess burden of tax rates. The modified Samuelson
rule is derived using general preference structures. It is demonstrated that the marginal cost
of public funds does not determine optimal public goods provision under optimal taxation.
Modified Samuelson rules for public good provision under linear and non-linear taxation are
identical.
Applied policy economists should use the marginal excess burden as the social cost of income
redistribution, and the marginal cost of public funds as the social cost of financing public activities that are unrelated to income redistribution. It should be remembered that the marginal
cost of public funds of a distortionary tax does not only capture the deadweight loss of a dis-
30
tortionary tax, but also its distributional benefits. When tax systems are not optimal, the
marginal cost of public funds could either be smaller or larger than one, depending on whether
the distributional benefits are smaller than the marginal excess burden of distortionary taxes.
This paper followed common practice in the second-best literature by assuming that the
government is a benevolent social planner and markets are Pareto-efficient in the absence of
government intervention. Government failure could explain why taxes and public goods are not
set at second-best optimal levels. One may therefore be tempted to conclude that government
failure raises the marginal cost of public funds above one. Moreover, not only governments, but
also markets can fail. The main lesson of this paper is that without explicitly incorporating
the fundamental reasons why governments or markets fail into the analysis, it is premature to
conclude that the marginal cost of public funds lies above or below one. In future research, the
marginal cost of public funds should be derived in settings where market or government failure
is explicitly modeled from first principles and not derived from ad hoc constraints.
One could also extend the model to allow for migration of individuals, resulting in participation constraints as in, for example, Hellwig (2004) or Lehmann, Simula, and Trannoy (2014).
This is a large deviation from the assumptions used in the standard literature on the marginal
cost of public funds. Nevertheless, even lump-sum taxes/transfers are probably no longer neutral from an efficiency point of view. In particular, a marginal reduction in the transfer induces
some individuals to migrate. It can be conjectured that the marginal cost of public funds is no
longer one at the optimal tax system if no non-distortionary government instrument exists at
all. Future research could explore this further.
A
Proof Proposition 7
A.1
Elasticities
First, the behavioral elasticities of model are derived. These elasticities will be used later in
deriving the optimal tax expressions. As in Jacquet, Lehmann, and Van der Linden (2013),
define the following shift functionm – omitting the indices n:
L(l, G, n, τ, ρ) ≡ n(1 − T 0 (nl) − τ )uc (nl − T (nl) − τ (nl − nl(n)) + ρ, l, G, n)
+ ul (nl − T (nl) − τ (nl − nl(n)) + ρ, l, G, n).
(52)
L(l, n, τ, ρ, G) measures a shift in the first-order condition for labor supply when one of the
variables l, G, n, τ , or ρ changes. τ captures an exogenous increase in the marginal tax rate
(i.e., for any level of earnings). ρ is introduced to retrieve the income effect when the individual
receives an exogenous amount of income ρ. The first-order condition for labor supply of the
individual n is thus equivalent to L(l, G, n, 0, 0) = 0.
The following partial derivatives of the shift function L are found, using the first-order
31
condition −ul = n(1 − T 0 )uc :
2
ul
T 00
Ll (l, G, n, 0, 0) = ull +
ucc − 2
ucl + nul
,
(53)
uc
(1 − T 0 )
2
!
T 00
ul
ul
l
uln ucn
−ul
+ nul
+
ucc −
ulc
+
ul , (54)
Ln (l, G, n, 0, 0) =
−
l
(1 − T 0 )
uc
uc
n
ul
uc
ul
uc
Lτ (l, G, n, 0, 0) = −nuc ,
ulc uc − ul ucc
Lρ (l, G, n, 0, 0) =
.
uc
(55)
(56)
The envelope theorem gives the following formula for the partial derivatives:
∂l
∂q
L
= − Lql , q =
n, τ, ρ.
The uncompensated wage elasticity of labor supply εuln is equal to:
εuln
∂lu n
Ln n
≡
=−
=−
∂n l
Ll l
−ul
l
2
ul
ul
nuln
T 00
+
u
−
+ nul (1−T
0)
cc
uc
uc ulc +
ul −
2
T 00
ull + uucl ucc − 2 uucl ucl + nul (1−T
0)
nucn
uc
ul
l
.
(57)
The income elasticity of labor supply εlρ is defined as:
ul ucc
u
−
lc
uc
Lρ
∂l
εlρ ≡ (1 − T 0 )n
= −(1 − T 0 )n
=
.
2
∂ρ
Ll
T 00
ull + uucl ucc − 2 uucl ucl + nul (1−T
0)
ul
uc
(58)
The compensated wage elasticity of labor supply εcln is defined residually by the Slutsky
equation (εcln ≡ εuln − εlρ ):
εcln
nuln
nucn ul
T 00
−
nu
−
−
l (1−T 0 )
ul
uc
l
n
≡
= εuln − εlρ =
.
2
∂n l
T 00
ull + uucl ucc − 2 uucl ucl + nul (1−T
0)
ul
l
∂lc
(59)
The compensated tax elasticity of labor supply εclT 0 is:
εclT 0 ≡ −
∂lc (1 − T 0 )
Lτ (1 − T 0 )
=
=
∂τ
l
Ll
l
ull +
ul
uc
2
ul /l
.
T 00
ucc − 2 uucl ucl + nul (1−T
0)
(60)
Note that the compensated tax elasticity of earnings supply εczT 0 equals the compensated tax
elasticity of labor supply (since the wage rate n is not affected by an increase in marginal taxes,
only labor supply).
The uncompensated wage elasticity of earnings supply εuzn is equal to:
εuzn ≡
∂z n
= 1 + εuln
∂n z
ul
1 + luulll − luuccl − nuulln − nuuccn
l
=
.
2
T 00
ull + uucl ucc − 2 uucl ucl + nul (1−T
0)
32
(61)
A.2
Optimal taxation and public goods provision
Consumption is defined as a function of the allocation: c(n) ≡ c(l(n), u(n), G, n), which is
obtained by inverting the utility function u(c(n), l(n), G, n). The derivatives of c(·) are found
by applying the implicit function theorem: cu = 1/uc , cl = −ul /uc , cG = −uG /uc , cn = −un /uc .
For notational compactness the incentive compatibility constraint can be rewritten as:
du(n)
l(n)
= un (c(l(n), u(n), G, n), l(n), G, n) −
ul (c(l(n), u(n), G, n), l(n), G, n)
dn
n
≡ ϕ(l(n), u(n), G, n),
(62)
where the function ϕ(l(n), u(n), G, n) has partial derivatives – omitting the n-indices:
ul
lull lulc
nunl nunc
ϕl = −
1+
−
,
−
−
n
ul
uc
ul
uc
1 nunc lulc
ϕu =
−
,
n
uc
uc
uG
l
uG
ϕG = unG − unc
−
ulG − ulc
.
uc
n
uc
(63)
(64)
(65)
The Hamiltonian for maximizing social welfare is:
H = Ψ(u(n))f (n) + η [nl(n) − c(l(n), u(n), G, n) − pG/N ] f (n) − θ(n)ϕ(l(n), u(n), G, n), (66)
where θ(n) is the co-state variable associated with state variable u(n), which has to satisfy
incentive compatibility constraint (62). Furthermore, η denotes the Lagrange multiplier on the
economy’s resource constraint in equation (43). θ(n) is multiplied with a minus sign to obtain
a positive multiplier θ(n).
The first-order conditions for an optimal allocation are – omitting the n-indices:
∂H
ul
lull lulc
θul
nunl nunc
=η n+
f+
1+
−
−
−
= 0,
∂l
uc
n
ul
uc
ul
uc
Z
N
∀n,
(67)
∂H
η
θ nunc lulc
dθ
= Ψ0 −
, ∀n,
(68)
f−
−
=
∂u
uc
n
uc
uc
dn
Z ∂H
uG
p
θuG
nunG nunc
lulG lulc
dn =
−
−
−
η
f−
−
dn = 0, (69)
∂G
uc
N
n
uG
uc
uG
uc
N
where the derivatives of the c(·) function are used in each line. The transversality conditions
for this optimal control problem are given by limn→n θ(n) = 0, and limn→n θ(n) = 0.
A.2.1
Optimal non-linear taxes
The first-order conditions for l in equation (67) can be rewritten using
−ul
uc
= (1 − T 0 (z))n:
T0
uc θ/η
lull lulc
nunl nunc
=
1+
−
−
−
.
1 − T0
nf (n)
ul
uc
ul
uc
33
(70)
Next, employ the definitions for the elasticities for εclT 0 in equation (60) and εuzn in equation
(61) to find:
1+
lull lulc
−
ul
uc
−
nunl nunc
−
ul
uc
=
εuzn
.
εclT 0
(71)
Hence, the first-order condition for optimal taxes can be written as:
T 0 (z(n))
uc (·)θ/η εuzn
.
=
1 − T 0 (z(n))
nf (n) εclT 0
(72)
The first-order condition for u in equation (68) can be rewritten as:
Ψ0 uc
θ
θ lulc
uc dθ
− 1 f (n) − ucn +
=
.
η
η
η n
η dn
(73)
Introduce the composite multiplier Θ(n):
Θ(n) ≡
θ(n)uc (c(n), l(n), G, n)
θ(n)uc (c(l(n), u(n), G, n), l(n), G, n)
=
,
η
η
(74)
which has total derivative (note that G does not vary with n):
dθ uc θucc
dΘ
=
+
dn
dn η
η
∂c dl
∂c du
∂c
+
+
∂l dn ∂u dn ∂n
θ
dl
θ
+ ucl
+ ucn .
η dn η
(75)
Simplify the last expression using the derivatives of the c-function and substituting equation
(62) for
du
dn :
dθ uc
θ
dΘ
=
−
dn
dn η
η
l ul ucc
− ucn
n uc
θ
−
η
ul ucc
− ucl
uc
dl
.
dn
(76)
Use the elasticity εlρ in equation (58) and the elasticity εclT 0 in equation (60) to find an expression
for
ucc ul
uc
− ucl :
εlρ uc
ul ucc
− ulc = − c
.
uc
εlT 0 l
(77)
Substitute equation (77) into equation (76), and use εuln from equation (57) to derive:
dΘ
dθ uc
θ l ucc ul
θ
θ εlρ uc u
=
−
+ ucn +
ε .
dn
dn η
η n uc
η
η εclT 0 n ln
(78)
Substitute the first-order condition for u in equation (73) into equation (78) to find:
dΘ θ εlρ uc u
−
ε =
dn
η εclT 0 n ln
Ψ0 uc
θ ucc ul
l
− 1 f (n) −
− ulc
.
η
η
uc
n
(79)
Substitute equation (77) in equation (79) and note that 1 + εuln = εuzn from equations (57) and
(61) to find:
dΘ uc θ εuzn ul
−
εlρ =
dn
ul η εclT 0 n
Ψ0 uc
− 1 f (n).
η
(80)
Rewrite first-order condition for zn in equation (72):
T 0 nf (n) = −
34
θ ul εuzn
.
η n εclT 0
(81)
Substitute equation (81) in (80) to find:
dΘ
=
dn
Ψ0 uc
T0
+
ε
−
1
f (n).
lρ
η
1 − T0
(82)
Equation (82) can be integrated – using a transversality condition – to obtain:
θ(n)uc (c(n), l(n), G, n)
Θ(n) =
=
η
Ψ0 (u(m))uc (m)
T 0 (z(m))
1−
−
εlρ f (m)dm.
η
1 − T 0 (z(m))
(83)
n
Z
n
The ABC-formula for the optimal non-linear tax results upon substituting equation (83) in
equation (72):
T 0 (z(n))
1 − T 0 (z(n))
=
εuzn
εclT 0
Rn
n
1−
Ψ0 (u(m))uc (m)
η
−
T 0 (z(m))
1−T 0 (z(m)) εlρ
1 − F (n)
f (m)dm 1 − F (n)
nf (n)
.
(84)
Finally, the optimal tax formula can be written in terms of the earnings density as in Saez
(2001) and Jacquet, Lehmann, and Van der Linden (2013). Define the earnings distribution as
F̃ (z(n)) ≡ F (n), with corresponding density function f˜(z(n)). Then, it can be derived that
εuzn f˜(z(n))z(n) = f (n)n. By noting that there is a perfect mapping between earnings z(n) and
ability n the expression for the optimal income tax is written as:
T 0 (z(n))
1 − T 0 (z(n))
=
1
R z(n) z(n)
1 − g(m) −
εclT 0
T 0 (z(m))
1−T 0 (z(m)) εlρ
f˜(z(m))dz(m) 1 − F̃ (z(n))
1 − F̃ (z(n))
z(n)f˜(z(n))
.
(85)
0
T (z(n))
∂l(n)
0
0
Using the definition of α(n) ≡ Ψ0 (u(n))λ(n)+η 1−T
0 (z(n)) εlρ = Ψ (u(n))λ(n)+ηnT (z(n)) ∂(−T (0)) ,
where the second step follows from εlρ = εl(−T (0)) , this can be rewritten as the expression in
the main text.
A.2.2
Marginal cost of public funds
0
T (z(n))
Use α(n) ≡ Ψ0 (u(n))λ(n) + η 1−T
0 (z(n)) εlρ and the transversality condition θ(n) = 0 in equation
(83) to derive:
Z
n
(1 − α(m)) f (m)dm.
0=
(86)
n
Hence, switching the bound of the integral, the marginal cost of public funds equals unity in
the policy optimum:
M CF ≡ R z(n)
z(n)
A.2.3
η
= 1.
α(m)f˜(z(m))dm
(87)
Optimal public goods provision
In order to express the optimal formula for public goods provision in terms of sufficient statistics, the optimal level of public goods is evaluated along an optimized non-linear tax system
T ∗ (nl(n)). The Lagrangian for maximizing social welfare with respect to the public good can
35
then be formulated as:
Z
L≡N
Z
∗
T (nl(n))dF (n) − pG ,
Ψ(v(n))dF (n) + η N
N
(88)
N
where v(n) is indirect utility of the individual. The first-order condition for G is given by:
∂L
=N
∂G
Z N
∂l(n)
Ψ (v(n))uG + ηnT (nl(n))
∂G
0
0
dF (n) − pη = 0,
(89)
where the derivatives of indirect utility have been substituted. Substituting the Slutsky equation
∂lu (n)
∂G
∂lc (n)
∂G
=
+
uG ∂l(n)
λ(n) ∂ρ
Z
N
N
yields:
α(n) uG
dF (n) + N
η uc
Z N
∂lc (n)
nT (nl(n))
∂G
0
dF (n) = p,
(90)
Use the Feldstein characteristic for the public good ξG to derive:
Z
uG (·)
dF (n) =
uc (·)
(1 − ξG )
N
Z
N
α(n) uG (·)
dF (n).
η uc (·)
(91)
Hence, optimal provision of the public good follows from:
Z
uG (·)
dF (n) = p 1 −
γn εclG dF (n) ,
uc (·)
N
Z
(1 − ξG )N
N
where γn ≡
(92)
N T 0 (nl(n))nl(n)
.
pG
Alternatively, one can derive the optimal rule for public goods provision in terms of the
model’s primitives, as in Jacobs (2010). Derive that
∂ ln(uG /uc )
.
∂ ln l
Z
N
N
nunG
uG
− nuucnc =
∂ ln(uG /uc )
∂ ln n
lulG
uG
− luuclc =
dF (n).
(93)
and
Substitute this in equation (69), and rearrange to obtain:
uG (·)
dF (n) = p + N
uc (·)
Z
N
θ(n)uG (·)/η
nf (n)
∂ ln(uG /uc ) ∂ ln(uG /uc )
−
∂ ln n
∂ ln l(n)
Substitute equation (72) for the optimal income tax in equation (93) to find:
Z
N
N
where
uG (·)
(1 + ∆(n))dF (n) = p,
uc (·)
T 0 (z(n)) εclT 0
∆(n) ≡
1 − T 0 (z(n)) εuzn
∂ ln(uG /uc ) ∂ ln(uG /uc )
−
∂ ln l(n)
∂ ln n
(94)
.
(95)
References
Allgood, Sam and Arthur Snow. 1998. “The Marginal Cost of Raising Tax Revenue and Redistributing Income.” Journal of Political Economy 106 (6):1264–1273.
Atkinson, Anthony B. and Nicholas H. Stern. 1974. “Pigou, Taxation and Public Goods.”
Review of Economic Studies 41 (1):119–128.
36
Atkinson, Anthony B. and Joseph E. Stiglitz. 1976. “The Design of Tax Structure: Direct
versus Indirect Taxation.” Journal of Public Economics 6 (1-2):55–75.
Ballard, Charles L. and Don Fullerton. 1992. “Distortionary Taxes and the Provision of Public
Goods.” Journal of Economic Perspectives 6 (3):117–131.
Barro, Robert J. 1979. “On the Determination of the Public Debt.” Journal of Political
Economy 87 (5):940–971.
Becker, Gary S. 1983. “A Theory of Competition Among Pressure Groups for Political Influence.” Quarterly Journal of Economics 98 (3):371–400.
Blundell, Richard and Thomas MaCurdy. 1999. “Labor Supply: A Review of Alternative
Approaches.” In Handbook of Labor Economics, vol. 3A, edited by Orley Ashenfelter and
David Card, chap. 27. Elsevier, 1559–1695.
Boadway, Robin and Michael Keen. 1993. “Public Goods, Self-Selection and Optimal Income
Taxation.” International Economic Review 34 (3):463–478.
Bovenberg, A. Lans and Ruud A. de Mooij. 1994. “Environmental Levies and Distortionary
Taxation.” American Economic Review 84 (4):1085–1085.
Bovenberg, A. Lans and Frederick van der Ploeg. 1994. “Environmental Policy, Public Finance
and the Labour Market in a Second-Best World.” Journal of Public Economics 55 (3):349–
390.
Browning, Edgar K. 1976. “The Marginal Cost of Public Funds.” Journal of Political Economy
84 (2):283–298.
———. 1987. “On the Marginal Welfare Cost of Taxation.” American Economic Review
77 (1):11–23.
Browning, Edgar K. and William R. Johnson. 1984. “The Trade-Off Between Equity and
Efficiency.” Journal of Political Economy 92 (2):175–203.
Christiansen, Vidar. 1981. “Evaluation of Public Projects under Optimal Taxation.” Review of
Economic Studies 48 (3):447–457.
Dahlby, Bev. 1998. “Progressive Taxation and the Social Marginal Cost of Public Funds.”
Journal of Public Economics 67 (1):105–122.
———. 2008. The Marginal Cost of Public Funds: Theory and Applications. Cambridge-MA:
MIT Press.
Diamond, Peter A. 1975. “A Many-Person Ramsey Tax Rule.” Journal of Public Economics
4 (4):335–342.
———. 1980. “Income Taxation with Fixed Hours of Work.” Journal of Public Economics
13 (1):101–110.
37
———. 1998. “Optimal Income Taxation: An Example with a U-Shaped Pattern of Optimal
Marginal Tax Rates.” American Economic Review 88 (1):83–95.
Diamond, Peter A. and James A. Mirrlees. 1971. “Optimal Taxation and Public Production I:
Production Efficiency.” American Economic Review 61 (1):8–27.
Feldstein, Martin. 1972. “Distributional Equity and the Optimal Structure of Public Prices.”
American Economic Review 62 (1):32–36.
Gahvari, Firouz. 2006. “On the Marginal Cost of Public Funds and the Optimal Provision of
Public Goods.” Journal of Public Economics 90 (6-7):1251–1262.
———. 2014. “Second-Best Pigouvian Taxation: A Clarification.” Environmental and Resource
Economics 59 (4):525535.
Gauthier, Stephane and Guy R. Laroque. 2009. “Separability and Public Finance.” Journal of
Public Economics 93 (11-12):1168–1174.
Håkonsen, Lars. 1998. “An Investigation into Alternative Representations of the Marginal Cost
of Public Funds.” International Tax and Public Finance 5 (3):329–343.
Harberger, Arnold C. 1964. “Taxation, Resource Allocation, and Welfare.” In The Role of Direct
and Indirect Taxes in the Federal Revenue System. Princeton University Press for NBER and
the Brookings Institution, 25–70.
Heckman, James J., Seong Hyeok Moon, Rodrigo Pinto, Peter A. Savelyev, and Adam Yavitz.
2010. “The Rate of Return to the High Scope Perry Preschool Program.” Journal of Public
Economics 94 (1-2):114–128.
Hellwig, Martin. 2004. “Optimal Income Taxation, Public Goods Provision, and Public Sector
Pricing. A Contribution to the Foundations of Public Economics.” MPI Discussion Paper
2004/14, Bonn: Max Planck Institute for Research on Collective Goods.
———. 2010. “Incentive Problems with Unidimensional Hidden Characteristics: A Unified
Approach.” Econometrica 78 (4):1201–1237.
Jacobs, Bas. 2009. “The Marginal Cost of Public Funds and Optimal Second-Best Policy Rules.”
Erasmus University Rotterdam.
———. 2010. “The Marginal Cost of Public Funds is One.” CESifo Working Paper No. 3250,
Munich: CESifo.
Jacobs, Bas and Robin Boadway. 2014. “Optimal Linear Commodity Taxation under Optimal
Non-Linear Income Taxation.” Journal of Public Economics 117 (1):201–210.
Jacobs, Bas and Ruud A. de Mooij. 2015. “Pigou Meets Mirrlees: On the Irrelevance of Tax
Distortions for the Optimal Corrective Tax.” Journal of Environmental Economics and Management 71:90–108.
38
Jacquet, Laurence, Etienne Lehmann, and Bruno Van der Linden. 2013. “The Optimal Marginal
Tax Rates with both Intensive and Extensive Responses.” Journal of Economic Theory
148 (5):1770–1805.
Kaplow, Louis. 1996. “The Optimal Supply of Public Goods and the Distortionary Cost of
Taxation.” National Tax Journal 49 (4):513–533.
———. 2004. “On the (Ir)relevance of Distribution and Labor Supply Distortion to Public
Goods Provision and Regulation.” Journal of Economic Perspectives 18 (4):159–175.
———. 2008. The Theory of Taxation and Public Economics. Princeton and Oxford: Princeton
University Press.
Kleven, Henrik Jacobsen and Claus Thustrup Kreiner. 2006. “The Marginal Cost of Public
Funds: Hours of Work versus Labor Force Participation.” Journal of Public Economics
90 (10-11):1955–1973.
Kreiner, Klaus Thustrup and Nicolai Verdelin. 2012. “Optimal Provision of Public Goods: A
Synthesis.” Scandinavian Journal of Economics 114 (2):384–408.
Laffont, Jean-Jacques and Jean Tirole. 1993. A Theory of Incentives in Procurement and
Regulation. Cambridge-MA: MIT-Press.
Laroque, Guy R. 2005. “Indirect Taxation is Superfluous under Separability and Taste Homogeneity: A Simple Proof.” Economics Letters 87 (1):141–144.
Lehmann, Etienne, Laurent Simula, and Alain Trannoy. 2014. “Tax Me If You Can! Optimal
Nonlinear Income Tax Between Competing Governments.” Quarterly Journal of Economics
129 (4):1995–2030.
Lucas, Robert E. jr. and Nancy Stokey. 1983. “Optimal Fiscal and Monetary Policy in an
Economy without Capital.” Journal of Monetary Economics 12 (1):55–93.
Lundholm, Michael. 2005. “Cost-Benefit Analysis and Marginal Cost of Public Funds.” Mimeo
Stockholm University.
Meghir, Costas and David Phillips. 2010. “Labour Supply and Taxes.” In The Mirrlees Review
– Dimensions of Tax Design, edited by James A. Mirrlees, Stuart Adam, Timothy J. Besley,
Richard Blundell, Steven Bond, Robert Chote, Malcolm Gammie, Paul Johnson, Gareth D.
Myles, and James M. Poterba, chap. 3. Oxford: Oxford University Press, 202–274.
Mirrlees, James A. 1971. “An Exploration in the Theory of Optimum Income Taxation.” Review
of Economic Studies 38 (2):175–208.
———. 1976. “Optimal Tax Theory: A Synthesis.” Journal of Public Economics 6 (4):327–358.
Pigou, Arthur C. 1947. A Study in Public Finance - 3rd Edition. London: Macmillan.
Ramsey, Frank P. 1927. “A Contribution to the Theory of Taxation.” Economic Journal
37 (145):47–61.
39
Saez, Emmanuel. 2001. “Using Elasticities to Derive Optimal Income Tax Rates.” Review of
Economic Studies 68 (1):205–229.
———. 2002. “The Desirability of Commodity Taxation Under Non-Linear Income Taxation
and Heterogeneous Tastes.” Journal of Public Economics 83 (2):217–230.
Samuelson, Paul A. 1954. “The Pure Theory of Public Expenditure.” Review of Economics
and Statistics 36 (4):387–389.
Sandmo, Agnar. 1975. “Optimal Taxation in the Presence of Externalities.” Scandinavian
Journal of Economics 77 (1):86–98.
———. 1998. “Redistribution and the Marginal Cost of Public Funds.” Journal of Public
Economics 70:365–382.
Seade, Jesus K. 1977. “On the Shape of Optimal Tax Schedules.” Journal of Public Economics
7 (2):203–235.
Sheshinski, Eytan. 1972. “The Optimal Linear Income Tax.” Review of Economic Studies
39 (3):297–302.
Slemrod, Joel B. and Schlomo Yitzhaki. 2001. “Integrating Expenditure and Tax Decisions:
The Marginal Cost of Funds and the Marginal Benefit of Projects.” National Tax Journal
52 (2):189–201.
Stiglitz, Joseph E. 1982. “Self-Selection and Pareto Efficient Taxation.” Journal of Public
Economics 17 (2):213–240.
Stiglitz, Joseph E. and Partha Dasgupta. 1971. “Differential Taxation, Public Goods and
Economic Efficiency.” Review of Economic Studies 38 (2):151–174.
Triest, Robert K. 1990. “The Relationship between the Marginal Cost of Public Funds and
Marginal Excess Burden.” American Economic Review 80 (3):557–566.
Tuomala, Matti. 1984. “On the Optimal Income Taxation: Some Further Numerical Results.”
Journal of Public Economics 23 (3):351–366.
Werning, Ivan. 2007. “Optimal Fiscal Policy with Redistribution.” Quarterly Journal of Economics 122 (3):925–967.
Wildasin, David E. 1984. “On Public Good Provision with Distortionary Taxation.” Economic
Inquiry 22 (2):227–243.
Wilson, John D. 1991. “Optimal Public Good Provision with Limited Lump-sum Taxation.”
American Economic Review 81 (1):153–166.
Zoutman, Floris T., Bas Jacobs, and Egbert L.W. Jongen. 2016. “Redistributive Politics and
the Tyranny of the Middle Class.” CESifo Working Paper No. 5881, Munich: CESifo.
40