On reducing the windfall profits in environmental subsidy programs∗ Carmen Arguedas+ and Daan P. van Soest++ + Departamento de Teoría e Historia Económica, Universidad Autónoma de Madrid ++ Department of Economics and CentER, Tilburg University October 30, 2008 Abstract Investment subsidies are widely used to induce adoption of new technologies that are able to reduce emissions at lower (marginal) cost. To economize on the amount of subsidies provided, governments would like to distinguish between firms that need to receive a subsidy to adopt a new technology, and firms that would adopt that technology even without subsidies. We show that policies consisting of a menu of emission taxes and investment subsidies can induce firms to self-select. But we also find that such separating policies do not necessarily result in lower social costs. JEL Codes: D82, H23, Q52. Key words: Investment subsidies, environmental policy, abatement technology adoption, asymmetric information. ∗ Please send all correspondence to: Daan van Soest, Tilburg University, Department of Economics, P.O. Box 90153, 5000 LE Tilburg, The Netherlands; email: [email protected], tel: +31—13—466 2072; fax: +31 13 466 3042. Carmen Arguedas gratefully acknowledges the hospitality of CentER, Tilburg University, where part of this paper was written, and financial support from the Spanish Ministry of Education under research project SEJ2005-05206/ECON. Daan van Soest is grateful to the Netherlands Organization for Scientific Research (NWO) for financial support as part of the NWO/Novem research program. The authors thank Jan Boone, Bouwe Dijkstra, Larry Kranich, Till Requate, Marta Risueño, Herman Vollebergh, Cees Withagen and two anonymous referees for valuable comments on an earlier version of this paper. 1 1 Introduction Subsidies rank high among the instruments most frequently used in environmental policy making (Parry [37]). In some instances, they are simply the most efficient instrument to achieve certain environmental objectives. Examples include the use of subsidies to stimulate investment in abatement technologies if there are positive adoption spillovers (Schneider and Goulder [45]), to foster the use of clean inputs if pollution itself is unobservable (Stranlund [49], Fullerton and Mohr [14], Fullerton and Wolverton [15]), and to complement pollution taxes in case of imperfect competition (Loeb and Magat [28], Kim and Chang [24]). But subsidies may also be the preferred instrument because of ‘political economy’ considerations such as concerns about the potential negative impact of other environmental policy instruments on an industry’s international competitiveness (Baumol and Oates [4]), distributional fairness considerations (Johnson et al. [19]), political self—interest (Fredriksson [12], but also Aidt [1]), etc.1 Independent of the reason why subsidies are being used, society always has a stake in reducing the amount spent, especially if subsidies generate windfall profits for the agents receiving them. If subsidies were straightforward transfers between the government and the recipients, there would be no welfare loss associated with giving agents more money than strictly necessary to compensate them for the costs incurred when undertaking the desired action. However, in most instances raising funds for subsidies requires exacerbating tax distortions elsewhere in the economy, imposing a loss on society in the range of $0.20—$0.30 for every $1 raised (Ruggeri [44]; see also Laffont and Tirole [25] and [26]). 1 For a detailed overview of (the political economy) arguments regarding instrument choice, see Dietz and Vollebergh [11]. Of course it would be easy for the regulator to eliminate all windfall profits if she knew the ‘reservation subsidy level’ for each individual agent, but obtaining such agent—specific information is in many instances prohibitively expensive (cf. Kim and Chang [24] and OECD [35]). Therefore, the question is whether a mechanism can be designed that induces agents to self—select into different groups, ranked according to the minimum amount of money needed to compensate them for providing the desired action. In this paper we take the first step in answering this question by solving the problem for straightforward (lump sum) investment subsidies.2 These subsidies are used very frequently in practice (Jenkins and Lamech [18], OECD [34]) and can be highly effective in stimulating diffusion of new technologies. The regulator can induce any number of firms to adopt by offering a sufficiently large subsidy such that the private adoption benefits3 of all targeted firms are (weakly) higher than their investment cost net of this subsidy. But unless all targeted firms face identical adoption benefits, this policy gives rise to windfall profits in the industry. Whereas the subsidy may only be just sufficient to induce adoption by some firms, others actually obtain a net gain —some may even collect the subsidy while they would have adopted the new technology anyway. Indeed, empirical studies suggest 2 Despite its societal importance, mechanisms to reduce windfall profits have received surprisingly little attention in the environmental economics literature. Indeed, as is evidenced by recent surveys of the literature on technology adoption (see for example Jaffe et al. [16], Nelissen and Requate [33], Requate [42], and Stavins [50]), most attention has been paid to the effectiveness of taxes, quotas, performance standards, and permits (e.g., Jung et al. [21], Milliman and Prince [32], Requate [40]—[41], and Requate and Unold [43]). Also, and probably as a consequence of the difference in focus on policy instruments, the issue of the social costs of raising funds has not received much attention. Indeed, the bulk of the literature either ignores the issue or rather focuses on the (existence of a) ‘double dividend’ of environmental policies —i.e., pollution taxes fostering abatement while at the same time raising funds to reduce distortive taxes elsewhere; see De Mooij [10] for an overview. An exception is Parry [37], who analyzes the efficiency cost of financing subsidies by distortionary taxation, as well as the interaction of these taxes with pre—existing distortions in the economy. 3 These include, among others, savings on the amount of physical inputs used (e.g., in case of energysaving technologies) and savings on the firm’s environmental tax bill. 3 that the incidence of firms receiving windfall profits in environmental investment subsidy programs may be as high as 50—60% (e.g., Joskow and Maron [20], Malm [30], and Wirl and Orasch [54]). Therefore, studying whether and how the windfall profits in technology adoption subsidy programs can be mitigated is relevant in itself. But the analysis may provide useful insights into the feasibility of reducing transfers in other environmental subsidy programs as well. In this paper, we consider an industry composed of two types of firms that differ with respect to their abatement costs associated with the use of both the current (or old) abatement technology, and a new one that is on the market. As a result, the two firm types also differ with respect to their net benefits associated with adopting the new technology. Each firm knows its type, but the regulator only knows the number of firms of each type in the industry. The use of subsidies is motivated in our model by assuming that there are political economy costs that are an increasing function of the environmental tax level. Whereas such political economy costs are likely to be a convex function of the pollution tax rate, we take the simplified approach of assuming a step function: costs are zero below a certain tax level, and infinitely high above that level. We assume that the regulator wants all firms to adopt, but that the maximum tax level is too low to induce industry—wide adoption. Hence, subsidies are needed to complement the tax instrument.4 Given the two instruments available (the constrained pollution tax, and lump—sum 4 We assume that there are just two types of firms in the industry, and that both should be induced to adopt. However, our model can be generalized to the case of an industry consisting of M > 2 firm types, where the regulator wishes to induce K types to adopt (M ≥ K ≥ 2). K ≥ 2 implies that there are potential gains from separating the K types. The same analytical tools developed for analyzing the case M = K = 2 can be used for M > 2, independent of whether full adoption (M = K > 2) or partial adoption (M > K > 2) is socially optimal. We focus on M = K = 2 for expositional clarity but we check the robustness of our results in Section 5. 4 subsidies), the regulator may try to construct a menu of tax/subsidy combinations to induce firms to self—select (a separating policy), or just impose a uniform policy on all firms within the industry (i.e., where a specific pollution tax level and a specific subsidy level apply to all regulated firms).5 Intuitively, one may think that constructing such a self— selecting policy scheme is a formidable task, but also that if successful, such a separating policy must be unambiguously preferred to even the best uniform policy. Interestingly, we find the exact opposite. It is always feasible to construct a set of tax/subsidy combinations such that firms do self—select, but we also find that separating policies do not necessarily result in lower social costs than the best uniform policy. The intuition behind these results is as follows. Separating requires offering two tax/subsidy combinations, with the tax rate and the subsidy level being lower in one combination than in the other (if not, all firms would strictly prefer the one with the lower taxes and higher subsidies). Because of cost—effectiveness considerations6 , the government wants the type that needs the weakest incentives to adopt (i.e., the one that benefits most from adoption) to opt for the policy combination with the low subsidy level (and also the low tax rate), and the type that gains least to opt for the other combination. Offering a lower tax rate is socially costly because of the reduced incentives to abate, but if the associated subsidy level can be decreased sufficiently, net social costs may fall. But what determines whether a type actually chooses one policy combination or the other? In other words, under what circumstances is the government’s preferred policy 5 Note that because all firms are free to choose which tax/subsidy policy they wish to be regulated by, political feasibility may not be an issue here. As long as there is free choice and equal access, there is no unfair preferential treatment or discrimination involved. For examples of policy menus used in practice, see the Danish and Spanish cases discussed below. 6 That is, trying to induce all firms to adopt the new technology at minimum social cost. 5 incentive compatible? The key determinants here are the marginal abatement costs of the two types when using the new technology. The lower the marginal abatement costs, the more pollution is abated and hence the smaller is the pollution tax base. And the smaller the tax base, the larger the increase in the pollution tax rate firms are willing to accept for an extra dollar of subsidies. That means that the more efficient firms tend to prefer the higher tax/subsidy combination. So what does this tell us about the feasibility and desirability of separation? Because the low marginal abatement cost firms are more prone to choosing the high tax/subsidy combination, it is always possible to separate the two firm types by making the difference between the two tax/subsidy combinations sufficiently large. But if the low marginal abatement cost firms happen to be the ones that gain most from adoption (i.e., they are the ones that need the smallest incentives to adopt), separation implies paying the largest subsidies to the type that needs them least. Hence, a necessary condition for separation resulting in lower social costs is that the firm type with the lower marginal abatement costs is also the type that gains least from adopting. We focus on combinations of taxes and subsidies because of two reasons. The first is that the joint use of these two instruments has received a substantial amount of attention in the economics literature as a means to implement environmental policies in second— best situations due to the presence of asymmetric information, market power, etc. (see for example Carraro and Siniscalco [8], Dasgupta et al. [9], Kim and Chang [24], Lewis [27], Segerson [46], Sheriff [47], and Spulber [48]; see Alberini and Segerson [2] and Khanna [23] for useful overviews of this literature). The second reason is that many countries use combinations of taxes and subsidies to implement their environmental policies regarding 6 for example CO2 emissions reductions and habitat conservation, including, among others, Australia, Denmark, Finland, Italy, Japan, the Netherlands and the United States (see OECD/IEA 2008). Many of the policies in place are just uniform policies (in the sense of imposing the same tax rate on and offering the same subsidy level to all firms in the regulated industry). But given that the two instruments are used jointly already, these countries are likely to be willing to consider the advantages of offering a menu of tax/subsidy combinations. And indeed a couple of examples are available of policies that offer such menus in practice. Since 2000, the Danish Energy Agency offers firms in energy—intensive sectors (such as paper, food and beverages, and the steel industry) combinations of differentiated CO2 tax rates and differentiated subsidies (direct transfers, but also in the form of tax rebates) depending on the energy—saving technologies adopted (OECD [35]). And the Spanish government offers a menu of subsidies and feed—in tariffs to firms to stimulate renewable—resource—based electricity generation (UNEP [51]). Obviously, our proposal to use taxes and subsidies as instruments to reduce windfall profits in environmental subsidy programs does not mean that this is the only way in which the problem can be addressed. One alternative approach is to undertake extensive firm audits, trying to identify all possible investment projects that could be undertaken by each firm to reduce its emissions and then offering firms to compensate them for all costs involved. Indeed, in the period 1995—2000 the Danish Energy Agency tried to reduce Denmark’s emissions of CO2 by undertaking detailed audits of firms in several energy—intensive industries; see OECD [35]. All audited firms were compensated for their investments, and the level of detail of the information obtained was such that the windfall 7 profits obtained were very small — but at the cost of very high transaction costs indeed.7 This paper suggests that the Danish Energy Agency could also have decided to send auditors to a representative sample of the industry, asking them to identify types of firms, and then use combinations of subsidies and taxes to induce the socially optimal level of adoption. A second alternative approach is to use investment subsidies in combination with instruments other than taxes. However, the results we obtain using tax/subsidy combinations also hold when using quota/subsidy combinations instead. By focusing on investment subsidies and assuming that the regulator is constrained in the use of the tax rate, our model’s results are of direct relevance to all the programs where the purchase of environmentally friendly technologies is subsidized even if there are neither (sizable) positive spillovers from adoption nor information asymmetries regarding polluting input use (as is the case for many of the subsidized technologies in the US Energy Policy Act of 2005 (Public Law 109–58–Aug. 8 2005)). In these instances policy makers are effectively unwilling to charge the Pigovian tax to induce the socially optimal level of adoption (cf. Requate and Unold [43]), and this is captured in our model by assuming a maximum level for the tax rate below the Pigovian level. But our model is also likely to provide useful insights in other instances in which environmental subsidies are being used. Indeed, our characterization of the key factors determining desirability of separation is very general and is likely to be applicable to many other situations too. Two direct applications are the Utility Conservation programs as analyzed by Wirl [52], [53], and Wirl and Orasch [54], and the Payment—for—Ecosystem Services programs as analyzed by 7 This is illustrated by the fact that the Danish Energy Agency discontinued undertaking detailed audits from 2000 onwards (OECD [35]). 8 Wu and Babcock [55]. In both programs there is a conflict between cost effectiveness and incentive compatibility very similar to the problem we analyze. The remainder of the paper is organized as follows. In Section 2, we present the model. In Section 3, we explore the possible conflict between cost effectiveness and incentive compatibility. We do so by determining how behavior of the two firm types is affected by different tax/subsidy schemes, and how this difference in behavior can be used to induce separation. In Section 4, we identify the conditions under which separating can be welfare enhancing. In Section 5, we analyze the robustness of our results under more general assumptions, and our conclusions ensue in Section 6. For the reader’s convenience, a table with all the notation is presented in Appendix 1. The proof of the main result of the paper (Proposition 1) is in Appendix 2. 2 2.1 The model Technology description We consider an industry composed of two firm types (with at least one firm of each type). Firms of the two types emit a homogeneous pollutant but differ with respect to the efficiency with which they can abate emissions. The two firm types are indexed i = 1, 2, and emissions of firms of type i are denoted by ei > 0. Independent of its type, we assume that each firm produces 1 unit of emissions unless it engages in abatement (i.e, 0 < ei ≤ 1). Using qi to denote the number of firms of type i (qi ≥ 1 for i = 1, 2), the industry’s total amount of emissions is E = q1 e1 + q2 e2 ≤ q1 + q2 . Environmental damage D is assumed to be increasing in emissions (D(E)). Using prime notation to denote first and second derivatives, we assume D > 0 and D ≥ 0. 9 The costs of emission abatement depend on the type of technology used and on the amount of emissions abated. Both firm types currently use an abatement technology that results in type—specific abatement costs equal to Ci (ri ), where ri = 1 − ei is the amount of emissions abated when using this technology. We assume Ci (ri ) > 0, Ci (ri ) > 0 for ri > 0, and Ci (0) = 0, Ci (0) = 0. Without loss of generality, ‘firms of type 1’ have the lowest marginal abatement costs for each level of abatement using the current technology: C2 (r̄) > C1 (r̄) > 0 for all r̄ ∈ (0, 1]. There is also a new abatement technology which enables firms to abate emissions at lower marginal costs. For simplicity, the costs of adoption are assumed to be the same for the two firm types, and are equal to I > 0.8 The associated variable costs are firm—type specific and are denoted by Vi (Ri ), where Ri denotes the amount of emissions abated by firms of type i when using the new technology. Lower marginal cost implies 0 < Vi (r̄) < Ci (r̄) for all r̄ ∈ (0, 1]. Furthermore, we assume Vi (0) = 0, Vi (0) = 0, and Vi (Ri ) > 0 for all Ri ∈ (0, 1]. Regarding the higher order derivatives of Ci (ri ) and Vi (Ri ), we are silent for the moment on the relative values of the second derivatives — they play a key role in the analysis, and hence receive due attention later on — but assume that all third derivatives are negligible. 8 As will become clear below, assuming that firms face different investment costs complicates the analysis without adding new insights. All that matters is that firms (i) have different marginal abatement costs when using the new technology, and (ii) have different net private benefits of adoption (= gross benefits of adoption minus the investment costs). We come back to the consequences of relaxing the assumption of identical investment costs in Section 5. 10 2.2 The agents’ objectives There are two types of actors in this model: the firms and the regulator. Firms minimize private costs, choosing the technology type as well as the optimal abatement level. The regulator minimizes social costs, consisting of the sum of the costs incurred by each firm type, the environmental damage, and the deadweight losses associated with her policies. The regulator has two instruments to influence firm behavior: per—unit emission tax rates (τ ≥ 0) and lump—sum investment subsidies (S ≥ 0). Each firm takes these levels as given, and the associated compliance costs (Ki (S, τ )) are the result of its decisions with respect to investment and abatement: Ki (S, τ ) = min {(1 − φi ) [Ci (ri ) + τ (1 − ri )] + φi [Vi (Ri ) + τ (1 − Ri ) + I − S]} , (1) ri ,Ri ,φi where φi is an indicator function which equals 1 if firm type i adopts the new technology, and zero otherwise. The firm solves this minimization problem as follows. First, when using the old or new abatement technology, the optimal abatement levels are determined implicitly by Ci (ri ) = τ or Vi (Ri ) = τ , (2) respectively. Using ri (τ ) and Ri (τ ) to denote the solutions of (2), the firm then calculates the net benefits it obtains when using the new technology: Bi (S, τ ) ≡ Zi (τ ) + S − I, (3) where Zi (τ ) ≡ Ci (ri (τ )) − Vi (Ri (τ )) + τ (Ri (τ ) − ri (τ )) are the variable cost savings (abatement cost savings plus tax savings) associated with adoption (also referred to as 11 the private adoption benefits of firms of type i), and I − S are the net investment costs. Firms of type i set φi = 1 if Bi (S, τ ) ≥ 0, and zero otherwise. Regarding the information asymmetries between the regulator and the firms, we assume that each firm knows its type, but the regulator only knows the number of firms of each type in the industry (q1 and q2 ). The regulator can use taxes and subsidies (potentially differentiated between firm types) to influence the adoption and abatement behavior of firms, and aims to minimize the following social cost function: SC (S1 , S2 , τ 1 , τ 2 ) = 2 qi [(1 − φi ) Ci (ri ) + φi Vi (Ri ) + φi I] + D(E) + F (ST ) , (4) i=1 where ST = 2 i=1 qi φi Si are the total subsidies provided (conditional on adoption; φi = 1), and F (ST ) denotes the social costs of raising subsidies (with F (0) = 0, F > 0 and F ≥ 0).9 Given our two firm type setup, the interesting case for our study is the one in which full adoption of the new technology is socially optimal (see footnote 4). We therefore implicitly assume that marginal damage is sufficiently high or that investment costs are sufficiently low such that full adoption is optimal. That means that if the regulator sets the tax rate equal to the Pigovian level τ E = D (E), where D (E) = Vi (Ri ), all firms find it in their interest to adopt — even with zero subsidies (cf. Requate and Unold [43]). As stated in the introduction, we motivate the use of subsidies by assuming that there are ‘political economy costs’ which are zero for tax levels below a certain threshold τ 9 Note that we treat the tax revenues as a pure transfer between the firms and the government, and hence interpret the F function as the extra costs associated with raising funds. Alternatively, it can be argued that it is not the gross subsidies that should be taken into consideration, but the subsidies net of pollution taxes, F ST − 2i=1 qi τ i ei . We postpone discussing the consequences of this alternative specification to Section 5. 12 (where τ < τ E ), and infinitely high above it. That means that there is effectively a constraint on the tax rate: τ ≤ τ . A uniform policy that results in full adoption (with τ ∗ = τ and S ∗ = S̄ H ≡ I − min{Z1 (τ̄ ) , Z2 (τ̄ )}) gives rise to windfall profits for one type. But we assume that uniform policies that induce partial adoption or complete non—adoption are more costly: SC (0, τ̄ ) ≥ SC (S̄ L , τ̄ ) > SC(S̄ H , τ̄ ), (5) where S̄ L ≡ max{0, I − max{Z1 (τ̄ ) , Z2 (τ̄ )}} is the minimum subsidy needed to induce adoption by the type that gains most from investing. Since subsidies are socially costly, it may be in the interest of the regulator to give smaller subsidies to the firms that have the largest gross benefits of adoption. Of course, such separating policies need to be incentive compatible, that is they have to meet the following condition: Ki (Si , τ i ) ≤ Ki (Sj , τ j ) . (6) In words, a pair of tax/subsidy combinations is incentive compatible if firms (weakly) prefer the tax/subsidy scheme designed for its type.10 The timing of the model is as follows. First the regulator announces the terms of the policy. Second, firms respond by choosing the policy combination and decide whether or not they adopt. Next, they collect their subsidies (if any, and if they are entitled to receive them), and then they choose their abatement levels. 10 The regulator needs a mechanism to elicit firms’ private information. By the revelation principle (Fudenberg and Tirole [13]: 255), we can concentrate on direct mechanisms where the regulator asks the firms to report their type, i, and then sets a policy contingent on the report, τ i , S i , that induces each firm to reveal its true type, i. For simplicity, we assume that if a firm isindifferent between announcing any of the two types, then it announces the true one. We denote τ i = τ i , Si = S i . 13 3 Cost effectiveness and incentive compatibility For our purposes, the most important information required is (i) which firm type needs the largest subsidies to adopt (cost effectiveness) and (ii) which type prefers the policy combination with the largest subsidies and taxes (incentive compatibility). Interestingly, both are determined by the locations of the marginal abatement cost functions firms have when using the old and the new technology. Regarding cost effectiveness, let us first calculate the benefits of adoption using the standard graphical representation (cf. Milliman and Prince [32], Palmer et al. [38], and Requate and Unold [43]); see Figure 1. Suppose the regulator imposes a tax rate τ . When using the old (new) technology, firm i abates up to the point where the relevant marginal abatement costs (Ci (ri ) or Vi (Ri )) are equal to this tax rate; see (2). That means that given tax rate τ and when using the old (new) technology, the abatement costs of firm i are abc (ab c ) and its tax bill is bdd c (b dd c ). That means that the variable cost savings associated with adoption, Zi (τ ) , are equal to the area abb . And type i adopts if Zi (τ ) ≥ I − S. So which firm type’s private benefits of adopting are largest, those of type 1 or type 2? From Figure 1 we immediately see that the size of Zi (τ ) depends on the horizontal distance between type i’s marginal abatement cost functions when using the old and the new abatement technology, as reflected by the difference in abatement levels using the new and the old technology (Ri (τ ) − ri (τ )). Given that we assume that the third derivatives of the abatement cost functions are negligible, the horizontal distance at any tax level (or the difference in abatement activity, Ri (τ ) − ri (τ )) is larger the larger the difference in 14 $ Ci’ Vi’ Zi (τ) b τ a b’ d c c’ ri(τ) Ri(τ) d’ 1 1-ei Figure 1: Technological progress and the benefits of adoption. the slopes of the old and new marginal abatement cost functions. The formal condition for Z1 (τ ) being larger or smaller than Z2 (τ ) is given in Lemma 1. Lemma 1 Zi (τ ) > Zj (τ ) for all τ > 0 if and only if (Ci − Vi ) Cj Vj > Cj − Vj Ci Vi . Proof. Differentiating Zi (τ ) ≡ Ci (ri (τ )) − Vi (Ri (τ )) + τ (Ri (τ ) − ri (τ )) with respect to τ and using (2), we have Zi (τ ) = Ri (τ ) − ri (τ ) > 0. Then, Zi (τ ) = Ri (τ ) − ri (τ ) = 1 Vi − C1 = i Ci −Vi , Ci Vi which is a constant. Our assumptions Ci (0) = Vi (0) = 0, 0 < Vi (r̄) < Ci (r̄) for all r̄ ∈ (0, 1] and negligible third order derivatives imply Ci − Vi > 0, and hence Zi > 0. Since Zi > 0 and Zi (τ ) > 0 for all τ > 0, straightforward integration yields Zi (τ ) > Zj (τ ) for all τ > 0 if and only if Ci −Vi Ci Vi > Cj −Vj . Cj Vj Therefore, cost effectiveness dictates that the regulator gives the largest subsidies to type j if and only if Cj − Vj Ci Vi < (Ci − Vi ) Cj Vj . 15 To be able to determine whether cost effective policies are also incentive compatible, we describe firm behavior using two (sets of) functions for each firm type: its adoption barrier and its (set of) isocost curve(s). The adoption barrier is the combination of all policies (S, τ ) such that a firm of type i is indifferent between adopting and not adopting; Bi (S, τ ) = 0 (cf. (3)). An isocost curve consists of all (S, τ ) under which the firm incurs the same costs, dKi (S, τ ) = 0. The set of isocost curves represents the firm’s costs associated with each policy combination (S, τ ). The relevant characteristics of these functions are described in the following lemmas. Lemma 2 Each firm type’s adoption barrier Bi (S, τ ) = 0 is downward sloping in (S, τ ) dτ space ( dS < 0). Also, for τ > 0, Bi (S, τ ) > Bj (S, τ ) if and only if Zi (τ ) > Zj (τ ). Bi =0 Proof. Because 0 < Vi (r̄) < Ci (r̄) for all r̄ ∈ (0, 1], equation (2) yields Ri (τ ) > ri (τ ) for all τ > 0. Total differentiation of the adoption barrier Bi (S, τ ) = 0 (cf. (3)) yields: dτ −1 = < 0. dS Bi =0 Ri (τ ) − ri (τ ) (7) The second part of the Lemma is trivial, by (3). Lemma 3 The set of isocost curves of a firm of type i consists of weakly increasing functions in (S, τ ) space, differentiable in every point except on the adoption barrier Bi = 0, and such that costs (weakly) fall with lower taxes and higher subsidies. Also, at any (S, τ ) such that Bi > 0 for all i, the corresponding isocost curve for a firm of type i is strictly steeper than that of a firm of type j if Vi R < Vj R for all R ∈ (0, 1] . Proof. Using (1) and (2), we have ∂Ki /∂τ = (1 − φi ) (1 − ri ) + φi (1 − Ri ) > 0 and 16 ∂Ki /∂S = −φi ≤ 0. Setting dKi = 0, we have 0 dτ = 1 >0 dS dKi =0 1−Ri (τ ) if φi = 0 . if φi = 1 (8) To prove the second part of the lemma, note that Vi R < Vj R for all R ∈ (0, 1] implies Ri (τ ) > Rj (τ ) for all τ , by (2) . Therefore, we have φi = φj = 1. dτ dS dKi =0 > dτ dS dKj =0 > 0 if In Figure 2 we depict an example of the adoption barrier and one isocost curve for each firm type for the case (a) (C1 − V1 ) C2 V2 < (C2 − V2 ) C1 V1 and (b) V1 R < V2 R for all R ∈ (0, 1].11 For type i, barrier Bi = 0 divides the set of policies into two regions. To its right, type i’s net benefits of adoption are positive (Bi ≥ 0) and the firm decides to adopt (φi = 1); to its left, the net adoption benefits are negative and the firm decides not to adopt (Bi < 0; φi = 0). Increasing either the tax rate (along the vertical axis) or the subsidy level (along the horizontal axis) makes adoption more attractive. And (a) implies that type 1’s adoption boundary is located to the right of that of type 2 (by Lemmas 1 and 2).12 The isocost curves are strictly upward sloping in each type’s adoption region, and horizontal in its non—adoption region (as the firms are not entitled to receive any subsidies when they do not adopt); see Lemma 3. The further to the South—East an isocost curve is located, the lower the associated total costs (lower taxes, or larger subsidies). And (b) implies that for any tax/subsidy combination in the full adoption region (B1 (S, τ ) ≥ 0, 11 Our assumptions regarding the curvature of the abatement cost functions are such that the iso-cost curves and adoption barriers are, in fact, convex functions in (S, τ ) space. We have simplified this figure by drawing them as straight lines, but this does not affect the interpretation of the results in any way. 12 The only point of intersection of the two firm types’ adoption barriers is at τ = 0, since Z1 (0) = Z2 (0) = 0. Equation (3) then implies that Si = I for i = 1, 2. 17 τ K1 K2 Decrease in firms’ costs (S1, τ1) (0, τ2) B2=0 B1=0 I S Figure 2: The adoption boundaries and isocost curves, and a separating policy (for the case where V1 (R̄) > V2 (R̄) and (C1 − V1 ) C2 V2 < (C2 − V2 ) C1 V1 ). i.e. where φ1 = φ2 = 1), the corresponding isocost curve of type 1 is strictly steeper than that of type 2 (see Lemma 3). The economic interpretation is the following. Consider a one dollar increase in the amount of subsidies offered. Given this one dollar increase, the type that is ultimately most efficient (that is, when using the new technology) can incur a larger increase in the tax level and still remain at the same isocost curve. Emission taxes are paid over the amount of unabated emissions (1 − Ri ), and these are smallest (all else equal) for the type that is most efficient in abatement upon adoption. Figure 2 can be used to identify uniform policies as well as sets of separating policies that induce full adoption and are incentive compatible. Any uniform tax/subsidy combination on or to the northeast of the adoption barrier B1 = 0 will induce both firms to adopt. An example of a separating policy is the menu (S1 , τ 1 ) and (0, τ 2 ). Here, both firm 18 types are indifferent between the two tax/subsidy combinations and hence, by assumption, each firm chooses the combination targeted at its type.13 So, (S1 , τ 1 ) and (0, τ 2 ) are an example of a separating policy menu, where the type that needs largest subsidies to adopt (type 1) also ends up receiving the largest amount of subsidies. 4 Separating or uniform policies? The regulator has to find the combination of taxes and subsidies that minimizes the social cost function (4) , ensuring that both types adopt the new technology (φ1 = φ2 = 1), and that the incentive compatibility constraint (6) is met. The complete information policy is trivial. Since τ < τ E , the sum of abatement costs and external damages decreases if the tax is increased towards the Pigovian level, and hence τ c1 = τ c2 = τ (where superscript c denotes the optimal solution under complete information). And to ensure full adoption at minimum subsidies, Sic = max{0, I − Zi (τ )}. But clearly, this policy is not incentive compatible in case of asymmetric information: all firms would prefer the contract with the largest subsidy, since the tax rate τ is the same in both contracts. Under asymmetric information, the incentive compatibility constraint is trivially met when imposing a uniform policy where τ 1 = τ 2 = τ ∗ and S1 = S2 = S ∗ . Taxes should be set such that they give the maximum incentive to abate and adopt, and hence τ ∗ = τ . To ensure full adoption, we need S ∗ = I − min{Zi (τ ) , Zj (τ )}, so one firm type ends up obtaining windfall profits.14 13 The assumption of ‘in case of indifference, firms choose the policy combination targeted at their type’ is made for expositional simplicity only. To make firms strictly prefer the policy aimed at their type, we only need to move (S1 , τ 1 ) infinitesimally to the North-East into the region between K1 and K2 . 14 For example, in Figure 2 B2 = 0 is located to the South—West of B1 = 0. For any given tax rate, uniform subsidies need to be such that type 1 firms are indifferent between adopting and not adopting. 19 We now ask whether it is possible to reduce social costs (and therefore, total windfall profits) by designing a separating policy, such that each firm actually prefers the tax/subsidy combination targeted at its type. That is, we ask whether an incentive— compatible separating policy menu can be socially preferred to the optimal uniform policy. As shown in Section 3, the two crucial characteristics for separation are (i) which type is the most efficient upon adoption (as identified in Lemma 3), and (ii) which type gains most upon adoption (see Lemma 1). Because we arbitrarily assumed that type 1 has the lowest marginal abatement costs when using the old technology, we are left with three cases: (a) type 1 gains least when adopting but remains most efficient upon adoption, (b) type 1 gains most from adoption and also remains most efficient upon adoption, and (c) type 2 gains most when adopting and also becomes most efficient when adopting. These three cases can be labelled convergence, divergence and overtaking, respectively, for reasons that become obvious momentarily. The case of convergence (a) is presented in Section 4.1, and divergence (b) and overtaking (c) are discussed in Section 4.2. Practical issues regarding the implementation of separating policies are discussed in Section 4.3. 4.1 The case of convergence Under convergence, the initially most efficient firm type (type 1) remains the most efficient one upon adopting (that is, it has the lowest marginal abatement costs when using the new technology, V1 R̄ < V2 R̄ ), but its private benefits of adoption are smaller than type 2’s (that is, (C1 − V1 ) C2 V2 < (C2 − V2 ) C1 V1 ). This is illustrated in Figure 3. Hence, uniform policies need to be located on type 1’s adoption barrier (B1 = 0), which implies that type 2 firms receive a windfall profit. 20 $ C 2’ C1’ V2’ V1’ t r2(t ) r1(t ) R2(t ) R1(t ) 1 1-e Figure 3: The case of convergence. Here, when both types adopt, type 2 is still less efficient in abatement than type 1 but the difference in efficiency is smaller than before adoption; the two types’ abatement levels converge (that is, r1 − r2 > R1 − R2 ). And the adoption benefits of type 1 firms (Z1 ) equal the sum of the black and dark—grey areas, whereas the adoption benefits of type 2 firms (Z2 ) equal the sum of the dark—grey and light—grey areas (cf. Figure 1). Differences in abatement levels tend to fall when a technology matures, and hence convergence is likely to reflect the type of technological progress if there is a technical maximum to emission abatement. The closer one operates to this maximum, the smaller the gains that can be obtained from adoption (as is the case with incremental technological progress, as opposed to technological breakthroughs; cf. Antonelli [3]). An example of 21 convergence is provided by Jaffe and Stavins [17]. They estimate how the level of insulation implemented in housing by building firms in the 48 lower States of the US in the period 1979—1988 varies with a series of control variables, including lagged levels of insulation. The coefficient on the lagged insulation level is positive but less than 1 (it ranges between 0.3 and 0.4), implying that if the difference in insulation levels installed between two states is 1 unit in a specific year, the difference is smaller in the next year — all else equal. So, Jaffe and Stavins find that the building firms that initially install higher levels of insulation in new homes and buildings than other firms, still install higher insulation levels ten years later, but that the gap with other firms is reduced. Within the convergence case, two situations are possible. “Leapfrogging” implies that it is possible that firms of type 2 abate more in absolute terms than do firms of type 1, but only if they have adopted the new technology and firms of type 1 have not (R2 > r1 ). This is in fact the case presented in Figure 3. Alternatively, “closing the gap” implies that firms of type 1 always abate more, irrespective of the technology used by each type (r1 > R2 ). In terms of Figure 3, this would be the case where C1 and V2 swap location. Regarding the locations and shapes of the adoption barriers and the isocost functions of the two types in case of convergence, Lemmas 1—3 imply that B1 (S, τ ) ≥ B2 (S, τ ) and dτ dS dK1 =0 > dτ dS dK2 =0 > 0 for all (S, τ ) where B1 (S, τ ) > 0; see Figure 4.15 Here, (S ∗ , τ ) is the optimal uniform policy, and the first thing to note is that it cannot be part of a separating menu. A separating policy including (S ∗ , τ ) would have (0, τ 2 ) as its second policy combination, where (S ∗ , τ ) is intended for type 1 firms and (0, τ 2 ) for type 2 firms. 15 Note that when explaining the adoption barriers and iso—cost functions in Section 3, we actually used the case of convergence, cf. Figure 2. 22 τ K1’ K1 K2 (S*,τ) τ K2’ (S1, τ) (0,τ2’) (S2, τ2) B2=0 B1=0 I S Figure 4: Incentive—compatible separating policies around the least—cost uniform policy in case of convergence. Type 2 firms are indifferent between the two (as both policy points are located on K2 ) and hence are assumed to choose the latter. But type 1 firms strictly prefer not to adopt and choose (0, τ 2 ) because (S ∗ , τ ) is located strictly above K1 , the isocost function through (0, τ 2 ). Because (S ∗ , τ ) cannot be part of a separating policy, any attempt to decrease social costs by decreasing the amount of subsidies offered to type 2 firms necessarily goes at the expense of paying larger subsidies to type 1 firms. In other words, the policy targeted at type 1 firms should be located in their strict adoption region (B1 > 0) in order to be able to locate the policy targeted at type 2 firms in the non—adoption region of type 1 firms (B1 < 0 ≤ B2 ). Thus, starting from the uniform policy (S ∗ , τ ), the policy targeted at type 1 firms should include a (possibly infinitesimal) increase in the subsidy only; the 23 tax rate remains at τ . The tax cannot be increased, and it is not socially optimal to reduce it because of the associated increase in the sum of abatement costs and external damages. Since both types prefer this new policy (S1 , τ ) to the uniform one, the policy for type 2 firms should be adjusted to satisfy the incentive compatibility constraint (6). For policy (S1 , τ ), an incentive—compatible tax/subsidy combination targeted at type 2 firms is (S2 , τ 2 ). Both types are indifferent between (S1 , τ ) and (S2 , τ 2 ), and hence they are assumed to choose the policy targeted at their type.16 Note that increasing the subsidy for type 1 firms and decreasing the tax for type 2 firms increase social costs. However, decreasing the subsidy for type 2 firms decreases social costs. Therefore, separation is socially desirable if the latter effect dominates the former. The following proposition gives a sufficient condition for separating to be preferred to the uniform policy. Proposition 1 Under costly subsidies (F > 0) and convergence of the types’ abatement technologies, separating reduces social costs if the following condition holds: q1 1 (τ − D ) R2 < (r1 − R2 ) + q2 1 − R1 F (9) evaluated at the uniform policy (S ∗ , τ ) . Proof: see Appendix 2. So, the relevant factors include the number of firms of each type (q1 and q2 ), their abatement levels before (ri ) and after adoption (Ri ), the marginal social costs of raising funds (F (•)), and the difference between the maximum tax rate (τ ) and the actual 16 For any policy point in the region B1 (S, τ ) > 0, a second policy point can be found such that the two form a separating policy. That means that there are infinitely many separating policies; (S1 , τ̄ ) and (S2 , τ 2 ) is just one of them. 24 marginal damage, D . The first observation we can make from Proposition 1 is that because R1 < 1, τ < D , R2 > 0 and F > 0, r1 − R2 should be positive and sufficiently large in order for condition (9) to hold. That means that separation can only be socially preferred in case of “closing the gap” (see above). But this is a necessary condition, and not a sufficient one. If “closing the gap” is met, separation is more likely to be preferred (i) the larger the number of type 2 firms and the smaller the number of type 1 firms (resulting in q1 /q2 being smaller), (ii) the smaller the environmental distortion (D − τ ) and the larger the marginal social costs of raising subsidies (F ), and (iii) the larger the difference in efficiency in using the new technology (such that R1 >> R2 or, alternatively, R1 close to 1). Regarding (i), clearly the proportion of firms of each type matters. Separation reduces the amount of windfall profits received by the type 2 firms, but at the cost of giving sufficiently generous subsidies to firms of type 1 to bring them strictly into their adoption region. If there are relatively many firms of the former type, separating can be socially preferred to the least—cost uniform policy. Regarding (ii), the smaller D − τ , the less subsidies are required as higher taxes provide stronger incentives to adopt.17 But if the marginal social costs of raising subsidies are negligible (F close to zero), subsidies are virtually transfers and hence there is not much use in trying to reduce the amount of subsidies provided. Finally, concerning (iii), the relevance of a larger R1 and a smaller R2 is clear from Figure 4. R1 (R2 ) being large (small) implies K1 (K2 ) being steep (flat) —see Lemma 3— which allows τ 2 to be close to τ and S1 to be close to S ∗ . 17 Alternatively, the smaller τ , the less likely it is that separation is optimal. In the limiting case of τ = 0, both types need S = I to adopt and there is neither means nor reason to induce separation. 25 4.2 The cases of divergence and overtaking Let us now analyze the cases of divergence and overtaking. Divergence occurs if the initially most efficient firm type (type 1) remains the most efficient one upon adopting (that is, V1 R̄ < V2 R̄ ) and its private benefits of adoption are larger than type 2’s (that is, (C1 − V1 ) C2 V2 > (C2 − V2 ) C1 V1 , see Lemma 1). This is the case where the old (new) marginal abatement cost function of type 1 in Figure 3 is shifted up (down) sufficiently so that the horizontal distance between the two firm types’ new marginal abatement cost functions is larger than the horizontal distance between their old functions (i.e., r1 − r2 < R1 − R2 ). In other words, the difference in efficiency increases upon adoption. This case may arise, for example, if the labor force that uses the old technology most efficiently have skills that better match the requirements associated with using the new technology, so that they are able to make better use of the new technology than their peers. Divergence may therefore occur if existing know—how plays an important role in technology use. In contrast, overtaking occurs if the initially least efficient firm type becomes most efficient upon adoption (V1 R̄ > V2 R̄ ) — by definition it is then also the type that gains most from adoption. In terms of Figure 3 this case can be represented by swapping the labels of V1 and V2 (so that r1 − r2 > 0 > R1 − R2 ). This situation may occur if firms need to invest in learning how to use the new technology. Whereas the expected benefits for the “technological leader” (type 1) are too small to invest in developing this knowledge, they are likely to be much larger for the “lagging firm” (type 2), and hence the latter ends up running the new technology (much) more efficiently than the former 26 (for a similar argument see for example Brezis et al. [7]). Or it may be the case that abatement technologies are run on the basis of rules of thumb rather than that operating procedures are continuously re—optimized, with the shock of using a new technology being sufficiently large for the type 2 firms’ managers to thoroughly reconsider their modes of operation, but not for those of type 1 firms (cf. Porter and van der Linde [39]). Examples of both divergence and overtaking can be found in the literature. Kerr and Newell [22] analyze how the US lead phasedown policy induced adoption of cleaner technologies (specifically the adoption of the so—called pentane—hexane isomerization technology). Using panel data on US refineries in the period 1971—1995, they find that initially low—cost abatement firms gain more from adopting cleaner technologies than the initially high—cost abatement firms, which they attribute to marginal abatement cost functions shifting out in a fashion we have labeled divergence (see also their Figure 1 on page 319, which is a specific case of our Figure 1). They find that refineries using relatively complex technologies beforehand (more specifically, those that had catalytic reforming capacity in 1971) are more prone to adopting the new technology because of the in—house availability of specialized knowledge. An example of overtaking is the case of Flue Gas Desulfurization (FGD) as presented by Bellas [5]. Tabulating the data of sulfur content in units with and without FGD, Bellas finds that the propensity to adopt FGD technology is higher for the firms with the higher initial levels of average sulfur content (see his Table 1 on page 330). Assuming continuous marginal abatement cost functions, we can infer that the firms with the lower initial marginal abatement costs (resulting in lower emission levels) are less prone to adopt the new abatement technology, resulting in the least efficient firms becoming most efficient 27 upon adopting. Let us now analyze how our results change under these two alternative cases. They have in common that the ultimately most efficient type also has the largest benefits of adopting. Because it faces the lowest marginal abatement costs when using the new technology it is more prone to choosing the high tax/subsidy combination, whereas it is actually the type that needs least subsidies to adopt. As a result, we find that separating policies are always more costly than the best uniform policy in case of either divergence, or overtaking. We show this graphically in Figure 5. In this figure, j identifies the firm type that uses the new technology most efficiently and also gains most when adopting (and i the other firm type). In case of overtaking i = 1 and j = 2; under divergence, i = 2 and j = 1. Separation can be induced by increasing the subsidy offered to the ultimately most efficient type from S ∗ to Sj and by offering a second tax/subsidy combination along Ki (Sj , τ ) to the South—West of (Sj , τ ). Obviously this second tax/subsidy combination cannot be in the region Bi < 0 because then type i would not adopt. The least cost approach is then to locate (Si , τ i ) on Bi = 0 (to the South—East of (S ∗ , τ )). As a result, in case of divergence and overtaking separation always requires an increase in the total amount of subsidies provided (Sj > Si > S ∗ ) and with lower incentives to actually abate for the ultimately least efficient type (τ i < τ ), thus exacerbating the environmental distortion. 4.3 Separating policies in practice Having derived the conditions for separating policies to be socially preferred, the next question is how the regulator can obtain the required information to construct these 28 τ Kj Ki (S*,τ) τ (Sj, τ) (Si, τi) Bj=0 Bi=0 I S Figure 5: The cases of divergence (i=2, j=1) and overtaking (i=1, j=2). policies. The analysis in this subsection gives insight into the steps the regulator needs to take to obtain such information. First of all, the regulator should determine what case of technological progress is relevant. This is not hard to determine; engineers will be able to give an assessment what firms are likely to gain most when adopting the new technology — the ones that are currently most efficient in abatement, or the ones that are least. For example, if the new technology is very complicated to operate requiring a high level of specific knowledge, it is likely that either divergence or overtaking is the relevant case. Divergence because the firms currently using fairly advanced technologies have the in—house knowledge to successfully run the new technology (as is the case with the pentane—hexane isomerization technology as analyzed by Kerr and Newell [22]), or overtaking because the firms using 29 fairly advanced technologies stand to gain fairly little from using the new technology so that they decide not to invest in acquiring the knowledge needed to run the new technology (as in the case of the FGD technology analyzed by Bellas [5]). The analysis in Section 4.2 shows that in the case of pentane—hexane isomerization and FGD technologies, uniform policies are preferred to separating policies. This is relevant information for regulators in various countries where governments use (or have used) subsidies (oftentimes in combination with taxes) to phase out lead in gasoline (cf. Lovei [29]: 23) and/or to reduce emissions of SO2 (Bohi and Burtraw [6]: 74). Separating by means of tax/subsidy menus may be socially preferred, however, in case of convergence (as shown in Section 4.1), and this may well be the case that occurs with a higher frequency than either divergence or overtaking. An example of convergence was provided by Jaffe and Stavins [17] as their regression results suggest that insulation levels installed by construction firms converge over time. Again, many countries use subsidies to increase insulation levels in new residences and buildings (OECD/IEA [36]), and for these governments the relevant next question is whether technological progress is ‘closing the gap’ or of the leapfrogging type. An example in point is Spain that tries to stimulate residential insulation by offering a combination of minimum standards regarding energy savings (insulation, the use of solar power) and investment subsidies to building corporations (OECD/IEA [36]). Whereas engineers are likely to readily assess (or even infer from the literature as in the cases described above) whether a specific technology is expected to give rise to divergence, convergence or overtaking, determining whether convergence results in leapfrogging or closing the gap is much more information intensive. To assess this, information is needed 30 about the pre— and post—adoption abatement levels of the various types. Indeed, a necessary condition for separating to be preferred to uniform policies is that r1 > R2 . Again, regulators may be able to learn from other countries’ experience. For example, tradable permit systems implemented in other jurisdictions can reveal which types of firms tend to adopt the new technologies and whether they become net suppliers or buyers of permits (see also Malueg [31]). But in some instances site visits may be the only way to obtain such information.18 If regulators cannot draw on other countries’ experiences, the analysis in Section 4.1 suggests that the least—cost approach is then to next assess how likely it is that separating will provide savings — by assessing whether condition (9) is likely to hold. That means that the regulator needs to assess (i) how large the marginal costs of raising funds are, (ii) how large the marginal damages are (or, more specifically, how large the difference is between marginal damages and the maximum tax rate that she can impose), and (iii) how large the share is of the currently most efficient firms. If it is likely that condition (9) holds, it is expected to be worthwhile to send engineers to firms for extensive site visits in order to determine whether technological progress is likely to result in ‘closing the gap’ or ‘leapfrogging’, and also what the expected abatement levels are of the various firms types when using the new technology (i.e., R1 , R2 and R2 ; see condition (9)). 18 From all the possible policy instruments implemented by other countries, tradable permits convey the most information to other regulators because they reveal both adoption behavior as well as the relative positions of the marginal abatement cost functions across types (because some firms will become net buyers and others net sellers of permits). But there are also other sources of information. For example, Jaffe and Stavins [17] include the price of energy in their regression analysis explaining insulation levels, and thus information is available about the relationship between energy prices changes (caused by international price developments, or by energy taxes) on the level of insulation installed — information which can be used to trace the firms’ marginal abatement cost functions. To the extent that the application of insulation techniques is sufficiently similar across jurisdictions, studies that analyze the determinants of technology adoption also contain useful information for regulators. 31 Clearly, auditing a sample of firms in the regulated industry is costly. However, the savings in the social costs of raising funds may well be such that the information collection effort is a profitable investment. 5 Robustness of the results In the previous section we have identified the necessary condition for separation to be the least—cost solution; technological progress should “close the gap” but even then the uniform policy may still be socially preferable. We derive these conclusions using a simple model; most of the assumptions serve the purpose of making the model easier but are not crucial to our results. In this section we show the robustness of our results against relaxing some of them. First, we consider what happens if we relax the assumption that all firms face the same investment costs. The direct consequence of relaxing this assumption is that the horizontal intercepts for the adoption barriers in (S, τ ) space no longer coincide (see Figure 2, 4 or 5), resulting either in (i) the one barrier being located strictly to the North—East of the other for all τ ≥ 0 (rather than weakly), or (ii) the two adoption barriers intersecting in (S, τ ) space. Clearly, case (i) leaves our analysis unaffected; the resulting situation can still be classified as convergence, divergence, or overtaking. Case (ii) is a little more subtle, because the resulting situations can then no longer be globally referred to with labels such as convergence, divergence and overtaking. But these labels are still relevant locally: for a given set of adoption barriers and isocost functions, the relevant case may be convergence for a specific level of τ̄ but divergence if the tax threshold is slightly higher or lower. And hence all our analyses remain valid; all that matters is whether or not the 32 type that needs the largest adoption subsidies (when confronted with tax rate τ̄ ) is also the one that is more prone to choosing the high tax/subsidy combination. Also, making the costs of raising funds dependent on the net subsidies provided (that is, the total subsidies paid minus the tax revenues raised) does not fundamentally affect our results. Our analysis in Sections 3 and 4 uses the gross subsidy specification, and hence reducing the tax rate (i.e., below τ̄ ) for the type that uses the new technology less efficiently is costly only because it gives lower incentives to abate at the margin. If we use the net subsidy specification there is the additional cost in the form of raising less funds to cover the cost of subsidization, and hence separation is even less likely to be socially preferred. That means that assuming that the costs of raising funds are a function of net subsidies instead of gross subsidies reduces the probability that separating policies prove to be welfare enhancing. Our conclusion that separation is always feasible but may not be desirable is again reinforced when allowing for three firm types — or more. Let us assume that type 3 firms are the ones that gain least from adoption. Considering the consequences of having three types in the regulated industry, two cases are relevant here: (a) the third type should not be induced to adopt (i.e., partial adoption is optimal), or (b) it should. In (a), the presence of type 3 firms does not affect the location of the uniform policy (S ∗ , τ̄ ). At this policy point type 3 firms strictly prefer not to adopt. So they are charged tax rate τ̄ but do not collect subsidy S ∗ . Because the uniform policy is the least costly option in case of divergence and overtaking, the presence of a third type does not affect any of the results in these two cases. But it will have an impact in case of convergence, because overinvestment needs to be 33 prevented. The separating policy should be such that type 3 firms do not want to adopt and hence also do not claim a subsidy. If (S1 , τ̄ ) in Figure 4 is located in the B3 < 0 region (not shown), type 3 firms prefer the other policy point (S2 , τ 2 ) and hence offering an additional tax/subsidy combination (0, τ 2 ) results in complete separation of the three types. If (S1 , τ̄ ) is located in the B3 ≥ 0 region, type 3 firms need to be induced not to select this policy, and this may even require offering a third policy combination (0, τ 3 ), with τ 3 < τ 2 — if K3 is sufficiently steep and/or B3 = 0 is sufficiently close to B1 = 0. In either case there is an additional welfare cost as compared to the uniform policy because type 3 firms are charged a tax rate below τ̄ . And hence the likelihood of separating being the least—cost option is smaller if partial adoption is optimal.19 If the third type should be induced to adopt (case (b)), separation is again always feasible, but may or may not be desirable. Here, the uniform policy is affected, obviously, because of the presence of firms of type 3 that need larger subsidies to adopt. In principle, this raises the stakes of being able to separate the three types. And separation is still always feasible. But it would require offering even larger subsidies to the type that is most efficient upon adopting and also reducing the tax rates faced by the other two types. If the type that is ultimately most efficient in using the new technology is not the one that needs the largest subsidies to adopt (cf. divergence and overtaking), separating is again never the least—cost solution. If this ultimately most efficient type does need the largest subsidies, however, separation may be welfare enhancing, and similar considerations play 19 Note that partial adoption in the two-types model is trivial. By offering policies (0, τ̄ ) and (S, τ̄) —where S = I − max{Z1 (τ̄), Z2 (τ̄)}— the two types are separated efficiently. 34 a role as in Proposition 1. 6 Conclusions We have analyzed the problem that (lump—sum) investment subsidies can be highly effective in inducing adoption of new technologies —e.g., emission abatement equipment— but that such subsidies may result in some firms earning windfall profits (or informational rents). Firms differ with respect to the private benefits they reap from switching to a new technology, and hence a subsidy that renders adoption weakly profitable for some firms results in positive net gains for others. Given that providing subsidies is socially costly, the question arises whether the regulator can design a menu of policies consisting of taxes and subsidies that induces firms to self—select according to the minimum amount of money they need to adopt the new technology. Using a stylized model of two types of firms, we show that the ultimately most efficient firms are (weakly) more likely to prefer higher tax/subsidy combinations than the ultimately least efficient ones. That means that for policy menus to be incentive compatible, the regulator should offer larger subsidies to (but also impose higher tax rates on) those firms that are more efficient in emission abatement upon adoption —irrespective of whether these firms are indeed the ones that need the largest subsidy to adopt. But only when these ultimately most efficient types are also the ones that gain least upon adopting, separating policy schemes may be socially preferred to uniform policies. This incentive compatibility condition enables us to identify the situations in which a separating policy scheme results in lower social costs than the best uniform scheme. Here, the differences in marginal abatement costs within and between firms play a key 35 role. In fact, we find that separating policies are socially preferred neither if full adoption results in a divergence of the firms’ levels of abatement efficiency, nor in case of overtaking. In those cases, incentive compatibility requires giving a larger subsidy to the type that gains most when adopting, which is not cost—effective. Then, uniform policies are always cheaper. Separating policies can be socially preferred in case of convergence of abatement efficiencies. Here, the initially most efficient type is also most efficient upon adopting, but its private adoption benefits are smaller than those for the initially least efficient type. We find that a necessary condition for separation to be least cost is that the initially most efficient firm type always has lower marginal abatement costs than the other type, even when the latter invests and the former does not (i.e., under “closing the gap”). If this necessary condition is met, we find that separation is more likely to be least cost (i) the smaller the share of ultimately most efficient type in the industry, (ii) the closer the emission tax rate can be set to actual marginal damage, and (iii) the larger the difference in abatement efficiencies upon adopting. Whereas we find that uniform policies are better than separating policies in two out of three cases, it may well be the case that convergence occurs much more frequently than divergence or overtaking. That means that regulators need to assess, on a technology— to—technology basis, whether it is worthwhile to invest in collecting the data needed to be able to construct separating policy menus. Our analysis suggests that in some instances information is readily available in the environmental economics literature obviating the necessity to collect the data because the technology under consideration gives rise to divergence or overtaking (e.g., Kerr and Newell [22] and Bellas [5]). If not, a sample of 36 firms may need to be audited to be able to identify firm types and to infer the slopes and locations of the various types’ marginal abatement cost functions (before and after adoption). Clearly, the costs of auditing a sample of firms in the regulated industry can be considerable, and this paper identifies the steps that need to be taken to ensure information is collected cost—effectively. Ultimately, the savings in the social costs of raising funds may well be such that the information collection effort is a profitable investment. 7 Appendix 1 Table of symbols. Bi (S, τ ) Ci (ri ) D (E) ei 2 E= qi ei Firm type i’s net benefits of adoption Abatement costs of a firm of type i under the old technology External damages associated with total emissions in the industry Emission level of a firm of type i Total emissions in the industry i=1 φi F (ST ) I Ki (S, τ ) qi ri Ri Si 2 ST = qi φi Si Variable indicating whether type i adopts (φi = 1), or not (φi = 0) Social costs of raising subsidies Costs of adopting the new technology The minimum total abatement costs Number of firms of type i Emission abatement level of a firm of type i under the old technology Emission abatement level of a firm of type i under the new technology Subsidy level intended for a firm of type i Total amount of subsidies provided in the industry i=1 τi τE τ Vi (Ri ) Zi (τ i ) Tax rate intended for a firm of type i Pigovian tax rate Tax rate below (above) which the political economy costs of taxation are zero (infinite), τ < τ E Abatement costs of a firm of type i under the new technology Variable cost savings of adoption for a firm of type i 37 τ K1 τ dτ2 τ2 dS1 1 dS1’ (S1, τ) (S*,τ) (S2, τ2) K2 (S*+1, τ2) dS2 B2=0 S* B1=0 I S*+1 S Figure 6: Graphical representation of the approach taken for the proof of Proposition 1. 8 Appendix 2 Proof of Proposition 1. We present the proof with the help of Figure 6. Starting from the uniform policy (S ∗ , τ ), we identify a separating menu of policies such as (S1 , τ ) and (S2 , τ 2 ) intended for type 1 and 2 firms respectively. The social cost consequences of moving from (S ∗ , τ ) to (S1 , τ ) and (S2 , τ 2 ) depends on dτ 2 and dS1 (which increase social costs) as well as on dS2 (which reduces social costs). To induce separation, we start by increasing the subsidy intended for type 1 firms by a very small amount; say one dollar (S ∗ + 1). The corresponding decrease in the tax (dτ 2 ) that still induces adoption by type 1 firms is given by (7): dτ 2 = − 1 < 0. R1 − r1 38 (10) The extra subsidy that type 1 firms (dS1 = 1 + dS1 ) require to be indifferent between the policies (S ∗ + 1, τ 2 ) and (S1 , τ ) can be obtained by using (8) and (10): dS1 = 1 + dS1 = 1 + 1 − R1 1 − r1 = > 0. R1 − r1 R1 − r1 (11) We can now construct (S2 , τ 2 ). First note that type 2 firms’ costs in (S1 , τ ) are lower than under the uniform policy; dK2 = −dS1 . To find S2 , type 2 firms must be indifferent between policies (S1 , τ ) and (S2 , τ 2 ). Therefore, the cost saving associated with moving to (S2 , τ 2 ) should also be −dS1 : ∂K2 ∂K2 dτ 2 + dS2 = −dS1 . ∂τ ∂S (12) By (1), ∂K2 /∂S = −1 and ∂K2 /∂τ = 1 − R2 . Substituting these expressions in (12), and using (10) and (11), we obtain: dS2 = R2 − r1 , R1 − r1 (13) which is negative if and only if R2 < r1 . We now consider the corresponding change in social costs due to the changes (dτ 2 , dS1 , dS2 ), evaluated at the uniform policy (S ∗ , τ ) : dSC = q2 (τ − D ) R2 dτ 2 + F [q1 dS1 + q2 dS2 ] . (14) Substituting (10) , (11) and (13) in (14), we have: dSC = 1 {−q2 (τ − D ) R2 + F [q1 (1 − R1 ) + q2 (R2 − r1 )]} , R1 − r1 which is negative if: F [q1 (1 − R1 ) + q2 (R2 − r1 )] < q2 (τ − D ) R2 . Rearranging terms, we obtain the desired result. 39 (15) References [1] Aidt, T.S., 1998. Political Internalization of Economic Externalities and Environmental Policy. Journal of Public Economics 69, 1—16. [2] Alberini, A., Segerson, K., 2002. Assessing Voluntary Programs to Improve Environmental Quality. Environment and Resource Economics 22, 157—184. [3] Antonelli, C., 2003. The Economics of Innovation, New Technologies and Structural Change. New York: Routledge. [4] Baumol, W.J., Oates, W.E., 1988. The Theory of Environmental Policy. Cambridge: Cambridge University Press. [5] Bellas, A.S., 1998. Empirical Evidence of Advances in Scrubber Technology. Resource and Energy Economics 20, 327—343. [6] Bohi, D.R., Burtraw, D., 1997. SO2 Allowance Trading: How do expectations and experience measure up? Electricity Journal 10(7), 67—75. [7] Brezis, E.S., Krugman, P.R., Tsiddon, D., 1993. Leapfrogging in International Competition: A Theory of Cycles in National Technological Leadership. American Economic Review 83 (5), 1211—1219. [8] Carraro, C., Siniscalco, D. 1996. Voluntary Agreements in Environmental Policy: A Theoretical Appraisal. in A. Xepapadeas (ed), Economic Policy for the Environment and Natural Resources: Techniques for the Management and Control of Pollution, Cheltenham: Edward Elgar, 80—94. 40 [9] Dasgupta, P., Hammond, P., Maskin, E. 1980. On Imperfect Information and Optimal Pollution Control. Review of Economic Studies 47, 857—860. [10] De Mooij, R., 2002. The Double Dividend of an Environmental Tax Reform. In van den Bergh, J.C.J.M. (ed.). Handbook of Environmental and Resource Economics. Cheltenham: Edward Elgar, 293—306. [11] Dietz, F.J., Vollebergh, H.R.J., 1999. Explaining Instrument Choice in Environmental Policies. In van den Bergh, J.C.J.M. (ed.). Handbook of Environmental and Resource Economics. Cheltenham: Edward Elgar, 339—351. [12] Fredriksson, P.G., 1997. Environmental Policy Choice: Pollution Abatement Subsidies. Resource and Energy Economics 20, 51—63. [13] Fudenberg, D., Tirole, J., 1992. Game Theory. Cambridge (MA): MIT Press. [14] Fullerton, D., Mohr, R., 2003. Suggested Subsidies are Suboptimal Unless Combined with an Output Tax. BE Journal of Economic Analysis and Policy 2(1), Article 1. [15] Fullerton, D., Wolverton, A., 1999. The Case for a Two—Part Instrument: Presumptive Tax and Environmental Subsidy. In Panagariya, A., Portney, P.R., Schwab, R.M., eds. Environmental and Public Economics: Essays in Honour of Wallace E. Oates. Cheltenham: Edward Elgar. [16] Jaffe, A.B., Newell, R.G., Stavins, R.N., 2002. Environmental Policy and Technological Change. Environmental and Resource Economics 22, 41—69. 41 [17] Jaffe, A.B., Stavins, R.N., 1995. Dynamic Incentives of Environmental Regulations: The Effects of Alternative Policy Instruments on Technology Diffusion. Journal of Environmental Economics and Management 29(3), S43—S63. [18] Jenkins, G.P., Lamech, R., 1992. Fiscal Policies to Control Pollution: International Experience. Bulletin for International Fiscal Documentation 46, 483—502. [19] Johnson, P., McKay, S., Smith, S., 1990. The Distributional Consequences of Environmental Taxes. IFS Commentaries C23, London: Institute for Fiscal Studies. [20] Joskow, P.L., Maron, D.B., 1992. What does a Negawatt Really Cost? Evidence from Utility Conservation Programs. The Energy Journal 13(4), 46—55. [21] Jung, Ch., Krutilla, K., Boyd, R., 1996. Incentives for Advanced Pollution Abatement Technology at the Industry Level: An Evaluation of Policy Alternatives. Journal of Environmental Economics and Management 30, 95—11. [22] Kerr, S., Newell, R.G., 2003. Policy—Induced Technology Adoption: Evidence from the U.S. Lead Phasedown. Journal of Industrial Economics 51(3), 317—343. [23] Khanna, M., 2001. Non—Mandatory Approaches to Environmental Protection. Journal of Economic Surveys 15(3), 291—324. [24] Kim, J.C., Chang, K.B., 1993. An Optimal Tax/Subsidy for Output and Pollution Control under Asymmetric Information in Oligopoly Markets. Journal of Regulatory Economics 5, 183—197. 42 [25] Laffont, J.J., Tirole, J., 1996a. Pollution Permits and Compliance Strategies. Journal of Public Economics 62, 85—125. [26] Laffont, J.J., Tirole, J., 1996b. Pollution Permits and Environmental Innovation. Journal of Public Economics 62, 127—140. [27] Lewis, T.R., 1996. Protecting the Environment when Costs and Benefits are Privately Known. RAND Journal of Economics 27(4), 819—847. [28] Loeb, M., Magat, W.A., 1979. A Decentralized Method for Utility Regulation. Journal of Law and Economics 22, 399—404. [29] Lovei, M., 1998. Phasing Out Lead from Gasoline: Worldwide Experience and Policy Implications. World Bank Technical Paper 397, Washington DC: The World Bank. [30] Malm, E., 1996. An Actions—Based Estimate of the Free Rider Fraction in Electric Utility DSM Programs. The Energy Journal 17(3), 41—48. [31] Malueg, D.A., 1989. Emission Credit Trading and the Incentive to Adopt New Pollution Abatement Technology. Journal of Environmental Economics and Management 16(1), 52—57. [32] Milliman, S.R., Prince, R., 1989. Firm Incentives to Promote Technological Change in Pollution Control. Journal of Environmental Economics and Management 17, 247— 265. 43 [33] Nelissen, D., Requate, T., 2007. Pollution—Reducing and Resource—Saving Technological Progress. International Journal of Agricultural Resources, Governance and Ecology 6, 5—44. [34] OECD, 1994. Applying Economic Instruments to Environmental Policies in OECD and Dynamic Non—Member Countries. Paris: Organization for Economic Cooperation and Development. [35] OECD, 2003. Voluntary Approaches: Two Danish Cases. Paris: Organization for Economic Co—operation and Development. [36] OECD/IEA, 2008. Policies and Measures Data Base: Climate Change. Paris: Organization for Economic Co—operation and Development / International Energy Agency. [37] Parry, I.W.H., 1998. A Second—Best Analysis of Environmental Subsidies, International Tax and Public Finance 5(2), 153—170. [38] Palmer, K., Oates, W.E., Portney, P., 1995. Tightening Environmental Standards: The Benefit Costs or the No—Cost Paradigm? Journal of Economic Perspectives 9(4), 119—32. [39] Porter, M.E., van der Linde, C., 1995. Towards a New Concept of the Environmental— Competitiveness Issue. Journal of Economic Perspectives 9(4), 97—118. [40] Requate, T., 1995. Incentives to Adopt New Technologies under Different Pollution— Control Policies. International Tax and Public Finance 2, 295—317. 44 [41] Requate, T., 1998. Incentives to Innovate under Emission Taxes and Tradable Permits. European Journal of Political Economy 14, 139—165. [42] Requate, T., 2005. Dynamic Incentives by Environmental Policy Instruments: A Survey. Ecological Economics 54, 175—195. [43] Requate, T., Unold, W., 2003. Environmental Policy Incentives to Adopt Advanced Abatement Technology: Will the True Ranking Please Stand Up? European Economic Review 47(1), 125—146. [44] Ruggeri, G., 1999. The Marginal Cost of Public Funds in Closed and Small Open Economies. Fiscal Studies 20(1), 41—60. [45] Schneider, S.H., Goulder, L.H., 1997. Achieving Low—Cost Emissions Targets. Nature 389(4 September), 13—14. [46] Segerson K., 1999. Do Voluntary Approaches lead to Efficient Environmental Protection? European Research Network on Voluntary Approaches, CAVA, Working Paper No. 99:10. [47] Sheriff, G., 2008. Optimal Environmental Regulation of Politically Influential Sectors with Asymmetric Information. Journal of Environmental Economics and Management 55, 72—89. [48] Spulber, D.F., 1988. Optimal Environmental Regulation under Asymmetric Information. Journal of Public Economics 35, 163—181. 45 [49] Stranlund, J., 1997. Public Technological Aid to support Compliance to Environmental Standards. Journal of Environmental Economics and Management 34, 228—239. [50] Stavins, R.N., 2003. Experience with Market—Based Environmental Policy Instruments. In Mäler, K.G., Vincent, J. (eds.). The Handbook of Environmental Economics. North—Holland: Elsevier Science. [51] UNEP, 2008. Reforming Energy Subsidies. New York: United Nations Environment Programme. [52] Wirl, F., 1999. Conservation Incentives for Consumers. Journal of Regulatory Economics 15(1), 23—40. [53] Wirl, F., 2000. Lessons from Utility Conservation Programs. Energy Journal 21(1), 87—108. [54] Wirl, F., Orasch, W., 1998. Analysis of United States’ Utility Conservation Programs. Review of Industrial Organization 13(4), 467—486. [55] Wu, J.J., Babcock, B.A., 1996. Contract Design for the Purchase of Environmental Goods from Agriculture. American Journal of Agricultural Economics 78, 935—945. 46
© Copyright 2026 Paperzz