Journal of Public Economics 74 (1999) 97–139 On the determinants of fiscal centralization: Theory and evidence Ugo Panizza* Office of the Chief Economist, Inter-American Development Bank, Stop W-0436, 1300 New York Avenue NW, Washington DC 20577, USA Received 1 April 1998; received in revised form 1 September 1998; accepted 1 December 1998 Abstract This paper presents a simple model that unifies most of the results of the literature on fiscal federalism. The model describes an economy characterized by two levels of government, one public good, and a private good. The predictions of the model are tested by using a new set of measures of fiscal centralization. The main findings are that country size, income per capita, ethnic fractionalization, and level of democracy are negatively correlated with the degree of fiscal centralization. The model is tested using OLS, Tobit, and semi-parametric estimators. The paper also shows that the variables included in the regression are helpful in the prediction of changes in the level of centralization. 1999 Elsevier Science S.A. All rights reserved. Keywords: Political economy; Median voter; Fiscal federalism; Decentralization JEL classification: D720; H110; H710; H770 1. Introduction Across countries we observe very different institutional arrangements and very different levels of fiscal centralization. The recent years have witnessed strong movements toward decentralization and secession (Argentina, Colombia, Ethiopia, the republics of the former Soviet Union, Yugoslavia, India, Czechoslovakia, *Tel.: 11-202-623-1427; fax: 11-202-623-2481. E-mail address: [email protected] (U. Panizza) 0047-2727 / 99 / $ – see front matter 1999 Elsevier Science S.A. All rights reserved. PII: S0047-2727( 99 )00020-1 98 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 Canada, Belgium, Italy and Spain). At the same time, a divided country has reunified (Germany), large free-trade zones have been created, and there have been important steps towards the economic and monetary union of some European countries. It is therefore natural that decentralization has been at the center of the political and economic debate in many countries. This paper attempts to identify empirical regularities explaining cross-country differences in the level of fiscal centralization. To set the stage for the empirical work, Section 2 presents a stylized model describing an economy with two levels of government, one public good, and one private good. The central government obtains utility from the size of its budget and the equilibrium level of centralization is determined by a sequential game where the central government moves first. The key findings of the theoretical model are that the level of fiscal centralization is inversely correlated with: (i) country size; (ii) income per capita; (iii) tastes differentiation; and (iv) level of democracy. The last point is particularly interesting because it may account for the wave of decentralization that has followed the end of the Cold War. The existing literature on fiscal centralization can be divided into three main branches. The first branch studies the optimal division of powers between the central and local governments (Musgrave, 1959; Oates, 1972). One of the main results of this branch of the literature is the Decentralization Theorem (Oates, 1972) that identifies the conditions under which it is more efficient for local governments to provide the Pareto-efficient levels of output for their respective jurisdictions than for the central government to provide an uniform level of output across all jurisdictions. One of the corollaries of the decentralization theorem is that the benefits of decentralization are positively correlated with the variance in demands for publicly provided goods. This is also one of the key results of the model presented in Section 2. The second branch of the literature concentrates on the role of organization costs (Breton and Scott, 1978). A decentralized system can reduce mobility and signaling costs, but it is likely to increase administrative and coordination costs. The optimal level of decentralization is the one that minimizes the sum of these costs. The third branch of the literature emphasizes the benefits of competition among jurisdictions. Tiebout (1956) studies how, in a system with many jurisdictions, the agents can ‘vote with their feet’ and locate in the jurisdiction that has policies that are closer to their preferences. While Tiebout concentrates on horizontal competition, Breton (1996) studies the benefits of vertical competition. According to this notion, different levels of government, in an effort to increase their ‘market share,’ provide the citizens with the optimal type and quantity of public goods. Brennan and Buchanan (1980) claim that horizontal and vertical competition among different levels of government can be very important in containing the size of their budgets. In addition to the fiscal federalism literature, the scope of this paper extends to U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 99 several issues examined in the recent strand of political economy literature that studies the optimal number and size of nations (Alesina and Spolaore, 1997) and the optimal amount of public goods in countries with heterogeneous preferences (Alesina et al., 1996a,b). The model presented in this paper constitutes a step in the direction indicated by Alesina and Spolaore (1997) who suggest that it would be interesting to extend their analysis to a framework with multiple levels of government. The presence of two levels of government is the key difference between the model presented in this paper and the one studied by Alesina and Spolaore (1997); Alesina et al. (1996a). The second part of the paper builds a new data set of measures of fiscal centralization and uses it to test the predictions of the model. The theory is tested using both standard techniques (OLS and Tobit) and a semi-parametric estimator that does not require normally distributed residuals. To the best of my knowledge, Oates (1972); Wallis and Oates (1988) are the only two attempts to use a large set of cross-section data to study the existence of empirical regularities explaining differences in centralization across countries. Oates (1972) finds that only size and income per capita are significantly correlated with fiscal centralization. Wallis and Oates (1988) study the fiscal structure of the American states. They build a panel of state-level measures of fiscal centralization and find a correlation between a few socio-economic variables (the percentage of urban population, the percentage of whites, and the percentage of farmers) and fiscal centralization. When Wallis and Oates test the model using a simple cross section of American states they find that only size is robustly associated with fiscal centralization. In the light of the existing literature, the results of this paper seem particularly interesting. In fact, the empirical analysis identifies a correlation between fiscal centralization and two variables measuring the degree of democracy and the degree of ethnic fractionalization. These results are consistent with the model of Section 2 and contradict the idea that it is impossible to find a unique set of variables to explain cross-country differences in fiscal centralization (Oates, 1972). 2. A model of fiscal centralization The model presented in this section extends the framework developed by Alesina and Spolaore (1997); Alesina et al. (1996a), to an economy with two levels of government. The model studies a linear country with area S, population N, and divided into J jurisdictions. S, N and J are assumed to be exogenous.1 To simplify the analysis, J is assumed to be an odd number. Since it is not possible to capture in a single model the richness of the vast literature on fiscal centralization, the focus of the model presented in this section is 1 Alesina and Spolaore (1997) present a model where the size and number of countries is endogenous but the size of the public good is exogenous. 100 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 simplification and unification. Having a unified framework will be useful to organize the ideas and guide the empirical analysis presented in Section 4. The government produces one public good: G (for government). The general (local plus central) government budget constraint is given by T 5 G, where T stands for tax receipts. This can be written in per capita levels as: t 5 g. Per capita consumption of the private good is defined by c. The prices of all goods are normalized to 1. All individuals have the same income y, on which they pay a lump sum tax t, and similar preferences, but they differ in their tastes for the type of public good.2 Education is an example of publicly provided good on which preferences are often polarized: some citizens may prefer religious as opposed to secular schools or may favor the use of a specific language. It is assumed that the individuals are uniformly distributed over the territory and that the individuals are stratified and sorted according to their preferences for the public good. The latter implies that there is a one-to-one correspondence between the physical and the ‘ideological’ distance from the center of the country. The theoretical basis for this assumption is provided by the literature pioneered by Tiebout (1956). According to this literature, individuals with similar preferences tend to locate in the same community and stratification is an equilibrium condition. The model of this paper assumes a situation where Tiebout-style stratification has already taken place and, since individuals are already sorted according to their preferences, there is no role for mobility. Although there is no general consensus supporting the Tiebout hypothesis, many empirical studies have found a strong support for neighborhood stratification in American cities (Hamilton et al., 1975; Borjas, 1995). The idea that individuals are sorted according to their preferences is also central in Alesina and Spolaore’s work. These authors claim that: ‘‘If there were no relationship between location and preferences, then there would be no presumption that a country would be geographically connected’’ (Alesina and Spolaore, 1997, p. 1030). The preferences of the ith citizen are described by a distance-sensitive utility function (Jurion, 1983): Ui 5 g i12 a (u l im 1(12u )l ij ) c ib (1) where g is the per capita amount of government expenditure, l im represents individual i’s distance from the center of her country, l ij the distance from the center of the jurisdiction, and u the level of centralization (i.e. the share of the public good that is provided by the central government). The difference in tastes across individuals is captured by a [[0,1]: a 50 indicates a country with a very homogeneous population, while a 51 characterizes a country with a population 2 The assumption of homogeneous income allows us to abstract from all the issues linked to income redistribution. Wildasin (1991) discusses in great detail the cost and benefits of local versus central redistribution. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 101 with diversified tastes. Hence, a (u l im 1(12u )l ij ) is the distance between individual i’s preferred type of government and the actual type of government provided in equilibrium. Assuming that S [(0,1) guarantees that the exponents in Eq. (1) are non-negative for all individuals. It is easy to show that the above utility function generates the prediction that the closer an individual’s preferences are to the actual government the higher is the ratio between public and private goods that the individual will demand. This intuitive result justifies the use of a utility function like the one in Eq. (1). Not all the utility functions would generate this prediction. A simpler Cobb-Douglas utility function of the kind: Ui 5[gi (12 a (u l im 1(12u )l ij ))] g c i b would generate the non-realistic prediction that all individuals demand the same mix of public and private goods, no matter what their preferences for the type of public good are. All the results obtained in this paper can be reproduced using the additive utility function studied by Alesina et al. (1996a). By maximizing the utility function of Eq. (1) under the constraint y5c1g, it is possible to derive the following demand functions for the public and private goods: di y gi 5 ]] di 1 b (2) by c i 5 ]] di 1 b (3) with di 512 a (u l im 1(12u )l ij ). Note that ≠gi / ≠di .0, ≠c i / ≠di ,0. As already pointed out, each citizen will have different preferences for the quantity of g and c. The actual level of g provided in equilibrium will then be determined by voting. Note that the idea behind Eq. (1) is that utility is decreasing in the distance from the type of public good provided in equilibrium (i.e. that Udi . 0). Since Udi 5 U idi ln( g), this condition will be satisfied only if g.1. It is then necessary to assume that g.1. Since di .0, this assumption only requires an appropriate definition of the units in which y is measured. To derive the equilibrium level of centralization I assume that the central government is the first mover and decides the level of centralization. This assumption may seem at odds with democratic voting over the type and amount of public good. Its theoretical background relates to the large political science literature that shows how the agenda setter can, by controlling the order of voting, manipulate the final outcome of an election.3 The agenda setter has always some power and, in the model presented in this paper, the level of democracy measures how much of this power the government is willing to use to achieve its own goals. 3 One of the most important results in this literature is McKelvey (1976) Chaos Theorem. The latter shows that multidimensional voting is almost always characterized by a situation with no Condorcet winner. 102 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 After observing u, the citizens vote on the amount of the public good, and then on the type of the public good. The assumption of sequential decision making (similar to the one used by Alesina et al., 1996a) reflects the budget process adopted in many countries (Alesina and Perotti, 1996). In a democratic system the type of public good provided is the one preferred by the median voter. Two types of median voters will be considered: the ‘national’ median voter, indicated by med, and the median voter of jurisdiction j, indicated by mej. Given the assumption on the spatial distribution of individuals, the median voters will be located at the center of the country and at the center of their jurisdictions, respectively.4 See Fig. 1. At this point it is necessary to investigate who will assume the role of ‘central government.’ On principle, anybody who promises to supply the type of public goods preferred by the median voter could play the role of central government, but only one individual can credibly commit to provide such type of public good: the median voter herself. Any other individual will, once elected, have an incentive to deviate and implement a policy closer to her own preferences. It is therefore natural to assume that the central government will have preferences for the public good identical to those of the national median voter. Besides sharing the preferences of the national median voter, the central government derives additional utility from staying in power. Following Brennan and Buchanan (1980) view of the central government as a budget-maximizing Leviathan I will assume that the utility that the government obtains from staying in power is a function of the budget it controls. The central government chooses u in order to maximize the following utility function: Vgov 5 f U med 1 (1 2 f )u g (4) Where f [(0,1) indicates the level of democracy: f 50 indicates a dictatorship, while f 51 indicates a perfectly democratic country. The utility function of Eq. (4) has a straightforward interpretation: dictators only care about their budget while democratic governments identify their preferences with the preferences of Fig. 1. A country with five jurisdictions. 4 The assumption that j is an odd number guarantees that the national median voter is also the median voter of the central jurisdiction. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 103 the median voter. Given the discretional power of the agenda setter, the central government will always be able to extract some rent. In the utility function of Eq. (4) the level of democracy measures how much of this rent the central government is willing to extract.5 Notice that the probability of being re-elected does not appear in the government’s utility function. This is because, given the structure of preferences, the median voter is the only individual for which the Condorcet winning policy is incentive compatible. Therefore she will be re-elected with probability 1. For the central government the distances from both the center of the jurisdiction and the center of the country are 0. Eq. (2) can then be rewritten as: V gov 5 f gc b 1 (12 f )u g. The government maximizes Eq. (4) by solving the model backward. The last decision (and therefore the first to analyze) is on the type of the public good. The type of public good chosen in equilibrium is the one preferred by the median voter (and therefore the central government). The next step is to determine the amount of the public good to be provided in equilibrium. By applying the median voter theorem and using the demand function of Eq. (2) it is possible to derive the following: Proposition 1. The amount of public good provided in equilibrium is given by: my g* 5 ]] m 1b F (5) G S S with m 5 1 2 a u ] 1 (1 2 u ) ] . 4 4J Proof. See Appendix A. In a model with only one level of government, Alesina et al. (1996a) show that the optimal quantity of public good depends on the ‘median distance from the median.’ In the framework presented here, it is possible to interpret (u (S / 4)1 (12u )S / 4J) as a weighted average of the median distance from the national median and the jurisdiction median and (12 m ) as the ‘ideological’ distance from the median. The latter will be close to 0 if a country is very small or if its preferences are homogeneous, and close to 1 for large countries with heterogeneous preferences. By substituting Eq. (5) into the central government’s utility function Eq. (4), it is easy to show that the central government will choose the value of u that maximizes the indirect utility function: 5 This could also be interpreted as: ‘Given the institutional structure, how much rent is the central government capable of extracting.’ This would involve modeling the institutional structure and goes beyond the scope of this paper. 104 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 my Vgov 5 f ]] m 1b S by DS]] m 1 bD b my 1 (1 2 f )u ]] m 1b (6) From the maximization of Eq. (6) it is possible to derive the following: Proposition 2. The amount of fiscal centralization u is decreasing in: ( i) the level of taste differentiation a; ( ii) democracy f; ( iii) income per capita y; and ( iv) country size S. Proof. See Appendix B. The proof of Proposition 2 is simple but long and tedious. Its interpretation however is straightforward. Since (i) the central government obtains direct utility from the size of its budget, and (ii) dgov 51. m, the amount of public good demanded by the government is greater than g*. In other words, voters force the central government to decentralize by choosing an amount of publicly provided good lower than the amount preferred by the central government. To increase the amount of public good produced in equilibrium the central government is then required to reduce the level of centralization. Eq. (14) in Appendix B shows that the central government chooses the value of u that sets the marginal cost of centralization (due to the fact that gu ,0) equal to the marginal benefit of centralization (due to the fact that, for a given level of g, an increase in u increases the government budget). Since the ideological distance from the center (12 m ) depends on the size of the country and on the level of heterogeneity of preferences, it is easy to show that ga ,0 and gS ,0. Therefore, an increase in the level of tastes differentiation or in country size will decrease the amount of public good and the marginal benefits from centralization. A decrease in g by increasing Ug gov /Uc gov will also increases the costs of centralization. Both effects will induce the central government to reduce the level of u. One of the results of Proposition 2 is in line with Alesina and Spolaore (1997) finding that democratization should be positively correlated with the equilibrium number of countries and it proves the claim that their analysis can be applied to the division of a country into jurisdictions. A corollary to Proposition 2 is that a perfectly democratic government will set u 50 (see the proof in Appendix B).6 Proposition 2 does not provide any result for the effect on centralization of the number of jurisdictions J. A change in the number of jurisdictions will have two opposite effects. On the one hand, an increase (decrease) in the number of jurisdictions will affect m and cause an increase (decrease) of g and therefore of the benefits (from the central government’s point of view) of centralization. On the 6 This result allows to compare the model of this paper with the normative analysis. In fact, it is possible to show that, given the preferences illustrated in Eq. (1), a perfectly democratic government will behave like a Benthamite social planner (this is proved in Appendix B). U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 105 other hand, since gu J ,0, an increase in J will increase the centralization-elasticity of the demand for g and therefore the cost of centralization. The final effect on the equilibrium level of centralization will depend on which of these factors dominates the other. 3. Taking the model to the data Proposition 2 generates four predictions. The result that ≠u / ≠a ,0 suggests that countries with polarized preferences for the type of public good should be more decentralized than countries with homogeneous preferences. In countries with homogeneous tastes the central government can induce the citizens to vote for a high level of expenditure in publicly provided good even when the latter are provided in a highly centralized fashion.7 Hence, we should find a negative correlation between the level of centralization and heterogeneity in the demand of public goods. Economic theory indicates that the key factors in determining demand are tastes and income. Since the model assumes constant income, I will concentrate on the role of taste heterogeneity. The problem in testing the possible presence of a negative correlation between centralization and taste heterogeneity is how to measure the latter. In Section 3.1 I provide some justifications for the use of ethnic fractionalization as a proxy for taste heterogeneity. The result that ≠u / ≠f is based on the idea that a democratic government will not try to exploit its agenda setter power. Hence, we should find a negative correlation between the level of democracy and the degree of centralization. This results, besides being in line with Alesina and Spolaore (1997) who find that a world of dictatorships would lead to larger countries, is also consistent with Ades and Glaeser’s (1995) finding that dictatorships tend to have very large capital cities. The central government’s utility function suggests that perfect democracies should set u 50 and very repressive dictatorships should set u 51. The majority of countries included in the data set used in this paper fall between these two extremes. Most of the real-world governments are neither perfect democracies (because they are run by self-interested politicians with some agenda-setting power) nor perfect dictatorships (even dictators need to rely on the support of the group of people who put them in power). It should also be noted that, since some public goods cannot be efficiently produced by the local governments (these are goods with large spillover; defense is an example of such a good), even perfect democracies will have levels of centralization greater than 0. 7 U gov g Remember that, in equilibrium, ] . 1 and therefore the Central Government would like to U gov c increase public good expenditure above its equilibrium level. 106 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 The result that ≠u / ≠y,0 is based on the idea that decentralization is a normal good. Hence, we should find a negative correlation between centralization and income per capita. The existing literature does not agree on the relationship between income per capita and fiscal centralization. Some authors claim that decentralization is a luxury and therefore should be positively correlated with income per capita (Wheare, 1964). In support of this idea Oates (1972) finds that the level of fiscal centralization is inversely correlated with a country’s level of development. Other authors have observed that this relationship tends to disappear when only rich countries are examined. Wallis and Oates (1988), for instance, show that in developed countries centralization is positively correlated with income per capita. A possible explanation for this finding is that richer countries tend to have more generous redistributive policies, usually implemented by the central government. Casual evidence indicates that some rich countries are extremely decentralized (Switzerland, United States, Canada are examples) while others are highly centralized (France for instance). The regression analysis of Section 4 indicates a strong correlation between income per capita and decentralization. The result that ≠u / ≠S ,0 depends on the fact that, other things equal, the bigger the country, the smaller m (leading to a large ideological distance from the center (12 m )), and hence the quantity of g provided in equilibrium. The central gov government will then be forced to decentralize in order to reduce the U gov g /U c ratio. Hence, we should find a negative correlation between centralization and country size. While casual evidence shows that there are some countries that contradict this prediction (Switzerland is more decentralized than France for example), the regression analysis provides strong support for a negative correlation between size and centralization. The empirical analysis uses Area as the relevant measure of size. Support for using this variable can also be found in the previous fiscal federalism literature. The latter suggests that the benefits of decentralization can be offset by market failures. The most important of these market failures relates to the presence of spillovers across jurisdictions. These externalities are likely to be inversely correlated with the size of the jurisdictions. So, to the extent that larger countries have larger jurisdictions, land area (Area) will be an appropriate proxy for size.8 Note that the utility function of the central government (Eq. (4)) depends on the per capita level of expenditure in g. By changing Eq. (4) into Vgov 5 f U med 1(12 f )u G (i.e. considering total government expenditure rather than per capita expenditure), it would be possible to show that centralization is increasing in population size (≠u / ≠N .0).9 8 Switzerland is a notable exception. The small size of its jurisdictions is compensated by characteristics of the territory that limit the mobility of its citizens, and therefore the spillovers among jurisdictions. 9 Tables 13 and 14 show a positive (but not statistically significant) correlation between population and centralization. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 107 Summarizing, the simple framework presented in this section predicts a negative correlation between fiscal centralization and: (i) taste heterogeneity; (ii) democracy; (iii) income per capita; and (iv) country size. Note that the predictions of Proposition 2 derive from the reduced form of the model of Section 2. Estimating the structural form of the model would require an equation for the behavior of the agents and one equation for the behavior of the central government. This could be done by estimating the following simultaneous model: gi 5 a1 1 a2 y i 1 a3 Fract i 1 a4 Area i 1 a5ui 1 u i (7) ui 5 b1 1 b2 Demi 1 b3 y i 1 b4 gi 1 vi (8) Eq. (7) estimates the quantity of public good voted by the economic agents. The equation is identified by the exclusion of the democracy variable (Democracy does not appear in the demand functions of Eqs. (2) and (3)). Eq. (8) estimates the level of centralization chosen by the central government. The equation is identified by the exclusion of Area and Fract, variables that do not enter in the direct utility function of the central government. The theoretical model predicts a positive sign for a2 and b4 and a negative sign for all the other coefficients. The above model is fully identified and could be estimated using two-stage least squares. Unfortunately, TSLS produce consistent but inefficient estimates and, with approximately 55 observations, this could be a serious problem.10 The empirical part of the paper will then concentrate on estimating a reduced form equation of the kind u 5f( y, Area, Fract, Dem). One last problem concerns the technique used in estimating the model. By definition, the level of fiscal centralization cannot be higher than 100%. Therefore, I am dealing with a censored dependent variable and, assuming well-behaved residuals, the appropriate estimation technique is the Tobit model. It should be pointed out that Eq. (4) shows that the central government will set u 5100 only if f 50. Hence, only very repressive dictatorships should be fully centralized. The data seem to support this implication. If we exclude very small countries where the scope for decentralization may be extremely limited (Malta and Singapore, for instance), we observe that full centralization is found almost only in extremely autocratic regimes like Myanmar and Zaire (Egypt is an exception). 3.1. Data What follows discusses the variables used in the empirical analysis. A description of the data sources can be found in Appendix C. 10 The parameters obtained by estimating Eqs. (7) and (8) with TSLS have all the expected signs but they are not statistically significant. The estimation results are available upon request. 108 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 3.1.1. Fiscal centralization To test the predictions of the model studied in Section 2 it is necessary to build a data set of measures of fiscal centralization. Identifying such measures is not an easy task. The main issue is finding a method to quantify the activity of local governments that results from independent decision making. Oates (1972) discusses the conceptual problems involved in the choice of the right measure of fiscal centralization. These problems can be summarized as follows: (i) Different levels of local governments should be weighted in different ways. For instance, large regional governments should be considered more centralized than smaller, city-wide, governments. (ii) Sometimes the local governments collect revenues or make expenditure but have no autonomy in deciding the tax amount to be collected or the type of expenditure to be made. This problem is also related to the identification of the relevant definition of jurisdiction. (iii) The role of intergovernmental grants. The available data do not allow to address the problems listed above. They divide between the central government and local governments as a group. Information on the appropriate decision units and on the use of the intergovernmental grants is not available. It is therefore impossible to apply a weighting scheme to different levels of local governments or identifying the number of relevant jurisdictions. Following Pryor (1968), I define centralization ratios as the percentage of revenues (or expenditure) of the central government out of the total revenues (or expenditure) of the public sector. Two measures of fiscal centralization (Total Revenues and Total Expenditure) for 1975, 1980 and 1985 are built using data from the IMF (1975, 1980 and 1985) Government Finance Statistics Yearbook.11 The centralization ratios are illustrated in Table 8 of Appendix C. While the data for 1980 and 1985 are very similar (the correlation between the 1980 and 1985 values is 0.95 for total revenues and 0.98 for expenditure) the centralization ratios for 1975 are less correlated with the 1980 and 1985 values (the correlation is between 0.6 and 0.75). This may be an indication of either a change in the level of decentralization or a change in the methods of data collection. Oates (1972) tested for the determinants of fiscal centralization using the data reported in the World Tables (The World Bank, 1969). These data, following the United Nations accounting conventions, include social security programs in total public sector revenues and expenditures, but do not include social security programs in the revenues or expenditure of the central government. Given the fact that in some countries social security programs represent a big share of the government budget (sometimes more than 30%), not including social security 11 Until 1988 the IMF provided information on central governments expenditure and revenues and on general governments (defined as central plus local governments) expenditures and revenues. The centralization ratios are computed by dividing the former by the latter. A detailed description of the method used to build the data is provided in Appendix C. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 109 programs makes a big difference. Therefore, the centralization ratios reported by the World Tables are much lower than the centralization ratios used in this paper. Since the central government plays a very important role in deciding the size and the scope of social security programs, the measure of fiscal centralization used in this paper should be better suited for a study of the determinants of fiscal centralization than the centralization ratios of the World Tables.12 For most measures Yugoslavia is the most decentralized country. Among the industrialized countries Switzerland, Canada, and the United States are the most decentralized. The values for Yugoslavia are suspiciously low. The shares of the federal government in total revenues and expenditures are always below 30%. Although Yugoslavia was a well known case of extreme federalism the above values may be the outcome of an accounting procedure that did not reflect the actual division of powers between the central and local governments. 3.1.2. Heterogeneity in the preferences for public goods Since tastes are not directly observable, it is necessary to find a proxy for this variable. It is not unlikely that different ethnic groups may diverge in their tastes for publicly provided goods (education is an important example). Therefore, differences in tastes may be proxied by a measure of ethnic fractionalization (Fract). The possibility of a relationship between ethnic fractionalization and tastes was first explored by Oates (1972) who used a dummy variable to differentiate homogeneous from heterogeneous countries.13 One problem with Oates’ analysis is that a dummy variable dividing the world between homogeneous and heterogeneous countries is a very coarse measure of tastes differentiation. A continuous measure of heterogeneity would be more instructive. To address this problem, I measure ethnic fractionalization using the data collected by the Department of Geodesy and Cartography of the State Geological Committee of the Soviet Union, originally published in the Atlas Narodov Mira (1964) and then reported by Taylor and Hudson (1972). This variable ranges from 0 (Korea) to 0.9 (Zaire) and measures the probability that two randomly selected individuals will belong to different ethno-linguistic groups.14 The same measure has been used in cross-country studies of growth by Canning and Fay (1993); Mauro (1995); Castilla (1996); Easterly and Levine (1997). Easterly and Levine study the relationship between this measure of ethnic diversity and similar indices proposed by various authors and find that ethnic fractionalization is a robust 12 Ideally, one should use data for the activities that are decentralizable and therefore exclude defense and social security. 13 He used three measures of differentiation: linguistic, racial, and religious. These dummies assumed a value of 1 for countries that Oates deemed to be homogeneous and 0 for countries deemed to be heterogeneous. 14 The value for Korea was not included in the original data set of the Atlas Narodov Mira but reported (using a different source) by Taylor and Hudson (1972). 110 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 predictor of potential ethnic conflicts. Alesina et al. (1996a) use ethnic fractionalization to capture conflicts among groups in the decision of the quantity and type of public goods. To support this idea, they quote a vast sociological literature that finds that preferences and conflicts over public policies are more strongly correlated with ethnic as opposed to income differences. Most African countries are highly ethnically fractionalized and nine out of the ten most fractionalized countries are in Africa (the tenth one is India). Zaire, Cameroon, South Africa, Nigeria, Central African Republic, Kenya and Zambia have measures of ethnic fractionalization greater than 0.8. Among the industrialized countries, Canada has the highest degree of ethnic fractionalization (0.75), followed by Belgium (0.55), Switzerland, and the USA (0.5). Yugoslavia is the European country with the highest degree of fractionalization (0.75). Of course, ethnic fractionalization is appropriate to test the model of Section 2 only if one assumes that the different ethnic groups are spatially separated. Support for this assumption comes from the empirical literature aimed at testing Tiebout (1956) model. It is also often observed that different ethnic groups are located in different regions of a country and, in some cases, these regions have large autonomy from the central government.15 3.1.3. Democracy To test for the fourth hypothesis I use the data on political rights assembled by Gastil (1990).16 Gastil’s classification assigns the value 1 to perfect democracies and 7 to countries where most citizens have no political rights. Following Barro (1996), I transform Gastil’s 1–7 ranking into a 0–1 ranking, where 0 corresponds to Gastil’s 7 and 1 corresponds to Gastil’s 1. An alternative source of measures of democracy is provided by the Polity II data set (Gurr et al., 1989). The results obtained using the Polity II indices of democracy are essentially identical to the results obtained using Gastil’s definition. 4. Estimations of the determinants of fiscal centralization This section discusses the methods used to test for the empirical predictions described in Section 3.1. The results of the various tests are presented using the centralization ratios for both government revenues and expenditures. As a first 15 A possible interpretation is that the central government ‘bribes’ ethnically diverse regions by giving them more autonomy. For instance, in a highly centralized country like Italy there are five special regions that enjoy large fiscal autonomy and transfers from the central government. Two of these regions are islands (Sardinia and Sicily). The other three are border regions characterized by large ethnic minorities: French in Valle d’Aosta, German in Trentino Alto Adige, and Slavic in Friuli Venezia Giulia. Similar examples can be found in Canada, Belgium, India, Spain, Switzerland and the UK. 16 Gastil uses the following definition: ‘‘Political rights are rights to participate meaningfully in the political process. In a democracy this means the right of all adults to vote and compete for public office.’’ U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 111 step, the model is estimated using ordinary least squares. Then, more appropriate Tobit and semi-parametric estimation methods are used. Previous work found evidence for the fact that size and income per capita have some weight in explaining fiscal centralization, but most attempts of finding any correlation between fiscal centralization and socio-political variables have not been successful. I start the analysis by estimating two regressions where measures of ethnic fractionalization and democracy are added, one at a time, to a basic specification that includes income per capita and area (these variables are always included given the existing evidence of their correlation with centralization). Formally, I estimate linear forms of the following two equations: CENTR5f(Size, y, Fract), and CENTR5f(Size, y, Dem). The rationale for the above specifications is to compare the effects on centralization of ethnic fractionalization and democracy. Next, I test the following equation: CENTR i 5 a0 1 a1 Size i 1 a2 y i 1 a3 Fract i 1 a4 Dem i 1 u i (9) Instead of levels, the estimations use the logs of Area and GDP per capita. The log specification improves the fit of the regression and indicates the presence of a non-linear relationship between these two variables and fiscal centralization. Oates (1972) uses ordinary least squares to estimate an equation similar to CENTR5f(Area, y, Fract). Since the centralization ratios cannot be higher than 100, the independent variables are censored from above and OLS is not the appropriate technique to estimate Eq. (9). It is well known that, with censored data, least squares estimates are biased. Greene (1993) finds that the bias is proportional to the percentage of limit observations. In the data set used in this paper the non-limit observations range between 84 and 90% of the total. Therefore the OLS estimates should have a bias that ranges between 10 and 20%. Despite its fundamental flaw, I run OLS regressions to compare my results with Oates’ work. The results, reported in Appendix D, confirm Oates’ finding that size and income per capita are negatively correlated with fiscal centralization. I also find that, as predicted by the model of Section 2, democracy and ethnic fractionalization are negatively correlated with fiscal centralization. These preliminary results, although not very strong (because the coefficients are not always statistically significant), are encouraging. Oates (1972) found that only size and income had a role in explaining centralization and that the proxies for demand differentiation always had the wrong signs.17 4.1. Tobit estimations This section uses the Tobit model to test the predictions of Section 3.1. The Tobit model is the standard technique used to estimate equations with censored 17 In his regressions ethnic fractionalization was never statistically significant and always positively correlated with centralization. 112 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 dependent variables. The assumption of normally distributed residual is crucial for the consistency of the Tobit estimates. Table 1 presents the results for the Tobit estimates when the dependent variable is total revenues. For all regressions, Area and y are negatively correlated with Table 1 Tobit estimates: Revenues centralization ratios 1975 Area y Fract. 22.761 (23.913)*** 27.925 (25.206)*** 28.779 (21.519) Dem. Const. No. obs. x2 Area y Fract. 189.271 (12.097)*** 60 34.52 1980 22.623 (23.039)*** 29.75 (24.681)*** 215.347 (21.968)* Dem. Const. No. obs. x2 Area y Fract. 206.03 (9.508)*** 56 25.55 1985 23.202 (23.865)*** 210.961 (25.638)*** 218.438 (22.724)*** Dem. Const. No. obs. x2 225.481 (10.69)*** 56 37 23.052 (24.892)*** 24.528 (22.238)** 210.592 (22.375)** 166.674 (10.384)*** 59 37.92 23.049 (23.700)*** 27.861 (23.279)*** 1.258 0.199 189.264 (8.463)*** 58 21.35 23.85 (25.129)*** 27.736 (23.098)*** 26.876 (21.008) 204.559 (9.17)*** 60 39.2 23.045 (24.333)*** 24.621 (22.133)** 26.405 (21.081) 212.545 (22.299)** 172.579 (10.000)*** 57 37.57 22.639 (23.048)*** 29.306 (23.638)*** 215.522 (21.983)* 21.919 (20.299) 203.75 (8.873)*** 56 25.64 23.253 (23.949)*** 28.937 (23.488)*** 218.435 (22.732)*** 28.247 (21.192) 214.361 (9.417)*** 56 38.43 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 113 revenues centralization and have high and statistically significant coefficients. The coefficients attached to ethnic fractionalization are, as predicted by the theory, always negative but they are not statistically significant in the 1975 regression. However, they are marginally significant in the regression for 1980 and highly statistically significant in 1985. Conversely, democracy has an important role in explaining fiscal centralization in 1975 but loses its explanatory power in 1980 and 1985. Table 2 presents the results for the Tobit estimates when the dependent variable is total expenditure. Now, Fract has high t-statistics for both 1980 and 1985. As for revenues, democracy plays a key role in explaining centralization in 1975. Qualitatively, the results of the Tobit estimations are not very different from the results of the least squares estimates. As expected, the absolute values of the coefficients are higher and, as predicted by Greene (1993), the differences range from 10 to 25%. To summarize, the Tobit estimations support the idea that size, income, and ethnic fractionalization are negatively correlated with the degree of fiscal centralization. The coefficients attached to democracy are always negative (supporting the prediction that democratic countries tend to have more decentralized fiscal systems), but statistically significant only for 1975. Tables 13 and 14 in Appendix E show that these results are robust to the inclusion of population or total GDP as a measure of size. When more than one measure of size is included in the regression, only Area shows a robust correlation with fiscal centralization. The specifications used in Tables 1 and 2 are very parsimonious. To test the robustness of the results, I augmented the regressions with some variables that are likely to be correlated with fiscal centralization. One key issue is that most of these variables are likely to be endogenous with respect to the measure of fiscal centralization. A variable that might play an important role in determining the degree of fiscal centralization is defense expenditure. Augmenting Eq. (9) with defense expenditure does not modify the basic results of Tables 1 and 2. The coefficients attached to defense expenditure have the expected (positive) sign but they do not enter in the regression in a statistically significant way. Four regional dummies (Africa, Latin America, Asia and OECD countries) were included in the regression to test for possible regional effects. The introduction of these dummies does not change the results of Tables 1 and 2. Since different regions may have different slopes, Eq. (9) was also augmented with four interaction dummies. When sub-Saharan Africa is used, some of the coefficients attached to the interaction dummies become marginally significant, but the basic results are similar to the ones of the regressions that do not include the interaction dummies (see Table 10 in Appendix E). The coefficients attached to ethnic fractionalization increase and become statistically significant in all the regressions (including 1975) when interaction dummies for Asia are included in the model (Table 11). Asian countries seem to be characterized by a positive relationship 114 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 Table 2 Tobit estimates: Expenditure centralization ratios 1975 Area y Fract. 22.422 (23.572)*** 27.208 (24.593)*** 22.129 (20.365) Dem. Const. No. obs. x2 Area y Fract. 176.650 (11.179)*** 58 26.91 1980 22.761 (23.215)*** 29.474 (24.940)*** 218.351 (22.561)*** Dem. Const. No. obs. x2 Area y Fract. 208.162 (10.143)*** 56 29.41 1985 23.076 (23.658)*** 210.446 (24.881)*** 215.052 (22.127)** Dem. Const. No. obs. x2 218.608 (9.681)*** 54 29.93 22.700 (24.492)*** 24.241 (22.108)** 212.117 (22.416)*** 159.098 (10.099)*** 57 32.04 23.457 (24.147)*** 26.873 (22.937)*** 21.68 (20.268) 189.795 (8.61)*** 58 23.81 23.648 (24.812)*** 29.086 (23.403)*** 21.06 (20.158) 210.177 (8.872)*** 59 34.29 22.917 (24.206)*** 23.337 (21.520) 20.197 (20.328) 213.934 (22.448)*** 158.259 (9.214)*** 55 30.39 22.789 (23.241)*** 28.687 (23.582)*** 218.536 (22.580)*** 23.328 (20.529) 204.015 (9.308)*** 56 29.69 23.088 (23.667)*** 29.929 (23.504)*** 214.953 (22.108)** 21.928 (20.279) 215.558 (8.6)*** 54 30.01 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. between fractionalization and centralization and by a negative relationship between democracy and centralization. In interpreting these results one must be careful because all the coefficients have very large standard errors and are unstable over time. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 115 5. Outliers and robustness This section identifies some important outliers and study their role in determining the results discussed in Section 4. The next section will use a semi-parametric estimator and bootstrapping techniques to investigate in a more systematic way the role of outliers and the consequences of the violation of the hypothesis of normally distributed residual. There are two important outliers in the data: Yugoslavia and Zaire. Both countries have high levels of ethnic fragmentation and low levels of democracy, but Yugoslavia has the most decentralized fiscal structure and Zaire one of the most centralized. To analyze the role of these two countries, I dropped Zaire and Yugoslavia from my data set. Table 3 illustrates the results of this experiment when revenues centralization for 1985 is used as a dependent variable to estimate Eq. (9). Column 1 shows that, when only Yugoslavia is excluded from the regression, the coefficient attached to ethnic fractionalization loses most of its explanatory power and the coefficients and the t-statistics attached to Dem increase noticeably. It is interesting to note that the results are similar to the 1975 estimates (that do not include Yugoslavia). Column 2 shows that the opposite happens when only Zaire is excluded from the regression. The most interesting case is the one illustrated in Column 3 where both Yugoslavia and Zaire are dropped from the regression. In this case, both the coefficients attached to Fract and Dem are statistically significant, but the effect of democracy dominates the effect of ethnic fractionalization. The coefficients of Column 3 are very similar to the coefficients of the semi-parametric estimation that will be presented in the next section. Table 3 Tobit estimations excluding Yugoslavia and Zaire (dependent variable: 1985 revenues centralization ratio) Area y Fract. Dem. Const. No. obs. x2 1 (Yugoslavia excluded) 2 (Zaire excluded) 3 (Yugoslavia and Zaire excluded) 23.35 (25.54)*** 25.63 (22.88)*** 29.65 (21.89)* 215.86 (22.97)*** 190.40 (11.25)*** 55 53.38 23.28 (24.01)*** 28.32 (23.23)*** 219.54 (22.90)*** 28.34 (21.21) 209.66 (9.21)*** 55 37.88 23.37 (25.49)*** 25.23 (22.66)*** 210.49 (22.06)** 215.85 (22.99)*** 187.26 (11.06)*** 54 52.48 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. 116 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 Economists and statisticians do not agree on the way one should treat outliers. Some think that outliers distort the estimations and they should be excluded from the model. Others claim that outliers provide interesting information. Paradoxically, Yugoslavia and Zaire seem to be good real-world examples of the situation described in the model of Section 2. In both countries, we observe two opposing forces: low levels of democracy that tend to make the country more centralized and highly ethnically fractionalized populations that push for more decentralization. The outcome probably depends on the cost of repression (a very repressive central government may face international sanctions). It is reasonable to think that the latter will depend on the ‘visibility’ of a country. So, high levels of repression may have been very costly for a country like Yugoslavia (located at the center of Europe), while the costs of repression may have been lower in Zaire. The recent history indicates that both democracy and ethnic fractionalization played an important role in Yugoslavia. This country was kept unified and fairly stable by a powerful and charismatic leader: Marshall Josip Tito. Tito’s death was soon followed by the break-up of Yugoslavia. As pointed out in Section 3.1, the values for fiscal centralization in Yugoslavia are suspiciously low and might not reflect the true division of powers between central and local governments. In this case, Yugoslavia would not be a true outlier, but only an observation not accurately measured. Since Yugoslavia has such an important weight in determining the role of ethnic fractionalization, it is interesting to do a sensitivity analysis and explore the effects of different values of its centralization ratio. To this purpose, I re-estimated the equation for revenue centralization in 1985 using different values for Yugoslavia. When revenues centralization for Yugoslavia is between 0.5 and 0.85,18 both ethnic fractionalization and democracy are statistically significant. While Area, GDP per capita, and ethnic fractionalization are very likely to be exogenous with respect to fiscal centralization, reverse causation from centralization to democracy may be a problem. The promotion and preservation of democracy was one of the main reasons why the American founding fathers chose a highly decentralized system. De Tocqueville in La Democratie en Amerique praises the highly decentralized American system and claims that such a system is the best protection for democracy. Although the events of the recent years (Spain, Czechoslovakia, Poland and the Soviet Union) seem to indicate a clear causation from democracy to decentralization, it is interesting to explore the reverse causation issue by using an instrument for the democracy variable. The problem is, as usual, the identification of a good instrument. An attempt was made by using the lagged value of democracy. This variable is highly correlated with democracy and therefore has all the statistical characteristics to be a good instrument.19 The 18 These seem to be reasonable values given the fact that only two countries have centralization ratios lower than 0.5 (Yugoslavia and Canada). 19 However, even if the use of this variable rules out direct reverse causation, there may still exist a systematic relationship between democracy and centralization. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 117 instrumental variables estimates (available upon request) are almost identical to the results of the Tobit estimations with no instruments. 5.1. Semi-parametric estimations Similarly to OLS, the Tobit model yields consistent parameter estimates if the residuals are homoskedastic and normally distributed. While ordinary least squares are very robust, and a violation of these hypotheses leads to inefficient but still consistent estimates, this is not the case for the Tobit model. Maximum likelihood methods can only be used to estimate censored regression models if the distribution of the residuals is known. Greene (1993) discusses the consequences of a mis-specification of the distribution of the residuals and shows that the violation of the normality and homoskedasticity hypotheses leads to inconsistent estimates of the parameters. In recent years many semi-parametric estimators that do not require any assumption on the distribution of the residuals have been proposed. The symmetrically trimmed least squares estimator (STLS) and the censored least absolute deviation estimator (CLAD) proposed by Powell (1984), (1986) do not require i.i.d. errors but they do not perform well in small samples. The pairwise difference estimator (PD) proposed by Honore´ and Powell (1994) requires i.i.d. residuals but performs relatively well in small samples. Monte Carlo simulations have shown that, with a sample size of 50 observations, the PD estimator is often close to the best estimator. Since the data set used in this paper contains less than 70 observations, I decided to use Honore´ and Powell’s PD to estimate Eq. (9). (Appendix F contains a description of the Honore´ and Powell pairwise difference estimator.) Table 4 presents the results obtained by using this estimator. The number in parentheses indicate the probability of a sign switch. These pseudo p-values were obtained by bootstrapping the model 1000 times. The semi-parametric estimations of the coefficients of Area and y are similar to the ones obtained in the OLS and Tobit estimations. The semi-parametric estimator instead produces very different results for the coefficients attached to ethnic fractionalization and democracy. The coefficients attached to Fract are always lower than the Tobit estimates and the coefficients attached to Dem always higher. Ethnic fractionalization is statistically significant at the 5% level when the dependent variable is revenues in 1985 and statistically significant at the 10% level when the dependent variable is revenues in 1980. In the expenditure regressions, the coefficient attached to ethnic fractionalization is statistically significant (with a p-value of 0.051) only in 1980. Democracy enters in the regression with a statistically significant coefficient for both expenditure and revenues in 1975 and for revenues in 1985. The results of the semi-parametric estimations are much closer (in the magnitude of both the parameters and their p-values) to the results of Column 3 of Table 3 than to the Tobit results for the whole sample. In the Tobit estimations for 118 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 Table 4 Semi-parametric estimations Revenues centralization ratios 1975 Area y Fract. Dem. Const. No. obs. Area y Fract. Dem. Const. No. obs. 1980 1985 23.33 22.82 23.53 (0.000) (0.002) (0.000) 24.18 26.68 26.27 (0.026) (0.002) (0.002) 24.47 28.45 210.19 (0.275) (0.095) (0.044) 212.64 25.46 212.46 (0.025) (0.19) (0.030) 170.8 182.38 193.42 (0.000) (0.000) (0.000) 57 56 56 Expenditure centralization ratios 1975 1980 22.99 (0.002) 23.98 (0.082) 0.05 (0.492) 213.54 (0.007) 163.96 (20.000) 55 23.04 (0.002) 26.03 (0.001) 210.45 (0.051) 26.78 (0.189) 182.49 (0.000) 56 1985 23.25 (0.002) 26.92 (0.001) 25.71 (0.200) 26.05 (0.240) 190.17 (0.000) 54 Pseudo p-values in parentheses. the whole sample, Fract has a coefficient of 218.43 (and a t-statistic of 2.73), which is 80% higher (in absolute value) than the coefficient obtained with the PD estimator (210.19). At the same time, the coefficient of the Tobit estimation that excludes Yugoslavia and Zaire is almost identical to the coefficient of the PD estimation for the whole sample. This is an indication that the Honore´ and Powell method and the bootstrapping downweigh outliers and implicitly correct for some of the problems in the data. To conclude, the PD estimator reduces the explanatory power of ethnic fractionalization and increases the explanatory power of the democracy variable, but still finds a negative correlation between these two variables and the level of fiscal centralization. After exploring the existence of a statistically significant relationship between centralization and a set of socio-economic variables, it is worth asking if this relationship is also economically significant. The results of Table 4 indicate that, for the 1985 sample, a 1 standard deviation increase in ethnic fractionalization is U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 119 associated to a 2.8% decrease in revenues centralization. At the same time, a 1 standard deviation increase of democracy in 1985 is associated to a 4.3% decrease in centralization. The effects on centralization associated to changes in income per capita and area size are also very large (ranging from 0.5 to 6%). This allows to conclude that the relationship between centralization and the set of socio-economic variables used in this paper is not only statistically significant but also economically significant. 6. The role of history History is the main reason why many scholars claim that it is not possible to use economic theory to explain cross-country differences in the division of powers among different levels of government. Many political scientists have pointed out that, in each country, intergovernmental fiscal relations are the outcome of a bargaining process that is generally unpredictable (Oates, 1972). In some cases, this bargaining process generated a structure of intergovernmental fiscal relations that, although optimal at the time the process took place, may not reflect the current preferences of the citizens. In other cases, the level of fiscal centralization may have been imposed by a dictator (or monarch) or by international pressure without any consideration for the citizens’ preferences. Since the process of adjusting to the optimal fiscal structure requires time, many countries may still be far away from their optimal level of centralization. Although the model presented in Section 2 is static, it is interesting to study if some of the countries included in the sample are out of the equilibrium and slowly adjusting towards it. An ideal way to control for the role of history would be to include in the regression fiscal centralization measured at the time a given country achieved its independence, but this variable is not available.20 An alternative method is to augment the regression with the lagged value of fiscal centralization. 20 Attempts to find proxies for the distance from the optimal fiscal structure were not successful. To this purpose I explored variables like social unrest, presence of independentist movement, or number of years since a given country has started its adjustment process. All these variables present conceptual problems. The first set of variables (social unrest and presence of independentist movement) is likely to be the effect and not the cause of a sub-optimal fiscal structure. The latter variable has the advantage of being exogenous but it is not easily measurable. The problem with this variable is to identify a starting point. While this is not very difficult for countries like the UK or the US that have had a very stable political system and the same form of government for more than two centuries, other countries pose very serious problems. How does one deal with cases like France, where there have been several changes in the constitution in the last 40 years? Or Spain? Should one start counting after the death of Franco? What about Italy where there is a large consensus for a more decentralized system but nothing has changed so far? Exogenous shocks, like the end of the Cold War, seriously reshaped the political equilibrium in Europe and modified the status quo. Unfortunately, the available data stop in 1985 and it is impossible to control for these events. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 120 To this purpose, I included fiscal centralization in 1975 among the explanatory variables for fiscal centralization in 1985.21 The results (reported in Table 5) confirm the fact that history is very important. The lagged value of fiscal centralization absorbs most of the variance of the regression (but its coefficient is significantly lower than 1) and reduces the explanatory power of the other variables. Although the explanatory power of Area and y decrease noticeably, the correlation between Fract and fiscal centralization is robust to the inclusion of the lagged dependent variable. It is not surprising that fiscal centralization is highly correlated with its lagged value. Most of the explanatory variables are either almost constant (like ethnic fractionalization and area) or highly correlated (like GDP and democracy) over time. The fact that, even when the lagged value is included, the other explanatory variables still have some power in explaining fiscal centralization suggests the idea that, while some countries have reached their equilibrium level of fiscal centralization, others are still adjusting towards it. The very recent history shows that for some countries even the overall borders were not optimal from the point of view of their citizens. It is not surprising that the borders of these countries had not been fixed by a national movement but imposed by outside forces (this is the case for the separation of West and East Germany) or were the output of a highly non-democratic process (Yugoslavia and Czechoslovakia, for instance). The borders of other countries, especially in Africa, Table 5 Tobit estimations (dependent variables: 1985 revenues and expenditure centralization ratios) Area y Fract. Dem. Centr. 1975 Const. No. obs. x2 Revenues centralization Expenditure centralization 20.82 (22.19)** 21.32 (21.20) 25.93 (22.08)** 23.50 (21.25) 0.87 (16.48)*** 37.54 (2.80)*** 53 125.37 20.89 (22.00)* 23.27 (22.19)** 27.91 (22.26)** 21.64 (20.44) 0.80 (12.19)*** 60.95 (3.66)*** 51 93.90 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. 21 When the 1975 values were missing, the 1980 values were used as proxy. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 121 were determined by their former colonial authorities. Even countries with a strong and long democratic history can be far away from the equilibrium level of centralization. The regression therefore pools together countries that may be very close to their optimal fiscal structure with countries that are far away from the optimum and are slowly adjusting to it. If we accept the idea that some countries are out of the equilibrium, the residuals of the regression should be correlated with changes in fiscal centralization over time. Countries where the actual level of centralization is higher than the predicted value (i.e., u i .0) should be moving towards a more decentralized system and vice versa. Table 6 illustrates some anecdotal evidence in support of this idea: Belgium, France, Italy, Spain, and the United Kingdom are all countries where, in 1985, u i .0; since then, all these countries have moved or are moving towards a more decentralized system. The very large negative residual for Yugoslavia seems to indicate that this country was too decentralized in 1985; this is contradicted by the events that followed Tito’s death. The situation is more complex for Canada; the strong movement for the independence of Quebec seems to indicate that the country is too centralized (and therefore contrasts with the negative residual). Another possible interpretation is that, in order to prevent secession, the central government is ‘bribing’ Quebec with ‘too much’ autonomy. This idea finds some support in the results of the referendum in which the residents of Quebec voted against secession from the rest of Canada. Although Table 6 provides some support for the idea that the residuals are negatively correlated with future decentralization, relying on anecdotal evidence to try to identify which countries are moving towards a more or less centralized system may sound suspicious. A more systematic method to investigate the problem is to look at the correlation between the residuals of the 1975 regression and the annual rate of change in fiscal centralization for the 1975–1985 period (defined as Dc5(Centr1985 /Centr1975)1 / 10 21).22 The results of this experiment Table 6 Revenues centralization (1985) (actual and predicted values for selected countries) Country Actual Predicted u Belgium France Italy Spain United Kingdom Canada Yugoslavia 92.71 89.38 92.76 85.15 86.52 47.37 27.13 78.99 74.22 82.83 76.20 76.49 53.58 81.09 13.71 15.16 10.73 8.95 10.03 26.21 254.41 22 The 1975–1980 rate of change was used when data for 1985 were not available. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 122 Table 7 Correlation between the residuals for the 1975 regression and rate of change in fiscal centralization in the following decade Correlation p value No. obs. Revenues centralization Expenditure centralization 20.40 0.0045 48 20.51 0.0003 45 strongly support the idea that the 1975 residuals are negatively correlated with the subsequent changes in fiscal centralization. Table 7 provides evidence for the fact that countries that are out of the equilibrium are slowly moving towards it and that Eq. (9) is a good specification to analyze the empirical determinants of fiscal centralization. Some evidence that countries are adjusting toward an equilibrium can also be found in the fact that the 1985 equation fits the data much better than the 1975 equation. When an OLS regression is performed on the same set of countries R¯ 2 increases from 0.47 to 0.56. At the same time the sum of squared errors of the Tobit regression decreases from 4670 to 3842. One should be careful in interpreting the results of this section. In particular, they do not provide any additional support for the model of Section 2. The latter is set in a purely static framework, while the issues discussed in this section are dynamic. In fact, this section shows that there are changes in the level of centralization that cannot be explained by the set of regressors used in this paper, and calls for further research on the transitional dynamics of the decentralization process. 7. Conclusions In the real world, democratization has often been followed by decentralization. This happened in Spain, where the death of Francisco Franco and the return to a democratic system was soon followed by a massive process of decentralization. Other examples are Poland, Czechoslovakia, Russia and the Ukraine. The end of the Cold War also favored the rise of secessionist (or pro-decentralization) political movements in countries that, although democratic, used to have an extremely rigid political situation. In Italy, for instance, the end of the ‘Cold War equilibrium’ was soon followed by the rise of a separatist political party. The model and empirical analysis presented in this paper try to shed some light on these recent events. Many authors think that it is not possible to find a single set of variables explaining the cross-country differences in the degree of fiscal centralization. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 123 Oates (1972) attempt seems to support this view. This paper is more optimist on the possibility of finding empirical regularities explaining decentralization. I find that country size, income per capita, ethnic fractionalization, and the level of democracy are negatively correlated with fiscal centralization. The main problem with the results described above is their sensitivity to outliers and to the use of different samples. While Area and GDP per capita are robustly correlated with centralization in almost all specifications, ethnic fractionalization and democracy have unstable coefficients, highly dependent on the sample used. This is due to the fact that outliers play a very important role in determining these coefficients. Yugoslavia is a particularly troublesome outlier. Its very low level of fiscal centralization plays an enormous weight in determining the coefficients (and t-statistics) attached to democracy and ethnic fractionalization. When Yugoslavia is dropped from the sample, democracy becomes strongly correlated with decentralization. This is also true when more realistic values are used for fiscal centralization in Yugoslavia. To conclude, this paper shows that there are some regularities that can be used to explain the cross-country variations in the degree of fiscal centralization. In particular, if one believes that outliers are important and that they should not be eliminated from the analysis, then it is possible to conclude that land area, GDP per capita, and ethnic fractionalization are associated with fiscal decentralization. If, however, one believes that outliers should be eliminated, the analysis shows a strong correlation between fiscal centralization and each of land area, GDP per capita, and Democracy. This paper indicates that there are open avenues for further research both on the empirical and theoretical sides. On the empirical side, it would be interesting to identify a homogeneous definition of jurisdiction and augment Eq. (9) with a variable measuring the number of jurisdictions. On the theoretical side there are many possible extensions. The first relates to extending the results to a case of a circular country and relaxing the assumption of perfect sorting of the citizens. It would be interesting to show whether the results of this paper are robust to a case where preferences are highly, but not perfectly, correlated with location. Introducing income heterogeneity would be another interesting extension. Section 2 shows that the equilibrium level centralization is decreasing in income. Therefore, a model where individuals are sorted according to their income should predict that the policies of the central government are closer to the preferences of the rich. This could be used to explain the fact that upper classes tend to have higher political participation and influence than poorer classes. The results of Section 6 seem to indicate that the adjustment toward the optimal fiscal structure takes time. This cannot be captured by the static model of Section 2. It would then be an interesting exercise to extend the model to a dynamic framework and try to obtain some information on why some countries reached the equilibrium faster than others or explain the overshooting that characterized other countries (for instance Poland). 124 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 Acknowledgements This paper, based on the second chapter of my Ph.D. dissertation at the Johns Hopkins University, was written while I was at the Department of Economics of the University of Turin. I would like to thank Nada Choueiri, Ernesto Stein, two anonymous referees, seminar participants at the JHU, University of Adelaide, University of South Florida, Inter-American Development Bank, University of Cagliari, and University of Turin for helpful comments. I am also grateful to Laurence Ball and Louis Maccini for useful comments and constant support, and Marisa Apa Domino for suggesting the use of the Honore´ and Powell estimator and sharing her program with me. The usual caveats apply. The opinions expressed in this paper are my own and do not necessarily reflect the views of the Inter-American Development Bank. Appendix A Proof of Proposition 1 We want to show that g*5( m y /( m 1 b )) (with m 512 a [u (S / 4)1(12u )(S / 4J)]) is the amount of public good preferred by the median voter. This is equivalent to showing that half of the population would like to consume an amount of g larger than g* and the other half of the population would like to consume and amount of g smaller than g*. In Eq. (1) it was shown that gi 5(di y) /(di 1 b ), with di 512 a (u l im 1(12u )l ij ). Note that, since we assume a continuum of individuals, for half of the population l im ,(S / 4) and for the other half l im .(S / 4). Also, for half of the population l ij ,(S / 4J) and for the other half l ij .(S / 4J). Therefore, when u 51 or u 50, Proposition 1 is trivially true. Since (≠gi / ≠di )5( b y /(di 1 b )2 ).0, individuals with larger d (and therefore smaller l im and l ij ) prefer a larger amount of public good. It is therefore possible to divide the population into four groups as shown in Fig. 2. For a fraction F of Fig. 2. Preferences for the size of the public good. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 125 individuals (all located in the first quadrant), l im ,(S / 4) and l ij ,(S / 4J), therefore di , m and gi ,g*. For a fraction F of individuals (all located in the fourth quadrant) l im ,(S / 4) and l ij ,(S / 4J), therefore di . m and gi .g*.23 Next, we show that each individual in the second quadrant can be mapped into an individual in the third quadrant with opposite preferences. Let us start by defining pl the individuals in the second quadrant and lp the individuals in the lp third quadrant. Define l lp m and l j the distance from the center of the country and the center of the jurisdiction of an individual i belonging to the third quadrant, lp then there is an individual z in the second quadrant with l pl m 5(S / 2)2l m and lp pl pl lp l j 5(S / 2J)2l j . At this point, we are ready to show that (d z 2 m )5 2(d i 2 m ) i.e., for each individual in the second quadrant there is an individual in the third quadrant with opposite preferences who will vote in the opposite way. F S S lp lp d lp 1 (1 2 u )] z 2 m 5 1 2 a [u l m 1 (1 2 u )l j ] 2 1 1 a u ] 4 4J S S 5 a u ] 2 l lp 1 (1 2 u ) ] 2 l jpl m 4 4J FS D S G DG (10) pl Using the fact that l lp j 5(S / 2J)2l j it is possible to rewrite Eq. (10) as: FS D S S S d lp 2 l mlp 1 (1 2 u ) l jpl 2 ] z 2m 5a u ] 4 4J DG (11) Let us now look at d pl j 2m F S S pl pl d pl 1 (1 2 u )] i 2 m 5 1 2 a [u l m 1 (1 2 u )l j ] 2 1 1 a u ] 4 4J S S 5 a u ] 2 l pl 1 (1 2 u ) ] 2 l pl m j 4 4J FS D S G DG (12) lp pl Using the fact that l pl m 5S / 22l m it is possible to rewrite d j 2 m as: FS D S S S lp d pl 1 (1 2 u ) ] 2 l pl j 2 m 5 a u lm 2 ] j 4 4J DG 5 2 (d lp z 2 m) (13) This proves the fact that for each voter in the second quadrant there is a voter in the third quadrant with opposite preferences. Therefore, given the distribution of the voters and the fact that the preferences are single peaked g*, is a unique Condorcet winner. Q.E.D. 23 It is possible to show that F 5((J21)S / 4J) when ((J21) / 2) is odd and F 5((J 11)S / 4J) when ((J21) / 2) is even. But the only thing that really matters for the proof is that the first and the fourth quadrants have the same number of individuals. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 126 Appendix B Proof of Proposition 2 The proof of this proposition requires some simple, but long and tedious, algebra. I start by establishing some basic results: a S(J 2 1) mu 5 2 ]]] , 0; muu 5 0; 4J F G S S 2 S(J 2 1) ma 5 2 u ] 1 (1 2 u ) ] , 0; mau 5 mua 5 ]]]] , 0; 4 4J 4J F G 1 1 2 a (J 2 1) mS 5 2 a u ] 1 (1 2 u ) ] , 0; mu S 5 mSu 5 ]]]] , 0; 4 4J 4J a (1 2 u )S 2 aS mJ 5 ]]] . 0; mJu 5 mu J 5 ]] , 0; 2 4J 4J 2 b ymu 2 2b y( mu )2 ≠g ≠c ] 5 2 ] 5 ]]] ]]]] , 0; g 5 , 0; uu ≠u ≠u ( b 1 m )2 ( b 1 m )3 b yma mau ( b 1 m ) 2 2ma mu ≠g ≠c ] 5 2 ] 5 ]]] , 0; gua 5 gau 5 b y ]]]]]] , 0; ≠a ≠a ( b 1 m )2 ( b 1 m )3 b ymS mSu ( b 1 m ) 2 2mS mu ≠g ≠c ] 5 2 ] 5 ]]] , 0; gu S 5 gSu 5 b y ]]]]]] , 0; 2 ≠S ≠S ( b 1 m ) ( b 1 m )3 b ymJ mJu ( b 1 m ) 2 2mJ mu , ≠g ≠c ] 5 2 ] 5 ]]] , 0; gu J 5 gJu 5 b y ]]]]]] 50 ≠J ≠J ( b 1 m )2 . ( b 1 m )3 Using the fact that (≠g / ≠u )5 2(≠c / ≠u ), it is possible to write the government’s F.O.C. as: S D ≠Vgov 12f ]] 5 f gu U med 2 U cmed 1 ]] u 1 (1 2 f )g 5 0 g ≠u f (14) For the median voter of the central jurisdiction (and therefore the government), Ug 2Uc 5U med ((1 /g)2( b /c))5(U med ( b 1 m ) /y)((12 m ) /m ),0 (because m ,1). Therefore the first term of Eq. (14) is negative and the second is positive. By deriving implicitly Eq. (14), it is possible to study the effect of the exogenous variables on the level of fiscal centralization. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 S f S 12f Vuu 5 f guu U med 2 U cmed 1 ]]u g f 127 D D 12f med med 1 gu gu (U med 1 (1 2 f )gu gg 1 U cc 2 2U cg ) 1 ]] f (15) med med Since guu ,0 and (U med gg 1U cc 22U cg ),0, Vuu ,0. This result, besides being necessary to compute the effect of the independent variables on the level of centralization, also shows that the government’s utility function is concave and therefore the F.O.C. of Eq. (14) is necessary and sufficient for a maximum. S D 12f med med Vua 5 f gua U med 2 U med 1 ]]u 1 f gu ( ga (U med g c gg 1 U cc 2 2U cg )) f 1 (1 2 f )ga , 0 (16) S D 12f med med Vu S 5 f gu S U med 2 U med 1 ]]u 1 f gu ( gS (U med g c gg 1 U cc 2 2U cg )) f 1 (1 2 f )gS , 0 (17) Vuf 5 gu (U med 2 U cmed ) 2 ( guu 1 g) , 0 g (18) Note that Vuf ,0 because Eq. (14) requires that ( guu 1g).0. S D 12f med med Vu J 5 f gu J U med 2 U med 1 ]]u 1 f gu ( gJ (U med g c gg 1 U cc 2 2U cg )) f , 1 (1 2 f )gJ 50 (19) . To investigate the role of a change in income let us start by noting that gu 5g( bmu /m ( m 1 b )), therefore Eq. (14) can be rewritten as: F ≠Vgov ]] 5 g (1 2 f ) ≠u S S bmu 12f U med ( b 1 m ) 1 1 f ]]] ]] u 1 ]]]] ] 2 1 f y m m( m 1 b ) DDG Since g50 is a minimum, it must be true that F S bmu U med ( b 1 m ) 1 2 m (1 2 f ) 1 f ]]] u 1 ]]]] ]] y m m( m 1 b ) It is now possible to show that: S D b Vu y 5 fmu (1 2 m ) ] my 2 gU med , 0 DG 5 0. 50 (20) U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 128 Therefore: Vuf Vu y Vua Vu S ≠u ≠u ≠u ≠u ]5 2] , 0; ] 5 2 ] , 0; ] 5 2 ] , 0, ] 5 2 ] ≠a Vuu ≠S Vuu ≠f Vuu ≠y Vuu Vu J , ≠u , 0, and ] 5 2 ] 50 ≠J Vuu . A corollary to Proposition 3 is that f 51 will lead the central government to set u 50. This is easily shown by observing that, when f 51, Eq. (14) becomes (≠Vgov / ≠u )5gu (U med 2U cmed ),0. Q.E.D. g To prove that a Benthamite social planner will set u 50 it is sufficient to show that the social planner’s utility function is strictly decreasing in u. Let us start by writing the social planner’s problem: N max U sp u N E E g c di 0 0 di b 5 Ui di 5 ≠U sp ]] 5 ln( g) ≠u N E (21) N ≠di gu Ui ] di 1 ] ≠u g 0 E U [d 2 m ]di i i (22) 0 N Now note that the ln ( g)e Ui (≠di / ≠u ) di , 0 (because g.1). Proposition 1 shows 0 that m is the median (and, given the assumption of uniform distribution of N individuals, also the mean) value of di . Therefore e [di 2 m ]di 5 0, but since in0 dividuals for which di . m have levels of utility higher than those of individuals for N which di , m, then e Ui [di 2 m ]di . 0, the second term of Eq. (22) will then be nega0 tive and (≠U sp / ≠u ),0. Q.E.D. Appendix C Data sources This appendix describes the source of the data used. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 129 Table 8 Centralization ratios Revenues 1975 Argentina Australia Austria Barbados Belgium Bolivia Brazil Canada Cent. Afr. Rep. Chile Colombia Congo Costa Rica Cyprus Denmark Dominican Rep. Egypt Ethiopia Finland France Gambia Germany Greece Guatemala Honduras Hungary Iceland India Indonesia Iran Iraq Ireland Israel Italy Jordan Kenya Korea Kuwait Lesotho Luxembourg Malawi Malaysia Malta Mauritius Mexico Expenditure 1980 1985 1975 1980 1985 84.36 76.96 72.37 100.00 92.70 86.28 78.20 47.37 70.46 85.04 73.48 100.00 72.72 79.01 75.66 100.00 91.75 69.08 53.97 77.49 50.55 74.50 78.00 75.25 100.00 95.03 95.82 70.59 52.16 94.25 80.03 96.96 90.01 96.28 83.55 94.89 81.90 95.80 96.46 97.75 97.47 67.76 98.29 69.77 98.49 100.00 97.52 69.35 89.38 96.38 98.53 56.62 98.45 100.00 69.30 98.84 100.00 98.56 72.11 90.71 97.33 61.85 71.03 76.21 78.62 72.81 100.00 60.86 79.29 74.71 99.96 94.68 71.27 53.18 79.18 47.42 100.00 97.53 82.68 98.24 85.00 79.77 96.93 97.43 66.60 98.02 100.00 70.26 89.82 93.29 62.20 97.38 97.71 69.97 90.32 94.60 62.94 63.95 73.07 88.11 93.54 62.12 97.43 95.57 85.18 95.51 94.65 92.00 91.41 85.54 70.80 89.96 63.88 96.00 93.76 79.79 67.34 96.55 98.49 100.00 95.69 87.87 74.83 73.79 97.63 91.70 87.58 74.06 70.36 97.45 95.16 92.22 93.26 94.66 92.76 93.74 94.33 94.23 100.00 100.00 99.82 100.00 88.96 86.48 100.00 91.29 82.77 84.11 99.98 92.23 82.51 100.00 91.73 94.11 90.43 100.00 81.71 94.33 84.67 100.00 81.57 80.35 87.95 81.61 74.26 97.23 98.69 87.73 95.59 91.11 94.35 98.88 89.38 73.69 98.58 97.47 95.54 96.70 86.90 76.08 77.94 98.95 95.72 74.92 96.84 95.52 77.31 95.45 96.35 100.00 100.00 90.14 96.51 85.85 100.00 86.88 94.45 86.51 100.00 99.08 87.52 81.41 130 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 Table 8. Continued Revenues 1975 Myanmar Netherlands New Zealand Norway Pakistan Panama Paraguay Peru Poland Portugal Romania Senegal Singapore South Africa Spain Sri Lanka Sudan Sweden Switzerland Thailand Trinidad Tunisia United Kingdom United States Uruguay Venezuela Yemen Yugoslavia Zaire Zimbabwe No. obs. Mean S.D. 100.00 97.14 Expenditure 1980 96.35 96.78 95.63 88.80 82.13 72.75 97.69 95.51 96.48 88.73 90.14 66.95 100.00 95.09 94.31 71.24 52.15 92.87 97.42 84.41 60.43 89.68 99.28 100.00 72.75 57.00 87.31 13.15 1985 1975 100.00 95.90 66.66 96.67 83.11 69.16 83.75 97.20 96.86 97.68 95.57 84.28 89.56 1980 1985 100.00 94.91 89.61 79.40 78.82 98.20 95.14 87.37 100.00 95.79 90.85 90.55 100.00 84.66 90.37 98.26 100.00 86.28 87.37 97.70 74.02 55.46 96.40 74.33 47.94 96.05 85.99 68.18 91.59 89.93 68.63 92.50 100.00 29.62 100.00 81.87 55.00 86.90 14.49 100.00 28.36 82.16 98.09 95.01 86.15 91.26 92.57 91.03 83.43 100.00 100.00 82.20 88.87 96.51 100.00 85.45 85.16 94.54 69.31 53.25 91.61 99.76 73.17 53.28 94.14 94.20 96.48 93.13 69.61 53.81 94.44 96.88 86.53 60.11 90.50 84.44 65.36 92.52 86.86 63.35 90.93 98.55 100.00 29.17 100.00 77.64 56.00 85.49 15.12 100.00 27.14 100.00 82.05 56.00 86.19 14.88 100.00 55.00 86.96 13.21 85.10 54.00 85.84 14.74 1. Data on fiscal centralization. The data on fiscal centralization were computed as the percentage of central government revenues (expenditure) on the total (central plus local) government revenues (expenditure). The data on central and total government (general government, in the IMF terminology) revenues and expenditures were obtained from the IMF (1975, 1980 and 1985) Government Finance Statistics Yearbook. When the full set of data was not available for the years considered in this study (i.e., 1975, 1980 and 1985) but it was available U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 2. 3. 4. 5. 131 for the previous or following year the available data were used. The data for 1975 were not available for Zimbabwe and Sudan (1976 was used instead of 1975). The data for 1980 were not available for the Central African Republic, Hungary, Pakistan, Venezuela and Egypt. For the first two countries 1981 data were used and for the last three countries 1979 data were used. Data for 1984 instead of 1985 were used for Bolivia, Colombia, Kenya, Malawi, Switzerland and Uruguay. Data on ethnic fractionalization. The data on ethnic fractionalization, originally published in the Atlas Narodov Mira, were obtained from Taylor and Hudson (1972). Data on democracy. The data on democracy are from Gastil (1990); Barro (1996). The data were available for 1975, 1978 and 1988 only. The 1978 data were used to measure democracy in 1980 and the average between 1978 and 1988 was used to measure democracy in 1985. Data on area and population. World Development Indicator (World Bank, 1990). Data on GDP. The data on GDP are from Summers and Heston (1991). Appendix D OLS estimations Given the fact that the dependent variable is censored at 100, OLS is not the appropriate technique to estimate the equations described in Section 4. This appendix presents the results of least squares regressions in order to compare them with Oates (1972) results. The OLS estimates are presented in Table 9. When the dependent variable is total revenue, Area and GDP per capita have negative and statistically significant coefficients.24 Fract has always a negative coefficient, not statistically significant for 1975 and statistically significant at 2.5% and 1% level for 1980 and 1985. Democracy seems to play an important role in explaining fiscal centralization in 1975, but this effect disappears in 1980 and 1985. The expenditure regressions do not fit the data as well as the revenue regressions, but the results of the two sets of regressions are similar. As for the revenue regressions, Democracy has an important weight in explaining centraliza- 24 I tried to introduce the square of GDP per capita to control for non-linearities, but its coefficient turned out not to be statistically significant. 132 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 Table 9 Least squares estimates Revenues centralization ratios 1975 Area y Fract. Dem. Const. No. obs. 2 R̄ F Area y Fract. Dem. Const. No. obs. R̄ 2 F 1980 1985 22.51 22.177 (24.082)*** (22.800)*** 23.723 28.834 (21.868)* (23.633)*** 25.480 214.982 (20.994) (22.054)** 211.947 20.087 (22.592)*** (20.014) 157.066 191.66 (9.980)*** (8.964)*** 56 56 0.40 0.31 13.32 12.05 Expenditure centralization ratios 22.474 (23.532)*** 27.643 (23.326)*** 214.613 (22.409)*** 24.862 (20.774) 188.645 (9.488)*** 56 0.39 19.32 1975 1980 1985 22.498 (23.901)*** 23.363 (21.592) 20.850 (20.147) 211.987 (22.183)** 151.744 (9.095)*** 55 0.35 10.55 2.183 (23.016)*** 27.691 (23.549)*** 213.853 (22.195)** 21.179 (20.206) 183.642 (9.531)*** 56 0.33 14.46 22.396 (23.267)*** 28.93 (23.430)*** 2 10.993 (21.682) 20.003 (0.000) 194.645 (8.602)*** 54 0.34 15.81 t-statistics in parentheses. Parameters indicated are statistically significant at: * 5%; ** 2.5%; *** 1%. tion in 1975, but no importance in 1980 and 1985 (the coefficients attached to democracy in 1980 and 1985 are higher in the revenues regressions). Fract is statistically significant at the 2.5% level in the 1980 regression and statistically significant at the 5% level in the 1985 regression. Even if fundamentally flawed, the OLS analysis is instructive. Using the OLS estimate it is possible to compare the results of this paper with Oates (1972) and conclude that the differences are not only due to different estimation techniques, but also to the use of different data sets. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 133 Table 10 Tobit estimates: Interaction dummies for sub-Saharan Africa Revenues Area y Fract. Dem. Af *Area Af *y Af *Frac Af *Dem Const. No. obs. x2 Expenditure 1975 1985 1975 1985 23.33 (24.78)*** 25.82 (22.62)*** 23.25 (20.51) 212.73 (22.34)** 13.12 (1.71)* 223.31 (21.92)* 242.84 (21.04) 63.08 (1.55) 186.15 (10.21)*** 57 43.95 23.31 (24.09)*** 29.58 (23.38)*** 220.57 (22.91)*** 27.27 (21.07) 223.02 (21.63) 48.29 (1.50) 121.99 (1.77)* 2494.06 (21.45) 220.55 (8.79)*** 56 42.41 23.08 (24.37)*** 23.22 (21.39) 2.05 (0.31) 213.38 (22.29)** 6.30 (1.31) 29.27 (21.10) 232.20 (20.91) 25.11 (0.75) 158.69 (8.54)*** 55 32.50 23.24 (23.89)*** 211.91 (23.87)*** 215.41 (22.09)** 20.71 (20.10) 25.35 (20.45) 22.59 (20.11) 44.61 (0.68) 195.93 (0.83) 234.22 (8.46)*** 54 33.09 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. Appendix E Sensitivity analysis Tables 10–14 show the result of the sensitivity analysis. Appendix F Description of the Honore´ and Powell pairwise difference estimator Maximum likelihood estimators can be used to estimate the censored regression model U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 134 Table 11 Tobit estimates: Interaction dummies for Asia Revenues Area y Fract. Dem. As*Area As*y As*Frac As*Dem Const. No. obs. x2 Expenditure 1975 1985 1975 1985 22.80 (23.71)*** 26.67 (22.49)*** 214.02 (22.06)** 28.39 (21.26) 22.94 (21.42) 4.74 (1.42) 33.61 (2.02)* 221.51 (21.44) 185.32 (9.69)*** 57 43.07 22.40 (22.98)*** 213.77 (24.97)*** 222.34 (23.27)*** 9.42 (1.20) 21.81 (20.39) 7.99 (1.82)* 20.03 (20.001) 256.98 (22.89)*** 232.22 (10.71)*** 56 54.41 21.99 (23.03)*** 28.88 (23.55)*** 211.21 (21.78)* 28.26 (21.35) 27.72 (24.19)*** 10.38 (3.43)*** 55.10 (3.76)*** 217.03 (21.27) 193.36 (11.02)*** 55 49.71 22.20 (22.53)*** 215.29 (24.40)*** 219.76 (22.60)*** 15.90 (1.76)* 23.24 (21.44) 7.25 (2.19)** 17.14 (0.79) 234.58 (21.95)* 237.77 (9.03)*** 54 40.70 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. y i 5 maxh0, x i9 b 1 ´i j (23) only if the distribution of ´i is known. In particular, if ´i |N(0, s 2 ), the Tobit model produces consistent estimates of b. Maximum likelihood estimators are in general biased if the distribution of ´i is mis-specified. In the recent years, a large number of semi-parametric estimators that do not require assumptions on the form of the distribution of ´i have been proposed. Among these estimators, the pairwise difference (PD) estimator studied by Honore´ and Powell (1994) seems to perform particularly well in small samples. This appendix briefly describes the general idea of PD. The censoring in Eq. (23) has the effect that y i 2x i9 b 5maxh2x i9 b, ´i j and y j 2x 9j b 5maxh2x j9 b, ´j j are not identically distributed even if ´i and ´j are i.i.d. Honore´ and Powell show that it is possible to obtain i.i.d. residuals by censoring y i 2x 9i b and y j 2x 9j b from below at x 9j b and x 9i b, respectively. In particular, let e ij ( b );maxh´i , 2x j9 b, 2x i9 b j and e ji ( b );maxh´j , 2x i9 b, 2x j9 b j, then e ij ( b ) and e ji ( b ) have the same conditional distribution. Furthermore, since i and j are randomly sampled, e ij ( b ) and e ji ( b ) are conditionally independent and therefore (e ij ( b )2e ji ( b )) is symmetrically distributed around 0 conditional on x i and x j . U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 135 Table 12 Tobit estimates: Sensitivity analysis using different definitions of size (1985) Revenues (1985) 23.97 (23.33)*** 1.34 (0.84) Area Pop Inc y Frac Dem Const. No. obs. x2 1.35 (0.85) 28.74 210.09 (23.42)*** (23.48)*** 219.38 219.38 (22.84)*** (22.84)*** 28.83 28.83 (21.28) (21.28) 200.29 200.29 (7.14)*** (7.14)*** 56 56 39.13 39.13 Expenditure (1985) 23.81 (23.11)*** 1.36 (0.82) Area Pop Inc y Frac Dem Const. No. obs. x2 23.97 (23.33)*** 29.73 (23.44)*** 215.95 (22.22)** 22.45 (20.35) 201.30 (6.37)*** 54 30.67 23.81 (23.11)*** 1.36 (0.82) 211.09 (23.51)*** 215.95 (22.22)** 22.45 (20.35) 201.30 (6.37)*** 54 30.67 2 3.97 (23.33)*** 1.31 (0.75) 0.00 (0.04) 28.77 (23.31)*** 219.39 (22.84)*** 28.87 (21.27) 201.06 (6.06)*** 56 39.13 23.83 (23.12)*** 1.04 (0.58) 0.00 (0.44) 210.06 (23.43)*** 216.14 (22.24)** 22.83 (20.41) 209.39 (5.90)*** 54 30.87 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. Area, Pop, Inc and y are in logs, except for the last column, where Inc is in levels. This allows Honore´ and Powell to use the moment condition of Eq. (24) to define an estimator for b. E[ j (e ij ( b ) 2 e ji ( b ))ux i , x j ] (24) where j (?) is an odd function. Since it is not always easy to show that a set of U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 136 Table 13 Tobit estimates: Sensitivity analysis using different definitions of size (1975) Revenues (1975) 23.71 (23.47)*** 1.08 (0.41) Area Pop Inc y Frac Dem Const. No. obs. x2 23.71 (23.47)*** 1.08 (0.41) 24.43 25.50 (22.04)** (22.28)** 26.57 26.57 (21.11) (21.11)*** 213.38 213.38 (22.41)** (22.41)** 162.37 162.37 (7.72)*** (7.72)*** 57 57 38.26 38.26 Expenditure (1975) 22.89 (22.66)*** 20.50 (20.04) Area Pop 22.89 (22.66)*** 20.50 (20.04) 23.30 (21.34) 1.99 (0.33)** 213.89 (22.40)** 158.74 (7.40)*** 55 30.40 Inc y Frac Dem Const. No. obs. x2 23.35 (21.52) 1.99 (0.33) 213.89 (22.40)** 158.74 (7.40)*** 55 30.40 23.67 (23.47)*** 1.39 (1.02) 0.00 (0.67) 24.08 (21.84)* 26.09 (21.03) 213.00 (22.35)** 154.30 (6.41)*** 57 38.70 22.88 (22.70)*** 0.03 (0.02) 0.00 (0.17) 23.25 (21.43) 22.08 (20.34) 213.78 (22.36)** 156.61 (6.26)*** 55 30.42 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. Area, Pop, Inc and y are in logs, except for the last column, where Inc is in levels. moment conditions are satisfied at a given point, Honore´ and Powell define the estimator as the minimand of the following random function: SD n Sn (b) 5 2 21 O s( y , y , d ) i i ,j where d 5(x i 2x j )9b and: j (25) U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 137 Table 14 Tobit estimates: Sensitivity analysis using different definitions of size (1975) Revenues (1975) 23.71 (23.47)*** 1.08 (0.41) Area Pop Inc y Frac Dem Const. No. obs. x2 1.08 (0.41) 24.43 25.50 (22.04)** (22.28)** 26.57 26.57 (21.11) (21.11)*** 213.38 213.38 (22.41)** (22.41)** 162.37 162.37 (7.72)*** (7.72)*** 57 57 38.26 38.26 Expenditure (1975) 22.89 (22.66)*** 20.50 (20.04) Area Pop Inc y Frac Dem Const. No. obs. x2 23.71 (23.47)*** 23.35 (21.52) 1.99 (0.33) 213.89 (22.40)** 158.74 (7.40)*** 55 30.40 22.89 (22.66)*** 20.50 (20.04) 23.30 (21.34) 1.99 (0.33)** 213.89 (22.40)** 158.74 (7.40)*** 55 30.40 23.67 (23.47)*** 1.39 (1.02) 0.00 (0.67) 24.08 (21.84)* 26.09 (21.03) 213.00 (22.35)** 154.30 (6.41)*** 57 38.70 22.88 (22.70)*** 0.03 (0.02) 0.00 (0.17) 23.25 (21.43) 22.08 (20.34) 213.78 (22.36)** 156.61 (6.26)*** 55 30.42 t-statistics in parentheses. Indicated parameters are statistically significant at: * 5%; ** 2.5%; *** 1%. Area, Pop, Inc and y are in logs, except for the last column, where Inc is in levels. J ( yi ) 2 ( y j 1 d )j ( yi ) J ( yi 2 y j 2 d ) s( y i , y j , d ) 5 J ( y j ) 2 (d 2 yi )j (2y j ) 5 if d # 2 y j if 2 y i , d , 2 y i if y i # d where J (d): R→R 1 is an even (i.e., J (2d)5 J (d)) loss function that must meet the following criteria: J (d) is convex; J (d) is strictly increasing for d .0; and 138 U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 J (0)50 is differentiable everywhere except at 0. j (d)5 J 9(d) is the right derivative of J (d) at 0. In my estimations I use J (d)5udu. A possible alternative is J (d)5d 2 .25 The following steps have been followed in the estimation reported in Table 4. 1. Since fiscal centralization cannot be higher than 100, the censored regression model to be estimated can be described as: y i 5minh100, x i9 b 1 ´i j. To transform this model into one similar to Eq. (23), it is necessary to add 100 to both sides of the equation and multiply all the variables by 21.26 n 2. Generate 2 pairs by matching all the observations. For each pair of observations the dependent variables are y i and y j and the independent variables (x i 2x j ). 3. Estimate b by minimizing Eq. (25). 4. Use bootstrap techniques to build confidence intervals. The pseudo p-values were generated by bootstrapping the data with 1000 replications. SD References Ades, A., Glaeser, E., 1995. Trade and circuses: Explaining urban giants. Quarterly Journal of Economics 110, 195–227. Alesina, A., Baqir, R., Easterly, W., 1996. Public goods and ethnic division, Mimeo, The World Bank. Alesina, A., Perotti, R., 1996. Budget deficits and budget institutions, Mimeo, International Monetary Fund. Alesina, A., Perotti, R., Spolaore, E., 1996. Together or separately? Assessing the costs and benefits of political and fiscal unions. European Economic Review 39, 751–758. Alesina, A., Spolaore, E., 1997. On the number and size of nations. Quarterly Journal of Economics 112, 1027–1056. Barro, R., 1996. Determinants of democracy, Mimeo, Harvard University. Borjas, G., 1995. Ethnicity, neighborhoods, and human capital externalities, The American Economic Review 85, 365–390. Brennan, G., Buchanan, J., 1980. The Power to Tax: Analytical Foundations of a Fiscal Constitution, Cambridge University Press, Cambridge, UK. Breton, A., 1996. Competitive Governments, Cambridge University Press, Cambridge, UK. Breton, A., Scott, A., 1978. The Economic Constitution of Federal States, University of Toronto Press, Toronto. Canning, D., Fay, M., 1993. The effect of transportation networks on economic growth, Discussion Paper, Department Of Economics, Columbia University. Castilla, M., 1996. Political instability, investment, and economic growth: An empirical investigation, Mimeo, The Johns Hopkins University. Easterly, W., Levine, R., 1997. Africa growth tragedy: Policies and ethnic divisions. Quarterly Journal of Economics 112, 1203–1250. 25 The reason for using udu instead of d 2 is that with the second function the conditions for consistency and asymptotic normality are stricter. 26 After the estimation, it is necessary to add 100 to the constant. U. Panizza / Journal of Public Economics 74 (1999) 97 – 139 139 Gastil, R.D., 1990. The comparative survey of freedom: Experience and suggestions. Studies in Comparative International Development 25, 25–50. Greene, W., 1993. Econometric Analysis, Prentice-Hall, New York. Gurr, T., Jaggers, K., Moore, W., 1989. The Polity II Codebook, University of Colorado at Boulder, Center for Comparative Politics. Hamilton, B., Mills, E., Puryear, D., 1975. The Tiebout Hypothesis and residential income segregation. In: Mills, E., Oates, W. (Eds.), Fiscal Zoning and Land Use Control, Heath-Lexington, Lexington, MA. ´ B., Powell, J., 1994. Pairwise difference estimators of censored and truncated regression Honore, models. Journal of Econometrics 64, 241–278. International Monetary Fund, 1975, 1980 and 1985. Government Finance Statistics Yearbook, Washington, DC. Jurion, B., 1983. A theory of public services with distance-sensitive utility. In: Thisse, J., Zoller, H. (Eds.), Location Analysis of Public Facilities, North Holland, Amsterdam. Mauro, P., 1995. Corruption and growth. Quarterly Journal of Economics 110, 681–712. McKelvey, R., 1976. Intransitivities in multidimensional voting models. Journal of Economic Theory 12, 472–482. Musgrave, R., 1959. The Theory of Public Finance, McGraw-Hill, New York. Oates, W., 1972. Fiscal Federalism, Harcourt Brace Jovanovich, New York. Powell, J., 1984. Least absolute deviations for the censored regression model. Journal of Econometrics 25, 303–325. Powell, J., 1986. Symmetrically trimmed least squares estimations for Tobit models. Econometrica 32, 1435–1460. Pryor, F., 1968. Public Expenditure in Communist and Capitalist Nations, George Allen and Unwin, London. Summers, R., Heston, A., 1991. Penn World Table (Mark 5): An expanded set of international comparisons. Quarterly Journal of Economics 106, 326–368. Taylor, C., Hudson M., 1972. World Handbook of Political and Social Indicators, Yale University Press, New Haven. Tiebout, C., 1956. A pure theory of local expenditures. Journal of Political Economy 64, 416–424. Wallis, J., Oates, W., 1988. Decentralization in the public sector: An empirical study of state and local government. In: Rosen H. (Ed), Fiscal Federalism: Quantitative Studies, University of Chicago Press, Chicago. Wheare, K., 1964. Federal Government, Oxford University Press, London. Wildasin, D., 1991. Income redistribution in a common labor market. The American Economic Review 81, 757–774. The World Bank, 1969. The World Tables, Washington, DC. World Bank, 1990. World Development Indicator.
© Copyright 2026 Paperzz