On the Measurement of Polarization Author(s): Joan-Maria Esteban and Debraj Ray Source: Econometrica, Vol. 62, No. 4 (Jul., 1994), pp. 819-851 Published by: The Econometric Society Stable URL: http://www.jstor.org/stable/2951734 Accessed: 11/05/2010 02:09 Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/action/showPublisher?publisherCode=econosoc. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica. http://www.jstor.org Econometrica, Vol. 62, No. 4 (July, 1994), 819-851 ON THE MEASUREMENT OF POLARIZATION BY JOAN-MARIA ESTEBAN AND DEBRAJ RAY' Suppose that a population of individuals may be grouped according to some vector of characteristics into "clusters," such that each cluster is very "similar" in terms of the attributes of its members, but different clusters have members with very "dissimilar" attributes. In that case we say that the society is polarized. Our purpose is to study polarization,and to provide a theory of its measurement.Our contention is that polarization,as conceptualizedhere, is closelyrelatedto the generationof socialtensions, to the possibilitiesof revolutionand revolt, and to the existence of social unrest in general. We take special care to distinguishour theory from the theory of inequality measurement.We derive measuresof polarizationthat are easily applicableto distributions of characteristicssuch as income and wealth. KEYWORDS:Clustering,convergence,inequality,income distribution,polarization,so- cial conflict. 1. INTRODUCTION THIS PAPER IS CONCERNED WITH THE CONCEPTUALIZATION and measurement of polarization.We shall argue that the notion is fundamentallydifferent from inequality,which has received considerableattention in the literatureon economic developmentand elsewhere.2 Let Y be a set of characteristicsor attributes.An individualwill be described by an element in this set. The prevailingstate of affairsin a society can then be captured by a distributionover Y, which describes the proportions of the populationpossessingattributesin any subset of Y. Consider a particular distribution on Y. Suppose that the population is groupedinto significantly-sized"clusters,"such that each cluster is very "similar" in terms of the attributes of its members, but different clusters have memberswith very "dissimilar"attributes.In that case we would say that the society is "polarized."Our purpose is to study these intuitivecriteriacarefully, and to providea theory of measurement. When introducinga new concept, one should start by showing that it is differentfrom other standardsummarystatisticsand by persuadingthe reader 1 Research on this project was started when Esteban was visiting the Indian Statistical Institute in 1987. The results reported here were obtained when Ray (then at the I.S.I.) visited the Instituto de Analisis Economico under the Sabbatical Program of the Ministry of Education, Government of Spain. We are grateful for funding from the Ministerio de Educacion y Ciencia (Grant No. PB90-0172) and EC FEDER Contract No. 92 11 07 005, and for supportive facilities at the I.S.I. We have benefited from long discussions with David Schmeidler, and have also received useful comments from Anthony Atkinson, Partha Dasgupta, Bhaskar Dutta, Michael Manove, Isaac Meilijson, Nicholas Stern, and participants of numerous seminars and conferences. The current version owes much to the comments of Andreu Mas-Colell and two anonymous referees. 2 For theoretical studies, see Kolm (1969), Atkinson (1970), Sen (1973), Dasgupta, Sen, and Starrett (1973), Fields and Fei (1978), and the numerous references in the excellent survey by Foster (1985). For empirical studies, the reader may consult Kuznets (1955), Kravis (1960), Adelman and Morris (1973), Weisskoff (1970), Ahluwalia (1976), and Fields (1980), to mention only a few. 819 820 J.-M. ESTEBAN AND D. RAY that the phenomenoncapturedby the new concept is indeed relevant.Our task is not easy for at least three reasons. First, one cannot rely on pre-existing intuitionor axiomatization.We shall have to introducethe concept and develop its intuition at the same time. Second, the theory of economic inequality-the obvious contender to the theory developed here-is based on a long and venerable tradition in welfare economics. This tradition is by now so deeply rooted in our way of thinkingabout distributions,that sometimeswe take the propositionsproven two decades ago as unquestionablyintuitive axioms.3Finally, while we see polarizationas a particularlyrelevantcorrelateof potential or open social conflict, "mainstream"economics has thus far paid little attention to this last issue.4 We begin with the obviousquestion:why are we interestedin polarization?It is our contention that the phenomenonof polarizationis closely linked to the generation of tensions, to the possibilitiesof articulatedrebellion and revolt, and to the existence of social unrest in general. This is especially true if the underlyingset of attributesis a variablesuch as income or wealth.A societythat is dividedinto groups,with substantialintra-grouphomogeneityand inter-group heterogeneityin, say, incomes,is likelyto exhibitthe featuresmentionedabove. At the same time, measured inequality in such a society may be low. The reader unconvinced of this last point may turn at once to Section 2, where the distinctionbetween polarizationand inequalityis made via a series of examples. The idea above is not novel at all. Marxwas possiblythe first social scientist to give a coherent interpretationof society as characterizedby the buildingup of conflictand its resolution.We are not interestedhere in his specifictheoryof the developmentof the capitalistsociety, as much as in his view of intergroup conflict dynamics,and the use of a concept such as polarizationto describe these dynamics.5Deutsch (1971)'saccountof the Marxiantheoryis particularly relevantfor our purpose: "As the struggleproceeds,'the whole societybreaksup moreand moreinto two hostile camps,two great, directlyantagonisticclasses:bourgeoisieand proletariat.'The classes polarize, so that they become internallymore homogeneousand more and more sharplydistinguishedfrom one another in wealth and power"(Deutsch(1971,p. 44)). This way of understandingsocial conflicthas been later taken up by contemporarysociologists(e.g., Simmel(1955) and Coser(1956))and politicalscientists 3 We find particularlyrevealingthe questionnaireresultsobtainedby Amiel and Cowell(1992). The majorityof students with an education in Economics ranked distributionsin a manner consistentwith the Lorenzordering,while studentswith other backgroundsproducedrankingsthat violatedthis ordering. 4 There is a strandof recent literaturethat sharesour view that social conflict,and in particular distributionalconflict,plays a crucialrole in the explanationof macroeconomicphenomena.For instance,Alesina et al. (1992), Alesina and Rodrik (1991), Benhabiband Rustichini(1991), and Perssonand Tabellini(1991)focus on the relationshipbetweendistributionalconflictand economic growth. 5 See also Lenin (1899),Chayanov(1923),and Shanin(1966). POLARIZATION 821 (e.g. Gurr(1970, 1980) and Tilly (1978)).6In fact, the broadview that a society split up into two well-definedand separated camps exhibits a high degree of potential (if not open) social conflict is neither new nor specificallyMarxist.It has also been shared by such differentpolitical analystsas BenjaminDisraeli, the nineteenth century Tory prime minister, or the US Kerner Commission, twentyfive years ago.7 This literaturediffersfrom studies on inequalityin one simpleyet fundamental way:in the formationof categoriesor groups,or "subsocieties,"thepopulation frequency in each category also carries weight.8 The following simple example is illustrative. Consider an initially equal society of peasant farmers. Suppose, now, that through a process that redistributes a certain fixed amount of agriculturalsurplus, a fraction a of these peasantsbecome rich kulaks at the expense of the remainder.Given the value of the surplus, for which values of a would we say that a "high" degree of differentiationis takingplace? Of course, there is no precise numericalanswer to this question,but certainlyvalues of a very close to zero or unitywould not be construedas creatinga sharpdivisionamongthe population.However,note that under every known measure of inequality, lower values of a would unambiguouslygive rise to higher inequality.9By effectively neglecting the population frequencyin each category, inequalitymeasurementdeparts from the study of differentiation. Anotherway to see the differenceis to recognizethat the axiomsof inequality measurement(or equivalently,second-orderstochastic dominance for meannormalizeddistributions)fail to adequatelydistinguishbetween "convergence" to the global mean and "clustering"around "local means."'0 Consider the following example, related to the question of economic "convergence"in the world economy,a topic that has recentlyreceived considerableattention." To be precise, consider the frequency distributionof national growth rates in 6Marx'scriticR. Dahrendorf(1959)himselfdoes not attackthis view of conflict,but ratherthat Marxoverlookedthe fact that individualcharacteristicsare multidimensional,so that differences within and similaritiesbetween social classes restrainedthe polarizationprocess (see Deutsch (1971)).Indeed, polarizationin some attributesuch as incomemay be counterbalancedby dissimilaritiesin other attributes,an objectioncommonto the theoryof inequality. Disraelicoined the term"twonations"to describethe Englishsocial structureof his time in his novel Sybil,publishedin 1845.The KernerCommission(1968)expressedconcernaboutthe ongoing divisionin the United States into two societies. 8By focussing on this one difference,we do not mean to deny the existence of others. In particular,it is difficultto remain unimpressedby the richnessof the political and sociological studies alluded to above. Nevertheless,this very richness opens the door to a possible lack of sharpnessat the conceptuallevel. The theoryof inequalitycan certainlynot be criticizedon these grounds.Our objective,then, is to retainthe sharpnessof a formalconstruction,capturingthe new featuresin the cleanestpossibleway. inequalitymeasure. 9To be precise,this would happenin the case of everyLorenz-consistent 10Landsbergerand Meilijson(1987)also note (in the contextof risk)that second-orderstochastic dominanceis not necessarilyassociatedwith transfersof probabilitymass from the center of the supI?ortof a distributionto its tails. See, for example,Barro(1991),Barroand Sala-i-Martin(1992),and Mankiw,Romer,andWeil (1992). 822 J.-M. ESTEBAN AND D. RAY per capita GDP. Suppose that over time, we were to observethat "low"growth rates were convergingto their commonmean, while the same is true for "high" growth rates. In terms of growth rates, then, two "growthpoles" would be forming-a clear emergenceof North and South. Polarizationin our sense may well be rising. But every inequality measure defined on growth rates, and consistentwith the Dalton TransfersPrinciplewould register an unambiguous decline!12 Indeed, there are a numberof social and economicphenomenafor which the knowledgeof the degree of clusteringor polarizationcan be more telling than a measure of inequality.Quite apart from the distributionof income, wealth, or growth rates, relevant economic examples include labor market segmentation, conceptsof the dual economyin developingcountries,or distributionof firmsby size in a given industry.In the broader arena of the social sciences, there are issues of social class or significantproblemsconcerningracial, religious,tribal, and nationalisticconflicts,which clearlyhave more to do with the clusteringof attributesthan with the inequalityof their distributionover the population. All this suggestsa need to develop a theory of the measurementof polarization. Indeed, the theory of inequalitymeasurementis possibly on far weaker ground in its claim to independent existence. Given the underpinningsof inequalitytheory as simplywelfare economicsapplied to a restricteddomain,13 one mightquestionthe value added of this specificattention.A good part of the literaturehas found its justificationin the assertionthat one should decompose distributionalquestions from questions of the aggregate product (or GDP). Even if the social welfarefunctionwere to enjoythis curiousproperty,'4there is still no need to treat the theory of inequality as separate from welfare economicsin general,distinguishedby propertiesintrinsicto the theoryitself. More closely related to the subject of this paper is the motivation for studying inequalityas a phenomenonclosely relatedwith social unrest.The openingpage of Sen's celebrated book, On Economic Inequality,asserts that "the relation between inequalityand rebellionis indeed a close one, and it runs both ways." We do not disagree with this position, but it remains to be seen empirically whether this connectionis specificallywith inequality,or with features that are 12 See Section2 for a numericalexample.Our preliminarystudiesusingthe WorldBankdata set reveal that the distributionby frequenciesover the annual growthindices by countriesis multimodal. We comparedthe behaviorof the polarizationmeasures developed in this paper with inequalityindices applied to the same data. Specifically,we find that while the Gini coefficient continuallydiminished over the period 1979-86, polarizationfalls too in 1981 but increases thereafter.We might add that we plan to applythe ideas developedin this studyto the empirical questionof "internationalconvergence."But that is the subjectof anotherpaper. For interesting recent researchthat bearson the issue of multimodalityin this context,see Quah(1992). 13 For measuresof relativeinequality,this is the domainover whichthe total sum of incomesis constant. 14 For studiesin this area, see Sheshinski(1972),Blackorbyand Donaldson(1978),Ebert(1987), and Dutta and Esteban(1992). POLARIZATION 823 commonto both polarizationand inequality,15and perhapsmore closely tied to the formerin cases of disagreement.16 We end our introductorydiscussion with a remark on the basic set of attributesthat we use for the developmentof our measurementtheory.Ideally, this set must include all individualattributesthat are relevant (at the social level) for creatingdifferencesor similaritiesbetween persons, possiblyrelative to a given announcedsocioeconomicpolicy.'7 But we will simplifythe analysis enormouslyby assumingthat the attribute can be captured by the values of some scalar variable. Indeed, we work with the natural logarithmof income (thoughthis particularinterpretationis formallyquite unnecessary).The idea is that percentage differences in income are a good proxy for socioeconomic differencesor similarities.More generally,even if the attributespace is multidimensional,our analysisis undisturbedif the differencesbetween attributescan be representedby a metric.But such mathematicalsleight of hand only sweeps a serious dimensionalityissue under the rug. So we do not defend our restriction except on groundsof tractability.We note, moreover,that exactlythe same criticismsmay be levelled againstthe theory of inequalitymeasurement.18 The rest of the paper is organized as follows. Section 2, which takes the reader througha series of examples,is designed to make two points. First, we arguethat there is an analyticaldistinctionbetween polarizationand inequality. Second, we submit that there is no simple (and acceptablyrich) partial order (like the Lorenz criterion in inequality theory) that will capture increasing polarization.Specifically,any measure of polarizationmust be "global" and "nonlinear,"in a sense that is made precise in Section 2. In Section 3, we formally develop and study polarization measures. Section 3.1 deduces an allowable class of measures, obtained by combininga behavioralmodel with certain intuitive assumptionsor axioms.Theorem 1 reveals that the allowable class of measures is quite sharp and amenable to empiricaluse. Section 3.2 (Theorem2) studies the maximalelements of polarizationmeasures,and reveals 15After all, there are a numberof situations(see Section 2) where polarizationand inequality measures agree in their rankings.In this intersectionfalls, for instance, the phenomenonof a disappearingmiddleclass, a matterof concernto sociologistsand some economistsin the contextof U.S. society.See, for example,Kostersand Ross (1988),Horriganand Haugen(1988),and Duncan, Smeeding,and Rodgers(1991). 16 For instance,the empiricalfindingsof Nagel (1974)show that social conflictis low both under complete equalityand under extremeinequality.Indeed, politicalscientistsworkingon inequality and conflict, such as Midlarski(1988) and Muller, Seligson, and Fu (1989) have alreadyserved notice that standardnotionsof inequalitymaybe inadequatefor the studyof conflict.Comingacross their work after a first draft of this paper was written,we see that their alternativenotions of patterned inequality and bifurcated inequality seem to be motivated by the same arguments we put forwardin this paper. A thoroughempiricalexplorationof our contentionsis the subject of a forthcomingpaper. 17 Indeed,the idea of usingspacesof policiesand measuringpolarizationas a state of affairsthat yield high resistanceto policieschosen at randomformsthe basis of an alternativedevelopmentof the theory.But this is beyondthe scope of the presentpaper. 18 While multidimensional variantsof inequalitytheory do exist, to go beyond the one-dimensional theorythey must imposeparticularrestrictionson how dimensionsare to be compared. 824 J.-M. ESTEBAN AND D. RAY them to be degenerate,symmetricbimodal distributions.In Section 3.3 (Theorem 3), we sharpenthe class of allowablemeasureseven furtherby the use of an additionalplausible assumption.In Section 3.4, we illustrate how these measures work by applyingthem to a set of examples, including the motivating examplesin Section 2. In Section 4, we extend our class of measuresbeyondthe simple model of Section 3, in a form that might be even more suitable for empiricalwork. Section 5 describespossible extensionsof the basic theory,and concludesthe paper. This section also containsremarkson the issue of population and income normalizationfor the measureswe obtain. 2. POLARIZATION: OVERVIEW AND SOME EXAMPLES The objectiveof this section is to study a numberof examplesin an informal way. Some of these examples will be elevated in the sequel to the status of axioms.At present, however,our statementsare not meant to be precise. They are meant to align your intuition to ours, to form a preliminaryjudgement regardingthe formalconcept we shall later introduce. Loosely speaking, every society can be thought of as an amalgamationof groups,where two individualsdrawn from the same group are "similar,"and from differentgroups,are "different"relativeto some given set of attributesor characteristics.The polarizationof a distributionof individualattributesmust exhibitthe followingbasic features. FEATURE 1: There must be a high degree of homogeneitywithineach group. FEATURE 2: There must be a high degree of heterogeneityacross groups. FEATURE 3: There must be a small number of significantlysized groups. In particular,groups of insignificantsize (e.g., isolated individuals)carry little weight. We start by comparingtwo alternativedistributionsof a given attribute(say income)over a population.Example1 below is designedto illustrateFeature 1. 1 2 3 4 5 6 7 8 9 10 FIGURE 1A FIGURE 1.-Diagrams 1 2 3 4 5 6 FIGURE 1B to illustrate Example 1. 7 8 9 10 POLARIZATION 825 EXAMPLE 1: In Figure la, the population is uniformlydistributedover ten values of income,spaced apartequallyat the points 1,2,..., 10. In Figure lb, we collapse this distributioninto a two-spikeconfigurationconcentratedequallyon the points 3 and 8. This diagramformalizesthe discussionof per capita GDP in the Introductionas a specificexample,or per capita growthrates if the attribute is reinterpretedappropriately. Which distributionexhibitsgreater"polarization"?We would argue that it is the latter distribution.Two groupsare now perfectlywell formed in the second diagram,while in the formerdiagramthe sense of groupidentityis more fuzzy. Under the second distributionpopulation is either "rich" or "poor,"with no "middle class" bridgingthe gap between the two, and one may be inclined to perceive this situationas more conflictualthan the initial one.'9 If you are still hesitant,thinkof the attributeas an opinion index over a given political issue, running from right to left. Many would agree that political conflict is more likely under a two-spike distribution-with perhaps not completely extremepolitical opinions,but sharplydefined and involvingpopulation groupsof significantsize-rather than under the uniformdistribution. But the point is this. If you admit the possibilityof greater polarizationin Figurelb, you are forced to departfromthe domainof inequalitymeasurement. For under any inequalitymeasure that is consistent with the widely accepted Lorenz ordering,inequalityhas come down in Figure lb relativeto la! EXAMPLE 2: This example illustrates Feature 2 above. It also shows that there are cases where increasingpolarizationalso conformsto Lorenz-worseningof the underlyingdistribution.ConsiderFigures2a and 2b. Think of the attribute as income. Figure2a is just the old Figure lb. In Figure2b, we also have a two point income distribution,but this time concentratedequallyon the incomes 1 and 10. Which has more polarization?Well, as far as intra-grouphomogeneity goes, there is nothingto choose between the two diagrams.But there is greater inter-groupheterogeneityin Figure 2b. So it should be fairly easy to conclude that there is more polarizationin Figure 2b. But note that this time income inequality,too, has worsened in Figure 2b (relativeto 2a). In particular,we do not claim that the notion of polarizationalways conflictswith that of inequality. EXAMPLE 3: With the help of Figure 3, we illustrate Feature 3. In Figure 3a, the population is distributedaccordingto a three-point distribution,with the attributesspaced equally. Think of the central and the right-handmasses as being of approximatelythe same size. Considera small shift of populationmass from the left extreme to the right extreme.The problemis immediatelyseen to be more complexthan those of the previousexamples.While it is true that the shift creates a greatertendencytowardsformingtwo sharplydefinedgroups,the 19 In this context,see Proposition9 in Coser(1956)(whichdrawson Simmel(1955)),wherebythe internalcohesionof groupsis associatedwith increasedconflictbetweengroups. 826 1 J.-M. ESTEBAN AND D. RAY 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10 FIGURE 2B FIGURE 2A FIGURE 2.-Diagrams FIGURE 3A to explain Example 2. FIGURE 3B FIGURE 3.-Diagrams to explain Example 3. left mass maywell be instrumentalin creatingpart of the social tensionsthat do exist, and the net effect is far from clear. Consider,however,the case illustratedin Figure 3b, where the left mass is very tiny indeed comparedto the other two. In this case, the initial contribution of the left mass is vanishinglylow and it mightmake sense to arguethat the very same move now serves to increasepolarization.This is the substanceof Feature 3. While we do not adopt the example of Figure 3b as a basic assumptionfor our main result, we do explore its implicationsfor the choice of measure (see Section 3.3 and Theorem3). To us, it is reasonableas a basic axiom,thoughit is of interest to see how far we can go without it. The examples above are meant to show that the concept of polarization (whateverit is) is logically separate from that of economic inequality,and to bringout certaindefiningfeatures of the concept.The two remainingexamples in this section are meant to illustrate an entirely different set of issues. Specifically,we wish to argue that any reasonablemeasureof polarizationmust be global in nature,in a way that inequalitymeasuresare not. To see this point, recall the Dalton Principleof Transfersthat underliesthe Lorenzordering.The principle states that startingfrom any distributionof income, any transferof income from an individualto one richerthan him must increaseinequality.The principle is a local one. To apply it, it is unnecessaryto take account of the 827 POLARIZATION 1 2 3 4 5 6 7 8 9 10 FIGURE 4A FIGURE 4.-Diagrams 1 2 3 4 5 6 7 8 9 10 FIGURE 4B to illustrate Example 4. original distribution. In our thinking about the subject, we have found it impossibleto come up with a similarlocal prescriptionfor increasingpolarization. The examplesthat follow try to indicatewhy this is so. EXAMPLE 4: Consider the two income distributionsin Figures 4a and 4b. Each distributionhas point masses at the income levels 1, 5, and 10. In Figure 4a, half the population is equally divided between incomes 1 and 5, and the other half is at income 10. Consider,now, the fusion of the firsttwo groupsinto a single mass at the income level 3 (see arrowsin 4a). Has polarizationgone up? Note that the collectionof individualsat incomelevels 1 to 5 no longerhave any interpersonalanimosity;they are all united at income level 3. They now see as their "commonenemy"the sizeable groupat income level 10. Moreover,so far as the group at income 10 is concerned, they now see facing them a united group of individualsof similarsize. On average,this group is as differentfrom the 10-groupas it was before. It would, indeed, be difficult to defend the view that polarization has decreasedin this situation! But now turn to Figure4b. Here, the only differenceis that almostthe entire populationis dividedbetween the two income levels of 1 and 5, and there is a tiny fractionof individualsat income level 10. Performthe same experimentas before. You will notice that it is now practicallyimpossibleto make the same argumentas we made for the precedingcase. Because of the small size of the 10-group,"most"of the "polarization"in Figure 4b comes from the fact that there are two groupssituated at incomes 1 and 5. Put them together and you have wiped out most of the social tension. It is true that there is a group at 10, but it is a very small group, and any notion of continuity20would requirethat the intra-grouptension also be small in the new situation,so in Figure 4b the opposite argument-polarization has come down-is more convincing. 20 Simply send E to 0 in Figure 4b. At E = 0, polarization must come down by the argument of Example 2. So this must also happen for ? > 0 but small! 828 J.-M. 1 2 3 4 5 6 7 ESTEBAN 8 9 10 AND 1 D. RAY 2 3 4 6 7 8 9 10 FIGURE SB FIGURE SA 1 5 2 3 4 5 6 7 8 9 10 FIGURE Sc FIGURE 5.-Diagrams to illustrate Example 5. EXAMPLE 5: Reconsider Example 2. Imagine moving from Figure 2a to Figure2b by a series of smallchanges.In each change,equal fractionsof people at incomes 3 and 8 are removedand replacedat 1 and 10 respectively.The end result is, of course, Figure 2b. Figures 5a and 5c reproduce Figure 2, with Figure 5b displayingan intermediatescenariowhere the populationis equally divided among the income levels 1, 3, 8, and 10. Note that these changes generate a sequence of regressiveLorenz dominateddistributions. We have alreadyarguedthat 5c displaysmore polarizationthan 5a. But does 5b displaymore polarizationthan 5a? We claim that there is no unambiguous answerhere. While it is true that there are very differentgroupsin 5b (relative to 5a), there is also little group homogeneity.It all depends on whether one gives more weight to intergroupdifferences at the expense of within group homogeneity. We are not making any particularclaim as to a direction of change, but are only tryingto convince you that there is a genuine ambiguity here. These examples (4 and 5) are meant to illustrate the possibility that the searchfor a (reasonablyrich)partialorderfor increasesin polarizationcan be a difficultone. For one thing, the effect (on polarization)of a given change may POLARIZATION 829 depend on factorsthat are not directlyassociatedwith the change(e.g., the size of the 10-groupsin Figures 4a and 4b). This is what we meant above by the global nature of the concept. Moreover,the same directionsof change can be associatedwith different effects on polarization,dependingon the initial configurations.This point is illustrated by both Examples 4 and 5. Both these features are absent in the theory of inequalitymeasurement,embodied in the Lorenzordering. In the next section, we present a formal analysisthat attempts to capture some of the issues raised here. 3. POLARIZATION MEASURES 3.1. Axiomatic Derivation of a Class of Measures 3.1.1. Distributions and Polarization Measures The previous section should have already made clear that the concept of polarizationis somewhatcomplicated,and no single simple axiom(such as the Dalton Axiom in the contextof inequalitymeasurement)will serve to captureits essence. Accordingly,our aim is limited. We introduce a model of individual attitudes in a society that leads naturally to a broad class of polarization measures. On this class, we place certain axioms and narrowdown the set of allowablemeasuresconsiderably.Finally,we show that the measurewe obtain performs"well" in the context of the examplesearlier described. Our basic perceptualvariableis the naturallogarithmof income, denoted by y, and we shall presumethat its values lie in DR (zero incomesare not permitted). By an individual9 we shall mean an individualwho has income (with log equal to) 9. The choice of this variable is based on the presumption that only percentage differencesmatter. But any other scalar can be used as the basic perceptualvariablewith little overallconceptualdifference.21 In this paper,we consideronly those distributionswith supporton some finite set of incomes. This simplificationis made for expositoryconvenience.There are nontrivialadditionalissues involvedin dealingwith the case of distributions with infinitesupport.We commenton these brieflybelow, and refer the reader to Estebanand Ray (1991)for a detailed developmentof the theoryin this case. We begin with some backgrounddefinitionsand notation. For any positive integer n, (rr, y) (n1r ***, yn;y1,* , yn) is a distribution if y E WR,yi $ yj for all i, j, and nr> 0. The total population associated with (irr,y) is given by EL 11Ti. Denote by -9 the space of all distributions. A polarization measure (PM) is a mapping P: 9 -- R 21 It may be argued, for example, that large, percentage differencesin income make little differenceto an individualaftersome threshold.In that case, the basicperceptualvariablemightbe some strictlyconcavetransformof y. 830 J.-M. ESTEBAN AND D. RAY Throughout,we will suppose that the ranking induced by a polarization measure over two distributionsis invariantwith respect to the size of the population.This homotheticitypropertyis standardin the theory of inequality measurement(see, e.g., Foster (1985)). We refer to such a propertyas Condition H. CONDITION H: If P(rr, y) > P(rr', y') for two distributions (iir, y) and (r', y'), then for all A > 0, P(Arr,y) > P(Ar', y'). We now turn to the study of a behavioralmodel that will yield a class of polarizationmeasures. 3.1.2. A Model Our first concept deals with the fact that intra-grouphomogeneityaccentuates polarization.We proposethat an individualy feels a sense of identification with other individualswho have the same incomes as him. Thus the identification felt by an individualis an increasingfunctionof the numberof individuals in the same income class of that individual.Formally,we introducea continuous identification function I: R+-D>R+. Assume I(p) > 0 whenever p > 0. Formally, no furtherassumptionneed be made, though it is no surprisethat we will later deduce that I(p) is increasing(and more). Two remarksare in order.First,we have postulateda particularlysharpform of identificationwhereby individuals feel identified with people who earn exactlythe same income,but with no one else. This is defensibleonly if we think of incomes as point estimates of income classes or intervals.It is surely more reasonableto also allow for (a perhapssmaller)identificationwith neighboring individuals.Indeed, as we point out in detail in Section 4, this postulateleads to a drawbackwith our derived class of measures.We have neverthelesschosen this route in the interest of a cleaner model. Second, the sense of identificationmay depend not only on the numberof similarindividualsbut also on the commoncharacteristics(in this case, income) that these individualspossess. We commenton this extensionin Section 5.3. Next, we posit that an individualfeels alienated from others that are "far away"from him. This concept will deal with the fact that inter-groupheterogeneity accentuates polarization. Let a: R;+ R+, with a(O)= 0, be a nonde- creasingcontinuousfunction, to be called the alienationfunction. We assume that individual y feels an alienation a(8(y, y')) with an individual y', where 8(y, y') stands simplyfor absolute distance Iy - y'f. We pause here to explicitlytake note of an importantfeature subsumedin the definition. We have assumed that the concept of alienation (as well as identification)is perfectly symmetric.An individualwith low income feels the POLARIZATION 831 same alienation towards an individualwith high income as the latter feels towardsthe former. This is perhaps inappropriate,as it can be argued that a polarizedsociety arises from the alienationthat the poor feel towardsthe rich, and not the other way around.We discussan "asymmetric"model that captures these features in Section 5.2. Now we are in a position to put togetherthese two concepts.Whatwe wish to capture is the effectiveantagonismthat y feels towards y'. This is, roughly speakingjust the same as y's alienationvis-a-visy', but we wish to allow for the possibility that an individual's feelings of identification may influence the "effectivevoicing"of his alienation.22In the sequel, this influencewill turn out to be the criticalfeature that distinguishespolarizationfrom inequality. Accordingly,we allow for this influence (though we do not assume it). We postulate that the effective antagonism felt by y towards y' is given by a continuous function T(I, a), where a = a(8(y, y')) and I =I(p), with p the measureof people in the income class of individualy. This function T is taken to be strictly increasing in a whenever (I, a)>> 0. We assume further that T(I, 0) = 0.23 Finally,the total polarizationin the society is postulatedto be the sum of all the effective antagonisms: (1) P(7, Y) = n n E E iri7jT(I(7i),a(6(yi,yj))). i=1 j=1 The postulate (1) embodies two assumptions.First, we record that polarization depends on the "vector"of effectiveantagonismsin the society and on this postulate of alone; in this respect it is analogousto a Bergson-Samuelson-type nonpaternalism.Second it goes further and proposes an analogue of additive utilitarianism.Some justificationfor the additivepostulate may be found along lines suggestedby Harsanyi(1953) and others: that an impartialobserverwho might find himself in any position in the given distributionmight use the expected value of his effective antagonismto judge overallpolarization. On the other hand, we have left the structureof the functions I(-), a(-), and T(&)very general indeed. The choice of any one functionalform for each will yield a particularmeasureof polarization. 22 Thus, we are primarilyinterestedin notions of organized opinion,protest,etc. The notion of an individualwho runs amok,perhapsprovokedby his sense of isolation,is an importantone that we do not considerhere. 23 Note that these assumptionsformallypermit identificationto not have any influence at all (T(I, a) = a is permitted,for instance), or perhaps even have negative influence, though these featureswill disappearin the sequel. Note, moreover,that it is also possibleto weightone's feelings of alienationby invokingthe visibilityof the group from which the individualfeels alienated.But there is nothing so far in our discussionand exampleswhich appear to necessitate this. Indeed, whethersuch a weightingshould enter positivelyor negativelyis far from intuitivelyclear. So we ignorethis in the interestsof simplicity.See Section5.3 for more discussion. 832 J.-M. ESTEBAN AND D. RAY 3.1.3. Axiomatic Derivation We are interested in narrowingdown the class of allowable measures by imposingsome "reasonable"axioms. AXIOM 1: p q- q x 0 q y Data: p, q>> 0, p > q, 0 <x <y. Statement: Fix p > 0 and x > 0. There exists E > 0 and ,ut> 0 (possibly depending on p and x) such that if 8(x, y) < E and q < ,up, then the joining of the two q masses at their mid-point, (x + y)/2, increases polarization. The intuition behind this axiom should be very clear. The two right hand masses are individuallysmaller than p (see Data). Moreover,they are "very close" to each other (see Statement).In such a case, by pooling the two small masses we "identify"them while not changingthe average distance from the third mass. This should thereforeraise polarization.24 AXIoM 2: p r q 0 24 - x I y The axiom is actually stated in very weak terms. The axiom is invoked only when the distance between x and y is small, and also only when the q-masses are small relative to p. 833 POLARIZATION Data:(p,q,r)>>O, p>r, x>Iy-xl. Statement: There is E > 0 such that if the population mass q is moved to the right (towards r) by an amount not exceeding e, polarization goes up. This is also an intuitiveaxiom.The intermediatepoint mass q is at least as close to the r-massas it is to the p-mass.Additionally,the p-mass is largerthan the r-mass.So if only small locationalchangesin the q-massare permitted,the direction that brings it closer to the nearer and smaller mass should raise polarization. We now turn to our final axiom. AxIOM3: r q p 0 x -d d y Data: (p,q)>>0, x=y-x-d. Statement: Any new distributionformed by shifting population mass from the central mass q equally to the two lateral masses p, each d units of distance away, must increase polarization. This axiom is so intuitiveit hardlyrequirescomment.The axiom states that the disappearanceof a "middle class" into the "rich" and "poor" categories must increasepolarization. We may now state our main result.To do so, it will be necessaryto introduce the function f: R22- R, definedby (2) f(z,a) (1 +a)[z - Z +]2 It is possible to show that there exists a number a* > 0 such that maxz> 0 f(z, a) < 0 (resp. < 0) if and only if a < a* (resp. < a*). Moreover,it can be shownthat f(z, a*) < 0 for all but one point z*, where equalityholds. It is also possible to show analyticallythat a* E (1,2). Numerical computation reveals that a* - 1.6. Details of the proofs of these assertionsand the numerical computationare availableon request. 834 J.-M. ESTEBAN AND D. RAY We now have the followingtheorem. 1: A polarization measure P* of the family defined in (1) satisfies THEOREM Axioms 1, 2, and 3, and Condition H, if and only if it is of the form (3) P*(wr,y) n n i=l j=l =KE r1+arj Y- j for some constants K > 0 and ae (0, a* ] where a* - 1.6 (see above, equation (2)). This theorem dramaticallynarrowsthe class of allowablepolarizationmeasures. The only two degrees of freedom are in the constants K and a. The former is simply a multiplicativeconstant which has no bearing on the order, but which we use for population normalization(see Section 5.1 below). The constant a reflectsour deductionthat the identificationfunctionmust be of the form pa, where a > 0. Note that no a priori assumptionof how identification affects effective antagonismhad been postulated in the model. The deduction that this effect is indeed positive is not surprising,given the earlier discussion and examples.This is preciselywhat will distinguishpolarizationmeasuresfrom inequalitymeasures. It should be noted, however,that a cannot take on arbitrarypositivevalues, but must be bounded in the way indicatedby the theorem.Withoutthis bound, all the axioms cannot be satisfied and furthermore,the bound is needed to verifyother intuitivepropertiesof the measure(see below). Indeed, each one of the restrictionsimplied by the theorem is importantin signingthe directionof change in intuitiveexamples.The readermay turn right awayto Section 3.2, where a numberof examplesare workedout and the use of each of these conditionsis illustrated. It should be noted that the measure P* bears a strikingresemblanceto the Gini coefficient. Indeed barring the fact that we are using the logarithmof incomes,our measure would be the Gini if a were equal to zero. It is precisely the fact that populationweights are raised to a power exceeding unity in the formula above that gives rise to distinctlydifferentbehaviorof a polarization measure. Thus a may be treated as the degree of "polarizationsensitivity"of the derived measure. The larger is its value, the greater is the departure from inequality measurement.For more on this, see the discussion following the statementof Theorem3, and Section 3.4.2. With these remarksmade, we now turn to a proof. PROOF: (Necessity.) Define 0: lR2- R by 0Gr, S) T(IOrr),a(a)). First, we show that 0Qr, *) is linearfor all wr> 0. Considerthe case depicted in Axiom 1, with S(x, y) < e and q < Ap. Total polarizationis given by (4) P1 _pq[O(p, x) + 0(p, y)] + pq[O(q, x) + O(q, y)] + 2q20(q, IY-xl). 835 POLARIZATION When the two q's are put together at (x + y)/2, polarizationis (5) p2 pq [0( P Y )]+ 2pq [0(2q, xy)] So by Axiom 1 we have, combining(4) and (5), pL 'P 2J) 2q, 2 ]>P[(P, x) + O( y)] +p[O(q, x) + O(q, y)] + 2q0(q, Iy -xl). Passing to the limit as q -O 0, we get for each p > 0, (6) 20 p, 2 > (p,x)+ (P,Y) The above holds, it should be noted, for each (p, x) > 0 when y is "sufficiently"close to x. Nevertheless,it is possible to show that because 0(p, *) is continuousfor each p, 0(p, *) mustbe concave.The detailsof this argumentare but tedious, and are thereforeomitted. straightforward Next, pick any (p, x, x') >> 0 with x > x'. We claim that there exists D > 0 such that for all A E (0, D], 0(p, x + A) > 0(p, x' - A). To establish this claim, consider Axiom 2. For given (p, x, x') >>0, choose (q, r) >>O such that p > r and y such that y - x = x'. Then (p, q, r, x, y) satisfies all the conditionsof that axiom.Now, polarizationis initiallygiven by (7) Pa pr[O(p, y) + 0(r, y)] +pq[O(p, x +A) + O(q, x + A)] + qr[O(q, x'-A) + 0(r, x'-A)] evaluatedat A = 0. By the axiom,it mustbe the case that Pa > POfor all values of A E (O,x').25 Using this informationin conjunctionwith (7), we see that pq[{0(p, x +,A) - 0(p, x)} + {O(q,x +,A) - O(q,x)}] > qr[{O(q,x') - O(q, x' -,A)} + {0(r, x') - 0(r, x' - A)}] for all A E (0, x'/2). Passing to the limit as (q, r) -->(p, p), we deduce that O(p, x +,A) - O(p, x) > O(p, x') - O(p, x' -,A), for all 0 <,A < D x'/2. This establishesthe claim. Combiningthis claim with the earlierdeductionthat 0(p, *) is concave(and continuous)for all p, we may conclude at once26 that there exists a continuousfunction 4( ) with +(p) > O for all p > 0 such that for all (p, 8) > 0, O(P, a) = O( p)6 25 If this were not true, then keeping(p, q, r, y) fixed, there would exist some value x" E (x, y) such that Axiom2 wouldbe violatedwith x" in place of x. 26We omit the detailsof this deduction.One easyway is to note that by the concavityof 0(p,), o admitsright and left hand derivativesin its second argument.By interpretingthis claim as an additionalrestrictionon these derivatives,we maydeduce the linearityof 0 in its second argument. 836 J.-M. ESTEBAN AND D. RAY We show, now, that k(&)must be an increasingfunction. To do this, firstpick q > 0 and then recall the scenario of Axiom 1, with (p, x, y) chosen such that 5(x, y) < E and q < ,up. Total polarizationis given by Pi -pq[O(p)x +4(p)y] +pq[O(q)x + O(q)y] + 2q20(q)ly When the two q's are put together at (x + y)/2, polarizationis x +y x+y + 2pqb(2q) =2pq4(p) p2- -xj. 2 2 By combining these two observations and using Axiom 1, it follows that 4(2q)> f(q). Since q was an arbitrarypositive number,the continuousfunction +(q) must be increasing. We show, next, that 4(p) must be of the form Kpa, for constants (K, a) >> 0. To prove this we proceed as follows. Considerany two-pointdistributionwhere positivepopulationmasses p and q are situated at a distanceof one unit from each other. Total polarizationis given by pq[b(p) + +(q)]. Now note that for each (p, q, p') >>0, there exists q' > 0 such that + +(q)] =p'q'[O(p') + 4(q')]. By ConditionH, it follows that for all A > 0, (8) pq[b(p) (9) A2pq[O(Ap) + 4(Aq)] = A2p'q'[ (Ap') + O(Aq')]. Combining(8) and (9), we see that (10) o(p') + o(q') O(Ap') + O(Aq') +(P) + +(q) O(Ap) + O(Aq) Takinglimits in (10) as q -O 0 and noting that q' 0 as a result,we have for all (p, p', A) >> O, ( ( 1) ) -- () O(Ap) O(Ap') Put A = i/p and r = p'/p in (11). Note that p and r can be varied freely over Makingthe requiredsubstitutionsin (11), we arriveat the fundamental DR++. Cauchyequation (12) k(p)4(r) = 4(pr)4(l) for all (p, r) >>0, where 4( ) is continuous and strictly increasing(this last observationby our previous argument).The class of solutions to (12) (that satisfythe additionalqualifications)is completelydescribedby +(p) = Kpa for constants(K, a) >>0 (see, e.g., Aczel (1966, p. 41, Theorem3)). To summarize,we have proved so far that O(p,8) = Kp for (p, 8) > 0. It remainsto deduce the bounds on a. To do so, consider the case of Axiom 3. Polarizationis given by (13) P() 2d[(p-) + 4d(p 1p+a _ (q + 2) A)2+a, + (1 + 2,) 1+a - A) 837 POLARIZATION evaluatedat A = 0. The axiomimplies that the derivativeof P with respect to A must be nonpositivefor all values of (p, q) >>0. Performingthe necessary calculation and defining z as the ratio p/q, we see that this requirementis equivalentto the conditionthat f(z, a) < 0 for all z, where f(z, a) is definedin (2). But this implies that a < a*. (Sufficiency.)ClearlyP* as describedin the statementof the theoremsatisfies ConditionH. It remainsonly to verifyAxioms 1, 2, and 3. (VerifyingAxiom 1): Considerp, q, x, y as in the Axiom. Initially,polarization is p = (pl+aq + q+ap)(x + y) + q2+aly-xl. Putting the two q's together at (x + y)/2, Pt = (2p = (pl+aq +aq + 21+aql+ap)(x + ql +ap)(x +y)/2 + y) + (2 al) (x + y) ql +ap, so that P' >P whenever (2a - 1)(x +y)p > 2qIy -xl. It is easy, therefore, to find (e, ,)? >>0 so that Axiom 1 is satisfied. (VerifyingAxiom 2): Take p, q, r, x, y satisfyingthe conditions of Axiom 2. Polarizationis given by + ql+ap) + (y -x)(q1+ar + rl+ap). P=x(pl+aq (14) +rl+aq) +y(pl+ar Considera small increase in x by A. Then, using (14), the change in polariza- tion is A{(p1+aq + ql+ap) -(ql+ar + rl+aq)}. It is easy to verifythat this expressionis positive if p > r. (VerifyingAxiom 3): Considerthe case depicted in Axiom 3. Recall (14). One can verify that it will be sufficient to prove that for every (p, q)>>0, the derivative of P(A) evaluated at A = 0 is nonpositive, and that it is strictly negative for all but at most one ratio of p to q. For each ratio z =p/q, this derivative is given by f(z, a). Now refer to the assertions following (2) to complete the verification. Q.E.D. This completesthe proof of the theorem. 3.2. Maximal Elements of the Polarization Measure This subsection is devoted to examiningthe partial orderingsgenerated by the class of measures given by Theorem 1. Moreover,we show that for given income classes Y1,Y2,..., Yn and total population,the distributionthat assigns half the populationto the lowest income class and the remainderto the highest income class is more polarized than any other distribution,under any of the measuresgivenby Theorem1. In what follows,we shall take as fixedthe income classes and identifydistributionsby the vector of populationmasses. 2: The bimodal distribution (H/2, 0,. . ., 0, H/2) is more polarized THEOREM than any other distribution (rr,... measure P* in the class given by (3). , n) with total population H, under any 838 J.-M. ESTEBAN AND D. RAY PROOF: Because of the continuityof any given measureP* in the class given by Theorem 1, and given the compactness of the space of distributions,it suffices to show that any distribution not equal to (H/2, 0, .. ., 0, H/2) can be dominated by another in terms of polarization as measured by P*. In the argumentsto follow, we will considermeasureswith K normalizedto unity:no loss of generalityis involved.The followinglemmaswill be useful. LEMMA1: Supposethat thereexistsk, 1 < k < n, with thepropertythat Tk > 0 and {EW`?lw1- E>k? 1w) and {Ek_l'l '+a - E1 =k+ 1T1I+a}are nonzeroand of the samesign. Then,if the sign is positive(resp.negative),the newdistributioncreated by movingall massfrom k to k + 1 (resp. to k - 1) raisespolarizationunderP*. In what follows,we will denote Iyi - yjIby a(ij). Suppose that the condition of the lemma is met with sign positive (the negative case uses a symmetricargument).Denote by P and P' the values of polarizationbefore and after the change. By singling out the indices k and k + 1, we can write P as PROOF OF LEMMA 1: (15) P- n n n E Erja(ij) + 7Tk+' E 7ja(ki) j=1 i=1 j=l iok,k+1 jok,k+1 n + kE wl+aa(jk) j=1 n + .1+al E Trja(k + 1,j) j=l jok n a(k+1,j). +Wrk+ l E, wJil+a j=1 jik Now observe that for j > k, a(kj) = a(k + 1, j) + a(k + 1, k), while for j < k, a(kj) = a(k + 1, j) - a(k + 1, k). Consequently, we may rewrite (15) as n (16) P n n 1I rja(i;) E i=1 i$k,k+1 j=1 j?k,k+1 k+ k+ E 7rja(k + 1 j) j=1 jok n + (Wk+Wk+1) E w+aa(k + 1,j) j=1 j#k n + 7rl+aa(k + 1, k) E k-1 v-Ev j=k+l + lTka(k + 1, k) T1+a ( j=k+l j=1 k-1 n - E +a ) j=1 Recall now that P' is the measure achieved by shifting all mass from k to k + 1. 839 POLARIZATION Therefore n (17) P -r n n + rr1+arja(ij) E E i=1 i#k,k+l + 7k+l) (7k E j=1 j#k,k+l j=1 j#k rja(k +1, j) n +(k + 1, j). l+a a(k E +Wk+1) j=1 jik Subtracting (16) from (17), we have n (18) P + P1[(Wk 1k+l)a (k Ewvja(k+1,j) k1+a)] + j=l (k-1 +7kl?aa(k+l,k) n 7rj- vi E j=k+l k-1 n r+_ + wrka(k + 1,j=+ k) j j=l j ;1 wJ+ , E j=k+l Combiningthis (7)+a + w)) Note that because a > 0, (7k + Wk+k)l+a> observationwith (18) and the conditions of the lemma, we see that P' >P. Q.E.D. LEMMA2: Suppose that an income distribution has at least four nonzero mass points. Then there exists k, 1 < k < n, such that the conditions of Lemma 1 are met. PROOFOF LEMMA2: Let k1 be the second nontrivial mass point counting from the left, and k2 be the second nontrivialmass point counting from the right. Then, because there are at least four nonzero mass points, k1 < k2. We claim that the conditionsof Lemma1 hold for at least one of these two indices. Clearly,for either k = k1 or k = k2, i = 1 - Ei=k+l1i # 0. Case 1: Suppose that k = k1. Consider the subcase where n ki-1 (19) E E wii=1 wi>O. i=kl+l Because wi > 0 for at most one index i for 1 < i < k1, we have Eik 1-1+a. Consequently, by (19), k - 1 S7r 1+a= i=l 1 +a kl-l S \i=1l i >l \i= 1 +a n E i=kl+l i (E 1Yl li)l+a = n > E r1i+a. i=kl+l This means that the conditionsof Lemma 1 are met for k = k1. Next, considerthe subcasewhere (19) fails. Then it certainlymust be the case that (19) holds when k1 is replaced by k2. Now carry out exactly the same argumentfor k2 as we did for k1. 840 J.-M. ESTEBAN AND D. RAY Case2: Suppose that k = k2. Then use exactlythe same argumentas in Case Q.E.D. 1, with the roles of k1 and k2 reversed. We may now complete the proof of the theorem. If the number of mass points is at least 4, use Lemmas1 and 2. If the numberof mass points is 3, use Axiom 3. Finally,if the numberof mass points is 2, then use the computationin Q.E.D. Section 3.4.1 below. Lemmas1 and 2 are of independentinterestfor polarizationanalysis.Lemma 1 establishes a criterion of polarizationdominancewhich generates a partial ordering.Roughly speaking,this criterionsays that polarizationis raised if we shift a populationmass to a neighboringclass wheneverthe distributionof the populationon the side to which this neighborbelongs satisfiesthe followingtwo conditions:total population is less, and more "dispersed"than on the other side. Lemma 2 demonstrates that for every distributionthere always is an income class such that the previous conditions are satisfied. For additional discussionof this point, see Section 3.4.3, item (3). 3.3. An Additional Restriction The conclusionof Theorem 1 maybe sharpenedfurtherby the inclusionof an additionalassumption,whichwe find plausible.27This assumptionconcernsthe case studied in Example3 of Section 2, which, it will be recalled,was designed to illustratethe insignificanceof small groupsin the concept of polarization. AXIOM 4: r q Pi 0 x y Data: (p, q, r, ) >>O and q > r. Statement: There is ,u > 0 such that if p < ,ur and q - r < ,u, then a transfer of population mass from p to r will not decrease polarization. 27We decided against using this assumption directly in Theorem 1. Theorem 1 does achieve a dramatic simplification, and it may be useful to explicitly see which assumptions are driving the result obtained there. 841 POLARIZATION Note that Axiom 4 only applies if p is small relativeto the masses q and r, and if q - r. We note, too, that mass is transferredfrom p to only the smaller of the two remainingmasses,so the Axiomis quite weak. We refer the readerto Section 2, Example3, for a discussionof this point. THEOREM 3: Under Condition (H) and Axioms 1-4, every polarization measure must be of the form described in Theorem 1, with the additional restriction that a> 1. We note that Theorem 1 is deficient in one respect. Because of the weak natureof the axiomsemployed,Theorem1 fails to rule out the possibilitythat a maybe arbitrarilyclose to zero. As noted above,smallervalues of a indicate an increasinglack of "polarizationsensitivity,"and a greater degree of concordance with inequality measurement.The limiting case of a = 0, as we have alreadyobserved,is one of inequalitymeasurement,with our measureconverging to the Gini coefficient (defined on log incomes). En route, to the limit, Axioms 1 to 3 are satisfied, but in progressivelyweaker terms (with failure occurring,of course, only at the limit). The additionalrestrictionachieved in Theorem 3 rules out these cases altogether,boundingthe permissiblevalues of a below by unity, and impartinga minimaldegree of "polarizationsensitivity" to the entire class of measures.See Section 3.4.2 for additionalillustration. PROOF OF THEOREM 3: For given (p, q, r, A) as in Axiom 4, polarizationis given by (20) P A{[p 1+aq + q1 +ap] r r+l+aq] + + [q' + rl+ap]}. +2[p1+ar Differentiating(20) with respect to p under the understandingthat dq = 0 and dr = -dp, we see that (21) --= +(1+ )paq +q1+a + 2(1 +a)par-2pl (1 +a)raq -ql+a +a-2(1 + a)rap + 2r' +a. Using Axiom 4, it follows that evaluated at p = 0 and q = r, (1/A)(dP/dp) Passing to the limit as p -> 0 and r -p q in (21), we see that < 0. 1 dP a dp (= O,r=q) = -(1 +a)ql + 2q'+ So the requiredinequalityholds if and only if a > 1. This completes the proof of the proposition. Q.E.D. 842 J.-M. ESTEBAN AND D. RAY 3.4. Propertiesof the Polarization Measure in Particular Cases In this subsection,we apply the polarizationmeasurethat we have obtained to some simple and (intuitively) not-so-simple cases, to examine how the measure behaves. These cases include an analysis of the various "desirable" features of a polarizationmeasure, discussedin Section 2. In all the examples that follow, we normalizepopulationto a unit mass and set K = 1 in (3). 3.4.1. Two-pointDistributions Our first example studies aspects of Example 2 (Section 2). Consider the two-point income distributionwith weights 7r and 1 - r concentrated at incomes x and y. Polarizationis given by + (1w-g) ?awJ P[ w1+a(1-r) (y-x). It is immediatethat pulling apart the two groupswill increase polarization. Furthermore,by using the expression above, it can be verified that for given (x, y), polarizationis maximizedat rr= 0.5. This last feature uses the condition (provedin Theorem 1) that a < a.28 3.4.2. Three-pointDistributions We now take a look at different three-point distributions.Without loss of generality, denote by (0, x, y) the incomes (0 < x < y) and by (p, q, r) the correspondingpopulationweights.The general form of the measurenow reads P-[pl+q+ql+apx + [q1+a [pl+pr+rl+p]y r + r1+aq](y-x). One special case is where p = r. This yields a symmetricdistribution.As populationweight on this distributionis shifted from the center to the sides, we expect polarizationto rise. This is exactlythe sentimentof Axiom 3, which our polarizationmeasureis knownto satisfy. Another specialcase is that illustratedby Example4 in Section 2. Here q = r. The idea is to examinewhat happenswhen the two q-massesare fused at the midpointof their incomes.We arguedin Section 2 that polarizationshould rise if p is "large"relativeto q, but fall if the relativesizes were reversed.Denoting the polarizationvalues before and after the changeby P and P' respectively,we can reproduceexactlythe same argumentin the proof of Theorem 1 to see that P > P' if and only if p q 2(y-x) (2 - hl)(x + y) 28 Indeed, this feature holds as long as xx< 2. 843 POLARIZATION p q q d qd- FIGURE 6.-Weight-shifting fromcenterto sides. The behaviorof the measurethereforereinforcesour intuition.In particular,if p is large relativeto q, polarizationrises, while exactlythe opposite is true if p is small.29This is preciselythe kind of global sensitivitythat we referredto in Section 2. Apropos the discussionfollowingTheorems 1 and 3, note how clearly a acts as a measureof "polarizationsensitivity."In particular,the smallerthe value of a the highermust be the ratio of p to q before polarizationcan be said to have increasedunder the measure. 3.4.3. Distributions with Four or More Mass Points (1) We start by reproducingExample5 of Section 2. This example,it will be recalled,was designed(along with Example4), to show the globalnature of the measure.ConsiderFigure 6. Imagine that weight is being transferredequally from the two mass points nearer the center towards the sides: thus, we have initially the configuration (0, ', ', 0) and finally, the more polarized distribution (2, 0,0, '). Our purpose is to investigateintermediatebehavior. Use Figure6 to examinethe formulato be givenbelow. Initially,polarization is given by p = 2[p l+aq + 2(pl+aq + ql+ap] d + 2p2+a + ql+ap)(1 d + r7)d + 2q2+a(2 + 71)d = 2p2+a7d + 2[ p1+aq + ql +ap + q2+a] (2 + q)d = 2p2+a7d + 2[ pl+aqqlq 1+a](2 + j) d, using the fact that p + q = 1/2. With dp/dq = -1, the derivative of this 29Of course,the precisethresholdvaluesdependon the valuesof x and y. But that is exactlyas it shouldbe. 844 J.-M. ESTEBAN AND D. RAY expressionwith respect to q is -2(2 + a)pl+a 7d+ 2[ pl+a-(1 +a) paq + 2(1 + a)qa] (2 + 7)d. We wish to examinethe sign of this as q rangesover [0, ]. First note that at q = 2 (and p = 0), the derivativeis unambiguouslypositive. This reflects the intuitivelyappealingfeaturethat when the populationis alreadylargelybunched at the two extreme points, furtherbunchingwill serve to accentuatepolarization. At q = 0 (and p = 1), the derivative has the value d[2 - (1 + a)-q]2-a. This could be of either sign, dependingon the values of -1 and a. For instance,if -q is "small,"then the initial polarizationis relativelysmall and all moves to the extremes raise polarization.However, if qj> 2/(1 + a), initial polarizationis relativelylarge. In this case, the above analysis shows that as population is moved away from the central masses to the extremes, polarizationfirst goes down and then goes up,30ultimatelyattaininga higherlevel than the initialone. Under the conditionsyieldingTheorem3, this requirementis veryweak, saying only that -q> 1. The reader may wish to compare this with our intuitive discussionof Example5. (2) We turn next to the question of Dalton transfersthat are equalizing,to checkwhetherpolarizationcan rise in this case. Numericalexamplesare easy to find. Consider,for instance,the case of Example1 in Section 2. To be concrete, think of incomes in this example as absolute values (and not logarithms),to retain an explicit notion of Lorenz domination. It is possible to verify by numerical methods that polarizationwill rise as long as a exceeds approximately 0.3. This conditionis more than amplysatisfiedunder Theorem3. Indeed, it is easy to provide a general formula that checks whether the "bunching"of incomes initiallyuniformlydistributedover a finite number of income classes, into a smallernumberof income points, increasespolarization. Apart from being of possible interest for its own sake, this exercise shows that our polarizationmeasureis readilyamenableto analyticalcalculation. Consider n (logarithmic)income points spaced uniformlywith pairwisedistance d, and suppose that the population is uniformlydistributedon these points. Defining P(n, d) to be the value of polarizationthus obtained,we see that P( d) (2+ a) E () ( + (n-k)(n- k+ 2 k=1\2 1)) where in deriving the above expression, we recall that the sum 1 + 2 + *.. +m m(m + 1)/2 for any positive integer m. Using, in addition, the formula 1 + 22 + * +m2 = m(m + 1)(2m + 1)/6 for any positive integer m, the above simplifiesto yield = (22) 30 P(n, d) -dn-(2+a)n(n2 - 1)/3. It can be checkedthat a higherdegree of nonlinearityis not possiblein this case. 845 POLARIZATION Consider positive integers M and K. Suppose, now, that we start with MK income points spaced at uniformdistance, normalizedto unity. We "collapse" this distributionby taking adjacent sets of K points each and concentrating these at the averageincome levels of these sets. We now have a new uniform distributionover M points spaced K units apart.Using (22), we may conclude that polarizationhas increasedif (KM) (2+a)(K2m2 - 1) KM/3 >JKMQ-(2+a)(M2 - 1) M/3, which on simplificationyields the condition (23) K 2+a(M2 - 1) >K 2M2 - 1. It is immediate that for (23) to be satisfied for any given K, a > 0. This is intuitive.Recall that for a = 0, the polarizationmeasureessentiallyreducesto a Gini measure of inequality(in the perceptualvariablelog-income),so that all bunchingof incomes will bring the measure down by the Dalton principleof transfers.Note, too, that if bunchingresultsin a singlegroup(M = 1), polarization cannot go up, and this too is reflectedby the condition(23). Next, observethat for any K and M # 1, condition(23) is most stringentwhen M = 2. It follows, therefore,that for givenK, if polarizationis to rise under all such circumstanceswhich involve the bunchingof K groups, a necessaryand sufficientconditionfor this is (24) 3K2+a > 4K2 _ 1. Note that if the polarizationmeasuresatisfiesAxiom4, then a > 1 and (24) is automaticallysatisfiedfor all values of K > 2. (3) We end this section of the paper by observing that our polarization measure combines a preference for numerical equality at different income points with a preferencefor creatingbunchedpopulations.Thus, providedthat groups can somehowbe defined, "intra-group"equalityenhances polarization. One way of examiningthis is to generalizethe conditionsof Axiom 2. Consider a particularincome class k and examinethe distributionson either side of this class. Supposethat the followingtwo conditionshold: (i) total populationto the left of this class exceeds total populationto the right of this class, and (ii) the income distributionon the space of incomes to the left of this class is Lorenzdominated by the income distributionto the right of this class. Informally, higherincomes are more scarcelypopulated and more bunchedtogether.Then a shift of population weight from income class k to class k + 1 must raise polarization as defined by (3). The proof of this observation is implied by Lemma 1 and the fact that a > 0, and is omitted. 4. CROSS-IDENTIFICATION:AN EXTENDED CLASS OF MEASURES It is appropriateat this stage to reconsidera serious assumptionthat drives the derivationof the polarizationmeasure(3) in the previoussection. It will be recalled that our model posited a particularlysharp form of the identification 846 J.-M. ESTEBAN AND D. RAY 3/4 1/2 1/4 1/4 FIGURE 7A FIGURE 7.-Diagrams 1/4 FIGURE 7B 1/2 1/4 1/4 FIGURE 7c to illustratethe discontinuityproblem. function;namely,that an individualidentifieswith another in exactlythe same income class and with no one else. This extreme postulate is echoed in the measurein the form of an unsatisfactorydiscontinuity.3' Figure 7 reconsidersExample 4 in more detail. Begin with two population masses, say 1/4 and 3/4 at two income points, as shown in Figure 7a. Now suppose that the first mass and a subset of the second mass (of measure 1/4) begin to move towardseach other, leavingthe remaininghalf of the population where it is. Figures 7b and 7c illustratethis process. Initially,we would expect polarizationto fall and finally,when the two populationmasses are fused (just as in Example 4) we would have polarizationrise. All this is satisfiedby our derivedmeasure.The point is that when the masses are alreadyclose (say, as in Figure 7c) but not exactlysuperimposed,we would also expect polarizationto be increasingin the describedmovement(supportingthis by an argumentthat the identificationbetween the two 1/4-masses is growing).But our measure does not incorporatecross-identificationbetween groups, and so continuesto fall, only to jump up "in the limit" as the two masses exactlyfuse. The problem is easily resolved by permitting identificationacross income groupsthat are "sufficiently"close.32 But this resolutionis not costless. Admitting cross-identificationopens up a set of new issues. In particular, it is impossible to specify (or deduce) a priori the domain over which a sense of identificationprevails. Nevertheless,such an extended class of measures(indexed by the choice of the identificationzone) can be defined for the purpose of empiricalwork, and can be shown to satisfy the axioms in the previous section, as well as the properties discussed in Section 3.4. The choice of this zone must be left to the empirical researcher, for it depends on the degree of grouping or averaging in each of the "income classes" of the available data. 31 We are grateful to a referee for making this point, and for suggesting a discussion of it in the paper. 32 Indeed, an earlier version of our paper (Esteban and Ray (1991)) does just this. 847 POLARIZATION To illustrate an extended class, let D > 0 be such that if an income y' is within D of an income y, then there is some identificationbetween the two incomes. Formally,let w(d) be a positiveweightingscheme on [0, D] such that w( ) is a decreasingfunction with w(D) = 0. For a given distribution(r, y), define the identificationfelt by any memberof income class i as Ii--~~~~17 j( W yj) . j: IyI-yjLD The new measure induced by the weightingfunction w is then some scalar multipleof n (25) PW(7IY n E rjIia i=1 j=1 max{ yi - y,I- D, 0) A complete characterizationof such measures is beyond the scope of this paper. But note that these measures do resolve the discontinuitydescribed above. 5. OTHER EXTENSIONS AND CONCLUDING REMARKS 5.1. Income and Population Normalization As stated earlier,we use the logarithmof incomes as the perceptualvariable in our model. While this may appear nonstandardin the theory of inequality measurement,where absolute differences in income are noted (and subsequently normalizedby mean income), we find our approachsuitable for two reasons. The first is conceptual. We find it natural that individualsreact to percentagedifferencesin income ratherthan absolute differences,and we wish to keep these bilateral differencessymmetric.33The second reason is one of convenience.A numberof our axioms and examplesinvolvethe movementof population from one income class to another, which then alter aggregate income. With absolute income, the measurewould have to be renormalizedat each step before the axiomsdictatehow we want the measureto behave.This is unnecessaryin the present formulation.A corollaryis that the measurecomes prenormalizedin incomes. The same is not true of population.The naturalnormalizationto choose, of course, is to replace the populationweights rri,i = 1,.. ., n, by the population frequencies. This is tantamount to choosing K = [ELn ,i](2+a) in (3). But it is importantto be clear as to what such a normalizationimplies in the context of our model. In our model, polarizationamong small populationsis smaller than among larger groups,ceteris paribus.But a careful readingof the model will make it 33Thus, for example, the percentage difference in incomes, normalized by the income of the individual concerned, is not symmetric. While more complicated formulations are symmetric (such as the absolute difference of income divided by the sum of the two incomes), we felt that the logarithm would do just as well. 848 J.-M. ESTEBAN AND D. RAY clear that this refers to the relative importanceof the small population in a contextwhere it is embeddedwithin a largerpopulation.Thus polarizationin a small regionof a large countrymaymean little relativeto polarizationoccurring in that same countryat a national level. But if this small region is an entirely separatecountry,then the comparisonof the two countriesrequiresa scalingof the population. Population homogeneity of degree zero may then be legitimatelyinvokedwithoutviolatingthe basic tenets of the model. 5.2. Asymmetric Polarization Measures Earlier,we noted that a perfectlysymmetricmodel of polarizationmay not be entirely appropriate,especially in a context where the variable of interest is incomeor wealth.While the poor feel alienationfromthe rich,it maybe argued that the correspondingsense of distance felt by the rich from the poor should not enter in a symmetricway into the polarizationmeasure.We brieflydiscuss an extremeversion of this observation:the alienationfunctionfor individualy registerspositivevalues only for income values y' that are greaterthan that of individualy. Fortunately,we can exploit the formal structureabove with no change of notation if we don't mind changingthe concept a bit. Interpret 3(y, y') now as max{y'-y,O} (instead of Iy'-yI). The general class of measures studied in Section 3 may now be also put forwardfor this case. As before, a set of axioms may be imposed to narrow down this class of measures. This set will be somewhatdifferentfrom the axiomsof that section, insofaras the latter has an implicitbias towards symmetry(for example,Axiom 2). This class of polarization measures will certainly not yield symmetricdistributionsas its maximal elements. This last observationcan be verifiedeven without imposingany axiom at all. Considerthe two point distributionin Section 3.3.1, with the mass points p and (1 -p) situated at a distance d. Any asymmetricmeasure in the general class yields polarizationequal to P =p(1 -p)6(p, d). Suppose we were to maximize P with respect to a choice of p E [0,1]. We observethat if 0 is increasingand differentiablein p, with ao/ap> 0, then the maximum of the polarization measure must be attained at values of p that lie strictlybetween2 and 1.34 34Here is a quickproof. Note that any value of p that maximizesP mustbe strictlybetween 0 and 1. So the first-ordernecessarycondition: 0( p*, d)(1l- 2p* ) + p*l (-P *) dO(p*, d) = d ? op must hold at any values of p* that maximizeP. But now observe that because dO/Op> 0, this conditioncannot hold for any P* < 2 POLARIZATION 849 This shows that asymmetricpolarization models will not yield symmetric distributionsas maxilhalelements. 5.3. WeightedIdentification and Alienation Functions We commenton two possible extensions.First, the sense of identificationof an individualmay depend not only on the number of other individualswith similar attributes,but also on the attributesthemselves. In other words, the attributesmay confer "power"on the group to effectivelymanifestits sense of identification.In the case where the space of attributesis income,this consideration may be of particular relevance. An equal number of people in two differentincome categoriesmay possess an unequal degree of "effectiveidentification."The resourcesof the richergroupmay contributeto the abilityof that groupto form lobbies,organizeprotests,push particularpolicies, and in general to manifest itself as a unified entity. This extension is easily incorporatedinto the model that we have set up here. The identificationfunctionof an individual with income yi must now be extended to include not only the number rriwith that income but also yi itself: I = I(ri, yi). A typical member of the extended familyof polarizationmeasureswould then be n P*(7r,y) =KE n E I+a jYlYi-Yjl, i=1 j=1 which reducesto the currentfamilyin case /8 = 0. We have not axiomatizedthis extended family in the paper, though we conjecturethat this would not pose serious difficulties. Second, the alienationfelt by an individualy towardsanotherperson y' may also depend upon the numberof individualswith the latter income. In a certain sense, the notion of identificationalreadyincorporatesthis feature:the effective antagonismof the latter group towards y will have been weightedby its sense of identification.But it is certainlypossible to create additionalweighting.The point is that we have little to say, a priori, regardingthe effect of such a weighting on effective antagonism.Does it increase or decrease the effective voicingof protest?In our mind,the answeris unclear,thoughsuch an extension merits inquiry. 5.4. Methodological Remark Perhaps the most important drawbackof our work is that it combines a behavioralstoryof individualattitudeswith axiomaticrestrictionsplaced not on the story itself but separatelyon distributionsof characteristics.In this sense, there is perhaps a "deeper"axiomatizationof the concept of polarization,one that is parallelto the theoryof inequalitymeasurement.Our effortsso far have not been successful in uncoveringthis more fundamentaltheory. While we believe that some progress has been achieved by motivating, defining, and 850 J.-M. ESTEBAN AND D. RAY characterizinga new concept and its measure,it should certainlybe possible to do better. Instituto de Ana'lisis Economico, Universidad Autonoma de Barcelona, 08193 Bellaterra, Barcelona, Spain and Department of Economics, Boston University, 270 Bay State Rd., Boston, MA 02215, U.S.A. Manuscript received March, 1991; final revision received October, 1993. REFERENCES J. (1966): Lectures on Functional Equations and their Applications. New York and London: Academic Press. ADELMAN, I., AND C. T. MORRIS (1973): Economic Growth and Social Equity in Developing Countries. Stanford: Stanford University Press. AHLUWALIA, M. (1976): "Inequality, Poverty and Development," Journal of Development Economics, 3, 307-432. ALESINA, A., AND D. RODRIK (1991): "Distributive Politics and Economic Growth," NBER Working Paper No. 3668. ALESINA, A., S. OZLER, N. ROUBINI, AND P. SWAGEL (1992): "Political Instability and Economic Growth," mimeo. AMIEL, Y., AND F. A. COWELL (1992): "Measurement of Income Inequality: Experimental Test by Questionnaire," Joumal of Public Economics, 47, 3-26. ATKINSON, A. B. (1970): "On the Measurement of Inequality," Journal of Economic Theory, 2, 244-263. BARRO, R. J. (1991): "Economic Growth in a Cross-Section of Countries," Quarterly Joumal of Economics, 106, 407-443. BARRO, R. J., AND X. SALA-I-MARTIN (1992): "Convergence," Joumal of Political Economy, 100, 223-251. BENHABIB, J., AND A. RUSTICHINI (1991): "Social Conflict, Growth and Income Distribution," mimeo. BLACKORBY, C., AND D. DONALDSON (1978): "Measures of Relative Equality and their Meaning in Terms of Social Welfare," Journal of Economic Theory, 18, 59-80. CHAYANOV, A. V. (1923): The Theory of Peasant Economy (1966 edition edited by D. Thorner, B. Kerblay, and R. Smith). Homewood, IL: R.D. Irwin. COSER, L. A. (1956): The Functions of Social Conflict. Glencoe IL: The Free Press. R. (1959): Class and Class Conflict in Industrial Society. Stanford, CA: Stanford DAHRENDORF, University Press. DASGUPTA, P., A. SEN, AND D. STARRETr (1973): "Notes on the Measurement of Inequality," Journal of Economic Theory, 6, 180-187. DEUTSCH, M. (1971): "Conflict and its Resolution," in Conflict Resolution: Contributions of the Behavioral Sciences, ed. by C. G. Smith. Notre Dame: University of Notre Dame Press. DUNCAN, G., T. M. SMEEDING, AND W. RODGERS (1991): "W(h)ither the Middle Class? A Dynamic View," Levy Institute Conference on Income Inequality, Bard College. DUTrA, B., AND J. ESTEBAN (1992): "Social Welfare and Equality," Social Choice and Welfare, 9, 267-276. EBERT, U. (1987): "Size and Distribution of Incomes as Determinants of Social Welfare," Joumal of Economic Theory, 41, 23-33. ESTEBAN, J., AND D. RAY (1991): "On the Measurement of Polarization," Institute for Economic Development Discussion Paper 18. FIELDS, G. S. (1980): Poverty, Inequality and Development. Cambridge: Cambridge University Press. FIELDS, G. S., AND J. C. H. FEI (1978): "On Inequality Comparisons," Econometrica, 46, 303-316. FOSTER, J. (1985): "Inequality Measurement," in Fair Allocation, ed. by H. Peyton Young. Providence, R.I.: American Mathematical Society. ACZEL, POLARIZATION 851 GURR, T. R. (1970): Why Men Rebel. Princeton: Princeton University Press. (ED.) (1980): Handbook of Political Conflict: Theory and Research. New York: The Free Press. J. C. (1953): "Cardinality Utility in Welfare Economics and in the Theory of Risk-Taking," Journal of Political Economy, 61, 434-435. HORRIGAN, M. W., AND S. E. HAUGEN (1988): "The Declining Middle Class Thesis: A Sensitivity Analysis," Monthly Labour Review, 111, 3-13. KERNER COMMISSION (UNITED STATES) (1968): Report of the National Advisory Commission on Civil Disorders. Washington, D.C.: Gov't. Printing Office. KOLM, S-CH. (1969): "The Optimal Production of Social Justice," in Public Economics, ed. by J. Margolis and H. Guitton. London: Macmillan; New York: St. Martin's Press. KOSTERS, M. H., AND M. N. Ross (1988): "A Shrinking Middle Class?" The Public Interest, 90, 3-27. KRAVIS,I. (1960): "International Differences in the Distribution of Income," Review of Economics and Statistics, 42, 408-416. KUZNETS, S. (1955): "Economic Growth and Income Inequality," American Economic Review, 45, 1-28. LANDSBERGER, M., AND I. MEILIJSON (1987): "A Tale of Two Tails: Playing Bold and Some New Definitions of Risk," mimeo, Haifa University. LENIN, V. I. (1899): The Development of Capitalism in Russia, 1956 edition. Moscow: Foreign Languages Publishing House. MANKIw, N. G., D. ROMER, AND D. N. WEIL (1992): "A Contribution to the Empirics of Economic Growth," QuarterlyJournal of Economics, 107, 407-437. MIDLARSKI, M. I. (1988): "Rulers and the Ruled: Patterned Inequality and the Onset of Mass Political Violence," American Political Science Review, 82, 491-509. MULLER, E. N., M. A. SELIGSON, AND H. Fu (1989): "Land Inequality and Political Violence," American Political Science Review, 83, 577-586. NAGEL, J. (1974): "Inequality and Discontent: a Non-linear Hypothesis," World Politics, 26, 453-472. PERSSON, T., AND G. TABELLINI (1991): "Is Inequality Harmful for Growth?: Theory and Evidence," mimeo, Institute for International Economic Studies, University of Sweden. QUAH, D. (1992): "Empirical Cross-section Dynamics in Economic Growth," mimeo, London School of Economics. SEN, A. K. (1973): On Economic Inequality. Oxford: Clarendon Press. SHANIN, T. (1966): "The Peasantry as a Political Factor," Sociological Review, 14, 5-27. SHESHINSKI, E. (1972): "Relation between a Social Welfare Function and the Gini Index of Income Inequality," Journal of Economic Theory, 4, 98-100. SIMMEL, G. (1955): Conflict. Glencoe, IL: The Free Press. TILLY,C. (1978): From Mobilization to Revolution. New York: Random House. WEISSKOFF, R. (1970): "Income Distribution and Economic Growth in Puerto Rico, Argentina and Mexico," Review of Income and Wealth, 16, 303-332. HARSANYI,
© Copyright 2026 Paperzz