The Source of Some Paradoxes from Social Choice

The Source of Some Paradoxes
from Social Choice and
Probability
Saari, D.G.
IIASA Collaborative Paper
August 1986
Saari, D.G. (1986) The Source of Some Paradoxes from Social Choice and Probability. IIASA Collaborative Paper.
IIASA, Laxenburg, Austria, CP-86-025 Copyright © August 1986 by the author(s). http://pure.iiasa.ac.at/2860/
All rights reserved. Permission to make digital or hard copies of all or part of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed for profit or commercial
advantage. All copies must bear this notice and the full citation on the first page. For other purposes, to republish,
to post on servers or to redistribute to lists, permission must be sought by contacting [email protected]
NOT FOR QUOTATION
WITHOUT THE PERMISSION
OF THE AUTHOR
THESOURCEOFSOMEPARAD0XE;SFROM
SOCIAL CHOICE AND PROBABILITY
Donald G. S a a r i
August 1986
CP-86-25
C o l l a b o r a t i v e P a p e r s r e p o r t work which h a s not been p e r f o r m e d solely
at t h e International Institute f o r Applied Systems Analysis and which
h a s r e c e i v e d only limited review. Views o r opinions e x p r e s s e d h e r e i n
d o not necessarily r e p r e s e n t t h o s e of t h e Institute, i t s National Member
Organizations, o r o t h e r organizations supporting t h e work.
INTERNATIONAL INSTITUTE FOR APPLIET SYSTEMS ANALYSIS
2361 Laxenburg, Austria
FOREWORD
This p a p e r o f f e r s a powerful, simple method f o r understanding many "paradoxes" in social choice and probability t h e o r y . The a p p r o a c h i s a geometrical one;
t h e underlying principle e m e r g e s from a wide v a r i e t y of examples ranging from
elections a n d agenda manipulation to gambling a n d conditional probabilities.
Alexander B. Kurzhanski
Chairman
System a n d Decision S c i e n c e s P r o g r a m
THE; SOURCE OF S O m P A W O E S FROM
SOCIAL CHOICE AND PROBABTLITT
DonaLd G. Saari
Department of Mathematics, Northwestern University, Evanston, Illinois
The social choice l i t e r a t u r e and t h e probability l i t e r a t u r e are filled with
descriptions of "paradoxes". A s w e show h e r e , many of them c a n b e explained and
extended by using t h e same, simple, geometric argument. Our extensions include
s e v e r a l new r e s u l t s about t h e intransitivities of election results o v e r subsets of alternatives, t h e cycles of agenda manipulation, gambling, and Simpson's paradox
from conditional probability. Furthermore, we prove t h a t t h e s e paradoxes must
accompany t h e modeling in a robust fashion. Our a p p r o a c h a p p e a r s t o be new, i t i s
elementary (based on t h e open mapping principal), and i t uncovers new examples.
Indeed, one point t h a t emerges i s t h e e a s e with which paradoxes (i.e., a p p a r e n t
contradictions in a relationship) can a r i s e .
Our argument extends beyond social choice and probability, but we emphasize
these two a r e a s because of t h e i r familiarity and t h e i r importance as standard
modeling tools f o r economics and decision analysis. Examples from probability a r e
discussed in Sections 2 and 3; examples from social choice a r e discussed in Section
3. To simplify t h e exposition, we use d i s c r e t e random variables. However, all this
work easily generalizes t o more general models.
The flavor of o u r r e s u l t s is indicated by the following two prototype examples.
In t h e following sections, w e show how they are r e l a t e d and how they c a n b e extended.
1.1 Conditional Probability and Simpson's Paradox
Suppose a c e r t a i n d r u g is t e s t e d in Chicago ( C ) a d in Los Angeles (C'). A test
g r o u p (T) r e c e i v e s t h e new d r u g , and a c o n t r o l g r o u p (T') r e c e i v e d t h e s t a n d a r d
t r e a t m e n t . Some people are r e t u r n e d t o h e a l t h (H), while o t h e r s are n o t (H').
Suppose t h a t in both communities t h e new d r u g i s judged t o b e successful b e c a u s e
i t c u r e s t h e sick with a h i g h e r r a t i o t h a n t h e s t a n d a r d t r e a t m e n t
Is i t possible f o r t h e a g g r e g a t e d t e s t r e s u l t s to h a v e t h e r e v e r s e conclusion
P(H:T)
< P(H:T')?
I t is, a n d t h i s i s known as Simpson's p a r a d o x . An explanation
(which d i f f e r s from t h a t given h e r e ) a n d how i t r e l a t e s to t h e "sure thing" principle i s given by Blyth (1972a). Examples with r e a l d a t a are given b y Wagner (1982).
This inconsistency phenomenon t u r n s o u t to b e a c h a r a c t e r i s t i c of models
based on conditional probability o r t h e combination of random v a r i a b l e s . When additional conditions are introduced, almost any imaginable extension c a n o c c u r . F o r
instance, s u p p o s e t h a t t h e tests are conducted using facilities provided b y univers i t i e s (U)a n d p r i v a t e l a b o r a t o r i e s (U'). T h e r e e x i s t examples whereby t h e new
d r u g i s unsuccessful a t e a c h university and at e a c h l a b o r a t o r y a s w e l l as in e a c h
community, b u t it is successful in t h e a g g r e g a t e , and t h e r e exist examples where
t h e conclusions oscillate with t h e level: t h e new d r u g i s successful a t e a c h of t h e
facilities, it i s unsuccessful in e a c h community, and i t i s successful in t h e a g g r e gate, etc.
The a p p r o p r i a t e r a t i o of s u c c e s s t o f a i l u r e in e a c h of t h e s e examples can b e
made t o e x c e e d a n y predetermined constant. This means t h a t t h e r e e x i s t examples
of d a t a w h e r e at e a c h facility t h e probability of regaining h e a l t h by u s e of t h e
d r u g i s at l e a s t twice t h a t obtained by t h e s t a n d a r d t r e a t m e n t ; in e a c h community,
t h e s t a n d a r d t r e a t m e n t i s at l e a s t t h r e e times b e t t e r t h a n t h e d r u g ; a n d a t t h e t o t a l
a g g r e g a t e d level, t h e d r u g is at least f o u r times b e t t e r t h a n t h e s t a n d a r d t r e a t ment!
1.2 Voting and Ranking Methods
The a g g r e g a t i o n of p r e f e r e n c e i s a c e n t r a l i s s u e in t h e social s c i e n c e s . A simple system i s voting. H e r e , s e v e r a l p a r a d o x e s o c c u r when t h e v o t e r s r a n k a s e t of
t h r e e a l t e r n a t i v e s la , b ,c 1 by using t h e stanC t r d plurality voting system. Suppose
t h e r e are nine v o t e r s where f o u r of them giv t h e ranking
c
> a > b , three
give
t h e ranking b
c
>b >a
> a > c , and two give t h e
ranking a
> b > c.
The g r o u p ' s r a n k i n g i s
with a tally of 4 : 3 : 2. This ranking is inconsistent with t h e f a c t t h a t a
majority of t h e v o t e r s (five of them) p r e f e r t h e bottom r a n k e d a l t e r n a t i v e a t o t h e
top ranked alternatives c .
I t might b e s u s p e c t e d t h a t a n election ranking of N a l t e r n a t i v e s must have
some relationship t o how t h e group r a n k s at l e a s t one of t h e p a i r s of a l t e r n a t i v e s .
This need not b e t h e case. F o r e a c h of t h e N(N-1)/2
p a i r s of a l t e r n a t i v e s desig-
n a t e , in a n a r b i t r a r y fashion, one of t h e a l t e r n a t i v e s . W e show t h a t t h e r e e x i s t
examples of v o t e r s ' rankings of a l t e r n a t i v e s s o t h a t (1) t h e plurality election
result is al
> a 2 > ... > a ~ and
, (2) f o r e a c h p a i r of t h e a l t e r n a t i v e s , a majority of
t h e s a m e v o t e r s p r e f e r t h e designated a l t e r n a t i v e .
2. THE GENERAL RESULT
The simple geometric p r o p e r t y w h e r e open s e t s are mapped t o o p e n sets i s t h e
unifying explanation f o r all t h p a r a d o x e s d e s c r i b e d in t h i s p a p e r . The following
s t a n d a r d statement (see Warner, 1970) suffices f o r what follows.
Proposition 1
Let F b e a smooth mapping from a n m-dimensional manifold M t o a n n dimensional manifold N where m
> n.
Let c b e a n i n t e r i o r point of N. Assume t h a t
p in F - l ( c ) i s a n i n t e r i o r point of M . If t h e Jacobian of F at p h a s maximal r a n k ,
t h e n t h e r e i s a n o p e n neighborhood of p t h a t i s mapped onto a n o p e n neighborhood
of c .
The proof of t h e following theorem i l l u s t r a t e s why t h e a b o v e p r o p e r t y i s t h e
s o u r c e of t h e paradoxes.
Theorem 1
Consider t h e example in Section 1.1where a d r u g i s compared with a s t a n d a r d
treatment.
C
Let
A
be
+ C',C,C1,CU,CU',C'U,
the
and C'U'.
variable
representing
the
seven
sets
F o r e a c h c h o i c e of A , designate which t e r m
from t h e p a i r (P(H:TA),P(H:TtA)) i s t o h a v e t h e l a r g e r value. Choose a c o n s t a n t
clA g r e a t e r t h a n unity and e x p r e s s e a c h p a i r as a r a t i o t h a t i s bounded below by
clL. T h e r e e x i s t finite examples of d a t a t h a t simultaneously satisfy all t h e s e v e n
specified inequalities.
To p r o v e t h e theorem, i t suffices t o show t h a t , f o r any c h o i c e of signs f o r t h e
seven quantities P(H:TA) - P ( H : T r A ) ,t h e r e e x i s t sample points t h a t r e a l i z e them.
View t h e s e quantities as defining t h e seven components of a mapping F into R7. The
choice of t h e signs identifies a n o r t h a n t B of R7.
The origin 0 of R 7 i s a boundary point f o r e a c h of t h e o r t h a n t s . This "comparison point" i s used in t h e following way. F i r s t , a point p in F - ~ ( o )i s found s o
t h a t (i) i t i s a n i n t e r i o r point of t h e domain, a n d (ii) t h e Jacobian of F at p h a s
maximal r a n k . According t o Proposition 1,F maps a n open neighborhood of p o n t o
an open neighborhood of 0 . This o p e n image set meets e a c h of t h e o r t h a n t s ; in
p a r t i c u l a r i t meets o r t h a n t B. T h e r e f o r e , t h e r e are sample points t h a t satisfy all
seven conditions simultaneously. The technical p a r t of t h e proof i s t o define t h e
domain s o t h a t F c a n b e r e p r e s e n t e d by a smooth mapping.
Proof
T h e r e are eight sets determined by t h e intersections of t h e sets T , C , U a n d
t h e i r compliments. They are:
S 1 = TCU S 2 = TCU' 3.3 = TC'U S 4 = TC'U'
S 5 = T'CU S 6 = T'CU' S7 = T'C'U S 8 = T'C'U'
T r e a t e a c h of t h e s e sets as a disjoint s p a c e . Let Xj designate t h e c h a r a c t e r i s t i c
function
of
H
in
S .
5 = X Z j + X 2 j - 1 ,j = 1,...,4, and
v a r i a b l e 5 , which i s t h e c h a r a c t e r i s t i c
Define
-
Zj = Y21 + Y 2 j - l , j = 1 3 . The random
function of H o v e r S z j + S 2 j - 1 , r e p r e s e n t s t h e r e s u l t s at t h e community level,
while Zj
= Y Z j + Y Z j r e p r e s e n t s t h e final a g g r e g a t e d r e s u l t s .
If z j denotes t h e value of P(Xj
I,j = 1,..., 8 . Let
= I), t h e n
z j i s in t h e unit i n t e r v a l
9 designate P ( S j ) in t h e s p a c e usk.The 9 v a r i a b l e s d e s c r i b e
a simplex in R 8 which i s denoted by S i ( 8 ) and defined by
These 1 6 v a r i a b l e s are in t h e 15-dimensional s p a c e M = l8X S i ( 8 ) .
By use of t h e s t a n d a r d relationship, f o r any set E ,
it follr u s t h a t t h e probabilities yl = P ( 5 = 1 ) a n d z j = P ( Z j = 1 ) a r e t h e r a t i o n a l
functions
and
C o m p a r i s o n m a p : Let F:M
+
R~ b e
where e j i s t h e unit v e c t o r in R7 with unity in i t s j t h component. The components
of F r e p r e s e n t P(H:TA )
- P ( H :TA ') as A
r a n g e s t h r o u g h i t s seven values.
O p e n m a p p i n g : Clearly, F i s a smooth mapping. That t h e Jacobian of F h a s maxi-
mal r a n k at some preimage of 0 i s a d i r e c t computation. Indeed, t h i s r a n k condition holds e v e r y w h e r e e x c e p t on a c e r t a i n lower-dimensional s u b s e t of M . These
points of lower r a n k are where e i t h e r t h e y values o r t h e z values are uniquely
determined b e c a u s e t h e corresponding p a i r s of x o r y are equal.
The signs chosen f o r t h e seven quantities determine a n o r t h a n t of R7,denoted
by B. By construction, all t h e sample points with t h i s behavior are in U = F-'(B).
By t h e continuity of F,U i s a n o p e n s e t ; w e must show t h a t i t i s nonempty. The Jacobian of F h a s maximal r a n k at some i n t e r i o r point of M in F-'(o), s o F maps a n open
s e t from M o n t o a n open set of 0. This open set meets B . Consequently F-'(B) i s
nonempty .
?\Text w e show t h a t U contains points t h a t c a n b e identified with a finite d a t a
s e t . Any r a t i o n a l point will suffice. A multiple of t h e common denominator of d is
t h e t o t a l number of subjects. The same multiple of t h e numerator of d c o r r e s p o n d s
t o t h e cardinality of S f , and i t s e r v e s as a multiple of t h e denominator of z j . Bec a u s e t h e r a t i o n a l points are dense, t h e r e i s a n infinite set of r a t i o n a l points in
F - ~ ( B ) . E a c h point c a n b e identified with a n infinite number of d i f f e r e n t finite
d a t a sets.
The s e t F-'(o): It remains t h a t t h e inequalities can b e bounded below by t h e designated constants. Once t h e values of
type z j
> dAXj +4
4
a r e specified, t h e inequalities are of t h e
with a similar relationship y and z . Let q = ( q l ,. . . , q , ) b e a point
in B. The set F-'@ ) i s given by equations of t h e form z j
- z j +4 = lqjl with
similar
equations f o r y and z . These equations define lower-dimensional hyperplanes in
t h e domain, s o t h e r e are points in U satisfying inequalities of t h e form
The assertion follows if t h e r e i s a point in U such t h a t t h e right-hand sides of
t h e s e inequalities are bounded below by d A , with similar statements f o r y and z .
These inequalities are satisfied if t h e values of t h e denominators on t h e right-hand
sides c a n be chosen t o be a r b i t r a r i l y small. This involves a d i r e c t computation
t h a t i s easily done because F-'(o) contains t h e intersection of 0
X
Si ( 8 ) and t h e
boundary of U. The points are chosen a r b i t r a r i l y close t o this set.
Comments:
( 1 ) The basic idea of t h i s proof extends t o all t h e paradoxes discussed h e r e .
Individual comparisons are one dimensional. When s e v e r a l comparisons
are made, they m u s t b e viewed as defining a comparison mapping F with a
higher-dimensional r a n g e space. A higher-dimensional s p a c e admits symmetries and cycles, s o it should be expected t h a t t h e s e cycles are manifested as paradoxes by t h e comparisons. To prove t h a t all t h e symmetries are admitted, locate a "comparison point" on t h e boundary of
e a c h of t h e comparison regions. Next, show t h a t t h e image of F includes
an open set about t h e comparison point. The intersection of t h i s open set
with e a c h comparison region is a nonempty open set. Because F i s continuous, t h i s means t h a t t h e r e i s a nonempty set of points in t h e domain
with t h e desired p r o p e r t i e s . To complete t h e proof, impose conditions s o
t h a t in each of t h e s e sets in t h e domain t h e r e exist points t h a t are identified with sample points from t h e model. This simple idea i s t h e essence of
o u r explanation f o r all t h e paradoxes in t h i s p a p e r .
(2) The number of possible, paradoxial relationships i s determined by t h e dimension of t h e domain f o r a comparison mapping.
If t h i s dimension
exceeds t h a t of t h e range, then t h e comparison mapping i s not "complete"; t h e r e exist additional relationships t h a t may define more complex
paradoxes.
A s a c o r o l l a r y , t h e a b o v e illustration and extension of
Simpson's p a r a d o x i s not t h e "best possible" r e s u l t . The domain of F i s
15 dimensional while t h e r a n g e i s only seven dimensional; e i g h t more com-
p a r i s o n s using t h e s e v a r i a b l e s c a n b e added. (They may involve d i f f e r e n t
levels of aggregation, waiting times, e t c . )
(3) O t h e r conclusions a r e derived from t h e p r o p e r t i e s of
(i)
F-l.
I t i s n a t u r a l t o determine t h e limits of a paradox. (This i s illustrated in Theorem 1 with t h e a s s e r t i o n t h a t dA i s not bounded above.)
Often, as f o r t h i s model, t h e s e limits are determined b y t h e p r o p e r t i e s of t h e points n e a r t h e intersection of F-'(o)
a n d t h e boundary
of t h e domain.
(ii) In o r d e r f o r a p a r a d o x (described by B) t o o c c u r , we may need a
c e r t a i n number of d a t a points. The minimal size i s given by t h e smal-
lest
"lowest
common denominator"of
t h e admissible points in
F - ~ ( B ) = U.
(iii) The probability t h a t a p a r a d o x (described by B) o c c u r s is given by
t h e measure of a probability distribution o v e r t h e o p e n set F-~(B).
(4) O t h e r conclusions are d e r i v e d from t h e s t r u c t u r e of t h e image of F. F o r
instance, t h e image contains a n open set a b o u t t h e origin, so it meets any
sector defined by a specified r a t i o of t h e outcomes; e.g.,
The a b o v e shows t h a t t h e r e are sample points t h a t satisfy t h e s e conditions.
(5) To avoid t h e a b o v e behavior, t h e Jacobian of F cannot b e of maximal
r a n k . This singularity c o n s t r a i n t becomes a necessary condition t o avoid
a p a r a d o x . Often, as f o r t h e a b o v e model, t h e s e lower-dimensional singul a r i t y conditions c o r r e s p o n d t o familiar c o n s t r a i n t s such as t h e "independence of random variables".
This a p p r o a c h c a n b e used as long as t h e components of a comparison mapping
are smooth functions. These components could b e functional combinations of p r o babilities, e x p e c t e d values, t h e v a r i o u s moments, waiting times, loss functions, decision r u l e s , c o r r e l a t i o n indices, s c a t t e r i n g indices, covariance, etc. If t h e open
mapping condition holds a t a comparison point, t h e n all possible comparisons c re
realized. In t h i s way i t is e a s y t o show t h a t t h e r e e x i s t examples illustrating, ' o r
instance, t h a t t h e expected value may satisfy E (X)
> E (Y),
yet E (J' (X))
< E (J' (Y))
f o r some monotonically increasing function 'j , and t h a t c e r t a i n decision r u l e s may
b e inconsistent with o t h e r measures. (Indeed, a d i s c r e t e version of this can b e
used t o explain t h e Arrow social choice paradox.) Theorem 2 i s t h e formal statement t h a t c o v e r s all t h e s e situations.
Before stating Theorem 2, w e formally define t h e s t r u c t u r a l relationship
between a comparison point and a comparison region.
Definition
Let a topological s p a c e N b e partitioned. A comparison point is a boundary
point f o r each partition set. For a given comparison point p , a comparison reg i o n is a partition set such t h a t t h e closure of its i n t e r i o r contains p .
Definition
Let F : M -,N b e a comparison mapping f o r a given model. A point in M is a n
admissible point if i t can b e identified with a sample f o r t h e model.
Theorem 2
Let F:M -,N b e a smooth comparison mapping where t h e dimension of M i s
bounded below by t h e dimension of N. Assume t h a t t h e admissible points form a
dense set in M . Let c b e a comparison point in N. If p in F-'(c) i s a n i n t e r i o r
point of M such t h a t t h e Jacobian of F at p h a s maximal r a n k , then t h e behavior
characterized by any comparison region of N is admitted.
Example
Consider t h e following dice game. Each of t h e players rolls his own weighted
die. (Each die i s marked in t h e standard fashion.) On each roll, t h e winner i s t h e
player t h a t rolled t h e l a r g e r f a c e value. For each choice of k = 1,...,4, t h e losing
player pays t h e winning player t h e difference between t h e f a c e values raised t o
t h e k t h power. For each of t h e f o u r choices of k , a r b i t r a r i l y s e l e c t a die t o have
t h e l a r g e r expected payoff, and then a r b i t r a r i l y select a die t o have t h e higher
probability of winning a roll. It i s a d i r e c t consequence of Theorem 2 t h a t t h e dice
can b e weighted in such a fashion t h a t all five selected conditions are satisfied
simultaneously. This illustrates t h e possible incompatibili .y among reward functions and t h e distributions.
A s a s p e c i a l case (K1 = I ) , t h e following demonstrates t h a t t h e more p r o b a b l e
of two e v e n t s may h a v e t h e longer waiting time.
Corollary 2.1
Let e a c h of t h e two u r n s U1 and U 2 contain r e d and black balls. Each u r n i s
randomly sampled without replacement. F o r a positive i n t e g e r k , l e t w j (k ) b e t h e
probability t h a t i t t a k e s at l e a s t k t r i e s b e f o r e a r e d ball is s e l e c t e d from u r n U j ,
j
= 1 , 2 . Let k l
# k 2 . F o r e a c h of t h e two p a i r s ( w l ( k S ) , w 2 ( k S ) ) ,=
s 1 3 , choose
t h e value t h a t is t o b e t h e l a r g e r . T h e r e e x i s t examples of d a t a s o t h a t both conditions are satisfjed simultaneously.
Outline of the proof
W e proof t h e s p e c i a l case where K1 = 1 and K2 = 2 . The g e n e r a l case follows
in much t h e same manner.
The domain of t h e comparison mapping F i s I x I x R + where R + i s t h e halfline of positive numbers. A point in t h e domain i s denoted by ( z ,y , z ) . Let
b e a mapping into R'.
t h e mapping
(W
A t t h e r a t i o n a l points in t h e domain, F c a n b e identified with
-
i ( l ) w ' ( 1 ) , w1 ( 2 ) - w ' ( 2 ) ) . This identification follows by assuming
t h a t t h e r e are zzn r e d a n d zn ( 1
- z) black
balls in U 1 and y n r e d and n (1--y )
black balls in U z , and by choosing a n a p p r o p r i a t e value f o r t h e p a r a m e t e r n .
The comparison point i s 0 = (0,O). Any domain point p of t h e form ( z , z , l )i s
in F - l ( 0 ) . The g r a d i e n t of t h e f i r s t component of F i s ( 1 ,
- 1,O); t h e g r a d i e n t of
t h e second component evaluated at p i s
[l + (l-Z=)zn, zn -1
where z
+ ( 1-22
)m;,
z n -1
(2 -1)z
( z n -1)'
I
= 1 . Clearly, t h e s e two v e c t o r s are linearly independent. This completes
t h e proof.
The conclusion of t h i s c o r o l l a r y holds even if o n e of t h e s e p a i r s is r e p l a c e d
with t h e p a i r of e x p e c t e d waiting times. However, t h e conclusion does n o t hold if
t h e sampling i s with replacement, o r if t h c number of balls i s t h e same f o r e a c h
urn. F o r e a c h of t h e s e models, t h e z terr.1 does n o t a p p e a r in t h e definition of F.
A s a r e s u l t , t h e t h i r d component of t h e I -adient is z e r o , a n d t h e second is t h e
negative of t h e f i r s t . Therefore, t h e Jacobian of t h e comparison mapping i s singul a r . This illustrates comment (5).
A more interesting paradox i s obtained by combining t h e model in Theorem 1
with t h e one given above. Here, s e v e r a l p a i r s of u r n s with r e d and black balls are
used. A s t h e contents of t h e u r n s a r e combined in a specified way, t h e urn with t h e
higher probability of selecting a r e d ball may change with t h e level of aggregation.
Furthermore, t h e waiting time t o select a r e d ball may vary. However, as demons t r a t e d above, t o obtain t h e s e examples, often w e need t h e e x t r a d e g r e e s of freedom o f f e r e d by varying t h e number of balls p e r urn. Also, t h e number of independent comparisons is bounded by t h e dmension of t h e domain.
S e v e r a l interesting paradoxes from population dynamics involve only a small
number of comparisons, so i t i s t o use Theorem 2 t o explain and extend them (see,
f o r example, t h e p a p e r by Vaupel and Yashin, 1985). However, often t h e s e examples, as given by Vaupel and Yashin, a r e based on continuous random variables. To
use Theorem 2, t h e continuous variables are approximated by d i s c r e t e valued random variables.
Alternatively, Proposition 1 can b e extended, in t h e obvious
fashion, t o permit M t o b e a function space. In this way, t h e examples of Vaupal
and Yashin c a n b e t r e a t e d directly.
Another s o u r c e of paradoxes subsumed by Theorem 2 i s Blyth's p a p e r (1972
b). One of his paradoxes with random variables X and Y h a s P(X
unity as desired, even though P ( X
< a ) <P(Y < a )
> Y)
as close t o
f o r all choices of a: This, of
course, i s a n example of t h e boundary behavior of t h e comparison mapping. Both
Blyth (1972 b) and Vaupel and Yashin (1985) d e s c r i b e t h e paradoxes in terms of examples. The above treatment explains and unites them, i t shows t h a t they can b e
extended in s e v e r a l ways, and i t proves t h a t t h e paradoxes are "robust" in t h a t
they are satisfied by open sets of examples.
Theorem 1 and its generalization to a set of N c h a r a c t e r i s t i c functions are
corollaries of Theorem 2. The only surprising f e a t u r e of t h e generalization i s t h a t
t h e dimension of t h e domain f o r a comparison mapping c a n b e v e r y large. To s e e
this, let t h e f i r s t N-1 c h a r a c t e r i s t i c functions define
zN-l sets.
The last random
variable i s t r e a t e d as a c h a r a c t e r i s t i c function on each set. Thus, t h e domain of a
comparison mapping has t h e dimension
comment (2) this means t h a t up t o
zN-l + (zN-l - 1 ) = zN - 1.
zN - 1 functional
According t o
relationships can b e defined
from t h e s e random vari3bles with possible concomitant unexpected behavior.
W e conclude t h i s section with a p a r t i a l c o n v e r s e f o r Theorem 2. I t asserts
t h a t if a c e r t a i n set of examples c a n b e found, t h e n examples of all t y p e s exist.
Such a r e s u l t i s of value b e c a u s e when t h e dimension of t h e r a n g e s p a c e i s sufficiently l a r g e i t may b e difficult t o verify t h e r a n k condition. However, t h e specified set of examples might b e identified by a computer s e a r c h . F o r simplicity, w e
r e s t r i c t a t t e n t i o n t o l i n e a r comparison maps.
Corollary 2.2
Suppose t h a t F i s a l i n e a r comparison mapping from a l i n e a r s p a c e t o a r a n g e
s p a c e R k . Assume t h a t t h e Zk o r t h a n t s of R k are comparison r e g i o n s . If t h e r e exi s t 2(k
+ 1examples,
e a c h in a d i f f e r e n t comparison region, t h e n F h a s maximal
r a n k and a l l possible comparisons are admitted.
Proof
The image of a l i n e a r s p a c e u n d e r a l i n e a r mapping i s a l i n e a r s p a c e . If t h i s
image s p a c e h a s dimension k , t h e n t h e conclusion follows. By assumption, k image
points c a n b e found t h a t do n o t lie in t h e same (k -1)-dimensional
subspace. This
completes t h e proof.
Extensions are obvious. F o r instance, t h e proof r e q u i r e s only k examples
t h a t are not in t h e same (k -1)-dimensional plane. F o r o t h e r c h o i c e s of comparison
regions, t h e emphasis i s placed on t h e geometry defined by t h e image points with
r e s p e c t t o t h e p r o p e r t i e s of t h e image set.
3 RANKING PARADOXES
A r i c h e r assortment of p a r a d o x i c a l behavior emerges from multivalued r a n dom variables. (This i s b e c a u s e t h e dimension of t h e domain f o r a comparison mapping i n c r e a s e s with t h e number of values admitted by a random variable.) W e illus-
trate t h i s with s e v e r a l new r e s u l t s a b o u t ranking and voting p r o c e d u r e s .
Our main r e s u l t s c o n c e r n weighted o r positional voting. This i s defined in t h e
following way.
To r a n k t h e N a l t e r n a t i v e s , a l , ...,aN, choose N s c a l a r weights
(zlil, ...,wN) where w j 2 wk if and only if j
<k
and where w
> w~
2
0. Each v o t e r
lists his ranking of t h e A! a l t e r n a t i v e s on a ballot. To tally a ballot, wj points are
assigned t o t h e j t h r a n k e d a l t e r n a t i v e s , j = 1,...,N.
In t h e obvious way, t h e
grq.rp's ranking of t h e a l t e r n a t i v e s i s determined by t h e sum of t h e assigned
w e ghts.
The weights define a voting v e c t o r WN = (wl, ...,wN) in RN. For plurality voting, t h e voting v e c t o r is (1,0, ...,0). Another well-known voting system, called t h e
Borda count, is defined by t h e voting vector BN = (N,N
- 1,...,1).
(If a voting vec-
t o r is a linear combination of BN and EN = (1,...,I ) , then we call t h e system a Borda
system. This i s because t h e election r e s u l t f o r any Borda system always a g r e e s
with t h e r e s u l t when BN i s used t o tally t h e ballots (see S a a r i (1982.))
This tallying p r o c e s s can b e identified with t h e expected value of multivalued
random variables. For N alternatives, t h e r e are N! different ways t o r a n k t h e N
alternatives. Since t h e sum assigned t o any alternative is a l i n e a r relationship,
t h e group's ranking i s not a l t e r e d should each sum be divided by t h e t o t a l number
of voters. This means t h a t t h e number of v o t e r s i s replaced with t h e fraction of
t h e v o t e r s with each ranking. In t h i s way t h e domain f o r this problem can b e identified with (the rational points in) t h e simplex Si(N!). The simplex i s in t h e positive o r t h a n t of a n N!-dimensional space. If Aj i s t h e random variable assigned t o
alternative a j , then P(A,
= w k ) denotes t h e fraction of t h e v o t e r s who r a n k t h e
j t h alternative in t h e K~~ place. Let A b e t h e v e c t o r valued random variable
(A1, ...,AN). A point in Si(N!) can b e viewed as being a probability distribution, and
s o t h e tally of t h e ballots c a n b e identified with t h e expected value E(A).
The following definitions are used in what follows.
Definition
A voters' profile i s a listing of each v o t e r ' s ranking of t h e N alternatives.
Definition
The
WN
voting
+ (wN,...,w
vector
WN
defines
a
reverse
neutral
system
if
= CEN = (C ,...,C) f o r some s c a l a r C.
A Borda system is always r e v e r s e neutral. An easy algebraic argument demon-
s t r a t e s t h a t t h e s p a c e of r e v e r s e n e u t r a l systems i s a hyperplane of R N with dimension 1 + [N/2] where
[I
denotes t h e "greatest integer function". A basis f o r
this hyperplane can b e computed directly. For N
r e v e r s e neutral.
For N
= 3 only t h e Borda systems are
= 4, a basis f o r t h e hyperplane i s given by E 4 , and
(2,1,1,0). F o r N = 5, a basis is E5,B5 and (2,1,1,1,0); etc.
Although weighted voting systems a r e a n important class of voting methods,
t h e i n t e r p r e t a t i o n of election results i s problematic. This i s illustrated by t h e following theorem which includes Example 2 a s a special case.
Theorem 3
Let N 2 3. Let ALk b e t h e subset [ a L...,
, ak 1. Let Wk b e t h e voting v e c t o r used
t o r a n k ALk. Assume t h a t
N(N
- )/2
wk
i s not r e v e r s e neutral, k = 3,...,N. For each of
p a i r s of alternatives, a r b i t r a r i l y designate one of t h e alternatives. A r -
bitrarily choose a ranking RKk f o r t h e set ALk,k
= 3, ...,N. There exist profiles of
v o t e r s such t h a t (i) t h e election r e s u l t f o r ALk is RKk ,k = 3, ...,N, and (ii) f o r each
p a i r of alternatives, a majority of t h e s e same v o t e r s p r e f e r t h e designated alternative.
The proof of this theorem and a n extension a r e given in Section 4.
A simple consequence of this theorem i s t h a t , if a Borda system is not used,
then t h e r e are profiles of v o t e r s where most of t h e v o t e r s p r e f e r a l t o a 2 , most
p r e f e r a 2 t o a s , most p r e f e r a l t o a 3 , y e t t h e election r e s u l t i s a 3 > a 2 > a l . The
implied ranking obtained by majority vote o v e r t h e p a i r s of alternatives i s t h e rev e r s a l of t h e election result! (In t h e example in Section 1.2, such a profile i s given
= 5 t h e r e i s a profile
of v o t e r s such t h a t majority votes determine t h e rankings a, > a, +l f o r j = 1 ,...,4,
f o r plurality voting.) A more striking example i s t h a t f o r N
a 5 > a l [ t h e s e five alternatives from a cycle1 a 4 > a j f o r j = 1,2,al > a 3 , and
aj
al
> a 5 f o r j = 2,3, and t h e plurality election r e s u l t s of A L j , j = 3,4,5, a r e
> a 3 > a 2 , a 2> a 3 > a l > a 4 , and a 3 > a l > a 5 > a 2 > a 4 respectively. Other
examples are limited only by t h e imagination of t h e designer.
By use of different techniques, Fishburn (1981) proved Theorem 3 f o r t h e special case N = 3. (More accurately, Fishburn gave a proof only f o r t h e f i r s t example above. However, i t i s possible t h a t his approach extends t o include o u r gen-
eral statement f o r N = 3.) F o r N
> 3,
t h e f i r s t conclusion without t h e second i s a
special c a s e of a r e s u l t given by S a a r i (1984).
The second p a r t of t h e theorem i s of independent interest. Essentially, i t as-
serts t h a t if t h e p a i r s of alternatives are ranked by majority voting, then any type
of cycle, subcycle, etc., c a n occur. To highlight this r e s u l t , we restate it.
Definition
Let N
=> 3.
F o r 1S k
<j
S
N, l e t Rkj b e t h e set lak
> a j , a j > ak j.
Let t h e
s p a c e of b i n a r y rankings BR b e t h e c a r t e s i a n p r o d u c t of t h e N(N -1)/2 sets R k j .
An element of BR i s a sequence t h a t imposes a n o r d e r i n g f o r e a c h of t h e N(N -1)/2
p a i r s of a l t e r n a t i v e s . These binary rankings need not b e t r a n s i t i v e , n o r need t h e y
satisfy a n y o t h e r consistency requirement.
Corollary 3.1
Let q b e a n element BR. T h e r e e x i s t examples of v o t e r s ' p r o f i l e s such t h a t ,
f o r e a c h p a i r of a l t e r n a t i v e s , a majority of t h e same v o t e r s h a v e t h e ranking
specified by q .
The remainder of t h i s section i s devoted to e x t r a c t i n g some of t h e consequences of
Theorems 3 and Corollary 3.1. We start by obtaining new r e s u l t s a b o u t t h o s e
schemes t h a t depend on majority v o t e s o v e r p a i r s of a l t e r n a t i v e s ( s e e t h e exposit o r y a r t i c l e by Nierni a n d R i k e r (1976).) F o r example, a n a l t e r n a t i v e is called a
Condorcet w i n n e r i t i s r e c e i v e s a majority v o t e when compared a g a i n s t e a c h of
t h e o t h e r a l t e r n a t i v e s . A Condorcet winner d o e s n o t always e x i s t (e.g., t h e a b o v e ,
second example), so o t h e r schemes h a v e been proposed to determine t h e winning
a l t e r n a t i v e . The following definition a p p e a r s to include a l l methods based on t h e
ordinal rankings.
Definition
A b i n a r y r a n k i n g method i s a nonconstant mapping from a s u b s e t of BR into
lal,...,aNj. That is, based on t h e ordinal rankings of p a i r s of a l t e r n a t i v e s , o n e of
t h e N a l t e r n a t i v e s is selected.
Examples
(1) A Condorcet winner i s a binary ranking method. The s u b s e t is t h e set of
a l l elements of BR where some o n e a l t e r n a t i v e i s p r e f e r r e d to all o t h e r
alternatives.
(2) An obvious extension of t h e Condorcet winner is t o select t h e a l t e r n a t i v e
t h a t wins t h e l a r g e s t number of pairwise comparisons. This extensio-.. admits a l a r g e r s u b s e t of elements from BR.
(3) Suppose N
- 1a l t e r n a t j v e s
are proposed t o r e p l a c e t h e s t a t u s quo, a l .
The s e l e c t e d a l t e r n a t i v e i s a l if and only if a l i s a Condorcet winner. If
not, t h e n from t h e set of t h o s e a l t e r n a t i v e s t h a t b e a t a l , s e l e c t t h e one
t h a t wins t h e most pairwise comparisons.
(4) A commonly used binary ranking method i s a n agenda.
Definition
Let N
> 3.
An agenda i s a n o r d e r e d listing of t h e N a l t e r n a t i v e s . The f i r s t two
listed a l t e r n a t i v e s are voted upon. The a l t e r n a t i v e receiving t h e majority v o t e i s
t h e n compared with t h e t h i r d listed a l t e r n a t i v e . This i t e r a t i v e , pairwise comparison p r o c e d u r e i s continued t o t h e end of t h e listing. The remaining a l t e r n a t i v e i s
t h e selected alternative.
The following statement extends s e v e r a l r e s u l t s from "agenda manipulation"
( s e e , for example, McKelvey (1976) and P l o t t and Levine (1978)). I t implies t h a t
t h e r i g h t t o set a n agenda f o r a meeting i s a potential s o u r c e of power ( t h e f i r s t
conclusion) t h a t may lead t o a n undesired outcome ( t h e second conclusion).
Corollary 3.2
Let N
> 3.
T h e r e e x i s t v e c t o r s ' p r o f i l e s and N agendas such t h a t , when t h e
same v o t e r s u s e t h e j t h agenda, t h e outcome i s a,. j = 1.....N. F o r N
> 3,
there
e x i s t v o t e r s ' profiles and N agenda so t h a t t h e above holds even though all the
voters p r e f e r a g lto a 4 ,a 4 t t o ] a s ,..., land] a
~ to -a ~
~ .
An i n t e r e s t i n g f e a t u r e of t h i s c o r o l l a r y i s t h a t f o r a fixed p r o f i l e of v o t e r s ,
t h e winning a l t e r n a t i v e v a r i e s o v e r a l l possible outcomes as t h e "seeding", o r t h e
choice of t h e agenda, changes. The proof depends on t h e fact t h a t majority, p a i r wise voting c a n define a n y d e s i r e d c y c l e and subcycle. Thus, t h i s conclusion extends t o t h e o t h e r binary ranking methods t h a t depend on t h e initial seeding. This
includes tournaments, w h e t h e r single, double, o r k -fold elimination, c e r t a i n
h i e r a r c h i c a l methods, etc. In p a r t i c u l a r , because a change in t h e seeding c h a n g e s
t h e definition of a binary ranking method, Corollary 3.2 i s a special case of t h e
following.
Corollary 3.3
Let Pl and P2 b e different mappings from BR to lal,...,aN1. There exist profiles of v o t e r s s o t h a t t h e outcome of t h e two binary voting methods differ.
The next r e s u l t compares t h e outcome of a binary ranking method with t h e
election r e s u l t s of a weighted voting system. A s special cases, i t shows t h a t a Cond o r c e t winner, or t h e r e s u l t of a n agenda, need not a g r e e with a n election ranking.
Corollary 3.4
Let N
> 3.
Let t h e s e t Alk b e ranked with t h e voting v e c t o r Wk ,K
= 3,...,N.
Let a binary ranking method b e given. Assume t h a t Wk,k = 3,...,N, is not r e v e r s e
neutral. Let Rkk , k
= 3,...,N, by any ranking of Alk, and l e t a j b e a n a r b i t r a r y ele-
ment in t h e image of t h e binary ranking method. There exist voters' profiles such
t h a t (i) t h e election r e s u l t of Alk,K = 3,...,N, and (ii) t h e binary ranking method
selects aj.
A s a consequence of this theorem, t h e r e is a profile of v o t e r s s o t h a t t h e i r plurality election ranking of Sk i s a l
> a 2 > ... > ak if
k is even and t h e r e v e r s e of t h i s if
k is odd, and t h e Condorcet winner is a l .
This chaotic s t a t e of a f f a i r s cannot b e eliminated if t h e selection method is
defined to combine, in some way, t h e election r e s u l t s o v e r all of subsets
Alk,K
= 2,...,N. For instance, in a run-off election, t h e lower ranked alternatives
are dropped, and t h e remaining set is r e r a n k e d in a s e p a r a t e election. The following definition extends this notion.
Definition
A d y n a m i c a l selection p r o c e s s consists of (i) a set of voting v e c t o r s
1WN,...,W3,(1,O)1,
(ii) r u l e s t h a t eliminate a specified, positive number of alterna-
tives from a s e t of k alternatives, k = 2,...,N, and (iii) a selection function. The
p r o c e d u r e is defined in t h e following way. The set AIN i s ranked by using WN.
Then, based on t h e elimination r u l e f o r t h e N alternatives, N-s
alternatives are
eliminated. The remaining set of s alternatives is ranked by using W,.
Iteratively,
this procedure i s continued. Based on t h e s e election rankings, t h e nonconstant
selection p r o c e d u r e selects one alternz tive.
Examples
(1) The s t a n d a r d "run-off" election i s a dynamical p r o c e d u r e . At e a c h s t e p ,
t h e bottom r a n k e d a l t e r n a t i v e i s eliminated. The s e l e c t e d a l t e r n a t i v e i s
t h e one remaining at t h e end of t h e p r o c e s s .
(2) This run-off p r o c e d u r e c a n b e generalized in t h e following way. Choose a
positjve i n t e g e r k < N. The elimination p r o c e d u r e i s t h e same as in (I),
but t h e selection r u l e s e l e c t s t h e t o p r a n k e d a l t e r n a t i v e from t h e election ranking of k alternatives. If k
k
= 2, t h i s i s t h e a b o v e p r o c e d u r e . If
= N , t h i s i s a s t a n d a r d election p r o c e d u r e .
(3) The run-off p r o c e d u r e c a n eliminate more t h a n one a l t e r n a t i v e at e a c h
s t a g e . F o r instance, a f t e r t h e N a l t e r n a t i v e s are r a n k e d , all b u t t h e t o p
two a l t e r n a t i v e s may b e dropped.
(4) Let a l r e p r e s e n t t h e s t a t u s qua, and let aj,
j = 2,...,N r e p r e s e n t t h e contending a l t e r n a t i v e s . Use WN t o r a n k t h e N a l t e r n a t i v e s . If a l i s t h e t o p
r a n k e d a l t e r n a t i v e , i t i s d e c l a r e d t h e winner. If i t i s not, t h e n eliminate
a l a n d r a n k t h e remaining a l t e r n a t i v e s with WN
The t o p r a n k e d a l t e r -
native from t h i s election i s d e c l a r e d t h e winner.
(5) The elimination r u l e may depend o n t h e a l t e r n a t i v e s . F o r instance, t h e
p r o c e s s d e s c r i b e d in (4)c a n b e modified t o eliminate not only a l b u t also
a l l a l t e r n a t i v e s r a n k e d below a l in t h e f i r s t election.
As a s p e c i a l c a s e , t h e following asserts t h a t t h e winner of a run-off election need
not b e a Condorcet winner.
Corollary 9.5
Let N
> 3.
Assume t h a t a binary ranking method and a dynamical selection
p r o c e s s are given. Suppose t h a t , f o r e a c h K r 3, t h e weight v e c t o r Wk i s not rev e r s e n e u t r a l . A r b i t r a r i l y s e l e c t aj from t h e r a n g e of t h e b i n a r y ranking method
and ak from t h e r a n g e of t h e dynamical process. T h e r e e x i s t p r o f i l e s of v o t e r s so
t h a t t h e b i n a r y ranking outcome i s aj while t h e dynamical method outcome i s at.
Recently t h e r e h a s b e e n i n t e r e s t in election p r o c e d u r e s where a v o t e r c a n
choose a voting v e c k r t o tally h i s ballot.
Definition
A m u l t i p l e voting s y s t e m used t o r a n k N alternatives i s determined by a set
MN of voting v e c t o r s where at least two of t h e s e v e c t o r s and EN are linearly independent. Each v e c t o r r a n k s t h e N alternatives on his ballot, and then h e selects
a v e c t o r from MN t o tally his ballot.
Examples
(1) Bullet
M
voting:
The
= (2.0,.0 (1..0
set
defining
( 1 1 0. 0
.
of
voting
vectors
is
This procedure w a s used during
t h e 1970s f o r some legislative offices in Illinois.
(2) Cardinal voting: The set MN contains all voting v e c t o r s where t h e components sum t o unity.
Occasionally, cardinal voting is used t o define
rankings f o r methods from decision analysis.
(3) Approval
voting:
The
defining
I (l,O,...,O),(l,l,O,...,O),..., (1,1,...,1,0) j.
set
of
N-1
vectors
is
For t h i s method, which w a s intro-
duced by R. Weber, among o t h e r s , and ithas been analyzed by Brams and
Fishburn (1982), a v o t e r indicates e i t h e r approval o r disapproval of
each alternative.
Often t h e r e s u l t s of multiple voting systems are compared with t h e Condorcet
winner. The following shows t h a t t h e r e s u l t s can b e incompatible.
Corollary 3.6
Let N r 3. Let a binary ranking method b e given. Let Mk define a multiple
voting system f o r dlk,k
= 3, ...,N. Assume t h a t , f o r each k ,Mk contains at l e a s t one
v e c t o r t h a t i s not r e v e r s e neutral. Let Rkk b e a ranking f o r dlk,and l e t aj b e a n
alternative in t h e r a n g e of t h e binary ranking method. There e x i s t voters' profiles s o t h a t (i) t h e multiple election r e s u l t f o r A l k i s Rkk,k = 3,...,N, and (ii) t h e
binary ranking outcome is aj.
A consequence of this r e s u l t is t h a t , f o r any choice of s , t h e r e e x i s t examples
where t h e alternative ranked in s t h place in an approval voting election i s t h e
Condorcet winner, and t h e r e e x i s t examples where t h e results based upon approval
v3ting are a4 > a 3 > a 2 > a l f o r t h e set of four alternatives, a l > a 2 > a f o r t h e
Q.
~ b s e of
t t h r e e alternatives, and a 2 i s a Condorcet winner.
I t follows from this a p p r o a c h t h a t t h e p r i n c i p a l cause of the social choice
paradoxes i s the difference between the d i m e n s i o n s of the d o m a i n a n d the
range of a comparison mapping (see Section 2 , comment ( 2 ) ) . To model a weighted
election f o r N alternatives, t h e domain S i ( N ! ) h a s dimension N!-1. The image i s
E ( A ) . Because t h e domain i s S i ( N ! ) , this image i s in t h e simplex Si ( N ) in R ~ (The
.
sum of t h e components of
wN define this
sume t h a t t h i s sum i s unity.)
simplex. Without loss of generality, as-
Thus, t h e r a n g e s p a c e h a s dimension N-1.
This
difference of N{(N-I)!-11 i s z e r o if and only if t h e r e are only two alteratives.
Therefore, if N
2
3, o t h e r relationships with resulting paradoxes can b e added.
This is illustrated by Theorem 3.
This dimensional argument also proves t h a t , f o r N
2
4, Theorem 3 i s not t h e
"best possible" r e s u l t . To see this, w e need t o d e s c r i b e t h e comparison mapping L
f o r Theorem 3. The f i r s t N components of L a r e given by E ( A ) , t h e next N-1 by
t h e expected value of t h e weighted voting method defined by W N - ~ e, t c . The last
N ( N - I ) / 2 components are given by t h e expressions P ( A k ) - P ( A , ) , k
<j .
Thus,
the range space f o r L is
Si ( N )
where
N
J
is
the
+ ( N -1)+...+3 +
X
interval
S i ( N -1) x.. .
[-ll]
I(N)(N - 1 ) / 2 1 = N~
X
S(3) x J
This
~-I)' (
range
-N - 1.
space
~
has
the
dimension
For t h e model described in t h e
theorem, t h e difference between t h e dimensions of t h e domain and t h e r a n g e is
N!
- N~ + N.
This value i s positive if and only if N
2
4.
Corollary 3.7
Let N
N!
2
- N~ + N
4. In addition t o t h e subsets of alternatives described in Theorem 3,
additional relationships involving t h e rankings of t h e N alternatives
can b e defined in such a way t h a t , f o r certain profiles of v o t e r s , t h e r e s u l t s are
independent of t h e rankings obtained in Theorem 3.
A simple dimensional argument shows t h a t even if Theorem 3 can b e extended from
nested sets of t h r e e o r more alternatives t o all possible subsets of alternatives,
f o r N t 4 additional relationships can still b e found.
To complete o u r description of L , notice t h a t t h e comparison value on each
simplex in t h e r a n g e i s t h e point of complete indifference N - ~ E ~
For
. each of t h e
intervals J , t h e comparison value i s 0. Thus, t h e comparison point i s
Therefore Theorem 3 is a n example of Theorem 2 where M and N are manifolds and
where t h e comparison point is not t h e origin of a Euclidean space.
The comparison mapping L i s linear. Therefore L -'(0)
space with dimension N!
of Si(N!).
- N~ + N.
If N
> 3 this s p a c e must
must b e a linear subi n t e r s e c t t h e boundary
A boundary point of Si(N!) corresponds to a profile of v o t e r s where
none of t h e v o t e r s r a n k t h e alternatives in c e r t a i n ways. This extreme boundary
behavior describes t h e limits of t h e voting paradox. A special case is described in
Corollary 3.2.
Corollary 3.8
Let N
2
4. The r e s u l t s in Theorem 3 can b e obtained with profiles of v o t e r s
where no v o t e r h a s c e r t a i n rankings of t h e alternatives.
W e have not t r i e d to find a general characterization of t h e boundary behavior.
The proof of Theorem 3 involves showing t h a t t h e linear comparison map has
maximal rank. The r a n k condition does not hold if Wk,k = 3,...,N are Borda vect o r s . I t t u r n s out t h a t L h a s corank (with r e s p e c t t o t h e r a n g e space) of at least
IN(N
- 1 ) / 2 ] - 1. This means t h a t although a Borda election ranking admits incon-
sistencies with r e s p e c t t o a given binary ranking method, not all possible inconsistencies are admitted. In p a r t i c u l a r , Theorem 3 does n o t hold if e v e n one of t h e
Wk is a Borda v e c t o r . A d i r e c t verification of this f o r N = 3 i s given in Section 4.
With only slight modifications, t h e s e r e s u l t s can b e used to d e s c r i b e c e r t a i n
ranking p r o c e d u r e s coming from probability and statistics. For instance, suppose
N forms are making t h e same product, and they are t o b e ranked based on t h e quality of t h e i r products. In Theorem 3, identify t h e "ithalternative" with t h e " i t h
firm" t h e " t th v o t e r " with t h e "jthv e c t o r sample" of t h e product taken from each
of t h e N firms, and t h e "jthv o t e r ' s p r e f e r e n c e ranking" with t h e linear "quality
ranking" of t h e products in t h e jth sample. The relationship ak
> a j means t h a t ,
based upon t h e samples, form k ' s product a p p e a r s t o b e s u p e r i o r to firm j ' s . It
follows from Corollary 3.1 t h a t any possible choice of binary rankings is realized
by an open set of d a t a points. Binary sampling a p p r o a c h e s need not lead t o a
linear ordering of t h e "quality of t h e firms". Indeed, in this way, t h e well-known
Steinhaus-Trybula paradox (Steinhaus and 'Trybula, 1959), where t h e final ranking
of t h r e e forms is a l
> a 2 , a 2 > a 3,
but a 3 > a l , becomes a special case of Corol-
l a r y 3.1.
I t follows from Theorem 3 that, even if t h e firms are ranked by use of weighted ranking methods, t h e r e s u l t s could b e difficult t o i n t e r p r e t . For instance, t h e
weight v e c t o r s W, = (LO, ...,0 ) correspond t o t h e natural ranking method based on
P ( q = maxIXf:j
E Ak
j). I t follows from t h e above t h a t , should some one firm b e
deleted, t h e revised ranking could drastically change. Other measures experience
similar problems. A similar effect o c c u r s f o r t h e scoring of athletic events where
a voter's ranking corresponds t o how t h e various teams are placed in a p a r t i c u l a r
event, etc.
A s a final amusing example, note t h a t a connoisseur is often described as a
person whose taste p r e f e r e n c e s a r e based upon s e v e r a l a t t r i b u t e s (e.g., t h e color,
t h e t a s t e , and t h e bouquet of a wine), and whose rankings are based on a n aggregation of them. If so, w e should not expect his binary comparisons t o define a transitive ordering. This is, of course, an N alternative version of t h e famous folklore
"pie" example (I p r e f e r "apple" t o "cherry", but if "blueberry" is available, then
my choice i s "cherry").
A s in Section 2, t h e r e e x i s t open sets in t h e domain which exhibit e a c h of t h e
above behaviors. Consequently, t h e s e examples cannot b e dismissed as being isolated; t h e behavior is robust. A s t h e number of agents increases ( t h e denominat o r s of t h e rational points become l a r g e r ) , so d o t h e number of t h e possible examples, which leads us t o t h e following corollary.
Corollary 3.9
Consider a system of weighted voting methods as described in Theorem 3. Let
Q denote a n outcome o v e r t h e various sets as described in Theorem 3. Let n (Q,m)
b e t h e probability t h a t t h e election r e s u l t f o r a group of m v o t e r s i s Q. Assume
t h a t t h e profiles of v o t e r s are uniformly distributed. Then, as m -, w , n ( Q , m ) app r o a c h e s t h e r a t i o of t h e area of L -'(Q) to t h e area of t h e simplex Si (N!).
For elementary number t h e o r e t i c reasons, t h e sequence [n(Q ,m )
monotone. The limit i s positive if L -'(Q)
may not b e
contains a n open set; t h i s i s t r u e whenev-
e r Q does not admit ties. For o t h e r distributions, t h e r a t i o is determined in a simil a r fashion, but with a different measure.
4. PROOFS
The proof of Theorem 2 i s obvious. To prove Theorem 3, w e f i r s t prove Coroll a r y 3.1.
Proof of Corollary 3.1
List t h e p a i r s of alternatives in t h e following o r d e r : t h e f i r s t p a i r is ( a l , a z ) ,
t h e second set of two p a i r s i s given by ( a j , a t ) . j
is given by (a, , a k +1), j
= 1.2, and t h e
j t h set of k p a i r s
= 1,...,k ; k = 3,...,N -1. A ranking of t h e N alternatives de-
fines a n tN(N -I)/ 2 ]-dimensional v e c t o r in t h e following way. The j
th
component
is determined by t h e ranking of t h e j t h p a i r of alternatives. This component i s 1 if
t h e f i r s t listed alternative i s p r e f e r r e d t o t h e second; otherwise, i t is -1. F o r example, t h e v e c t o r associated with t h e p r e f e r e n c e ranking a l
> a z > ... > a~ h a s
t h e value 1in all t h e components.
Because t h e N alternatives can b e ranked in N! different ways, t h e comparison map is a linear mapping from Si (N!) t o
J ~ ( ~ -where
~ ) J
' ~
is
t h e interyal
[-1.11
and t h e comparison point i s 0 . (this map defines a convex combination of t h e
above N! vectors.) We must show t h a t t h e r e i s a point p in t h e i n t e r i o r of Si (N!)
such t h a t (i) p i s in t h e preimage of 0 and (ii) t h e Jacobian of t h e comparison map
at p h a s full rank. Let p = (~!)-'(l,
...,1).
Because p i s t h e profile where t h e r e
are equal numbers of v o t e r s f o r e a c h of t h e N! possible ways t o r a n k t h e alternatives, p i s mapped t o 0.
The comparison mapping i s linear, s o
jt
h a s a matrix representation. The ma-
t r i x i s t h e Jacobian, and i t consists of t h e N! column v e c t o r s defined above. I t
remains t o show t h a t t h i s set of N! v e c t o r s includes N(N-1)/2
linearly indepen-
dent vectors.
Consider t h e v e c t o r s Vj ,j
f i r s t IN(N -1)/Z J
- ( j-1)
= 1,...,N(N -1)/2, where
5 has t h e value 1f o r t h e
component and -1 f o r t h e remaining components. This
set of v e c t o r s i s linearly independent. This is because they form a s q u a r e a r r a y
where t h e e n t r i e s on and above t h e diagonal from t h e lower left-hand c o r n e r t o t h e
u p p e r tight-hand c o r n e r are all equal t o 1. A l l t h e o t h e r entries are -1.
There a r e
zN v e c t o r s with
e n t r i e s of e i t h e r 1o r -1. Most of them are not re-
lated t o t h e described ranking method. So, t o complete t h e proof, i t remains t o
5 is associated with one of t h e N! rankings of t h e alternatives. The
choice of t h e components and t h e v e c t o r s 5 makes thi; fairly simple. The v e c t o r
show t h a t e a c h
Vl corresponds t o t h e ranking a
> a 2 > ... > a,,,.
Vectr
.- V2 has
-1 only in t h e last
component; this corresponds t o a transposition of aN and a N - , . These two alternatives are adjacent in t h e f i r s t ranking, s o t h e ranking f o r V2 c a n b e obtained from
t h e ranking f o r Vl by transposing these alternatives. This defines t h e ranking
a,
> a 2 > ... > U N - Z > a~ > a ~ - ~ .
Indeed, t h e only difference between Vj and
is in one component. This
component r e f l e c t s a change in t h e ranking of precisely one p a i r of alternatives.
By construction, t h e s e two alternatives are adjacent in t h e ranking Rkj t h a t is associated with
?.
Therefore, t h e ranking f o r V,
is obtained by transposing these
two adjacent alternatives in R k j . This completes t h e proof.
This proof is based on t h e f a c t t h a t the -1's in t h e square a r r a y correspond t o
the N-1
adjacent transpositions required t o move aN from last place in
> a 2 > ... > a~ t o first. This defines N -1 rankings where
aN > a l > a 2 > ... > a ~ - Next,
~ . move aN-l from what is now last
al
t h e last one is
place t o second,
etc.
Proof of Theorem 3
Let t h e weight v e c t o r s Wk,k = 3 ,...,N , b e as specified in t h e statement of t h e
theorem. With each ranking of t h e N alternatives, w e associate a v e c t o r with
[ N ( N -1) / 2 J
+ 3 + ... + N components.
The f i r s t N ( N -1) / 2 components a r e defined
as above. The next t h r e e are given by t h e a p p r o p r i a t e permutation of Wa t o
correspond t o t h e specified ranking. For instance, t h e ranking a 2 > a g > al i s
identified with t h e vector ( w 3 , w l l w 2 ) .In general, t h e s e t of k components is t h e
a p p r o p r i a t e permutation of Wk t o r e f l e c t t h e ranking of t h e k alternatives,
k = 3 ,..., N .
P (N-1)'2
X
The comparison mapping L which is a mapping from Si ( N ! ) t o
Si ( 3 ) X...
X
Si ( N ) is described in Section 3 . The point p described
above is mapped t o t h e comparison point ( 0 , O,..., 0 ;( 1/ 3)Ea ,...,( 1/ N)EN).
Because L is linear, i t s matrix representation defines t h e Jacobian. This mat r i x h a s N ! column' v e c t o r s with IN (N - I ) / 2 j
+ 3 + ... + N
= N' - 3 components.
(The dimension of t h e r a n g e space is smaller; i t h a s dimension N'-N
- 1.
The
difference results from t h e constraints defining t h e N -2 simplices Si ( k ) in the ima g e space.) W e must show t h a t t h e r e are N'
In t h e proof of the corollary,
in t h e f i r s t N ( N - 1 ) / 2
:i set
- 3 linearly independent vectors.
of N ( N -1) / 2 v e c t o r s t h a t are independent
compo~:~ntsw e r e found.
To obtain t h e remaining
! N ( N + 1 ) /2 1-3 independent vecto! s, t a k e t h e v e c t o r associated with each of t h e N !
rankings and add i t t o t h e v e c t o r associated with t h e r e v e r s a l of this ranking.
Each of t h e f i r s t N(N -I)/ 2 components of t h e v e c t o r associated with t h e r e v e r s e d
ranking will differ in sign from t h e original vector. Therefore, t h e sum v e c t o r s
will have z e r o s in each of t h e s e f i r s t N(N-1)/2
components. Consequently, t h e s e
new v e c t o r s are orthogonal t o t h e range s p a c e used in t h e proof of Corollary 3.1.
All w e need t o do is t o show t h a t t h e s e new v e c t o r s contain a set of 1N(N
+ I)/ 21-3
independent vectors.
For a r e v e r s e neutral v e c t o r , t h e s e new v e c t o r s are all multiplies of
all o t h e r c a s e s , t h e j t h component h a s t h e value w j
+ W ~ - j + l For
.
EN.
In
instance, f o r
t h e voting v e c t o r (4, 3, O), t h e new v e c t o r corresponding t o t h e rankings
a
b
>b >c
>c >a
and c
>b >a
i s (4, 6, 4). The v e c t o r corresponding t o a
>c >b
an
i s (4, 4, 6). In general, t h e s e new v e c t o r s would correspond to a voting
vector e x c e p t t h a t they do not satisfy t h e monotonicity condition. However, t h e
r e s u l t s given by S a a r i (1984) hold even f o r v e c t o r s t h a t do not satisfy t h e s e monotonicity p r o p e r t i e s . Therefore, t h e above reduces t o a special case of t h e one
given by S a a r i (1984). This completes t h e proof.
Proof that a Borda weight vector does not work for N = 3
Assume t h a t t h e alternatives are a , b , and c . assume t h a t t h e Borda weight
vector i s B3 = (3,2,1). The comparison mapping i s linear and its image includes t h e
comparison point (0,0,0;6,6,6). (B3 i s not normalized, and s o t h e sum of t h e components of S
i (3) i s 6.)
The comparison regions in S
i (3) are identified with t h e linear rankings of t h e
t h r e e alternatives. To obtain them, note t h a t if t h e a x e s of R~ are labeled in t h e
> y corresponds to a > b , y > z
to c > a , etc. In this way, t h e simplex
usual z,
y , z notation, then t h e region z
corresponds t o b
> c, z >z
corresponds
S
i (3) i s divided into six open sets which are defined by t h e intersection of t h e simplex with t h e t h r e e hyperplanes z = y ,y
= z , and
z
= z (see S a a r i (1978, 1982)).
Suppose t h a t t h e different behaviors described in t h e theorem hold for B3.
This means t h a t t h e image of t h e comparison mapping meets each of t h e six regions
of Si (3) as well as all of t h e open regions in J ~ In
. all, i t would meet 48 open regions. If this happens, then, by t h e linearity of t h e mapping and a comparison of
t h e dimensions of t h e domain and range, i t follows t h a t t h e mapping i s onto a neighborhood of t h e c ~ m p a r i s o npoint. This f o r c e s t h e matrix t o b e of r a n k five. To
show t h a t this
id
not so, w e list all six of t h e v e c t o r s and then e x t r a c t a four-
dimensional basis.
The v e c t o r s are as follows:
These six v e c t o r s admit a basis consisting of t h e f i r s t t h r e e v e c t o r s and t h e v e c t o r
(0,0,0; 1,1,1). Thus, t h e system has corank 2. Since t h i s last v e c t o r is orthogonal
t o t h e image space, t h e system has corank 2 with r e s p e c t t o t h e image space. This,
and t h e linearity of t h e mapping, means t h a t t h e comparison mapping h a s a nonzero
intersection with 1 2 of t h e 48 admissible comparison regions. If t h e mapping were
always consistent, then t h e mapping would meet only 3! = 6 regions. Thus, t h e mapping still admits s e v e r a l "inconsistent" conclusions. (In a p a p e r being p r e p a r e d ,
we c h a r a c t e r i z e t h e election rankings admitted by a Bol;da count.)
Extension of Theorem 3
The last p a r t of t h e proof of Theorem 3 is based on t h e work of S a a r i (1984)
which admits a wider variety of results. For example, f o r k alternatives, suppose
t h e r e a r e k -1 weight v e c t o r s W k ,which form, with Ek,a linearly independent set.
Arbitrarily choose k -1 rankings of t h e k alternatives. The theorem asserts t h a t
t h e r e exist voters' profiles s o t h a t when t h e s a m e voters rank t h e s e t of k alterna-
= 3,...,N with t h e i t h voting vector, then t h e outcome is t h e i t h specified
ranking of t h e alternatives. This is t r u e f o r all choices of i = 1,...,k -1 and k.
tives, k
A similar extension holds f o r Theorem 3. For each k = 3,...,N, choose
k
- 1- k / 2
voting vectors with t h e following property: (i) t h e voting v e c t o r is
not r e v e r s e neutral, and (ii) t h e s e t of k -k / 2 vectors, defined by Ek and t h e vect o r s formed by adding each of t h e k
- 1 - k / 2 voting
vectors t o i t s r e v e r s a l , is a
linearly independent s e t . For each voting vector, a r b i t r a r i l y choose a ranking of
t h e k alternatives. For each p a i r of alternatives, designate one of them. There
exist voters' profiles s o t h a t when t h e i t h voting vector is used t o r a n k t h e k alternatives t h e outcome is t h e assigned ranking i
= l , ...,k - l[/21, k = 3,...,N. For
each p a i r , a majority of t h e same v o t e r s p r e f e r t h e designated alternative. The
proof of this statement is a straightforward modification of t h e proof of Theorem
3.
Proof of Corollary 3.2
From Corollary 3.1, i t follows t h a t t h e r e is a n open set of v o t e r s ' profiles
where t h e outcome in pairwise elections i s t h e cycle
Consider t h e r e v e r s e d c y c l e a l
< aN < a ~ < ...
- <~a 2 < a l .
The following defines
a n agenda where a j will b e t h e winner. Let t h e agenda b e t h e N t e r m s in t h e rev e r s e d cycle t h a t start with t h e a l t e r n a t i v e immediately following a j a n d e n d s with
a j . F o r example, a 3 wins with t h e agenda [ a 2 , a l , a N , a-l
, ,..., a 4 , a 3 ] .
The second p a r t of t h i s c o r o l l a r y i s a consequence of t h e boundary p r o p e r t i e s
of L -l(0). The second p a r t of t h i s c o r o l l a r y i s a consequence of t h e boundary
p r o p e r t i e s of L-l(0). The profile, w h e r e a n equal number of v o t e r s h a v e e a c h of
the
three
rankings
a 3 > a q > ... > aN > a l
> a2 has
al
> a 2 > ... > a
~ , >aa 3~> ... > a ~ > a l l
and
t h e d e s i r e d p r o p e r t i e s . Note t h a t in e a c h p a i r -
wise comparison t h e winning a l t e r n a t i v e r e c e i v e s e i t h e r
2
of
t h e vote, o r all of it!
3
This i s t r u e f o r whichever agenda i s used and whichever a l t e r n a t i v e wins.
The dice example
This i s a s t r a i g h t f o r w a r d computation. However, t h e domain point used in t h e
image of t h e comparison point should c o r r e s p o n d t o two identical weighted, b u t not
f a i r , dice. The probability t h a t a p a r t i c u l a r f a c e will s u r f a c e is l e f t t o t h e end of
t h e computation. In o t h e r words, t h e r e are some complications in t h e computation
with two f a i r dice.
ACKNOWLEDGEMENT
This r e s e a r c h w a s s u p p o r t e d by NSF Grants IST-8111122 and IST-8415348.
I
a m pleased t o acknowledge conversations with W.W. Funkenbusch, Tom Miles, and
Hans Weinberger on t h e s e and r e l a t e d topics. Some of t h i s work w a s done while I
w a s visiting t h e Institute f o r Mathematics and i t s Applications (IMA) at t h e University of Minnesota. I would like t o thank my hosts, Hans Weinberger, George Sell,
and Leo Hurwicz f o r t h e i r kind hospitality during my stay. These r e s u l t s were
presented at a n IMA c o n f e r e n c e on mathematical political s c i e n c e in July 1984.
REFERENCES
Blyth, C. (1972a). On Simpson's paradox and t h e sure-thing principle. J. Am. S t a t .
Assoc. 67, 364-366.
Blyth, C. (1972b). Some probability paradoxes in choice from among random a l t e r natives. J. Am. S t a t . Assoc. 67, 366-373.
Brams, S., and Fishburn P. (1982). Approval Voting. Birkhauser, Boston.
Fishburn, P . (1981). Inverted o r d e r s for monotone scoring rules. Discrete Appl.
Math. 3 , 27-36.
McKelvey, R.D. (1976). Intransitivities in multidimensional voting models and some
implications for agenda control. J. Econ. Theory 12,472-482.
Niemi, R., and Riker W. (June 1976). The choice of voting systems. S c i . Am. 234,
21-27.
Plott, C.R., and Levine M.E. (1970). a model of agenda influence on committee decisions. Am. Econ. Rev. 68 146-160.
S a a r i , D.G. (1982). Inconsistencies of weighted summation voting systems. Math.
Oper. Res. 7 , 479-490.
S a a r i , D.G. (1984). The ultimate of chaos in weighted voting systems. Adv. Appl.
Math. 5, 286-308.
S a a r i , D.G. (October 1978). The Geometry of Departmental P o l i t i c s , t h e S c o r i n g
of Track Meets, a n d Arrow's Social Choice Theorem. Northwestern University
preprint.
Steinhaus, H., and Trybula S. (1959). On a paradox in applied probability. B u l l .
Acad. Polo S c i . , 7 , 67-69.
Vaupal, J .W., and Yashin, A.I. (1985). Heterogeneity's ruses: Some surprising ef-
fects of selection on population dynamics, The American S t a t i s t i c a n , 39,
176-185.
Wagner, C. (1982). Simpson's paradox in real life, The American S t a t i s t i c a n , 36,
46-48.