Testing Transitivity and other Properties Using a True and Error Model

This Pump Sucks:
Testing Transitivity with
Individual Data
Michael H. Birnbaum and
Jeffrey P. Bahra
California State University,
Fullerton
Transitivity of Preference
• If A > B and B > C then A > C.
• Satisfy it or become a money pump.
• But transitivity may not hold if data
contain “error.”
• And different people might have
different “true” preferences.
Tversky (1969)
• Tversky (1969) reported that
selected subjects showed a pattern
of intransitive data consistent with a
lexicographic semi-order.
• Tversky tested Weak Stochastic
Transitivity: If P(A>B) > 1/2 and
P(B>C) > 1/2 then P(A>C) > 1/2.
Issues
• Iverson & Falmagne (1985) argued
that Tversky’s statistical analysis was
incorrect of WST.
• Tversky went on to publish transitive
theories of preference (e.g., CPT).
Renewed Interest in
Intransitive Preference
• New analytical methods for analysis of transitivity
(Iverson, Myung, & Karabatsos; Regenwetter &
Stober, et al); Error models (Sopher & Gigliotti,
‘93; Birnbaum, ‘04; others).
• Priority Heuristic (Brandstaetter, et al., 2006);
stochastic difference model (González-Vallejo,
2002; similarity judgments, Leland, 1994; majority
rule, Zhang, Hsee, Xiao, 2006). Renewed interest
in Fishburn, as well as in Regret Theory.
Lexicographic Semi-order
•
•
•
•
•
•
•
G = (x, p; y, 1 - p). F = (x’, q; y’, 1 - q).
If y - y’ ≥ DL choose G (DL = $10)
If y’ - y ≥ DL choose F
If p - q ≥ DP choose G (DP = 0.1)
If q - p ≥ DP choose F
If x > x’ choose G; if x’ > x choose F;
Otherwise, choose randomly.
Priority Heuristic
• “Aspiration level” is 10% of largest prize,
rounded to nearest prominent number.
• Compare gambles by lowest consequences.
If difference exceeds the aspiration level,
choose by lowest consequence.
• If not, compare probabilities; choose by
probability if difference ≥ 0.1
• Compare largest consequences; choose by
largest consequences.
New Studies of Transitivity
• Work currently under way testing
transitivity using same procedures as
used in other decision research.
• Participants view choices via the
WWW, click button beside the
gamble they would prefer to play.
• Today’s talk: Single-S data.
Studies with Roman Gutierez
• Four studies used Tversky’s 5 gambles,
formatted with tickets or with pie charts.
• Studies with n = 417 and n = 327 with small
or large prizes ($4.50 or $450)
• No pre-selection of participants.
• Participants served in other risky DM
studies, prior to testing (~1 hr).
Three of Tversky’s (1969)
Gambles
• A = ($5.00, 0.29; $0, 0.79)
• C = ($4.50, 0.38; $0, 0.62)
• E = ($4.00, 0.46; $0, 0.54)
Priority Heurisitc Predicts:
A preferred to C; C preferred to E,
and E preferred to A.
Findings
• Results were surprisingly transitive, unlike
Tversky’s data (est. 95% transitive).
• Of those 115 who were perfectly reliable,
93 perfectly consistent with EV (p), 8 with
opposite ($), and only 1 intransitive.
• Differences: no pre-test; Probability
represented by # of tickets (100 per urn),
rather than by pies; Participants have
practice with variety of gambles, &
choices;Tested via Computer.
Pie Chart Format
Pies: with or without Numerical
probabilities
• 321 participants randomly assigned
conditions with probabilities displayed as
pies (spinner), either with numerical
probabilities displayed or without.
• Of 105 who were perfectly reliable, 84
were perfectly consistent with EV (prob),
13 with the opposite order ($); 1 consistent
with LS.
Findings
• Priority Heuristic predicted violations of
transitivity were rare and rarely repeated
when probability and prize information
presented numerically.
• Violations of transitivity are still rare but
more frequent when probability
information presented only graphically.
• Evidence of Dimension Interaction violates
PH and additive Difference models.
Response to BirnbaumGutierrez
• Perhaps the intransitivity only develops in
longer studies. Tversky used 20
replications of each choice.
• Perhaps consequences of Tversky’s gambles
diminished since 1969 due to inflation.
Perhaps transitivity occurs because those
prizes are too small.
Birnbaum & Bahra
• Collected up to 40 choices/pair per
person. (20 reps). 2 Sessions, 1.5 hrs,
1 week apart.
• Cash prizes up to $100.
• 51 participants, of whom 10 to win the
prize of one of their chosen gambles.
• 3 5 x 5 Designs to test transitivity
vs. Priority heuristic predictions
Notation-Two-branch
Gambles
•
•
•
•
G = (x, p; y, 1 - p); x > y ≥ 0
L = Lower Consequence
P = Probability to win higher prize
H = Higher consequence
LH Design
•
•
•
•
•
A = ($84, .50; $24)
B = ($88, .50; $20)
C = ($92, .50; $16)
D = ($96, .50; $12)
E = ($100, .50; $8)
LP Design
•
•
•
•
•
A = ($100, .50; $24)
B = ($100, .54; $20)
C = ($100, .58; $16)
D = ($100, .62; $12)
E = ($100, .66; $8)
PH Design
•
•
•
•
•
A = ($100, .50; $0)
B = ($96, .54; $0)
C = ($92, .58; $0)
D = ($88, .62; $0)
E = ($84, .66; $0)
Priority Heuristic
Predictions
• LH Design: E > D > C > B > A, but A > E
• LP Design: A ~ B ~ C ~ D ~ E, but A > E
• PH Design: A > B > C > D > E but E > A
One Rep = 2 choices/pair
First
A
B
C
D
E
A
1
1
1
1
Second Gamble
B
C
D
2
2
2
2
2
1
2
1
1
1
1
1
E
2
2
2
2
Analysis
• Each replication of each design has
20 choices; hence 1,048,576 possible
data patterns (220) per rep.
• There are 1024 possible consistent
patterns (Rij = 2 iff Rji = 1, all i, j).
• There are 120 (5!) possible transitive
patterns.
Within-Rep Consistency
• Count the number of consistent choices in
a replicate of 20 choices (10 x 2).
• If a person always chose the same button,
consistency = 0.
• If a person was perfectly consistent,
consistency = 10.
• Randomly choosing between 1 and 2
produces expected consistency of 5.
Intransitive and Consistent
LH
First
A
B
C
D
E
A
1
1
2
2
Second Gamble
B
C
D
2
2
1
2
2
1
2
1
1
2
1
1
E
1
1
2
2
Within-Replicate
Consistency
• The average rate of agreement was
8.63 (86% self-agreement).
• 46.4% of all replicates were scored
10; an additional 19.9% were scored 9.
LH Design: Overall Proportions
Choosing Second Gamble
First
A
A
B
0.58
0.61
0.64
0.70
C
D
E
Second Gamble
B
C
D
0.41 0.38
0.40
0.59
0.61 0.55
0.69 0.66
E
0.34 0.27
0.36 0.30
0.44 0.32
0.33
0.66
LP Design: Overall Proportions Choosing
Second Gamble
First
A
A
B
0.54
0.54
0.56
0.60
C
D
E
Second Gamble
B
C
D
0.44 0.43
0.42
0.55
0.56 0.53
0.59 0.57
E
0.42 0.36
0.42 0.38
0.45 0.40
0.41
0.56
PH Design: Overall Proportions
Choosing Second Gamble
PH
First
A
B
C
D
E
A
0.37
0.34
0.34
0.34
Second Gamble
B
C
D
0.61 0.64
0.61
0.37
0.35 0.33
0.33 0.35
E
0.64 0.64
0.63 0.65
0.64 0.64
0.63
0.34
Majority Data WST
•
•
•
•
LH Design A>B>C>D>E
LP Design A>B>C>D>E
PH Design E>D>C>B>A
Patterns consistent with special TAX
with “prior” parameters.
• But this analysis hides individual diffs
Individual Data
• Choice proportions calculated for
each individual in each design.
• These were further broken down
within each person by replication.
S# 8328 C = 9.6 Rep = 20
LH
A
A
B
C
D
E
Second Gamble
B
C
D
E
0.02 0.02 0.00 0.02
0.02 0.00 0.02
0.02 0.00
0.02
S# 8328 C = 9.8 Rep = 20
LP
A
A
B
C
D
E
Second Gamble
B
C
D
E
0.05 0.02 0.00 0.00
0.00 0.00 0.00
0.05 0.02
0.00
S# 8328 C = 9.9 Rep = 20
PH
A
A
B
C
D
E
Second Gamble
B
C
D
E
1.00 1.00 1.00 0.98
1.00 1.00 1.00
0.95 1.00
0.95
S# 6176 C = 9.8 Rep = 20; started with this pattern,
then switched to perfectly consistent with the
opposite pattern for 4 replicates at the end of the
first day; back to this pattern for 10 reps on day 2.
PH
A
A
B
C
D
E
Second Gamble
B
C
D
E
0.28 0.20 0.23 0.20
0.25 0.20 0.20
0.20 0.20
0.20
S# 684 C = 8.1 Rep = 14; an intransitive pattern
opposite that predicted by priority heuristic.
LP
A
A
B
C
D
E
Second Gamble
B
C
D
E
0.07 0.57 0.71 0.50
0.18 0.54 0.68
0.14 0.57
0.14
S# 7663 C = 6.3 Rep = 10; an intransitive pattern
consistent with priority heuristic, DP = 0.05. Few
reps and low self-consistency in this case.
PH
A
A
B
C
D
E
Second Gamble
B
C
D
E
0.15 0.75 0.75 0.90
0.45 0.55 0.65
0.30 0.55
0.30
Data Summary
• For n = 51, there are 153 matrices. Of
these, 90% were perfectly consistent
with WST: P(A,B) ≥ 1/2 & P(B,C) ≥ 1/2
then P(A,C) ≥ 1/2.
• 29 people had all three arrays fitting
WST; no one had all three arrays with
intransitive patterns.
Summary of WST
Individuals
Response
Pattern
A>B>C>D>E
LH
LP
PH
26
27
15
E>D>C>B>A
13
16
30
Other trans
5
3
2
LS
3
1
1
Other Intrans 4
4
3
29 People with 3 Perfectly
WST Patterns
LH
A
A
A
A
E
E
E
E
LP PH
A A
A E
E
A
E
E
A A
A E
E A
E E
Number
1
14 (prior TAX)
0
3
4
0
4
3
Within-Person Changes in
Preference Pattern
• Criterion: Person must show perfect
consistency (10 out of 10) to one pattern in
one replication, and perfect consistency to
another pattern on another replication.
• 15 Such cases were found (10%). There
may be other cases where the data are less
consistent.
Summary
• Recent studies fail to confirm systematic
violations of transitivity predicted by
priority heuristic. Adds to growing case
against this descriptive model.
• Individual data are mostly transitive.
• Next Q: From individual data, can we
predict, for example, from these data to
other kinds of choices by same person, e.
g., tests of SD?