Does Experience Teach? Professionals and

http://www.econometricsociety.org/
Econometrica, Vol. 78, No. 3 (May, 2010), 1143–1154
DOES EXPERIENCE TEACH? PROFESSIONALS AND
MINIMAX PLAY IN THE LAB
JOHN WOODERS
Eller College of Management, University of Arizona, Tucson, AZ 85721-0108, U.S.A.
The copyright to this Article is held by the Econometric Society. It may be downloaded,
printed and reproduced only for educational or research purposes, including use in course
packs. No downloading or copying may be done for any commercial purpose without the
explicit permission of the Econometric Society. For such commercial purposes contact
the Office of the Econometric Society (contact information may be found at the website
http://www.econometricsociety.org or in the back cover of Econometrica). This statement must
be included on all copies of this Article that are made available electronically or in any other
format.
Econometrica, Vol. 78, No. 3 (May, 2010), 1143–1154
NOTES AND COMMENTS
DOES EXPERIENCE TEACH? PROFESSIONALS AND
MINIMAX PLAY IN THE LAB
BY JOHN WOODERS1
Does expertise in strategic behavior obtained in the field transfer to the abstract setting of the laboratory? Palacios-Huerta and Volij (2008) argued that the behavior of
professional soccer players in mixed-strategy games conforms closely to minimax play,
while the behavior of students (who are presumably novices in strategic situations requiring unpredictability) does not. We reexamine their data, showing that the play of
professionals is inconsistent with the minimax hypothesis in several respects: (i) professionals follow nonstationary mixtures, with action frequencies that are negatively
correlated between the first and the second half of the experiment, (ii) professionals
tend to switch between under- and overplaying an action relative to its equilibrium frequency, and (iii) the distribution of action frequencies across professionals is far from
the distribution implied by minimax. In each respect, the behavior of students conforms
more closely to the minimax hypothesis.
KEYWORDS: Minimax, mixed strategy, Nash equilibrium.
1. INTRODUCTION
SEVERAL RECENT PAPERS have established that the on-the-field behavior of
sports professionals in strategic situations requiring unpredictability is consistent with the minimax hypothesis and its generalization to the theory of mixedstrategy Nash equilibrium. (See Walker and Wooders (2001) and Hsu, Huang,
and Tang (2007) for tennis, and Chiappori, Levitt, and Groseclose (2002) and
Palacios-Huerta (2003) for soccer.) This raises an important question: Does
expertise in strategic behavior obtained in a familiar setting, for example, the
tennis court or the soccer field, transfer to an unfamiliar one? If it does, a
key implication is that the nature of the subject pool is a critical ingredient of
whether results obtained in the laboratory are useful for predicting behavior in
the field.
Following a literature using professionals as subjects in experiments, Palacios-Huerta and Volij (2008) recruited professional soccer players and students to play two mixed-strategy games—a “penalty kick” game they introduced and the well known O’Neill (1987) game.2 They reported extraordi1
I have benefited from many discussions with Mark Walker about mixed-strategy play, and
my thoughts on this topic have been influenced by those discussions. I am grateful to Diego
Moreno, Amnon Rapoport, Jason Shachat, Matt Van Essen, Mark Walker, and Myrna Wooders
for helpful comments and to Matt for outstanding research assistance. I am also grateful to Ron
Oaxaca, Jonah Gelbach, and Keisuke Hirano for comments on statistical issues. A co-editor and
four anonymous referees provided valuable comments.
2
See, for example, Cooper, Kagel, Lo, and Gu (1999), Garratt, Walker, and Wooders (2004),
List (2003), and Alevy, Haigh, and List (2007).
© 2010 The Econometric Society
DOI: 10.3982/ECTA7970
1144
JOHN WOODERS
nary results: the play of soccer professionals conformed remarkably closely to
the behavior predicted by the theory, whereas the play of students did not.
The finding for professionals is surprising since a long line of experimental
research has demonstrated that student subjects do not play equilibrium in
mixed-strategy games.3
The present paper reexamines the Palacios-Huerta and Volij (henceforth
PH-V) data and argues that the behavior of soccer professionals is inconsistent with the minimax hypothesis in several respects: (i) professionals follow
nonstationary mixtures, with action frequencies that are negatively correlated
across the first and second half of the data, (ii) they tend to switch between
halves between under- and overplaying an action relative to its equilibrium
frequency, and (iii) the distribution of action frequencies across players is far
from the distribution implied by minimax. In each respect, the behavior of students conforms more closely to the minimax hypothesis.
Paradoxically, this reexamination is motivated by the fact that in one respect,
the actual play of professionals is too close to the theoretically expected play.
The minimax hypothesis applied to the O’Neill game calls for each player to
choose cards (1, 2, 3, or J(oker)) according to an independent and identically
distributed (iid) mixture that assigns probability 0.4 to the J card, and probability 0.2 to each of the non-J cards.4 In PH-V’s experiment, while 80 J’s are
expected after 200 plays, the probability a player chooses between 79 and 81
J’s is only 0171.5 Hence among the 40 subjects who were professionals, we
expect to find only 685 for whom actual play is this close to expected play. Remarkably, we find 16. Such an outcome is extremely unlikely under minimax
play.6 The professionals’ empirical card frequencies exhibit the same excessive
closeness to the theoretically expected frequencies for each of the non-J cards
as well.
PH-V hypothesized that professionals do not literally follow the iid minimax mixture, but rather they “. . . try to ‘match’ some probabilities they have,
consciously or unconsciously, in mind.” In other words, professionals choose
cards as though a law of small numbers applies, keeping their empirical card
frequencies close to the minimax expected frequencies.
To investigate this hypothesis, we partition the data in the simplest way, into
the first and the second half. If professionals act to match the minimax frequencies, then their empirical card frequencies should exhibit a similar conformity
to the minimax frequencies in each half as they do in the data overall. More3
Important experimental studies of mixed-strategy play include O’Neill (1987), Rapoport and
Boebel (1992), Mookherjee and Sopher (1994), Ochs (1994), Shachat (2002), and Rosenthal,
Shachat, and Walker (2003). See Camerer (2003) for an in-depth survey.
4
See Wooders and Shachat (2001) for an analysis of repeated binary outcome games.
5
To avoid the ace bias noted in Brown and Rosenthal (1990), PH-V followed Shachat (2002)
in labeling the strategies red, brown, purple, and green in their experiment, but used O’Neill’s
original labeling in describing their results.
6
The odds that 16 or more players choose between 79 and 81 J’s is about 1 in 1900.
1145
DOES EXPERIENCE TEACH?
over, the striking difference found between professionals and students should
also hold in each half.
We find no evidence that professionals match minimax frequencies. Even
though tests based on only half the data have less power, the hypotheses that
professionals choose cards individually (or jointly) according to the minimax
frequencies are rejected at too high a rate to be consistent with the minimax
hypothesis. They are rejected at similar rates for professionals and students.
The essential difference between professionals and students is how their play
changes between halves. The minimax hypothesis requires that each player
choose cards according to the same iid mixture at each round; hence, the frequencies with which a card is chosen in each half are statistically independent.
The hypothesis of independence is rejected for professionals, for each card in
the O’Neill game, in favor of negative correlation. For students, in contrast,
this hypothesis is only rejected for the J card. In this respect, the behavior of
students conforms more closely to the theory.
A related implication is that if a subject happens to play a card with, say, less
than its equilibrium frequency in the first half (i.e., he “underplays” it), this
has no bearing on the likelihood he will underplay it in the second half. We
show that professionals who underplay a card in the first half tend to overplay
it in the second half (and vice versa). Students, in conformity with the theory,
exhibit no such tendency.
The tendency of professionals to change their play across halves has a powerful effect: it causes their overall card frequencies to be too closely clustered
around the expected card frequencies. The Kolmogorov–Smirnov (KS) goodness of fit test applied to the professionals’ overall card choices resoundingly
rejects minimax play for each card individually and for all four cards jointly; the
distribution of card frequencies across professionals is far from that implied by
the minimax hypothesis. The same test yields a rejection only for the J card for
students.
2. THE PH-V EXPERIMENT
The penalty kick (P-K) game is a stylized representation of a penalty kick in
soccer, where the kicker (row) chooses whether to kick left (A) or right (B),
and the goalie (column) simultaneously chooses whether to cover left (A) or
right (B). In both games of PH-V’s experiment, the winner receives 1 Euro and
the payoff numbers are the probability that the row player wins.
A
B
A
0.60
0.90
B
0.95
0.70
P-K Game
1
2
3
J
1
0
1
1
0
2
1
0
1
0
3
1
1
0
0
J
0
0
0
1
The O’Neill Game
1146
JOHN WOODERS
Eighty professional soccer players—40 kickers and 40 goalies—participated.
For the P-K game, 40 professionals in 20 fixed pairs, with a kicker in the
row role and a goalie in the column row, played 150 rounds. For the O’Neill
game, another 40 professionals were paired in the same fashion, but played
200 rounds. A total of 160 college students participated, half with soccer experience and half without. To sharpen the contrast, we focus on the 80 students
without soccer experience. Like the professionals, 20 fixed pairs of students
played 150 rounds of the P-K game and 20 fixed pairs played 200 rounds of the
O’Neill game. In both games, the subjects played 15 practice rounds. Subjects
were not told the number of rounds to be played.
Comparing Professionals and Students in the O’Neill Game
Tables I and II show the empirical card frequencies of professionals and students, respectively, in the first half and the second half of the O’Neill game.
Considering the overall data, PH-V found that the null hypothesis that a subject chooses the J card with probability 0.4 is rejected at the 5% level for
only 3 professionals, but for 9 students (2 rejections are expected).7 For the
non-J cards, the minimax binomial model (MBM) is rejected at the 5% level
in only 4 instances for professionals, but in 13 instances for students (6 rejections are expected). Applying the Pearson goodness of fit test to each subject’s
choice of all four cards, the null hypothesis that the subject chose cards according to the minimax mixture is rejected at the 5% level for only 2 professionals,
while it is rejected for 7 students. The joint hypothesis that all 40 players choose
their cards according to the equilibrium mixture is not rejected for professionals (p-value 0988), while it is rejected at the 1% level for students (p-value
0006).
To examine whether the PH-V data are consistent with professionals choosing cards to “match” the minimax frequencies, we partition the data into the
first and last 100 rounds.8 If professionals are indeed matching frequencies,
then the close conformity of their overall empirical card frequencies to the
expected frequencies should also be found when each half of the data is considered in isolation.9 The tables show this is not the case. In particular, we note
the following observations:
• The null hypothesis that a player chooses J according to the MBM is rejected
at roughly three times the expected rate for both professionals and students.
7
These results are reported in Tables X and XIV, respectively, in PH-V and on the right hand
side of Tables I and II here.
8
Besides being the obvious one, this split has several other advantages: (i) It gives the PH-V
matching hypothesis its best chance since professionals might not match frequencies over very
short intervals. (ii) The statistical tests have the same power on each half and hence are comparable.
9
Subjects did not know the number of rounds to be played and hence could not match the
minimax J frequency by choosing, say, 30% J’s in the first half and 50% J’s in the second half.
1147
DOES EXPERIENCE TEACH?
TABLE I
PROFESSIONALS PLAYING O’NEILLa
First Half
Pair
Playerb
1
2
3
1
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
23
19
20
23
25
25
18
19
22
20
27∗
21
23
20
26
22
20
22
21
14
27∗
21
17
16
21
17
20
17
21
19
23
17
22
16
20
25
21
19
22
21
23
28∗
22
22
21
25
20
19
21
17
18
20
21
24
15
23
16
21
25
17
22
21
23
18
23
21
31∗∗
12∗∗
23
19
20
19
27∗
28∗∗
29∗∗
16
21
20
21
19
24
24
23
22
23
14
21
18
16
28∗
17
16
19
13∗
24
19
20
21
16
7∗∗
23
12∗∗
20
23
17
18
22
18
24
13∗
16
14
18
21
19
17
18
18
21
21
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
Second Half
J
30∗∗
29∗∗
35
33
31∗
36
41
44
41
35
38
43
37
43
35
36
44
36
38
62∗∗
28∗∗
46
40
43
39
44
27∗∗
53∗∗
32∗
49∗
41
50∗∗
33
35
32∗
42
40
43
36
39
†
‡
†
‡
‡
1
2
3
16
19
20
18
14
17
11∗∗
24
18
16
14
21
18
23
19
17
21
15
18
21
23
20
22
24
18
26
21
20
21
24
16
24
23
25
21
17
13∗
22
15
18
14
17
19
21
14
14
17
22
18
22
19
21
17
19
15
20
20
18
14
19
18
17
17
22
20
16
27
25
14
21
13∗
20
16
18
20
23
20
24
20
23
18
34∗∗
27∗
27∗
18
26
24
18
26
13∗
24
21
22
13∗
17
20
21
22
27∗
27∗
18
22
21
16
21
21
17
23
16
21
19
25
23
17
23
19
17
29∗∗
15
19
Overall
J
52∗∗
30∗∗
34
34
54∗∗
43
48
36
38
49∗
43
37
43
45
49∗
43
38
45
41
33
41
41
40
38
41
37
35
32∗
49∗
34
52∗∗
31∗
38
40
36
41
50∗∗
25∗∗
50∗∗
40
1
2
†
‡
3
J
∗∗ ∗∗
∗
∗
‡
‡
†
∗∗
∗∗
∗
∗∗
∗
∗∗
†
∗∗
‡
∗
∗
‡
∗
C
439∗∗ 442∗∗ 401
718∗∗
362∗∗ 352∗∗ 414
872∗∗
R
393 409 357∗∗ 841∗ ‡ 411 402 433∗ 754∗∗
Overall 832 851∗∗ 758∗ 1559
773 754∗ 847∗ 1626
a Symbols: ∗∗ and ∗ denote rejection of the minimax binomial model for a given card at the 5% and 10% level,
respectively. ‡ and † denote rejection of the minimax multinomial model at the 5% and 10% level.
b C and R denote Column and Row, respectively.
1148
JOHN WOODERS
TABLE II
STUDENTS PLAYING O’NEILLa
First Half
Pair
Playerb
1
2
3
1
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
C
R
16
19
15
19
20
16
17
13∗
17
11∗∗
25
16
20
21
20
22
15
14
27
33∗∗
15
19
20
25
16
17
19
18
17
14
23
21
19
22
25
26
28∗
25
24
24
19
28∗∗
19
16
21
24
18
16
20
19
24
19
12∗∗
20
20
21
22
19
15
27
28∗
14∗
19
21
18
26
19
20
21
19
21
22
21
14
21
21
25
19
21
16
23
22
17
14
20
21
20
15
17
22
21
27∗
19
22
15
17
19
24
25
20
21
19
13∗
17
17
18
16
23
17
17
17
22
18
20
17
20
19
24
17
14
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
C
398
R
395
Overall 793
404
401
805
Second Half
J
42
31∗
49∗
51∗∗
39
39
45
56∗∗ ‡
46
48∗
30∗∗
38
49∗
37
45
40
44
43
33
20∗∗ ‡
36
48
48
37
49∗
39
46
39
45
50∗∗
39
35
42
44
37
33
28∗∗ ‡
32∗
38
46
368∗ 830
398 806
766 1636
1
2
3
12∗∗
26
22
22
26
24
18
16
22
16
18
21
20
25
17
17
25
16
23
23
30∗∗
20
23
31∗∗
24
18
20
16
22
14
22
28∗
23
17
18
27∗
23
35∗∗
19
15
22
26
22
20
17
17
18
27∗
15
19
22
24
18
17
25
24
21
24
22
25
24
21
25
21
22
13∗
20
26
20
23
22
17
20
18
20
21
23
18
18
25
29∗∗
17
25
18
25
22
25
16
22
25
25
20
14
21
22
16
20
16
17
20
20
17
22
19
25
21
25
29∗∗
23
23
15
16
22
20
19
17
20
14
24
19
Overall
J
37
31∗
31∗
40
32
37
39
41
41
40
35
35
48∗
37
36
43
34
44
38
32
26∗∗
42
30∗∗
29∗∗
29∗∗
48∗
35
29∗∗
35
40
41
39
35
45
43
35
34
33
39
41
1
†
2
∗∗
3
J
∗∗
∗∗
∗∗
∗∗
†
‡
∗ ∗∗
‡
∗∗
∗∗
†
∗
∗∗
†
‡
∗
∗
∗∗ ∗∗
∗∗
∗∗
∗∗
‡
‡
‡
∗∗
∗∗
‡
∗∗ ∗
†
∗
∗∗
∗∗
†
‡
‡
‡
∗∗
‡
427 416 439∗∗ 718∗∗
‡ 427 426 386
761∗ †
∗∗
∗
† 854 842 825 1479∗∗
∗∗
∗
∗∗
∗
∗
‡
‡
a Symbols: ∗∗ and ∗ denote rejection of the minimax binomial model for a given card at the 5% and 10% level,
respectively. ‡ and † denote rejection of the minimax multinomial model at the 5% and 10% level.
b C and R denote Column and Row, respectively.
DOES EXPERIENCE TEACH?
1149
It is rejected at the 5% level for 7 professionals and 6 students in the first
half, and 7 professionals and 5 students in the second half. Only 2 rejections
in each half are expected under the null.
• For the non-J cards, the MBM is rejected at the expected rate for professionals and students in both halves. There are 6 and 4 rejections for professionals
and students, respectively, in the first half; there are 3 and 6 rejections for
professionals and students, respectively, in the second half. A total of 6 rejections (2 rejections per non-J card) are expected at the 5% level in each
half.
• The null hypothesis that a player chooses all four cards according to the minimax multinomial model (MMM) is rejected at the 5% level (by the Pearson
goodness of fit test) for 3 professionals and 3 students in the first half, and
for 3 professionals and for 4 students in the second half.
• The joint hypothesis that all the players in a given role choose a given card
according to the MBM is rejected more frequently for professionals than for
students. In the first half, it is rejected at the 5% level for professionals in
the column role for the 1, 2, and J cards and in the row role for the 3 card.
The same null is not rejected for students, for either role or any card. In the
second half, the joint MBM is rejected for professionals in the column role
for the 1, 2, and J cards and for the row role for the J card. For students, it is
rejected only for column playing the 3 and J cards.
• The joint hypothesis that all the players in a given role choose all four cards
according to the MMM is rejected at the 5% level for the row role, for both
professionals and students in the first half; it is not rejected for either professionals or students for either role or when the roles are combined in the
second half.
These results show that in each half there is little difference between professionals and students in terms of the number of 5% rejections of either the
MBM or the MMM, in contrast to the overall data.10 For both groups, we obtain more than the expected number of rejections of the MBM for the J card.
The joint MBM is rejected more frequently than expected, especially for professionals.
Also inconsistent with the PH-V matching hypothesis, there is no tendency
in either half for the actual play of professionals to be too close to expected
play. For example, in each half we expect 9.6 of 40 players to choose either 39,
40, or 41 jokers. In the first and second half there were, respectively, 7 and 8
professionals whose actual play was this close to expected play, while the analogous numbers for students are 6 and 10.
10
See Wooders (2008) for the p-values for these tests. The numbers of rejections of the MBM
and MMM continue to be remarkably similar for professionals and students if the data are split
by thirds or quarters.
1150
JOHN WOODERS
3. NEGATIVE CORRELATION AND SWITCHING
If professionals are not matching frequencies, then why are their overall empirical card frequencies too close to the minimax frequencies? We will show
that for each card, the frequencies with which it is played by professionals in
each half are negatively correlated. Indeed, professionals tend to switch between halves between under- and overplaying each card relative to its minimax
mixture.
In the equilibrium of the repeated O’Neill game, each player chooses cards
according to the same iid mixture at each round. An implication is that the
frequency with which a card is played in the first half is statistically independent
of the frequency with which it is played in the second half. Table III shows that
this is not the case. It reports, for each card, the value of the Spearman rank
correlation coefficient R between these frequencies.11
The choice frequencies of professionals are negatively correlated between
halves for each card, with the null hypothesis of independence rejected for each
card at the 5% significance level. For students, the same null is not rejected for
any of the non-J cards. For the J card, however, independence is rejected at
the 5% significance level, with the first- and second-half frequencies positively
correlated.
A related implication of the minimax hypothesis is that if a subject plays a
card with, say, a frequency below its equilibrium frequency in the first half, this
has no bearing on the likelihood he will underplay the card in the second half;
that is, a player is equally likely to switch as not. There were 36 professionals
who either strictly under- or overplayed the J card in both halves. Of these,
25 switched between underplaying the J card and overplaying it, and hence the
hypothesis that a player is equally likely to switch as not is rejected (p-value
0.029). For the 1, 2, and 3 cards there are, respectively, 32, 29, and 37 professionals who strictly underplayed or overplayed the card in both halves. With 26,
TABLE III
SPEARMAN RANK CORRELATION COEFFICIENTS
Professionals
J
1
2
3
Students
R
t
p-Value
R
t
p-Value
−0.3195
−0.5804
−0.3688
−0.3463
−2.08
−4.39
−2.45
−2.28
0.04432
0.00009
0.01900
0.02831
0.3159
0.2882
−0.0160
−0.0427
2.05
1.86
−0.10
−0.26
0.04731
0.07064
0.92087
0.79627
11
Under the null hypothesis of independence, the distribution of R is known and hence R can
be used to obtain a nonparametric test of independence. (See Gibbons and Chakraborti (2003,
pp. 422–431).) The Spearman R is computed using the web page http://faculty.vassar.edu/lowry/
corr_rank.html, authored by Richard Lowry, and it corrects for ties in ranks.
1151
DOES EXPERIENCE TEACH?
24, and 27 switches, respectively, the null hypothesis that the switching probability is 0.5 is rejected for each of these cards as well.12 In each case, the null is
rejected as there are too many switches.
The behavior of students in contrast, is consistent with this implication of the
minimax hypothesis. For the J card, there were 36 students who strictly underor overplayed the card in both halves and, of these, only 13 switched. For the 1,
2, and 3 cards, respectively, 17 of 34, 16 of 31, and 16 of 31 students switched.
The null hypothesis is not rejected for any card.
Switching and the Consequences for Overall Play
If a subject overplays a card in the first half, but underplays it in the second
half, then the sample variance will tend to be too small relative to the sample
variance under equilibrium play. Consider, for example, the 200-time repeated
matching pennies game. If a subject chooses H with probability 05 + γ in the
first half and 05 − γ in the second half, with 0 ≤ γ ≤ 05, then 100 heads are
expected, but the variance is only 50(1 − 4γ 2 ). Given a collection of subjects
whose play varies in this fashion, there will tend to be too many having chosen
approximately the expected number.13 Under the null hypothesis of a fair coin,
applying the binomial test there will tend to be too many subjects with large
p-values (and too few with small p-values).
Table IV shows the results of applying the KS test to the empirical distributions of 40 p-values obtained when the randomized binomial test is applied to
each card individually and the Pearson goodness of fit test is applied all four
cards jointly. For each card, the KS test resoundingly rejects the joint null hyTABLE IV
OVERALL KS TESTS OF CONFORMITY TO U[0 1]
Professionals
J
1
2
3
1–2–3–J
Students
KS
p-Value
KS
p-Value
1.477
2.332
1.917
1.693
2.434
0.025429
0.000038
0.001285
0.006456
0.000014
1.484
1.239
1.160
1.071
1.292
0.024464
0.092692
0.135789
0.201243
0.071098
12
Although each test is meaningful on its own, the tests are not independent. A subject who
switches from under- to overplaying J must switch from over- to underplaying at least one of the
non-J cards.
13
This also introduces a bias in favor of too few runs and away from the negative serial correlation (see Brown and Rosenthal (1990) and O’Neill (1991)) exhibited by students in the O’Neill
game.
1152
JOHN WOODERS
pothesis that professionals play according to the MBM; each empirical cumulative distribution function (cdf) has too many large p-values.14 In contrast, the
same null is not rejected for any one of the non-J cards for students. The hypothesis that all four cards are jointly chosen according to the minimax mixture
is rejected for professionals, but it is not rejected for students. The empirical
distribution of card frequencies for professionals is far from that implied by
the minimax hypothesis.
4. DISCUSSION
The differences between professionals and students found here in the
O’Neill game are exhibited in the P-K game as well. Table V reports the value
of the Spearman R between the first and second half (i.e., 75 of 150 plays) frequencies of right (B). Since there are only 20 subjects in each role, this test has
lower power for the P-K game than for the O’Neill game, where, since the equilibrium mixture was the same for both roles, we could pool the data for both
row and column. Nonetheless, for professionals, the null of independence is
rejected at the 10% level and just fails to be rejected at the 5% level, while
independence is not rejected for students.
Professionals in the column role tend to switch between under- and overplaying right, with 14 players switching. Under the null that professionals in
the column role are equally likely to switch as not, the probability of 14 or
more switches is 0.0577; the null just fails to rejected at the 10% level. Only
7 professionals in the row role switch, which is less than the number expected.
This is the only instance in which professionals exhibit no tendency to switch;
it is also the only instance in which PH-V rejected the joint minimax model for
professionals.
Table VI shows the results of applying the KS test to the distributions of
p-values obtained when the randomized binomial test is applied to the left–
right choices in the P-K game. For professionals, minimax play is rejected for
both roles, with the empirical cdf for the column role exhibiting too many large
TABLE V
SPEARMAN RANK CORRELATION COEFFICIENTS
Professionals
Row
Column
Students
R
t
p-Value
R
t
p-Value
−0.4153
−0.4354
−1.94
−2.05
0.06821
0.05522
−0.0982
−0.2155
−0.41
−0.94
0.68053
0.36153
14
See Wooders (2008) for a description of the randomized binomial test and for figures showing
the empirical cdf’s.
1153
DOES EXPERIENCE TEACH?
TABLE VI
OVERALL KS TESTS OF CONFORMITY TO U[0 1]
Professionals
Row
Column
Students
KS
p-Value
KS
p-Value
1.403
1.508
0.039058
0.021187
2.732
2.509
0.000001
0.000007
p-values. Minimax play is resoundingly rejected for students, with too many
small p-values for both roles.
Since there is no obvious reason why professionals (but not students) would
follow nonstationary mixtures, efforts to replicate the PH-V results seem especially important. Our results suggest that it may be useful to focus on the
whether professionals follow nonstationary mixtures. Although not an exact
replication, Levitt, List, and Reiley (2010) found no evidence that the behavior
of American Major League Soccer players conforms more closely to minimax
than does the behavior of students in the O’Neill game.
REFERENCES
ALEVY, J., M. HAIGH, AND J. LIST (2007): “Information Cascades: Evidence From a Field Experiment With Financial Market Professionals,” Journal of Finance, 62, 151–180. [1143]
BROWN, D., AND R. ROSENTHAL (1990): “Testing the Minimax Hypothesis: A Reexamination of
O’Neill’s Experiment,” Econometrica, 58, 1065–1081. [1144,1151]
CAMERER, C. (2003): Behavioral Game Theory, Experiments in Strategic Interaction. Princeton:
Princeton University Press. [1144]
CHIAPPORI, P., S. LEVITT, AND T. GROSECLOSE (2002): “Testing Mixed Strategy Equilibria When
Players Are Heterogeneous: The Case of Penalty Kicks in Soccer,” American Economic Review,
92, 1138–1151. [1143]
COOPER, D., J. KAGEL, W. LO, AND Q. L. GU (1999): “Gaming Against Managers in Incentive
Systems: Experimental Results With Chinese Students and Chinese Managers,” American Economic Review, 89, 781–804. [1143]
GARRATT, R., M. WALKER, AND J. WOODERS (2004): “Behavior in Second-Price Auctions by
Highly Experienced eBox Buyers and Sellers,” Working Paper 02-04, University of California
at Santa Barbara. [1143]
GIBBONS, J., AND S. CHAKRABORTI (2003): Nonparametric Statistical Inference. New York:
Dekker. [1150]
HSU, S., C. HUANG, AND C. TANG (2007): “Minimax Play at Wimbledon: Comment,” American
Economic Review, 97, 517–523. [1143]
LEVITT, S., J. LIST, AND D. REILEY (2010): “What Happens in the Field Stays in the Field: Professionals Do Not Play Minimax in Laboratory Experiments,” Working Paper 07-11, University
of Arizona. Econometrica (forthcoming). [1153]
LIST, J. (2003): “Does Market Experience Eliminate Market Anomalies,” Quarterly Journal of
Economics, 118, 41–71. [1143]
MOOKHERJEE, D., AND B. SOPHER (1994): “Learning Behavior in an Experimental Matching
Pennies Game,” Games and Economic Behavior, 7, 62–91. [1144]
OCHS, J. (1994): “Games With a Unique Mixed Strategy Equilibria: An Experimental Study,”
Games and Economic Behavior, 10, 202–217. [1144]
1154
JOHN WOODERS
O’NEILL, B. (1987): “Nonmetric Test of the Minimax Theory of Two-Person Zero-Sum Games,”
Proceedings of the National Academy of Sciences, 84, 2106–2109. [1143,1144]
(1991): “Comments on Brown and Rosenthal’s Reexamination,” Econometrica, 59,
503–507. [1151]
PALACIOS-HUERTA, I. (2003): “Professionals Play Minimax,” Review of Economic Studies, 70,
395–415. [1143]
PALACIOS-HUERTA, I., AND O. VOLIJ (2008): “Experientia Docent: Professionals Play Minimax
in Laboratory Experiments,” Econometrica, 76, 71–115. [1143]
RAPOPORT, A., AND R. BOEBEL (1992): “Mixed Strategies in Strictly Competitive Games: A Further Test of the Minimax Hypothesis,” Games and Economic Behavior, 4, 261–283. [1144]
ROSENTHAL, R., J. SHACHAT, AND M. WALKER (2003): “Hide and Seek in Arizona,” International
Journal of Game Theory, 32, 273–293. [1144]
SHACHAT, J. (2002): “Mixed Strategy Play and the Minimax Hypothesis,” Journal of Economic
Theory, 104, 189–226. [1144]
WALKER, M., AND J. WOODERS (2001): “Minimax Play at Wimbledon,” American Economic Review, 91, 1521–1538. [1143]
WOODERS, J. (2008): “Does Experience Teach? Professionals and Minimax Play in the Lab,”
Working Paper 08-04, University of Arizona. [1149,1152]
WOODERS, J., AND J. SHACHAT (2001): “On the Irrelevance of Risk Attitudes in Repeated TwoOutcome Games,” Games and Economic Behavior, 34, 342–363. [1144]
Dept. of Economics, Eller College of Management, University of Arizona,
401 McClelland Hall, Tucson, AZ 85721-0108, U.S.A.; [email protected].
edu.
Manuscript received June, 2008; final revision received December, 2009.