"Mises Redux" -- Redux: Fifteen Arguments against

"Mises Redux" -- Redux: Fifteen Arguments against Finite Frequentism
Author(s): Alan Hájek
Source: Erkenntnis (1975-), Vol. 45, No. 2/3, Probability, Dynamics and Causality (Nov., 1996),
pp. 209-227
Published by: Springer
Stable URL: http://www.jstor.org/stable/20012727
Accessed: 15/03/2010 17:43
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=springer.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
Springer is collaborating with JSTOR to digitize, preserve and extend access to Erkenntnis (1975-).
http://www.jstor.org
ALAN HAJEK
MISES REDUX" - REDUX: FIFTEEN ARGUMENTS AGAINST
FINITE FREQUENTISM
ABSTRACT. According to finite frequentism, the probability of an attributeA in a finite
class B
reference
fifteen
arguments
is the relative
against
frequency
this position.
1.
The most
of actual
occurrences
of A within
B.
I present
INTRODUCTION1
of probability
is frequentism.
interpretation
a
the
that
coin
lands
heads when
says:
Roughly,
probability
frequentism
tossed is the number of times that the coin lands heads, divided by the total
widely
accepted
of times that the coin is tossed; the probability
that a radium atom
so decay, divided
is
of
atoms
in
the
number
radium
1500
that
years
decays
so
on.
of
the
number
and
total
radium
This
should
sound familiar
atoms;
by
?
?
all too familiar
for somehow
this notion still pervades much scientific
it
But
should be rejected, as Iwill argue here ?
about
probability.
thinking
fifteen times over.
number
or philosophically
To philosophers
inclined scientists,
the demise of
is familiar, I admit, even though it hasn't quite been universally
frequentism
are many of the arguments
that Iwill present here
accepted.2 Familiar too
?
were
some
of
them
Richard
indeed,
inspired by
Jeffrey's "Mises Redux"
I
to
it
will
be
useful
have
them
hope
(1977)?though
gathered in one place.
in this paper are new, as far as I am aware. So even if
arguments
the fact that there is bad news for frequentism
is old news, I hope it is
news
how
much
bad
there
newsworthy
just
really is.
Other
The
stance
important
here.
that one
Iwill
takes on
begin
issues
by saying
of philosophical
some
friendly
is
methodology
about
the
role
things
of intuition in the philosophical analysis of objective probability, by way
of preparation for the unfriendly things that Iwill say about frequentism
as such an analysis.
two frequentist analyses ?finite fre?
Iwill distinguish
space limitations require
quentism and hypothetical frequentism.
Although
me to confine my discussion
to the former, many of the arguments
that I
will adduce will count equally against both.
69
Erkenntnis 45: 209-227, 1997.
1997
Kluwer
Academic Publishers. Printed in theNetherlands.
?
210
ALAN HAJEK
OBJECTIVE PROBABILITY: INTUITIONS AND ANALYSIS
2.
and science. The
is, I claim, a concept of both commonsense
Probability
on
street
and
it
the
understands
person
recognizes
(at least to some extent)
in locutions such as "the probability
that this coin lands heads is 1/2"; the
and understands
it (at least to some extent)
in locu?
recognizes
to be spin 'up' with probability
such as "this electron
is measured
1/2". Commonsense
and science are joined by a two-way street: scientific
scientist
tions
theories
are, after all, invented by people who share the folk's conceptual
and who
seek to refine it; and commonsense,
in turn, partly
apparatus,
some
as
of
these
scientific
ideas become popu?
refinements,
incorporates
concept of gravity, for example, was once a part of neither
nor science, and now it is part of both. I believe
the same is
true of the concept of probability.
larized.
The
commonsense
seem
economists...
statisticians, physicists,
Many computer scientists,
or
is
relative
to speak as if probability
ifs
end
of
buts,
simply
frequency?no
a
as
We
to
mistaken.
would
do
better
think
of
it
is
This
surely
putative
story.3
one which both informs
notion of probability,
analysis of our pretheoretical
scientific notion of probability.
by a more
sophisticated
a
term that one is free to define
not
after
is
technical
all,
just
'Probability',
to our
as one pleases. Rather,
is answerable
it is a concept whose
analysis
intuitions, a concept that has various associated platitudes
(for example: "if
and
is informed
has probability
greater than 0, then X can happen"). Thus, it is unlike
or 'material
like 'complete metric
space' or 'Granger causation'
are
with
definitions
which
there
which
there
for
conditional',
stipulative
X
terms
platitudes. What
'probability'
arguing, and no associated
or
or 'if ... then',
like 'space'
'causation'
is like, instead, are concepts
It is fair game to
of analyses.
that can be the subject matter
concepts
to
it
is
fair
and
such
game
certainly
dispute frequentism.
analyses;
dispute
about my project
I say this early on, to forestall any possible puzzlement
from various computer
that I have already encountered
here (a puzzlement
...
statisticians
scientists,
).
is at best an analysis of objective probability,
Furthermore,
frequentism
is no
sensible
for example, from
objective chance (to be distinguished,
or degree of belief.) As we will
it
see, however,
probability,
subjective
cannot be even that.
have some sort
and relative frequency
This is not to deny that probability
sometimes
called
connection.
theorists, logical probabilists,
propensity
Subjectivists,
that an event with probability
all agree, for example,
and so on presumably
to occur roughly half of the time, in some senses of
half should be expected
and 'roughly' (I would
words
the
say this is another
'should',
'expected',
can
a
out
relative frequency
I concede
that finding
platitude). Moreover,
70
of close
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
211
the only
way of finding out
a
I do not deny the existence of some interesting
the value of
probability.
their identification.
the two; I am only disputing
between
relationship
notion
So far I have taken as a starting point our commonsensical
as an analysis
of that
and I have regarded frequentism
of probability,
from another direction.
(a bad one). But we could come to frequentism
often
the best
be
?
and
even
sometimes
of probability. More
is, as I said at the outset, an interpretation
a
the
of
of
axioms
it
is
theory?
putative interpretation
probability
precisely,
So starting with a primitive,
those provided by Kolmogorov.
traditionally,
Frequentism
function P, defined over a certain set-theoretic
substructure,
uninterpreted
we
come to a
and
is non-negative,
which
normalized,
additive,
might
a
to play.
P.
Here
have
role
of
intuitions
again,
frequentist understanding
to do with our intuitive notion of
that have nothing
toKolmogorov's
axioms, and so in some sense provide
probability
an interpretation
think of mass, or length, or volume, which are
of them
can be suitably normalized.
and additive, and which
clearly non-negative
commonsense
tells us
They are not even in the running, however, because
For many
quantities
conform
thatprobability is simply something else. (Just try substituting any of them
into the platitudes above!) Incidentally, this also shows that it is too glib to
as long as
of probability
is provided
say that a satisfactory
understanding
we find a concept of importance to science that conforms to the axioms-for
that does not narrow down the field enough. In any case, the more strictly
our commonsensical
concept would
project of analysing
as the project of analysing our commonsensical
concept of
even
we
if
the
had
done
remain
would
causation,
say,
already
job for the
as
in
it
science.
appears
concept
philosophical
remain, much
3.
VERSIONS OF FREQUENTISM
to distinguish
two variants of actual frequentism,
It is necessary
and these
to
from hypothetical
actual
the prob?
frequentism. According
frequentism,
an
or
to
event
is
of
attribute
be
its
identified
actual
relative
with
ability
for all the requisite
there is no need to 'leave the actual world',
frequency:
if there happen to be infinitely many events or
facts are right here. Now,
attributes of the requisite
sort, then we cannot simply count the number
and divide this by the total number of trials, since this will
of 'successes'
take the indeterminate
relative
frequency
In that case, we take the limit of the
trial, as n tends to infinity. Hypothetical
form oo/oo.
up to the nth
frequentism keeps the intuition that probability is such limiting relative
frequency, but applies when the actual world does not furnish the infinitely
It thus identifies probability with a counterfactual
trials required.
many
71
212
ALAN HAJEK
relative
limiting
frequency:
trials.
the limiting
relative
frequency
if there were
infinitely many
I cannot discuss
since
the problems with hypothetical
here; and
frequentism
the infinite variant of actual frequentism
suffers from many of the
same problems, my discussion
of it is best left for another occasion
also.4
So let me focus solely, then, on the version of actual frequentism
in which
trials of the relevant sort - for short, finite
there are only finitely many
frequentism:
FINITE FREQUENTISM: The probability of an attributeA in a finite
class B
reference
within
is the relative
frequency
of actual
occurrences
of A
B.
Venn (1876), in his discussion of the proportion of births of males and
is nothing but that proportion"
(p. 84, his
"probability
at
its purest. Reichenbach
This I take to be finite frequentism
in the end looked more
inclinations
also, although his account
concludes:
females,
emphasis).
had such
accounts were pursued
in
finite frequentist
like hypothetical
frequentism;
and Putnam among others.
detail by Cramer, Hempel
thus be thought
Of course, all of this was a long time ago. It might
interest as far as the philosophical
is at best of historical
that frequentism
I have already emphasized
literature is concerned?its
currency elsewhere
?
more
in
these
times. Up to a
of
much
scrutiny
enlightened
unworthy
more
I guess; but only up to a point. As I indicated
this is true enough,
inclined
still has its proponents
among philosophically
earlier, frequentism
some
even
of
the
criticisms
and
Furthermore,
statisticians,
philosophers.
point,
accounts
that
here have some force against more sophisticated
presented
have grown out of frequentist
soil, for example Lewis'
(1994) 'best system'
as
at
I
the
will
to
chance,
argue
point.
appropriate
objective
approach
Indeed,
account of objective
chance could be
if an empiricist
and I suspect that many of
look a lot like frequentism,
could be adapted accordingly
against any such account.
scruples should not give
frequentist with serious empiricist
one wonders
that didn't
given
these arguments
Any aspiring
up on finite frequentism
comes at a considerable
lightly. The move
to hypothetical
say,
frequentism,
price, one that an empiricist should be
is really the only version that upholds
to pay. Finite frequentism
unwilling
that might make frequentists
inclinations
scientific
the anti-metaphysical,
at
first
In
us
first
in the
of
blush, it is an attractive theory. It
any case,
place.
are
it apparently
well understood;
is a reductive analysis, whose primitives
makes
the epistemology
metaphysical
of probability
72
straightfoward;
unlike
the classical
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
213
it appears to be about the world; and it
and logical theories of probability,
seems to be inspired by actual scientific practice.
at second blush, it does not look nearly so good: it runs afoul
However,
of many
important
intuitions
that we
have
Or
so I will
of them?
It might
about probability.
argue.
4.
THE ARGUMENTS
are almost
We
make
ready for the arguments.
that I am uneasy
suspicious
Why
about
so many
them, substituting
quantity
for
buckets'
'leaky
metaphor
philosophical
one is better than fifteen leaky
to paraphrase him, one watertight
arguments:
ones. Suffice to say that I think the arguments here are pretty watertight,
and some of them are pretty decisive on their own.
you
for quality.
It recalls
Flew's
reason for giving so many arguments
is this. You might agree with
cannot be an analysis of our concept of (objective)
think that it is, so to speak, a success?
however,
you might
probability;
One
me
that frequentism
one that captures an important and
of the concept,
ful partial
analysis
central strand in our thinking about probability
(even if there are other
even
In
I feel some temptation
such strands).
moments,
my ecumenical
to concede
this: perhaps we should let a thousand flowers bloom, with
one of them. Or you might
think that frequentism
is a
of
of
the
surrogate for
concept
cleaned-up
probability?a
good explication
even
a messy,
one
confused
suitable for
vague, and
concept,
ambiguous,
use in science and clear-headed
discourse.
But I do think that the many
frequentism
being
successively
chip away at frequentism's
possible domain,
its
interest.
steadily reducing
reason for giving so many arguments
is that it shows just how
Another
are
for retrenching frequentism
in favor of some close
dim
the prospects
arguments
here
to itmight prompt one to
relative of it. A single class of counterexamples
in order to save it.What I hope tomake clear, by piling
add a single epicycle
are not
on ever more arguments
is that the problems
against frequentism,
an artifact
with
of some particular presentation
cosmetic
surgery. No
a little clever
of it, ones that would go away
the problems with frequentism
run deep.
the title and fanfare, I don't want to be too fussy about how the
are counted. Not all of the arguments are completely
indepen?
arguments
dent of each other; in fact, several of them might be regarded as stemming
Despite
from a single intuition (that probability
On
of 'counterfactual
independence').
73
can obey a certain sort
I might
the other hand, elsewhere
statements
214
ALAN HAJEK
the one
two or more
that could be
heading
arguments
the arguments
in a way that I hope is natural.
separated. Iwill distinguish
Iwill begin with some general arguments
that I think are telling against
form
of
then, arguments
any
frequentism;
specifically
against finite fre?
combine
under
quentism.
GENERAL PROBLEMS CONCERNING ANY VERSION OF FREQUENTISM
1. The Reference
We
Class Problem
events
think that various
and indeed we
bilities,
have unconditional
straightforwardly
proba?
even have theories that tell us what some of these
are. But it seems that frequentism
delivers only conditional
- or
in any case, relativized probabilities.5
Von Mises
probabilities
(1957)
"It is only the notion of probability
in a given collective
writes:
which
probabilities
is unambiguous" (p. 20). Suppose I am interested inmy probability of
dying by age 60. What I want is an unconditional probability. I can be
reference classes: the set of all living things; the set of all
set
the set of all non-smoking
all males;
the
of
males who exercise
humans;
the set of all philosophers;
the set of all Woody
Allen
fans
occasionally;
... Each of these reference
classes will have its own associated
relative
for death by age 60. But I'm not interested inmy probability
of
frequency
in various
placed
death qua philosopher,
say. To repeat, Iwant
Here we confront the notorious
'reference
an unconditional
probability.
a given event
class problem':
or attribute has more than one relative frequency;
to the fre?
and according
or
means
event
more
one
that
the
this
attribute
has
than
quentist,
probability.
is some
have been quick to mock Car
irony here. Frequentists
on
the grounds that itmust always be relativized
napian logical probability
to a choice of language, and no single language seems to be the canoni?
cal one. But a parallel problem
is practically
alluded to in the very name
must
'relative frequency'
to a choice
frequencies
always be relativized
no
of reference class, and
single reference class seems to be the canonical
There
one.
I see only one possible way out for the frequentist. He should insist that
is really conditional;
all probability
and that a putative unconditional
prob?
is really elliptical for a conditional
statement
ability statement
probability
is tacit. He could maintain
in which
the condition
that probability
theory
could
between
still do a lot of work
conditional
for us. For example, knowledge
of inequalities
we
be
all
that
need
in order to
might
probabilities
control
our environment
ficially.
(When you
in desirable ways,
see that the conditional
74
modifying
our behavior
probability
of death by age 60,
bene?
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
215
is substantially
you
greater than it is given non-smoking,
given smoking,
- at
see a good reason to quit smoking
least when you have ruled out other
such as the existence of a common cause,
for this correlation,
explanations
also
on the sole basis
of conditional
rather than seeing the reference
it.
embrace
In short,
information.)
probability
as
a
class problem
the frequentist
problem,
could
a way out of the reference class prob?
Perhaps this gives the frequentist
that
this
'eliminativism'
he
admit
should
lem. But
regarding unconditional
is somewhat
science seems to abound
radical, if only because
probability
with
statements
longer pretend
And of course he can no
probability.
an interpretation of Kolmogorov's
axioms.
of unconditional
to be giving
2. Typing Events
may Change
the Probability
princi?
Every event, in all itsmyriad detail, is unique. This is just Leibniz's
are
to
events.
of
So
if
of
the
indiscernibles,
you
identity
applied
going
ple
to group an event with others, you will have to allow differences
between
them. But
difference
to be a guarantee
that these differences
to the probability. The thought must be that whatever
there needs
make
no
the differ?
over
are, they are not relevant. The bulge in the carpet then moves
to the notion of 'relevance'.
It had better not be probabilistic
relevance, on
pain of circularity. But what is it, then?
ences
drains from finite frequentism
Plausibility
especially when the putative
or too small. Unfortunately
reference
for the
class is too heterogeneous,
in tandem, so that solving one tends to
these problems work
frequentist,
can be enhanced by raising the admis?
the other. Homogeneity
exacerbate
sion standards into a reference class, demanding greater similarity between
or events; but that reduces
that can be admitted.6
the individuals
events
3. Probabilities
Distant Events
of Local
Events
the number
can be Counterfactually
of
individuals
Independent
or
of
Let me first continue my plea on behalf of commonsense,
with a little homi?
Sometimes
arguments against a philo?
ly on philosophical
argumentation.
to
show
that
the
sophical position attempt
position has internal difficulties
some
are of this form). On the
of
my arguments against frequentism
(and
other hand, sometimes
intuitions
arguments begin with commonsensical
that are supposed to be dear to us, and then deploy these intuitions against
a philosophical
position with which
they clash. For example, we might
begin with the following
hand is a matter solely
the fire, my
hand,
commonsensical
intuition: this fire's burning my
involving a small region of space-time
containing
and little else; and then deploy
this intuition against
75
216
ALAN HAJEK
Hume's
often
with
tion
and
are
'Intuition-based'
regularity account of causation.
arguments
as 'internal-difficulty'
not as damaging
and
when
faced
arguments;
an intuition-based
a proponent
of the philosophical
argument,
posi?
in question might simply retort that the argument is
question-begging,
that we should revise our intuitions. Nevertheless,
on the
depending
and the weight
that we attach to common
strength of those intuitions,
can
some
such
still
have
sense,
arguments
pull on us. Much
philosophy?
the greater part of it, Iwould
in just this way. (Think espe?
say-proceeds
famous
the Chinese Room,
cially of philosophy's
thought experiments:
and so on.) The
Twin Earth,
argument
that Iwant
to turn to now
is of this
form.
is a radium
Here
1/2. If radium
atoms
atom;
distant
its probability
of decaying
in 1500 years is
from it in space and time had behaved differ?
ently, this probability would still have been 1/2.1 submit that probability
statements
about
things can be (and perhaps
typically
are) counterfactually
independent of what other things ofthat kind happen to do; and probability
statements
about
can be (and perhaps
typically are) counter
of distant events of the same kind. But according
to
this is not so.
local events
factually
independent
relative frequentism,
statements about a thing at a time can be
And I submit that probability
of the behavior ofthat
counterfactually
independent
thing at other times.
The chance that this coin lands heads now does not depend on how the
? as
itwere, the coin doesn't have to 'wait and
coin will land in the future
it happens to do in the future in order to have a certain chance of
now. To put the point crudely, though vividly:
heads
it's almost as if
landing
in something
like backward causation from future
the frequentist believes
see' what
Put more carefully:
the frequentist believes
that
on the chance that the
of the coin places constraints
is counterfactually
coin lands heads now9 and in that sense, that chance
on
the future behavior.
Indeed, if the coin has yet to be tossed,
dependent
to the
determines
that current chance, according
the future behavior
fully
results
to current
the future
chances.
behavior
I think the frequentist has things backwards:
surely it is
frequentist.7
of landing heads that gives rise to its statistics, rather
the coin's probability
than the other way round.
finite
And
Hume's
so
the same fate as
suffers much
Frequentism
The fact that this flame burned my hand does
other flames happen to be contiguous with other
it is in general.
theory of causation.
not depend on whether
hands getting burned. The
this argument is that probability,
matter
than that.
private
some
of my arguments have force
I
earlier
that
said
briefly:
Digressing
more
that
of objective
analyses
probability
against certain
sophisticated
like causation,
intuition behind
is a far more
76
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
217
of frequentism,
could be thought of as refinements
(1994).
notably Lewis'
that
It runs roughly as follows. The laws of nature are those regularities
are theorems of the best theory: the true theory of the universe
that best
of
(that is, the probability
strength, and likelihood
simplicity,
course of history, given the theory).
If any of the laws are
these laws say they are. It seems
then the chances are whatever
probabilistic,
balances
the actual
to Lewis,
the probability
that this radium atom decays
that according
in
1500 years does depend on what other, perhaps distant atoms (and perhaps
?
at least this will be so under the
not just radium atoms) happen to do
are
that
there
sufficiently many such atoms. For if many such
assumption
atoms
say, much
had decayed,
earlier
than they actually
did
(something
the best theory will admit has positive chance), then plausibly the best
?
it would
have
decay laws
to have reasonable
In particular, plausibly
likelihood.
law for radium would have been different, and hence so too the
had different
have
theory would
to in order
needed
the decay
radioactive
for this particular radium atom.
decay probability
I think that the next two arguments also carry some weight
against the
and perhaps even the two after that (with some smallish
Lewis analysis,
of (finite) frequentism.
but let us return to our discussion
modifications);
Concern
4. An Argument
from
Let me
a variation
pursue
on the
'counterfactual
I am
theme.
independence'
'argument from
(1980) famous
inspired here by Kripke's
to which
Lewis'
counterpart
theory, according
against
across
identified
worlds:
genuinely
possible
entities
concern'
cannot
be
to Lewis]
ifwe say 'Humphrey might have won the
[According
we are not talking
election (if only he had done such-and-such)',
about something
that might have happened
toHumphrey
but to
someone
a
else,
"counterpart".
Probably, however, Humphrey
someone else, no matter how much
could not care less whether
resembling
him, would
have been victorious
in another possible
world, (p. 45)
from concern
Arguments
of B9 then our concerns
A
cannot
defeasible,
contingent
the form:
"If A were
the correct
analysis
such-and-such;
they are not; hence
be the correct analysis of BV To be sure, such arguments
are
our
concerns
and what
to be is, I suppose,
a highly
happen
matter.
philosophical
contingent
have
-1
would
be
as thought experiments
can be a source of
our
are surely highly
to
them
responses
though
Still, much
insight?even
believe
such arguments
77
can be also. And
they may
serve a
218
ALAN HAJEK
distinctive
trading as they do not only on our beliefs,
function,
our desires
(fears, regrets,...
but also on
).
It is natural to think thatmy probability of dying by a certain age is a
of me
property
though von Mises
(or perhaps me plus my immediate environment).
Natural,
goes out of his way to deny it: "We can say nothing about
the probability of death of an individual even ifwe know his condition of
life and health in detail. The phrase
it refers
'probability of death', when
to a single person, has no meaning
at all for us" (p. 11). Also: "It is utter
nonsense
to say, for instance, thatMr. X, now aged forty, has the
probability
0.011 of dying in the course of the next year" (pp. 17?18). But surely it is
that Mr. X really cares about, when
just such 'nonsense'
about his probability
of death. Of course, he may well
about his fellow citizens
too, and he may be concerned
he is concerned
care
unselfishly
to find out how
high the death rate is among people of his type. But to the extent that
the other people can drop out of the
his concerns are directed to himself,
can drop
say that Humphrey's
counterparts
picture (much as Kripke would
concern about losing the
out of the picture when
it comes to Humphrey's
election). The statistics about others like him may give him good evidence
as to his own chance of dying, but the fact
cares about
that he ultimately
? one
a
of
is a fact about himself
by
expressed
meaningful
'probability
death' statement that refers to a single person.
GENERAL PROBLEMS CONCERNING ACTUAL FREQUENTISM (BOTH FINITE
AND INFINITE)
5. Actual
Here
is another
variant
though sufficiently
once
of Autrecourt
object,
Commits
Frequentism
one cannot
of
the
said words
the existence
to a Surprisingly
Rich Ontology
'counterfactual
argument,
independence'
separate treatment, I think. Nicolas
to the effect that, from the existence
of one
to merit
different
deduce
one
the existence
of others. But
of a non-trivial
to finite
according
for an event does
probability
of other, similar events in the actual world. The fact that
for dying by age 60 proves that I am not alone
I have a non-trivial probability
some probability
to
the
Or consider
in the world,
according
frequentist.
frequentism,
imply the existence
own mind:
that itwill deteriorate by age
for example,
to
this means
the finite frequentist,
0.1. According
60 with probability
like mine experience
such
that a tenth of the people out there with minds
?
are
course
that
there
other
such
of
60
which
deterioration
implies
by age
for you!
there's a quick argument against solipsism
minds. Now
statement
about my
the falsehood
here. It's not that implying
I'm being a little facetious
? on
the contrary. And of course the would-be
is a bad thing
of solipsism
78
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
219
will simply deny that there are any non-trivial
solipsist-frequentist
prob?
abilities
about my mind. What's
of
though, is that statements
troubling,
about a mind, an object, or an event, seem to be simply irrel?
probability
evant to the existence
of other minds,
other objects, other events of the
same sort, right here in the actual world. Moreover,
we can even put lower
on how many such entities there are ? for example, at least 9 other
in the case just considered,
and of course some multiple
minds
of 10minds
in total. It is often true that the required things do indeed exist, and in the
bounds
numbers required (at least 9 other minds, for example). But sometimes
? see
the problem of the single case below; and in any case,
do not
they
these
things simply don't seem to be implied by the corresponding probability
statements.
6. Actual
=
Frequentism
Operationalism
about Probability
of probability
The finite frequentist
sounds a lot like an opera?
definition
tional definition.
Like the operational
of temperature
in terms
definitions
or mental
states in terms of actu?
of actual thermometer measurements,
in terms of the results of
al behavior, we have probability
being defined
some actual 'measurement'
(put in scare quotes, since the results might not
this
be
in
case, the results of trials of the relevant sort.
observed):
always
is mistaken
for the probability
of the probability
itself.
?
course.
so
hard
of
has hit
And rightly
the argu?
times,
Operationalism
are well known. To rehearse just one of these arguments: we want
ments
can be misleading
to be able to say that measurements
("the thermometers
were poorly calibrated"),
but an operational
definition
doesn't
let us say
'Measurement'
that. Likewise,
if the frequentist
has his way, we can't say that the chance
of the coin landing heads really was
1/2, but that there was an unusually
high proportion of tails in the actual
be a very natural thing to say.
7. Chance
Why
do we
is Supposed
believe
to Explain
in chances?
sequence
of tosses. And
Stable Relative
Because
we
yet that could
Frequencies
observe
that various
relative
are stable; and that is exactly what we would expect
frequencies
are
if there
chances with similar values. We posit chances
in
underlying
order to explain the stability of these relative frequencies.
But there is no
of events
explaining
to be done
not
be those
just is actual relative frequency: you can't
explain something by reference to itself. Here I am echoing a well-known
argument due to Armstrong
(1983) against the 'naive regularity theory' of
are
lawhood
laws
(that
simply true universal
generalizations).
Compare:
we posit laws of nature in order to explain regularities,
so they had better
simply
if chance
regularities,
as a naive
79
regularity
theory
of lawhood
220
would
ALAN HAJEK
it. (Indeed,
have
the demise
of frequentism
to that of the
is parallel
in many
naive
respects.)
regularity theory
I have presented firstly some general arguments
that work equally well
?
of frequentism
that I have mentioned
and
against any of the versions
refinements
indeed, in some cases, even against more sophisticated
thereof;
and then some further arguments
against actual frequentism,
irrespective
of the size of the reference classes. But finite frequentism
also has its own
all really stemming from simple mathematical
characteristic
problems,
ratios of (finite) natural numbers.
facts about
PROBLEMS SPECIFIC TO FINITE FREQUENTISM
8. Attributes
Chance
with No Occurrences
have Undefined
Relative
Frequencies.
Gaps
are undefined
frequencies
no
value. But
determinate
0/0 has
Relative
probabilities
for attributes
I contend
that have no occurrences:
can have
that such attributes
nonetheless.
each with a single die. In the first world,
Imagine two different worlds,
the die is tossed a number of times, but in the second it is never tossed.
There's a sense inwhich both dice can be said to have well-defined
proba?
to finite frequentism,
bilities for landing 6, say, but according
only the first
in the first world,
the die is weighed,
does. By analogy,
both
dice have masses.
it is never weighed;
nonetheless,
von
considerations
similar
Mises
adduces
Ironically,
but in the second
tomine
about dice
that probability
is relative
to argue for his opposite
conclusion
a
a
a
of
of
6
is
"The
given die and
physical property
probability
frequency:
to itsmass, specific heat, or electrical resistance"
is a property analogous
(p.
ought to
14). Exactly! But taking the analogy at face value, the conclusion
is an intrinsic property of chance devices
be that probability
(such as dice)
to
a
those other
theorist
The
say.
analogy
might
something
propensity
von
to
of
because
his
Mises
thoroughgoing
appealed
properties presumably
in order
on operationalism.
He regarded the mass of a die,
bordering
positivism,
as the limit of a sequence of ever improving measurements
for example,
of the mass. But who among us now would want to say that?
All
this may
devolve
into a clash of intuitions,
as so many
philosophical
debates do (which is not to say that the debates areworthless); but I think
that the intuitions
examples
on my side are perhaps
section.
in the next
80
even more
compelling
in the
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
9. IfB Occurs
Once,
221
0 or 1: Local Determinism
A has Probability
toss a certain
suppose that we
the relative frequency
Now
of heads
coin exactly once. It lands heads. Then
is 1. But we don't want to be committed
to saying that the probability
of heads is 1, since we want to allow that it
device. (Change the example to Stern-Gerlach
could be an indeterministic
measurements
of electron spin, if you think that coin tosses are determin?
inwhich
there is only the one coin, if you think
istic; and imagine a world
there are any, are relevant.)
that the results of tossing other coins, when
an
once
event that only happens
to any sensible
And in general,
(according
for typing it) does not automatically
do so with
an a priori argument for local pockets of determinism
standard
Such
probability
is surely
1.
too
good to be true!8
now a radioactive
atom that obeys an
as
so
but
it
there
is exactly one such
decay law,
happens,
atom in the entire history of the universe (cf. Lewis'
(1994) "unobtainium").
of decay is 0 or 1, over any time interval,
Are we to say that its probability
Of
course
it isn't true. Consider
indeterministic
simply
because
for each
such
interval
the relative
frequency
of decays
is
either 0 or 1? So with probability 1 it decays exactly when it does? This
contradicts
our supposition
that it obeys
an indeterministic
decay
law. An
innocuous
surely.
supposition,
are most naturally regarded as being unrepeatable ?
experiments
game, a horse race, a presidential
election, a war, a death, certain
in the very early history of the universe. Nonetheless,
it
chancy events
seems natural to think of non-extreme
some
to
of
probabilities
attaching
Many
a football
them. This, then, is another notorious
called problem
of the single case.
10. Universal
Generalizations
statements
Certain
are
for example, universal
think that non-trivial
'single
problem
for frequentism:
and Existential
Statements
case'
of their very
claims.
in virtue
and existential
generalizations
the so
logical form:
Some people
to such statements ?
attach
probabilities
that all ravens are black is 0.9', or 'the
be, 'the probability
that there exist tachyons is 0.1 '. If there is sense to be made of
probability
such probabilities,
then it is not the frequentist who can make
it, for such
(objective)
as it might
to be true or false. How
only get one opportunity
a universal
or an existential
in which
generalization,
statements
cases
true? What
is the reference
do you count
is
statement,
instances are 'all ravens are
class, 0.9 of whose
one
I suppose
black'
instances?
could imagine counting possible worlds:
the
of
all
10%
(in
limit?)
possible worlds are 'there exist tachyons'-worlds.
But this is hardly an attractive proposal,
and in any case, it is certainly not
finite frequentism.
81
222
ALAN HAJEK
some further, non-fre
frequentist might acknowledge
sense
covers
cases
that
such
of
probability
("let a thousand flowers
quentist
...
bloom
that frequentism
still holds sway elsewhere.
The
"), insisting
to
sorts
certain
of
statements
this
is
of
argument
identify
probability
point
An
ecumenical
that people have found quite intelligible, even though they are (virtually)
on a frequentist
unintelligible
are not so ecumenical.9
analysis.
And
various
real-life
frequentists
So far, the problems have involved very low numbers of instances of the
attributes in question, namely 0 or 1. So the reaction might be: "frequentism
was never meant
to handle cases inwhich
there are no statistics, or only a
single data point; but in decent-sized
in the catchy
intuition encapsulated
samples itworks just fine." This is the
but all-too-vague
slogan "Probability
The reaction
is wrong-headed:
problems
is long run relative frequency."
remain even if we let our finite number
11. Intermediate
The
problem
guaranteed
terministic
Probabilities
'
of trials be as large as we
in an Deterministic
like.
World
are
of the single case was that certain relative frequencies
are
of
inde?
the
results
to be extreme (0 or 1), even when
they
because
for frequentism,
This is an embarrassment
processes.
such indeterminism is thought to be incompatiblewith extreme (objective)
?
cannot be probabilities.
hence those relative frequencies
probabilities
on
it would
its head: determinism,
let's turn this thought
Now
seem, is
a
in
determin?
intermediate
with
probabilities:
(objective)
incompatible
are
so
0 or 1.
chances
all
is
and
istic world,
objective
chancy,
nothing
But
determinism
is no obstacle
Remember
to there being relative frequencies
the probability
Venn's example:
that
of a
these values.
of male births among all births. This
is simply the proportion
a
that determines
but the process
is
1/2;
roughly
presumably
proportion
nonetheless.
sex
well
be
deterministic
could
baby's
lie between
male
birth
12. Finite
Frequentism
Consider
a coin
with
Generates
Spurious
Biases
that is perfectly
fair, meaning
by this that it lands heads
Yet itmight not come
for
tails.
equal to 1/2, and likewise
1/2 of the time in actual tossing. In fact, itwould be highly
probability
heads
up
exactly
to do so in a huge number of tosses, say 1,000,000.
unlikely
to
itwould be more than unlikely
If the number of tosses is 1,000,001,
?
thinks
So the finite frequentist
it would be downright
do so
impossible.
fair. Put
in saying that the coin is perfectly
then be wrong
that we would
an
coin
that
is
it
to
finite
any
analytictnxth
frequentism,
simply: according
a
there's
Now
is
biased.
an
of
times
number
odd
that is tossed
startling bit
of a priori
reasoning for you.
82
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
223
we do not need to leave our finite frequentist
to
arm-chairs
Likewise,
of all n-sided
dice that are tossed a number of
'discover'
the biasedness
times that is not divisible by n (a coin can be regarded, after all, as just the
=
If only
2). And so on for other chance processes.
special case inwhich n
so
matters
could be settled
all empirical
easily!
to the possible biases of the coin, or
there is a 'graininess'
Furthermore,
the dice. Toss them n times; the relative frequencies must all be multiples
assure us that various coins
of 1/n. So not only can the finite frequentist
and dice are biased
he can even put severe constraints on the possible
extents
He
of the biases!
should
that the coin
to reply to all of this: "When we say
the temptation
mean
we
is fair,
that it lands heads with probability
really
to
and
for tails."10 Firstly, there is no
likewise
1/2,
equal
resist
approximately
that the fair coin will
guarantee
even approximately
half the
a
one
fair
that
coin,
Secondly,
imagine
genuinely
an
moreover
odd number of times; but the finite frequentist
is tossed
thinks
an
that this is on a par with
red
uncolored
imagining
object
namely,
we
time.
land heads
can at least
imagining gibberish. Finally, we should not let too much hinge on the
choice of the example. Consider
if you prefer certain Stern-Gerlach
spin
are perhaps
which
'fairer' than the coin is; or consider the
measurements,
half-life of radium; or whatever
your favorite example might be.
the longer the finite run of coin tosses (or whatever),
the
it is that the relative frequency
the
value
that
exactly equals
To be sure, the probability
that the relative frequency
is near
Ironically,
more unlikely
it 'should'.
the value that it 'should' equal increases. But if frequentism
is supposed
near enough is not good enough.
be an analysis of probability,
13. Finite
Frequentism
Generates
Spurious
to
Correlations
Let us say thatA is spuriously correlated with B
if P(A\B)
^ P(A)9
and yet A and B are not causally
related. The finite frequentist will see
over
correlations
all
the
spurious
place. We can be pretty sure that, say, the
of people who die by the age of 60 is not exactly the
relative frequency
same in general as it is among people who wear green shirts. In fact, we
can be absolutely
sure that this is so if the smaller sample size happens not
to divide the larger one. To see the point, pretend that there are 10 people
in our sample reference class, and that 7 of them wear green shirts. (Note
that 7 does not divide 10.) Then all relative frequencies within thewhole
the green shirt sample
of 1/10, while within
sample must be a multiple
a
no
must
be
of
1/7.
there
is
of 1/10
Now,
way for a multiple
they
multiple
to equal a multiple
cases
of 1/7 (apart from the trivial
of 0 and 1, which
are uninteresting).
The finite frequentist
translates this as: there is no way
83
224
ALAN HAJEK
a correlation - presumably,
to agree. In other words,
for the probabilities
?
a spurious one
between death by the age of 60, and the wearing
of green
it is startling that such results can
in this case. Again,
shirts, is guaranteed
be derived
aprioril
14. All Irrational
Probabilities,
'GoMissing'
a good
There's
Rational
and Infinitely Many
Probabilities,
sense
in which
most
of the numbers
between
0 and
1 are
irrational
aren't). Yet a finite
relative
Thus,
(uncountably many are, only countably many
can never take an irrational value.
frequency
to probabilities
is necessarily
which
gives such values
its
of
to finite frequentism,
subject matter.
irrespective
any
theory
false, according
That's certainly a
to finite
according
For example,
law for radium is false for all time periods
frequentism,
for decay?which
is to say that it is false
that have irrational probabilities
almost everywhere.
and very similar to one that we
1 (? la Reichenbach,
Reply number
an irrational value as closely as we like,
saw above): we can approximate
a
we
have
sufficiently
large (finite) number of trials.
provided
quick
of quantum
the radioactive
refutation
Counter-reply:
that probability
again,
mechanics!
this misses
the point. The
relative frequency,
thesis before
us is not
but that it is relative
is approximately
an identification
of
with
relative frequen?
have
probability
frequency.
we
can
values as
that
it
Of
course,
probability
approximate
cy.
implies
?
values
as
we
like with relative frequency
anything approximates
closely
but it is a much
itself as closely as we like!
stronger claim. The point
We
relative frequen?
in justifying
might be appropriate
approximation
but our topic is
for discovering
tism as good methodology
probabilities;
not
its
the analysis of probability,
methodology.
are such things as
Reply number 2: Bite the bullet, and deny that there
about
irrational probabilities.
Counter-replies:
laws
probabilistic
radioactive
No
experiment
Firstly, this would
is more complicated
could
mean
ever reveal
their existence.
that the truth about various
than we
think. For
the
instance,
than
smooth
rather
involve step functions,
decay laws would
curves. Secondly,
the reply smells of positivism.
Thirdly, we
exponential
even
irrational probabilities,
that instantiate
can imagine possible worlds
not
want
one
do
We
of
them.
not
to
be
if the actual world turns out
surely
is not only false, but logically false.11
to say that quantum mechanics
are ruled out by finite
a
infinitesimal
the
probabilities
way,
fortiori
(By
?
of
version
and
yet such prob?
indeed, by any
frequentism
frequentism
an
For
to
role
have
abilities may nonetheless
sympathetic
play.
important
84
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
discussion
pp.
of infinitesimal
see for example
probabilities,
Skyrms
225
(1980),
177-187.)
to finite frequentism,
Moreover,
according
infinitely many rational prob?
also. This is related to the point Imade earlier about
abilities
'go missing'
the 'graininess'
rational values
of finite
for a given sample size. All
frequencies,
the endpoints of the grains will be ineligi?
to the finite frequentist.
according
relative
that fall between
ble as probability
values,
Note that the last few arguments did not require any assumptions
about
what we take to be the relevant reference classes. As I indicated before, I
about including in the reference class of a certain coin, the
have misgivings
results of tossing other, very different or distant coins. But even waiving
the last arguments
still go through. Include if you like
those misgivings,
the results of various other coins when determining
the probability
for this
coin; indeed, include if you like the results of all coins that ever were tossed,
are tossed, and ever will be tossed. Since there will still be only finitely
many trials in the reference class, still the frequentist will have to say: all
to be rational, and in fact, all multiples
will be guaranteed
probabilities
of a certain finite fraction;
spurious correlations with
and it is discoverable
chosen factors can be guaranteed;
that if the total number
15. Non-Frequentist
Symmetry, Derivation
of tosses
is odd,
Considerations
from
the coins
Enter
other appropriately
from the arm-chair
are biased.
our Probabilistic
Judgments:
Theory...
cases above as fatal for finite frequentism,
cannot easily be bitten. We know that
bullets
that
they provide
so
cannot
to be biased, because we
coins and dice
easily be 'shown'
have independent
sometimes
grip on what their various chances are. We
We
should
regard
the various
because
of radioactive decay cannot so easily be 'shown' to
know that probabilities
be rational, because quantum mechanics
There are other
says otherwise.
?
sources of our probability
besides
relative
for
judgments
frequencies
and derivation from scientific theories
symmetry considerations,
example,
that we
to. When
subscribe
there's a conflict between
already
one
of
other
and
these
the latter often wins.
sources,
frequency
5.
In this space,
relative
CONCLUSION
I could not give voice
to various
responses
to these arguments
on behalf of finite frequentists (although I did give voice to quite a few).
reject the starting points of some of the arguments,
that were
other arguments
'intuition-based';
they would
They would
doubtless
particularly
those
85
ALAN H?JEK
226
That should
grant me, remaining untroubled
by their conclusions.
seem
to
most
debates
go the same way.
philosophical
hardly be surprising:
I do, however,
think that finite frequentism
is about as close to being
perhaps
as a serious
refuted
once we
ever gets. This
position
philosophical
of
how
the
probabilities
separated
question
have
becomes
clear
are discovered
from the question of what probabilities are. (A good way to find out if a
'bachelor'
is to ask him; but we wouldn't want to analyse
to
To
in
the
the form
put my position
question.)
'yes'
reasonable
bad
'Finite frequentism:
methodology,
analysis'.12
is a bachelor
man
as one who
of a slogan:
answers
NOTES
I
This paper is an edited version of the first half of my talk "Thirty Arguments Against
Frequentism", presented at the Luino Conference. Iwanted this paper to reflect that talk,
while
meeting
the reasonable
that this volume
constraints
length
So
required.
I have
omit?
tedmy lengthy discussion of hypothetical frequentism, hoping to present that on another
occasion.
I thank
21 know
this from
the editors
various
for their forbearance.
I have
conversations
had,
though
such
catching
frequentists
out
of the closet and in print is not so easy. Shafer (1976) comes close in his definition of
one of the possible
outcomes
of the time that a particular
the proportion
[of a
the chance
ofthat
outcome"
is called
tends to occur
(p. 9), and closer
experiment]
later.
"tends
to" four pages
still when
he drops the qualification
3
a
for the
mathematical
abstraction
is
but
"The
word
Frieden
Witness
'probability'
(1991):
"
term 'frequency
of occurrence'
(p. 10).
intuitively more meaningful
chance:
"...
random
4
The discussion takes place inmy manuscript "Fifteen Arguments Against Hypothetical
the second
Frequentism",
half
talk at Luino.
of my
5
and a
The distinction, I take it, is between a conditional probability of the form P(B\A)
relativized probability of the form Pa{B). The former presupposes thatP{A) is defined,
the latter does not.
6
somewhat
This resembles
for
the
'best'
of
theory
between
the tension
and strength
simplicity
account
to Lewis'
(1994)
central
the universe,
in the competition
of chance:
raising
the standards for admission is like an increase in strength, with a corresponding loss in
on
More
simplicity.
7
better
It is hardly
corresponding
coin
be-tossed
that shortly.
to propose
has
that
instead
frequencies
an undefined
of
chance
evolve
the chances
over
evolve
relative
time:
landing
for
heads
mean
as the
exactly
that the yet-to
'chance
gap'
over
that would
?
see the
time
objection
below.
81 cannot
tive chance.
pause for further discussion
I admit that the connections
on
the connection
I assume
here
between
determinism
are not uncontroversial.
and objec?
See Lewis
(1986) pp. 117-121 for a fuller treatment.
9
Frieden,
for example
-
see footnote
3.
10
I suppose we already knew that no actual coin lands heads with probability 1/2 exactly,
and tailswith probability 1/2 exactly, if only because some tiny amount of probability goes
to the coin landing on its edge; and perhaps even some (tinier still) amounts of probability
go
to the coin
landing
on each
of the two edges
of
its edge.
II
Here I construe logic broadly to include analytic truths.
86
FIFTEENARGUMENTSAGAINST FINITEFREQUENTISM
227
121am grateful tomany people for discussions of thismaterial, especially Jim Bogen, Alex
Byrne, Fiona Cowie, Ned Hall, David Hubert, Marc Lange, Brian Skyrms, Nigel Thomas,
Jim Woodward,
and Lyle
Zynda.
REFERENCES
Armstrong, D. M.: 1983, What is a Law ofNature? Cambridge University Press.
B. R.:
Frieden,
1991, Probability,
Statistical
Optics,
and Data
Testing,
Springer-Verlag.
Jeffrey,Richard: 1977, 'Mises Redux', inR. E. Butts and J.Hintikka (eds.), Basic Problems
inMethodology and Linguistics.
Kripke, Saul: 1980,Naming and Necessity, Oxford University Press.
Lewis, David: 1986, Philosophical Papers, vol. II,Oxford University Press.
Lewis,
David:
1994,
'Humean
Supervenience
Debugged',
Mind
103,473-490.
Reichenbach, Hans: 1949, The Theory of Probability, University of California Press.
Shafer, Glenn: 1976, AMathematical Theory of Evidence, Princeton University Press.
Brian:
Skyrms,
Venn,
John:
von Mises,
1980, Causal
1876,
Richard:
The Logic
Necessity,
of Chance,
1957, Probability,
Yale
1980.
2nd
ed., Macmillan
and Co.
Truth
and Statistics,
Macmillan.
California Inst, of Technology
Division of Humanities and Social Sciences
228-77
Pasadena CA 91125
U.S.A.
87