Causation and Context

Causation and Context
Dr Rani Lill Anjum
University of Tromsø / University of Nottingham
[email protected]
Introduction
Causation is often claimed to contain necessity. That C causes E with necessity. But is
necessity a necessary aspect of causation, and if it is, in what way can we say that causation
involves necessity? I want to argue that causation is essentially context-dependent, which
means that necessity in this connection can at least not mean that C causes E independently of
everything else. I will do this by focusing on how conditionals are always true or false
according to a certain set of background conditions, and if we change these background
conditions, true conditionals can become false and vice versa. This should be relevant for how
we understand causation, as hypotheses typically have a conditional form: ‘If C then E’.
1. Conditionals and hypotheses
The truth functional interpretation of ‘If A then B’ defines conditionals as being nothing but a
certain combination of truth-values assigned to ‘A’ and ‘B’:
A
T
T
F
F
B A⊃ B
T
T
F
F
T
T
F
T
But this seems to dissolve the very conditional relation that ‘if’ is supposed to express.
Consider the following valid inferences for the material conditional, none of which are valid
for ‘If A then B’:
1.
2.
3.
4.
(A & B) ⇒ ((A ⊃ B) & (B ⊃ A))
(¬A & ¬B) ⇒ ((A ⊃ B) & (B ⊃ A))
B ⇒ ((A ⊃ B) & (¬A ⊃ B))
¬A ⇒ ((A ⊃ B) & (A ⊃ ¬B)
A correct understanding of conditional is important for understanding causation, as the
conditional form is used to express hypotheses: ‘If C then E’.
With a material conditional understanding of ‘if’ we get that:
All hypotheses that are not tested will come out as true.
This is the so-called problem of counterfactuals, where conditionals with false antecedents are
defined as true: ¬A ⊃ (A ⊃ B).
The most commonly proposed solution is to treat conditionals with false antecedents as
special cases.
1
2. Counterfactuals or subjunctives?
The very problem of counterfactuals stems from a confusion. One typically expects the
distinction between indicatives and counterfactuals to be indicated by the grammatical mood
of the statement, as in the following pair of conditionals:
1. If Oswald didn’t kill Kennedy, then someone else did it.
2. If Oswald hadn’t killed Kennedy, someone else would have.
The fact that we would accept (1) in the indicative mood but not necessarily (2) in the
subjunctive mood, is used as an argument for a separate logical treatment of
counterfactuals/subjunctives. But this is a mistake, because:
i)
ii)
The subjunctive form does not indicate counterfactuality, but hypotheticality.
The change of truth conditions for (1) and (2) is due to a change in background
conditions, not the change of grammatical mood from indicative to subjunctive.
For all conditionals, the change of grammatical mood from indicative and subjunctive will not
change the truth conditions of the conditional, as long as the background conditions remain
the same:
a. If my head gets chopped off, I will die.
b. If my head got chopped off, I would die.
c. If my head had gotten chopped off, I would have died.
(a), (b), and (c) have the same set of truth conditions. This is because they all refer to the same
conditional relation and to the same set of background conditions. Hence they all have a
subjunctive, hypothetical relation, or what is often (mis)taken for the counterfactual aspect of
conditionals.
3. Indicatives and subjunctives
David K. Johnston notice that most conditionals with an apparent indicative form originally
had an explicit subjunctive grammatical marking in Old English. We can recognise these
originally subjunctive conditionals in that a change in the mood from indicative to subjunctive
does not change the conditional’s truth conditions, like in (a), (b), and (c) above. Thus
according to Johnston’s analysis, these conditionals are all essentially subjunctive.
But not all conditionals behave like this, which is why we get the divergence in truth
conditions for the pair of conditionals (1) If Oswald didn’t kill Kennedy, then someone else
did it, and (2) If Oswald hadn’t killed Kennedy, someone else would have. The reason for the
divergence in truth conditions is the divergence in background conditions:
In (1) it is assumed that Kennedy is killed and that someone killed him. It is an open question
whether Oswald did it, but if he didn’t do it, then someone else did. Accordingly (1) seems to
follow logically from the background conditions that someone killed Kennedy. For every x, if
x didn’t kill Kennedy, then someone else did it. But this is not a subjunctive relation like the
one in (2).
2
In (2), however, we are asked to disregard the fact that Kennedy is killed, and predict what
would have happened if Oswald hadn’t killed him. It is assumed that Oswald was the one who
actually killed Kennedy, and that more people were prepared to kill him if Oswald didn’t.
Johnston calls conditionals of the type (1) non-subjunctive because they don’t have a
corresponding subjunctive conditional with the same set of truth conditions. Other nonsubjunctive conditionals are:
a. If the butler didn’t do it, then the maid did it.
b. If Jane bet on heads, she won.
c. If John took that plane, he is dead.
What seems to be common for the non-subjunctive conditionals is that they seem to follow
from a set of de facto background conditions. In (a) we seem to know that something has been
done and that only the butler or the maid could have done it. In (b) we seem to know that the
coin landed heads, and in (c) we seem to know that everyone on that plane died. Hence these
conditionals are different that the ones where we would be in a situation of not knowing what
will actually happen, but predicting that if the butler will not do it, then the maid will, or that
if Jane bets on head, she will win, or even worse, that if John takes that plane, he will die.
Those would all be subjunctive conditionals, involving prediction and hypotheticality.
4. Preliminary conclusion: Conditionals are essentially subjunctive
Johnston’s findings indicate that a conditional is essentially subjunctive, and that so-called
counterfactuals are paradigmatic examples of conditionals, not special cases that should be
treated separately from other conditionals. Assuming that conditionals have their logical
properties according to factuality or counterfactuality, one misses the point that it is the
relation − not the relata − that is in focus. And this relation is essentially subjunctive.
But what is the subjunctive aspect? I will say that this is the hypothetical and causative aspect
of the conditional: We hypothetically assume the antecedent and predict what will or would
follow from it.
It is the essence of a hypothesis, scientific or other, that it is hypothetical, causative and
subjunctive.
5. Causation versus regularities
Hume demonstrates that we cannot directly observe a causal relation between some A and B.
We can observe that:
i. A and B
ii. A before B
iii. some sort of contact between A and B
But we cannot observe that:
iv. B because A
3
Hume’s analysis of causation shows us how we can never infer a causal relation between two
particulars only from their joint occurrence, or even constant conjunction. Now this is exactly
what is done in the material conditional, where every conditional ‘If A then B’ is taken as true
unless A is true and B is false. In other words, every hypothesis that is not directly falsified is
considered verified.
Focusing solely on the truth-values of the statements ‘A’ and ‘B’, the material conditional is
bound to miss out on the conditional relation expressed between them in a conditional ‘If A
then B’. But this is not a flaw only of truth functionality and the material conditional. All
approaches to causal relations that can only deal with the occurrence or non-occurrence of
some particular A and B are bound to end up in some sort of logical atomism where causal
matters simply are reduced to the joint occurrence of A and B.
If Hume is right in his analysis of causation, we are unable to distinguish between
regularities, where B always (at least up to this point) follows A, and causation, where B
happens because of A.
So why should this be a problem? If we always had full control on what B would follow what
A, then do we really need an additional relation of causation between them? Isn’t regularities
enough?
No, we do need causation to:
a) account for hypothetical cases, and mere potentialities
b) explain how a causal relation sometimes is prevented by contextual factors.
6. Causation: Categorical or conditional?
When we say that ‘All Fs are Gs’ this can be taken as a categorical or conditional statement.
This makes a logical difference:
Interpreted as a categorical, ‘All Fs are Gs’ is taken to express that the class of Fs is contained
in the class of Gs. Examples of claims that are naturally interpreted as categorical are:
a) All men are mortal.
b) All even numbers are divisible on 2.
The categorical is understood as stating a logical relation, where one will not allow or expect
counterexamples. The finding of an x that is F but not G would necessarily be taken as a
genuine counter-example to the general statement. The finding of a black swan made the
classification of all swans as white invalid (sic!). Hence one single counter-example would
force us to reject our original claim and revise our classification.
In this way, the categorical interpretation does not open for context-sensitivity in the same
respect as conditionals, as no background conditions are thought to make an even number not
divisible on 2 or make a man not mortal. Fs are classified as Gs. Hence there is no
hypotheticality, induction, ceteris paribus conditions or prediction involved in this
interpretation.
4
Conditionals
Interpreted conditionally, ‘All Fs are Gs’ is taken to express that for any x, if Fx then Gx.
Examples of claims that are naturally interpreted as conditional are:
a) If I drop this pen, it will fall.
b) If a dog doesn’t get water for three days, it will die.
The conditional is understood to state a hypothetical, causal, empirical relation, where we
predict that F will cause G. Still we get a problem of induction, since there is always a chance
of some interfering factor that may interfere and prevent the relation. Hence we also need to
add a ceteris paribus clause to conditionals, saying that ‘If F then G, all background
conditions being equal’.
The logical form of causation must be the conditional, rather than the categorical. Just like the
conditional, a causal relation is always sensitive to a change of background conditions. But
the conditional aspects of such causal relations are not first of all found in whether or not they
are expressible as a conditional ‘If some triggering factor, then this manifestation’. Such an
understanding of the conditionality of causation is too simple. This is evident when we
consider how causal relations, no matter how necessary, might be prevented from manifesting
themselves given certain background conditions of interfering factors.
This is why in our reasoning about causal relations we need a ceteris paribus clause as an
added premise in order to ensure that ‘If C then E’, while no such clause is needed for a
classification. Since causal relations can always be prevented by an external factor, the
problem of induction is indeed a genuine problem for causation.
This also points to the important role of background conditions in relation to explanations of
causal relations. Just like the background conditions are what can prevent a causal relation it
is also what’s making the ground for a causal relation: A causal relation must always be
found, justified and explained outside the immediate context of some C and E.
An example:
Lack of fuel causes a car to stop, − but only if it is a fuel-driven car! This is so obvious to us
that it feels stupid to even mention it. But let’s say that we wanted to find out if there was a
causal relation between lack of fuel and the car stopping, only by looking to A and B as such:
the lack of fuel and the car stopping. That wouldn’t help us at all. We have to check out the
motor of the car to see how it works. We can say that the motor of a car, together with all the
other pieces, is one part of the background conditions that we need to understand causal
relations in the car. If it’s an electric car, then it couldn’t have been the lack of fuel that
caused the car to stop. And we know that because we know something about how cars work.
Thus we would need a ceteris paribus clause for conditionals and any causal relation, in the
sense that no F alone will ever be enough to cause G. A conditional ‘If F then G’ is only true
according to a certain set of background conditions. This means that the ceteris paribus clause
will only play the role of fixing the set of background conditions: ‘If F then G, all things
being equal, meaning; given no change in this particular set of background conditions that
makes the causal relation true.’
5
7. Some claims about causation and necessity
If we want to understand causation, we need to focus on the relation, not the relata. This is the
same with conditionals: A conditional statement ‘If A then B’ cannot be reduced to two
particulars A, B and some relation R. To avoid that ‘If A then B’ just points to a regularity,
where A is in general followed by B, we seem to think that we need an additional clause of
necessity to get causation from regularities. But what should necessity mean in this
connection? − Surely not a strict conditional, where A ⇒ B in the sense that A ⊃ B is a
tautology?
Sometimes the necessity involved in causation seems to be mistaken for some sort of logical
necessity, where something is regarded as completely independent of contextual elements.
But then ‘C causes E’ would have to mean something like ‘E follows C independently of
everything else’. But it is actually an essential part of causation that it is not so.
Just like a conditional is true only relative to a certain context, a causal relation is necessary
only according to a certain set of background conditions. This necessity is empirical and a
posteriori, not logical and a priori. But it follows with empirical necessity that E follows from
C, given this particular set of background conditions. If we got the background conditions
wrong, or there was some element we didn’t know of, then we must revise our theory.
But this doesn’t mean that the adding a ceteris paribus clause to a causal relation means ‘If C
then E, unless not’. The ceteris paribus clause points to the relevant background conditions.
But how do we know what these relevant background conditions of a causal relation are?
Well, that is our job to us to find out if we want to understand a phenomenon.
8. Understanding causation by way of failure
Only when we understand how a causal relation can be prevented, will we understand what
the causal relation is, or what the phenomenon is. In this way, understanding causation must
happen by way of understanding the apparent counter-examples. − Not by trying to confirm
and generalise the hypothesis to see if C is always followed by E with logical necessity.
Furthermore, because the generality that is part of causation cannot be one of logical
necessity, but must be an empirical, context-sensitive necessity, we cannot fight the problem
of induction. But as with the ceteris paribus clause, induction is not something to fight to
guarantee that there actually is a causal relation between C and E. Instead, the fact that a
relation is vulnerable to induction only points to the empirical and informative part of
causation.
If causation was a relation of logical necessity and complete context-independence, we would
have to ask for its usefulness in science and our everyday lives. Then causation would be
nothing but a logical, analytic, tautological, and self-evident matter. And who needs that?
9. Some conclusive remarks about causation
What I have argued in this paper is that the logical form of causal statements must be
conditional rather than categorical or material. I have also tried to say something about the
nature of conditionals and what kind of generality that must be involved in statements about
6
causal matters. My main aim has been to show that when we state that ‘C causes E’, then this
should be understood as a genuinely conditional claim, saying that ‘If C then E’.
If correct, my analysis of conditionals should be informative for how we understand
causation. But because causal relations are in the world while conditional statements are in
language, the one will never be reducible to the other. Also, causal relations can also be
expressed in other ways than in the conditional form. The point is that an insight into
conditionals also represents an insight into causation. The following characteristic of
conditionals should therefore also tell us something essential about causation.
Conditionals are:
subjunctive in essence, involving prediction
hypothetical, not factual or counterfactual
context-dependent, not independent of everything else
a posteriori, not a priori or logically true
more or less general, not universal
involving induction, allowing counter-examples without rejecting the general
hypothesis
− in need of a ceteris paribus clause, fixing the set of background conditions
− irreducible, not reducible to some C, E and additional relation R.
−
−
−
−
−
−
To compare, causal relations are:
−
−
−
−
−
−
−
−
approached by way of prediction
a matter of potentiality
context-dependent; no F will ever alone be sufficient to cause G
empirical
a matter of more or less generality, not necessity
subject to the problem of induction, as it can be prevented by external factors
in need of a ceteris paribus clause, fixing the set of background conditions
always found, justified and explained outside the immediate context of some
C, E and additional type of necessity
7