In Defense of Reverse Inference

Brit. J. Phil. Sci. 65 (2014), 251–267
In Defense of Reverse Inference
Edouard Machery
ABSTRACT
Reverse inference is the most commonly used inferential strategy for bringing images of
brain activation to bear on psychological hypotheses, but its inductive validity has recently been questioned. In this article, I show that, when it is analyzed in likelihoodist
terms, reverse inference does not suffer from the problems highlighted in the recent literature, and I defend the appropriateness of treating reverse inference in these terms.
1
2
3
4
5
Introduction
Reverse Inference
Reverse Inference Defended
3.1 Typical reverse inferences are fallacious
3.2 No quick and easy fix
3.3 A likelihoodist defense of reverse inference
3.4 An example
Appropriateness of the Likelihoodist Approach
4.1 Likelihoodist reverse inference is not applicable
4.2 Cognitive neuroscientists are not interested in comparative conclusions
4.3 Reverse inference and negative hypotheses
4.4 Likelihoodist reverse inference may confuse cognitive neuroscientists
4.5 Bayesian reverse inferences should be preferred to likelihoodist reverse
inferences
Conclusion
1 Introduction
Cognitive neuroscientists hold that neuroscientific data, particularly images of
brain activation obtained by means of fMRI, PET, or other technologies, can
be used to support or undermine psychological hypotheses about the processes and events underlying the capacity to read, recognize faces, ascribe
beliefs, or make moral judgments. To fulfill this goal, cognitive neuroscientists
and cognitive scientists have examined various inferential strategies for
ß The Author 2013. Published by Oxford University Press on behalf of British Society for the Philosophy of Science. All rights reserved.
doi:10.1093/bjps/axs044
For Permissions, please email: [email protected]
Advance Access published on April 24, 2013
252
Edouard Machery
bringing neuroimages to bear on psychological hypotheses (Sternberg [2001],
[2011]; Shallice [2003]; Poldrack and Wagner [2004]; Henson [2005], [2006],
[2011]; Poldrack [2006], [2010]). In this article, I will discuss the most influential of these inferential strategies1: reverse inference.2 Despite its popularity,
the inductive validity of reverse inference has recently been questioned
(Henson [2005]; Poldrack [2006]), and an increasing number of cognitive
neuroscientists now view this inferential pattern with skepticism. In this article, I defend reverse inference: I argue that when reverse inference is recast in
likelihoodist terms, patterns of brain activation provide genuine evidence for
and against psychological hypotheses.
Here is how I will proceed: In Section 2, I describe reverse inference and I
illustrate its use in cognitive neuroscience. In Section 3, I show that when it is
cast in likelihoodist terms, reverse inference is immune to the main objections
against it. In Section 4, I defend the appropriateness of treating reverse inference in likelihoodist terms.3
2 Reverse Inference
When cognitive neuroscientists reverse infer, they conclude that a particular
psychological process or event (be it a cognitive process, an emotion, a mood,
etc.) is recruited by an experimental task from the fact that a particular pattern
of brain activation is elicited during this task. As it is typically used in cognitive neuroscience, reverse inference rests on two premises and has the following structure:
Argument structure of typical reverse inferences:
(A) When psychological process, p, is recruited by a task, pattern of brain
activation, E, is likely to be found.
(B) In task T, pattern of brain activation, E, was found.
(C) Hence, psychological process p was recruited by task T.
The conclusion, (C), does not, and is not meant to, follow deductively from
premises (A) and (B). Rather, reverse inferences are, and are meant to be,
inductive: Premises (A) and (B) are meant to provide a non-conclusive reason
to hold (C). Premise (A) is typically supported by surveying, often informally,
the existing literature in neuroimagery. Such literature reviews examine
whether tasks that are believed to recruit psychological process p are likely
1
2
3
For instance, Anderson ([2010a]) notes the frequency of reverse inferences in Neuroeconomics:
Decision Making and the Brain.
For a discussion of another influential strategy—Henson’s ([2005], [2006]) function-to-structure
inference or forward inference—see (Machery [2012]).
In this article, ‘likely’ does not have the technical sense that it has in the likelihoodist literature.
Rather, ‘likely’ will be used in its everyday sense as a synonym of ‘probable’.
In Defense of Reverse Inference
253
to elicit pattern E. Premise (B) describes the experimental data to which
reverse inference is applied.
Consider, for instance, Greene et al.’s ([2001]) article on moral judgements.
While undergoing brain scanning, participants were presented with sixty dilemmas that were classified as non-moral, moral-impersonal, or moralpersonal. The footbridge trolley dilemma (in which participants judge whether
it is permissible to push someone from a footbridge in order to stop a runaway
trolley that is about to kill five people) illustrates the class of moral-personal
dilemmas, while the bystander trolley dilemma (in which participants judge
whether it is permissible for a bystander to push a lever that will divert a
runaway trolley that is about to kill five people onto a sidetrack where the
trolley will kill a single person) illustrates the class of moral-impersonal dilemmas. Greene and colleagues singled out the brain areas that responded
more to moral-personal dilemmas than to moral-impersonal and non-moral
dilemmas—BA 9 and 10 in the medial frontal gyrus, BA 31 in the posterior
cingulate gyrus, and BA 39 in the angular gyrus (bilateral)—as well as those
areas that responded less to the former class of dilemmas than to the latter
class of dilemmas—BA 46 in the middle frontal gyrus (right) and BA 7/40 in
the parietal lobe (bilateral). The second premise of Greene et al.’s reverse
inference describes this pattern of brain activation. Furthermore, Greene
and colleagues review the empirical literature showing that BA 9 and 10,
BA 31, and BA 39 are activated in tasks recruiting emotional processing: citing
four cognitive-neuroscientific studies, they write that ‘[r]ecent functional imaging studies have associated each of these areas with emotion’ ([2001], p.
2107). Citing two studies, they also assert that BA 46 and BA 7/40 are
‘both associated with working memory’. This informal, succinct literature
review provides evidence for the first premise of their reverse inference. On
the basis of the pattern of brain activation found in their experiments and of
their succinct literature review, Greene et al. ([2001], p. 2106) conclude that
[. . .] from a psychological point of view, the crucial difference between
the trolley dilemma and the footbridge dilemma lies in the latter’s
tendency to engage people’s emotions in a way that the former does not.
The thought of pushing someone to his death is, we propose, more
emotionally salient than the thought of hitting a switch that will cause a
trolley to produce similar consequences, and it is this emotional response
that accounts for people’s tendency to treat these cases differently.
Greene and colleagues’ use of brain imagery data is a crystal-clear instance
of reverse inference, and it can be easily recast along the lines of the argument
structure presented above:
(A) When emotional processing is elicited by a task, BA 9 and 10, BA 31,
and BA 39 are likely to be activated.
254
Edouard Machery
(B) Reading moral-personal dilemmas resulted in greater activation of BA
9 and 10, BA 31, and BA 39 than reading moral-impersonal and
non-moral dilemmas.
(C) Hence, moral-personal dilemmas (e.g. the footbridge trolley dilemma)
elicit greater emotional processing than moral-impersonal (e.g. the
bystander trolley dilemma) or non-moral dilemmas.
Consider, more briefly, an additional example. Richeson et al.’s ([2003])
experiment was meant to test the hypothesis that interracial interactions
weaken people’s executive resources (as shown by, for example, Richeson
and Shelton [2003]) because executive function is a finite resource that can
be temporarily depleted. White participants were given an implicit association
test (IAT), interacted with a black participant, then completed a Stroop task4;
in a distinct experiment, they were presented with black faces while undergoing brain scanning. For present purposes, let’s focus on this latter experiment,
which involves a reverse inference. Richeson and colleagues found that, while
viewing black faces, activation in the right dorsolateral prefrontal cortex
(DLPFC) correlated with people’s implicit bias toward Blacks, as measured
by the IAT. This provides the content of the second premise of their reverse
inference. Based on the cognitive-neuroscientific literature, they assume that
reliance on executive function activates the right DLPFC: as they put it
([2003], p. 1323), citing about twenty studies, ‘we based the present investigation on emerging literature in cognitive neuroscience that has identified a
complex circuit of brain structures—consisting, in part of dorsolateral prefrontal cortex (DLPFC) and anterior cingulated cortex (ACC)—that supports
executive control’. This literature provides empirical support for the first
premise of their reverse inference. The conclusion of their reverse inference
is that, the greater one’s implicit bias is, the more one attempts to control it,
and thus the more one’s executive function is taxed (which contributes to the
explanation of why more biased individuals tend to do worse on the Stroop
task after having met a black experimenter). This reverse inference can also be
recast along the lines of the argument structure presented earlier:
(A) When executive function is recruited, the DLPFC is likely to be
activated.
(B) When presented with black faces, the greater a participant’s implicit
bias, the more the DLPFC was activated.
(C) Hence, when presented with black faces, the greater a participant’s
implicit bias, the more people’s executive function was recruited.
4
In a Stroop task, participants are asked to read the names of colors that are printed in a color
different from the colors named (for example, participants are asked to read ‘yellow’ when it is
written in red).
In Defense of Reverse Inference
255
In these two cases, as well as in many other cases, cognitive neuroscientists
conclude that a task recruits a particular psychological process or elicits a
particular mental state on the grounds that the brain areas activated in the
task are known to be activated when this process is recruited or when this
mental state is elicited. But are they justified in so doing? Under which conditions are cognitive neuroscientists justified in treating patterns of brain
activation as evidence for the conclusions of reverse inferences? In the remainder of this article, I will attempt to delineate these conditions.5
3 Reverse Inference Defended
3.1 Typical reverse inferences are fallacious
When they reverse infer, cognitive neuroscientists typically treat patterns of
brain activation as evidence that particular psychological processes are recruited by particular tasks on the grounds that these patterns of brain activation are likely to be found when these processes are recruited. The problem is
that, for a pattern of activation, E, to be evidence for the conclusion that a
psychological process, p, is recruited by a task, T, it is not sufficient that E be
likely to occur when p is recruited, because E could be even more likely to
occur when psychological processes other than p are recruited. So, a particular
pattern of brain activation constitutes evidence for the conclusion of a reverse
inference if and only if it is more likely to occur when the psychological process under consideration is recruited by an experimental task than when it is
not recruited.
This observation is the gist of Poldrack’s ([2006]) influential critique of
reverse inference as it is typically applied in cognitive neuroscience (see also
Henson [2005]; Harrison [2008]): it is fallacious to infer from a particular
pattern of brain activation and from the fact that tasks recruiting a particular
psychological process, p, are likely to elicit this pattern that the task under
consideration recruits p because one then does not take into account the probability of obtaining this pattern of brain activation when p is not recruited.
Poldrack puts this point in simple Bayesian terms. According to Bayesians
(for example, Earman [1992]; Howson and Urbach [1993]), a piece of evidence,
E, supports a hypothesis, H, if and only if the posterior probability of H
conditional of E is larger than its prior probability. In turn, as H and H
are mutually exclusive and jointly exhaustive,6 this inequality holds if and only
E is more likely if H is true than if H is false. Thus, a particular pattern of brain
5
6
In the terminology of Bandyopadhyay and Brittan ([2006]), I will focus on ‘the evidence question’—under which conditions do patterns of brain activation provide evidence for the conclusions of reverse inferences?—rather than ‘the belief question’—under which conditions are
cognitive neuroscientists justified in believing the conclusion of a reverse inference?
I am grateful to a reviewer for highlighting this assumption.
256
Edouard Machery
activation supports the hypothesis that psychological process p is recruited by
T if and only if the probability of the occurrence of this pattern of brain
activation if p is recruited, is higher than the probability of its occurrence if
p is not recruited.
3.2 No quick and easy fix
One may think that a small fix is all that is needed to remedy the invalidity of
reverse inferences (e.g. Harrison [2008], pp. 536–7). Cognitive neuroscientists
should compare the probability of the occurrence of the pattern of brain activation observed during T if p is recruited and the probability of its occurrence if p is not recruited, instead of just taking into account the former
probability. A pattern of activation provides evidence for the conclusion
that p is recruited by T if and only if the former probability is larger than
the latter.
While cognitive neuroscientists would be justified in treating patterns of
brain activation as evidence for the conclusions of reverse inferences if this
necessary and sufficient condition were met, this Bayesian characterization of
reverse inference is not very useful for two distinct reasons. First, in many
cases, cognitive neuroscientists have no sense of the probability of obtaining a
particular pattern of brain activation if psychological process p is not recruited
by experimental tasks and, as a result, they do not know whether the observed
pattern of activation gives them a reason to conclude that the psychological
process of interest was involved during the task under consideration. Second,
as meta-analyses and systematic literature reviews of the activation of particular brain areas across experimental tasks show (for example, for the precuneus, see Cavanna and Trimble [2006]; for the posterior cingulate cortex, see
Vann et al. [2009]; see also Anderson [2010b]; Yarkoni et al. [2010]), many
brain areas are such that they are activated when many different psychological
processes are recruited. As a result, the probability of obtaining a particular
pattern of brain activation when the psychological process of interest is not
recruited is likely to be high. In this case, observing the relevant pattern of
brain activation can only provide at best weak evidence, if any evidence at all,
for the psychological hypothesis under consideration.
At this point, it may be tempting to simply reject every inference that a
particular psychological process is recruited by an experimental task on the
basis of the activation of the brain during this task. Influenced by Poldrack,
many cognitive neuroscientists and cognitive scientists now regard reverse
inference with skepticism,7 and some recommend that it be completely
7
See, for example, (McCabe and Castel [2008], p. 345; Young and Saxe [2008], p. 1404; Greene
and Paxton [2009], p. 2 of the supporting information; Gonsalves and Cohen, [2010], p. 746).
In Defense of Reverse Inference
257
discarded. For instance, Anderson ([2010a], p. 152) writes that ‘[t]he solution
to the problems exemplified by neural correlate work and reverse inference is
not to use the techniques with caution, but to stop doing it’. However, this
rejection would deprive cognitive neuroscientists of the most commonly used
inferential strategy for bringing neuroscientific evidence to bear on psychological hypotheses, and one should only discard reverse inference altogether if
there is no alternative formulation of the conditions under which cognitive
neuroscientists are justified to treat patterns of brain activation as evidence for
the conclusions of reverse inferences.
3.3 A likelihoodist defense of reverse inference
The first step for developing this alternative formulation involves distinguishing two conceptions of evidence. According to the first conception (endorsed
by Bayesians), the evidential relationship is a relation between an empirical
result and a particular hypothesis: an empirical result supports (undermines,
or neither supports nor undermines) a particular hypothesis. When evidence is
understood this way, an empirical result supports a hypothesis if and only if it
is more likely if the hypothesis is true than if it is false. According to a second
conception (embraced by likelihoodists such as, for example, Edwards [1972];
Royall [1997]; Forster and Sober [2004]; Sober [2008]; Blume [2011]; Taper
and Lele [2011]), the evidential relationship is a relation between an empirical
result and two competing hypotheses8: an empirical result supports a particular hypothesis more than (or less than or to the same degree as) another hypothesis. In this case, evidence is comparative. When evidence is understood
this way, an empirical result supports a hypothesis over another one if and
only if it is more likely if the former hypothesis is true than if the latter is true.
Reverse inference can thus be recast in likelihoodist terms: cognitive neuroscientists are justified in treating a pattern of brain activation, E, as providing
evidence for the hypothesis that a task, T, recruits a first psychological process, p1, over the hypothesis that it recruits another psychological process, p2,
if and only if E is more likely to be found when p1 is recruited than when p2 is
recruited.9
So recast, reverse inference does not suffer from the problems highlighted
earlier in this section. First, there is now nothing fallacious in overlooking the
probability that a pattern of brain activation occurs when a particular psychological process is not recruited. Second, it is not of limited utility: neither
cognitive neuroscientists’ ignorance of this probability nor its possible high
value stand in the way of using patterns of brain activation as evidence for
8
9
Or more than three if more than two hypotheses are competing.
As noted by an anonymous reviewer, this is reminiscent of Lipton’s inference to the best explanation (Lipton [1991]).
258
Edouard Machery
particular psychological hypotheses. Cognitive neuroscientists only need to
take into account the probability that the pattern of brain activation under
consideration occurs when the specific psychological processes, which are
mentioned in the competing hypotheses, are recruited by experimental tasks.
3.4 An example
If one views reverse inference in Bayesian terms, one is forced to conclude that
Greene and colleagues invalidly inferred that moral-personal dilemmas elicit
more emotional processing than moral-impersonal and non-moral dilemmas.
They did not take into account the probability that the medial frontal gyrus,
the posterior cingulate gyrus, and the angular gyrus are activated when psychological processes other than emotional processing are recruited by experimental tasks. Furthermore, it is unclear whether taking into account this
probability so as to counter the invalidity charge would have helped Greene
and colleagues. Because, as Klein ([2011]) argues, the neuroscientific literature
suggests that the medial frontal gyrus, the posterior cingulate gyrus, and the
angular gyrus are activated when processes other than emotional processing
are involved in a task. Greene et al.’s findings could have provided at best
weak evidence for their hypothesis.
In contrast, when Greene and colleagues’ reverse inference is recast in likelihoodist terms, it becomes irrelevant that the medial frontal gyrus, the posterior cingulate gyrus, and the angular gyrus are activated when a variety of
psychological processes other than emotional processing are recruited by experimental tasks. Let’s see in detail how this works.
In a likelihoodist framework, one always compares several particular
hypotheses, and one examines the probability of the evidence in light of
these hypotheses. The relevant hypotheses for Greene and colleagues are the
following ones:
H1: People respond differently to moral-personal and moralimpersonal dilemmas because the former elicit more emotional processing than the latter.
H2: People respond differently to moral-personal and moralimpersonal dilemmas because the single moral rule that is applied to
both kinds of dilemmas (for example, the doctrine of double effect)
yields different permissibility judgments.
Greene and colleagues favour H1, whereas other authors (for example,
Nichols and Mallon [2006]) favour H2. Greene and colleagues correctly treated the observed pattern of activation as evidence for the hypothesis they
favour if the probability that the medial frontal gyrus, the posterior cingulate
gyrus, and the angular gyrus are activated during emotional processing is
In Defense of Reverse Inference
259
larger than the probability that they are activated when a moral rule or a
moral principle such as the doctrine of double effect is applied to a case. As
this inequality holds, evidence does support the hypothesis that
moral-personal dilemmas elicit more emotional processing than
moral-impersonal and non-moral dilemmas over the hypothesis that a single
moral rule was applied to both kinds of dilemmas (for another example, see
Section 4.2).
4 Appropriateness of the Likelihoodist Approach
In this final section, I examine five objections that could be raised against the
likelihoodist treatment of reverse inference presented in this article. These
objections assert successively that likelihoodist reverse inference is not applicable; that, even if it is applicable, it is not useful for cognitive neuroscientists;
that, in any case, it often does not differ from Bayesian reverse inference; that,
even if it is applicable, useful, and differs from Bayesian reverse inference; it is
confusing; and finally that, even if it is applicable, useful, differs from
Bayesian reverse inference, and is not distinctively confusing, it is not needed.
4.1 Likelihoodist reverse inference is not applicable
According to the likelihoodist framework developed here, reverse inference
can only be useful if cognitive neuroscientists have some sense of how likely it
is that some pattern of activation will be observed if two distinct psychological
processes, p1 and p2, are recruited by experimental tasks. However, one could
object that this knowledge is often lacking.10
This objection rightly notes that likelihoodist reverse inference can only be
used when two competing psychological hypotheses, H1 and H2, are available
and when, based on the existing literature, cognitive neuroscientists expect
two different patterns of brain activation if p1 is recruited and if p2 is recruited.
In some research contexts, this knowledge will be missing, and likelihoodist
reverse inference won’t be applicable there. On the other hand, in some contexts, this knowledge is available, and conceiving reverse inference in likelihoodist terms allows cognitive neuroscientists to apply reverse inference
there.
One may wonder how in practice one determines the probabilities that a
particular pattern of brain activation occurs if p1 is recruited and if p2 is
recruited, and whether determining these probabilities relies on past reverse
inferences.11 While past reverse inferences may play a role, they do not seem
necessary. To determine whether it is likely that one would observe a
10
11
I owe this objection to Matt Bateman.
I owe this question to Rik Henson.
260
Edouard Machery
particular pattern of brain activation if, say, the hypothesis that psychological
process p1 is involved, one can look at other tasks (tasks different from T) that
are known to recruit p1 based on behavioural data, neuropsychological data
such as dissociations, self-reports, and so on. To illustrate, consider Greene
et al. ([2001]). Many tasks are known to recruit emotions on the basis of
behavioural data, physiological data, and self-reports. One can examine
which brain areas tend to be activated in these tasks in order to get a sense
of how likely it is that a particular pattern of activation is found if emotional
processing is elicited. Importantly, this procedure assumes that the neural
correlate of a particular psychological process or of a particular psychological
state (for example, fear) does not vary from one task to the other.
4.2 Cognitive neuroscientists are not interested in comparative
conclusions
If reverse inference is understood in likelihoodist terms, it is intrinsically comparative: reverse inference can only show that evidence supports a particular
psychological hypothesis more than a competing one. One may thus object
that cognitive neuroscientists are not merely (if at all) interested in comparative claims; rather, they want to provide evidence for their hypothesis, understood in non-comparative terms. For instance, Greene and colleagues’ goal
was to provide evidence for the hypothesis that moral-personal dilemmas elicit
more emotional processing than moral-impersonal dilemmas and non-moral
dilemmas, not merely to provide evidence supporting this hypothesis over a
competing hypothesis (for example, that a single moral rule was applied to
both kinds of dilemmas). Understood in likelihoodist terms, reverse inference
is thus unable to fulfill cognitive neuroscientists’ goals.12
There are two mutually-consistent responses to this objection: first, appearances notwithstanding, cognitive neuroscientists who use reverse inference are
in fact sometimes, and perhaps often, interested in comparative claims;
second, when they are not, they are typically mistaken to use reverse inference.
I briefly elaborate on these two responses.
First, when cognitive neuroscientists use brain images to support a psychological hypothesis, they are sometimes weighing on a controversy between two
competing psychological hypotheses that disagree about the nature of the
processes involved in some task. Typically, one of these two hypotheses entails
that a first psychological process is recruited by this task, while the other
hypothesis entails that a distinct process is recruited. In these cases, cognitive
neuroscientists’ goal is, at least implicitly, comparative, and reverse inference
conceived in likelihoodist terms is well suited to this goal.
12
I owe this objection to Michael Anderson, Felipe de Brigard, and Russ Poldrack.
In Defense of Reverse Inference
261
Indeed, it is easy to recast some uses of reverse inference in cognitive neuroscience in comparative terms, even when reverse inference was not originally
presented to support a comparative conclusion. The discussion of Greene
et al. ([2001]) in Section 3.4 illustrates this claim.
More important, some existing cognitive–neuroscientific studies are explicitly comparing two psychological hypotheses, and such comparisons amount
to applying implicitly the likelihoodist reverse inference described in this article. Fernandes et al.’s ([2005]) goal was to test two alternative hypotheses that
had been put forth to explain the effect of divided attention during episodic
recollection.13 According to the first hypothesis, H1, episodic retrieval is disrupted by a concurrent task because retrieval and this task compete for general
attentional resources. According to the second hypothesis, H2, this disruption
is due to the fact that retrieval and the concurrent task ‘compete for specific
structural representations’ (p. 1115), viz. to the fact that the concurrent task
involves retrieving some information (for example, phonological and word
representations) that is related to what one tries to remember. While undergoing brain scanning, participants were asked to perform a word-recognition
task either without a competing task (full-attention condition) or with either a
word-related or a digit-related concurrent task (divided-attention condition).
Based on the existing literature about the neural correlates of verbal processing, the use of domain-general attentional resources, and memory retrieval,
Fernandes and colleagues held that, if H1 were true—if memory retrieval taps
into general attentional resources—then a particular pattern of brain activation would be observed: more activation in the DLPFC for the word-related
task than for the digit-related task and no difference between the word-related
and digit-related task in the neural correlates of verbal processing and memory
retrieval (left inferior frontal, left temporal, and left supramarginal cortex), as
well as no difference between these two tasks in the areas related to memory
retrieval (hippocampus, parietal lobe, precuneus). They also held that, if H2 is
true, this pattern of brain activation should not be observed. Rather, a different pattern should be found, involving a difference between the word-related
and digit-related tasks in the neural correlates of verbal processing and
memory retrieval. The latter pattern having been observed, they drew the
following comparative conclusion ([2005], p. 1115): ‘Results also support a
component-process model of retrieval which posits that MTL-mediated
(medial temporal lobe) retrieval does not compete for general cognitive resources but does compete for specific structural representations’. This is a
clear (albeit implicit) likelihoodist use of reverse inference.
However, it is also true that some other uses of reverse inference cannot be
so recast because reverse inference is then not used (even implicitly) to weigh
13
I am grateful to Felipe de Brigard for this example.
262
Edouard Machery
on a controversy between two psychological hypotheses. In these cases, reverse inference will typically not be applicable. The reason is that reverse
inference cannot be put in likelihoodist terms, and has to be recast in
Bayesian terms. Then, on pain of committing a fallacy, cognitive neuroscientists have to take into account the probability of observing the pattern of
activation obtained in the task under consideration if the psychological hypothesis is false. But, this probability is often unknown, and, when it is known,
it often turns out to be high (because many psychological processes produce
this pattern of activation).
It is unclear how often reverse inference can be recast in likelihoodist terms,
but one can be confident that this is often possible when brain imagery data
are used for psychological purposes—to better understand which cognitive
processes or psychological states are involved in a particular task—because
cognitive neuroscientists then start with existing psychological hypotheses
about these processes or states. They bring a new form of data to bear on a
few competing hypotheses.
A philosophically-minded reader could perhaps push the objection under
consideration one step further and ask whether the interpretation of reverse
inference in likelihoodist, comparative terms is committed to the controversial
claim that evidential support is necessarily comparative.14 However, this interpretation is compatible with the view that there are two distinct notions of
evidential support: a comparative and a non-comparative notion. But then,
our reader could object, if I do not reject the notion of non-comparative
evidential support, why do I think that cognitive neuroscientists should be
aiming for contrastive evidential support rather than non-comparative evidential support? The reason is that, in many inferential contexts in cognitive
neuroscience (Section 3), it is dubious that non-comparative evidential support can be obtained, while contrastive evidential support is available. In these
contexts, cognitive neuroscientists should aim for the latter.
4.3 Reverse inference and negative hypotheses
The third objection follows up on the second objection. Some psychological
controversies do not pit a first hypothesis, H1—that a particular psychological
process, p1, is recruited by a task—against a competing hypothesis, H2—that
another particular psychological process, p2, is recruited by this task. Rather,
they pit H1 against its negation, namely, the hypothesis that it is not the case
that p1 is recruited by this task. For this kind of controversy, showing that
14
I owe this objection to a reviewer.
In Defense of Reverse Inference
263
P(EjH1) > P(EjH2) amounts to showing that P(EjH1) > P(EjH1), and likelihoodist and Bayesian reverse inferences are identical.15
My response can be brief as it is similar to the response provided above.
When a psychological controversy has this form, reverse inference will often
not be applicable.
4.4 Likelihoodist reverse inference may confuse cognitive
neuroscientists
The fourth objection is also connected to the comparative nature of likelihoodist reverse inference. One could concede that reverse inference can be put in
likelihoodist terms, but object that putting it in these terms would be dangerous because cognitive neuroscientists are likely to forget its comparative
nature. They are thus likely to interpret the conclusions of likelihoodist reverse
inferences in non-comparative terms.16 For instance, cognitive neuroscientists
may be prone to conclude from Greene et al. ([2001]) that evidence suggests
that moral-personal dilemmas elicit more emotional processing than
moral-impersonal dilemmas and non-moral dilemmas. This conclusion is
not comparative and it is not supported by the likelihoodist reverse inference.
This objection should be resisted. That a particular form of inference can be
misused or misunderstood is no argument against it; it merely shows that
cognitive neuroscientists and psychologists should be cautious. Indeed,
every form of inference carries some risks of misunderstanding and misapplication, and, if carrying such a risk were a ground for rejecting a form of
inference, scientists would be left with no way to draw conclusions from
data. For instance, statisticians and methodologists have often noted that
null hypothesis significance testing prompts psychologists to mistake significance values for the posterior probability that the null hypothesis is true (for
example, Morrison and Henkel [1970]; Oakes [1986]; for discussion, see
Machery [unpublished]). Closer to home, reverse inference conceived in
Bayesian terms can also lead to misuses or misunderstandings because cognitive neuroscientists may overlook that the validity of a Bayesian reverse inference depends on the probability of obtaining the pattern of activation
under consideration if the psychological hypothesis is false, or they may confuse the claim that evidence supports a particular hypothesis for the claim that
the posterior probability of this hypothesis is high.
15
16
Mazviita Chirimuuta and Michael Anderson raised an objection along these lines.
I owe this objection to Russ Poldrack.
264
Edouard Machery
4.5 Bayesian reverse inferences should be preferred to likelihoodist
reverse inferences
Finally, one could object that I have given short shrift to the quick fix of
reverse inference presented in Section 3.2. When cognitive neuroscientists
are able to estimate the probability of obtaining the pattern of activation
under consideration if the psychological hypothesis is false, and if this probability is smaller than the probability of observing the pattern of activation
under consideration if the psychological hypothesis is true, they are justified in
treating the relevant pattern of brain activation as providing non-comparative
evidence for the conclusion of the relevant reverse inference.17
The problem, however, is that for many areas we do not have such estimates. Fortunately, when reverse inference is recast in likelihoodist terms,
cognitive neuroscientists can do without these estimates. In other cases, because the probability of obtaining the pattern of activation under consideration if the psychological hypothesis is false is high, the probability of a
pattern of activation if process p is recruited and if it is not recruited are
similar, and the pattern of activation under consideration does not provide
evidence for the psychological hypothesis of interest. For instance, Yarkoni
et al. ([2011]) showed that the anterior cingulate and anterior insula are active
in many different tasks. Fortunately, the similarity of the probability of a
pattern of activation if process p is recruited and if it is not recruited is not
relevant when reverse inference is recast in likelihoodist terms.
Finally, it is important to see that our capacity to predict with a surprising
accuracy which of a number of tasks, a participant is engaged in on the basis of
brain imagery data (for example, Poldrack et al. [2009]; Yarkoni et al. [2011])
does not justify relying on Bayesian reverse inference. These studies are only
able to provide an estimate of P(EjH) and to compute P(HjE) because they
restrict the space of possible hypotheses to a group of known, mutually-exclusive hypotheses. In effect, they are comparing P(EjH1) to P(EjH2) to
P(EjH3), and so on. If anything, this is in the spirit of likelihoodist reverse
inference.
5 Conclusion
Reverse inference is a common inferential strategy in cognitive neuroscience.
It allows cognitive neuroscientists to mend the bridge between the brain and
the mind by turning brain images into data that bear on psychological hypotheses. From a Bayesian point of view, typical reverse inferences are invalid
because they do not take into account the probability that the pattern of
brain activation would have been obtained if the psychological hypothesis
17
I owe this objection to Russ Poldrack.
In Defense of Reverse Inference
265
were false. Reformulated in Bayesian terms, reverse inference seems often of
little use as too often either we do not know the relevant probabilities or the
pattern of brain activation would probably have been obtained if the psychological hypothesis were false. By and large, cognitive neuroscientists have
concluded from this analysis that reverse inference is at best a suspect form
of inference, while persisting in using it to draw conclusions about the mind.
However, these problems are avoided if reverse inference is recast in likelihoodist terms. So recast, reverse inference allows cognitive neuroscientists to
use brain images to determine which of two competing psychological hypotheses is best supported. A likelihoodist approach to reverse inference is partly
restrictive: the range of its correct application is narrower than the range of its
typical applications. Cognitive neuroscientists must be comparing two distinct
psychological hypotheses: task T recruits psychological process p1 versus task
T recruits psychological process p2 (Sections 4.2 and 4.3); they must have a
good sense of how likely it is that a pattern of activation would be observed if
p1 and if p2 were recruited (Section 4.1); and they must keep in mind that, so
understood, reverse inference can only lead to comparative conclusions
(Sections 4.4). When these conditions are not met, reverse inference is typically
not applicable. On the other hand, a likelihoodist approach is only partly
restrictive: It allows reverse inference to be used when these conditions are
met (see, e.g. Fernandes et al. [2005]); it does not recommend getting rid of
reverse inference entirely (as, for example, Anderson [2010a] does); and it does
not depend on obtaining the kind of information that is needed to apply a
Bayesian reverse inference.
Acknowledgements
Many thanks to Michael Anderson, Felipe de Brigard, Mazviita Chirimuuta,
Clark Glymour, Rik Henson, Alexa Morcom, Russ Poldrack, Mark Sprevak,
and two anonymous reviewers for comments on previous versions of this
article.
Department of History and Philosophy of Science
University of Pittsburgh
1017CL, Pittsburgh, 15260 PA, USA
[email protected]
References
Anderson, M. L. [2010a]: ‘Review of Neuroeconomics: Decision Making and the
Brain’, Journal of Economic Psychology, 31, pp. 151–4.
Anderson, M. L. [2010b]: ‘Neural Reuse: A Fundamental Organizational Principle of
the Brain’, Behavioral and Brain Sciences, 33, pp. 245–313.
266
Edouard Machery
Bandyopadhyay, P. S. and Brittan, G. C. [2006]: ‘Acceptibility, Evidence, and Severity’,
Synthese, 148, pp. 259–93.
Blume, J. D. [2011]: ‘Likelihood and Its Evidential Framework’, in P. S.
Bandyopadhyay and M. Forster (eds), Hanbook of Philosophy of Statistics,
Oxford: Elsevier, pp. 493–512.
Cavanna, A. E. and Trimble, M. R. [2006]: ‘The Precuneus: A Review of its Functional
Anatomy and Behavioural Correlates’, Brain, 129, pp. 564–83.
Earman, J. [1992]: Bayes or Bust: A Critical Examination of Bayesian Confirmation
Theory, Cambridge, MA: MIT Press.
Edwards, A. W. F. [1972]: Likelihood: An Account of the Statistical Concept of
Likelihood and Its Application to Scientific Inference, Cambridge: Cambridge
University Press.
Fernandes, M. A., Moscovitch, M., Ziegler, M. and Grady, C. [2005]: ‘Brain Regions
Associated with Successful and Unsuccessful Retrieval of Verbal Episodic Memory
as Revealed by Divided Attention’, Neuropsychologia, 43, pp. 1115–27.
Forster, M. and Sober, E. [2004]: ‘Why Likelihood’, in M. Taper and S. Lele (eds),
Likelihood and Evidence, Chicago: University of Chicago Press, pp. 153–65.
Gonsalves, B. D. and Cohen, N. J. [2010]: ‘Brain Imaging, Cognitive Processes, and
Brain Networks’, Perspectives on Psychological Science, 5, pp. 744–52.
Greene, J. D. and Paxton, J. M. [2009]: ‘Patterns of Neural Activity Associated with
Honest and Dishonest Moral Decisions’, Proceedings of the National Academy of
Science, 106, pp. 12506–11.
Greene, J. D., Sommerville, R. B., Nystrom, L. E., Darley, J. M. and Cohen, J. D.
[2001]: ‘An fMRI Investigation of Emotional Engagement in Moral Judgment’,
Science, 293, pp. 2105–8.
Harrison, G. [2008]: ‘Neuroeconomics: A Rejoinder’, Economics and Philosophy, 24,
pp. 533–44.
Henson, R. [2005]: ‘What Can Functional Neuroimaging Tell the Experimental
Psychologist?’, The Quaterly Journal of Experimental Psychology, 58A, pp. 193–233.
Henson, R. [2006]: ‘Forward Inference Using Functional Neuroimaging: Dissociations
versus Associations’, Trends in Cognitive Sciences, 10, pp. 64–9.
Henson, R. [2011]: ‘How to Discover Modules in Mind and Brain: The Curse of
Nonlinearity, and Blessing of Neuroimaging. A Comment on Sternberg’,
Cognitive Neuropsychology, 28, pp. 209–23.
Howson, C. and Urbach, P. [1993]: Scientific Reasoning: The Bayesian Approach,
Chicago: Open Court.
Klein, C. [2011]: ‘The Dual Track Theory of Moral Decision-Making: A Critique of the
Neuroimaging Evidence’, Neuroethics, 4, pp. 143–62.
Lipton, P. [1991]: Inference to the Best Explanation, London: Routledge.
Machery, E. [2012]: ‘Dissociations in Cognitive Neuroscience and in
Neuropsychology’, Philosophy of Science, 79, pp. 490–518.
McCabe, D. P. and Castel, A. D. [2008]: ‘Seeing Is Believing: The Effect of Brain
Images on Judgments of Scientific Reasoning’, Cognition, 107, pp. 343–52.
Morrison, D. E. and Henkel, R. E. [1970]: The Significance Test Controversy: A Reader,
New Brunswick, NJ: Transaction Publishers.
In Defense of Reverse Inference
267
Nichols, S. and Mallon, R. [2006]: ‘Moral Dilemmas and Moral Rules’, Cognition, 100,
pp. 530–42.
Oakes, M. [1986]: Statistical Inference: A Commentary for the Social and Behavioural
Sciences, Chichester: John Wiley and Sons.
Poldrack, R. A. [2006]: ‘Can Cognitive Processes be Inferred from Neuroimaging
Data?’, Trends in Cognitive Sciences, 10, pp. 59–63.
Poldrack, R. A. [2010]: ‘Mapping Mental Function to Brain Structure: How Can
Cognitive Neuroimaging Succeed?’, Perspectives on Psychological Science, 5,
pp. 753–61.
Poldrack, R. A., Halchenko, Y. and Hanson, S. J. [2009]: ‘Decoding the Large-Scale
Structure of Brain Function by Classifying Mental States across Individuals’,
Psychological Science, 20, pp. 1364–72.
Poldrack, R. A. and Wagner, A. D. [2004]: ‘What Can Neuroimaging Tell us about the
Mind?’ Current Directions in Psychological Science, 13, pp. 177–81.
Richeson, J. A. and Shelton, J. N. [2003]: ‘When Prejudice Does Not Pay: Effects of
Interracial Contact on Executive Function’, Psychological Science, 14, pp. 287–90.
Richeson, J. A., Baird, A. A., Gordon, H. L., Heatherton, T. F., Wyland, C. L.,
Trawalter, S. and Shelton, J. N. [2003]: ‘An fMRI Investigation of the Impact of
Interracial Context on Executive Function’, Nature Neuroscience, 6, pp. 1323–8.
Royall, R. M. [1997]: Statistical Evidence: A Likelihood Paradigm, London: Chapman
and Hall.
Shallice, T. [2003]: ‘Functional Imaging and Neuropsychology Findings: How Can
They Be Linked?’, NeuroImage, 20, pp. S146–54.
Sober, E. [2008]: Evidence and Evolution: The Logic behind the Science, Cambridge:
Cambridge University Press.
Sternberg, S. [2001]: ‘Separate ModiEability, Mental Modules, and the Use of Pure and
Composite Measures to Reveal Them’, Acta Psychologica, 106, pp. 147–246.
Sternberg, S. [2011]: ‘Modular Processes in Mind and Brain’, Cognitive
Neuropsychology, 28, pp. 156–208.
Taper, M. L. and Lele, S. R. [2011]: ‘Evidence, Evidence Functions, and Error
Probabilities’, in P. S. Bandyopadhyay and M. Forster (eds), Handbook of
Philosophy of Statistics, Oxford: Elsevier, pp. 513–34.
Vann, S. D., Aggleton, J. P. and Maguire, E A. [2009]: ‘What Does the Retrosplenial
Cortex Do?’, Nature Reviews Neuroscience, 10, pp. 792–802.
Yarkoni, T., Poldrack, R. A., Nichols, T. E., Van Essen, D. C. and Wager, T. D. [2011]:
‘Large-Scale Automated Synthesis of Human Functional Neuroimaging Data’,
Nature Methods, 8, pp. 665–70.
Yarkoni, T., Poldrack, R. A., Van Essen, D. C. and Wager, T. D. [2010]: ‘Cognitive
Neuroscience 2.0: Building a Cumulative Science of Human Brain Function’, Trends
in Cognitive Sciences, 14, pp. 489–96.
Young, L. and Saxe, R. [2008]: ‘The Neural Basis of Belief Encoding and Integration in
Moral Judgement’, NeuroImage, 40, pp. 1912–20.