Stop, Crying, Stop Crying : When 1 + 1 = 1

Master de Sciences Cognitives
Spécialité : Neurosciences Cognitives
Memoire
Stop, Crying, Stop Crying :
When 1 + 1 = 1
Using fMRI to investigate how the Brain merges single words
into sentences
Y-Lan Boureau
Directeur de stage : Christophe Pallier
Directeur d’unité : Stanislas Dehaene
Unité Inserm 562 - CEA - SHFJ
4 pl. du Général Leclerc
91401 Orsay
· 2006 ·
Abstract
A characteristic of natural languages is that meanings of single words in a sentence do not simply
add up into the meaning of the sentence. Instead, they combine in a non-trivial way dictated by the
rules of syntax.
Intuitions of investigators looking for an “area of syntax” in the brain have been guided by the pattern
of syntactic impairments in aphasic patients with a lesion in Broca’s area.
Indeed, imaging studies manipulating complexity in sentences by making use of a large array of syntactic transformations, and comparing these complex sentences to simple ones, have found increased
activation in Broca’s area, but also in left posterior temporal areas.
However, they might have missed core components of syntax (also present in simple sentences) by
subtracting them away.
Studies comparing activation to sentences and to non-sentence stimuli may be better suited to capture
core components of syntax. They generally pointed to an activation of areas in the left posterior and
anterior temporal lobe, but not the frontal lobe.
We conducted two fMRI experiments to investigate the transition from individual words to sentences.
An earlier fMRI study had found adaptation in the left temporal lobe when listening to sentences with
repeated semantic content. The first experiment sought to refine these results by disentangling adaptation to syntax-dependent meaning (meaning of a word after it has been attributed a thematic role)
and to individual, syntax-independent word meaning. Confusing early results call for modifications of
the initial design.
The second experiment tracked the phrase-building process by incrementally adding syntactic structure to strings of twelve words (from 12 isolated words to one 12-word phrase in five steps).
In pilot subjects, areas displaying a progressive increase in activation were found in the left angular/posterior middle temporal gyrus and the left anterior middle temporal gyrus.
Some evidence suggests that among these, the posterior areas subserve attribution of thematic roles to
verb arguments, whereas the anterior areas are in charge of merging small phrases into bigger phrases.
Potential ways to modify the experiment to bring about a dissociation between these processes are
discussed.
Acknowledgments
I am extremely grateful to Anne-Dominique Devauchelle, Christophe Pallier, Stanislas Dehaene and
Luigi Rizzi for being such pleasant people to work with.
I would also like to thank Antoinette Jobert for helping me to run the fMRI experiment.
Contents
1 Introduction
1
2 Some background in linguistics and imaging of syntax
2.1 A light primer on linguistics . . . . . . . . . . . . . . . .
2.1.1 Phonology, semantics, syntax, pragmatics . . . .
2.1.2 More about syntax . . . . . . . . . . . . . . . . .
2.1.3 Merging and transforming . . . . . . . . . . . . .
2.2 Imaging literature on syntax . . . . . . . . . . . . . . . .
2.2.1 Scope . . . . . . . . . . . . . . . . . . . . . . . .
2.2.2 The bias towards Broca’s area . . . . . . . . . .
2.2.3 Sentences vs. sentences . . . . . . . . . . . . . .
2.2.4 Sentences vs. non-sentences . . . . . . . . . . . .
2.2.5 Priming of syntactic structure . . . . . . . . . . .
2.3 So where do we stand ? . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
3
. 3
. 3
. 4
. 4
. 7
. 7
. 7
. 9
. 13
. 16
. 17
3 Disentangling word form, single word meaning, and thematic roles
3.1 The first study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.1.1 Methods and results . . . . . . . . . . . . . . . . . . . . . . . . .
3.1.2 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2 The second study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2.1 Design . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2.2 Materials and methods . . . . . . . . . . . . . . . . . . . . . . . .
3.2.3 Behavioral results . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2.4 Confusing imaging results . . . . . . . . . . . . . . . . . . . . . .
3.2.5 Possible other explanations . . . . . . . . . . . . . . . . . . . . .
3.3 Further directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
19
19
19
21
21
22
22
23
24
24
26
4 Core syntax at the phrase level
4.1 Reconciling Devauchelle et al. and Noppeney &
4.2 Tracking constituents . . . . . . . . . . . . . . .
4.3 Material and methods . . . . . . . . . . . . . .
4.3.1 Subjects . . . . . . . . . . . . . . . . . .
4.3.2 Design . . . . . . . . . . . . . . . . . . .
4.3.3 Stimuli . . . . . . . . . . . . . . . . . .
4.3.4 Task . . . . . . . . . . . . . . . . . . . .
4.3.5 Procedure . . . . . . . . . . . . . . . . .
4.3.6 fMRI Scanning Technique . . . . . . . .
4.3.7 Data Analysis . . . . . . . . . . . . . . .
4.4 Results . . . . . . . . . . . . . . . . . . . . . . .
4.4.1 Behavioral results . . . . . . . . . . . .
4.4.2 Network activated by all conditions . . .
4.4.3 Linear contrast . . . . . . . . . . . . . .
4.5 Discussion . . . . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
27
27
28
29
29
29
29
29
31
31
31
32
32
32
32
34
iii
Price
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
. . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
4.5.1
4.5.2
Areas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
Patterns of activation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
5 General discussion and further directions
5.1 Adaptation to semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.2 Looking for core syntax and constituent-building areas . . . . . . . . . . . . . . . . . . .
5.3 Further directions concerning the core-syntaxe study . . . . . . . . . . . . . . . . . . . .
5.3.1 Controlling for mere sequence effects . . . . . . . . . . . . . . . . . . . . . . . . .
5.3.2 Decreasing variability between subjects by introducing tighter control on stimuli
5.3.3 Differentiating between agency and phrase building . . . . . . . . . . . . . . . . .
5.3.4 Digging constituency further: priming by constituents vs. adjacent strings of
words . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
41
41
41
42
42
42
42
Bibliography
48
Abbreviations table
49
iv
43
Chapter 1
Introduction
When you listen to someone talking in your mother tongue, words flow like a breeze and blend seamlessly into a meaningful message. Actually language is so deeply rooted into your brain that even
when you are trying to focus on something else, and wish you would not be distracted by what people
around you are saying, you simply cannot help parsing sentences and turning noise into meaning.
Yet the hard time people have trying to learn foreign languages should suffice to remind us that
using and understanding language is far from being an inherently easy task.
The contrast between the computational difficulty of parsing language and the experienced effortless use of natural language points towards the existence of specialized modules in the human brain.
Indeed, human brain mapping’s debut more than one century ago was all about language, and the loss
of the capacity to produce it following damage to Broca’s area (Broca, 1861).
Tremendous progress has been made in brain imaging since then, yet language still defies understanding. Evidence often proves either inconclusive or even contradictory. Studies have progressively
drifted from trying to find a hypothetical area for language, to trying to break down language into its
more simple components, and mapping those on the brain. Thus, the study of language is now often
restricted to the study of one particular aspect of it, to various degrees of specificity, ranking from the
study of semantics or syntax, to that of highly specialized subcomponents of language, as for instance
the study of passivization(Grodzinsky & Friederici, 2006).
However, trying to disentangle autonomous processes in language is quite a tough task. Parts of
language like phonology, semantics, and syntax are supposed to be working together and support one
another towards a common goal; there is evidence (Tyler & Marslen-Wilson, 1977; Altmann & Steedman, 1988) that language processing is not made up of autonomous networks that operate serially on
the input they receive, but of highly interconnected processors that interact all the time, with information flowing relentlessly back and forth from one processor to the other so that they all converge as
efficiently as possible on the final interpretation of language input. Communication through language
has to be redundant in order to be robust, to allow for understanding in a various number of nonoptimal conditions, for instance in a noisy environment(Miller et al., 1951). So language subprocessors
should not be expected to operate independently from one another. They all concur to the same goal.
Thus, all endeavours to tease apart pieces of language always exert a kind of violation on the natural
processing of language. Given the amount of cross-talk between levels of language processing, supposed
to be highly parallel in nature (Scott & Johnsrude, 2003), it appears quite unlikely that it might be
possible to shut down a specific subcomponent of language processing without influencing the others.
Yet such an approach cannot be dispensed with as there is no all-purpose language area in the brain,
but rather, a network of areas that subserve processes used in language (Grodzinsky & Friederici, 2006)
Let us now focus more specifically on syntax. Trying to pinpoint where it is implemented in the
1
Chapter 1: Introduction
2
brain implies both discriminating syntactic processes from non-syntactic ones - that is, distinguishing
syntax from semantics, pragmatics, and phonology - , and, within syntax, teasing apart different types
of operations.
Studies of syntactic processes in the brain have tackled the problem of mapping the syntactic brain
from various angles that have yielded somewhat conflicting results. However, bearing in mind that
syntax is no unitary process should help make sense of these results.
The first part of this report provides a short review of the literature and examines if a unified
framework can be sketched. Background in linguistics is provided as well.
I shall then move on to present the work that I have done, which consists in two fMRI experiments.
The first part, conducted conjointly with Anne-Dominique Devauchelle, comes as a logical following
to an initial fMRI study that was carried out by Devauchelle et al. (2006). This study compared
adaptation to syntactic and semantic features by monitoring decreases in activation following repetition
of sentences that did or did not share semantic ans syntactic features.
No adaptation to syntactic features was found, but robust adaptation to semantic features was seen
in the left middle temporal gyrus (L-MTG). However, it was unclear if the habituation was to word
form, individual word meanings (syntax-independent meaning), or attribution of the same thematic
roles (syntax-dependent meaning, a.k.a. “who did what to whom”), as the same thematic situation
was presented in various syntactic guises.
The study I shall present was designed to separate habituation to each of these three factors by
varying them as independently as possible. But the results we obtained have led us to think that the
experimental design failed to meet the requirements for efficient parameters estimation. The design is
currently being revised.
The second experiment looked directly at activations, to capture the phrase-building process that
underlies what we shall call “core syntax”, allowing to build larger phrases from small phrases.
We used an incremental design that we shall explain in far more detail below, the underlying intuition being that if some neural tissue engages in merging words into phrases, then its activation should
be higher when building one big phrase than when building separate smaller trees.
Finally, I propose directions for further research.
Chapter 2
Some background in linguistics and
imaging of syntax
In this chapter, I review the broad directions imaging research has taken to tackle syntax. Studies are
often interested with only one aspect of language; but when trying to break down language into its
parts, the first questions that arise are, what those parts are, and how specific they should be.
Before starting our review, we shall start off by giving some background in linguistics and see what
distinct components can be isolated in language.
2.1
2.1.1
A light primer on linguistics
Phonology, semantics, syntax, pragmatics
Understanding language implies converting auditory input into a meaningful message. However, this
is not a one-step conversion and a number of levels can be distinguished. The division of labour we
provide here is quite schematic for clarity’s sake, but the reader should keep in mind that all those
levels interact and cooperate into a common goal.
The first step (with which phonology is concerned) is to turn the auditory flow, an input that is
analogical in nature (variation in air pressure), into a sequence of phonemes, that is, into a digital
output (phonemes are a finite set of features that is specific to every single language); for instance, a
continuum between sounds “ba” and “pa” will not be perceived as such by a native French speaker,
but rather as a discontinuous leap from “ba” to “pa”, in a winner-take-all fashion(Liberman et al.,
1967).
The next step is then to turn that sequence of phonemes into words; that is, the sequence of
phonemes is compared against a dictionary until a match is found, and the corresponding meaning is
retrieved. Finally, all words combine into a sentence.
Semantics is the study of meaning, at every level, from word (or even parts of words) to sentences;
however, meanings of words do not simply add up when words are stringed together to form sentences:
the way their meanings combine is dictated by syntax. Thus, syntax can be viewed as a catalogue of
sophisticated ways to combine meanings, or even meaningless utterances. We shall enter into more
details about syntax in the next paragraph.
The last step is, integrating the message into a context and a network of expectations, guesses,
and implicit rules of communication (e.g., the expectation that someone who speaks to you intends
to convey an informational message and not some irrelevant sentence that merely “has a meaning”).
The study of relevance and interpretation of language in context is the goal of pragmatics (Sperber &
3
Chapter 2: Some background in linguistics and imaging of syntax
4
Wilson, 1986).
2.1.2
More about syntax
We are going to dwell somewhat longer on syntax, as it is sometimes unclear what syntax is, and what
syntax does, and it is crucial for our work to know what we are after when we are looking for the neural
correlates of syntax. Syntax is what makes language so powerful, and what makes it such a fine tool
when compared to syntactically poor ways to communicate like pidgins. Perhaps the contribution of
syntax to language (as compared to isolated words, or to mere juxtaposition of words, which anyway
is already a kind of syntax) can be seen as a shift from a 1D world to a 3D one, with the intermediate
2D world corresponding to basic, elementary juxtapositional syntax.
What steps could be distinguished in an increasingly complex interpretation of a string of words ?
As an example, take the sentence “it is the boy that the girl hits”. In a syntax-deprived world, you
would have isolated representations for the words “boy”, “girl”, “hit”, that would not even blend into
a common meaning. The lowest possible level of integration would be, representing the words in a
completely disconnected way, so that hearing three words one right after the other, or ten days apart,
would make no difference.
Then word meanings can combine with one another linearly. There is a girl, a boy, and there is
hitting, but importantly, the boy and the girl are the ones who are involved in the hitting.
A big progress is then to actually assign roles to participants; this is syntax already, and syntax tells
you whether it is the girl hitting the boy, or the other way round. We shall refer to that level of
syntactic processing as “core syntax”, as a short-hand.
But there is more to syntax than simple role assignment, which is already a considerable achievement per se. Syntax provides people with an exquisite way of shifting topics while still retaining the
same distribution of roles between participants; you know you should focus on the boy when hearing
“it is the boy that the girl hits”, which would not be the case if you were simply told “the girl hits
the boy”. That more sophisticated aspect of syntax allows to convey information in a far more refined
way; but it depends heavily on correct functioning of basic syntax.
Let us now turn to linguistic theories of syntax to see if they make that kind of distinction between
core syntax and the postulated more refined syntax.
2.1.3
Merging and transforming
Theories of syntax refer heavily to Chomsky’s work. Chomskyan theories come broadly in three guises.
The coverage provided here will be somewhat simplistic (the interested reader is referred to Chomsky
(1957, 1986); Haegeman (1991); Chomsky (1995); Lasnik (2002) for more detailed presentations).
The three distinct stages of Chomskyan theories of syntax are usually known as the Standard Theory (Chomsky, 1957), the Government and Binding theory (Chomsky, 1986; Haegeman, 1991) , and
the Minimalist Program (Chomsky, 1995; Lasnik, 2002).
I shall not enter into much detail; the crucial point I want to make is that all those three theories
have retained throughout a division between two distinct layers of syntax: that in charge of building
phrases (D-structure, X-bar theory, or MERGE), and that in charge of moving and binding those
phrases in a kind of second-order treatment of sentences (the sophisticated, but somewhat less indispensable part).
5
2.1 A light primer on linguistics
Core syntax
The first stage of syntactic processing involves building phrases out of isolated words. An example is
given below. The phrase “a black cat” is called a determiner phrase because it behaves as a whole set
of such phrases that generally contain a determiner and a noun (but not necessarily: “John” is a DP
as well).
DP
H
H
D
H
NP
HH
NP
AP
a
A’
N’
A
N
black
cat
Here, the important point to make is that the words “a”, “black” and “cat” are no longer isolated
entities, but are tied together in a tree and can be manipulated as a single entity (“a black cat” behaves
like “John”).
A node in a tree is called a constituent.
The meaning of the phrase depends crucially on the tree that has been built. For example:
1. (a)
S
H
H
DP
PP
The girl
HH
said
H
VP
HH
H
HH
CP
PPP
P
PP
PP
that she was coming on Thursday
Chapter 2: Some background in linguistics and imaging of syntax
2. (b)
S
HHH
6
H
H
H
DP
PP
The girl
H
VP
H
H
H
H
H
H
VP
H
H
said
H
H
CP
PPP
P
PP
PP
P
on Thursday
P
that she was coming
This example shows that semantic and syntactic interpretation are tightly interleaved at this level.
It roughly corresponds to building Chomsky’s D-structure until the Government and Binding theory,
but the term has been abandoned in the Minimalist framework. In the Minimalist framework, this
level seems to map on MERGE; however, we shall keep the term “core syntax” because of unsufficient
knowledge of the refinements of Chomsky’s MERGE.
Thematic roles assignment A predicate (where the predicate can be a verb, an adjective, a noun,
or a preposition) comes with arguments, that can be described as the participants that have some part
to play in the event being described by the predicate.
The roles that the arguments are assigned by the predicate are usually referred to as thematic roles,
or θ-roles. Common examples of θ-roles are: the theme, which is roughly the topic of the discussion
(e.g. Mary in the sentence “I called Mary yesterday”), the agent, which is causing the event to happen
or has some conscious control on the event (e.g. Mary in “Mary called her mother”), the experiencer,
which is undergoing the event (e.g. Mary in “Mary likes flowers”).
Assignment of θ-roles is crucial for the understanding of action, particularly when it comes to
knowing who did what to whom. The mapping of θ-roles on the arguments of a predicate is at the
core of the interface between syntax and semantics.
Second level syntax
We shall call second level syntax the part of the syntax that moves around the constituents built by
core syntax (or smaller pieces as well). In the standard theory, transformations would be part of that
second level syntax. For us, second level syntax will be all the more refined parts of syntax that do
something different than merely merging two words. In the minimalist framework, that would probably
correspond to move.
What syntax are we after ?
Looking for an area involved in syntactic processing in the brain appears to be too vague: the two
levels we have presented should at least be distinguished. That point should remain present in the
reader’s mind now that we are about to tackle our review of research on imaging of language, as it
can explain why lots of different studies uniformly labelled “looking for syntax” yielded various results.
7
2.2
2.2.1
2.2 Imaging literature on syntax
Imaging literature on syntax
Scope
Phonology, semantics and syntax have been studied in a large number of imaging studies; surprisingly
few attempts have been made to study pragmatics.
Semantics and syntax are intimately connected as we shall see in more depth ; sentences are often
syntactically ambiguous and semantics lead people to the right choice (remember the classical example
“time flies like an arrow”, that will never be spontaneously interpreted as a twin sentence to “fruit
flies like a banana”). Moreover, syntax can be viewed as nothing more than an auxiliary to semantics,
with its existence as a separate entity (in Jabberwocky speech for example 1 ) being a mere side-effect.
The boundaries between semantics and syntax are somewhat tricky to draw; for instance, does the
attribution of thematic roles regard syntax, or semantics ? Our review will quite often drift from
syntax towards semantics and back again.
The connection with pragmatics is not an easy one either. Let us take the example of attribution
of θ-roles once again. How much is that influenced by our pragmatic knowledge of the world ? When
reading a sentence like “the cat chases the mouse”, do we really need syntax to understand who did
what to whom ?
Taking a closer look at the interaction between syntax and pragmatics might be an interesting topic
for research.
2.2.2
The bias towards Broca’s area
Before looking at papers, we shall remind the reader with a few facts that should be kept present when
interpreting the results of research.
Broca’s famous lectures (Broca, 1861) about his patient who was incapable of producing grammatically structured utterances was the first unambiguous demonstration that a localized chunk of brain
tissue could subserve a particular function; at the time, Broca’s area was supposed to be involved in
speech production.
Later on, Wernicke’s patients completed the picture with what was perceived as a comprehension
deficit. The binary opposition was then one between speech production and speech comprehension,
with Broca’s aphasics being unable to produce speech while Wernicke’s aphasics could not understand
it. Following quite logically, conduction aphasia, a condition with impaired repetition, was associated
with damage to the pathway connecting Broca’s to Wernicke’s areas - namely, the arcuate fasciculus
(Geschwind, 1965, 1979).
It was not until the 1970s (Caramazza & Zurif, 1976) that that position was revised; according to
the new - now classic - assignment, Broca’s area was the one involved in syntax, whereas Wernicke’s
area received the charge of semantics. Indeed, Broca’s aphasics were described as producing mostly
speech without function words, whereas Wernicke’s aphasics were freely substituting words with irrelevant ones, while keeping the syntactic structure of sentences intelligible.
At first, it could look like some kind of progress. The opposition had moved to more abstract
grounds; but it was still a binary one, that tried to squeeze language into a clear-cut, neat framework. However challenged this view has been (e.g. Grodzinsky (2000)), it still survives happily in
the textbooks and the minds. Yet a careful study of lesions shows that there is neither a necessary
nor a sufficient causal link between a lesion in Broca’s area, and Broca’s aphasia, or purely syntactic
impairments. The same holds for Wernicke’s area and fluent aphasia.
1 Jabberwocky speech is grammatical speech which uses nonwords instead of existing content words. It is named after
Lewis Caroll’s poem in Alice in Wonderland
Chapter 2: Some background in linguistics and imaging of syntax
8
Figure 2.1: Broca and the brain of his patient Mr Tan
Figure 2.2: Broca’s and Wernicke’s areas and the arcuate fasciculus: the incomplete network for
language
9
2.2 Imaging literature on syntax
We shall review a number of papers that point towards syntax recruiting areas in the temporal
lobe, perhaps even more crucially than in the frontal lobe. Other papers argue that semantic tasks
are most often mapped to the anterior frontal cortex (e.g. Brodmann area 47, considered by some
authors to be part of Broca’s area. See Bookheimer (2002)). However, the classical mapping of syntax
on Broca’s area and semantics on Wernicke’s area has strongly biassed researchers into looking for a
confirmation of that mapping, rather than looking for a more adequate one. Thus, some studies (e.g.
Kang et al. (1999); Embick et al. (2000)) did not even include anterior temporal areas in the part of
the brain they imaged, while they had been hypothesized to be critically involved in syntax by earlier
papers(Mazoyer et al., 1993) .
It is important to bear those facts in mind when reading about papers looking for syntax. We shall
now move on to the review proper, that will fall into three parts: we first review papers that have
sought to disentangle syntax from semantics by comparing sentences to other sentences. We then turn
to studies that compared activation to sentences to activation to non-sentences stimuli. Finally, we
review what has been done using habituation paradigms.
2.2.3
Sentences vs. sentences
Broca’s area and complexity
Several studies (Just et al., 1996; Stromswold et al., 1996; Ben-Shachar et al., 2003, 2004)have been
successful in finding activation in Broca’s area in relation to syntactic complexity (see fig.2.3). The
idea is generally to oppose sentences with increased complexity, to simpler sentences, while balancing
for the number of words and other factors; thus, syntax can be isolated from semantics as there is
arguably no difference in semantic contents of syntactically complex and syntactically simple sentences.
Studies have used various ways to increase complexity, involving the presence of object-relative vs.
subject relatives (Ben-Shachar et al., 2004), center-embedded vs. right-branching relatives(Stromswold
et al., 1996), non-canonical order vs. canonical order(Ben-Shachar et al., 2003). Increased complexity
is generally assessed by behavioral reading-time measures or ratings, or rely on previous such measures.
Manipulations of syntactic complexity have consistently led to activation in Broca’s area (see Kaan
& Swaab (2002) for a review, cf. fig.2.3).
However, it should be noted that hearing simple sentences did not seem to activate primarily Broca’s
area (and activated temporal areas instead), and a great deal of complexity had to be introduced to
light up the area (see Kaan & Swaab (2002) for a review, cf. fig.2.5 ). This is a first clue pointing to
an identification of that type of syntax as the second level, not core, one.
Ben-Shachar et al. (2003, 2004) have provided a nice account of activation in Broca’s area, relating
it not to a general and somewhat vague increase in overall complexity (Just et al., 1996) or working memory load (Kaan & Swaab, 2002), but explicitly to transformations, in a way that relates to
Grodzinsky’s hypothesis about Broca’s aphasia being linked to trace deletion, based on the observation
that Broca’s aphasics’ deficit roughly boiled down to an incapacity to link antecedent to their traces
(Grodzinsky, 2000). Ben-Shachar et al. (2003, 2004) propose that Broca’s area is in charge of moving
antecedents to their traces, and indeed they observe increased activation in Broca’s area in sentences
that contain movement (e.g. cleft sentences and object relatives) relatively to sentences where movement is much smaller. Care is taken that a number of potential generators of complexity (number of
content words, number of embeddings, number of verb arguments) are balanced across conditions.
Ben-Shachar et al. (2003) also found higher activation in the posterior superior temporal sulcus bilaterally with transformational sentences, and in the left posterior superior temporal sulcus in relation
to the number of arguments required by a verb. We shall return to this later.
Chapter 2: Some background in linguistics and imaging of syntax
10
Figure 2.3: Map of activations found by studies that compared complex sentences and simple sentences.
From Kaan & Swaab (2002)
11
2.2 Imaging literature on syntax
Figure 2.4: Variation of activation in Broca’s area according to SOV or OSV order depends on the
nature of the verb and on the θ-role assignment to its arguments. An account based solely on syntactic
transformations as in Ben-Shachar et al. (2004) would predict higher activation in OSV noncanonical
order than SOV no matter what. From Bornkessel et al. (2005)
Broca’s area and linearization
However, a paper by Bornkessel et al. (2005) represents a recent challenge to this proposal. In Benschachar et al.’s view, the role of Broca’s area should be specified purely in terms of grammatical
functions; that is, a moved subject has to get back to where the subject is expected to be, and the
same goes for the object.
If that conception is true, then the activation in Broca’s area should be greater in sentences where
arguments have been displaced in a non-canonical order, than in sentences with standard order - SOV
(Subject Object Verb) if the language in question is one with standard SOV order, no matter what
their meaning is.
Bornkessel have studied sentences in German with various word orders. German case-marking
being explicit (so that an accusative masculine name, e.g. den Mond, can be distinguished from a
nominative (der Mond) or a dative (dem Mond) one) makes word order relatively free; however, there
is still a preference for SOV (the canonical order).
Indeed, activation in Broca’s area is higher when the object is presented first, and the verb is an
active one like schlagen (to hit), as was predicted by Benschachr’s interpretation. Importantly however,
the pattern is reversed when the verb is what is called an object-experiencer verb, e.g. gefallen (to
please), with Broca’s area being significantly more activated with SOV order than OSV (see fig.2.4).
Thus, Broca’s area seems to be recruited when moving arguments to conform to a standard θ-roles
hierarchy that would rule that experiencer should appear before theme, rather than a purely syntactic
SOV hierarchy. The fact that activations for object-experiencer verbs lie between the levels of activations associated with processing active verbs might reflect that both hierarchies come into play.
Importantly, this study also found increased activity of left pSTS when an early attribution of
θ-roles based on case-marking had to be revised when the verb appeared.
Another study (Grewe et al., 2005) shows that Broca’s area fails to be activated more when sentences in German are presented with OSV order than standard SOV order, but the object is not a
determiner phrase, but a pronoun instead. The authors argue that this is because there is a rule stating that a pronoun should generally precede non-pronominal arguments; thus, Broca’s area seems to
conform to a number of hierarchical linearization rules and not only the purely syntactic ones proposed
by Ben-Shachar et al. (2003).
Finally, Friederici et al. (2006) have found a parametrical increase of activation in Broca’s area as
a function of the number of moved verb arguments in German, which lends support to both a view
Chapter 2: Some background in linguistics and imaging of syntax
12
of Broca’s area subserving syntactic transformations, and an account in terms of reordering elements
according to various hierarchies.
Syntax as opposed to semantics
Kang et al. (1999) have tried to isolate syntax from semantics by comparing minimal verb phrases
(only two words) that contained either a syntactic (e.g. grew heard) or a semantic (e.g. ate suitcases)
violation to normal phrases (e.g. ate apples). They found higher activation in Broca’s area (Left BA
44) related to the syntactic condition, while BA 45,10 and 46 showed laterality differences, with the
semantic condition being associated with right-lateralization, and the syntactic condition with leftlateralization.
Unfortunately, they did not include the anterior temporal lobe in their imaging sessions, so that
the involvement of areas in the temporal lobe below the AC-PC line cannot be discussed.
Dapretto & Bookheimer (1999) have used another approach to disentangle semantics and syntax.
Their paradigms relied on the task to focus subjects’ attention either on semantics or syntax: subjects
had to make a judgement as to whether two sentences had the same meaning or were different; in the
semantic condition, sentences differed by a content word (e.g. “The lawyer questioned the witness” vs.
“The attorney questioned the witness”), and in the syntactic condition, they differed by their word
order (e.g. “West of the bridge is the airport” vs. “The bridge is west of the airport”). Behavioral
measured ensured that both tasks were equally difficult.
They found comparatively higher activation in a part of Broca’s area (Brodmann area BA 44, part
opercularis) in the syntactic condition, and in the lower portion of the left inferior frontal gyrus (LIFG, BA 47, part orbitalis), taken by some authors to be part of Broca’s area too) in the semantic
condition.
It should be noted that the activation of Broca’s area in the syntactic condition is consistent with
previously presented hypotheses that Broca’s area is recruited when there is some kind of movement
from a non-canonical order to the canonical one, the latter being specified either syntactically or thematically.
The activation of the L-IFG in the semantic task is consistent with a number of studies that have
implicated that area in tasks involving lexical processing (see Bookheimer (2002) for a review on semantic processing).
The two studies by Dapretto & Bookheimer (1999) and Kang et al. (1999) have been criticized by
Embick et al. (2000) on the ground that they used tasks that “lacked an experimental history”. Embick et al. (2000) get back to a classical violation detection task, where subjects had to report whether
sentences contained one or two violations, that were either grammatical (e.g. inverting a noun and its
determiner) or spelling errors.
The authors report a significantly greater activation in Broca’s area than in Wernicke’s area or in the
angular gyrus/supramarginal gyrus in the syntactic condition. It should be noted that the grammatical
errors introduced were errors in word order, which makes these results consistent with the linearization
hypothesis stated earlier.
As is the case with the study by Kang et al. (1999), the authors were primarily concerned with attributing roles to Broca’s and Wernicke’s areas, and used an ROI approach that only imaged slices
above the AC-PC plane. Therefore, no data on the lower temporal lobe are available from their study.
Problems with this approach
A problem with those kinds of approaches is that they are bound to look for a subpart of syntax
that has as few links to semantics as possible. Remembering the distinction between core syntax
and second-level syntax, we can ask what part of syntax they are tracking. Given that core-syntax is
tightly linked to semantics as it is about combining meanings and attributing thematic roles, it is likely
13
2.2 Imaging literature on syntax
Figure 2.5: Map of activations found by studies that compared sentences and non-sentence stimuli.
From Kaan & Swaab (2002)
that studies are often biassed towards second-level syntax. Indeed, the contrasts used often oppose
complex sentences to less complex ones - that is, sentences to sentences, so that any common factor
gets substracted out. Core syntax might indeed be one of those missed common factors.
2.2.4
Sentences vs. non-sentences
When comparing sentences to non-sentence stimuli, activation is usually found in the temporal, not
frontal lobe, as we shall now review. See fig.2.5 from Kaan & Swaab (2002).
An early study: stories vs. words
The most natural approach to tackle phrase-level syntax is to compare activation to sentences, to
activation to word lists. Indeed, an early study by Mazoyer et al. (1993) has used PET to compare
activation to sentences and word lists, while orthogonally varying semantic content (the words inside
the sentences could be congruent words, incongruent words, or pseudowords).
Processing of meaningful stories (semantically congruent condition) activated the left middle temporal
gyrus, the left and right temporal poles, and a superior prefrontal area in the left frontal lobe, in
addition to regions activated by single-word comprehension.
Importantly, only the bilateral temporal poles were consistently activated more in the sentence
condition than in the word list condition, regardless of the semantic condition, pointing towards a role
of that area in core syntax.
Chapter 2: Some background in linguistics and imaging of syntax
14
Sentences vs. environmental sounds
Support to this hypothesis has come from a study by Humphries et al. (2001), who compared activation
to sentences describing a short scene to activation to environmental sounds that matched as closely as
possible the semantic content of the sentence (e.g. “the tires hissed and the car crashed” compared to
the corresponding sounds).
They found increased activation for sentences in anterior superior temporal lobe bilaterally, and in left
posterior superior temporal regions.
Breaking down the sentence: semantics, prosody
The evidence presented in the previous sections point towards an involvement of anterior temporal
regions bilaterally, and left posterior superior temporal regions, in the processing of sentences.
There is a marked difference between sentences and word lists at the level of syntax, but sentences
and word lists also differ at the levels of semantics and prosody. Several studies have tried to tease
apart those effects.
Friederici et al. (2000), using a design that crossed sentence conditions (sentences vs. word lists)
and word type conditions (content words vs. pseudowords), found an increase of activation in the
planum polare bilaterally and in the deep portion of the left frontal operculum in the sentence conditions.
Vandenberghe et al. (2002) tried to disentangle syntax and semantics by using semantically unrelated words (e.g. “The branches of the traffic harmed the shopkeeper’s further tea” as opposed to
“The baby is staying with his grandmother”), and crossing this semantic condition with a syntactic
one (sentences vs. scrambled order sequences). They found increased activation of the left anterior
temporal pole to sentences compared to scrambled versions of sentences (coordinates: -44, -6, -24), and
a similar but weaker response in the left anterior superior temporal sulcus (coordinates: -64,-10,-12)
and the left posterior middle temporal gyrus (coordinates: -50,-58,-12) Only a subset of the voxels
in these areas, situated in the most anterior part of the temporal pole, showed an interaction with
semantics (that is, the increase in activation due to sentence structure was higher when the words
were semantically realted). No increased activation was observed in the left inferior frontal cortex
during reading sentences compared to scrambled word sequence, or with the inverse comparison. A
higher activation was observed in the left anterior temporal pole during reading semantically random
sentences and their scrambled versions compared to normal sentences and their scrambled versions.
Humphries et al. (2006) took that approach one step further by adding pseudowords conditions
to the semantically congruent and incongruent conditions, while still crossing this factor with the
syntactic structure one, thus yielding six different conditions.
Effects of syntactic structure were observed in the left anterior STS and left angular gyrus, which
showed greater activation to sentences than to word lists. On the other hands, widespread, bilateral
temporal lobe areas and the angular gyrus displayed greater activation to semantically congruent stimuli than either incongruent or pseudoword stimuli, probably reflecting increased semantic integration.
The fact that the left angular gyrus responded both to semantic and syntactic structure might
add evidence to the view that superior posterior temporal regions are involved in processing semantic
informations in the context of sentences.
The left anterior temporal lobe was “relatively insensitive to manipulations of semantic structure”,
which suggests that this area might be more purely syntactical than the other areas involved in sentence
processing.
15
2.2 Imaging literature on syntax
Figure 2.6: From Humphries et al. (2006)
Chapter 2: Some background in linguistics and imaging of syntax
16
Figure 2.7: From Humphries et al. (2006)
2.2.5
Priming of syntactic structure
Some caveats
A problem to studies relying on violation detection is that they rely on the assumption that an area
naturally associated with syntax will work harder when a violation occurs; however, the prediction is
quite tricky as to what direction the effect will be, with top-down processes being so important.
If the subject tries to bypass the violation and makes his best at parsing the sentence, then indeed
higher activation should be expected. But if the subject disengages from the task as soon as a violation
is detected, then lower activation will be predicted. This is especially the case in experimental settings
where the subject might learn to expect violations and to ignore them.
Thus, areas that are detected using violation techniques might reflect error detection rather than increased normal processing.
The same holds for jabberwocky and semantically incongruous sentences (e.g. “the truck in the cloud
ate too many chimneys”): it might still be possible - but effortful - to parse them and to try and build
a representation of their meaning. Then a subject would have the choice between either engaging,
or giving up altogether. There would then be a step-like discontinuity in activation, with activation
climbing and then suddenly dropping off when the difficulty crosses some threshold.
Habituation paradigms, that seek to identify areas in the brain that show decreased activation
following repetition of the feature of interest, do not have that problem and might be better suited to
pinpoint normal processing of language.
Two studies have used habituation paradigms (where activation in an area involved in a task gets
lower when material sharing features related to the task is presented repeatedly) to track syntactic
processing.
Noppeney & Price (2004) used four different syntactic structures in a context of local parsing ambiguity, and found a decrease in activation in left anterior temporal cortex (coordinates: -42, 3, -27),
pointing toward an involvement of that area in syntactic structure processing.
17
2.3 So where do we stand ?
Devauchelle et al. (2006) have compared habituation to to syntactic and semantic features No
habituation to syntax was seen. However, habituation to semantics was observed in the left medium
temporal gyrus and sulcus; we shall enter into more detail on these two studies in the next chapter.
2.3
So where do we stand ?
Broca’s area does not seem to subserve core-syntax
Importantly, activation in the left frontal lobe has been reported with studies involving sentences
with increased syntactic complexity (Keller et al., 2001; Caplan et al., 1998, 1999, 2000; Just et al.,
1996; Stromswold et al., 1996), syntactic violations (Moro et al., 2001; Embick et al., 2000; Kuperberg
et al., 2000; Ni et al., 2000; Kang et al., 1999), and implicitly induced syntactic analysis (Dapretto &
Bookheimer, 1999), but not when sentences do not involve complex syntactic analysis (Schlosser et al.,
1998; Friederici et al., 2000; Kuperberg et al., 2000). See fig.2.3 and 2.5
Moreover, Broca’s aphasia has been reported to lead to specific impairments to assigning a referent
to a pronoun (Vasic et al., 2006) or an antecedent to a trace (Grodzinsky, 2000), and there is evidence
that activity in Broca’s area increases parametrically with the number of displaced verb arguments
(Friederici et al., 2006), and shows sensitivity to how canonical the order of θ-roles presentation is
(Bornkessel et al., 2005).The evidence reviewed points to an involvement of Broca’s area not in core
syntax, but in second-level syntax.
Importance of the temporal lobe
Left anterior temporal regions In contrast to what has been reported of Broca’s area, activation of
the left anterior temporal lobe has been found consistently when contrasting sentences to environmental
sounds or word-lists regardless of their semantic content and of the prosody used (Mazoyer et al., 1993;
Humphries et al., 2001, 2005, 2006; Vandenberghe et al., 2002; Stowe et al., 1999). See fig. 2.5.
Left posterior superior temporal regions We have also reviewed evidence that the activity in
left pSTG and pSTS increases with verb argument complexity (Ben-Shachar et al., 2003), and errors
in the early attribution of θ-roles in case-marked languages (Bornkessel et al., 2005). This evidence has
lead authors to postulate a role for left pSTS in the interface between syntax and semantics (Vandenberghe et al., 2002; Bornkessel et al., 2005; Friederici et al., 2006; Humphries et al., 2006): posterior
temporal areas would subserve attribution of θ-roles and reinterpretation of individual words in the
context of the sentence.
Additional support for this hypothesis come from the lesion studies that show that Wernicke’s
aphasics have an impaired performance on complex verbs (Shapiro et al., 1993); according to a study
by Pinango & Zurif (2001), Wernicke’s aphasics show a deficit in semantics at the level of sentence
beyond their deficit in lexical-semantic retrieval.
Indeed, that view fits well with evidence from event-related brain potential studies, that have identified a late centro-parietal positivity (labelled P600) around 600 msec after the onset of a critical word
that introduces a syntactic violation or a ’garden-path’ (Friederici et al., 1993; Osterhout et al., 1994).
Patients with lesions in the posterior portion of the left temporal lobe including the pSTG have been
reported to show a selective absence of the P600 (Friederici & Kotz, 2003).
As proposed in Bornkessel et al. (2005), a view of left posterior superior temporal regions attributing agency fits well with a conception of posterior STS playing an important role in the identification
of agency more generally (Frith & Frith, 1999), based on observations that this region engages in the
processing of biological and implied motion, moral judgements, and tasks involving theory of mind
Chapter 2: Some background in linguistics and imaging of syntax
18
(reviewed in Frith & Frith (2003)). Indeed, all these tasks attribute a thematic representation to an
object or person.
Chapter 3
Disentangling word form, single
word meaning, and thematic roles
Here, I shall first present briefly the study that was run by Devauchelle et al. (2006); I shall then
explain in more detail the second experiment that logically stemmed from the first one.
3.1
The first study
The aim of the study was to distinguish areas responsible for syntactic and semantic processing, using
a habituation paradigm.
Four different repetition conditions were devised to disentangle syntax and semantics; repeated
sentences shared either:
1. both semantic and syntactic features (repetition of the exact same sentence four times)
2. semantic features (the same words were repeated and had the same θ-roles, but syntax was
varied)
3. syntactic features (the sentences shared the same syntactic tree)
4. neither syntactic nor semantic features (the sentences had different content words and different
syntactic trees)
An example is given below, see fig.3.1.
3.1.1
Methods and results
Methods
Briefly, during a session, subjects were presented with twenty blocks of four sentences that fell into one
of the four above mentionned conditions (see fig.3.2). There were six different sessions per subjects.
Visual and auditory modalities alternated across sessions.
To ensure that subjects attended to the stimuli, they were instructed to press a button in a few
sentences.
Results
Habituation to syntax No adaptation to syntactic structure was observed in either perceptual
modality.
Habituation to semantics Adaptation was observed in left inferior frontal and middle temporal
regions, as can be seen on fig.3.3.
19
Chapter 3: Disentangling word form, single word meaning, and thematic roles
20
Figure 3.1: Example of stimuli used by Devauchelle et al. (2006) to probe for adaptation to syntax
and semantics
Figure 3.2: Design of Devauchelle et al. (2006)
21
3.2 The second study
Figure 3.3: Results of Devauchelle et al. (2006): no adaptation was found to syntactic structure, but
there was adaptation to semantic features
3.1.2
Discussion
Absence of habituation to syntax
The absence of habituation to syntax is at odd with Noppeney & Price (2004), who did find adaptation
in the left anterior temporal gyrus while also using a habituation paradigm. I shall discuss that
discrepancy in the next chapter as an introduction to my own study.
Adaptation to semantics
Let us try and examine how similar sentences were in the “semantically similar” condition.
The similarity of the sentences can be broken down into three parts:
1. similarity of the word form
2. similarity of individual content words meaning
3. similarity of θ-roles mapping
It is impossible to conclude from the first experiment to which of those three sets of repeated features
brain areas adapt, and if some of the activated brain areas adapt more selectively to one particular set
of features. Therefore, a second study was designed to disentangle these mixed influences.
3.2
The second study
In order to disentangle single word semantics, word form, and θ-roles mapping, we 1 devised an experiment that varied each of those three factors as independently as possible (full orthogonality could
not be achieved because the three factors are correlated; e.g., it is generally not possible to find two
1 This
experiment was run conjointly by Anne-Dominique Devauchelle and myself
Chapter 3: Disentangling word form, single word meaning, and thematic roles
22
Same single-word meaning
Same θ-roles
Different θ-roles
Le flic a assommé le bandit.
Le bandit a assommé le flic.
Le policier a matraqué le délinquant.
Le délinquant a matraqué le policier.
C’est le bandit que le flic a assommé.
C’est le flic que le bandit a assommé.
C’est le délinquant que le policier a matraqué.
C’est le policier que le délinquant a matraqué.
Le bandit a été assommé par le flic.
Le flic a été assommé par le bandit.
Le délinquant a été matraqué par le policier.
Le policier a été matraqué par le délinquant.
C’est par le flic que le bandit a été assommé.
C’est par le bandit que le flic a été assommé.
C’est par le policier que le délinquant a été matraqué. C’est par le délinquant que le policier a été matraqué.
Different single-word meaning
Les convives ont acclamé l’hôte.
C’est l’hôte que les convives ont acclamé.
L’hôte a été acclamé par les convives.
C’est par les convives que l’hôte a été acclamé.
Table 3.1: Full array of conditions
words that share the same word form but have different individual meanings).
More specifically, here is how variations were obtained in each of the dimensions (single-word
semantics, θ-roles, word form):
1. single-word semantics: words were changed for words of an unrelated semantic field (e.g. “the
priest buys a bag” vs. “the girl hits the boy”)- in this case, neither of the other two dimensions
was preserved.
Therefore, variations along the two remaining dimensions were embedded in the “same singleword semantics” conditions and could be crossed orthogonally in a 2x2 factorial design:
2. θ-roles assignement: subject and object of a transitive verb were swapped (e.g. “the lawyer hits
the girl” vs. “the girl hits the lawyer”)
3. word form: words were changed for synonyms (e.g. “the miss strikes the attorney” vs. “the girl
hits the lawyer”).
Moreover, these variations were further crossed with variations in the syntactic structure, with
sentences being either simple (subject DP, transitive verb, object DP), cleft, passivized, or passivized
and cleft, so that habituation to syntax could also be examined.
A full grid of conditions is provided table 3.1.
3.2.1
Design
3.2.2
Materials and methods
Subjects 6 healthy, right-handed, French native speakers participated to the study after having
given written informed consent.
Design 180 pairs of sentences (a primer sentence and a target sentence) were presented.
Repetition was reduced from four in Devauchelle et al. (2006) to only two sentences, because results from Devauchelle et al.’s experiments had shown that habituation was already present at the first
23
3.2 The second study
repetition.
The target sentence was always a noncleft active one. The primer sentence could be:
• noncleft active, noncleft passive, cleft active or cleft passive, with equal 25% probability
• semantically related (80% trials) or not related to the target sentence (no similarity at the level
of single-word meaning, 20% trials)
The 80% semantically related primer sentences were further divided into four groups by crossing
two factors (first factor: word form; second factor: θ-role assignment):
• half “same words” sentences (identical individual content words), half “different words” sentences
(synonyms)
• half “same θ-roles assignment” sentences (thematic subject and object were not moved), half
“different θ-roles assignment” sentences (subject and object were swapped)
Stimuli 180 blocks of 20 sentences corresponding to the 20 different primer types were generated.
Each subject was presented with exactly one pair of sentences from each single block. Conditions were
assigned to trials in a pseudo-randomized way.
As to plausibility, sentences were:
• 25% plausible and reversible (that is, they remained plausible when subject and object were
swapped; e.g. “le docteur a interrogé le patient”)
• 25% plausible and not reversible (e.g. “Le garçon a ciré ses chaussures”)
• 25% implausible and reversible (e.g. “le spaghetti a adoré les noisettes”)
• 25% implausible and not reversible (e.g. “le paquet a livré le facteur”)
All target sentences were built according to the same syntactic structure: DP + present perfect
verb + DP and had three content words.
Task Subjects were instructed to press one button when the sentence presented was judged plausible,
and the other when the sentence was judged implausible. The assignment of buttons to plausibility
conditions was reversed after two sessions. Subjects underwent a small training outside the scanner to
ensure they understood the task.
Procedure There were four sessions of 12 minutes for each subject, corresponding to the presentation
of 45 pairs of sentences. Sentences were presented using the rapid serial visual presentation (RSVP)
technique: words were projected one at a time for 270 msec on a screen in white letters against a black
background. After each sentence, the screen went black for a time that varied randomly between 4
and 6 sec for subjects 1 to 4 , 2 and 8 seconds for subjects 5 and 6, mean 5 sec. The wider interval
of variation was introduced in the last session to decrease statistical correlation of stimuli which could
have prevented satisfying modelization.
3.2.3
Behavioral results
Response time data across the four pilot subjects showed that there was a strong behavioral priming
effect of individual word meaning, θ-role attribution, word form (see fig.3.4). There was no effect of
syntactic structure of the primer sentence.
Chapter 3: Disentangling word form, single word meaning, and thematic roles
24
Figure 3.4: Effect of semantic priming on response time
3.2.4
Confusing imaging results
The initial plan was to compare habituation for different categories of semantic priming. However,
analysis of fMRI time series yielded weird activation and contrast maps. Not only did we fail to find
a decrease in activation, but the results of overall activation did not even seem to make a lot of sense.
This has led us to think that the design of the experiment was probably flawed, with regressors
being too correlated to allow for a reliable estimation of the model. However, I shall briefly go through
possible alternative explanations for the absence of results.
3.2.5
Possible other explanations
The stimuli Synonyms are never true synonyms. Therefore, if the adaptation found in the first
study was due to adaptation to individual word meaning, then that adaptation could disappear when
synonyms were used as primers instead of the real word, as their meaning only approximates that of
the target word.
Yet activation should have been reduced all the same in half the cases, when the very same words
were used as primer and target, and that was not found to be the case.
Moreover, the behavioral analysis shows that priming by synonyms was efficient in terms of reducing
reaction times.
Are the stimuli too short ? The target sentences were quite simple (affirmative active sentences
that seldom contained more than six words), which was not the case in Devauchelle et al. (2006).
25
3.2 The second study
Figure 3.5: Effect of semantic priming on response time
Chapter 3: Disentangling word form, single word meaning, and thematic roles
26
Perhaps these simple sentences led to small activations, the variations of which could more easily be
drowned in task-related variations.
Semantic incongruency The task used was a plausibility task. In order to make judgements about
plausibility as effortless as possible, sentences were often grossly meaningless, with unrelated semantic
words (e.g. “le paon a mangé le nuage”). Thus, adaptation to θ-roles might have been prevented
by the semantic incongruency of words, that might have discouraged the subject to try and attribute
θ-roles.
However, adaptation should have been observed for the sentences that were plausible, and that
was not the case (that is, when plausible sentences used the very same words, to avoid any problems
caused by the choice of synonyms on the one hand, and semantically incongruent words on the other
hands. This should have replicated the former study by Devauchelle et al. (2006)).
The task The subjects did not have to perform any particular task in the first study, apart from
pressing a button once in a while when instructed to. No task was used either in Noppeney & Price
(2004). Habituation paradigms in syntax might be particularlry sensitive to tasks.
We have used a plausibility judgment task, that has been used previously in imaging studies of language (Stromswold et al., 1996). Feedback from subjects showed that the task was indeed not that easy.
The task itself might have disturbed adaptation processes. If the task of making a plausibility
judgement is intellectually effortful, then subjects might have tried to reduce it to its minimum by
turning it into a linguistic similarity judgement:
1. They have made a judgement for the primer sentence
2. As they do not want to take great pains again making a judgement about the target sentence,
they compare it first against the first one to see if they have the same meaning (because if they
do, there is no longer any need to make a judgement)
3. If meanings do not match, they make a decision about plausibility again.
This could explain why we found areas that appeared to be more activated with target sentences
than primer sentences.
In particular, a possibility might be that the culprit for failure to model successfully the data is an
interaction between design and task. Indeed, we used a rapid event paradigm, that has been shown
(Josephs & Henson, 1999) to allow for good detection of differences in activation, but poor detection
of activations against baseline with the kind of SOA we used, especially as our design did not contain
a “silence” condition against which the other conditions could be compared.
The task may have led to intractably high variability from one trial to the other, so that small
differences related to semantic processing might have been drowned in parasit differences related to the
task, especially as the sentences used were often short and simple (this is true of every target sentence);
all conditions involving language, no “sanity check” could be performed to check that activated areas
seemed to make some sense.
3.3
Further directions
A slightly different version of the experiment is to be run soon. The task will be removed to replicate
the conditions of the first study by Devauchelle et al. (2006), and perhaps some silence added to allow
for more reliable statistical estimation of sheer activations. The findings of the first study should at
least be replicated.
Given the presence the behavioral priming effect, it is unlikely that priming is simply not working in
this case.
Chapter 4
Core syntax at the phrase level
I shall first start by comparing studies by Noppeney & Price (2004) and Devauchelle et al. (2006), and
from then move on to the pilot experiment I have run.
4.1
Reconciling Devauchelle et al. and Noppeney & Price
Both of the studies I am going to discuss now looked for habituation to syntactic structures; in one
case, no habituation was found, while in the other, an area was identified in the temporal lobe.
Noppeney & Price (2004)’s study compared activation to the presentation of sentences that were
syntactically ambiguous.
Two types of syntactic ambiguities were used: clause boundary ambiguity and reduced relative/main clause ambiguity (see examples table 4.1).
Subjects were simply asked to read the sentences silently, and informed that their eye-movements
would be monitored to ensure that they paid attention.
Sentences were presented by blocks of five sentences in either a primed or an unprimed condition;
in the primed condition, the five sentences were of the same syntactic type, while in the unprimed
condition, the types of the sentences were randomized.
The primed condition resulted in decreased left anterior temporal activation, providing evidence
that this area is involved in syntactic processing.
These results show that syntactic structure can indeed lead to habituation, ruling out an explanation
of Devauchelle et al.’s results based solely on an inadequacy of the habituation procedure.
However, there could be at least two explanations to this discrepancy:
• the number of different syntactic structures used was four in Noppeney & Price (2004), and
about twenty in Devauchelle et al. (2006)
• the study used a context of local ambiguity, which is not the case in Devauchelle et al.
Let us focus on the second explanation.
Devauchelle et al. manipulated syntactic structure in a way that made large use of the possibilities
offered by second level syntax, and the syntactic contrasts used, involving movement, cleft sentences,
passivization, etc, were all contrasts relevant to second-level (not core) syntax, and had not much to
say about core-level syntax.
Conversely, Noppeney & Price’s experiment used sentences where the syntactic contrast bore specifically on the core-level tree being built. Moreover, ambiguity was maintained throughout the experiment, which might have focused attention of the subjects specifically on the object of ambiguity - that
27
Chapter 4: Core syntax at the phrase level
28
Clause boundary ambiguity
1. Preferred reading: late closure.
e.g.: Before the director left the stage the play began.
2. Nonpreferred reading: early closure.
e.g.: after the headmaster had left the school deteriorated rapidly.
Reduced relative/main clause ambiguity
1. Preferred reading: simple active.
e.g.: The artist left his sculptures to the British Museum.
2. Nonpreferred reading: reduced relative.
e.g.: The child left by his parents played table football.
Table 4.1: Examples of sentences with the four different syntactic forms, Noppeney & Price (2004)
is, the tree being built -, as in a kind of challenge where the goal was to resolve the ambiguity (which
is this case amounted to guessing the underlying syntactic tree).
Thus, core-syntax processes might have been amplified by that context of ambiguity, which might
have made the adaptation easier to capture.
4.2
Tracking constituents
As reviewed in the background chapter, previous studies have provided some evidence that the core
process of syntax - the process that underlies all syntax, that is, chaining together two isolated words
to build a phrase - might be carried out in the anterior portion of the left temporal cortex.
The approach taken in the study I shall now describe stems from the intuition that if an area is involved in core syntactic processing, that is, is somehow in charge of building constituents from smaller
chunks of language, then it should vary quite monotonically with the number of words that build up
into a tree. Roughly speaking, the bigger the tree to build, the harder a tree-building area has to work.
More explicitly, it seems reasonable to expect that a node is harder to build when the leaves it
merges are “heavier”; that is, building a DP from D “the” and NP “cat” would be easier than building
a DP from D “the” and NP “cat that my mother found on the street”, so that we hope that activation
of this area for building the whole phrase “the cat that my mother found on the street” would be
higher than the sum of the activations for “the cat” and “my mother found a cat on the street”.
So, building the tree for the sentence: “the black cat chases the white mouse” is expected to lead
a hypothetical syntax area to activate more than the sums of activities engendered by processing
“the black cat”, “the white mouse”, “she chases him”, even if the overall number of nodes and types
of nodes (two DPs from a D, an AP and an NP, and a TP from two DPs and a tensed verb) are the same.
Building on this intuition, we devised an experiment where a constant number of words could be
combined into syntactic trees of progressively larger sizes. So to speak, we tried to fill the gap between
sentence and isolated word processing.
What’s twelve words ? Leaving aside all non-homogeneous assignments, twelve words can be a
twelve word constituent, two six word constituents, three four word constituents, four three word constituents, six two word constituents, or twelve isolated words. An area involved in building phrases
from smaller phrases should display a progressive increase in activation when constituents get bigger.
29
4.3 Material and methods
This is exactly what we looked for in this study.
Relying merely on the number of words in a constituent to predict activation might seem a little
too vague, as there are a number of other factors that come into play when it comes to evaluating
complexity of a structure. For instance, it might not seem reasonable to predict that it is harder to
build a representation for the phrase “the very very very very very very very very very very big car”,
than for the phrase “please let her help me fold these sheets.” Care was therefore taken that the most
accessible factors were controlled: thus, the number of content words did not vary across conditions
(indeed, stimuli were generated without obvious biasses, as the same content words were used across
conditions to ensure that the design was well balanced); passivization and negation were avoided, as
well as object relatives. Furthermore, only monosyllabic words were used, so that the syllabic input
was carefully controlled.
4.3
4.3.1
Material and methods
Subjects
This study is still in its early stages of development, so only two healthy, right-handed, native French
speakers with no history of neurological damage have participated in it so far, after giving written
informed consent.
4.3.2
Design
144 twelve-word series were presented to subjects in a slow event design in two runs of 16 minutes.
The series of words could be of six different conditions, according to whether they were formed of 1,
2, 3, 4, 6, or 12 word constituents (see table 4.2).
Single trials were presented in a pseudorandomized order to balance transition probabilities between
conditions.
4.3.3
Stimuli
Stimuli were generated by blocks of six sentences. Each sentence contained six monosyllabic content
words, and one dependent clause. Content words were then rearranged across sentences to obtain
blocks of 6 12 word series corresponding to each of the five remaining conditions. Function words were
not necessarily conserved, nor agreement or tense features.
In order to avoid subjects rearranging shuffled content words to get the original sentence back, content words were shuffled across sentences rather than keeping always the same content words together
in the same 12 word string.
All sentences were active, affirmative ones, with no object relative and a word order that tried to
be as canonical as possible. All the words used were monosyllables.
4.3.4
Task
Subjects were instructed to press a button whenever they saw a proper name. They were not explicitly
asked to understand the meaning of the sentences; however, syntactic parsing is a highly automatized
task, so we expected to see syntactic activation nonetheless. In order to avoid the task being a mere
“capital letter” spotting task, we presented all words in lower-case font, regardless of whether they
were a proper name or not.
Chapter 4: Core syntax at the phrase level
Conditions
One word
mur nous boire la des chez scène bas toile laisses très en
je belle nous moite sans tu brique par chez eau soie lâchent
sans rêche pour tombe longent sous frais un je mains chien des
pour rends cour chez soif je fibre des une geste par glisse
noir ils joie sans fais hors dans sueur elle jambe des donne
suie des peu elles toi bruit maude ils hors crains chanvre pour
Two words
le mur la scène des bas pour boire trop belle lâche ça
en brique une eau tout moite en soie donne ça une laisse
longe la trop frais ma main ils tombent le chanvre un chien
la cour la soif rends les ils glissent en toile le faire
tout noir une joie la sueur mes jambes une fibre un geste
la suie si simple crains le sans bruit pour maude trop rêche
Three words
longe la cour tu as soif glisse sans bruit ces deux chiens
noir de suie tant de joie sur sa jambe lâche la laisse
mur en brique trop de sueur sans un geste une fibre rêche
des mains moites tout ce chanvre bas de soie elle est fraı̂che
boire cette eau une belle toile en faire trop il y tombe
une simple scène donnes en moi craindre pour soi maude les rend
Four words
le mur en brique tout moite de sueur le chanvre est rêche
de la belle toile il longe la cour geste de la main
est noir de suie donnent de la fibre il craint le bruit
boire de cette eau le bas de soie il lâche la laisse
la soif est fraı̂che glisse sur sa jambe un chien qui tombe
ils en font trop une joie si simple maude rend la scène
Six words
le mur qui longe la cour le bas se soie qui tombe
la brique est noire de suie ses jambes glissent sans un bruit
boire de cette eau si fraı̂che le chanvre est une fibre rêche
par une simple soif de joie ça donne de la belle toile
la sueur de ses mains moites maude qui lui fait un geste
ils craignent de rendre la scène il lâche la laisse du chien
Twelve words
le mur en brique qui longe la cour est noir de suie
boire cette eau fraı̂che quand on a soif est une joie simple
il craint que la sueur rende ses mains moites sur la scène
le bas de soie qui tombe glisse sur sa jambe sans bruit
le chanvre est une fibre rêche qui donne de la belle toile
il lâche la laisse de son chien quand maude fait un geste
Table 4.2: Sample block
30
31
4.3 Material and methods
A proper name was inserted in 1 over 6 sentences (twice in every condition) and could occur anywhere in the sentence.
4.3.5
Procedure
Rapid serial visual presentation was used, to control for visual input and eye movements, while not
preventing syntactic processing. No clue whatsoever informed the subjects as to the structure of the
twelve word series he was presented; that is, no punctuation mark was inserted to trace boundaries
between constituents. This choice was made to ensure that conditions differred minimally visually.
A fixation cross was displayed for two seconds at the beginning of each trial to make sure that the
subjects were prepared to pay attention, and avoid any “surprise effect” at the first word.
Twelve word series were then presented one word at a time, with a word duration of 300 msec. Then
the screen went black for 8400 msec, so that the duration of a complete trial was 14 sec.
Subjects underwent a small training session outside the scanner to ensure they understood the
task, and to prepare them to the unfamiliar nature of the stimuli.
4.3.6
fMRI Scanning Technique
Imaging data were collected on a Bruker 3T scanner with standard head coil. Functional images
used EPI (TR=2.4 sec, TE=30 msec, Matrix=64x64 ; Voxel size=3.5x3x3mm; Number of slices = 40.
The anatomical scan used a 3D gradient echo sequence (TI=2530 msec, TE=3.3 msec, TR=8.6 msec;
voxels=1.33x1.2x1.2 mm).
4.3.7
Data Analysis
Data preprocessing and analysis were performed using SPM2 software. All volumes were realigned
to the first volume and resliced using a sinc interpolation. The time series (fMRI) in each voxel was
high-pass filtered to 1/60 Hz and globally normalized with proportional scaling.
A General Linear Model was estimated for each of the two subjects, using standard HRF as a basis
function. The model consisted of 13 activation conditions: 6 conditions for reading twelve-word strings
that made up constituents of one, two, three, four, six or twelve words, where no response had been
given, 1 condition corresponding to the pressing the button, and 6 conditions shadowing the first six
conditions for trials where a response had been emitted.
Another model with five FIR regressors per conditions was also estimated, to have a better feel of
the shape of thre responses. For this model, baseline activation was taken as the fixation of the cross
and the 2 first seconds after presentation of the first word of a trial.
A third model using Fourier basis functions set was also calculated, in order to look for any effect
of conditions on the phase of the response. Indeed, the 6 response-less conditions had a very different
phase pattern than the 6 with-response conditions; but no convicing difference could be found across
the 6 conditions of interest, so I shall not show data on the phase here.
In all models, nuisance covariates included the realignment parameters (to account for motion artefacts).
A linear contrast, postulating a linear increase across conditions, was used to locate areas showing
the expected monotonic pattern of activation (1 word < 2 word < 3 word < 4 word < 6 word < 12
word condition). We report activations at a significance threshold of p < 0.001, uncorrected.
Chapter 4: Core syntax at the phrase level
32
Figure 4.1: Areas activated by reading words
4.4
4.4.1
Results
Behavioral results
Subjects were asked to press a button whenever they saw a proper name. A proper name was inserted
in 1 over 6 sentences and could occur anywhere in the sentence. Thus, a subject had to detect 24
proper names on the whole.
Subjects performed significantly above chance. The second subject performed well with only two
false alarms and no misses. The first one was somewhat less reliable, and missed 4 proper names, while
pressing for two false alarms (e.g. pressing for “gorge”).
4.4.2
Network activated by all conditions
An activation of bilateral occipital, temporal areas was seen following presentation of words. Word
reading relative to baseline resulted in predominantly left-lateralized activation in the left superior
temporal sulcus, middle temporal gyri spreading into the temporal poles bilaterally. Activation was
also observed in the left inferior frontal gyrus and parietal lobe.
4.4.3
Linear contrast
The linear contrast was chosen to detect areas presenting an increase across conditions. It should be
noted that the contrast was linear according to condition numbers, not numbers of words, because we
expected the activation to increase more steeply in the low indice conditions.
The areas that were selected by this contrast were highly left lateralized.
The regions found for the two subjects could not be said to really overlap, however both subjects presented roughly the same pattern of areas responding to the linear contrast: one area in the
L-AG/L-MTG, one area more anteriorly in the L-MTG, and some patches of activation in the L-aMTG.
The more anterior areas displayed increasing activations, whereas the areas in the L-AG displayed
deactivation for the low-indice conditions, and activations not significantly different from baseline in
33
4.4 Results
Figure 4.2: Voxels responding to the linear contrast, p<0.001, uncorrected
Chapter 4: Core syntax at the phrase level
Table 4.3: MNI coordinates - clusters > 5 voxels
Subject 1
Coordinates
Region
x
y
z
T score Extent
Left angular gyrus
-48 -60 27
4.71
22
Left middle temporal gyrus
-42 -54 15
4.78
8
-54
6
-27
4.15
9
Left middle frontal gyrus
-45
6
51
4.08
12
Left precentral gyrus
Left superior frontal gyrus
Left medial superior frontal gyrus
Left inferior frontal gyrus
pars opercularis
Left inferior temporal gyrus
Right precentral gyrus
34
Subject 2
Coordinates
x
y
z
T score
-51 -66 21
3.56
-45 -57 12
3.99
-57 -15 -6
4.86
-39
-27
-9
-57
-6
57
42
9
63
24
51
24
3.96
4.11
4.30
4.16
7
6
9
8
-33
15
3
-21
-39
72
4.06
3.99
6
16
the higher-order conditions.
One of the two subjects also displayed a monotonic increase in activation in Broca’s area (L-IFG).
Other activations were found in various locations in one subject but they are not discussed as they
were not present in the other subject.
4.5
4.5.1
Discussion
Areas
The results could seem somewhat disappointing, as they show very patchy response to the linear
contrast. However, the most activated voxels were found in areas of the temporal lobe or the temporoparietal junction that were repeatedly implicated in the processing of sentence vs. isolated words
(Mazoyer et al., 1993; Vandenberghe et al., 2002; Stowe et al., 1998, 1999; Noppeney & Price, 2004;
Humphries et al., 2001, 2005, 2006; Kaan & Swaab, 2002) and the interface between syntax and
semantics (left posterior superior temporal areas, left temporo-parietal junction, left angular gyrus),
as I have reviewed in the second chapter (Ben-Shachar et al., 2003; Pinango & Zurif, 2001; Shapiro
et al., 1993; Shapiro & Levine, 1990; Bornkessel et al., 2005; Humphries et al., 2006).
4.5.2
Extent
9
6
12
Patterns of activation
Left inferior frontal gyrus As reviewed in the second chapter, a number of studies suggest that
Broca’s area is involved in linearizing displaced constituents and dealing with sophisticated parts of
the syntax.
There is not much room for movement and sophistication in a single word world, and even a twoword world. It seems reasonable to say that the more the words, the higher the probability that some
part of sophisticated syntax will sneak in. The activation found in Broca’s area for one of the subjects
fits that account, with a step-like increase followed by a smoother curve.
Left anterior temporal regions As can be seen on figs. 4.3 and 4.4, patterns of activation seemed
to be different in the left angular gyrus (L-AG) and anterior middle temporal gyrus (L-aMTG).
Activation in the middle anterior regions seemed to follow a quite smooth curve while the number of
words in a constituent increased, whereas activation in the more posterior regions seemed more of an
“all-or-none” type.
35
4.5 Discussion
Figure 4.3: Estimated parameters of the HRF model for the first subject
Chapter 4: Core syntax at the phrase level
Figure 4.4: Estimated parameters of the HRF model for the second subject
36
37
4.5 Discussion
Figure 4.5: Regions found in the study by Vandenberghe et al. (2002) (adapted from Vandenberghe
et al. (2002))
Chapter 4: Core syntax at the phrase level
38
These patterns do make some sense in the light of recent proposals regarding the roles of posterior and anterior temporal regions in language. As we have reviewed earlier, there is considerable
evidence linking the left anterior temporal regions to the processing of sentences, irrespective of their
semantic content (Mazoyer et al., 1993; Vandenberghe et al., 2002; Humphries et al., 2001, 2006) . If
this area is in charge of building trees, and it is not surprising that activation should increase gradually.
We had foreseen a steeper increase in the low indice conditions (which is why we chose a contrast
that predicted the same increase when going from 2 to 3 word phrases, or when going to 6 to 12 word
phrase). Rather, the activation seemed to plateau around the 6 word condition.
An explanation could be that the 12 word condition was the only one that did not contain an
unexpected discontinuity (unexpected because the subject had no clue about the condition they were
in until a word that could not be integrated into the tree appeared), thus allowing smooth reading. It
would be interesting to see if the plateau would be observed at 12 word phrases too, in a paradigm
where conditions would go from isolated words to, say, 18 word phrases.
Left posterior temporal regions/Left angular gyrus The posterior temporal regions have been
postulated to play a crucial part in the interface between syntax and semantics, particularly in the
attribution of agency and mapping of θ-roles on arguments (Ben-Shachar et al., 2003; Pinango & Zurif,
2001; Shapiro et al., 1993; Shapiro & Levine, 1990; Bornkessel et al., 2005; Humphries et al., 2006)..
If this is true, what activation pattern would we predict ?
In order for arguments to exist, there has to be space for them; that is, at least one verb and
one argument, and probably one word more to allow for a determiner. A steep increase in activation
of an area involved in argument processing should be expected, with the discontinuity corresponding
to the constraint that there need to be room for the argument. Indeed, only activations (that is,
deactivations) observed in conditions 1 and 2 differ significantly from baseline activation in the L-AG
in subject 1, and this is almost true of subject 2 as well (the 4 word phrase condition hardly reaches
significance).
Interestingly, the observed difference in activity for conditions 1 and 2 are deactivations, not activations.
This should not come as a surprise, as the left angular gyrus has been shown to be part of a network
of area that is spontaneously active during so-called “rest” (Binder et al., 1999). I shall get back to
this shortly.
A role for the L-AG and posterior MTG in attributing syntax-dependent meaning to words is
particularly interesting given how close these areas are to the left temporo-parietal junction (L-TPJ),
which has been postulated to mediate attribution of beliefs (Saxe, 2006). See fig. 4.7.
Attributing θ-roles to arguments of a verb might be seen as a very elementary preliminary to
attributing mental states to people; in both cases, a representation has to be reinterpreted in the light
of the context it is embedded in.
Of particular interest is the evidence that deaf children that do not sign natively have impaired
theory of mind when compared to native ASL signers, even when the task does not involve language
(de Villiers & de Villiers, 2000; Woolfe et al., 2002; Peterson & Siegal, 1995).
Thus, if we consider areas L-pSTS, L-AG and L-TPJ as a whole:
1. People with lesions in posterior temporal regions have impaired argument processing (Shapiro
et al., 1993; Shapiro & Levine, 1990) and impaired reinterpretation of words according to the
sentence they are in (Pinango & Zurif, 2001)
39
4.5 Discussion
Figure 4.6: Regions postulated to play a crucial role in understanding others’ minds. From Saxe et al.
(2004)
Figure 4.7: Activation in the bilateral TPJ is higher for stories involving Theory of Mind and attribution of false beliefs than stories about objects or descriptions of people or objects. Adapted from Saxe
& Kanwisher (2003)
Chapter 4: Core syntax at the phrase level
40
2. pSTS shows increased activation with verbs that require more arguments (Ben-Shachar et al.,
2003) and when a false thematic mapping has to be reinterpreted (Bornkessel et al., 2005)
3. People with lesions in the L-TPJ have impaired theory of mind (Samson et al., 2004)
4. bilateral TPJ is activated when attributing false beliefs to people (Saxe et al., 2004; Saxe &
Kanwisher, 2003)
5. These areas are spontaneously active at rest, so that they display deactivation in perceptual
tasks (Binder et al., 1999)
6. Late signers of ASL have impaired theory of mind
Let us try and tie this evidence together: there might be some kind of gradient of increasingly
sophisticated attribution of specific semantic content from posterior superior temporal lobe to the
temporo-parietal junction; attribution of agency and intention to persons requires the capacity to represent thematic functions, that is, to adapt the semantic content of a representation to a context of
interaction.
Attributing a θ-role to a noun is like creating a tight link between that noun and the verb which
takes it as argument, and this link is somewhat independent from the rest of the world.
Perhaps representing someone’s beliefs relies on this kind of ability to build a priviledged and “protected” (that is, isolated from the rest of the world) relationship between two entities. If I say, “the cat
chases the dog, and the dog runs away”, I have in fact two dogs: an accusative dog and a nominative
dog. I am able to replicate my representation of the dog as many times as needed with a particular
θ-role for each single verb of which it is an argument, and the θ-role attributed by a verb will not leak
out of the verb-argument pair.
This capacity to represent one single object with bilateral relationships (to different entities) that
do not interfere with one another might underlie the capacity to attribute beliefs to people: the marble
can be both inside the basket for Sally and inside the box for Anne, just as the dog is at the same
time Accusative and Nominative.
So how can we tell the story ?
During rest, we represent a thousand things that have all kind of interactions, and these areas
are active. If given a perceptual task that has nothing to do with attribution of states, then the
area deactivates. If the task requires some attribution of thematic roles, the area is as active as in
rest. Finally, if the task involves a great deal of reasoning about intentions and beliefs, then the areas
activate above baseline for real.
Chapter 5
General discussion and further
directions
5.1
Adaptation to semantics
Starting from the results of Devauchelle et al. (2006) which gave evidence for semantic habituation in
the left middle temporal gyrus, we ran an experiment to disentangle adaptation due to word form,
meaning of individual words, and attribution of θ-roles in the sentence.
We obtained results that varied widely (“wildly” might even be better suited here) between subjects and seemed to indicate that something was wrong with the design or the task, which prevented
statistical modelling from yielding any informative results.
Nice behavioral priming results showed that the stimuli used were promising; moreover, the results
of the statistical analysis did not look like “normal” results which would simply fail to show the type of
adaptation we were longing for, but rather, like something went completely wrong in the way, probably
at the level of the temporal presentation of events.
Despite a number of modifications to the design between the first four subjects and the remaining
two (e.g. increasing the jitter between conditions to decrease the correlation between conditions), the
results remained puzzling and call for more substantial modifications.
While we are still trying to find out what could lead to the initial puzzling results, we are planning
to clean the experiment from a number of possible sources of trouble, by removing the task, introducing
silent conditions, and perhaps even turning to a slow event related paradigm.
5.2
Looking for core syntax and constituent-building areas
The second study aimed to investigate the neural correlates of constituent building during silent wordlist reading.
Subjects were submitted to increasingly large constituents while controlling for the number of words,
content words, and the number of syllables (as only monosyllabic words were used).
Results of the two pilot subjects pointed towards involvement of areas strongly left-lateralized, in the
temporal lobe. Although the areas that appeared do not fully overlap, the results nonetheless showed
some consistency across subjects, in that the linear contrast we tested pointed in both cases towards
the recruitment of three clusters in the left angular/posterior middle temporal gyrus, anterior middle
temporal gyrus medially and in a more anterior part.
This consistency is particularly interesting in view of accumulating evidence of an involvement of these
41
Chapter 5: General discussion and further directions
42
areas in the normal processing of sentences, with various interactions with single-word level semantics
(Humphries et al., 2006, 2005, 2001; Noppeney & Price, 2004; Vandenberghe et al., 2002; Mazoyer
et al., 1993).
5.3
5.3.1
Further directions concerning the core-syntaxe study
Controlling for mere sequence effects
One might argue that the activation found is due not to processing of constituents, but with processing
of sequences of adjacent words. The paradigm used for the pilot study does not allow to separate
constituency and adjacency. An additional control condition will be introduced, where chunks of
sentences made up of an increasing number of adjacent words are presented.
5.3.2
Decreasing variability between subjects by introducing tighter control on stimuli
Results of the two pilot subjects show that the parametric increase of syntactic structure leads to
monotonic activity in similar areas across subjects, yet the overlap is not really satisfying.
The approach to generate stimuli was to eliminate a number of potentially confusing factors, while
keeping complete freedom within the framework of some constraints (number of content words, number
of syllables, no object relative, no passive, no negation).
This strategy can be viewed as an “opt-out” one, as it starts from the whole pool of grammatical
sentences and excludes some of them according to the given criteria. It allowed for wider coverage of
possible syntactic trees, but this might have been at the expense of consistency in activations across
subjects.
Shifting from this “opt-out”strategy to an “opt-in” one, where we start from scratch and decide
that such and such are the only sentence patterns that will appear in the stimuli, would allow for a
tighter control of stimuli, and would ensure that the syntactic structures presented are matched as
closely as possible across subjects.
We plan to introduce a tighter filter of possible syntactic structures, but have not decided yet how
wide the coverage of possible sentences will be.
Conversely, we consider abandoning the “monosyllabic word” constraint, as it does not seem obvious that the benefits of retaining it match its cost. Removing it from the design would allow a better
control of the other, more critical constraints.
5.3.3
Differentiating between agency and phrase building
The areas we found to respond to a linear contrast according to the number of words in maximal
constituents have received increasing interest from investigators looking for syntax, as reviewed in
chapter 2.
Here is the proposed mapping 1 that might guide ideas:
1. left posterior superior temporal areas are involved in retrieving meaning of single words as modified by the sentence context in which they appear. This corresponds to activation related to
treatment of argument properties of predicate, θ-roles attribution
1 It
should be noted that studies that agree on the involvement of areas in the left temporal lobe vary in terms of
lateralization. We keep the safer “left-lateralized” assumption, especially as our data were strongly left-lateralized.
43
5.3 Further directions concerning the core-syntaxe study
2. left anterior middle temporal areas might be involved in concatenating individual meanings of
words into the meaning of a sentence
3. more anterior parts of the left anterior middle temporal gyrus and the left temporal pole seem
to subserve sentence building even with pseudowords or semantically incongruent words
Keeping that framework in mind, it is tempting to modify the experimental design to see if the
activation we find in these areas is separable.
As we have proposed in the discussion, the step-like increase in activation in the left posterior superior temporal areas could reflect the surge of θ-roles mapping on arguments, while the smoother curves
in more anterior parts of the temporal lobe would correlate with integrating syntactic and semantic
structures into larger ones.
Thus an exciting perspective would be to separate these activations by manipulating independently
arguments and tree depth; a way to achieve that separation without changing much of the design would
be to split some conditions (e.g., four word and six word conditions) between an “argument heavy”
and an “argument light” condition; for instance, contrasting “ma soeur parle à son fils le merle chante
un bel air” and “le fils de ma grande soeur le beau merle tout en noir”. If the proposed hypotheses
are true, then we would expect activation of the posterior superior temporal, but not anterior middle
areas to display an interaction of argument-type and number-of-word conditions.
A more refined design could involve comparisons between phrases where content DPs (as opposed
to pronouns) are arguments and phrases where the same DPs appear but in an A bar position, thus
receiving no θ-role, while making the underlying syntactic tree deeper, e.g.: “the dog bit my uncle”
vs. “my uncle’s dog bit them”.
If the hypothesis that the left posterior superior temporal areas subserve reinterpretation of meaning in the context of a sentence is true, then it should be less activated in the case where the internal
argument of the verb is a pronoun than in the case where it is a content DP.
On the other hand, an area recruited to integrate individual meanings into sentences and syntactic
phrases into larger phrases should activate more when the tree is deeper, that is, in the second phrase.
Thus, splitting conditions according to this kind of considerations might shed light on the role of
these various areas, and help tease apart processes involved in sentence processing at large.
5.3.4
Digging constituency further: priming by constituents vs. adjacent
strings of words
Investigating the influence of priming a sentence by some of it constituents would be another way to
identify areas involved in processing constituents, while keeping the framework of a parametric increase
in the size of constituents.
The idea would be to compare activation to target sentences (e.g. “le bas de soie qui tombe glisse
sur sa jambe sans bruit”) that would be primed by a string of words in one of four conditions:
1. constituents of the target sentence (e.g. “sans bruit sur sa jambe bas de soie”)
2. constituents of an unrelated sentence (e.g. “le soir la peau douce bout des doigts”)
3. nonconstituent strings of adjacent words of the target sentence (e.g. “soie qui tombe glisse sur
sa bas de” )
4. nonconstituent strings of adjacent words of an unrelated sentence (e.g.“lui parlent mère quand
ils il rêve de”)
Chapter 5: General discussion and further directions
44
The prediction would be that in a tree-building area, activation to sentences that were primed by
their constituents would be lower compared to sentences that were primed to adjacent strings of words.
It would be interesting also to add another condition, with primer being made up of adjacent words
that do not form a constituent, yet are the beginning of one, e.g. priming “je crois que tu as tort”
by “crois que tu” (adjacent), “je crois que” (adjacent + beginning of constituent), “tu as tort” (contituent). If an area is recruited only to perform the merging operation, then it should not show more
adaptation to the adjacent + beginning of constituent condition; conversely, if an area is recruited to
keep on-line chunks of sentences that still need to be integrated, then this area might display more
adaptation to the adjacent + beginning of constituent condition.
Concluding remarks
In this report, I have presented a review of the literature on syntax processing in the brain. In order
to have a clearer view of the results investigators have obtained when looking for syntax in the brain, I
have proposed that it is necessary to make a distinction between at least two levels of syntax, with the
more basic level being in charge of building phrases and the more sophisticated taking those phrases
and playing around with them.
From the review of the literature, it appears that left frontal language areas seem to come into play
whenever a sequence of words is presented, that does not follow some kind of canonical order stipulated
in terms of syntax, θ-roles and perhaps other hierarchies as well.
Left posterior superior temporal areas have been repeatedly reported to show increased activation
in relation to interface between semantics and syntax; that is, when the meaning of a word has to
be refined according to its appearance in a precise sentence. This includes argument processing and
θ-roles attribution.
Still in the left temporal lobe, more anterior areas that were often neglected when studying language
are beginning to attract more attention. Among those, it seems that more posterior areas show a
greater sensitivity to semantics, while more anterior areas extending to the anterior temporal pole
seems to respond primarily to sentence structure regardless of their semantic content.
I have presented results of two pilot subjects of an experiment which tried to trace down syntax in the
brain by using a parametric increase in the size of constituents that were presented. The areas that
emerged from the analyses in both subjects are situated in the left temporal lobe and could correspond
to the proposed mapping of sentence processing on left posterior superior and anterior middle temporal
areas.
Finally, I have presented further directions to explore that could help tease apart components of
sentence processing.
Bibliography
Altmann, G. & Steedman, M. (1988). Interaction with context during human sentence processing.
Cognition, 30.
Ben-Shachar, M., Hendler, T., Kahn, I., Ben-Bashat, D. & Grodzinsky, Y. (2003). The neural reality
of syntactic transformations: evidence from functional magnetic resonance imaging. Psychol Sci ,
14, 433–440.
Ben-Shachar, M., Palti, D. & Grodzinsky, Y. (2004). Neural correlates of syntactic movement: converging evidence from two fMRI experiments. Neuroimage, 21, 1320–1336.
Binder, J.R., Frost, J.A., Hammeke, T.A., Bellgowan, P.S., Rao, S.M. & Cox, R.W. (1999). Conceptual
processing during the conscious resting state. a functional mri study. J Cogn Neurosci , 11, 80–95.
Bookheimer, S. (2002). Functional MRI of language: new approaches to understanding the cortical
organization of semantic processing. Annu Rev Neurosci , 25, 151–188.
Bornkessel, I., Zysset, S., Friederici, A.D., von Cramon, D.Y. & Schlesewsky, M. (2005). Who did what
to whom? The neural basis of argument hierarchies during language comprehension. Neuroimage,
26, 221–233.
Broca, P. (1861). Remarques sur le siège de la faculté du langage articulé, suivies d’une observation
d’aphémie. Bulletin de la Société Anatomique, XXXVI, 330–357.
Caplan, D., Alpert, N. & Waters, G. (1998). Effects of syntactic structure and propositional number
on patterns of regional cerebral blood flow. J Cogn Neurosci , 10, 541–552.
Caplan, D., Alpert, N. & Waters, G. (1999). PET studies of syntactic processing with auditory sentence
presentation. Neuroimage, 9, 343–351.
Caplan, D., Alpert, N., Waters, G. & Olivieri, A. (2000). Activation of Broca’s area by syntactic
processing under conditions of concurrent articulation. Hum Brain Mapp, 9, 65–71.
Caramazza, A. & Zurif, E.B. (1976). Dissociation of algorithmic and heuristic processes in language
comprehension. Brain and Language, 3, 572–582.
Chomsky, N. (1957). Syntactic Structures. Mouton, La Haye.
Chomsky, N. (1986). Knowledge of Language Its Nature Origin and Use. Praeger, New-York.
Chomsky, N. (1995). The Minimalist Program. MIT Press.
Dapretto, M. & Bookheimer, S.Y. (1999). Form and content: dissociating syntax and semantics in
sentence comprehension. Neuron, 24, 427–432.
de Villiers, J. & de Villiers, P. (2000). Children’s Reasoning and the Mind . Hove.
Devauchelle, A.D., Oppenheim, C., Rizzi, L., Dehaene, S. & Pallier, C. (2006). An fMRI study of
syntactic and semantic adaptation. In Cognitive Neuroscience Society Annual Meeting.
45
BIBLIOGRAPHY
46
Embick, D., Marantz, A., Miyashita, Y., O’Neil, W. & Sakai, K.L. (2000). A syntactic specialization
for Broca’s area. Proc Natl Acad Sci U S A, 97, 6150–6154.
Friederici, A.D. & Kotz, S.A. (2003). The brain basis of syntactic processes: functional imaging and
lesion studies. Neuroimage, 20 Suppl 1, S8–17.
Friederici, A.D., Pfeifer, E. & Hahne, A. (1993). Event-related brain potentials during natural speech
processing: effects of semantic, morphological and syntactic violations. Brain Res Cogn Brain Res,
1, 183–192.
Friederici, A.D., Meyer, M. & von Cramon, D.Y. (2000). Auditory language comprehension: an eventrelated fMRI study on the processing of syntactic and lexical information. Brain Lang, 74, 289–300.
Friederici, A.D., Fiebach, C.J., Schlesewsky, M., Bornkessel, I.D. & von Cramon, D.Y. (2006). Processing Linguistic Complexity and Grammaticality in the Left Frontal Cortex. Cereb Cortex .
Frith, C.D. & Frith, U. (1999). Interacting minds–a biological basis. Science, 286, 1692–1695.
Frith, U. & Frith, C.D. (2003). Development and neurophysiology of mentalizing. Philos Trans R Soc
Lond B Biol Sci , 358, 459–473.
Geschwind, N. (1965). Disconnexion syndromes in animals and man. i. Brain, 88, 237–294.
Geschwind, N. (1979). Specializations of the human brain. Sci Am, 241, 180–199.
Grewe, T., Bornkessel, I., Zysset, S., Wiese, R., von Cramon, D.Y. & Schlesewsky, M. (2005). The
emergence of the unmarked: a new perspective on the language-specific function of Broca’s area.
Hum Brain Mapp, 26, 178–190.
Grodzinsky, Y. (2000). The neurology of syntax: language use without Broca’s area. Behav Brain Sci ,
23, 1–21; discussion 21–71.
Grodzinsky, Y. & Friederici, A.D. (2006). Neuroimaging of syntax and syntactic processing. Curr Opin
Neurobiol , 16, 240–246.
Haegeman, L. (1991). Introduction to government and binding theory. B. Blackwell Cambridge, Mass.,
USA.
Humphries, C., Willard, K., Buchsbaum, B. & Hickok, G. (2001). Role of anterior temporal cortex in
auditory sentence comprehension: an fMRI study. Neuroreport, 12, 1749–1752.
Humphries, C., Love, T., Swinney, D. & Hickok, G. (2005). Response of anterior temporal cortex to
syntactic and prosodic manipulations during sentence processing. Hum Brain Mapp, 26, 128–138.
Humphries, C., Binder, J.R., Medler, D.A. & Liebenthal, E. (2006). Syntactic and semantic modulation
of neural activity during auditory sentence comprehension. J Cogn Neurosci , 18, 665–679.
Josephs, O. & Henson, R.N. (1999). Event-related functional magnetic resonance imaging: modelling,
inference and optimization. Philos Trans R Soc Lond B Biol Sci , 354, 1215–1228.
Just, M.A., Carpenter, P.A., Keller, T.A., Eddy, W.F. & Thulborn, K.R. (1996). Brain activation
modulated by sentence comprehension. Science, 274, 114–116.
Kaan, E. & Swaab, T. (2002). The brain circuitry of syntactic comprehension. Trends Cogn Sci , 6,
350–356.
Kang, A.M., Constable, R.T., Gore, J.C. & Avrutin, S. (1999). An event-related fMRI study of implicit
phrase-level syntactic and semantic processing. Neuroimage, 10, 555–561.
Keller, T.A., Carpenter, P.A. & Just, M.A. (2001). The neural bases of sentence comprehension: a
fMRI examination of syntactic and lexical processing. Cereb Cortex , 11, 223–237.
47
BIBLIOGRAPHY
Kuperberg, G.R., McGuire, P.K., Bullmore, E.T., Brammer, M.J., Rabe-Hesketh, S., Wright, I.C.,
Lythgoe, D.J., Williams, S.C. & David, A.S. (2000). Common and distinct neural substrates for
pragmatic, semantic, and syntactic processing of spoken sentences: an fMRI study. J Cogn Neurosci ,
12, 321–341.
Lasnik, H. (2002). The minimalist program in syntax. Trends Cogn Sci , 6, 432–437.
Liberman, A.M., Cooper, F.S., Shankweiler, D.P. & Studdert-Kennedy, M. (1967). Perception of the
speech code. Psychological Review , 74, 431–461.
Mazoyer, B.M., Dehaene, S., Tzourio, N., Frak, V., Murayama, N., Cohen, L., Lévrier, O., Salamon, G., Syrota, A. & Mehler, J. (1993). The cortical representation of speech. Journal Cognitive
Neuroscience, 5, 467–479.
Miller, G.A., Heise, G.A. & Lichten, W. (1951). The intelligibility of speech as a function of the context
of the test materials. JEP , 41, 329–335.
Moro, A., Tettamanti, M., Perani, D., Donati, C., Cappa, S.F. & Fazio, F. (2001). Syntax and the
brain: disentangling grammar by selective anomalies. Neuroimage, 13, 110–118.
Ni, W., Constable, R.T., Mencl, W.E., Pugh, K.R., Fulbright, R.K., Shaywitz, S.E., Shaywitz, B.A.,
Gore, J.C. & Shankweiler, D. (2000). An event-related neuroimaging study distinguishing form and
content in sentence processing. J Cogn Neurosci , 12, 120–133.
Noppeney, U. & Price, C.J. (2004). An FMRI study of syntactic adaptation. J Cogn Neurosci , 16,
702–713.
Osterhout, L., Holcomb, P.J. & Swinney, D.A. (1994). Brain potentials elicited by garden-path sentences: evidence of the application of verb information during parsing. J Exp Psychol Learn Mem
Cogn, 20, 786–803.
Peterson, C.C. & Siegal, M. (1995). Deafness, conversation and theory of mind. J Child Psychol
Psychiatry, 36, 459–474.
Pinango, M.M. & Zurif, E.B. (2001). Semantic operations in aphasic comprehension: implications for
the cortical organization of language. Brain Lang, 79, 297–308.
Samson, D., Apperly, I.A., Chiavarino, C. & Humphreys, G.W. (2004). Left temporoparietal junction
is necessary for representing someone else’s belief. Nat Neurosci , 7, 499–500.
Saxe, R. (2006). Uniquely human social cognition. Curr Opin Neurobiol , 16, 235–239.
Saxe, R. & Kanwisher, N. (2003). People thinking about thinking people. the role of the temporoparietal junction in ”theory of mind”. Neuroimage, 19, 1835–1842.
Saxe, R., Carey, S. & Kanwisher, N. (2004). Understanding other minds: linking developmental psychology and functional neuroimaging. Annu Rev Psychol , 55, 87–124.
Schlosser, M.J., Aoyagi, N., Fulbright, R.K., Gore, J.C. & McCarthy, G. (1998). Functional MRI
studies of auditory comprehension. Hum Brain Mapp, 6, 1–13.
Scott, S.K. & Johnsrude, I.S. (2003). The neuroanatomical and functional organization of speech
perception. Trends Neurosci , 26, 100–107.
Shapiro, L.P. & Levine, B.A. (1990). Verb processing during sentence comprehension in aphasia. Brain
Lang, 38, 21–47.
Shapiro, L.P., Gordon, B., Hack, N. & Killackey, J. (1993). Verb-argument structure processing in
complex sentences in Broca’s and Wernicke’s aphasia. Brain Lang, 45, 423–447.
BIBLIOGRAPHY
48
Sperber, D. & Wilson, D. (1986). Relevance: Communication and Cognition. Harvard University Press,
Cambridge MA.
Stowe, L.A., Broere, C.A., Paans, A.M., Wijers, A.A., Mulder, G., Vaalburg, W. & Zwarts, F. (1998).
Localizing components of a complex task: sentence processing and working memory. Neuroreport,
9, 2995–2999.
Stowe, L.A., Paans, A.M., Wijers, A.A., Zwarts, F., Mulder, G. & Vaalburg, W. (1999). Sentence comprehension and word repetition: a positron emission tomography investigation. Psychophysiology,
36, 786–801.
Stromswold, K., Caplan, D., Alpert, N. & Rauch, S. (1996). Localization of syntactic comprehension
by positron emission tomography. Brain Lang, 52, 452–473.
Tyler, K.T. & Marslen-Wilson, W. (1977). The on-line effects of semantic context on syntactic processing. Journal of Verbal Learning and Verbal Behavior , 16, 683–692.
Vandenberghe, R., Nobre, A.C. & Price, C.J. (2002). The response of left temporal cortex to sentences.
J Cogn Neurosci , 14, 550–560.
Vasic, N., Avrutin, S. & Ruigendijk, E. (2006). Interpretation of pronouns in VP-ellipsis constructions
in Dutch Broca’s and Wernicke’s aphasia. Brain Lang, 96, 191–206.
Woolfe, T., Want, S.C. & Siegal, M. (2002). Signposts to development: theory of mind in deaf children.
Child Dev , 73, 768–778.
Abbreviations table
L (e.g. in L-IFG)
R
Anatomy
Left
Right
a (e.g. in L-aSTS)
p
Anterior
Posterior
IFG
STS
STG
MTG
TPJ
AG
fMRI
Inferior Frontal Gyrus
Superior Temporal Sulcus
Superior Temporal Gyrus
Middle Temporal Gyrus
Temporo-parietal Junction
Angular Gyrus
Functional magnetic resonance imaging
SOV
OSV
DP
NP
ASL
Linguistics
Subject Object Verb
Object Subject Verb
Determiner Phrase
Noun Phrase
American Sign Language
49