I spy with my little eye: Detection of temporal violations in event

International Journal of Psychophysiology 76 (2010) 1–8
Contents lists available at ScienceDirect
International Journal of Psychophysiology
j o u r n a l h o m e p a g e : w w w. e l s e v i e r. c o m / l o c a t e / i j p s y c h o
I spy with my little eye: Detection of temporal violations in event sequences and the
pupillary response
Susanne Raisig ⁎, Tinka Welke, Herbert Hagendorf, Elke van der Meer
Department of Psychology, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany
a r t i c l e
i n f o
Article history:
Received 25 June 2009
Received in revised form 11 January 2010
Accepted 12 January 2010
Available online 20 January 2010
Keywords:
Event knowledge
Predictions
Temporal violation
Conflict-monitoring
Pupillary response
Locus coeruleus
Anterior cingulate cortex
a b s t r a c t
Scripts of familiar activities store the temporal order of events. This enables us to generate predictions about
which event will follow another. When an event does not unfold in the chronological order, a mismatch
arises between the predictions and the external sensory input which is perceived as a conflict. The detection
of this mismatch is accomplished by a comparison mechanism (Zacks et al., 2007; Barsalou, 2009). We have
applied pupillometry to investigate the nature of this comparison process. We further tested for individual
differences in the efficiency of the mismatch detection. Participants were presented the title of an event
sequence to trigger predictions about the order in which events would unfold. Subsequently, three script
events were presented one at a time. The events either unfolded in the correct chronological order or
included temporal violations at different points within the event triplet. Violations of the temporal order had
to be detected. As soon as it was detected, the trial had to be terminated. We found that a temporal violation
elicited a large pupillary response in all individuals indicating that the comparison between the predictions
and the external sensory input was accomplished online and worked equally well for all individuals.
However, not all individuals terminated the trial after having detected the violation. Results showed that
efficient individuals who responded adequately had a greater pupillary response than inefficient individuals
suggesting that they invested more cognitive resources. The results are discussed in light of theories of
behavioral performance and conflict-monitoring.
© 2010 Elsevier B.V. All rights reserved.
1. Introduction
The ability to predict and foresee is a skill that often fails on us. The
weather forecast is probably the most prominent example of
predictions going wrong. However, every one of us is excellent at
predicting and foreseeing certain events that will occur in the near
future. We do it all day, every day. For example, when I reach out my
hand to grasp the cup standing on the table, most are able to predict
that I will move the cup to my mouth in order to drink. Crucial for
predictions is the knowledge about the temporal order in which
events unfold over time. Like in the example above, I can foretell
someone's actions based on my event-based knowledge that after
lifting a cup, it is usually brought to one's mouth. Similarly, predictions
are formed about events in complex event sequences. For example,
when I enter a restaurant I can predict that a waiter will show me to a
table: next, he will give me a menu, then he will ask me what I would
like to eat, and so forth. This knowledge is stored in scripts in
long-term memory (Schank and Abelson, 1977). The script structure
has been compared to an event chain that preserves the correct,
chronological temporal order of events, which enables accurate predic-
⁎ Corresponding author. Tel.: + 49 30 2093 9382; fax: + 49 30 2093 9361.
E-mail address: [email protected] (S. Raisig).
0167-8760/$ – see front matter © 2010 Elsevier B.V. All rights reserved.
doi:10.1016/j.ijpsycho.2010.01.006
tions about future events through priming. In the restaurant script, the
event order food primes the following event eat food, which in turn primes
the event pay the bill. In this way, all component events of the sequence
can be retrieved and future-oriented predictions are possible. The study
presented here focuses on these script-based predictions and investigates
cases when predictions are not fulfilled. This happens, for instance, when
an event does not unfold in the correct, chronological order but rather
violates the temporal order. Naturally, such an event will be unpredicted
and unexpected. Consider the following example: in a restaurant, I sit
down at a table and expect to be handed a menu by the waiter. After
having read the menu, I predict that the waiter will ask for my order,
which I will receive after a certain amount of time. If, however, I only sit
down at a table after having received my food, my predictions clearly do
not match the event I am currently encountering.
It is crucial that violations of the temporal order are detected in
order to adjust behavior or re-analyze the situation (in the above case,
for example, one would come to the conclusion after a re-analysis that
one is in a fast-food restaurant where the event sit down at table takes
place after the event receive food). Therefore, it is necessary to
monitor whether anything unexpected and unpredicted has occurred.
An ongoing event must be continuously compared to the predictions
to identify mismatches between them. Zacks et al. (2007) suggest that
upon perceiving an event, a mental representation is formed in
working memory that depicts what is currently going on. They call
2
S. Raisig et al. / International Journal of Psychophysiology 76 (2010) 1–8
this representation an event model. Apart from the sensory input,
predictions are also integrated top-down into the event model from
script structures in long-term memory. When the sensory input (i.e.,
the unfolding event) does not match the predictions within the event
model, an “error detection mechanism” becomes active. According to
Barsalou (1999), who adopts a somewhat different approach, perceptual and motor aspects of an experience are stored as perceptual
symbols in modality-specific areas of the brain. Upon knowledge
retrieval from memory, the necessary aspects of the experience are
simulated or re-enacted which is assumed to take place in working
memory. These simulations produce predictions that are then
compared to the sensory input when events occur (Barsalou, 2009).
Thereby, it is assessed whether events unfold as predicted or not.
Despite the differences, both accounts assume that the comparison
between the external sensory input and the predictions that have
been derived from the internal mental representation is based on an
online comparison process that constantly monitors for mismatches
between predictions and what is actually taking place.
Unpredicted and unexpected events as well as mismatches have been
shown to lead to specific psychophysiological changes, for example,
changes in the pupillary response. In an auditory oddball task where a
target tone had to be detected among distracters, Steinhauer and Zubin
(1982) found an inverse relationship between stimulus probability and
pupil size: the largest pupillary response was evoked by the rare,
unpredictable stimulus, while the highly predictable, frequent stimulus
evoked the least change in pupil size. Analogous results were found by
Gilzenrat, Nieuwenhuis, Jepma, and Cohen (in press). Correctly detected
target tones that occurred only in 20% of the cases elicited significantly
larger pupil dilations than correctly ignored distracter tones that were
highly frequent. Mismatches lead to similar effects in the pupillary
response as unpredicted events: in the Stroop Color–Word task, a
response conflict emerges in incongruent trials because of the mismatch
between the color word and the ink color. Apart from increasing response
times, high conflict trials also elicit greater changes in pupil size than nonconflict, congruent trials (Brown et al., 1999; Siegle et al., 2008) because
the former require additional cognitive resources for detecting, processing and resolving the conflict. This interpretation is consistent with the
assumption that the pupil accurately and reliably reflects cognitive
resource consumption. Research has demonstrated repeatedly that pupil
dilation is positively correlated with task difficulty, that is, it increases
when more cognitive resources are required (e.g., Hess and Polt, 1964;
Beatty, 1982; Kahneman, 1973; Just et al., 2003). The pupillary response
has the advantage that it is an online measure depicting the cognitive
resource consumption during the complete processing stage prior to
response. Therefore, it is very well suited to index the moment-tomoment changes in resource consumption.
The results described above provide evidence that unpredicted
events, mismatches and the conflicts they impose require additional
cognitive resources which can be measured with the pupillary
response. The present study applies this measure to unpredicted
events in event sequences in order to investigate the “error detection
mechanism” that compares the external sensory input and the
predictions that have been derived from the internal mental
representation (cf., Zacks et al., 2007; Barsalou, 2009). In a previous
study, we presented the title of an event sequence followed by three
component events that were presented simultaneously on the
computer screen. Participants had to detect errors in the temporal
order and had to decide whether the temporal order of the triplet was
correct or not (Raisig et al., 2007). Results suggested that each event
was indeed compared to the underlying mental representation.
However, due to the mode of presentation we could not specify the
exact nature of this comparison process. In order to investigate the
comparison process in the present study, the title of the event
sequence (e.g., going to a restaurant) was presented first to activate
the component events in long-term memory as well as their temporal
order and to produce predictions about which event will succeed
another. Subsequently, the three component events were presented
sequentially. This mimicked a continuous flow of events and each
event could be compared to the predictions that had been formed. In
an error detection task participants were asked whether the order of a
presented event triplet was correct or whether there was an error in
the temporal order (i.e., a temporal violation). For example, the three
events (A) enter the restaurant–(B) sit down at table–(C) read the
menu constituted the event sequence going to a restaurant. The
pupillary response was recorded continuously during processing to
depict the changes in cognitive resource consumption that were
associated with the event that was presented. In condition correct/
correct, events unfolded in the chronological order as stored in
long-term memory and therefore predictions were satisfied (see
Table 1). However, when events did not unfold in the chronological
order but rather violated the temporal order, a mismatch occurred.
These temporal violations could occur at different points of the event
triplet (see Table 1).
We assume that the detection of the temporal violation evokes a
significant increase in the pupillary response at the moment it occurs.
More specifically, in conditions false/correct and false/false, where the
temporal violation occurs upon presentation of the second event, the
task-evoked pupillary response should be greater than in conditions
correct/correct and correct/false, where there is no temporal violation
at this point. Upon presentation of the third event, the pupillary
response should be greater in condition correct/false than in condition
correct/correct because of the temporal violation that occurs.
Another important difference to our previous study (Raisig et al., 2007)
is that participants were encouraged to abbreviate a trial as soon as
possible, that is, as soon as they detected a temporal violation. Hence,
conditions false/correct and false/false should be abbreviated after
presentation of the second event. However, because we were using
event triplets we were able to compare cases where the temporal
violation was detected successfully with cases where the task was
continued despite a temporal violation. Specifically, if detection of the
temporal violation was accurate, conditions false/correct and false/false
should be terminated after the second event by an overt behavioral
response. We call these cases false/correct (abbreviated) and false/false
(abbreviated). Because a behavioral response terminated a trial, the
abbreviated conditions will naturally have shorter reaction times. We
assume that performance in the error detection task will be subject to
individual differences. That is, there will be individuals who show good
error detection in that they will efficiently abbreviate most, if not all, of the
possible trials. And there will be bad error detectors who will not
abbreviate the conditions false/correct and false/false. There are two
possible explanations to account for bad error detection: either the
temporal violation is simply not detected due to an inadequate
comparison process. Or the temporal violation is detected but participants
fail to inhibit further processing. With the pupillary response we
attempted to adjudicate between these possibilities. In the former case,
Table 1
Conditions and their temporal violations.
Condition
Triplet
Temporal violations
Correct/correct
A–B–C
Correct/false
A–C–B or
B–C–A
False/correct
B–A–C or
C–A–B
False/false
C–B–A
No temporal violation: the events unfold in
the correct temporal order.
Late temporal violation: the second event fulfills
the predictions; the third event violates
the temporal order and leads to a mismatch
between the predictions and the sensory input.
Early temporal violation: the second event violates
the temporal order and leads to a mismatch
between the predictions and the sensory input; the
third event is chronologically related to the second.
Early temporal violation: the second event violates
the temporal order and leads to a mismatch
between the predictions and the sensory input;
the third event again violates the temporal order.
S. Raisig et al. / International Journal of Psychophysiology 76 (2010) 1–8
the temporal violation should not evoke a pupillary response. If, however,
the second event elicits a large pupillary response but the task is
nevertheless continued, the latter explanation would be supported.
No matter which explanation holds, however, results from the
literature suggest that the individual differences in our task may arise
due to the amount of cognitive resources that are invested into the task.
Van der Meer, Beyer, Horn, Foth, Bornemann, Ries et al. (2010)
compared highly intelligent students (IQ > 120) with students of
average intelligence. The highly intelligent students showed better
task performance accompanied by greater pupil dilation. High-IQ
individuals seemed to dedicate more cognitive resources to the
processing of a high-level cognitive task than their peers as seen in a
greater pupillary response. This result suggests that the pupillary
response can discriminate between individuals' resource consumption.
Therefore, we applied pupillometry to investigate individual differences
in the detection of temporal violations in event sequences. If individual
differences are accounted for by the amount of cognitive resources that
are dedicated to the task, we expect that the pupillary response will be
greater in individuals who show good task performance than in
individuals how do not respond efficiently to the temporal violation.
2. Method
2.1. Participants
Thirty psychology students of Humboldt-Universität zu Berlin
participated in the experiment. They received either course credit or
€10 for participation. All were German native speakers. Seven
participants had to be excluded from the analysis of the pupil data
due to extensive blinking artifacts or other technical problems so that
23 participants remained in the analyses (12 female; mean age: 24.9,
SD = 6.4).
2.2. Stimuli and design
Event triplets consisted of three component events from event
sequences that had been produced and standardized in a script
generation task (Raisig et al., 2009). The events were arranged to
produce four conditions with temporal violations at different points
within the triplet. Table 1 shows the possible arrangements. In
condition correct/correct, events were always arranged in the form of
A–B–C. In condition correct/false events can be arranged either A–C–B
or B–C–A. In condition false/correct there are also two possibilities,
namely, B–A–C or C–A–B. Condition false/false again has only one
possible arrangement, which is C–B–A.
The main variable that we manipulated in a repeated measures
design was the temporal order of events. Fifty percent of the triplets
adhered to the correct temporal order so that the second and third event
fulfilled the script-based predictions when they were presented. Of the
chronologically triplets, 30 were targets while 60 triplets were filler
items. The filler items did not come from the pool of standardized events
sequences. Fillers were designed in a fashion similar to that of correct
triplet except that completely different event sequences were used.
Because fillers were not derived from the standardized pool of events,
they were not included in any analyses. The remaining 50% of the
triplets had an incorrect temporal order. Of these, 30 included one late
temporal violation (condition correct/false), 30 included one early
temporal violation (condition false/correct) and 30 were in the inverse
temporal order, that is, they included two temporal violations (condition
false/false). Reaction times (in milliseconds) and error rates were
recorded as indices of response latency and accuracy, and the pupillary
response (in millimeter deviations from a baseline) was continuously
recorded over the whole processing period as an index of cognitive
resource consumption.
3
2.3. Apparatus
The pupil size of the right eye was recorded with the iView X
Hi-speed system (SensoMotoric Instruments) with a sampling rate of
240 Hz and accuracy above .03 mm. The participant's head was placed
onto a chin rest above which the camera was positioned so that the eye
could be tracked continuously during the experiment. The experiment
was programmed using Presentation for Windows Version 12.1, which
was compatible with the iView X system. Presentation sent triggers
marking the on- and offsets of stimuli as well as participants' response.
Participants responded with the left and right control key on a standard
computer keyboard. Answer-response key assignment was randomized
between participants. Illumination in the lab was kept constant for all
testing sessions (luminance ≈ 300 lx (lux)).
2.4. Procedure
Participants were seated at the table and the camera was prepared
for recording. On the computer screen instructions appeared
explaining the task and asking for quick but accurate response.
Participants were explicitly encouraged to respond as soon as a
decision concerning the correctness of the event triplet could be made
by pressing either the right or left control key (assignment of yes and
no response to the left or right control key was varied between
participants). Practice trials followed in which participants were
informed whether their response was correct or not. After the practice
trials the experimental block began. Each trial started with a fixation
cross in the middle of the screen lasting for 2000 ms. Afterwards the
name of an event sequence appeared for 1000 ms. When it
disappeared, the first event appeared on left of the centre of the
screen. After 1500 ms, the second event appeared in the middle of the
screen next to the first one. If a temporal violation occurred at this
point a decision was possible and participants were instructed to
terminate the task by pressing the respective control key. If no
temporal violation occurred at this point and/or no response was
given within a 1500 ms time window, the trial continued and the
third event was presented on the right to the centre of the screen. The
third event remained on the screen until a response had been made.
After response (whether after the second event or third event) a mask
consisting of Xs replaced the areas on the screen where an event had
been. The mask was presented for 2000 ms and was used as a
refraction period for the pupil. Reaction times and pupil size were
recorded from the presentation of the name of the event sequence
onwards.
2.5. Data analysis
Prior to data analysis conditions were recoded. Conditions false/
correct and false/false were recoded as false/correct (abbr.) and false/
false (abbr.), respectively, when the trial was abbreviated within the
1500 ms time window after presentation of the second event. When a
response was given only after the third event, the conditions kept their
original label of false/correct and false/false, respectively. It could
happen that the third event was presented just at the moment when the
participant was about to press the response key. Such cases were
identified and all reactions to conditions false/correct and false/false
within a 300 ms time window after appearance of the third event were
also coded as false/correct (abbr.) and false/false (abbr.), respectively.
We chose this time window because we consider this time window to
be too short to read the event and embed it into a situational context
(cf., Barsalou et al., 2008). Because of this re-coding we ended up with
6 conditions (without fillers).
Erroneous responses were excluded from all the analyses of reaction
times and pupil data. Reaction times that were greater than +2 SD or
less than −2 SD from the mean were identified and replaced by the
mean reaction time ±2 SD for each condition and participant. The pupil
4
S. Raisig et al. / International Journal of Psychophysiology 76 (2010) 1–8
curve was smoothed using a computer algorithm developed in-house
(box filter-technique). Missing data in the curve due to blinks (sudden
large changes in pupil diameter) were corrected by interpolation. All
data correction was done with Matlab Version 7.4. 13% of trials
(including the filler condition) had to be discarded altogether due to
large artifacts that could not be interpolated. The pupil trace was
averaged stimulus-locked to the presentation of the name of the event
sequence. The pupil size within different time windows was determined
for each participant and compared between conditions to index the
amount of cognitive resources at different processing stages. The time
windows will be specified in the Results section. The pupil size was
expressed as a millimeter deviation from the baseline that was calculated as the mean pupil size during a 200 ms time window prior to
presentation of the activity name (baseline correction).
Reaction times, error rates and pupil size of conditions correct/
correct, correct/false, and the non-abbreviated conditions false/
correct and false/false were submitted to an analysis of variance
(ANOVA). To identify differences between conditions we calculated
repeated contrasts where each condition was compared to the
previous condition. Effect sizes of focused comparisons are indicated
as r-values. The abbreviated conditions false/correct (abbr.) and false/
false (abbr.) were compared separately with a paired t-test. Both data
sets were normally distributed. The Greenhouse–Geisser correction
was applied to all repeated measures with more than one degree of
freedom. Significance level was set at p < .05. Effect sizes are indicated
as r-values for each repeated contrast.
Participants were grouped as efficient or inefficient responders,
respectively, according to their performance in the detection task.
Participants who in conditions false/correct and false/false majorly
responded after the second event (i.e., in more than 70% of the cases)
were classified as efficient responders (n = 7). Participants who majorly
read all three events were classified as inefficient responders (n = 7).
Only those participants were grouped who could be classified clearly as
either one responder type or another. To test whether responders
differed regarding the amount of cognitive resources they dedicated to
the task, the pupil sizes in each condition were compared. Further, to
test whether efficient responders were not only more efficient but also
faster and more accurate in all conditions, reaction times and error rates
were compared between responder groups.
3. Results
3.1. Detection of temporal violations
Table 2 shows the mean reaction times, error rates and pupil sizes
for each condition as well as the total number of curves that were used
to compute the mean pupil size.
3.1.1. Reaction times
There was a main effect of condition (F(3, 66) = 11.13, p < .001). The
longest reaction time occurred in condition correct/false (see Table 2).
Repeated contrasts revealed that it was significantly greater than the
reaction time in condition correct/correct (F(1, 22) = 70.35, p < .01,
r = .9) and the non-abbreviated condition false/correct (F(1, 22) = 25.5,
Table 2
Mean reaction times (in ms), error rates (in %), pupil sizes (in mm-deviation from
baseline) and number of curves in each condition.
Condition
RT
Errors
Pupil
Curves
Correct/correct
Correct/false
False/correct
False/false
False/correct (abbr.)
False/false (abbr.)
5282
5627
5246
5377
3795
3902
5.8
14.4
15.4
4.3
1.5
1.1
.26
.33
.31
.30
.36
.37
610
548
278
370
316
264
p < .01, r = .7). In addition, the difference between the non-abbreviated
conditions false/correct and false/false was significant (F(1, 22) = 4.63,
p < .05, r = .4).
Naturally reaction times were shorter for the abbreviated conditions
false/correct (abbr.) and false/false (abbr.) than for the non-abbreviated
conditions. Although these conditions were equal (i.e., both required a
pairwise comparison of temporally inverse events) the difference
between these conditions reached significance (t(22) = −2.1, p = .049).
3.1.2. Error rates
Table 2 shows the mean error rates in percent. There was a main
effect of condition (F(3, 66) = 9.03, p < .001). Most errors were made
in the condition correct/false and the non-abbreviated condition false/
correct. Repeated contrasts showed that condition correct/false
differed significantly from condition correct/correct (F(1, 22) = 19.5,
p < .001, r = .7) and the non-abbreviated condition false/correct
differed significantly from the non-abbreviated condition false/false
(F(1, 22) = 16.5, p < .01, r = .7). No significant difference was found
between the abbreviated conditions false/correct (abbr.) and false/
false (abbr.) (t(22) = .58).
3.1.3. Pupillary response
Fig. 1 shows the pupil trace over time beginning at the fixation
cross (time point − 1.00). It must be noted beforehand that the pupil
is a rather slow reacting parameter. Data by Beatty (1982) or Zimmer
(1984) have shown that the pupil only starts to show a task-evoked
response 300–500 ms after the stimulus.
Generally, the shape of the pupil curve reflected the course of the
trial. At time point zero when the title of the event sequence was
presented (vertical dashed line) the pupil size decreased reaching its
minimum size after 2 s. After presentation of the first event (first vertical
grey line) the pupil steadily began to increase in all conditions. The slope
was steepest for the conditions false/correct (abbr.) and false/false
(abbr.). After about 3 s the pupil reached a plateau that was highest for
the abbreviated conditions false/correct (abbr.) and false/false (abbr.).
Conditions correct/correct and correct/false reached the lowest
plateaus. After this plateau the pupil resumed to increase in response
to the presentation of the second event (at 2.5 s, vertical grey line). To
this point, the task consisted merely of an event pair that was either
chronologically related (no temporal violation) or not (temporal
violation). In conditions false/correct (abbr.) and false/false (abbr.)
that were terminated at this point, the pupil reached its final peak at
around 4 s ms. In the other conditions that had not been terminated at
this point, the pupil curve reached a second plateau (at about 4.8 s) that
was highest for the non-abbreviated conditions false/correct and false/
false. After this plateau the pupil began to increase again in response to
the presentation of the third event (at 4 s, vertical grey line). It reached
its final peak at about 5.5 s.
To test the assumption that an early temporal violation that occurred
upon presentation of the second event elicited an increased pupillary
response, we determined the pupil sizes within the time window 3.2
and 4.8 s (see Fig. 1 and Table 3). This time window was chosen on
grounds of the shape of the pupil curve: the presentation of the second
event elicited a pupillary response at about 3.2 s that lasted until the
second plateau at about 4.8 s was reached. Within this time window, we
considered only conditions that had not been abbreviated (i.e., conditions correct/correct, correct/false, and the non-abbreviated conditions false/correct, and false/false) because the pupillary response is not
superimposed by decision or motor responses in these conditions. We
collapsed the pupil sizes of conditions correct/correct and correct/false
since these conditions are equal at this moment within the trial (i.e.,
they include no temporal violation at this point). The same was done for
the non-abbreviated conditions false/correct and false/false that both
include a temporal violation at this point. The mean pupil sizes were
then submitted to a paired t-test. We found that the mean pupil size
differed significantly between conditions with a temporal violation and
S. Raisig et al. / International Journal of Psychophysiology 76 (2010) 1–8
5
Fig. 1. Mean pupillary response (in mm from baseline) for each condition. Time 0 indicates the presentation of the event sequence (dashed vertical line). Each grey vertical line
indicates presentation of the events.
conditions without a temporal violation (t(22) = −3.23, p < .001)
whereby the temporal violation evoked a significantly greater pupillary
response (Table 3). Within this time window, the pupil reached its final
peak in the abbreviated conditions false/correct (abbr.) and false/false
(abbr.) (see Fig. 1). Peak dilations between these conditions were not
significantly different (t(22)= −.19, p = .85).
When the third event was presented, the pupil increased further
until it reached its peak. The peak dilations of the conditions correct/
correct, correct/false, and the non-abbreviated conditions false/correct,
and false/false were compared. The main effect of condition failed to
reach significance (F(3, 66) = 2.2, p = .09). Repeated contrasts were
nevertheless performed. They showed that the pupil size of condition
correct/false was significantly greater than the pupil size of condition
correct/correct (F(1, 22) = 10.21, p < .01, r = .56). There was no
significant difference between the non-abbreviated conditions false/
correct and false/false regarding pupil diameter although a difference
had been detected in reaction times.
3.2.1. Reaction times
Efficient responders were always faster in conditions correct/
correct, correct/false and the non-abbreviated conditions false/
correct, and false/false. The main effect of responder group reached
significance (F(1, 12) = 5.99, p < .05). Like in the overall analysis the
condition correct/false produced the longest reaction times (see Table 4).
The condition by responder interaction effect was not significant
(F = 1.05). When comparing reaction times of the abbreviated conditions, we also found a significant difference between the responder
groups (F(1, 12) = 11.3, p < .05) but no difference between conditions
(F = 3.18).
3.2.2. Error rates
Efficient responders made more errors in trials when all three
events were read (conditions correct/correct, correct/false and the
non-abbreviated conditions false/correct, and false/false; see Table 4).
This difference reached significance (F(1, 12) = 7.7, p < .05). The condition by responder interaction effect was not significant (F = 1.24).
3.2. Individual differences
Table 4 shows the mean reaction times, error rates and pupil sizes
for each responder group.
Table 3
Pupil size (in mm-deviation from baseline) at plateau (3.2–4.8 s).
Conditions Temporal relation
of the event pair
1 and 2
3 and 4
Pupil size at plateau
All
Efficient
(n = 23) responders
(n = 7)
No temporal
.16
violation
Temporal violation .24
Table 4
Mean reaction times (in ms), error rates (in %) and peak pupil sizes (in mm-deviation
from baseline) between groups.
Condition
Responder
RT
Errors
Pupil
Correct/correct
Inefficient
Efficient
Inefficient
Efficient
Inefficient
Efficient
Inefficient
Efficient
Inefficient
Efficient
Inefficient
Efficient
5557
5003
6009
5286
5599
4813
5675
5059
3998
3686
4216
3688
3
9
10
24
13
19
6
5
2
1
1
1
.14
.28
.23
.39
.18
.45
.23
.42
.34
.39
.31
.41
Correct/false
False/correct
Inefficient
responders
(n = 7)
.19
.06
.36
.15
False/false
False/correct (abbr.)
False/false (abbr.)
6
S. Raisig et al. / International Journal of Psychophysiology 76 (2010) 1–8
When comparing the abbreviated conditions, we found no difference
between the responder groups (F = 1.6) and no interaction effect
(F = 1.1).
3.2.3. Pupillary response
We compared the mean pupil sizes within the time window 3.2 to
4.8 s between responder groups. As in the overall analysis, we collapsed the mean pupil sizes of conditions correct/correct and correct/
false and of the non-abbreviated false/correct and false/false,
respectively. A temporal violation evoked a greater increase in pupil
size (cf. Table 3). Further, the mean pupil size in this time window
differed between groups whereby efficient responders had a significantly greater pupil size than inefficient responders (F(1, 12) = 8.1,
p < .05). Within this time window, the pupil of the abbreviated
conditions reached its final peak (see Fig. 1). There was no significant
difference in the peak value between the two abbreviated conditions.
There was also no significant differences between the responder
groups (F = 1.24).
When the third event was presented, the pupil increased further
until it reached its peak. The peak dilations of the conditions correct/
correct, correct/false, and the non-abbreviated conditions false/correct
and false/false were compared. There was a main effect of responder
group (F(3, 36) = 7.40, p < .05) whereby the efficient responders always
reached a significantly greater pupil size than the inefficient responders
(see Table 4).
4. Discussion
The current experiment addressed questions concerning the
detection of temporal violations in event sequences that occur when a
non-chronological event is presented. Such an event does not match the
predictions that have been derived from the internal representation
(i.e., script) of the event sequence in long-term memory. We
investigated the comparison process that is thought to monitor for
such mismatches (Zacks et al., 2007; Barsalou, 2009) with regards to the
efficiency of error detection and individual differences. We found that
an event that did not match the predictions elicited a greater pupillary
response than an event that matched the predictions. Importantly, we
could show that the performance in the error detection task was subject
to individual differences. Participants who were good error detectors
and showed efficient task performance also had a greater pupillary
response. In the following we will first briefly discuss the reaction time
data. Subsequently, we will turn to the pupil data discussing how the
data contribute to our knowledge of how temporal violations in event
sequences are detected. Finally, we will venture to relate our findings to
current theories of performance and conflict-monitoring.
4.1. Reaction times
We could replicate the pattern of the reaction time data from our
previous study (Raisig et al., 2007) where the condition correct/false
also produced the longest reaction times. Although the condition
correct/correct also required reading and processing all three events it
has a processing benefit because the correct temporal order of the
events is adhered to and consequently predictions are fulfilled. Hence,
the temporal violation introduced by the third event in condition
correct/false does not satisfy the predictions and leads to the significant
increase in reaction times. The non-abbreviated conditions false/correct
and false/false show similar reaction times to condition correct/correct.
This had also been the case in our former study. We will return to this
point further down in the discussion.
One strikingly different result from our 2007 study was the meaningful difference in reaction times between the non-abbreviated conditions false/correct and false/false. Unexpectedly, we also found a
significant difference between the abbreviated conditions false/correct
(abbr.) and false/false (abbr.) in the current study. This result can be
explained by the distance effect that reaction times decrease when the
temporal distance between event increases (e.g., Nottenburg and
Shoben, 1980). Because the event pairs in the abbreviated condition
false/false (abbr.) always took the form of C–B it had a smaller temporal
distance than the event pairs in the condition false/correct (abbr.) that
had a great temporal distance (20 event pairs took the form C–A). The
greater temporal distance was confirmed by ratings of temporal distance that we obtained post-hoc from new participants. The distance
effect might also account for the difference in the non-abbreviated
conditions.
4.2. Changes in pupillary response due to temporal violations
We will first consider the conditions that were not abbreviated (i.e.,
correct/correct, correct/false, and the non-abbreviated conditions false/
correct and false/false): the waveform of the pupil showed that each event
elicited a change in pupil size regardless of whether the event introduced a
temporal violation or not. This is to be expected, however, because it
reflects the processing of incoming sensory information. Although an
increase in pupil size was apparent in all conditions, the extent of the
increase differed between conditions. In the non-abbreviated conditions
false/correct and false/false the pupil increased substantially upon
presentation of the second event due to the temporal violation it inflicted
as compared to the conditions correct/correct and correct/false where a
chronological event was presented and therefore predictions were
fulfilled at this point. As the trial continued and the third event was
presented, condition correct/false led to the greatest increase in pupil size
due to the temporal violation in this condition. The pupil sizes of the
non-abbreviated conditions false/correct and false/false were not significantly different from the chronological condition correct/correct. These
results allow some conclusions regarding the nature of the comparison
process. Because the pupil measures cognitive resource consumption
continuously throughout processing, it depicts online the moment-tomoment changes in cognitive effort that is related to the detection of
temporal violations. Taking into account that the pupillary response
slightly lags behind the presentation of stimuli (cf. Zimmer, 1984; Beatty,
1982), the pattern of the pupil waveform suggests that temporal
violations and the mismatches they constitute were detected the moment
they occurred. It can be concluded, therefore, that the representation of
the sensory input that depicts the actual state is rapidly and continuously
compared to the predicted state in working memory in order to react to
the mismatch — in our case by terminating the task, in more natural cases
by re-analyzing the situation or adapting one's behavior. Therefore, the
comparison process enables a rapid adaptation to guide ongoing
processing and optimize behavior (cf. Zacks et al., 2007).
The question that arises is how good the online comparison
process actually is? We found that many participants did not show a
behavioral reaction to the temporal violations and the mismatches
arising from them. Rather, they showed inefficient response behavior
and did not abbreviate the conditions false/correct and false/false
although this would have been possible. Other participants, however,
were efficient and abbreviated most trials of conditions false/correct
and false/false. Are the individual differences in the behavioral
performance due to the quality of the comparison process? The
pattern of the pupil curves and the results obtained from the first time
window (3.2–4.8 s) do not speak for such a conclusion. Instead results
suggest that the temporal violation and the mismatch it inflicted were
indeed detected by all individuals alike but only few individuals were
efficient in giving an overt response. Two findings are in favor of this
conclusion: first, in the overall as well as the group analysis the pupil
always increased in response to the temporal violation that occurred
upon presentation of the second event. This indicates that at least the
autonomic pupillary response to the temporal violation was adequate.
Second, in the non-abbreviated conditions false/correct and false/false
the pupil increased only minimally upon presentation of the third
event suggesting that the new sensory input was only processed
S. Raisig et al. / International Journal of Psychophysiology 76 (2010) 1–8
superficially, that is, without processing its temporal properties. The
finding that reaction times were similar between the correct
condition correct/correct and the non-abbreviated conditions false/
correct and false/false also speaks for superficial processing of the
third event in the non-abbreviated conditions. Therefore, it can be
concluded that inefficient responders failed to perform adequately
despite the detection of a temporal violation.
This conclusion leads to the question why some participants failed to
respond adequately? We assumed that individual differences in the error
detection task were accounted for by the amount of cognitive resources
that were dedicated to the task. In line with our assumption we found that
efficient participants had larger pupil sizes in the first time window and
reached greater peak dilations than inefficient responders. Because the
pupillary response is directly related to the amount of cognitive resource
consumption (cf., Just et al., 2003; van der Meer et al., 2010) we can
conclude that efficient participants indeed put more effort into the task
than inefficient responders. This can even be observed on item-level: in
the overall analysis including all participants, abbreviated trials can be
considered as reflecting efficient trials. When looking at the pupil
waveforms of these abbreviated trials, we find that they show a steeper
slope, that is a greater pupillary response, from presentation of the first
event onwards. This suggests that more cognitive effort was put into these
trials by all participants. Of course other factors may also influence how
well and efficient errors are detected and responded to. Our results,
however, show that efficiency in error detection is directly related to the
amount of cognitive resources that were dedicated to a task.
We have argued so far that efficient responders dedicated more
cognitive resources into the task. However, they nevertheless made more
errors than inefficient responders in the non-abbreviated conditions
correct/correct/, correct/false, false/correct, and false/false. When looking
at the distribution of errors among the conditions, we find that errors
occurred particularly in conditions correct/false and false/correct. These
conditions are known to produce increased error rates (Raisig et al., 2007).
In condition correct/false, for example, the temporal order of the first and
the second event is evaluated and no temporal violation is detected. This
increases the response tendency of accepting the item as being correct.
The third event that is presented introduces a temporal violation which is
perceived as mismatch or conflict, respectively. Apart from the conflict
between the representational level (i.e., prediction) and the perceptual
level (i.e., sensory input), an additional conflict occurs on the response
level: the initial response tendency of accepting the item must be
overridden and changed into a rejecting response resulting in longer
reaction times and more errors. It seems that efficient individuals who
were generally good at detecting and reacting to a conflict between the
representational and the perceptual levels were strongly affected by the
conflict on the response level. The same explanation might apply to the
non-abbreviated condition false/correct. However, only few items fell into
that condition in the efficient group because they abbreviated the majority
of items. Hence, the interpretation that efficient responders are especially
affected by the conflict on the response level would have to be tested in
order to gain stronger support for this claim.
4.3. Integration of the results into theories of performance and conflictmonitoring
The efficient responders in our task can be described as being highly
task-engaged: they dedicate many cognitive resources into the task as
seen in the strong task-evoked pupillary response, which ameliorates
their performance. Inefficient responders, on the contrary, show less task
engagement and a lesser task-evoked pupillary response. So far, our
results clearly link performance and task engagement to the task-evoked
pupillary response. Recent findings suggest that the task-evoked pupillary
response reflects the phasic activity of the locus coeruleus (LC), an area in
the brainstem that releases the neurotransmitter norepinephrine (NE)
(Gilzenrat et al., in press). Crucially, the LC–NE system has also been
related to behavioral performance. The framework of Aston-Jones and
7
Cohen (2005) posits that accurate performance and high task engagement
is accompanied by bursts of activation of the LC neurons in response to
motivationally significant stimuli (i.e., phasic mode). Although it is beyond
the scope of this paper to scrutinize the relationship between behavior
and LC activity, we want to relate our results to this framework because it
has been suggested that the LC may be critically involved in the
comparison process that is considered to underlie the detection of errors
in event sequences (Zacks et al., 2007). The close relation between the
behavioral and the psychophysiological levels that we have observed (i.e.,
efficient responding accompanied by increased pupillary response) seems
compatible with the assumption that task engagement and accuracy of
performance are related to LC activity. That is, because the pupil indexes
LC activity (Gilzenrat et al., in press) performance in our task can also be
related to LC phasic activity.
As outlined above, the task-evoked pupillary response is considered an
index of LC phasic activity. However, in the literature the pupillary
response is mostly related to the cognitive workload and the cognitive
resources that are consumed during a task (e.g., Just et al., 2003). We
found that a temporal violation elicited a large pupillary response. We
interpreted that the temporal violation was perceived as a mismatch or
conflict which increased the need for cognitive resources similar to the
response conflict in the Stroop task (e.g., Siegle et al., 2008). A brain area
repeatedly associated with conflict-monitoring and error detection is the
anterior cingulate cortex (ACC) (cf., Botvinick et al., 2001). It has been
suggested that the ACC becomes active when tasks are difficult. Conflicts
make a task cognitively demanding thereby indexing the demand for
cognitive resources and mental effort (Rushworth et al., 2004; Walton et
al., 2003). Hence, the ACC may be “importantly involved in linking mental
effort to the autonomic changes that typically accompany it” (Botvinick et
al., 2004, p. 545).
Considering our results in the light of the framework by Aston-Jones
and Cohen (2005) as well as ACC functioning (Botvinick et al., 2004) our
work is in line with work emerging in this field that assumes a connection
between ACC, LC and pupillometry. It has been speculated, for example,
that the LC system responds to ACC conflict detection (Cohen et al., 2000)
and neuroanatomic evidence suggests that the ACC has projections to the
LC (Rajkowski et al., 2000) where it is thought to influence LC activity
(Aston-Jones and Cohen, 2005). If indeed ACC projects signals to the LC the
strength of the activation of the ACC might influence the strength of the LC
phasic response and consequently directly determine behavioral performance. An imaging study by Hester, Fassbender, and Garavan (2004)
demonstrated that the degree of ACC activation was directly related to
performance in a conflict-monitoring task (Go/No-Go task). Inattentive
individuals (as measured by self-reported absentmindedness) displayed
reduced ACC activation and performed worse on the conflict-monitoring
task than individuals who described themselves as attentive and showed
higher ACC activation. It remains to be investigated whether high ACC
activation in attentive individuals is also accompanied by LC phasic
activity and a large pupillary response, respectively. This would lend some
support to the presumed ACC–LC–pupillometry connection and would
allow us to conclude that efficient individuals in our task had a strong LC
phasic activation due to strong ACC activation. However, the work in this
field is still in its infancy and therefore this interpretation of our results
remains speculative.
What we can conclude from our study without speculation is that
pupillometry is a useful measure in error detection and conflictmonitoring tasks because it depicts the moment-to-moment resource
consumption and therefore gives insight into the time course of the
error detection. More importantly, it allowed us to identify the source of
the individual differences that apparently were due to the amount of
cognitive resources that were dedicated to the task at hand.
Acknowledgements
We would like thank Roman Purkhart for laboratory assistance and
helpful discussion. This research was supported by the German Science
8
S. Raisig et al. / International Journal of Psychophysiology 76 (2010) 1–8
Foundation (DFG) grant ME1362-10 awarded to Elke van der Meer and
Herbert Hagendorf. We thank Sander Nieuwenhuis and an anonymous
reviewer for valuable comments on an earlier version of this paper.
References
Aston-Jones, G., Cohen, J.D., 2005. An integrative theory of locus coerules-norepinephrine
function: adaptive gain and optimal performance. Annual Review of Neuroscience 28
(1), 403–450.
Barsalou, L., 1999. Perceptual symbol systems. Behavioral and Brain Sciences 22, 577–660.
Barsalou, L.W., 2009. Simulation, situated conceptualization, and prediction. Philosophical
Transactions of the Royal Society B: Biological Sciences 364 (1521), 1281–1289.
Barsalou, L.W., Santos, A., Simmons, W.K., Wilson, C.D., 2008. Language and simulation
in conceptual processing. In: de Vega, M., Glenberg, A.M., Graesser, A.C. (Eds.),
Symbols, Embodiment, and Meaning. Oxford University Press, Oxford, pp. 245–284.
Beatty, J., 1982. Task-evoked pupillary responses, processing load, and the structure of
processing resources. Psychological Bulletin 91 (2), 276–292.
Botvinick, M.M., Braver, T.S., Barch, D.M., Carter, C.S., Cohen, J.D., 2001. Conflict monitoring
and cognitive control. Psychological Review 108 (3), 624–652.
Botvinick, M.M., Cohen, J.D., Carter, C.S., 2004. Conflict monitoring and anterior
cingulate cortex: an update. Trends in Cognitive Sciences 8 (12), 539–546.
Brown, G., Kindermann, S.S., Siegle, G.J., Granholm, E., Wong, E.C., Buxton, R.B., 1999. Brain
activation and pupil response during covert performance of the Stroop Color Word
task. Journal of the International Neuropsychological Society 5 (04), 308–319.
Cohen, J.D., Botvinick, M.M., Carter, C.S., 2000. Anterior cingulate and prefrontal cortex:
who's in control? Nature Neuroscience 3, 421–423.
Gilzenrat, M.S., Nieuwenhuis, S., Jepma, M., and Cohen, J.D. (in press). Pupil diameter
tracks changes in control state predicted by the adaptive gain theory of locus
coeruleus function. Cognitive, Affective, & Behavioral Neuroscience.
Hess, E.H., Polt, J.M., 1964. Pupil size in relation to mental activity during simple
problem-solving. Science 143, 1190–1192.
Hester, R., Fassbender, C., Garavan, H., 2004. Individual differences in error processing: a
review and reanalysis of three event-related fMRI studies using the GO/NOGO task.
Cerebral Cortex 14 (9), 986–994.
Just, M.A., Carpenter, P.A., Miyake, A., 2003. Neuroindices of cognitive workload:
neuroimaging, pupillometric and event-related potential studies of brain work.
Theoretical Issues in Ergonomics Science 4, 56–88.
Kahneman, D., 1973. Attention and Effort. Prentice Hall, Englewood Cliffs, NJ.
Nottenburg, G., Shoben, E.J., 1980. Scripts as linear orders. Journal of Experimental
Social Psychology 16 (4), 329–347.
Raisig, S., Welke, T., Hagendorf, H., van der Meer, E., 2007. Investigating dimensional
organization in scripts using the pupillary response. Psychophysiology 44 (6), 864–873.
Raisig, S., Welke, T., Hagendorf, H., van der Meer, E., 2009. Insights into knowledge
representation: the influence of amodal and perceptual variables on event
knowledge retrieval from memory. Cognitive Science 33, 1252–1266.
Rajkowski, J., Lu, W., Zhu, Y., Cohen, J.D., Aston-Jones, G., 2000. Prominent projections
from the anterior cingulate cortex to the locus coeruleus (LC) in rhesus monkey.
Society for Neuroscience Abstracts 26, 838.15.
Rushworth, M.F.S., Walton, M.E., Kennerley, S.W., Bannerman, D.M., 2004. Action sets and
decisions in the medial frontal cortex. Trends in Cognitive Sciences 8 (9), 410–417.
Schank, R.C., Abelson, R.P., 1977. Scripts, Plans, Goals and Understanding: Inquiry into
Human Knowledge Structures. Erlbaum, Hillsdale, NJ.
Siegle, G.J., Ichikawa, N., Steinhauer, S., 2008. Blink before and after you think: blinks
occur prior to and following cognitive load indexed by pupillary responses.
Psychophysiology 45 (5), 679–687.
Steinhauer, S.R., Zubin, J., 1982. Vulnerability to schizophrenia: Information processing
in the pupil and event-related potential. In: Usdin, E., Hanin, I. (Eds.), Biological
Markers in Psychiatry and Neurology. Pergamon Press, Oxford, pp. 371–385.
van der Meer, R., Beyer, R., Horn, J., Foth, M., Bornemann, B., Ries, J., et al., 2010. Resource
allocation and fluid intelligence: insights from pupillometry. Psychophysiology 47,
158–169.
Walton, M.E., Bannerman, D.M., Alterescu, K., Rushworth, M.F.S., 2003. Functional
specialization within medial frontal cortex of the anterior cingulate for evaluating
effort-related decisions. Journal of Neuroscience: The Official Journal of the Society
for Neuroscience 23 (16), 6475–6479.
Zacks, J.M., Speer, N.K., Swallow, K.M., Braver, T.S., Reynolds, J.R., 2007. Event perception: a
mind-brain perspective. Psychological Bulletin 133 (2), 273–293.
Zimmer, K., 1984. Ansätze der psychophysiologischen Indikation von Wissensrepräsentation:
Die Pupillomotorik als sensbiler Indikator semantischer Informationsverarbeitungsaktivität. In: Klix, F. (Ed.), Gedächtnis, Wissen, Wissensnutzung. VEB Deutscher Verlag der
Wissenschaft, Berlin, pp. 137–155.