Comparing Switch Costs: Alternating Runs and Explicit Cuing

Journal of Experimental Psychology:
Learning, Memory, and Cognition
2007, Vol. 33, No. 3, 475– 483
Copyright 2007 by the American Psychological Association
0278-7393/07/$12.00 DOI: 10.1037/0278-7393.33.3.475
Comparing Switch Costs: Alternating Runs and Explicit Cuing
Erik M. Altmann
Michigan State University
The task-switching literature routinely conflates different operational definitions of switch cost, its
predominant behavioral measure. This article is an attempt to draw attention to differences between the
two most common definitions, alternating-runs switch cost (ARS) and explicit-cuing switch cost (ECS).
ARS appears to include both the costs of switching tasks and the switch-independent costs specific to the
first trial of a run, with the implication that it should generally be larger than ECS, but worse is that the
alternating-runs procedure does not allow these costs to be separated. New data are presented to make
these issues concrete, existing data are surveyed for evidence that ARS is larger than ECS, and
implications of conflating these measures are examined for existing theoretical constructs.
Keywords: cognitive control, task switching, alternating-runs procedure, explicit cuing procedure
directly separate their effects. The aim here is to ground these
issues in a study to make them concrete, survey published data for
indirect evidence that the two measures really are different, and
finally, illustrate the relevance to some existing control-related
theoretical constructs.
Questions about how people set, focus on, and switch among the
short-term goals that govern everyday behavior are central in the
study of cognitive control. One experimental domain in which
these questions are addressed is task switching (for surveys, see
Logan, 2003; Monsell, 2003). In a typical task-switching experiment, the participant performs one of two simple tasks on each of
a (usually large) number of trials. The stimulus presented on each
trial, at least in the study reported shortly here, is a randomly
selected digit, and the task is either to judge the digit’s parity (even
or odd) or to judge its magnitude (higher or lower than 5). The
correct task for a given trial, or run of consecutive trials depending
on the procedural variant, is signaled by a task cue of some kind.
The usual empirical finding of interest, generally modulated by
some experimental manipulation, is switch cost: Performance on
switch trials, in which the task differs from the previous trial, is
worse than performance on repeat trials, in which the task is the
same as on the previous trial. Switch cost is often taken to reflect
the operation of processes involved in cognitive control (e.g.,
Monsell, 2003) but in any case is richly documented enough to call
for some sort of theoretical explanation.
Good explanations depend on good construct validity, and there
are two validity problems with how the construct of switch cost
has been reported and discussed in the literature. First, although
the two most common task-switching procedures measure switch
cost differently, this distinction has not been raised in comparative
surveys of task-switching data (e.g., Logan, 2003; Monsell, 2003).
This lack of qualification may explain some of the heterogeneity
that can be found across different studies of what is nominally the
same quantity. Second, one of the two measures is structurally
confounded, meaning that (a) its variance could be attributed to
either of two independent variables and (b) there is no way to
The Problem: Two Different Operational Definitions
The two most common task-switching procedures are explicit
cuing and alternating runs. In the explicit-cuing procedure, a task
cue is presented on every trial to indicate what task to perform on
that trial. Cue selection is randomized, such that the task could
switch from one trial to the next but could also repeat. Switch
cost— explicit-cuing switch cost (ECS)—is measured simply by
comparing switch trials with repeat trials.
In the alternating-runs procedure, in contrast, trials are grouped
into predictable-length runs that alternate between tasks (e.g.,
AABBAABB . . .). The task (A or B) is cued by the position of the
current trial within a run and usually also by the spatial location of
the trial stimulus (e.g., Rogers & Monsell, 1995) or other perceptual cue (Koch, 2003; Waszak, Hommel, & Allport, 2003). Switch
cost—alternating-runs switch cost (ARS)—is measured by comparing the trial in Position 1 of a run to the trial in Position 2. By
design, this position variable (levels 1 and 2) correlates perfectly
with the switching variable (levels switch and repeat) in that the
Position 1 trial is the switch trial and the Position 2 trial is the
repeat trial. Thus, whereas explicit cuing compares switch and
repeat within Position 1 (there being no Position 2), alternating
runs compares switch and repeat between Positions 1 and 2. A
critical issue, of course, is whether this confounding of position and
switching in the alternating-runs procedure is a mere technicality or
whether there is reason to believe that position and switching both
contribute to variance in this measure of switch cost.
For illustration of these two different switch-cost measures, an
experiment was conducted by use of a hybrid design that combines
central characteristics of the two procedures at issue. In this
experiment, trials were grouped by task into predictable-length
runs, as in alternating runs. However, a randomly selected task cue
was presented just before the Position 1 trial, as in explicit cuing.
This work was supported by ONR Grant N00014-06-1-0077. I thank
Rich Carlson, Tim Gamble, and Stephen Monsell for their comments and
suggestions for improvement.
Correspondence concerning this article should be addressed to Erik M.
Altmann, Department of Psychology, Michigan State University, East
Lansing, MI 48824. E-mail: [email protected]
475
ALTMANN
476
Thus, this design affords two measures of switch cost: One derived
by comparing switch and repeat trials on Position 1 (comparable
with ECS) and one derived by comparing Positions 1 and 2 on
switch runs (comparable with ARS). This procedure also affords a
measure of first-trial cost, which is the difference in performance
between Positions 1 and 2 of repeat runs. First-trial cost is an index
of processes that are deployed only on the first trial of a run and
that are independent of whether the task switches. With respect to
the alternating-runs procedure, the issue developed here is that
ARS may well include first-trial cost as well as switch cost.
The present design also preserves a feature of many alternatingruns studies, which is a constant response–stimulus interval (RSI)
between trials. A number of previous studies have demonstrated
first-trial cost in a situation in which the RSI differs for Positions
1 and 2 (Allport & Wylie, 1999, 2000; Altmann, 2004a; Altmann
& Gray, 2002; Gopher, Armony, & Greenshpan, 2000; Kramer,
Hahn, & Gopher, 1999; Waszak et al., 2003). For example, in
Experiment 4 of Allport and Wylie’s (2000) study, which was
designed to probe first-trial cost, RSI before Position 1 was 2 s,
whereas RSI between positions of a run was only 300 ms. Similarly, in Altmann’s (2004a) study, RSI before Position 1 was 1 s,
and RSI between positions within a run was zero. In these situations, first-trial effects could have emerged from the change in
event timing between Positions 1 and 2 (Gopher et al., 2000).
Indeed, the term restart cost has been used to describe these
effects, reflecting a hypothetical role for the pause coming before
Position 1, which, although brief, could have put the system into
some state from which it needs to be restarted. There is no such
pause in many alternating-runs studies (e.g., Koch, 2005; Rogers
& Monsell, 1995), so to help make the case that ARS may include
first-trial cost in these situations also, it was useful to show that
first-trial cost can arise in a situation in which RSI is uniform
between all trials.
block with the experimenter present to answer questions and
continued with 10 regular blocks. Each block contained 60 trials
and was followed by online feedback and a chance to rest. The
feedback asked participants to be more accurate if the block score
was below 90% and to be faster if the block score was 100%.
A trial began with onset of a trial stimulus and ended with the
participant’s response. A task cue appeared every other trial and
remained visible until the response for that trial. The subsequent
uncued trial was governed by the same task, meaning that trials
were grouped by task into runs of two. The RSI was 1,000 ms, and
the cue–stimulus interval (CSI) was 100 ms, also as in Koch’s
(2005, Experiments 2 and 3) study. The screen flashed after an
error, adding 400 ms after error trials.
Task cues were randomly selected subject to the constraint that
at most three repeat cues could occur in a row (as in Koch, 2005).
Trial stimuli were randomly selected subject to the constraint that
immediate and lag-two repetitions were not allowed.
Design and Analysis
The practice block and the first trial of remaining blocks were
excluded from analysis. Error trials and trials following error trials
were excluded from latency analysis. Latencies were trimmed by
taking medians. Latencies and errors were examined with a 2 (run
type: switch, repeat) ⫻ 2 (position in run: 1, 2) analysis of
variance. Task (parity, magnitude) was included in the analysis,
but its effects are not reported. Various simple-effects analyses of
variance were used to examine the costs labeled in Figure 1.
Results
The latency results appear in Figure 1, separated by run type
(switch, repeat) and position (1, 2). Latency on switch runs (1,007
Method
Switch run
Participants
Task cues and trial stimuli were taken from a recent alternatingruns study (Koch, 2005). A task cue was a square (indicating the
parity task) or diamond (indicating the magnitude task), unfilled
and 3.8 cm on a side, presented in white in the center of a dark
computer display. A trial stimulus was a digit from the set 1 to 9
excluding 5, presented with a height of 0.8 cm, also in white in the
center of the display (surrounded by the task cue). The parity task
was to judge whether the trial stimulus was even or odd, and the
magnitude task was to judge whether the trial stimulus was higher
or lower than 5. The Z and ?/ keys of a QWERTY keyboard were
used to respond to both tasks. Response-to-key mapping was
randomized between participants.
Procedure
Participants were tested individually in sessions lasting roughly
30 min. A session began with online instructions and a practice
1100
Response latency (ms)
Materials
Repeat run
1200
Fifteen Michigan State University undergraduate students participated in exchange for credit toward a course requirement. Two
additional participants were excluded from analysis because their
overall accuracy was below 90%.
ECS analog
ARS analog
1000
First-trial cost
900
800
1
2
Position in run
Figure 1. Response latencies from the experiment, showing measures
analogous to alternating-runs switch cost (ARS) and explicit-cuing switch
cost (ECS) as well as a measure of first-trial cost.
COMPARING SWITCH COSTS
ms) was marginally slower than on repeat runs (957 ms), F(1,
14) ⫽ 4.5, p ⫽ .052, MSE ⫽ 16,625. Latency on Position 1 (1,096
ms) was slower than on Position 2 (867 ms), F(1, 14) ⫽ 21.5, p ⬍
.001, MSE ⫽ 73,075. Run type and position interacted, F(1, 14) ⫽
7.9, p ⫽ .014, MSE ⫽ 8,391.
In terms of the costs labeled in Figure 1, the ARS analog was
reliable, as reflected in a simple effect of position on switch runs,
F(1, 14) ⫽ 24.5, p ⬍ .001, MSE ⫽ 46,517. The ECS analog was
also reliable, as reflected in a simple effect of run type on Position
1, F(1, 14) ⫽ 10.1, p ⫽ .007, MSE ⫽ 13,955. Finally, first-trial
cost was reliable, as reflected as a simple effect of position on
repeat runs, F(1, 14) ⫽ 14.2, p ⫽ .002, MSE ⫽ 34,949.
Error rates were 2.47% on Position 1 switch, 2.78% on Position
1 repeat, 2.01% on Position 2 switch, and 2.29% on Position 2
repeat. There were no main effects of run type or position on error
rates, and there was no interaction ( ps ⬎ .182).
Discussion
The data in Figure 1 illustrate, in the context of this hybrid
procedure, the distinctions raised above between ARS and ECS.
Of particular interest is the first-trial cost incurred on Position 1 of
a repeat run, which is almost exactly the difference between the
quantities representing analogs of ARS and ECS. This first-trial
cost represents a measure of whatever processes are triggered on
the first trial of a run but are not related to switching tasks. Any
such cost incurred in an alternating-runs study will be included
when Position 2 is subtracted from Position 1 to compute ARS.
These results also show that first-trial cost can arise in a situation in which event timing, in particular the RSI, is equal for all
trials, which is often the case in alternating-runs studies (e.g.,
Koch, 2005; Rogers & Monsell, 1995). Thus, it now seems difficult to argue that first-trial costs will be absent from ARS as long
as the particular implementation of alternating runs uses uniform
event timing.
Although these results suggest that first-trial cost contributes to
ARS, they are only indirect evidence that it does, because this
procedure is not actually alternating runs. This lack of direct
evidence reflects not so much a deficit in the present hybrid
procedure but a deeper structural limitation of the alternating-runs
procedure, which is that it has no repeat run that would allow such
evidence to be developed, one way or the other. Without a repeat
run, there is no way to estimate first-trial cost as in Figure 1 and
no way to compute switch cost net of first-trial cost by comparing
a Position 1 repeat trial with a Position 1 switch trial. The implication is that if a given manipulation in an alternating-runs study
affects ARS in some way, it is impossible to know whether this
manipulation is having its effect on first-trial processes, on switchrelated processes, or on both. The further and perhaps surprising
implication is that despite the deep roots of the alternating-runs
procedure in the task-switching domain (Fagot, 1994; Rogers &
Monsell, 1995), it offers little leverage on the question of what
processes cause switch cost, which from the outset has been the
predominant question in this literature. The explicit-cuing procedure does not have this problem, because ECS is computed within
position, which, as operationalized here in terms of predictablelength runs, is always simply 1.
That said, any procedure will have its drawbacks, and explicitcuing is no exception. Indeed, there is a line of work premised on
477
the notion that ECS itself is confounded because the cost of
switching tasks is entangled with the cost of switching task cues,
with an attendant theoretical proposal that task-switch cost reduces
simply to cue-switch cost (Logan & Bundesen, 2003, 2004). This
theoretical proposal itself is weak (Altmann, in press; Mayr, 2006;
Monsell & Mizon, 2006; see also Forstmann, Brass, & Koch, in
press), and the confounding of task switching and cue switching is
actually present in any task-switching procedure that involves task
cues, including alternating runs. However, a more easily detectable
problem with explicit cuing, which is not necessarily shared by
alternating runs, is that it does not allow cue-related processing to
be separated from processing that starts with onset of the trial
stimulus. Thus, for example, explicit-cuing studies alone cannot
tell us whether manipulations of the CSI before a trial affect only
cue-related processing, as at least one model implies (Logan &
Bundesen, 2003, 2004) or whether they also affect later stages like
stimulus encoding or response selection. To develop evidence on
this point, uncued trials need to be included in the procedure,
implying a run structure of some sort, and indeed, in an experiment
similar to the one reported here in which CSI before Position 1 was
manipulated, both Position 2 and Position 1 were affected, suggesting that cue encoding was not the only process that changed
(Altmann, in press). The larger point is that these and any similar
issues that might emerge as we refine our understanding of procedures for studying cognitive control are important but separate
from the issue addressed here, which is an under-appreciated
problem with the alternating-runs procedure that the explicit-cuing
procedure does not share.
Evidence That One Switch Cost Is Larger Than the Other
Even if the alternating-runs procedure, lacking repeat runs, does
not afford a direct test of whether ARS includes a first-trial
component, one can still approach the question indirectly. This
section first develops a theoretical case that a functional adaptation
could produce first-trial cost in the alternating-runs procedure, then
reviews empirical data at two different levels— closely-matched
comparisons of the two procedures and unmatched comparisons—
suggesting that ARS is in fact larger than ECS, which is the
prediction if ARS spans a process that ECS does not.
A Theoretical Perspective
Given a situation in which trials are predictably grouped into
runs, consider the possibility that the cognitive system exploits this
regularity by opting to process the task cue on the first trial of a
run—whatever the cue, switch or repeat—in some way that it does
not process task cues on later trials of the run. The function of this
first-trial process might be to encode (Altmann, 2002) or retrieve
(Mayr & Kliegl, 2000, 2003) a mental representation of the current
task, which could then serve the system for the duration of that run,
until the next run starts. Such a first-trial strategy, whatever its
specific processing details, could have benefits even if, as assumed
here, all trials, and not just the first of a run, are accompanied by
task cues. For example, it would save having to establish the
relevant mental representation anew on every trial. It would also,
by virtue of applying to switch cues and repeat cues alike, spare the
system a decision process that would otherwise be necessary to
478
ALTMANN
distinguish whether the current cue represents a different task than
the one performed on the previous trial.
If a first-trial strategy like this were deployed when trials are
grouped into alternating runs, not just when trials are grouped into
randomized runs, then at least some of ARS would reflect first-trial
cost, plus any additional cost arising because Position 1 happens to
be a switch trial. This would be true even if a first-trial strategy
was deployed only stochastically within a given study, by some
participants on all trials, all participants on some trials, or some
other combination. (As hinted above, an alternate strategy available in the typical alternating-runs procedure, in which every trial
is accompanied by an external cue, would be to acquire task
information anew from the cue on every trial.) If the aggregate data
from a given study include any meaningful proportion of runs on
which the system deployed a first-trial strategy, then ARS for that
study will include some first-trial cost, as well as the cost of
switching tasks (the Position 1 trial also being a switch trial). Thus,
to have confidence that ARS contains no first-trial cost, one would
need to develop evidence that such a strategy was, in effect, never
used. This is a high burden that no extant alternating-runs study
can be said to have met. A conservative conclusion would therefore be that ARS, as commonly reported in alternating-runs studies, includes some amount of first-trial cost linked to at least
stochastic use of a first-trial strategy.
This analysis suggests that a cleaner implementation of the
alternating-runs procedure would be to present a task cue only on
the first trial of a run, as in the experiment reported earlier, rather
than cuing every trial. The original motivation for cuing every trial
was to helpfully relieve the experimental participant of the need to
carry the task in memory (Rogers & Monsell, 1995), but of course
this does not prevent the system from carrying the task in memory,
if this happens to be the more efficient strategy (e.g., Gray, Sims,
Fu, & Schoelles, 2006). In general, there is no way to prevent use
of memory short of inducing appropriate lesions, and indeed a fact
of life is that we often use memory to store information (like
access codes or phone numbers) because this affords fast access. In
effect, then, helpfully relieving the participant of the need to carry
a memory load unhelpfully relieves the analyst of any control over
where the participant stores information about the current task.
Cuing only the first trial of a run would limit the strategic choices
available to participants and thus make at least Position 2 latencies,
for example, interpretable in terms net of perceptual task-related
operations.
trials in the alternating-runs procedure. Despite these procedural
differences, however, there is no harm in probing performance
differences between alternating-runs and explicit-cuing procedures, and four studies to date have done so.
In the first study, Tornay and Milan (2001) reported three
experiments comparing the two procedures. In their Experiments 1
and 2, there appears to have been no interaction of the procedure
variable (alternating runs vs. explicit cuing) with switching (switch
vs. repeat). However, in their Experiment 3, in which slightly
longer runs were used in the alternating-runs procedure, ARS did
in fact seem to be larger than ECS. Figure 2 shows these results.
The solid line shows the alternating-runs (predictable) condition,
and the dashed line shows the explicit-cuing (random) condition.
The abscissa shows a generalized position variable that drops the
predictability constraint to accommodate a comparison between
predictable runs in the alternating-runs procedure and randomly
occurring runs in the explicit-cuing procedure (the relevance of
distinguishing these two kinds of runs is addressed later in this
section). ARS is simply the difference, along the solid line, between shift trials and first repetition. ECS is the difference, along
the dashed line, between shift trials and an average of first repetition and second repetition, weighted toward the former because
of the lower binomial probability of reaching two repeat trials in a
row. Comparing these representations of ARS and ECS, it seems
reasonably clear that ARS is larger, and although no statistics
involving the switching variable (switch vs. repeat) were reported
for this experiment, the interaction of the procedure and general-
Empirical Evidence From Closely Matched Experiments
From an empirical angle, there is in fact a circumstantial case
that ARS is generally larger than ECS, consistent with the proposal
that ARS includes some amount of first-trial cost. The case can
only be circumstantial because the two procedures are fundamentally different in ways that may affect the cost of switching tasks.
The most salient difference may be the predictability of tasks in the
alternating-runs procedure, which should largely eliminate any
effects of task uncertainty from trial to trial and may facilitate
anticipatory processing for pending trials in a way that the explicitcuing procedure cannot. Certainly the probability of a task switch
is known to affect performance (e.g., Dreisbach, Haider, & Kluwe,
2002; Monsell & Mizon, 2006), and this is roughly one half in the
typical explicit-cuing procedure but either zero or one on different
Figure 2. Response times (RTs) for alternating runs (predictable condition) compared with explicit cuing (random condition), from Experiment 3
of Tornay and Milan’s (2001) study. The abscissa represents a generalized
definition of the position variable (shift trials ⫽ Position 1, first repetition ⫽ Position 2, second repetition ⫽ Position 3) that accommodates both
predictable-length and randomly occurring runs. Reprinted, with permission, from “A More Complete Task-Set Reconfiguration in Random Than
in Predictable Task Switch,” by F. J. Tornay and E. G. Milan, 2001,
Quarterly Journal of Experimental Psychology, Human Experimental Psychology, 54(A), 785– 803, Figure 3. Copyright 2001 by Experimental
Psychology Society.
COMPARING SWITCH COSTS
ized position variables—the two variables shown in Figure 2—was
reliable, F(2, 34) ⫽ 3.51, p ⬍ .05.
In a second study, Monsell, Sumner, and Waters (2003) reported
one experiment that compared alternating runs with explicit cuing.
The results of their Experiment 2 appear in Figure 3. Here, the
alternating-runs (predictable) condition appears in the left panels,
and the explicit-cuing (random) condition appears in the right
panels. The different lines represent different intervals (in milliseconds) separating the response on one trial from onset of the
stimulus for the next trial (this is the RSI). The abscissa again
shows the generalized position variable that drops the predictability constraint to accommodate explicit-cuing runs. ARS is the
difference, along each line in the upper left panel, between the 1
and 2 bins on the abscissa. ECS is the difference, along each line
in the upper right panel, between the 1 bin on the abscissa and an
average of the 2, 3, and 4 bins, weighted toward the 2 bin. Within
each RSI, but most dramatically for the 50 condition, ARS was
again larger than ECS. Statistics involving the switching variable
(switch vs. repeat) were not reported, but the interaction between
the procedure (alternating runs vs. explicit cuing) and position in
run variables was again reliable, F(3, 33) ⫽ 9.77, p ⬍ .001.
In a third study, Milan, Sanabria, Tornay, and Gonzalez (2005)
also reported one experiment that compared alternating runs with
explicit cuing. The results of their Experiment 1 appear in Figure
4, in a format similar to Figure 2. ARS is the difference, along the
Figure 3. Response times (RTs) for alternating runs (predictable condition) compared with explicit cuing (random condition), from Experiment 2
of Monsell et al.’s (2003) study. The abscissa again represents a generalized definition of the position variable that accommodates both
predictable-length and randomly occurring runs. Reprinted, with permission, from “Task-Set Reconfiguration With Predictable and Unpredictable
Task Switches,” by S. Monsell, P. Sumner, and H. Waters, 2003, Memory
& Cognition, 31, 327–342, Figure 6. Copyright 2003 by Psychonomic
Society.
479
solid (predictable) line, between the 0 and 1 bins on the abscissa.
ECS is the difference, along the dotted (random) line, between the
0 bin on the abscissa and an average of the 1 and 2 bins, weighted
toward the 1 bin. ARS was larger than ECS, and although statistics
involving switching (switch vs. repeat) were not reported, the
interaction between the variables shown in Figure 4 was again
reliable, F(2, 34) ⫽ 7.96, p ⬍ .01.
In all three studies discussed above, the original aim was to
compare predictable runs in an alternating-runs condition with
randomly-occurring runs in an explicit-cuing condition, the latter
collected by finding unpredicted runs in the data post hoc. For
present purposes, the relevance of this difference between conditions lies in the difference in opportunities afforded the cognitive
system. Under the theoretical perspective developed earlier,
predictable-length runs should allow the system to save itself some
processing on later trials of a run by setting up a functional
representation of the current task on the first trial and then using
this for the duration of the run. Such a strategy would produce
confounds in ARS, as developed earlier. In explicit cuing, however, there are no predictable runs to exploit, so one might expect
the system to treat every trial somewhat like a first trial. This in
turn predicts that Positions 2 and later in an explicit-cuing run
should incur longer latencies than Positions 2 and later in an
alternating-runs run. This is empirically the case in at least Figures
3 and 4 (with little difference between the conditions in Figure 2).
The story may be more complicated, however, given the trend for
latencies to decrease across an explicit-cuing run, as seen particularly in Figure 3 (right panel; see also Meiran, Chorev, & Sapir,
2000). This trend could reflect facilitation of cue-related processing by repetition priming (e.g., Altmann, 2005; Sohn & Anderson,
2001), if this happens to build up over successive cue repetitions,
or conceivably an effect of a lengthening streak of repeated events
within a randomized event stream (Altmann & Burns, 2005), or
some other mechanism. The details of whatever explanation
emerges may suggest that the repeat trial in some particular position of a randomly occurring run is the best estimate of the offset
of switch-related processes in explicit cuing, which in turn would
make it important, for present purposes, to check whether differences between ARS and a revised ECS still stand. In purely
procedural terms, however, the best match for the Position 2 trial
used to compute ARS is the Position 2 trial of an explicit-cuing
run, which would produce the smallest ECS (see Figure 3) and thus
the largest difference between ARS and ECS.
Finally, in a fourth study that undertook a different kind of
comparison between the two procedures, Koch (2005) reported
three experiments in each of which participants performed the first
five blocks under one procedure, switched to the other procedure
for the sixth block, then switched back to the original procedure for
a seventh block. In his Experiments 1 and 2, in which the first and
last procedure was alternating runs, the switch in between to
explicit cuing produced no reliable change in switch cost. However, in his Experiment 3, in which the first and last procedure was
explicit cuing, the switch in between to alternating runs did in fact
produce an increase in switch cost, F(1, 15) ⫽ 11.8, p ⬍ .05.
The generalization across these four studies is that, in all cases
in which ARS and ECS differed reliably, ARS was larger, supporting the notion that ARS includes components that ECS does
not.
ALTMANN
480
Figure 4. Response times for alternating runs (predictable condition) compared with explicit cuing (random
condition), from Experiment 1 of Milan, Sanabria, Tornay, and Gonzalez’s (2005) study. The abscissa again
represents a generalized definition of the position variable (0 ⫽ Position 1, 1 ⫽ Position 2, 2 ⫽ Position 3) that
accommodates both predictable-length and randomly occurring runs. Reprinted from Acta Psychologica, 118,
E. G. Milan, D. Sanabria, F. J. Tornay, and A. Gonzalez, “Exploring Task-Set Reconfiguration With Random
Task Sequences,” 319 –331, Copyright 2005, with permission from Elsevier.
Empirical Evidence From Unmatched Experiments
Apart from these few closely-matched comparisons of alternating runs and explicit cuing, both procedures are popular, and many
studies have been published using one or the other. Therefore, if
ARS is on balance larger than ECS, this difference should manifest
in a sufficiently large unmatched-samples comparison. Table 1
shows the results of such an analysis. Studies selected for inclusion
were those involving healthy adults, in which cues were mapped
one-to-one to tasks (see Altmann, in press) and in which there was
a manipulation of the CSI between onset of the task cue and onset
of the subsequent trial stimulus. This last criterion was adopted
from another meta-analysis (Altmann, 2004b) simply to produce a
tractable sample size here, but it also happens to control nicely for CSI
duration, which was nearly equal for the two sets of studies (608 vs.
605 ms). (This control is relevant because CSI is often cited as
modulating switch cost, at least as switch cost is construed in, e.g.,
Logan, 2003; Monsell, 2003; see Altmann, 2004b, for a review of
contrary evidence.) Mean ARS across studies was 172 ms, and mean
ECS was 94 ms, F(1, 32) ⫽ 14.2, p ⫽ .001, MSE ⫽ 3,486.
Available empirical evidence, then, both from studies that match
the two procedures as closely as possible and from a broader
survey of studies that each use one procedure or the other, suggests
that ECS spans less processing than ARS—that subtracting repeat
trials from switch trials in explicit cuing cancels processing costs
that subtracting Position 2 trials from Position 1 trials in alternating runs does not. The remaining question is whether this matters—whether, after a dozen years of task switching (in the modern
era) and hundreds of studies, it may nonetheless be worthwhile
now to adopt a greater degree of precision in discussing switch
costs.
Implications: Failure-to-Engage and Stimulus Priming
Accounts of . . . What?
To summarize, the basic issue is that ARS and ECS have
different operational definitions, with the contrast sharpened by
conceptualizing ARS as ECS plus any first-trial costs (see Figure
1). The deeper methodological problem for ARS is that the
alternating-runs procedure affords no way to disentangle switchrelated processes from switch-independent, first-trial related processes. The reaction to these issues in the task-switching community will surely range from boredom to irritation. On one hand,
much of the procedural detail is in some sense obvious, and studies
have been published comparing the two procedures, so some
attention has been paid. On the other hand, everyone in the
community (myself included) has at one point or another referred
to these two measures interchangeably in framing research questions or results or in captioning existing findings, so we all stand
accused of some methodological sloppiness. Is it possible, therefore, to point to specific examples where such conflation has cost
us something, to illustrate the common interest in greater precision?
Three examples are offered here, the first two involving important constructs that are difficult to interpret because almost all the
relevant data were collected with the alternating-runs procedure.
The first line of work involves the failure-to-engage hypothesis
and its predictions concerning variance in response latencies (De
Jong, 2000; De Jong, Berendsen, & Cools, 1999; Lien, Ruthruff,
Remington, & Johnston, 2005; Nieuwenhuis & Monsell, 2002).
The empirical finding is that, at long CSIs, fast responses on
switch trials are as fast as the fastest responses on repeat trials, but
slow responses on switch trials are much slower than the slowest
responses on repeat trials. This pattern is consistent with a reconfiguration process that has work to do on switch trials (Monsell,
2003; Rogers & Monsell, 1995) and either completes this work
during a long CSI, if it can engage in a timely fashion, or engages
only after stimulus onset, in which case the time to do the work is
charged to response latency. Mixing the two types of trials—those
in which reconfiguration completes during the CSI and those in
which reconfiguration only starts after stimulus onset— explains
the empirical pattern in terms of some switches costing no extra
response latency and others costing the full amount of reconfigu-
COMPARING SWITCH COSTS
Table 1
Comparison of Alternating-Runs and Explicit-Cuing Switch
Costs
Study author(s)
Alternating-runs studies
De Jong (2000)
De Jong et al. (1999)
De Jong (2001)
Fagot (1994)
Kleinsorge (1999)
Koch (2003)
Koch (2005)
Kramer et al. (1999)
Kray & Lindenberger (2000)
Monsell et al. (2003)
Nieuwenhuis & Monsell (2002)
Oriet & Jolicoeur (2003)
Rogers & Monsell (1995)
Yeung & Monsell (2003)
Mean across studies
Explicit-cuing studies
Altmann (2004a)
Altmann (2004b)
Arrington (2002)
Besner & Care (2003)
Bojko et al. (2004)
Garavan (1998)
Hahn et al. (2003)
Hubner et al. (2004)
Kleinsorge et al. (2005)
Kleinsorge & Gajewski (2004)
Koch (2001)
Mayr (2002)
Mayr & Keele (2000)
Meiran (1996)
Meiran (2000a)
Meiran (2000b)
Meiran et al. (2000)
Meiran & Marciano (2002)
Schuch & Koch (2003)
Tornay & Milan (2001)
Mean across studies
Experiment
Mean
CSI
Mean
switch
cost
1, 2
2
2a
3
1–3
1
1 vs. 2
2, 3a
1a
1, 2
1
1, 2
2, 3
4
750
733
900
200
450
600
500
750
700
600
563
520
540
700
608
160
105
170
197
30
272
199
308
143
137
105
278
175
128
172
1, 2
1–3
2
1
1a
2
1
1–4
2b
1
1 vs. 3, 4
1
4
1–5
1, 2
1
2–4
1
1–4c
1, 2
500
500
700
375
300
305
450
675
675
550
500
317
400
982
891
516
785
1270
550
850
605
100
100
129
59
6
201
120
89
101
116
180
33
98
75
70
67
73
94
110
67
94
Note. Mean cue–stimulus interval (CSI) and mean switch cost are simple,
unweighted averages of CSIs and switch costs, respectively, in all conditions of all listed experiments in a given study, excluding any general
switch (or mixing) costs.
a
Young conditions only. b Correctly cued trials only. c Go conditions
only.
ration time. These studies were all framed in terms of a reconfiguration perspective on cognitive control. Yet, because they all used
alternating runs, their switch trials were also Position 1 trials, and
their repeat trials were also Position 2 trials, so they equally well
support a model in which the functional processes that engage
during some CSIs but not others are those that execute on the first
trial of a run—any run, switch or repeat. Thus, despite how it has
been framed, the failure-to-engage hypothesis may well turn out to
be an important element of a model in which the processes that
implement cognitive control have nothing to do with switching
tasks.
The second example is a line of work that characterizes switch
cost as a consequence of negative effects of stimulus priming. The
481
basic finding is that performance in response to new stimuli,
namely those that have never been used together with the competing task, is better than performance in response to old stimuli,
namely those that have been used together with the competing task
(Allport & Wylie, 2000, Experiment 5; Waszak et al., 2003;
Waszak, Hommel, & Allport, 2005). In these studies, a stimuluspriming effect registered on Position 1 of a switch run when
switching was from a picture-naming task to a word-reading task
(and registered on all positions when switching was in the other
direction). However, because these results were gathered by use of
alternating runs, with no repeat runs interleaved, they are silent on
whether this effect on Position 1 reflected changes in switching
processes or changes in first-trial processes. More recently, one
explicit-cuing study has linked stimulus priming to an increase in
ECS (Koch & Allport, 2006) but also reported a main effect on
response latency. To know whether this main effect reflected a
change in first-trial processes, one would need to reproduce it
using randomized runs, which would allow a test of the effect on
first-trial cost (see Figure 1).
The third example concerns the four aging studies included in
Table 1. In the three that used alternating runs (De Jong, 2001;
Kramer et al., 1999; Kray & Lindenberger, 2000), differences in
ARS between young and older adults were 159 ms, 282 ms, and (a
nonsignificant) 80 ms, respectively. In the one that used explicit
cuing (Bojko, Kramer, & Peterson, 2004), the difference in ECS
was a smaller (also nonsignificant) 33 ms. Were it not for the
“switching” terminology used in the first three studies, they might
suggest that first-trial costs, not task-switch costs, were what
increased with age, which could lead to very different guidance
from these studies about which mechanisms to target in future
cognitive-aging studies.
These three examples do not reflect an exhaustive search, so
there may be others, given that the alternating-runs procedure was
present at the inception of the modern task-switching era (Fagot,
1994; Rogers & Monsell, 1995) and remains in use today (e.g.,
Lien et al., 2005; Taube-Schiff & Segalowitz, 2005; Waszak et al.,
2005). However, what they may suffice to show is that precision in
reporting and discussion of switch cost is not just good in principle, in a domain broadly concerned with this construct, but is a
prerequisite for developing empirical results like those sketched
above into diagnostically useful constraints on models of cognitive
control.
Summary
The aim here has been to draw attention to a difference in how
switch cost is operationally defined in the two most common
task-switching procedures—a difference that in the literature is
generally ignored, even in integrative surveys (e.g., Logan, 2003;
Monsell, 2003). ARS is derived through comparison of Positions 1
and 2 of switch runs and thus may include costs specific to the first
trial of a run as well as costs of switching tasks. Whether it does
or not cannot be tested directly, because the alternating-runs procedure affords no Position 1 repeat trial that would cancel such
costs when used as a baseline. Nonetheless, if there is a realistic
possibility that performance in a context of predictable-length runs
involves a first-trial strategy of some kind, in which whatever cue
is presented at the start of a run is generically transformed into an
internal task representation that governs until its relevance expires,
ALTMANN
482
then first-trial costs cannot be ruled out, and ARS should be treated
with caution, as a confounded measure. In contrast, ECS is derived
by comparing switch trials and repeat trials with no position confound, such that any costs specific to the first trial of a predictablelength run—a run of length 1, in this instance— cancel in the comparison; explicit-cuing has shortcomings of its own, but at least it does
not suffer a structural confound between effects of switching tasks and
effects of position within a predictable-length run.
This analysis makes a prediction—that ARS should in general
be larger than ECS—that is supported by available empirical
evidence from the few studies that have compared the two procedures under relatively controlled circumstances and from a broader
survey of unmatched studies. It also suggests that one would need
to elaborate at least two lines of work in the task-switching
domain, one on the failure-to-engage hypothesis and the other on
the negative effects of stimulus priming, using a randomized-runs
procedure to identify whether effects reported in the original
studies were switch-related or first-trial related. More generally,
this analysis suggests that it would be useful to start acknowledging that ARS and ECS have different operational definitions, rather
than citing them together as if they are identical measures, as is
currently the standard (e.g., Logan, 2003; Monsell, 2003).
References
Allport, A., & Wylie, G. (1999). Task-switching: Positive and negative
priming of task-set. In G. H. Humphreys, J. Duncan, & A. Treisman
(Eds.), Attention, space, and action: Studies in cognitive neuroscience
(pp. 273–296). New York: Oxford University Press.
Allport, A., & Wylie, G. (2000). Task-switching, stimulus-response bindings, and negative priming. In S. Monsell & J. Driver (Eds.), Attention
and performance XVIII: Control of cognitive processes (pp. 35–70).
Cambridge, MA: M.I.T. Press.
Altmann, E. M. (2002). Functional decay of memory for tasks. Psychological Research, 66, 287–297.
Altmann, E. M. (2004a). Advance preparation in task switching: What
work is being done? Psychological Science, 15, 616 – 622.
Altmann, E. M. (2004b). The preparation effect in task switching: Carryover of SOA. Memory & Cognition, 32, 153–163.
Altmann, E. M. (2005). Repetition priming in task switching: Do the
benefits dissipate? Psychonomic Bulletin & Review, 12, 535–540.
Altmann, E. M. (in press). Task switching is not cue switching. Psychonomic Bulletin & Review.
Altmann, E. M., & Burns, B. D. (2005). Streak biases in decision making:
Data and a memory model. Cognitive Systems Research, 6, 5–16.
Altmann, E. M., & Gray, W. D. (2002). Forgetting to remember: The
functional relationship of decay and interference. Psychological Science,
13, 27–33.
Arrington, C. M. (2002). Explorations in task space: Similarity effects on
task switching. Unpublished doctoral dissertation, Michigan State University, East Lansing.
Besner, D., & Care, S. (2003). A paradigm for exploring what the mind
does while deciding what it should do. Canadian Journal of Experimental Psychology, 57, 311–320.
Bojko, A., Kramer, A. F., & Peterson, M. S. (2004). Age equivalence in
switch costs for prosaccade and antisaccade tasks. Psychology and
Aging, 19, 226 –234.
De Jong, R. (2000). An intention-activation account of residual switch
costs. In S. Monsell & J. Driver (Eds.), Attention and performance
XVIII: Control of cognitive processes (pp. 357–376). Cambridge, MA:
M.I.T. Press.
De Jong, R. (2001). Adult age differences in goal activation and goal
maintenance. European Journal of Cognitive Psychology, 13, 71– 89.
De Jong, R., Berendsen, E., & Cools, R. (1999). Goal neglect and inhibitory limitations: Dissociable causes of interference effects in conflict
situations. Acta Psychologica, 101, 379 –394.
Dreisbach, G., Haider, H., & Kluwe, R. H. (2002). Preparatory processes
in the task-switching paradigm: Evidence from the use of probability
cues. Journal of Experimental Psychology: Learning, Memory, and
Cognition, 28, 468 – 483.
Fagot, C. (1994). Chronometric investigations of task switching. Unpublished doctoral dissertation, University of California, San Diego,.
Forstmann, B. U., Brass, M., & Koch, I. (in press). Methodological and
empirical issues when dissociating cue-related from task-related processes in the explicit task-cuing procedure. Psychological Research.
Garavan, H. (1998). Serial attention within working memory. Memory &
Cognition, 26, 263–276.
Gopher, D., Armony, L., & Greenshpan, Y. (2000). Switching tasks and
attention policies. Journal of Experimental Psychology: General, 129,
308 –339.
Gray, W. D., Sims, C. R., Fu, W.-T., & Schoelles, M. J. (2006). The
soft-constraints hypothesis: A rational analysis approach to resource
allocation for interactive behavior. Psychological Review, 113, 461–
482.
Hahn, S., Andersen, G. J., & Kramer, A. F. (2003). Multidimensional set
switching. Psychonomic Bulletin & Review, 10, 503–509.
Hubner, M., Kluwe, R. H., Luna-Rodriguez, A., & Peters, A. (2004).
Response selection difficulty and asymmetrical costs of switching between tasks and stimuli: No evidence for an exogenous component of
task-set reconfiguration. Journal of Experimental Psychology: Human
Perception and Performance, 30, 1043–1063.
Kleinsorge, T. (1999). Response repetition benefits and costs. Acta Psychologica, 103, 295–310.
Kleinsorge, T., & Gajewski, P. D. (2004). Preparation for a forthcoming
task is sufficient to produce subsequent shift costs. Psychonomic Bulletin
& Review, 11, 302–306.
Kleinsorge, T., Gajewski, P. D., & Heuer, H. (2005). Task sets under
reconstruction: Effects of partially incorrect precues. Quarterly Journal
of Experimental Psychology: Human Experimental Psychology, 58(A),
521–546.
Koch, I. (2001). Automatic and intentional activation of task sets. Journal
of Experimental Psychology: Learning, Memory, and Cognition, 27,
1474 –1486.
Koch, I. (2003). The role of external cues for endogenous advance reconfiguration in task switching. Psychonomic Bulletin & Review, 10, 488 –
492.
Koch, I. (2005). Sequential task predictability in task switching. Psychonomic Bulletin & Review, 12, 107–112.
Koch, I., & Allport, A. (2006). Cue-based preparation and stimulus-based
priming of tasks in task switching. Memory & Cognition, 34, 433– 444.
Kramer, A. F., Hahn, S., & Gopher, D. (1999). Task coordination and
aging: Explorations of executive control processes in the task switching
paradigm. Acta Psychologica, 101, 339 –378.
Kray, J., & Lindenberger, U. (2000). Adult age differences in task switching. Psychology and Aging, 15, 126 –147.
Lien, M.-C., Ruthruff, E., Remington, R. W., & Johnston, J. C. (2005). On
the limits of advance preparation for a task switch: Do people prepare all
the task some of the time or some of the task all of the time? Journal of
Experimental Psychology: Human Perception and Performance, 31,
299 –315.
Logan, G. D. (2003). Executive control of thought and action: In search of
the wild homunculus. Current Directions in Psychological Science, 12,
45– 48.
Logan, G. D., & Bundesen, C. (2003). Clever homunculus: Is there an
endogenous act of control in the explicit task-cuing procedure? Journal
of Experimental Psychology: Human Perception and Performance, 29,
575–599.
COMPARING SWITCH COSTS
Logan, G. D., & Bundesen, C. (2004). Very clever homunculus: Compound stimulus strategies for the explicit task-cuing procedure. Psychonomic Bulletin & Review, 11, 832– 840.
Mayr, U. (2002). Inhibition of action rules. Psychonomic Bulletin &
Review, 9, 93–99.
Mayr, U. (2006). What matters in the cued task-switching paradigm: Tasks
or cues? Psychonomic Bulletin & Review, 13, 794 –799.
Mayr, U., & Keele, S. W. (2000). Changing internal constraints on action:
The role of backward inhibition. Journal of Experimental Psychology:
General, 129, 4 –26.
Mayr, U., & Kliegl, R. (2000). Task-set switching and long-term memory
retrieval. Journal of Experimental Psychology: Learning, Memory, and
Cognition, 26, 1124 –1140.
Mayr, U., & Kliegl, R. (2003). Differential effects of cue changes and task
changes on task-set selection costs. Journal of Experimental Psychology: Learning, Memory, and Cognition, 29, 362–372.
Meiran, N. (1996). Reconfiguration of processing mode prior to task
performance. Journal of Experimental Psychology: Learning, Memory,
and Cognition, 22, 1423–1442.
Meiran, N. (2000a). Intentional reconfiguration and involuntary persistence
in task set switching. In S. Monsell & J. Driver (Eds.), Attention and
performance XVIII: Control of cognitive processes (pp. 377–399). Cambridge, MA: M.I.T. Press.
Meiran, N. (2000b). Modeling cognitive control in task-switching. Psychological Research, 63, 234 –249.
Meiran, N., Chorev, Z., & Sapir, A. (2000). Component processes in task
switching. Cognitive Psychology, 41, 211–253.
Meiran, N., & Marciano, H. (2002). Limitations in advance task preparation: Switching the relevant stimulus dimension in speeded samedifferent comparisons. Memory & Cognition, 30, 540 –550.
Milan, E. G., Sanabria, D., Tornay, F. J., & Gonzalez, A. (2005). Exploring
task-set reconfiguration with random task sequences. Acta Psychologica,
118, 319 –331.
Monsell, S. (2003). Task switching. Trends in Cognitive Sciences, 7,
134 –140.
Monsell, S., & Mizon, G. A. (2006). Can the task-cuing paradigm measure
an “endogenous” task-set reconfiguration process? Journal of Experimental Psychology: Human Perception and Performance, 32, 493–516.
483
Monsell, S., Sumner, P., & Waters, H. (2003). Task-set reconfiguration
with predictable and unpredictable task switches. Memory & Cognition,
31, 327–342.
Nieuwenhuis, S., & Monsell, S. (2002). Residual costs in task switching:
Testing the failure-to-engage hypothesis. Psychonomic Bulletin & Review, 9, 86 –92.
Oriet, C., & Jolicoeur, P. (2003). Absence of perceptual processing during
reconfiguration of task set. Journal of Experimental Psychology: Human
Perception and Performance, 29, 1036 –1049.
Rogers, R. D., & Monsell, S. (1995). Costs of a predictable switch between
simple cognitive tasks. Journal of Experimental Psychology: General,
124, 207–231.
Schuch, S., & Koch, I. (2003). The role of response selection for inhibition
of task sets in task shifting. Journal of Experimental Psychology: Human
Perception and Performance, 29, 92–105.
Sohn, M.-H., & Anderson, J. R. (2001). Task preparation and task repetition: Two-component model of task switching. Journal of Experimental
Psychology: General, 130, 764 –778.
Taube-Schiff, M., & Segalowitz, N. (2005). Linguistic attention control:
attention shifting governed by grammaticized elements of language.
Journal of Experimental Psychology: Learning, Memory, and Cognition, 31, 508 –519.
Tornay, F. J., & Milan, E. G. (2001). A more complete task-set reconfiguration in random than in predictable task switch. Quarterly Journal of
Experimental Psychology: Human Experimental Psychology, 54(A),
785– 803.
Waszak, F., Hommel, B., & Allport, A. (2003). Task-switching and longterm priming: Role of episodic stimulus-task bindings in task-shift costs.
Cognitive Psychology, 46, 361– 413.
Waszak, F., Hommel, B., & Allport, A. (2005). Semantic generalization of
stimulus-task bindings. Psychonomic Bulletin & Review, 11, 1027–1033.
Yeung, N., & Monsell, S. (2003). The effects of recent practice on task
switching. Journal of Experimental Psychology: Human Perception and
Performance, 29, 919 –936.
Received July 7, 2006
Revision received September 30, 2006
Accepted December 11, 2006 䡲