Individual Differences in Inhibitory Control and Children`s Theory of

Child Development, July/August 2001, Volume 72, Number 4, Pages 1032–1053
Individual Differences in Inhibitory Control
and Children’s Theory of Mind
Stephanie M. Carlson and Louis J. Moses
This research examined the relation between individual differences in inhibitory control (IC; a central component of executive functioning) and theory-of-mind (ToM) performance in preschool-age children. Across two
sessions, 3- and 4-year-old children (N 107) were given multitask batteries measuring IC and ToM. Inhibitory control was strongly related to ToM, r .66, p .001. This relation remained significant controlling for
age, gender, verbal ability, motor sequencing, family size, and performance on pretend-action and mental state
control tasks. Inhibitory tasks requiring a novel response in the face of a conflicting prepotent response (Conflict scale) and those requiring the delay of a prepotent response (Delay scale) were significantly related to
ToM. The Conflict scale, however, significantly predicted ToM performance over and above the Delay scale and
control measures, whereas the Delay scale was not significant in a corresponding analysis. These findings suggest that IC may be a crucial enabling factor for ToM development, possibly affecting both the emergence and
expression of mental state knowledge. The implications of the findings for a variety of executive accounts of
ToM are discussed.
INTRODUCTION
Children’s knowledge about the mind is a foundational domain of cognitive development (Wellman &
Gelman, 1998). Critical developments in children’s
“theory of mind” (ToM) take place in the preschool
period, when a wide range of folk psychological abilities become apparent (see Flavell & Miller, 1998). For
example, 3-year-olds typically perform very poorly on
measures assessing their understanding that beliefs
can be false (Wimmer & Perner, 1983), that appearances may not reflect reality (Flavell, Flavell, & Green,
1983), and that different individuals may perceive the
same scene in different ways (Flavell, Everett, Croft,
& Flavell, 1981). These younger preschoolers frequently
state that beliefs and appearances always match reality,
and that there can only be a single (realist) perspective on any state of affairs. Yet, by the time they are 5
years old, children have an appreciation of these matters that in core respects is recognizably adult-like.
The most common interpretation of these and related
findings is that younger children lack concepts of
belief, mental representation, or representation more
broadly—for example, photographs (Flavell, 1988;
Gopnik, 1993; Gopnik & Wellman, 1994; Perner, 1991).
Consistent with this account of conceptual change are
findings that performance is highly resistant to task
simplification (Flavell, Green, & Flavell, 1986; Moses
& Flavell, 1990) and that various subsets of ToM tasks
are significantly intercorrelated (controlling for age
and, in some studies, verbal ability), suggesting the
presence of a common conceptual core (Flavell,
Green, & Flavell, 1989; Frye, Zelazo, & Palfai, 1995;
Gopnik & Astington, 1988; Moore, Pure, & Furrow,
1990; Taylor & Carlson, 1997; Wimmer & Hartl, 1991).
Although conceptual changes undoubtedly make
significant contributions to ToM development, other
factors may also play critical roles. One potentially
important factor, explored in the present investigation, is executive functioning. The construct of executive functioning subsumes those processes that serve
to monitor and control thought and action, including
self-regulation, planning, behavior organization, cognitive flexibility, error detection and correction, response inhibition, and resistance to interference (Eslinger, 1996; Zelazo, Carter, Reznick, & Frye, 1997).
Executive functioning might affect both the emergence and expression of ToM (Moses, 2001). With respect to emergence, it seems clear that children would
require a certain level of executive ability before they
could even begin to construct complex concepts of
mental life (see Russell, 1996). That is, without some
capacity to distance themselves from current stimuli,
they would surely never be able to reflect on representations of those stimuli. In this view, children
might fail ToM tasks because they lack critical concepts, but executive functioning would be implicated
in the acquisition of these concepts. With respect
to expression, successful performance on many ToM
tasks requires that children override prepotent (i.e.,
dominant, habitual) tendencies to reference reality
(Carlson, Moses, & Hix, 1998). Accordingly, in this view,
children’s failures on standard measures of false be© 2001 by the Society for Research in Child Development, Inc.
All rights reserved. 0009-3920/2001/7204-0007
Carlson and Moses
lief, appearance-reality, deception, and other aspects
of ToM may stem not from pure conceptual limitations, but rather from problems translating conceptual knowledge into successful action.
In both ways, then, executive functioning may represent a crucial enabling factor for ToM advances. If
so, individual differences in executive functioning
should account for substantial variation in preschoolers’ ToM performance. The present research examined this important, but understudied, possibility. In
particular, it identified inhibitory control (IC) as a
prime candidate for an executive ability that might relate to ToM.
Development of IC
Inhibitory control is the ability to inhibit responses
to irrelevant stimuli while pursuing a cognitively represented goal (Rothbart & Posner, 1985). Inhibitory
functioning is thought to contribute to individual differences, developmental changes, or both in a wide
array of cognitive abilities, including intelligence,
attention, memory, and reading comprehension, as
well as performance on a variety of Piagetian tasks
(Dempster, 1992; Diamond, 1990; Harnishfeger &
Bjorklund, 1993). It has also been implicated in the development of emotion regulation, conscience, and social competence (Kochanska, Murray, Jacques, Koenig,
& Vandegeest, 1996; Kopp, 1982).
Important developments in IC take place in the
first 6 years of life, with marked improvement between age 3 and 6 (Diamond & Taylor, 1996; Frye et
al., 1995; Gerstadt, Hong, & Diamond, 1994; Jerger,
Martin, & Pirozzolo, 1988; Kochanska et al., 1996;
Livesey & Morgan, 1991; Reed, Pien, & Rothbart,
1984). Two general kinds of tasks have been used to
assess IC in the preschool period. The first includes
measures of children’s ability to delay, temper, or altogether suppress an impulsive response when a task
calls for it. For example, Kochanska et al. (1996) gave
children a Gift Delay task, in which an experimenter
instructed children not to peek while the experimenter
noisily wrapped a present for them. Children’s waiting ability on this and other “delay” tasks improves
across the preschool-age period. The second category
includes measures that require children to respond a
certain way in the face of a highly salient, conflicting
response option. On these “conflict” tasks, the object
is not only to withhold an impulsive response but also
to provide a novel response that is incompatible with
the prepotent one. For example, following Luria’s
(1966) pioneering work, Gerstadt et al. (1994) presented 3- to 7-year-old children with cards depicting
either the sun or moon and instructed them to say
1033
“night” in response to sun cards and “day” in response
to moon cards. As with delay tasks, children’s performance on this and similar conflict tasks improves
gradually over the preschool-age period.
The behavioral evidence for development of IC
dovetails nicely with research on brain maturation in
childhood. The frontal lobes are believed to be heavily involved in inhibitory processes, and in executive functioning more generally (Luria, 1973; Passler,
Isaac, & Hynd, 1985). Indeed, Luria and others have
argued that the capacity for inhibition is the hallmark
of frontal lobe function. Much of the evidence for this
claim comes from brain-damaged adults, but inhibitory deficits have also been found in children born
with phenylketonuria (PKU), a metabolic disorder
that depletes levels of the neurotransmitter dopamine
in the frontal brain region (Diamond, Prevor, Callender,
& Druin, 1997; Welsh, Pennington, Ozonoff, Rouse, &
McCabe, 1990). In addition, there is some research indicating that frontal lobe lesions in children result in
perseveration and lack of inhibition (Dennis, 1991).
Although the frontal lobes develop most rapidly during infancy, they undergo a further growth spurt between about 4 and 7 years of age (Luria, 1973; Thatcher,
1992; see also Hudspeth & Pribram, 1990). The protracted development of IC is thus very likely to be related to the fact that the prefrontal cortex is a slowly
maturing brain area (Stuss, 1992).
Common Links between IC and ToM
There are several reasons to suspect that developments in IC and ToM might be related. First, as just
described, important developmental changes occur
in children’s IC in the preschool years, the very time
at which remarkable advances are made in their ToM.
Second, as with IC, evidence from brain-imaging
studies implicates the frontal lobes as the seat of ToM
abilities (Baron-Cohen et al., 1994; Fletcher et al., 1995;
Goel, Grafman, Sadato, & Hallett, 1995; Sabbagh &
Taylor, 2000). Although none of these studies included children, left frontal activity has been shown
to relate to preschool-age children’s social competence, a potential by-product of an intact ToM (Fox,
Schmidt, Calkins, & Rubin, 1996). Third, in addition
to ToM deficits, autistic individuals (even those with
relatively normal IQ) show impairments on classical
executive functioning tasks, such as the Wisconsin
Card Sort and Tower of Hanoi (Hughes & Russell,
1993; Ozonoff, Pennington, & Rogers, 1991). Finally,
and most directly, successful performance on ToM
tasks would seem to require well-developed IC skills.
For example, on false-belief, appearance-reality, and
deception tasks children clearly must inhibit their
1034
Child Development
own prepotent knowledge of current reality to respond in terms of less salient representations of reality. Younger preschoolers, who are equipped with notoriously poor inhibitory skills, may not be able to
resist the temptation to reference reality on these
and related tasks. In line with an executive “expression” account, then, it is conceivable that children do
not altogether fail to take account of false beliefs,
misleading appearances, and the like but, reacting
impulsively, fail to disengage from a default response (Carlson et al., 1998; Leslie & Polizzi, 1998;
Russell, Jarrold, & Potel, 1994). As children’s IC develops through the preschool-age period, they become better at resisting interference from prepotent
sources and hence their ToM performance improves.
Consistent with this latter hypothesis, a common
denominator in studies suggesting early ToM competence is that, in one way or another, the procedures
appear to reduce the inhibitory burden on children
(Freeman, Lewis, & Doherty, 1991; Mitchell & Lacohee,
1991; Moses, 1993; Robinson & Mitchell, 1995; Wellman
& Bartsch, 1988; Zaitchik, 1991). For example, 3-yearolds perform poorly when required to deceive by
pointing to an empty location; as in other ToM tasks
they consistently point to the actual location of an object
(Russell, Mauthner, Sharpe, & Tidswell, 1991; Sodian,
1991). We have argued, however, that because pointing to things accurately is a very well-reinforced response, young children may have great difficulty inhibiting such behavior (Carlson et al., 1998). Direct
comparisons were conducted of children’s deception
by means of (1) pointing, and (2) similar but novel
methods that do not impose as great a burden on their
IC skills. Specifically, children were given the chance
to deceive an experimenter about the location of a
hidden toy by means of a misleading pictorial cue or
a large arrow. Three-year-olds reliably failed to deceive by pointing and, similar to Russell et al. (1991),
showed little sign of learning across trials despite corrective feedback. In contrast, when the novel strategies were available, children quickly began to deceive
and did so at moderately high levels across three
studies (Carlson et al., 1998; see also Couillard &
Woodward, 1999; Hala, Russell, & Maberley, 1999).
In sum, IC and ToM abilities are closely related on
several grounds. They appear to share a common developmental timetable (substantial growth in the preschool
period) and a common brain region (prefrontal cortex),
and their joint absence appears to yield a common
psychopathology (autism). Moreover, success on many
ToM tasks clearly requires a certain level of IC and manipulating inhibitory demands has predictable effects.
The aim of the present research was to provide a
direct test of the inhibitory hypothesis by examining
whether the children who show inhibitory deficits
also have ToM difficulties. A series of studies by Frye
et al. (1995) is relevant to this individual-differences
approach. In an adaptation of the Wisconsin Card
Sort Task, children were asked to sort picture cards
according to one dimension (e.g., shape) and then to
sort the same cards by another dimension (e.g., color).
There was a strong age effect for this task, with
3-year-olds more likely than 4- and 5-year-olds to perseverate on the old sorting procedure after the rule
had been changed. Frye et al. also gave children four
ToM tasks, including measures of false belief (unexpected contents, representational change), appearancereality, and the distinction between pretense and reality. Performance on the card sort task was correlated
with ToM.
Although these results suggest an executive account of ToM, they are limited in several ways. First,
the correlations did not hold up over age in every
study (Frye et al., 1995) and for every ToM task. Second, children’s intelligence was not assessed. Thus, it
might not have been executive functioning but rather
a more general cognitive ability that accounted for the
obtained relation. Third, a mental state control task
(Gopnik & Astington, 1988) was not included. In this
task, after watching an experimenter replace the existing contents of a box with new contents, children
are asked to report what was previously inside the
box, before it was opened. The task thus has a very
similar structure to the representational change falsebelief task but does not require an appreciation of
mental states. Although typically included as a linguistic control (Gopnik & Astington, 1988; Gopnik &
Slaughter, 1991), the task might also be thought of as
an inhibitory measure; after all, children who mistakenly responded in terms of the current rather than the
prior contents could well have had a general problem
disengaging from current reality. The fact that even
children who perform well on such measures continue to have difficulties with ToM tasks might then
indicate that executive functioning could not be a
substantial contributor to ToM development. It is
therefore critical to the “emergence” version of the executive account that any correlations between tasks
like the card sort and ToM remain when performance
on mental-state control tasks is held constant. Finally,
although Frye et al. (1995) included a variety of ToM
measures, they assessed children on only one executive measure: the card sort. On the one hand, then, executive abilities might have been measured with low
reliability, possibly explaining the somewhat patchy
nature of the correlations; on the other, there is always
the danger that the obtained correlations somehow
resulted from the idiosyncrasies of the card sort task
Carlson and Moses
rather than from individual differences in more general
executive abilities. Moreover, the inclusion of only a
single executive measure made it difficult to distinguish among different executive accounts of ToM development, a point returned to later in the Discussion.
Goals of the Present Study
The present study sought to provide a thorough
examination of the relation between preschool children’s IC skills and their ToM. To achieve this aim, a
large sample of 3- and 4-year-olds were tested in two
sessions during which they completed batteries of
well-established measures of IC and ToM, as well as a
measure of verbal ability and mental state control tasks.
The ToM battery included measures of false belief, deceptive pointing, and appearance-reality. The IC battery included the two a priori groups of measures discussed earlier: conflict tasks and delay tasks. Parental
report data on children’s IC were also collected using
the Children’s Behavior Questionnaire (CBQ; Rothbart,
Ahadi, & Hershey, 1994) to provide a broad assessment of their inhibitory abilities. It was predicted that
individual differences in IC would be significantly related to performance on ToM tasks independent of
age, gender, verbal ability, and performance on the
mental state control tasks.
In addition, a motor sequencing task was included.
This task assesses general executive planning ability
under speeded conditions but does not involve specific inhibitory demands in the way of a response conflict or delay. Thus, although it shares some features
with the other executive tasks, it was not predicted to
be as closely related to ToM. Finally, data were also
collected on children’s ability to perform pretend actions and on the number of siblings they had. Both
factors have been found to relate to ToM (Jenkins &
Astington, 1996; Perner, Ruffman, & Leekam, 1994;
Taylor & Carlson, 1997; Youngblade & Dunn, 1995).
Moreover, it has been hypothesized that pretense and
the ability to understand symbolic relations are related to IC (DeLoache, 1999; Elias & Berk, 1999), and it
is certainly conceivable that family size might affect
the development of IC. Thus, there was interest in
whether any relation between IC and ToM would survive when these other factors were controlled.
METHOD
Participants
Participants were 107 preschool-age children (M 3
years, 11 months; range 3,3–4,11), sixty-two 3-yearolds (M 3,8; 34 boys and 28 girls) and forty-five
1035
4-year-olds (M 4;5; 17 boys and 28 girls) were included. An additional 2 children refused to participate
and a further 4 children did not return for Session 2.
Children were recruited by telephoning parents included in a database derived from birth announcements. The sample was predominantly White, reflecting the demographics of the community (Eugene, OR)
from which it was drawn.
Procedure
Children were tested individually in two videotaped sessions that lasted approximately 45 min, and
were, on average, 8 days apart. The measures consisted
of a verbal ability test, a ToM battery, two mental state
control tasks, an IC battery, a motor sequencing task, a
pretend-actions task, and a parental report measure of
IC. Each measure is described in detail in the following
section. The order of tasks in Session 1 was Peabody
Picture Vocabulary Test–Revised (PPVT-R), Day/Night;
Standard Location False Belief, Pinball; Mental State
Control (Contents), Card Sort, Appearance-Reality,
Gift Delay, Pretend Actions, and Grass/Snow. The order of tasks in Session 2 was Contents False Belief (Self
and Other); Kansas Reflection-Impulsivity Scale for
Preschoolers (KRISP), Mental State Control (Location),
Bear/Dragon, Deceptive Pointing, Tower Building,
Motor Sequencing, Whisper, Explicit Location False
Belief, and Spatial Conflict.1 Approximately half of the
ToM tasks were administered in each session and were
never given back-to-back. The same primary experimenter (E) and one of three assistants tested all chil1 A fixed order is standard practice in individual-differences research. If one is interested in locating individuals with respect to
one another in a multidimensional space, it is critical that the individuals be exposed to identical stimulus contexts. That context includes not only the stimuli themselves but also the order in which
they are presented. Of course, the correlations that are then obtained
may be affected by the particular order chosen but, in contrast to
the case in which means are compared in a repeated-measures design, the problem is not solved by counterbalancing order (or by
randomizing order). When drawing inferences about means, it is
advisable to counterbalance order because the resulting overallcondition means have an unambiguous interpretation: they represent the averages of the means across the counterbalanced orders
and, in that sense, the effects of order are “controlled.” In the case of
correlations, however, the correlations within each order do not
fully constrain the overall correlation that emerges when the data
are collapsed across orders. For example, suppose the correlation
between two variables is positive within each of the counterbalanced orders. Under those conditions there is no guarantee that the
overall correlation will be positive, let alone that it will be the average of the within-order correlations. The nature of the correlation
will depend on the shapes of the within-order scatterplots and their
locations with respect to one another (i.e., on the variable means
within each order). For that reason, interpreting correlations from
designs in which order has been counterbalanced is hazardous.
1036
Child Development
dren. Parents were given the CBQ at Session 1 and all
returned it at Session 2.
Measures
Verbal Ability
Children were given the PPVT-R (Dunn & Dunn,
1981), in which they were asked, for each of a series of
items, to select from a set of four pictures the one best
illustrating the meaning of an orally presented word.
The PPVT-R correlates significantly with other intelligence measures, such as the Wechsler Preschool
and Primary Scale of Intelligence–Revised (Carvajal,
Parks, Logan, & Page, 1992) and the verbal subscale
of the Stanford-Binet IV (Hodapp, 1993), as well as
with ToM (Taylor & Carlson, 1997).
Theory-of-Mind Battery
Location false belief. Using a version of Wimmer and
Perner’s (1983) standard unexpected location false belief task, two puppets (Bert and Ernie) played with a
ball briefly and then Bert put the ball in a blue container
and left. Ernie retrieved the ball, played briefly with it,
and then put it away in a red container and left. Finally,
Bert returned, wanting to play with the ball, at which
point children were asked the false-belief question
(“Where does Bert think the ball is?”) followed by the
reality question (“Where is the ball really?”).
In addition, using a version of Wellman and Bartsch’s
(1988) “explicit” location false-belief task, children were
shown a picture of a boy named John who wanted to
find his kitten. E stated that John’s kitten was really in
the closet [pointing to a picture of a closet door], but
that John thought his kitten was under the bed [pointing
to a picture of a bed]. Children were then asked,
“Where will John look for his kitten?” and “Where is
the kitten really?” For each location task, children were
credited with passing if they answered both belief
and reality questions correctly. Nine children erred
on the reality question in the standard version (four
3-year-olds and five 4-year-olds) and 27 did so in the
explicit version (nineteen 3-year-olds and eight 4-yearolds). The greater number of errors in the explicit
version is perhaps not surprising given that in this
version children were merely told about the real location
of the desired object rather than observing it directly.
Contents false belief. Following the procedure of
Gopnik and Astington (1988) and Perner, Leekam, and
Wimmer (1987), children were shown a Band-Aid box
and asked what they thought was inside. After discovering that the box actually contained crayons, the
lid was closed and children were asked about their
own former false belief: “When you first saw this box,
before you opened it, what did you think was inside,
Band-Aids or crayons?” They were then told that Ernie
had never looked inside the box before and were asked,
“What does he think is inside, Band-Aids or crayons?”
Children were scored for their knowledge of their own
former belief as well as Ernie’s current false belief. Responses from four 3-year-olds were omitted because
they refused to speculate about either their own or
Ernie’s belief. Due to experimenter oversight, a “reality”
control question was not included with this task.
Deceptive pointing. Children participated in a game
in which the object was to trick an experimenter about
a hidden object’s location. Following a procedure by
Carlson et al. (1998), children were shown two
opaque containers (one blue, one gray) and a toy car.
E1 placed the toy car in the blue container and demonstrated the car’s location by pointing to the box.
The children were then asked to place the car in the
other container and to point to that box. After this
warm-up, the children were told that E2 was going to
leave the room. After E2 exited, the children put the
car in one of the two boxes. In a conspiratorial tone of
voice, E1 encouraged the children to play a trick by
pointing so E2 would not find the car. E2 returned and
asked, “Where’s the car?” E2 looked in the box indicated and E1 provided feedback as to whether E2 had
been tricked. In a second trial using an identical procedure, E1 rather than E2 left the room. Data for one
child on the second trial were not included for subsequent analysis due to experimenter error.
Appearance-Reality. Following the procedure of Flavell et al. (1983, 1986), each child was shown two objects that had misleading appearances; one involved a
discrepancy between real and apparent identity (a piece
of sponge painted to look like a rock), the other, a discrepancy between real and apparent color (a picture of
a red castle that looked black when held behind a green
filter). For each stimulus, children were shown how the
object appeared, as well as the true identity or true color
of the object. E repeated that it appeared one way but
was really another, and then children were asked the
appearance question, “When you look at this right now,
does it look [like a sponge/red] or does it look [like a
rock/black]?” Children were also asked the reality question, “What [color] is this really and truly, [a sponge/
red] or [a rock/black]?” Children received credit on
each task if they answered both questions correctly.
Mental State Control Tasks
The Mental State Control tasks were designed to
parallel the Contents and Location False-Belief tasks,
but were devoid of reference to mental states.
Carlson and Moses
Contents. Following a procedure by Gopnik and
Astington (1988), children first opened a box containing a toy pig. E suggested they remove the pig and put
a toy horse inside instead. After doing so, the box was
closed and E asked, “Now what is inside?” (One child
refused to answer and so was given the answer by E.)
Children were then asked, “When I first showed you
the box, before we opened it, what was inside it then?
Was there a horse inside or was there a pig inside?”
This question has a very similar syntactic form to the
test question in the Contents False-Belief task for self.
Location. Children were asked to place a block inside a blue box, then to remove the block, and place it
inside a green box. E asked, “Now where is the block?”
(Three 3-year-olds and two 4-year-olds were corrected.) E then asked, “When we first put the block
in a box, before we moved it, where was it then? Was
it in this box [green] or this box [blue]?” On each
mental state control task children received credit if
they answered both the “now” and “before” questions correctly.
Inhibitory Control Battery
On each IC measure, children were required to respond counter to a prepotent tendency. Table 1 lists
the 10 IC tasks, as well as a description of the prepotent
and correct responses for each. Two trained research
assistants who were unaware of the major hypotheses
of the study coded the sessions from videotape. Intercoder reliability (computed as percent agreement) was
obtained for 30 cases on all executive functioning
measures except the computerized ones (Pinball and
Spatial Conflict). Discrepancies were resolved by a
third coder.
Day/Night. In this Stroop-like task (Gerstadt et al.,
1994), E first verified that children associated the sun
with daytime and the moon with nighttime. They were
then instructed to say “day” when shown a black card
depicting the moon and stars, and to say “night” when
shown a white card depicting a yellow sun. Two practice trials ensued in which children were shown one
of each type of card. If a child answered incorrectly, E
repeated both rules and repeated the practice trial as
necessary. Children then received 16 test trials without feedback in a fixed random order. Four children
(three 3-year-olds, one 4-year-old) did not complete
this task. Coder agreement was 100%.
Grass/Snow. This was another Stroop-like task in
which children responded by pointing instead of
speaking (for a similar task, see Passler et al., 1985).
After verifying that children could name the colors
of grass and snow, E introduced a large board with
a solid-white card attached to the upper-left corner, a
solid-green card attached to the upper-right corner,
and two fabric cutouts shaped like hands centered below the cards. Children were instructed to point to the
white card when E said “grass” and to the green card
when E said “snow” (The procedure was first demonstrated by E and then repeated by the child). Two (or
more) practice trials and 16 test trials followed in the
same fixed random order as the Day/Night task. Three
children (two 3-year-olds, one 4-year-old) were not
able to complete the task. Coder agreement was 100%.
Table 1 Prepotent Responses and Correct Responses on the Inhibitory Control Tasks
Inhibitory Control Task
Day/Night
Grass/Snow
Spatial Conflict
Card Sort
Bear/Dragon
Pinball
Gift Delay
Tower Building
KRISP
Whisper
1037
Prepotent Response
Correct Response
Say “day” for the sun and say
“night” for the moon
Point to green for “grass,” point
to white for “snow”
Press the button on the same side
as the picture
Sort by a previously successful
dimension
Follow the commands of both
animals
Release the plunger immediately
Peek while E wraps gift
Place all the blocks oneself
Point to a similar picture right
away
Call out the names of familiar
characters
Say the opposite of what the
picture shows
Point to the color that is opposite
to its associate
Press the button that matches the
picture, irrespective of location
Sort by a new dimension
Do what the Bear says, but not
what the Dragon says
Wait for a “Go!” signal
Wait without peeking
Give E turns placing blocks
Wait to examine all pictures
before choosing exact match
Whisper the names
Note: E experimenter; KRISP Kansas Reflection-Impulsivity Scale for Preschoolers.
1038
Child Development
Spatial Conflict. In this task, children attended to a
stimulus in one spatial location but were required to
respond at a different location. Following Gerardi
(1997; see also Simon, 1990), children participated in a
warm-up exercise in which they had to tap one of two
toys (a doll or an elephant) on the head, depending on
which matched a toy presented by E. Children then
sat at a table facing a computer monitor. Two picture
cards were inserted into clear, plastic envelopes covering large buttons were located on either side of the
child. Children were told to press the relevant button
whenever its accompanying picture appeared on the
screen. Between trials they were instructed to place
their hands on top of two felt cutouts in the shape of
hands. Eight practice trials followed in which the picture appeared in the center of the screen. If children
pressed the button with the matching picture, a high
beep sounded and the picture became animated for
3.5 s. If they pressed the nonmatching button, a low
beep sounded and the stimulus disappeared. A looming stimulus (concentric squares) appeared between
trials to direct attention to the screen. The test trials
consisted of four blocks (different picture pairs) of
eight trials each. On four trials the picture appeared
on the same side of the screen as the button with
the matching picture card (spatially compatible); on
the other four trials it appeared on the opposite side
(spatially incompatible). The order of trials within
blocks was randomized. Instructions were repeated
before each block of trials. An Apple Mac IIci computer recorded measures of accuracy and reaction
time in milliseconds. Data were not available for six
3-year-olds due to computer failure.
Bear/Dragon. The Bear/Dragon task (Kochanska et
al., 1996; Reed et al., 1984) is a simplified version of
“Simon Says” in which children are required to selectively suppress commanded actions. E first asked
children to imitate the following 10 actions: (1) stick
out your tongue, (2) touch your ears, (3) touch your
teeth, (4) touch your eyes, (5) clap your hands, (6) touch
your feet, (7) touch your head, (8) touch your tummy,
(9) touch your nose, and (10) wave your hand. E then
introduced two puppets; the first was described as a
“nice Bear”— “So when he talks to us, we will do
what he tells us to do.” The second was described as a
“naughty Dragon” — “So when he talks to us, we
won’t listen to him. If he tells us to do something,
we won’t do it.” Practice trials followed in which E
moved the Bear’s mouth and said [in a high-pitched
voice], “Touch your nose,” and then moved the
Dragon’s mouth and said [in a low, gruff voice],
“Touch your tummy.” Children passed the practice if
they followed the Bear’s command but ignored the
Dragon’s. All children but one succeeded on the Bear
practice the first time. Children who failed five practice trials in a row using the Dragon puppet were told
that a second E would help them on a final practice
trial by holding their hands down on the table. Ten
test trials followed (five Bear trials and five Dragon
trials, alternating order) in which children were given
no assistance; they were reminded of the rules after
five trials regardless of performance. Children received scores ranging from 0 to 3 on each Dragon trial
(0 a full commanded movement, 1 a partial commanded movement, 2 a wrong movement, 3 no
movement). Coder agreement was 98%.
Card Sort. The Card Sort was modeled after Frye et
al.’s (1995, Experiment 2) procedure in which children were instructed to sort cards according to one
criterion (shape) and then by a different criterion
(color), thus requiring them to inhibit the old sorting
rule. Children were shown two black containers with
slots cut out of the lids. A drawing of a red rabbit was
attached to one container and a drawing of a blue
boat to the other. E then produced a stack of cards depicting red rabbits, blue rabbits, red boats, and blue
boats, explaining that in the shape game all the rabbits should be placed in the rabbit container and all
the boats in the boat container. After demonstrating
with a blue rabbit and a red boat card, E asked children to sort five cards in the following order: blue rabbit, red boat, blue boat, red rabbit, and blue rabbit.
The rule was repeated before each trial and children
were praised when they sorted correctly. No child
made errors in this preswitch phase. E then announced they were going to switch to the color game
and told children that they should place all the blue
cards in the blue container and all the red cards in the
red container. Five postswitch trials followed in which
E repeated the rule before handing children a card,
but did not give feedback. Two of the postswitch trials
were compatible with the rules of the prior shape
game (Trial 2, red rabbit; Trial 3, blue boat) and three
were incompatible (Trial 1, red boat; Trial 4, red boat;
and Trial 5, blue rabbit). The critical score was the
number correct on the incompatible trials. Coder
agreement was 100%.
Pinball. This task measured children’s ability to
suppress an act until they received a signal (Kochanska
et al., 1996; Reed et al., 1984). E presented children
with a tabletop pinball machine and demonstrated
how to use the plunger to spring the ball forward. The
objective was to make the ball land in one of six holes
so that a colorful character would pop up. After practicing with the plunger, E asked children to hold it
back all the way until E said “Go!” The plunger was
outfitted with a digital timer that began counting when
it was pulled all the way back and stopped when it
Carlson and Moses
was released half-way or more. Six trials followed,
with delays for E to say “Go!” of 10, 15, 25, 15, 20, and
10 s, respectively. Children were reminded to wait
Until E said “Go!” before each trial. The mean waiting
time across trials constituted the pinball score. One
3-year-old was unable to pull out the plunger and
was therefore excluded from participating.
Gift Delay. This measure called for delay of gratification (Kochanska et al., 1996). E told children there
was a present for them but wanted it to be a “big surprise.” E asked children to sit in a chair facing away
and to try not to look while E wrapped the gift. She
then noisily wrapped the gift over a period of 60 s.
Finally, E invited children to open their present (a
small, toy animal). Coding included (1) a peeking score
(0 turning fully around to peek, 1 peeking over
the shoulder, 2 no attempt to peek), (2) the total
number of times children peeked, and (3) latency to
peek over their shoulder or fully turn around (60 s for
full compliance). Coding reliability was as follows:
peeking score, 100%; number of peeks, 80% (never off
by more than 1). Reliability for reaction times (RTs)
was within .5 s for 83% and within 1 s for 87% of doublecoded cases.
Tower Building. Following a procedure by Kochanska
et al. (1996), children were asked to take turns with E
in building a tower using 20 wooden blocks (two trials).
After a brief demonstration of turn-taking, E deliberately waited before placing each block until children
spontaneously signaled they were giving E a turn.
The coding of each trial reflected the proportion of
blocks placed by E out of the total number of blocks
(ideally .50). Higher scores denoted greater IC. Each
child received an overall Tower score that reflected a
weighted average of number of blocks placed by E
across two trials (completed towers were weighted
more heavily than short towers that fell down). Coder
agreement was 100%.
KRISP. The KRISP (Wright, 1971) is a measure
adapted for use with younger children from the
Matching Familiar Figures task (Kagan, Rossman, Day,
Albert, & Phillips, 1964). The task was designed to
measure cognitive reflectivity, that is, children’s ability
to slow down and reflect before making a response.
Children were required to match a target picture to
one from a set (ranging from four to six pictures in
size) in which all but one differed from the target
in minor details, for example, line drawings of cats
with slightly different whiskers, ears, and tails. All
children were given five practice trials in which E provided assistance and feedback. They then received 10
test trials with up to three errors permitted on a trial
before continuing. Trials increased in difficulty as the
game progressed (e.g., larger picture arrays, more subtle
1039
details in the drawings). Coding included an accuracy
score (total number of errors, reverse scored; agreement was 100%) and average latency to respond by
pointing to a picture (the first choice on each trial). Reaction times were within .5 s on 96% of the double-coded
cases. Three 3-year-olds did not complete the task.
Whisper. This task required voluntary lowering of
the voice (Kochanska et al., 1996). To warm up, E first
asked children to whisper their names; E then asked
children to whisper the names of 10 cartoon characters presented on 10 different laminated cards. Six of
the characters were familiar (Big Bird, Pocahontas,
Donald Duck, Snow White, the Beast from the movie
Beauty and the Beast, and Mickey Mouse) and four were
unfamiliar to most children of this age (Huckle, Elmer
Fudd, Petunia, and Fat Albert). The unfamiliar characters were included so that when a familiar character
would appear, children might be more tempted to shout
out its name. E spoke in a whisper throughout and reminded children to whisper after the first five trials. Scoring was as follows: 0 a shout, 1 a normal or mixed
voice, and 2 a whisper. Coder agreement was 96%.
Motor Sequencing Measure
The Motor Sequencing task is an executive measure that requires motor speed and planning ability
but not IC. It was adapted from Welsh, Pennington,
and Groisser (1991) to make it easier for 3-year-olds to
perform. Children were shown a musical keyboard
with four differently colored keys. E demonstrated
touching each key in a row with her index finger and
then began the sequence again. Children practiced
until they performed two consecutive scales successfully. They were then instructed to repeat the sequence
over and over as fast as they could until E said “Stop!”
Children were scored on the number of sequences
they completed in 10 s without touching the same key
twice or skipping a key. Coder agreement was 87%
(disagreements were never off by more than a count
of one sequence). Data were missing for two 3-year-olds
due to experimenter error.
Pretend Actions
Children completed a Pretend-Actions task that assessed their developmental level of pretend play
(Overton & Jackson, 1973). Following a warm-up in
which E demonstrated pretending (sleeping), E then
asked children to perform four pretend actions: (1)
brush your teeth with a toothbrush, (2) comb your
hair with a comb, (3) drink with a cup, and (4) put on
sunglasses. For each action, children’s responses were
coded as involving either a body part (e.g., when
1040
Child Development
Table 2 Percentage of Children Who Passed Theory-of-Mind Items as a Function of Age
Theory-of-Mind Item
3-Year-Olds
(n 62)
4-Year-Olds
(n 45)
Total
(N 107)
Age Differences
(Pearson 2)
False belief (location)
Standard
Explicit
10
34
49
69
26
49
20.8***
12.8***
False belief (contents)
Self
Other
40
33
40
62
40
45
0, ns
9.1**
Deceptive pointing
Trial 1
Trial 2
61
54
80
82
69
66
4.3*
9.1**
Appearance-Reality
Identity
Color
47
55
69
87
56
68
5.2*
12.2***
Note: For the 3-year-olds: n 58 on False belief (contents) Self, n 61 on False belief (Contents) Other,
and n 61 on Deceptive pointing Trial 2. ns not significant.
* p .05; ** p .01; *** p .001.
asked to pretend to brush their teeth, they used their
finger as a toothbrush), or a symbolic object (e.g., they
pretended to hold an imaginary toothbrush). Children received a score indexing the number of times,
out of four, that they used an imaginary object. Coder
agreement was 94%.
“is usually able to resist temptation when told he/she
is not supposed to do something.” Scores on the IC
subscale have been found to relate positively to several of the inhibition measures used in the present
study (Gerardi, 1997; Kochanska, Murray, & Coy, 1997;
Kochanska et al., 1996). Parents also provided information on the number of siblings at home.
Parent Ratings of IC
The CBQ (Rothbart et al., 1994), a parent report
measure of children’s temperament, was included to
obtain a global assessment of children’s self-control
behavior exhibited in familiar settings over a protracted time period (the 6 months prior to the study).
The questionnaire included 195 statements representing
16 different temperament subscales. Parents (mostly
mothers) rated each statement for how true it was for
their child on a scale from 1 (extremely untrue) to 7
(extremely true). The IC subscale was of particular interest in the present study. It included items such as
RESULTS
Theory-of-Mind Assessment
Children were given four kinds of ToM tasks with
two items per task. As is clear from Table 2, 4-yearolds generally outperformed 3-year-olds. The two
item scores within each task type were significantly
correlated, rs(dfs ranged from 103 to 107) .35, ps .01,
and thus were aggregated to form four overall ToM
measures. As Table 3 shows, all four measures were
significantly intercorrelated. Because the tasks appeared
Table 3 Raw and Partial Correlations among the Theory-of-Mind Measures
Task
False belief (location)
False belief (contents)
Deception
Appearance-Reality
False Belief
(Contents)
.47*** (.31**)
Deception
.40*** (.18*)
.36*** (.21*)
AppearanceReality
Theory-of-Mind
Battery
.58*** (.35***)
.46*** (.28**)
.46*** (.26**)
.61*** (.39***)
.53*** (.37***)
.50*** (.30**)
.64*** (.42***)
Note: Partial correlations controlling for age, gender, and verbal ability are shown in parentheses. N 107 except in correlations involving False belief (contents): N 106.
* p .05; ** p .01; *** p .001.
Carlson and Moses
to be tapping a common underlying construct, a ToM
battery score (number correct divided by total number
of items completed) also was calculated for each participant, Cronbach’s .76, average item-total r(106) .57. Whenever aggregation was required, means (sum
of scores divided by the total number of items completed) rather than sums were computed to accommodate the small amount of missing data. The item-total
correlations were uniformly high (see Table 3). Itemtotal correlations for all analyses were corrected (i.e., the
relevant item was first removed from the battery before
computing the item-total correlation).
Performance on the ToM battery was significantly related to age, gender (girls outperformed boys), and verbal ability, rs(107) .52, .24, and .62, ps .001, .02, and
.001, respectively. Performance on each of the four ToM
measures was significantly related to age and verbal
ability, and Appearance-Reality was related to gender,
1041
rs(dfs ranged from 106 to 107) .24, ps .05. It should
be noted, however, that the interrelations among the
various ToM tasks were independent of the effects for
age, gender, and verbal ability, as indicated by the
partial correlations shown in parentheses in Table 3.
Inhibitory Control Assessment
Behavioral Measures
Mean scores on each of the IC measures are shown
in Table 4. As they did on the ToM measures, 4-yearolds generally outperformed 3-year-olds. The dependent measures that were significantly correlated
within each IC task were first standardized and then
averaged to create a single score for each task. The
number of practice trials required (reverse scored)
and percentage correct on the test trials of Day/Night
Table 4 Mean Performance on the Inhibitory Control Measures as a Function of Age
Inhibitory Task
3-Year-Olds
4-Year-Olds
Total
Day/Night
No. practice trials
Percentage correct
3.4 (1.8)
59.9 (34.5)
2.9 (1.2)
65.7 (35.4)
3.2 (1.6)
62.4 (34.8)
2–10
0–100
ns
ns
Grass/Snow
No. practice trials
Percentage correct
2.5 (1.0)
60.4 (35.3)
2.4 (.8)
77 (26.9)
2.5 (.9)
67.4 (32.9)
1–7
0–100
ns
t(102) 2.6**
Spatial Conflict
Percentage correct
RT correct
96.7 (3.4)
2.2 (.4)
95.5 (7.7)
1.9 (.4)
96.2 (5.9)
2.0 (.4)
69–100
1.1–3.0
Bear/Dragon
No. dragon practice
Total dragon score
2.1 (1.6)
12.1 (5.4)
1.5 (1.3)
13.9 (3.3)
1.8 (1.5)
12.9 (4.7)
1–6
0–15
t(105) 2.1*
t(105) 2.0*
1.0 (1.3)
1.8 (1.4)
1.4 (1.4)
0–3
t(105) 3.1**
Pinball
Average wait time
12.9 (5.1)
14.4 (3.7)
13.5 (4.6)
0–16
ns
Gift Delay
Peeking score
No. times peeked
Latency to peek
1.3 (.8)
.9 (1.1)
41.2 (24)
1.6 (.7)
.5 (.8)
48.6 (19.5)
1.4 (.8)
.7 (1.0)
44.3 (22.4)
0–2
0–4
0–60
t(105) 1.7
t(105) 1.9
t(105) 1.7
.3 (.2)
.4 (.1)
.33 (.2)
0–.5
t(105) 2.7**
20.6 (5)
4.7 (3.8)
25 (4.1)
3.3 (1.1)
22.5 (5.1)
4.1 (3.0)
7–29
0–29
t(104) 4.8***
t(102) 2.4*
1.7 (.5)
1.8 (.6)
1.7 (.5)
0–2
Card Sort
No. correct (postswitch
incompatible)
Tower
Proportion turns to E
KRISP
Accuracy score
RT correct
Whisper
Mean score
Range
Age Difference
ns
t(99) 3.1**
ns
Note: Standard deviations are shown in parentheses. RT Median reaction time in s. E experimenter. KRISP Kansas ReflectionImpulsivity Scale for Preschoolers. N 107 except on the following variables: Day/Night practice (106); Day/Night percentage correct
(103); Grass/Snow practice (105); Grass/Snow percentage correct (104); Spatial Conflict (101); Pinball (106); KRISP accuracy (106); and
KRISP RT correct (104).
* p .05; ** p .01; *** p .001; p .10.
Note: Partial correlations controlling for age, gender, and verbal ability are shown in parentheses. Number of participants ranged from 97 to 107 due to some missing data. All tests are
two-tailed. KRISP Kansas Reflection-Impulsivity Scale for Preschoolers.
* p .05; ** p .01; *** p .001.
.38*** (.24**)
.63*** (.49***)
.39*** (.21*)
.48*** (.25**)
.55*** (.40***)
.47*** (.38***)
.38*** (.24**)
.40*** (.22*)
.50*** (.27**)
.41*** (.31**)
.29** (.20)
.41*** (.33***)
.04 (.06)
.33** (.24**)
.36*** (.28**)
.13 (.03)
.28** (.20*)
.17 (.08)
.19 (.06)
.10 (.07)
.44*** (.26**)
.32** (.14)
.14 (.17*)
.37*** (.18*)
.40*** (.30**)
.37*** (.27**)
.38*** (.23*)
.16 (.05)
.25* (.08)
.15 (.00)
.22* (.04)
.22* (.08)
.30** (.22*)
.27** (.18*)
.18 (.07)
.22* (.08)
.11 (.01)
.19* (.06)
.25** (.14)
.25** (.17*)
.30** (.22*)
.42*** (.34***)
.28** (.20*)
.19* (.06)
.26** (.15)
.25* (.14)
.48*** (.35***)
.40*** (.29**)
.35*** (.18*)
.35*** (.25**)
.49*** (.34***)
.38*** (.24**)
.18 (.06)
.32** (.17)
.44*** (.33***)
Day/Night
Grass/Snow
Spatial Conflict
Card Sort
Bear/Dragon
Pinball
Gift Delay
Tower
KRISP
Whisper
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
10
9
8
7
6
5
4
3
2
Inhibitory Task
Correlations among the Inhibitory Control Measures
and Grass/Snow were significantly correlated, r(103) .30 and r(105).19, ps .01 and .05, respectively. On
the Spatial Conflict task, accuracy and median RTs
on the incompatible trials were significantly related,
r(101) .25, p .02. On the tasks that included an RT
measure, it was predicted that RTs would be longer
for children who had difficulty with the task, but only
on trials for which they answered correctly. Thus RT
was only analyzed for correct responses. In addition,
median rather than mean RTs were calculated to reduce
the impact of outliers. The results were reexamined
using RTs for all responses, as well as an aggregate RT
measure collapsed across tasks; the major findings
were unchanged. Total scores on the Dragon trials of
the Bear/Dragon task were significantly related to the
total number of Dragon practice trials (reverse scored),
r(107) .78, p .001. All three Gift Delay scores were
highly correlated; rs(107) ranged from .83 to .87, ps .001. On the KRISP, accuracy was related to median
RTs, r(104) .26, p .01. The Card Sort, Pinball,
Tower, and Whisper tasks each had only one dependent measure, and so no aggregation was possible.
As shown in Table 5, the IC measures were moderately intercorrelated and appeared to tap a common
underlying construct, ( .76, average item-total
r(92) .47. Consequently, the 10 measures were standardized and averaged to create a composite IC battery.
IC battery scores were correlated with age, gender,
and verbal ability, rs(107) .44, .30, and .60, ps .001,
.01, and .001, respectively. Scores were significantly correlated with age on seven of the individual
measures: Grass/Snow, Spatial Conflict, Card Sort,
Bear/Dragon, Pinball, Tower, and KRISP. Girls significantly outperformed boys on half of the IC measures:
Grass/Snow, Spatial Conflict, Card Sort, Tower, and
KRISP. All 10 measures were significantly related to
verbal ability. As is clear from Table 5, however, the
relations among the measures generally persisted in a
partial correlation analysis controlling for age, gender, and verbal ability, again suggesting that the measures reflect a common underlying core.
A principal-components analysis of the 10 IC measures was conducted to determine whether factors
might emerge resembling the a priori division of tasks
into Conflict and Delay categories. A two-factor solution (accounting for 48% of the variance) followed by an
oblique rotation yielded the most interpretable pattern. The factor loadings are displayed in Table 6. The
two factors contained items that corresponded closely
to the theoretical grouping of tasks described earlier.
The six tasks loading most highly on the first factor
were aggregated to form a Conflict scale, ( .75, average item-total r(95) .50; the four tasks loading
most highly on the second factor were aggregated to
Inhibitory
Control Battery
Child Development
Table 5
1042
Carlson and Moses
Table 6 Loadings of Inhibitory Control Measures on Factors in
a Principal-Components Analysis
1043
the Delay scale, rs(107) .38, .35, and .29, respectively, ps .01.
Factor Loadings
Inhibitory Task
Card Sort
Day/Night
Grass/Snow
Whisper
Bear/Dragon
Spatial Conflict
KRISP
Tower
Pinball
Gift Delay
Factor 1,
Conflict
.79
.72
.67
.64
.52
.34
.08
.06
.11
.03
Factor 2,
Delay
.08
.11
.25
.03
.29
.29
.86
.68
.61
.58
Note: N 107. KRISP Kansas Reflection-Impulsivity Scale for
Preschoolers.
form a Delay scale, ( .56, average item-total r(103) .42. Scores on the two scales were significantly related, r(107) .46, p .001.
Parental Report
Finally, data from the IC subscale of the CBQ were
analyzed. As expected, parents’ ratings of children’s
IC were correlated with performance on several inhibitory tasks: Grass/Snow, r(104) .41; Spatial Conflict, r(101) .21; Card Sort, r(107) .25; Bear/Dragon,
r(107) .23; Pinball, r(106) .32; and the KRISP, r(104) .22, all ps .05. In addition, parent ratings were
related to the IC battery, the Conflict scale, and
Relation between IC and ToM
Raw Correlations
As shown in Table 7, all but three of the raw correlations between the IC and ToM measures were significant. In addition, every IC measure was significantly related to children’s ToM battery scores and,
conversely, every ToM measure was significantly related to the IC battery. The correlation between the two
batteries themselves was especially high (.66). Further,
both the Conflict and Delay scales were significantly related to the ToM battery as well as to each
ToM measure. Surprisingly, however, the parent report measure of IC was only weakly related to the
ToM battery, r(107) .13, p .19, and was not significantly related to any of the individual ToM measures,
rs ranged from .09 to .12. That said, one item from the
IC subscale of the CBQ was significantly related to the
ToM battery: being good at games like “Simon Says”
and “Mother, May I?” r(73) .34, p .01. (Although all
parents completed the CBQ, the n is lower on this item
because many did not witness their child playing these
games and so checked “not applicable.”) These selfcontrol games are very similar to some of the inhibitory tasks performed in this study (e.g., Bear/Dragon).
Controlling for Confounding Variables
Because ToM and IC were also significantly related
to age, gender, and verbal ability, the analysis was re-
Table 7 Raw and Partial Correlations between Inhibitory Control and Theory-of-Mind Measures
Inhibitory Control Measure
False Belief
(Location)
False Belief
(Contents)
Deception
AppearanceReality
Theory-of-Mind
Battery
Day/Night
Grass/Snow
Spatial Conflict
Bear/Dragon
Card Sort
Pinball
Gift Delay
Tower
KRISP
Whisper
Conflict scale
Delay scale
.33** (.22*)
.36*** (.13)
.26** (.06)
.38*** (.17*)
.34*** (.09)
.26** (.09)
.26** (.11)
.27** (.08)
.26** (.11)
.30** (.18*)
.49*** (.24**)
.36*** (.08)
.24* (.11)
.26** (.07)
.15 (.01)
.31** (.16)
.36*** (.20*)
.25* (.12)
.29** (.18*)
.28** (.14)
.31** (.11)
.30** (.19*)
.40*** (.20*)
.39*** (.22*)
.28** (.18*)
.35*** (.18*)
.37*** (.25**)
.55*** (.45***)
.27** (.07)
.25* (.13)
.17 (.03)
.11 (.06)
.29** (.06)
.26** (.16)
.55*** (.40***)
.27** (.06)
.24* (.08)
.38*** (.14)
.20* (.04)
.49*** (.31**)
.27** (.03)
.27** (.12)
.33** (.19*)
.34*** (.16)
.45*** (.18*)
.40*** (.31**)
.50*** (.24**)
.49*** (.26**)
.36** (.23*)
.45** (.21*)
.33** (.11)
.57*** (.42***)
.40*** (.12)
.33** (.17*)
.34*** (.19*)
.32** (.10)
.42*** (.09)
.41*** (.31**)
.64*** (.41***)
.49*** (.23*)
Inhibitory control battery
.50*** (.20*)
.47*** (.27**)
.50*** (.31**)
.57*** (.31**)
.66*** (.41***)
Note: Partial correlations controlling for age, gender, and verbal ability are shown in parentheses. N 107. For correlations with False belief
(contents), N 106. KRISP Kansas Reflection-Impulsivity Scale for Preschoolers.
* p .05; ** p .01; *** p .001.
1044
Child Development
conducted, partialling out the effects of these variables.
Not surprisingly, the magnitude of the correlations
was reduced (see Table 7), but six of the IC measures
remained significantly related to the ToM battery, and
the remainder all correlated in the predicted direction.
Of note is that all four ToM measures were still significantly correlated with the IC battery. Moreover, the
relation between the ToM and IC batteries remained
highly significant r(101) .41, p .001. Similarly high
correlations between the batteries were found for
both 3-year-olds, r(56) .40, and 4-year-olds, r(38) .41, ps .01. The ToM battery also remained significantly correlated with the Conflict and Delay scales,
.41 and .23, respectively. The Conflict–ToM partial
correlation was marginally larger than the Delay–
ToM correlation, t(104) 1.94, p .10. The Conflict–
ToM partial correlation was larger than the Delay–
ToM correlation for both age groups: rs(56) .38
versus .22 for 3-year-olds; rs(38) .45 versus .25 for
4-year-olds. This difference was significant for 4-yearolds, t(42) 2.08, p .05, but not 3-year-olds. Finally,
the CBQ item concerning self-control games also remained significantly related to ToM, r(68) .26, p .02.
Assessed next was whether the relation between
IC and ToM remained strong after variables other than
age, gender, and verbal ability were controlled. The
first assessment was whether the relations might stem
from general executive abilities as opposed to IC, per
se. Recall that the Motor Sequencing task had been included as a measure of the former but not the latter.
On the Motor Sequencing task, children completed an
average of 2.1 musical sequences in a 10-s period (SD 1.6, range 0 to 6). Four-year-olds (M 2.7, SD 1.6)
were better at this task than 3-year-olds (M 1.7, SD 1.4), t(103) 3.3, p .01. Motor Sequencing was significantly related to the IC and ToM batteries, rs(105) .35 and .40, ps .01 and .001, respectively. These
correlations fell below significance controlling for
age, gender, and verbal ability, rs(100) .11 and .15,
respectively. Not surprisingly then the relation between IC and ToM remained significant controlling
for Motor Sequencing performance as well as age,
gender, and verbal ability, r(99) .39, p .001, providing at least preliminary evidence that the relation
might be mediated specifically by IC.
Second, in addition to the ToM tasks, children completed Mental State Control tasks analogous to the
Location and Contents False-Belief Versions, but not
requiring belief attribution. Four-year-olds outperformed 3-year-olds on both the Mental State Control
Location (71% versus 52% correct) and Contents (84%
versus 65% correct) tasks, 2s(1, N 107) 4.1 and
5.2, ps .05 and .03, respectively. Because the two
tasks were highly correlated, r .40, p .001, they were
aggregated for the subsequent analyses. The Mental
State Control composite was significantly related to
both IC and ToM batteries, rs(105) .49 and .49, respectively, ps .001. As noted earlier, in prior research,
when children failing such tasks were excluded, performance on ToM tasks remained poor in younger
preschoolers. It should be noted, therefore, that the
IC–ToM partial correlation (controlling for age, gender,
and verbal ability) remained strong in the subset of
children who passed both of the Mental State Control
tasks (n 56), r(51) .46, p .001.
Third, on the four Pretend-Actions tasks, 4-yearolds made marginally more symbolic-object gestures
(M 1.6, SD 1.4) than 3-year-olds (M 1.1, SD 1.2), t(105) 1.8, p .08. Pretend-Actions scores were
related to both the IC and ToM batteries, rs(107) .26
and .22, ps .01 and .05, respectively. It might have
been the case that the relations obtained were a byproduct of the fact that pretense is related to both IC
and ToM. But, this was not so: the IC–ToM relation remained significant controlling for pretense in addition
to age, gender, and verbal ability, r(101) .41, p .001.
Finally, some earlier studies found that children
with more siblings performed better on tests of false
belief than those with fewer siblings (Jenkins and
Astington, 1996; Perner et al., 1994; Ruffman, Perner,
Naito, Parkin, & Clements, 1998). Children in the
present study had a mean of 1.3 siblings (SD .9,
range 0–5). In contrast to earlier studies, both the
raw correlation between number of siblings and ToM
battery scores, and the partial correlation controlling
for age, gender, and verbal ability, were not significant, rs(107, 102) .01 and .11. Similarly, both the
raw and partial correlations between number of siblings and the IC battery were not significant, rs(107,
102) .01 and .11. Not surprisingly, then, the IC–ToM
relation remained strong controlling for number of
siblings in addition to age, gender, and verbal ability,
r(101) .40, p .001. The pattern of findings was the
same when (1) the number of older siblings only (see
Ruffman et al., 1998), (2) the subset of children who
had three or fewer siblings (n 103; see Perner et al.,
1994), and (3) false-belief tasks only were examined.
In sum, the IC–ToM relation held up strongly across
a series of correlational analyses controlling for a variety of potentially confounding factors. In addition,
when variance shared with all these factors (i.e., age,
gender, verbal ability, Motor Sequencing, Mental State
Control tasks, Pretend Actions, and number of siblings) was statistically removed, the IC–ToM relation
again proved to be highly robust, r(96) .33, p .001.
Moreover, when a parallel analysis (partialling the
same variables with the exception of Mental State Control tasks) was conducted on the subset of children
Carlson and Moses
(n 75) who correctly answered at least five out of six
control questions (i.e., the two reality questions on the
False-Belief tasks and the four Mental State Control
Task questions), a similar pattern was found, r(66) .43, p .001. It is noteworthy that the findings of this
last analysis make it unlikely that the correlations
emerged because of individual differences in children’s comprehension of or attention to the tasks. It
could be argued that the inclusion of a large number
of IC tasks might have taxed children’s executive
skills to an unusually heavy extent, leaving those with
weaker skills especially prone to executive errors on
the ToM tasks. If so, the IC–ToM correlation would be
inflated relative to normal circumstances. If the number of tasks had such an effect, however, it would be
expected that the correlation would be larger later in
the testing sessions relative to earlier. No evidence of
this kind was found: the IC–ToM partial correlation
(controlling age, gender, and verbal ability) was about
the same in the first half of both sessions as in the second half, rs(102) .25 and .26, ps .02 and .01, respectively. Similarly, in an analysis across studies,
Perner and Lang (1999) found no evidence that the
correlation between executive ability and ToM was
higher in those studies in which a large number of
variables were included.
Which IC Measures Best Predict ToM?
The next assessment looked at which IC measures
or subsets of measures contributed most to the variance in ToM battery scores. First, the Conflict and Delay scales were entered into a multiple regression along
with age, gender, and verbal ability. This analysis revealed that the Conflict scale was a highly significant
predictor of ToM, holding the control variables and
the Delay scale constant, t(101) 4.2, p .001. The
Delay scale, however, did not make a significant unique
contribution to variance in ToM over and above the
control variables and the Conflict scale, p .13.
Next, a stepwise analysis examined which individual inhibitory tasks best predicted ToM performance.
After age and verbal ability were included in the model,
the Bear/Dragon and Whisper tasks (both Conflict
tasks) accounted for further unique variance in ToM
battery scores, ts(92) 4.0 and 2.2, ps .001 and .05,
respectively. Similar analyses were performed for
the individual ToM measures. For Appearance-Reality,
Bear/Dragon and Whisper were again significant predictors of ToM performance over and above verbal
ability and age, ts(92) 2.7 and 2.3, ps .01 and .03, respectively. For Deceptive Pointing, the Bear/
Dragon task was a significant predictor (beyond verbal ability), t(94) 5.1, p .001. For Location False
1045
Belief, the Day/Night task was a significant predictor
(beyond verbal ability and age), t(93) 2.3, p .03.
For Contents False Belief, the Card Sort and Gift Delay
tasks were strong predictors (beyond verbal ability),
ts(93) 2.0 and 1.7, ps .05 and .10, respectively.
Direction of Causality
Because the data being reported are correlational,
strong inferences concerning the causal nature of the
relation between IC and ToM cannot be drawn; IC
might foster ToM development or the reverse. Still,
suppose there were many more children who had
strong IC skills and weak ToM abilities than who had
strong ToM skills but weak IC skills. Such a pattern
might suggest that IC was at least developmentally
primary, and hence more likely to be causally implicated
in ToM advances. (Strong inferences about developmental primacy require that the variables of interest
are at least interval scales, an assumption that may or
may not have been met in the present case, see Dixon,
1998.) No evidence was found, however, of this pattern
(or of a reverse one indicating that ToM might be developmentally primary). Specifically, in a median split
analysis of the IC and ToM composite variables, the
majority of children (79) were either above the median on both variables or below it on both. Moreover,
roughly equal numbers of children fell in the “offdiagonal” cells: 15 children were high on IC and low
on ToM, whereas 13 children demonstrated the reverse pattern, McNemar 2 .04, ns. The number of
children in the off-diagonal cells remained roughly
equal (and very low) when more stringent criteria
were used to classify children as high or low on IC
and ToM, for example, including only those cases that
were in the top and bottom 10% of the distributions,
or that were at least one SD above or below the means.
Hence, in and of themselves, the data leave the direction of causality unresolved. We return to this issue in
the discussion.
Can IC Account for All Variance in ToM?
Given the strength of the IC–ToM relation, the question arose as to whether the ToM measures would hold
together when IC was held constant. When age, gender,
verbal ability, and IC battery scores were controlled,
three of the measures (Location False Belief, Contents
False Belief, and Appearance-Reality) remained significantly intercorrelated, and each was related to the
ToM battery (see Table 8). Somewhat surprisingly, the
Deception task failed to correlate with the two falsebelief measures. Nevertheless, it remained significantly correlated with the overall battery, as well as
1046
Child Development
Table 8 Partial Correlations among Theory-of-Mind Measures, Controlling for Age, Gender, Verbal
Ability, and Inhibitory Control (Battery Scores)
Task
False belief (location)
False belief (contents)
Deception
Appearance-Reality
False Belief
(Contents)
.27**
Deception
.13
.14
AppearanceReality
.31**
.22*
.19*
Theory-of-Mind
Battery
.34***
.30**
.22*
.35***
Note: N 107 except in correlations involving False Belief (Contents): N 106.
* p .05; ** p .01; *** p .001.
with Appearance-Reality. Hence, in the present study
although ToM and IC shared substantial common
variance, they clearly did not tap entirely coextensive
abilities. Other factors, including individual differences in children’s conceptions of the mind, surely
made their own independent contributions to variance in ToM performance.
DISCUSSION
The findings of this research indicate that developments in executive functioning are indeed closely interwoven with the dramatic changes taking place in
children’s ToM in the preschool-age period.
Summary of Findings
The data from the present study replicate the common findings that 4-year-olds outperform 3-year-olds
on ToM tasks and that these tasks are interrelated
over and above factors such as age, gender, and verbal
ability (Flavell et al., 1989; Frye et al., 1995; Gopnik &
Astington, 1988; Moore et al., 1990; Taylor & Carlson,
1997). In addition, the mean levels of performance on
the ToM tasks found in the present study were, by
and large, comparable with those typically obtained,
ruling out that the current sample population was
somehow unrepresentative or that the context in
which the tasks were embedded led children to respond in an aberrant manner. Similarly, the present
data confirm earlier findings suggesting that inhibitory skills show marked improvement in the preschool-age years (Frye et al., 1995; Kochanska et al.,
1996; Luria, 1966; Reed et al., 1984). Moreover, consistent with Kochanska et al. (1996, 1997), the various behavioral measures of IC formed a coherent battery
with high internal consistency. As did the ToM tasks,
these measures generally remained intercorrelated
when control factors were held constant, and they
were also related to parent report measures of IC.
Most importantly, the major hypothesis concern-
ing the relation between IC and ToM was overwhelmingly confirmed. The ToM battery was significantly
related to each of the aggregate IC measures (IC battery, Conflict scale, and Delay scale), as well as to all
of the individual behavioral measures of IC. Moreover, the four specific ToM measures also related to
the aggregate IC measures and virtually all of the individual measures. Of course, raw correlations might
have overestimated the strength of a relation because
of variance that was shared with extraneous factors.
In this respect, it was found that both ToM and IC
were related to age, gender, and verbal ability. Nonetheless, many of the correlations were still significant
when these other variables were controlled; the pattern of relations remained especially strong at the aggregate level. Further, the correlations persisted when
still other factors—family size and performance on
Mental State Control, Motor Sequencing, and Pretend
Actions tasks—were held constant, illustrating that
the relation between IC and ToM was a remarkably
robust one. Finally, the magnitude of the relation was
similarly high for both 3- and 4-year-olds, indicating
that IC and ToM are closely intertwined throughout
the preschool-age period.
These results are broadly consistent with those of
Frye et al. (1995), who found that performance on a Card
Sort task was significantly related over and above age
to Contents False Belief, representational change, and
(in one study) Appearance-Reality. In the present study,
the Card Sort was significantly correlated with all
four ToM measures initially, although it remained significantly related to only Contents False Belief when
age, gender, and verbal ability were partialled. The Card
Sort did, however, load highly on the Conflict scale
which was strongly related to the ToM battery over
and above these controls. These findings highlight the
importance of aggregation across multiple measures
for greater psychometric precision (Rushton, Brainerd,
& Pressley, 1983). For example, an examination of Frye
et al.’s research, in which only a single executive task
was used and there was no aggregation across ToM
Carlson and Moses
tasks, or an examination of any single IC measure in
combination with any single ToM measure in the
present study, might easily lead one to come away with
the mistaken impression that the relation between domains is either weak, mercurial, or perhaps even nonexistent. When multiple indicators of both constructs
are examined, however, the true strength of the relation begins to emerge because variance idiosyncratic
to any particular task is cancelled out across measures.
This research also extends Frye et al.’s (1995) findings in other important ways. First, by controlling for
individual differences in verbal ability it was found
that the IC–ToM relation persisted. Second, the relation also remained strong when other factors (gender,
pretense, number of siblings) that are (or could well
be) related to both IC and ToM were held constant,
and when children who erred on ToM control questions were excluded. Thus, considerable support was
obtained for the possibility that the relation is specific
to these constructs rather than a by-product of other
confounding variables. Third, a major advance in the
present study concerned the inclusion of Mental State
Control tasks. As argued earlier, such tasks might be
viewed as tapping IC skill, and indeed it was found
that they did relate to the IC measures. The IC–ToM
relation held up, however, even in the subset of children who performed perfectly on these Control tasks.
Hence, earlier findings that false-belief performance
in younger children remains poor when those who fail
tasks of this kind are excluded (Gopnik & Astington,
1988; Moses & Flavell, 1990; Wimmer & Hartl, 1991) did
not rule out IC as a major contributing factor to ToM
development. Finally, as discussed next, the inclusion
of a variety of conceptually different measures of IC
allowed for the ability to begin to distinguish among
competing executive accounts.
Which Aspects of Executive Functioning
Are Implicated?
The findings of this research strongly suggest that
executive functioning is centrally implicated in ToM
development. Parallel results have been reported for
both normal (Hughes, 1998a, 1998b; Perner & Lang,
1999) and “hard-to-manage” preschoolers (Hughes,
Dunn, & White, 1998) using a smaller number of inhibitory tasks. It is plausible, then, that some form of
executive functioning deficit severely constrains
young children’s ability either to acquire or demonstrate an appreciation of mental life.
The hypothesis guiding the present research was
that limited inhibitory skills are at the heart of this
deficit. Executive functioning, however, encompasses
many diverse cognitive abilities and thus it is conceiv-
1047
able that some executive ability other than IC was responsible for the observed pattern of relations. In this
respect, a motor sequencing task was included as a
measure of executive capacity that did not impose an
inhibitory burden on children. Consistent with the
IC hypothesis, performance on this task did not correlate with ToM once age, gender, and verbal ability were
controlled. Of course, because only one task of this
type was included, it cannot be definitively ruled out
that other, more general aspects of executive functioning subsumed the relation between IC and ToM.
Indeed, other aspects of executive functioning have
been found to correlate with ToM. For example, both
working memory and planning ability correlate moderately with false belief, controlling for age and, in some
cases, verbal ability (Bischof-Kohler, as cited in Perner,
in press; Davis & Pratt, 1996; Gordon & Olson, 1998;
Hughes, 1998b; Keenan, 1998; Keenan, Olson, & Marini,
in press). Nevertheless, some recent findings suggest
that IC tasks may be more strongly related to ToM
than tasks measuring these other executive abilities
(Carlson, Moses, & Breton, 2001).
That said, inhibition, by itself, may be insufficient
to account for the relation between executive function
and ToM. Working-memory skills may be implicated
as well. Recall that the present study included tasks
falling into Conflict and Delay categories, a classification subsequently confirmed in a principal-components
analysis. Although the two scales were significantly
correlated, the Conflict scale was a somewhat better
predictor of ToM than was the Delay scale. Specifically, in a regression analysis, the Conflict scale predicted performance on the ToM battery over and
above controls and the Delay scale, whereas the Delay scale was not significant in a corresponding analysis. Moreover, at the level of individual measures,
the Bear/Dragon and Whisper tasks—both from the
Conflict scale—explained unique variance in ToM
battery scores over and above the controls. The abilities tapped by Conflict tasks may thus have been
more central to ToM reasoning than those tapped by
Delay tasks.
In our view, the key factor differentiating these
task types is working memory. Most of the Delay
tasks required children to merely inhibit their responses (e.g., to wait until the experimenter gave the
signal on the Pinball task); in contrast, the Conflict
tasks required children not only to inhibit an inappropriate response but to activate a conflicting, novel response (e.g., on the Day/Night task, resist saying “day”
in response to the sun card and at the same time substitute the novel “night” response). This additional
processing requirement—holding conflicting alternatives in mind—suggests that the working-memory
1048
Child Development
demands are greater on the conflict tasks. In this respect, the conflict tasks are similar to many ToM tasks:
the latter require children to inhibit a dominant response (in terms of the actual state of affairs) as well
as activate an incompatible, novel response (in terms
of someone’s representation of that state of affairs).
The combination of inhibitory skill and working
memory may thus be critical to ToM reasoning (cf.
Diamond, 1990; Russell, 1997). Consistent with this
analysis, we recently found that Conflict tasks are significantly correlated with working memory, whereas
Delay tasks are not (Carlson et al., 2001).
Although the discriminant findings for Conflict
and Delay tasks can be readily accommodated within
an IC framework, they are also, to some extent, consistent with Frye et al.’s (1995) hypothesis that only tasks
having an embedded rule structure should correlate
with ToM. On this hypothesis, both standard ToM
tasks and certain executive tasks require children to
employ a higher order rule to select the condition (or
perspective) from which to reason. The Card Sort, for
instance, is said to involve an embedded rule structure consisting of (1) setting conditions (e.g., “if” sorting by shape or color ), (2) antecedent conditions (e.g.,
“and if” a certain shape or color is presented on a card),
and (3) consequent actions (e.g., “then” place the card
in its designated location). According to Frye et al., just
as 3-year-olds have difficulty switching dimensions or
setting conditions on the Card Sort task, they also have
difficulty switching perspectives on ToM tasks, such as
self versus other in false belief, before versus after in
representational change, and looks like versus is in
appearance-reality.
Not all executive tasks have an embedded rule
structure. In particular, the Delay tasks from the
present study had a simple rule structure. For example, in the Gift Delay task, there was only a single set
of rules to be followed, that is, in the context of this
game, to wait until the experimenter said that the child
could open the gift, versus opening it without delay
in everyday life. The fact that the Conflict tasks were
more strongly related to ToM than the Delay tasks
might then be seen as support for the embedded rule
use account. For several reasons, however, we do not
favor such an account. First, whether a task such as
the false-belief task is described as requiring simple
or embedded rules appears to be somewhat arbitrary
(see Perner, Stummer, & Lang, 1999). Second, one of
the conflict measures in the present study—the Whisper task—had a simple rather than embedded rule
structure (i.e., whisper in the context of this task, versus talk in a normal voice in other contexts), and yet
was one of the strongest predictors of ToM. Third, although the Conflict scale was a stronger predictor
than the Delay scale, the latter was nevertheless moderately correlated with ToM, suggesting that general
inhibitory problems were implicated independently
of rule structure. Finally, representational tasks with
the very same rule structure relate very differently to
executive ability. For example, although performance
on false-belief tasks is strongly correlated with IC,
that on “false”-photograph tasks is not (Sabbagh,
Moses, & Shiverick, 2001).
What Role Does Executive Functioning Play?
At the outset it was suggested that executive functioning might be implicated in either the expression
of children’s ToM or in its very emergence. As Russell
(1996) argued, young children’s ToM failures could
stem from either an executive performance problem
that prevents them from showing what they already
know (i.e., from translating their knowledge into performance) or from a more deep-seated executive competence problem that limits their ability to conceive of
mental states in the first place (i.e., lacking inhibitory
or executive skills, young children may simply be unable to reflect on multiple mental representations).
The present findings are compatible with either of
these executive accounts. As noted earlier, however,
findings from other studies, in which children reveal
a stronger appreciation of mental life when the inhibitory demands of tasks are reduced, are clearly compatible with a performance account (Carlson et al.,
1998; Hala et al., 1999; Leslie & Polizzi, 1998; Mitchell
& Lacohee, 1991; Moses, 1993; Robinson & Mitchell,
1995; Zaitchik, 1991). That said, inhibitory manipulations do not always have the predicted effects (Russell,
1996) and, even when they do, younger children’s performance remains far from perfect (Wellman, Cross, &
Watson, 2001). Hence, although performance factors are very likely implicated in ToM advances, their
role may be limited (although see Moses, 2001).
Instead, much of the impact of executive functioning
may occur at the level of ToM competence. Consistent
with this possibility, some recent findings (Moses,
Carlson, Stieglitz, & Claxton, 2001; Perner & Lang,
1999) indicate that executive tasks not only correlate
with ToM tasks (such as those in the present study), in
which a prepotent response option is available, but
also with those in which there is no apparent prepotent option. In these latter tasks—for example, Moore
et al.’s (1990) mental state certainty task, O’Neill and
Gopnik’s (1991) sources of knowledge task, and Wimmer and Mayringer’s (1998) false-belief explanation
task—younger children appear to respond randomly
rather than systematically favor an incorrect option.
The fact that executive function relates to such tasks
Carlson and Moses
cannot be accommodated within a performance account; instead, the correlations may arise because executive abilities are required for initial mastery of the
concepts themselves.
Whatever the precise role played by executive factors in ToM development, it seems very likely that
conceptual changes are also at work. Executive or inhibitory constraints alone clearly underdetermine the
nature of changes taking place in young children’s
ToM. For one, the present findings showed that even
when variance shared with inhibitory functioning (and
with factors such as age, gender, and verbal ability)
was removed, the ToM tasks still cohered as a group,
suggesting the presence of a common conceptual core
binding together performance on these tasks. Moreover, to the extent that inhibitory/executive skills are
involved, they may well interact with conceptual
abilities. For example, on a simple executive account,
one might expect children to have as much difficulty
with pretense and unfulfilled-desire tasks as with
false-belief tasks. After all, in each of these cases children need to set aside their own salient knowledge to
ascribe an apparently incompatible mental representation to another or to themselves. The fact that young
children reveal considerably greater knowledge of
pretense and desires than of false beliefs clearly speaks
against such an account (Bartsch & Wellman, 1995;
Flavell, Flavell, Green, & Moses, 1990; Gopnik &
Slaughter, 1991; Lillard & Flavell, 1992; Repacholi &
Gopnik, 1997; but see Moore, et al., 1995).
A modified executive account, in which inhibitory
functioning interacts with conceptual difficulty, can
accommodate these findings; prepotency may be relatively easy to override in the case of some mental
states, but very difficult to overcome in the case of
others. Specifically, inhibitory demands should be
greatest when there is normative pressure on a mental state to conform to reality (Carlson et al., 1998;
Moses, 1993). Pretense, for example, does not purport
to reflect reality (in fact, quite the opposite), and so
the threshold level of IC required to reason about pretense may be quite low. At the other end of the spectrum, beliefs are supposed to reflect reality and so the
inhibitory threshold may be very high. Desires may
fall somewhere in between because, although individuals would like their desires and reality to be
aligned, the pressure to conform is on reality rather than
the mental state (i.e., when our desires are unfulfilled
one typically attempts to change the world rather
than the desires; see Searle, 1983). Consistent with this
interactive analysis, we recently found that preschoolers’ IC was more strongly related to performance on
false-belief tasks than to performance on matched
pretense and desire tasks (Moses et al., 2001).
1049
Finally, we have argued that executive functioning
is causally implicated in ToM development. But of
course our findings are correlational and so it is possible that the direction of causality is the opposite—
that ToM advances engender improvements in executive functioning. Indeed, Perner and colleagues (Perner,
in press; Perner & Lang, 1999; Perner et al., 1999) have
argued that metarepresentational skills are necessary
for children to exert volitional control over actions.
Those authors proposed that to succeed on executive
tasks, children must be able to represent their action
goals as well as the impediments to those goals, such as
prior learning or habitual response tendencies. In this
view, such self-monitoring processes require a means
of expressing higher order states, that is, a ToM (see also
Dennis, 1991). Thus, an underdeveloped or impaired
ToM could cause executive functioning deficiencies.
The present finding that ToM and IC were related
is thus compatible with Perner’s (in press) account. The
fact, however, that Conflict tasks were more strongly
associated with ToM than were Delay tasks would appear to speak against it; in Perner’s account, advances in
ToM ought to facilitate similar improvements on all
tasks requiring executive inhibition (as Perner terms
it). Moreover, other findings are also inconsistent
with his account. For example, if ToM advances are
primary, then reducing inhibitory demands should
not affect performance. As discussed earlier, however,
reducing the inhibitory burden does generate improved performance, at least in some cases. Further,
Hughes (1998a) found executive functioning at age 3
to 4 years better predicts ToM 1 year later than ToM
predicts executive functioning across that age span.
Specifically, children’s performance on inhibitory
tasks (e.g., detour reaching, Luria’s hand game
(Luria, Pribram, & Homskaya, 1964), and set shifting)
at Time 1 accounted for 20% of the variance in ToM
scores at Time 2 controlling for the effects of age, verbal ability, and initial ToM scores. In contrast, Time 1
ToM scores were uncorrelated with four out of five executive function tasks at Time 2 (and surprisingly, accounted for only 6% of Time 2 ToM variance). Such a
pattern would not be predicted by a simple ToM-toexecutive control causal sequence. That said, additional research—including training studies as well as
the collection of longitudinal data beginning at an even
earlier age, when self-control and an understanding
of mental states are first emerging—is surely needed
to clarify the issue of causal direction.
In conclusion—the present analysis suggests that
developments in IC facilitate, and may well be necessary, for ToM advances; clearly, however, IC is not sufficient for those advances. As Flavell and Miller (1998)
argued, even with a fully mature executive system,
1050
Child Development
children would still have much to learn about mental
states. Nevertheless, the current findings indicate that
IC may well play a crucial enabling role in both the
emergence and expression of children’s ToM.
ACKNOWLEDGMENTS
This research was supported by dissertation research
awards to the first author from the American Psychological Association and the Psychology Department of
the University of Oregon. The authors wish to thank
Andrew Craven, Sarah Swisher, Becky Zeien, Sherri
Hooker, Kirsten Spoljaric, Casey Breton, and Mary
Clark for assistance with data collection and coding.
Thanks also go to Marjorie Taylor, Mary Rothbart, and
the anonymous reviewers for helpful suggestions.
ADDRESSES AND AFFILIATIONS
Corresponding author: Stephanie M. Carlson, Department of Psychology, University of Washington,
Box 351525, Seattle, WA 98195-1525; e-mail: carlsons@
u.washington.edu. Louis J. Moses is at the University
of Oregon in Eugene.
REFERENCES
Baron-Cohen, S., Ring, H., Moriarty, J., Schmitz, B., Costa, D.,
& Ell, P. (1994). Recognition of mental state terms: Clinical findings in children with autism and a functional
neuroimaging study of normal adults. British Journal of
Psychiatry, 165, 640–649.
Bartsch, K., & Wellman, H. M. (1995). Children talk about the
mind. New York: Oxford University Press.
Carlson, S. M., Moses, L. J., & Breton, C. (2001). How specific
is the relation between executive functioning and theory of
mind? Contributions of inhibitory control and working memory. Unpublished manuscript, University of Washington.
Carlson, S. M., Moses, L. J., & Hix, H. R. (1998). The role of
inhibitory control in young children’s difficulties with
deception and false belief. Child Development, 69, 672–
691.
Carvajal, H. H., Parks, J. P., Logan, R. A., & Page, G. L.
(1992). Comparisons of the IQ and vocabulary scores on
Wechsler Preschool and Primary Scale of IntelligenceRevised and Peabody Picture Vocabulary Test-Revised.
Psychology in the Schools, 29, 22–24.
Couillard, N. L., & Woodward, A. L. (1999). Children’s comprehension of deceptive points. British Journal of Developmental Psychology, 17, 515–521.
Davis, H. L., & Pratt, C. (1996). The development of children’s theory of mind: The working memory explanation. Australian Journal of Psychology, 47, 25–31.
DeLoache, J. (1999, April). IC and symbol use. In G. O. Deak
(Chair), Flexibility and perseveration in early problemsolving. Symposium conducted at the biennial meeting
of the Society for Research in Child Development, Albuquerque, NM.
Dempster, F. N. (1992). The rise and fall of the inhibitory
mechanism: Toward a unified theory of cognitive development and aging. Developmental Review, 12, 45–75.
Dennis, M. (1991). Frontal lobe function in children and adolescence: A heuristic for assessing attention regulation,
executive control, and the intentional states important for
social discourse. Developmental Neuropsychology, 7, 327–
358.
Diamond, A. (1990). Developmental time course in infants and
infant monkeys, and the neural bases of inhibitory control
in reaching. Annals of the New York Academy of Sciences, 608,
637–676.
Diamond, A., Prevor, M. B., Callender, G., & Druin, D. P.
(1997). Prefrontal cortex cognitive deficits in children
treated early and continuously for PKU. Monographs of
the Society for Research in Child Development, 62(4, Serial
No. 252).
Diamond, A., & Taylor, C. (1996). Development of an aspect
of executive control: Development of the abilities to remember what I said and to “Do as I say, not as I do.” Developmental Psychobiology, 29, 315–334.
Dixon, J. A. (1998). Developmental ordering, scale types, and
strong inference. Developmental Psychology, 34, 131–145.
Dunn, L. M., & Dunn, L. M. (1981). Peabody Picture Vocabulary Test-Revised: Form M. Circle Pines, MN: American
Guidance Service.
Elias, C., & Berk, L. E. (1999, April). Self regulation in young
children: Is there a role for sociodramatic play? Poster presented at the biennial meeting of the Society for Research
in Child Development, Albuquerque, NM.
Eslinger, P. J. (1996). Conceptualizing, describing, and measuring components of executive function: A summary. In
G. R. Lyon & N. A. Krasnegor (Eds.), Attention, memory
and executive function (pp. 367–395). Baltimore: Brookes.
Flavell, J. H. (1988). The development of children’s knowledge about the mind: From cognitive connections to
mental representations. In J. W. Astington, P. L. Harris, &
D. R. Olson (Eds.), Developing theories of mind. Cambridge,
U.K.: Cambridge University Press.
Flavell, J. H., Everett, B. A., Croft, K., & Flavell, E. R. (1981).
Young children’s knowledge about visual perception:
Further evidence for the Level 1–Level 2 distinction. Developmental Psychology, 17, 99–103.
Flavell, J. H., Flavell, E. R., & Green, F. L. (1983). Development of the appearance-reality distinction. Cognitive Psychology, 15, 95–120.
Flavell, J. H., Flavell, E. R., Green, F. L., & Moses, L. J. (1990).
Young children’s understanding of fact beliefs versus
value beliefs. Child Development, 61, 915–928.
Flavell, J. H., Green, F. L., & Flavell, E. R. (1986). Development of knowledge about the appearance-reality distinction. Monographs of the Society for Research in Child Development, 51(1, Serial No. 212).
Flavell, J. H., Green, F. L., & Flavell, E. R. (1989). Young children’s ability to differentiate appearance-reality and
Level 2 perspectives in the tactile modality. Child Development, 60, 201–213.
Carlson and Moses
Flavell, J. H., & Miller, P. H. (1998). Social cognition. In
D. Kuhn & R. S. Siegler (Eds.), W. Damon (Series Ed.),
Handbook of child psychology: Vol. 2. Cognition, perception,
and language (5th ed., pp. 851–898). New York: Wiley.
Fletcher, P. C., Happe, F., Frith, U., Baker, S. C., Dolan, R. J.,
Frackowiak, R. S. J., & Frith, C. D. (1995). Other minds in
the brain: A functional imaging study of “theory of
mind” in story comprehension. Cognition, 57, 109–128.
Fox, N. A., Schmidt, L. A., Calkins, S. D., & Rubin, K. H.
(1996). The role of frontal activation in the regulation
and dysregulation of social behavior during the preschool years. Development and Psychopathology, 8, 89–102.
Freeman, N. H., Lewis, C., & Doherty, M. (1991). Preschoolers’ grasp of a desire for knowledge in false belief prediction: Practical intelligence and verbal report. British Journal of Developmental Psychology, 9, 139–157.
Frye, D., Zelazo, P. D., & Palfai, T. (1995). Theory of mind and
rule-based reasoning. Cognitive Development, 10, 483–527.
Gerardi, G. M. (1997). Development of executive attention and
self-regulation in the third year of life. Unpublished doctoral dissertation, University of Oregon, Eugene.
Gerstadt, C. L., Hong, Y. J., & Diamond, A. (1994). The relationship between cognition and action: Performance of
children 3.5–7 years old on a Stroop-like day-night test.
Cognition, 53, 129–153.
Goel, V., Grafman, J., Sadato, N., & Hallett, M. (1995). Modeling other minds. Cognitive Neuroscience and Neuropsychology, 6, 1741–1746.
Gopnik, A. (1993). How we know our minds: The illusion of
first-person knowledge of intentionality. Behavioral and
Brain Sciences, 16, 1–14.
Gopnik, A., & Astington, J. W. (1988). Children’s understanding of representational change and its relation to
the understanding of false belief and the appearancereality distinction. Child Development, 59, 26–37.
Gopnik, A., & Slaughter, V. (1991). Young children’s understanding of changes in their mental states. Child Development, 62, 98–110.
Gopnik, A., & Wellman, H. M. (1994). The theory theory. In
L. Hirschfeld & S. Gelman (Eds.), Mapping the mind: Domain specificity in cognition and culture (pp. 257–293).
New York: Cambridge University Press.
Gordon, A. C. L., & Olson, D. R. (1998). The relation between
acquisition of a theory of mind and the capacity to hold in
mind. Journal of Experimental Child Psychology, 68, 70–83.
Hala, S., Russell, J., & Maberley, L. (1999). The role of social inhibition and response mode in the windows task. Unpublished manuscript, University of Calgary, Calgary, Canada.
Harnishfeger, K. K., & Bjorklund, D. F. (1993). The ontogeny
of inhibition mechanisms: A renewed approach to cognitive development. In M. L. Howe & R. Pasnak (Eds.),
Emerging themes in cognitive development: Vol. I. Foundations (pp. 28–49). New York: Springer-Verlag.
Hodapp, A. F. (1993). Correlation between Stanford-Binet
IV and PPVT-R scores for young children. Psychological
Reports, 73, 1152–1154.
Hudspeth, W. J., & Pribram, K. H. (1990). Stages of brain
and cognitive maturation. Journal of Educational Psychology, 82, 881–884.
1051
Hughes, C. (1998a). Finding your marbles: Does preschoolers’ strategic behavior predict later understanding of
mind? Developmental Psychology, 34, 1326–1339.
Hughes, C. (1998b). Executive function in preschoolers:
Links with theory of mind and verbal ability. British Journal of Developmental Psychology, 16, 233–253.
Hughes, C., Dunn, J., & White, A. (1998). Trick or treat? Uneven understanding of mind and emotion and executive
dysfunction in “hard to manage” preschoolers. Journal of
Child Psychology and Psychiatry and Allied Disciplines, 39,
981–994.
Hughes, C., & Russell, J. (1993). Autistic children’s difficulty
with mental disengagement from an object: Its implication for theories of autism. Developmental Psychology, 29,
498–510.
Jenkins, J. M., & Astington, J. W. (1996). Cognitive factors
and family structure associated with theory of mind development in young children. Developmental Psychology,
32, 70–81.
Jerger, S., Martin, R. C., & Pirozzolo, F. J. (1988). A developmental study of the auditory Stroop effect. Brain and Language, 35, 86–104.
Kagan, J., Rossman, B. L., Day, D., Albert, J., & Phillips, W.
(1964). Information processing in the child: Significance
of analytic and reflective attitudes. Psychological Monographs, 78(1, Whole No. 578).
Keenan, T. (1998). Memory span as a predictor of false belief understanding. New Zealand Journal of Psychology,
27, 36 – 43.
Keenan, T., Olson, D. R., & Marini, Z. (in press). Working
memory and children’s developing understanding of
mind. Australian Journal of Psychology.
Kochanska, G., Murray, K., & Coy, K. C. (1997) Inhibitory
control as a contributor to conscience in childhood:
From toddler to early school age. Child Development, 68,
263–277.
Kochanska, G., Murray, K., Jacques, T. Y., Koenig, A. L., &
Vandegeest, K. A. (1996). Inhibitory control in young
children and its role in emerging internalization. Child
Development, 67, 490–507.
Kopp, C. B. (1982). Antecedents of self-regulation: A developmental perspective. Developmental Psychology, 18, 199–214.
Leslie, A. M., & Polizzi, P. (1998). Inhibitory processing in
the false belief task: Two conjectures. Developmental Science, 1, 247–253.
Lillard, A. S., & Flavell, J. H. (1992). Young children’s understanding of different mental verbs. Developmental Psychology, 28, 626–634.
Livesey, D. J., & Morgan, G. A. (1991). The development of
response inhibition in 4- and 5-year-old children. Australian Journal of Psychology, 43, 133–137.
Luria, A. R. (1966). Higher cortical functions in man. New
York: Basic.
Luria, A. R. (1973). The working brain: An introduction to neuropsychology. New York: Basic.
Luria, A. R., Pribram, K. H., & Homskaya, E. D. (1964). An
experimental analysis of the behavioral disturbance produced by a left frontal arachnoidal endothelioma (meningioma). Neuropsychologia, 2, 257–280.
1052
Child Development
Mitchell, P., & Lacohee, H. (1991). Children’s early understanding of false belief. Cognition, 39, 107–127.
Moore, C., Jarrold, C., Russell, J., Lumb, A., Sapp, F., & MacCallum, F. (1995). Conflicting desire and the child’s theory of mind. Cognitive Development, 10, 467–482.
Moore, C., Pure, K., & Furrow, D. (1990). Children’s understanding of the modal expression of certainty and uncertainty and its relation to the development of a representational theory of mind. Child Development, 61, 722–730.
Moses, L. J. (1993). Young children’s understanding of
belief constraints on intention. Cognitive Development, 8,
1 – 25.
Moses, L. J. (2001). Executive accounts of theory of mind development. Child Development, 72, 688–690.
Moses, L. J., Carlson, S. M., Stieglitz, S., & Claxton, L. J.
(2001). Executive function, prepotency, and children’s theories of mind. Manuscript in preparation.
Moses, L. J., & Flavell, J. H. (1990). Inferring false beliefs
from actions and reactions. Child Development, 61, 929–945.
O’Neill, D. K., & Gopnik, A. (1991). Young children’s ability
to identify the sources of their beliefs. Developmental Psychology, 27, 390–397.
Overton, W. F., & Jackson, J. P. (1973). The representation of
imagined objects in action sequences: A developmental
study. Child Development, 44, 309–314.
Ozonoff, S., Pennington, B. F., & Rogers, S. J. (1991). Executive function deficits in high-functioning autistic individuals: Relationship to theory of mind. Journal of Child
Psychology and Psychiatry, 32, 1081–1095.
Passler, M. A., Isaac, W., & Hynd, G. W. (1985). Neuropsychological development of behavior attributed to frontal
lobe functioning in children. Developmental Neuropsychology, 1, 349–370.
Perner, J. (1991). Understanding the representational mind.
Cambridge, MA: MIT Press.
Perner, J. (in press). The meta-intentional nature of executive functions and theory of mind. In P. Carruthers &
J. Boucher (Eds.), Language and thought. Cambridge,
U.K.: Cambridge University Press.
Perner, J., & Lang, B. (1999). Development of theory of mind
and executive control. Trends in Cognitive Sciences, 3,
337–344.
Perner, J., Leekam, S. R., & Wimmer, H. (1987). Three-yearolds’ difficulty understanding false belief: Representational limitation, lack of knowledge or pragmatic misunderstanding? British Journal of Developmental Psychology,
5, 125–137.
Perner, J., Ruffman, T., & Leekam, S. R. (1994). Theory of
mind is contagious: You catch it from your sibs. Child Development, 65, 1228–1238.
Perner, J., Stummer, S., & Lang, B. (1999). Executive functions and theory of mind: Cognitive complexity or functional dependence? In P. D. Zelazo, J. W. Astington, &
D. R. Olson (Eds.), Developing theories of intention: Social
understanding and self-control. Mahwah, NJ: Erlbaum.
Reed, M., Pien, D. L., & Rothbart, M. K. (1984). Inhibitory
self-control in preschool children. Merrill-Palmer Quarterly, 30, 131–147.
Repacholi, B. M., & Gopnik, A. (1997). Early reasoning
about desires: Evidence from 14- and 18-month-olds. Developmental Psychology, 33, 12–21.
Robinson, E. J., & Mitchell, P. (1995). Masking of children’s
early understanding of the representational mind: Backwards explanation versus prediction. Child Development,
66, 1022–1039.
Rothbart, M. K., Ahadi, S. A., & Hershey, K. L. (1994). Temperament and social behavior in children. Merrill-Palmer
Quarterly, 40, 21–39.
Rothbart, M. K., & Posner, M. I. (1985). Temperament and
the development of self-regulation. In L. Hartlage & C. F.
Telzrow (Eds.), The neuropsychology of individual differences: A developmental perspective (pp. 93–123). New York:
Plenum.
Ruffman, T., Perner, J., Naito, M., Parkin, L., & Clements,
W. A. (1998). Older (but not younger) siblings facilitate
false belief understanding. Developmental Psychology, 34,
161–174.
Rushton, J. P., Brainerd, C. J., & Pressley, M. (1983). Behavioral development and construct validity: The principle
of aggregation. Psychological Bulletin, 94, 18–38.
Russell, J. (1996). Agency: Its role in mental development.
Hove, U.K.: Erlbaum.
Russell, J. (1997). How executive disorders can bring about
an inadequate theory of mind. In J. Russell (Ed.), Autism
as an executive disorder (pp. 256–304). Oxford: Oxford
University Press.
Russell, J., Jarrold, C., & Potel, D. (1994). Executive factors
in preschoolers’ strategic deception. British Journal of Developmental Psychology, 12, 301–314.
Russell, J., Mauthner, N., Sharpe, S., & Tidswell, T. (1991).
The “windows task” as a measure of strategic deception
in preschoolers and autistic subjects. British Journal of Developmental Psychology, 9, 331–349.
Sabbagh, M., Moses, L. J., & Shiverick, S. (2001, June). Inhibitory control and preschoolers’ understanding of false
representations. Poster presented at the annual meeting
of the American Psychological Society, Toronto, ON,
Canada.
Sabbagh, M. A., & Taylor, M. (2000). Neural correlates of
theory of mind reasoning: An event-related potential
study. Psychological Science, 11, 46–50.
Searle, J. (1983). Intentionality. Cambridge, U.K.: Cambridge
University Press.
Simon, J. R. (1990). The effects of an irrelevant directional cue
on human information processing: The “Stroop effect”
versus the “Simon effect.” Acta Psychologica, 73, 159–170.
Sodian, B. (1991). The development of deception in young
children. British Journal of Developmental Psychology, 9,
173–188.
Stuss, D. T. (1992). Biological and psychological development
of executive functions. Brain and Cognition, 20, 8–23.
Taylor, M., & Carlson, S. M. (1997). The relation between individual differences in fantasy and theory of mind. Child
Development, 68, 436–455.
Thatcher, R. W. (1992). Cyclic cortical reorganization during
early childhood. Brain and Cognition, 20, 24–25.
Wellman, H. M., & Bartsch, K. (1988). Young children’s reasoning about beliefs. Cognition, 30, 239–277.
Carlson and Moses
Wellman, H. M., Cross, D., & Watson, J. (2001). Meta-analysis
of theory-of-mind development: The truth about false
belief. Child Development, 72, 655–684.
Wellman, H. M., & Gelman, S. A. (1998). Knowledge acquisition in foundational domains. In D. Kuhn & R. S. Siegler (Eds.), W. Damon (Series Ed.), Handbook of child psychology: Vol. 2. Cognition, perception, and language (5th ed.,
pp. 523–574). New York: Wiley.
Welsh, M. C., Pennington, B. F., & Groisser, D. B. (1991). A
normative-developmental study of executive function:
A window on prefrontal function in children. Developmental Neuropsychology, 7, 131–149.
Welsh, M. C., Pennington, B. F., Ozonoff, S., Rouse, B., &
McCabe, E. R. B. (1990). Neuropsychology of earlytreated phenylketonuria: Specific executive function
deficits. Child Development, 61, 1697–1713.
Wimmer, H., & Hartl, M. (1991). Against the Cartesian
view on mind: Young children’s difficulty with own
false beliefs. British Journal of Developmental Psychology,
9, 125 – 138.
Wimmer, H., & Mayringer, H. (1998). False belief under-
1053
standing in young children: Explanations do not develop before predictions. International Journal of Behavioral Development, 22, 403–422.
Wimmer, H., & Perner, J. (1983). Beliefs about beliefs: Representation and constraining function of wrong beliefs in
young children’s understanding of deception. Cognition,
13, 103–128.
Wright, J. C. (1971). KRISP (Kansas Reflection-Impulsivity
Scale for Preschoolers). Unpublished manuscript, University of Kansas, Lawrence.
Youngblade, L. M., & Dunn, J. (1995). Individual differences
in young children’s pretend play with mother and sibling: Links to relationships and understanding of other
people’s feelings and beliefs. Child Development, 66,
1472–1492.
Zaitchik, D. (1991). Is only seeing really believing? Sources
of true belief in the false belief task. Cognitive Development, 6, 91–103.
Zelazo, P. D., Carter, A., Reznick, J. S., & Frye, D. (1997).
Early development of executive function: A problemsolving framework. Review of General Psychology, 1, 1–29.