Two dogmas of conceptual empiricism

COGNITION
Cognition 65 (1998) 103–135
Two dogmas of conceptual empiricism: implications for
hybrid models of the structure of knowledge
Frank C. Keil*, W. Carter Smith, Daniel J. Simons, Daniel T. Levin
Department of Psychology, Uris Hall, Cornell University, Ithaca, NY 14853, USA
Abstract
Concepts seem to consist of both an associative component based on tabulations of feature
typicality and similarity judgments and an explanatory component based on rules and causal
principles. However, there is much controversy about how each component functions in
concept acquisition and use. Here we consider two assumptions, or dogmas, that embody
this controversy and underlie much of the current cognitive science research on concepts.
Dogma 1: Novel information is first processed via similarity judgments and only later is
influenced by explanatory components. Dogma 2: Children initially have only a similaritybased component for learning concepts; the explanatory component develops on the foundation of this earlier component. We present both empirical and theoretical arguments that these
dogmas are unfounded, particularly with respect to real world concepts; we contend that the
dogmas arise from a particular species of empiricism that inhibits progress in the study of
conceptual structure; and finally, we advocate the retention of a hybrid model of the structure
of knowledge despite our rejection of these dogmas.  1998 Elsevier Science B.V.
Keywords: Dogmas; Conceptual empiricism; Hybrid models
1. Introduction
As this issue’s opening article by Rips and Sloman makes clear, mature concepts
have two central components: one that is largely associative and one that is explanatory. Proposals for such a hybrid stretch back at least to Locke and are very much
with us today (Neisser, 1967; Sloman, 1996). The general acceptance of the hybrid
model of thought and knowledge unifies the papers in this special issue of Cognition.
Yet, controversy remains about how these components are involved in cognitive
* Corresponding author. Tel.: +1 607 2556365; fax: +1 607 2558433; e-mail: [email protected]
0010-0277/98/$19.00  1998 Elsevier Science B.V. All rights reserved
PII S00 10-0277(97)000 41-3
104
F.C. Keil et al. / Cognition 65 (1998) 103–135
development and in adult learning and use of concepts. These controversies are
fueled by two distinct, but related assumptions (or Dogmas) that dominate most
views of the origin and structure of concepts.
Dogma 1: Any new category is understood by first processing similarity and only
later by considering causal or explanatory principles. The explanation-based component only arises given sufficient opportunity for cognitive reflection.
Dogma 2: Infants and young children initially represent categories using the
association component of concepts. Only with development does the more abstract,
explanatory component of concepts emerge.
Both of these dogmas presuppose that learning is perceptually driven, progressing
from the simple processing of sensory features to forming complex representations
and abstract thoughts; this presupposition eventually dooms them as reasonable
models of knowledge acquisition. Indeed, we argue that the two dogmas fail both
under principled considerations and under the bright light of experimental data. Yet,
these dogmas are implicit in most models of concept development and of adult
processing of categories. If these dogmas fail as explanations of category and concept acquisition and use, why, then, are they so prevalent and widespread? We show
how empiricist biases influencing the choice of stimuli used to study concepts and
the interpretations of data from these experiments have perpetuated and sometimes
even reified these dogmas. First, however, we must briefly review the need for both
the similarity and the explanatory components of concepts.
2. Do similarity and explanation account for separate aspects of concepts?
Many aspects of human thought seem to rely on relatively automatic processing,
driven largely by the statistical properties of instances. Some aspects of our categorization, induction, and concept acquisition clearly are influenced by tabulations
of feature frequencies (e.g. how frequently a feature occurs among members of one
category versus others) and probabilistic comparisons to stored exemplars (see
Smith and Medin, 1981 for a summary of earlier work). Indeed, the emergence
and successes of the Roschean view of concepts in the 1970s suggested that probabilistic representations might sufficiently explain all of categorization, displacing
earlier rule-based ‘classical’ models (Rosch and Mervis, 1975). These successes
were particularly seductive because such models were able to account for much
of categorization without delving into ‘cause’ and ‘explanation’. Indeed, because the
typicality of a feature often is directly related to its causal importance in determining
category membership, models of categorization based on feature typicality alone
seemed able to eliminate the explanatory component of concepts. For example,
whiteness is both highly typical of and causally central to polar bears. Causally
central features are those that are closely tied to other critical properties and are
linked to the origins or essence of the object. ‘White’ is causally central to polar
bears because the property is closely linked to survival in snow covered environments. Any other color would impair a polar bear’s ability to hunt. Less central
properties (e.g. a bear’s tail shape) could more easily be variable without affecting
F.C. Keil et al. / Cognition 65 (1998) 103–135
105
the underlying nature or the basic behaviors of the bear. The causal mechanisms
(evolutionary in this case) underlying the color of the polar bear explain why whiteness is a typical property. Given the substantial correlation between typicality and
causal structure for real world categories, experimenters may miss a critical aspect
of our concepts unless they empirically separate these components.
In many other cases, however, the properties most typically associated with
members of a kind are not the most causally or explanatorily central. For example,
virtually every washing machine ever encountered is white, yet we know that whiteness is irrelevant in determining that an object is a washing machine. Thus, although
we do use typicality in categorization and induction, in many cases something other
than mere similarity is needed, at least partly because some typicality information is
not relevant. That ‘something else’ involves intuitions about why things are similar—about the causal forces underlying the similar properties themselves. Moreover, even variable properties can be causally central if the patterns of variation are
linked to causal features of underlying category membership. For example, the
variable colors of chameleons help determine category membership. Given that
chameleons can change their colors, we rarely if ever encounter two chameleons
that share identical coloring. Thus, the color variability for an individual chameleon
leads to variability across chameleons as well. The variability itself is the feature that
ties individuals to the larger category. Successful categorization requires us to
ignore the typicality structure and rely on causal explanations for how color is
involved in chameleon ecology. The same point can be made in cases where variability is even more lawfully and predictably related to causal interactions with the
environment. For example, the color of iron varies considerably as a consequence of
its surface being oxidized into rust. This variation is intimately linked to the interaction of iron’s chemical nature and the environment. Even social categories such as
‘teenager’ can lead to variable, but nonetheless often depressingly predictable patterns of appearance. A concept of ‘teenager’ relies on causal accounts of how teens
interact with their peers and adult society.
In some domains, the difference between similarity and rule-based categorization
is even more striking. Categories such as odd numbers, prime numbers, and triangles
are governed by formal, precise rules that transcend associative relations. Adults
know and understand these logical constraints, but still easily and consistently judge
some exemplars of these categories to be better than others (Armstrong et al., 1983).
That is, even categories that are clearly defined by explicit rules can show typicalityor frequency-like effects. In other domains as well, we can clearly separate judgments based on typicality and on rules. For example, we can all appreciate that
kinship terms such as ‘uncle’ have a dual representation. The typical uncle is a man
about the same age as your parents who brings you gifts or sends cards on major
holidays and often talks about things he and your mother or father did together as
children. The rule-defined uncle is an individual of any age who shares a biological
relationship to your mother or father. Whether or not he visits or brings gifts is
irrelevant. Likewise, we know about the kinds of typical events that make one New
Years’ Eve similar to another, but we also know a rule defining that holiday.
Although the contrast between rules and typicality seems stark, we cannot fairly
106
F.C. Keil et al. / Cognition 65 (1998) 103–135
argue that these components are so discrete and independent that they function with
little or no interaction. In fact, the cases described highlight the danger of generalizing from only one class of things. With nominal kinds such as triangle and uncle,
the components may be fairly separable. However, for natural kinds, the interdependence of typicality and explanation is much more evident. If natural kinds are the
most prevalent and typical type of category, such interdependence might well be the
norm.
3. Natural kinds: the necessity of the hybrid
Knowledge of natural kinds clearly illustrates the need for a hybrid model of
concepts. Surely the first human knowledge was largely of natural kinds in a world
empty of artifacts. Thus, it is safe to assume that our first ways of meaningfully
carving up the world and understanding its structure centered around our knowledge
of natural kinds. Today, our knowledge about natural kinds is closely linked to the
contemporary sciences, but how did these sciences ever get started? What cognitive
processes account for the success of science in making predictions and increasing
our understanding of the natural world? This question underlies much of the modern
history and philosophy of science (Salmon, 1989). It should also be one of the
foundational questions of cognitive science.
Science works because it links observations of associated variables with explanations of the causal mechanisms underlying those associations. Natural science is a
union of the associative and the explanatory, of similarity and cause. For example,
consider the class of metals occupying column 1B in the periodic table (i.e. copper,
gold and silver) which tend to share many physical properties (reactivity, volatility,
and malleability, etc.). We attribute these physical similarities to the nature of their
electron shells: the causal mechanisms underlying the observable similarities
depend on the behaviors of the electrons. Yet, even if the physical properties of
these metals can be explained in terms of their electron shells, questions about the
causal mechanisms responsible for the behavior of electrons remain. Diligently, we
turn to particle physicists to find an explanation. They attribute the similarities to
causal interactions of subatomic particles like quarks. We can proceed in this manner (e.g. what explains the behavior of quarks?) until eventually, this chain of linked
mechanisms runs out and we must resort to less precise notions of what properties
are causally relevant: ‘at the end of every explanatory regress we must perforce shift
from causal mechanisms to causal powers’ (Harré, 1988, p. 142). Causal powers
refer to our knowledge about the dispositions of entities to engage in some kinds of
causal interactions but not others (see Harré and Madden, 1975; Harré, 1988). The
distinction between causal mechanisms and causal powers mirrors the distinction
between specific theories and framework theories (Wellman, 1990), where the frameworks constrain the kinds of information likely to be incorporated into a specific
theory; that is, they highlight which properties are likely to be relevant to a particular
causal explanation (see Brown, 1990; Gelman, 1990). With the accumulation of new
observations, specific theories and explanations develop and sometimes are reinter-
F.C. Keil et al. / Cognition 65 (1998) 103–135
107
preted in different theoretical terms. These new observations often take the form of
correlational evidence. Thus, further theory growth relies on an ever-growing similarity database. But this database is itself at least partially constrained by the
assumption that similarities and correlations occur for a reason. We either seek to
incorporate the observations into existing specific theories, or occasionally, if anomalous observations begin to accumulate, we attempt to significantly revise those
theories (Kuhn, 1962).
For natural kinds then, there are at least three levels of conceptual understanding:
precise knowledge of the causal mechanisms by which properties and kinds interact,
notions of what kinds of properties are causally central in a domain (as well as what
kinds of causal patterns and dispositions might be associated with a kind), and
finally, a database of properties that tend to co-occur within a kind (i.e. the ‘similarity’ component). In terms of the hybrid structure of concepts, the explanatory
component consists of both knowledge of precise mechanisms and framework
notions of causal powers and patterns. This explanatory part draws on the ‘similarity
database.’ New additions to that database, however, do not arise from a theoryneutral stance, but are in turn constrained by the explanatory component. That is,
some similarities will be more readily noticed than others because they are more
congruent with available systems of explanation. Each component, therefore, supports and helps guide the other. Indeed, without such mutual constraints each component would be dysfunctional.
Why do appeals to raw, theory-neutral similarity fail? Why cannot progressively
higher-order tabulations of similarity fully account for our conceptions of natural
kinds? The reason is simple: there are too many possible frequencies and correlations to be tabulated. Indeed the set of such possible tabulations is theoretically
infinite. As has now been discussed in many places (Goodman, 1972; Keil, 1981;
Murphy and Medin, 1985), depending on the features chosen for comparison, any
two arbitrarily chosen objects can be maximally similar to each other. For example,
a cloud and a white feather might be considered quite similar if color and ability to
float in air are the comparison features, whereas they could not be more different if
size and functional affordances are the comparison features. In the real world, every
instance of every object we encounter has an indefinitely large number of features,
any of which could be tabulated. Any attempt to store information about objects in
terms of feature frequencies would fail without constraints on the inputs to such
tabulations. Moreover, the whole notion of features as atomic primitives is suspect
given that they can either proliferate or evaporate depending on the theoretical
perspective adopted (see Wisniewski and Medin, 1994 for discussion). Thus, the
idea of a theory-neutral set of primitive features is a fiction that, when acknowledged, makes the need for an explanatory component all the more obvious.
Empirical evidence bolsters this plausibility argument. In learning artificially
constructed categories, in which the features are theory-neutral, adults are poor at
detecting feature correlations even when doing so would enhance concept learning
(Murphy and Wisniewski, 1989). Intuitive theories help people to assimilate statistical correlations (Wright and Murphy, 1984), and such intuitive notions of causation allow false beliefs to persist even when the statistical evidence contradicts the
108
F.C. Keil et al. / Cognition 65 (1998) 103–135
belief (e.g. Chapman and Chapman, 1967, 1969; Wright and Murphy, 1984). Thus,
bottom-up statistical patterns do not always drive reasoning: we often use high level
schema to impose interpretations on statistical patterns (see also Nisbett et al., 1983).
Finally, in seeking to attribute the cause of an effect to some factor, adults prefer
information about specific, underlying mechanisms over information about the covariation of surface events (Ahn et al., 1995).
In short, models of knowledge representation based solely on associative learning
among properties are hopeless without some way of sharply limiting the set of
properties to be associated. Empiricist approaches to knowledge representation
cling to the assumption that such constraints will be imposed by the perceptual
system. Accordingly, our sensory systems must provide a limited set of perceptual
primitives so that exhaustive tabulations over those primitives are computationally
possible. This assumption is nothing more than an article of faith and has never been
even faintly satisfied. Even with a stock of a few hundred primitive features (e.g.
colors, shapes, textures, sizes, and surface patterns), the number of potential associations to notice becomes massive. Add temporal and spatial factors to the tabulations (e.g. one feature is noticed at roughly the same time and place as another) and
the computational task explodes once again.
Without constraints on the ‘tabulation space’ sufficiently more powerful than
those imposed by the sensory system, associations alone cannot explain categorization of natural stimuli. In much of human cognition, those additional constraints
come through rules, explanatory relations, and notions of mechanism. In other
species, they may be less explicit and rule like, but still are radically different
from pure association. If concepts were learned solely through association, those
properties that co-occur more frequently should be more readily associated. Yet,
classic work on phobias and taste aversion suggest that a single co-occurrence may
indelibly link two properties or events. Many species form ecologically important
links between properties of objects in the world in ways that run against the frequency of occurrence. Rats will associate nausea with the taste of a food ingested 8
hours earlier rather than with a physical trauma occurring minutes earlier (Garcia
and Koelling, 1966; Rozin and Kalat, 1971). Also, monkeys more readily associate
fear with snakes than with rabbits or flowers even when they have had equal exposure to both classes of entities (Cook and Mineka, 1989, 1990). These preferred
linkages cannot be explained easily by different saliency of features; instead, the
animal is predisposed to be especially sensitive to those associations that make sense
for that organism’s ecological niche.
Humans show similar constraints in their understanding and categorization of
natural kinds; higher-order criteria determine the particular frequencies and correlations we choose to notice and remember. When learning about a new kind of animal,
we expect certain property relations to be causally and explanatorily useful. These
expectations can override similarity-based information. Although the nature of these
constraints is still unclear (they could take a range of forms from causal intuition to
precise rules), our concepts of natural kinds are a blend of frequency information
and constraints on the particular relations we choose to consider. In fact, we could
not have concepts of natural kinds without both components, any more than science
F.C. Keil et al. / Cognition 65 (1998) 103–135
109
could proceed without both theoretical expectations to guide our inquiry and feature
tabulations to gather as yet uninterpretable information. Hybrid concepts are necessary to adequately represent any natural kind.
The hybrid model of concepts occupies the middle ground in the continuum
offered by Rips and Sloman and acknowledges that both similarity and rules are
needed without reducing one to the other. Adopting a hybrid model neither entails a
particular representational format for each component nor specifies the nature of the
interaction between the components. We still need to know more about how the two
components interact and how each is represented. Over many different versions of
the hybrid model of concepts, the two dogmas outlined above are the primary
assumptions that have shaped current theory and research. Our main purpose here
is to place these dogmas in a broader context and to discuss why they must fail for
any model of natural kind concepts.
4. Dogma 1—New categories are initially similarity-based: only with time do
we come to apply rules and explanations
Although, at first glance, it seems plausible that adults acquire new knowledge via
an association to theory shift, research on causal attribution and the development of
domain specific expertise in adults directly contradicts this characterization. This
research shows that causal theories often exert a powerful influence on the acquisition process itself and furthermore that associationistic principles are not eventually
replaced by explanatory knowledge but remain important even in experts’ classification schemes.
Traditional models of causal attribution propose that people use information
about the covariation of factors and effects to determine what caused an event
(e.g. Kelley, 1967; Cheng and Novick, 1990, 1992). But an examination of the
information search strategies people use to develop explanations for events challenges this view (Ahn et al., 1995). In seeking to learn why an event occurred, people
solicit information about hypothesized causal mechanisms much more often than
they seek information about co-occurring events (Ahn et al., 1995). For example,
when trying to discover why John had a car accident on Route 9 last night, people
tend to ask questions that either propose specific underlying mechanisms (e.g. ‘Was
John drunk?’ ‘Did John’s brakes fail?’) or that seek more information about the
elements described (e.g. ‘Was there something wrong with John’s car?’ ‘Was there
something peculiar about last night?’). Generally, they do not ask questions about
the frequency with which John has car accidents, the frequency of car accidents last
night, or the frequency of accidents on that particular road. So, in learning about a
novel event, people do not often seek information about covarying factors, nor do
they seem to use such information when given it directly. However, as Ahn et al.
(1995) point out, these findings do not diminish the importance of covariation
information: such information may be used to generate hypotheses or to confirm
the presence of a particular underlying mechanism.
Covariation models of causal attribution, with their emphasis on common and
110
F.C. Keil et al. / Cognition 65 (1998) 103–135
distinctive ‘factors’ are analogous to similarity-based accounts of concept formation, in which categories are learned by attending to common versus distinctive
features. A still-thriving tradition of research on adult concept learning is based
on the premise that raw computations of similarity relations adequately describe
an early state of concept learning. Much of this research uses simple, semantically
meaningless stimuli that vary continuously on two dimensions to test predictions of
universal exemplar models of category structure. For example, the Generalized
Context Model (GCM; Nosofsky, 1984, 1986) predicts classification and recognition behavior by assessing the similarity between a test stimulus and stored representations of other, previously acquired exemplars (i.e. the two categories of
artificial stimuli used in a given experiment). This similarity measure is a weighted
function of the features across which a test stimulus is compared to stored exemplars. Stimuli that are very similar to a large number of stored exemplars from either
category will provoke a recognition response, whereas stimuli that are more similar
to exemplars from one category than those of another will be classified accordingly.
This ability to predict performance across a number of tasks using simple generalizable principles is one of the most important aspects of the GCM model and other
similarity based models.
Nosofsky (1992) provides the groundwork for a plausible interface between these
generalizable similarity computations and a more rule-based component of concepts
by suggesting that exemplar models be understood in terms of representation-process pairs (Anderson, 1976) whereby representations are given separate status from
the processes that work on them. Accordingly, similarity-based exemplar spaces
represent a stable ground upon which various processes (e.g. summed similarity)
operate to produce predictable patterns of behavior. This kind of model implies that
associative principles are fundamental in organizing concepts, but that theory laden
processes can supplement these basic representations to produce a complete explanation of more complex concepts (such as real world categories). The most concrete
mechanism purported to govern the interaction of the associative and explanatory
components is selective attention. Both the GCM and the earlier Context Theory
(Medin and Schaffer, 1978) suggest that selective attention changes the weighting
given to each stimulus feature in different task contexts. Thus, selective attention
takes into account the relevance of particular features to particular classification
goals. Therefore, a resulting hybrid model of concepts might borrow this distinction
between representations and processes by positing theory and explanation as processes akin to selective attention that operate a posteori on stable associative networks by determining which features will be most heavily weighted in a given
comparison. Advocates of this kind of model appeal to a combination of basic
perceptual filters and task-specific principles to constrain the properties represented
and thus render the similarity comparison process more plausible than a brute force
associative engine (Goldstone, 1994; Medin et al., 1993).
Such models directly instantiate Dogma 1 whereby learning begins as an accumulation of a series of exemplars encoded according to domain-general associative
principles and is gradually enriched by attentive processes that select critical features more carefully, which, in turn, drives the emergence of more sophisticated,
F.C. Keil et al. / Cognition 65 (1998) 103–135
111
perhaps causal, principles relating those features. This progression seems to account
for the learning of categories of artificial stimuli devoid of real-world meaning. The
use of such stimuli effectively eliminates the possibility that existing explanatory
knowledge can contribute to the organization of novel categories. Thus, this instantiation of Dogma 1 seems to follow by necessity: it is difficult to imagine how a rulebased ‘process’ can function until a sufficient number of exemplars have been coded
associatively into a sufficient ‘representation’.
However, when items in a novel category do evoke real world knowledge, the
importance of causal theories and explanations to even the earliest stages of the
acquisition of new concepts becomes evident. Unlike the arbitrary categories
described above, natural kinds and even humanmade implements have features
that tend to be causally interwoven. When features are causally interrelated, explanatory knowledge speeds both category learning and the identification of features
(Murphy and Allopena, 1994). In Murphy and Allopena’s study, participants learn to
classify novel animals with either interrelated or non-interrelated features. For
example, one animal might have ‘pointed ears’ and ‘spots’ and another animal
might ‘eat meat,’ and have ‘sharp teeth.’ Having pointed ears does not causally
entail the presence of spots or any other surface marking. However, ‘eating meat’
likely entails the presence of ‘sharp teeth’. Species that eat meat have generally
evolved sharp teeth. Participants who learn categories with causally interrelated
features are faster and more accurate at identifying isolated features than subjects
who learn non-interrelated features. Moreover, causally interrelated, but infrequently mentioned features are identified just as quickly and are rated as being
just as typical as frequently mentioned features. In contrast, participants who
learn non-interrelated features rate infrequently mentioned features as atypical
and identified such features more slowly than frequently mentioned features (Murphy and Allopena, 1994). This effect is not likely to be merely a consequence of
earlier stored correlations about the features of animals. Even if one taught a new,
plausible causal mechanism, it should produce the same effects in a replication of
the Murphy and Allopena study.
When the features available for learning novel instances of natural categories are
interrelated, people use knowledge of these interrelations to guide learning a new
category without having to attend to raw feature frequencies. But when the features
are not interrelated, people must resort to strategies using only the raw statistical
frequencies of features. Thus, knowledge about a domain can aid in learning about
new categories in a domain.
Knowledge from one domain can also be used during the initial learning of
another domain, as shown by research on the acquisition of expertise. Not only
do novices immediately use domain-specific causal theories, they also recruit theories from other domains when analyzing new information. Learners seem compelled to immediately organize new information using causal theories. Yet
similarity is still central to concept learning; it highlights associations needing
further explanation. Thus, explanation and association both play important roles
throughout the course of learning.
In one of the best studied areas of expertise, medicine, there is ample evidence
112
F.C. Keil et al. / Cognition 65 (1998) 103–135
that novices use causal knowledge right from the start. The presence of such theories
is perhaps most evident in the reasoning errors made by both novices and experts.
Beginning medical students assimilate and misconceptualize new medical knowledge because they rely on causal theories learned in school and everyday life. For
example, students may recruit knowledge about kitchen plumbing when trying to
learn properties of the circulatory system (Feltovich et al., 1989). As a result, they
fail to understand the contribution of the elasticity of arteries in amplifying the
power of the heart to pump blood because kitchen pipes are rigid. Even experts in
medicine sometimes mistakenly assume that an enlarged heart implies overstretching of its muscle filaments because they incorrectly recruit knowledge of skeletal
muscles which can become stretched enough to lose efficiency (Feltovich et al.,
1989).
In addition to recruiting knowledge from other domains novices rely heavily upon
causal theories within the new domain. They may even recruit causal theories more
frequently than experts. For example, Boshuizen and Schmidt (1992) asked students
and experts to diagnose case summaries, and found that students engaged more
frequently in explicit biomedical reasoning. When asked, experts could produce
more biomedical information, but only relatively infrequently did they spontaneously refer to this information in their protocols. The acquisition of medical
expertise begins with biomedical knowledge (i.e. rules and theories), and only
later is enriched by associations with the examples gained from practical experience
(Boshuizen and Schmidt, 1992). Even in cases where the medical school curriculum
emphasizes an exemplar-based approach, specific examples are almost always connected with pre-existing, causal, biomedical knowledge (Norman and Schmidt,
1992).
Experts may even use associations more rapidly than novices. Novices appear to
use a seemingly unwieldy process of backward chaining in which reasoning progresses from initial hypotheses to the observation of disease features and back to a
subsequent revision of the diagnosis if the features do not match the hypothesis
(Patel and Groen, 1991). Experts, on the other hand, appear to use forward chaining
in which reasoning jumps directly from features to hypotheses or disease schemas
(Patel and Groen, 1991). This procession from features to hypotheses is typical of an
associative process, yet it is observed more frequently in later stages of the learning
process, not in the initial stages.
Although the medical knowledge literature supports neither a shift from exemplars to causal theories nor a shift from perceptual to conceptual processing, other
contrasts between experts and novices have been taken to support Dogma 1. The best
known example comes from the classification of physics problems. Chi et al. (1981)
found that experts classify problems according to deep structural principles (e.g. the
kinds of physical laws applicable to solving the problem) but that novices classify
based on surface features (e.g. the kinds of objects involved in the physical mechanism under examination). Expert and novice math problem solvers (as defined by
Scholastic Aptitude Test scores) also seem to differ in their use of association
(Novick, 1988). Expert students showed positive transfer from an example word
problem to a new one which shared a structural component—the particular math-
F.C. Keil et al. / Cognition 65 (1998) 103–135
113
ematical solution procedure. Novices, on the other hand, attempted to transfer based
on a match between the surface features of the old and new problems and therefore
often selected the wrong solution procedure.
In both examples, however, the things identified as ‘surface’ features are more
complex than primitive perceptual attributes: ‘surface features’ include non-concrete terms and structural relations between objects. For the physics problems, surface structure was defined as ‘(a) the objects referred to in the problem (e.g. a spring,
an inclined plane); (b) the literal physics terms mentioned in the problem (e.g.
friction, center of mass); or (c) the physical configuration described in the problem
(i.e. relations among physical objects such as a block on an inclined plane)’ (Chi et
al., 1981, p. 125). In Novick’s (1988) mathematical problem solving task, two
problems were said to share surface similarity if they involved the same kind of
activity (e.g. gardening). The characterization of such abstract, relational features as
mere surface features seems to lead inherently to an underestimation of the depth of
processing in which novices engage. Although it might be the case that novices
encode problems more shallowly than experts, it is equally plausible that novices are
relying on a different (possibly more domain-general) set of causal principles to
code the problems. If this were true, then describing novice’s reasoning as ‘shallow’
and more instance-bound would be a mischaracterization.
For example, college students who learn to solve a physics problem involving a
constant change in the speed of some object will often spontaneously transfer the
appropriate equation from this base problem to a target problem concerning a constant change in the rate of population growth. These same participants are less likely
to use this equation to solve a problem concerning a constant change in the rate of
increase in attendance at an annual fair (Bassock, 1996). Because participants
represent speed and population growth as continuously changing quantities, whereas
they represent increases in attendance at an annual fair as a discretely changing
quantity, transfer between the former problems is less difficult than transfer from the
physics problem to the attendance problem (Bassock, 1996). Thus, novices may
engage in sophisticated causal reasoning and mental-model building even if they
do not happen to exploit the particular mathematical principles or structural correspondences considered important by the experimenter. Furthermore, the activities
described in a problem often do signal a particular formal procedure: expert problem
solvers readily use such content, or ‘surface,’ cues to select appropriate solution
schemas (Blessing and Ross, 1996).
In any problem solving situation, even novices exclude certain features from
consideration. For example, in assessing the force required to push a block up an
inclined plane, what novice would seriously consider the color of the block, the
gender of the pusher, or whether the plane ends in the back of a Ryder or a Hertz
rental truck? Rather, the typical novice would employ a whole series of implicit and
explicit theories about the properties of the plane’s surface, the tilt of the plane, and
the weight of the object. A strict empiricist might argue that the process of importing
this information is one of sophisticated association. Perhaps the learner has directly
experienced the process of pushing something along a surface, walking up a plane,
and carrying heavy and light objects. Therefore, each problem element might be
114
F.C. Keil et al. / Cognition 65 (1998) 103–135
associated with a relevant experience. Such a model would, of course, have to be
modified to include more abstract associations than simple perceptual features. It
would require representations of prior problem solving experiences with particular
ways of selecting features. Such an approach is, of course, not new to learning theory
in either its behaviorist (e.g. the concept of a discriminative stimulus) or neural net
instantiation (e.g. adaptive resonance theory; see Carpenter and Grossberg, 1987).
However, by allowing more abstract associations, this model cannot explain
behavior solely through the interaction of a domain-general learning system with
the structure of the environment. One of learning theory’s major goals is to explain
behavior without positing complex representations or domain-specific rules. The
need for constraints on feature selection forces learning theory to adopt just this
type of complex, domain-specific apparatus. Whether feature selections are driven
by domain-specific theories or discriminative stimuli, both require modifications to
supposedly objective associations in order to allow the domain-general apparatus to
accommodate domain-specific regularities. Such modifications imply that existing
knowledge, whether it be association weights, connections, propositions, or images,
cannot be ignored when modeling even the simplest interactions with a stable
environment. To claim that a carefully structured neural network that has been
fed precisely selected information can model behavior only using domain general
learning rules is to purposely black-box all of the domain-specific processes that
would be necessary to get the net to explain anything about real world behavior.
In summary, the development of expertise in adults does not support Dogma 1.
Not only do novices immediately recruit domain specific causal theories or theories
from other domains when acquiring new concepts, but experts continue to rely on a
blend of similarity-based computations and explanatory reasoning. Association and
similarity remain important throughout the course of learning, neither predominating nor becoming relegated to the periphery. Clearly, similarity, perceptual or otherwise, can play an important role in isolating potentially important correlations when
theory runs out or an initial hypothesis is contradicted. But similarity will rarely be
the sole basis for conceptualizing: in real world contexts, expertise develops from a
symbiosis of preconceived theory and experience with new material.
In addition to the expertise differences explicitly adopted by Dogma 1, a second,
less frequently mentioned assumption was prominent in many earlier discussions of
concepts and is still tacitly presupposed in contemporary discussions. The assumption rests on the following line of reasoning: (a) The similarity-based component of
concepts is simpler than the explanation-based aspect (this assumption is clearly
linked to the belief that novices are driven by similarity); (b) Simpler aspects of
cognition are performed more rapidly than complex aspects; (c) Simple, quickly
formed aspects of cognition often serve as preprocessing for more complex aspects;
(d) therefore, in real-time use of concepts (e.g. categorization and induction), the
similarity-based component is processed first, and only later does the explanationbased component come into play.
The idea that initial ‘rough and ready’ or ‘quick and dirty’ processing relies more
heavily on similarity was proposed explicitly in the 1970s (Smith et al., 1974) and
has been left largely unchallenged by contemporary work. Yet, this assumption may
F.C. Keil et al. / Cognition 65 (1998) 103–135
115
not describe real-world cognition. Even in the briefest glance at a scene, viewers
could have theory-based expectations about which features count. For example, a
brief glimpse of a whale may lead to the perception of a fish or of a mammal
depending on which features the viewer considers given a prior theory that emphasizes some features over others. Of course ignorance may lead to mistakes (e.g.
observers may misclassify bats as birds and dolphins as fish), but mistakes reflect the
use of different theory-based expectations not a complete reliance on similarity. Our
perceptions are often influenced by what was causally and explanatorily relevant in
the past. Only with stimuli so arbitrary and decontextualized that no prior information could be relevant will the ‘similarity first’ rule of processing apply.
Although the conclusion that initial processing of new entities is driven solely by
perceptual primitives and the laws of association may be correct under such decontextualized circumstances, we cannot generalize this conclusion to cases of conceptual apprehension in the real world. In some intriguing studies, researchers have
found that people with psychological disorders show Stroop-interference effects for
words related to their own individual diagnosis (for review, see Williams et al.,
1996). These studies suggest that a priori thoughts or emotions can create schemas
that affect early stages of perceptual processing. Research on natural scene processing provides further evidence suggesting that the initial steps in encoding objects
and scenes involve mutual constraint satisfaction in which scene context facilitates
object recognition (Biederman et al., 1982; Rayner and Pollatsek, 1992). Other
research on rapid identification of pictures shows it is possible to detect an object
that does not belong to a target category even when each image is visible for only
114 ms (Intraub, 1981). To detect an exemplar that is not a member of a specified
category requires participants to activate category knowledge at a higher level than
perceptual similarity; perceptual similarity is less useful in this case because the
participant does not know what features define the target. Therefore, on-line expectations about upcoming events can direct early visual processing in searching for
something as abstract as the non-occurrence of a given category of object. In both of
these cases, early object recognition processes are sensitive to the surrounding scene
context and current task demands, which implicate the use of more abstract knowledge than perceptual primitives. We are not resurrecting aspects of the ‘new look’ in
perception (e.g. Bruner and Goodman, 1947; Bruner and Postman, 1947), a movement that foundered in a methodological morass, but we are suggesting that abstract
conceptual information is involved in the interpretation and use of information from
the earliest stages of processing.
5. Dogma 2—Children initially rely on associations and only later begin to use
theories to constrain their concepts
The notion has long persisted that children have different patterns of thought and
different concepts than adults. Such apparent differences suggest the need for a
transition between undeveloped and mature thinking. The transition most frequently
proposed to account for these differences is one in which similarity precedes and
116
F.C. Keil et al. / Cognition 65 (1998) 103–135
acts as the cradle for rules and explanation. In this view, knowledge emerges through
ever more complex associations over perceptual primitives. The child is seen as
initially nothing more than a frequency and correlation detector, and the massive
collections of associations eventually yield impressions of causality, rules, and
explanation. In other words, young children are initially instance bound, but gradually develop principled ways of understanding. Many theorists have argued for a
shift from concrete, instance-based knowledge to abstract, rule-based concepts. For
example, Vygotsky (1962) argued that the development from similarity to rules
parallels the acquisition of language; language serves as a tool for stating and
using rules. Others argued for a shift from concrete representations to abstract
ones (Werner and Kaplan, 1963) or from accidental features to essential ones
(Inhelder and Piaget, 1969).
Despite the prevalence of the concrete to abstract shift in models of development,
no one has ever been able to describe how senses of explanation, mechanism, and
cause gradually arise out of statistical operations over primitives. Models based on
the second dogma offer no plausible account of how the second part of the hybrid
might emerge from the first. Threshold models in which associations of sufficient
strength become causal cannot account for our ability to reject causal explanations
for some cases of perfect, but indirect or arbitrary correlations. Thus, the shadow of a
flagpole at any given time of day may be perfectly correlated with its height, but no
one assumes that the shadow of the flagpole causes it to have a particular height. In
the 1950s in the US, hair length was almost perfectly correlated with gender, but no
one thought that gender caused hair to be of different lengths or that hair length
somehow caused an individual to be a certain gender. In contrast to such threshold
models, which do not capture our ability to posit mediating variables or to reject
nonsensical explanations, recent papers have argued that theory often precedes, or
operates in concert with considerations of similarity (e.g. Simons and Keil, 1995).
The second dogma also encounters the problem discussed earlier in the context of
adult concepts of natural kinds. If children merely tabulate feature frequencies, how
do they know which frequencies and correlations to encode? Even if the features
considered are constrained to a relatively small set of perceptual primitives, the
number of possible relations to tabulate seems overwhelming. Thus, in principle,
children could not acquire a rule-based concept of a natural kind from similarity
relations alone. Not surprisingly, then, research with human infants increasingly
suggests a sensitivity to features and relations that are anything but the sorts of
perceptual primitives embraced by those adopting Dogma 2. Young infants appear
to perceive causal relations over mere contiguities (Leslie, 1995), to evaluate the
rationality of an agent’s actions in relation to its inferred goals (Gergely et al., 1995),
to intermodally integrate information at birth (Gibson and Walker, 1984), to imitate
high order relational patterns (Meltzoff, 1988), and to discriminate biological from
non-biological motion (e.g. Fox and McDaniel, 1982). All of these abilities seem to
require representations of interactions among features or entities. Apparently, infant
concepts are not so impoverished that they are trapped by similarity of perceptual
features.
Several recent findings from work with preschool-aged children also raise serious
F.C. Keil et al. / Cognition 65 (1998) 103–135
117
questions about the developmental progression described by Dogma 2 (Simons and
Keil, 1995; Wellman and Gelman, 1992). For example, children’s inductions about
novel, unobservable properties seem to be based preferentially on information about
category membership as opposed to information about perceptual similarity (Gelman and Markman, 1986, 1987). In such tasks, preschool children are shown an
animal or artifact which is given a label. They are then told about an unobservable
property or part of that object and are asked which of several other objects have that
part. The test objects vary in their perceptual similarity to the original object and in
whether they have the same label as the original. Thus, the task directly pits category
membership against perceptual similarity. Dogma 2 would suggest that preschoolers
form categories using perceptual similarity alone, and that only later do children
begin to use more abstract rules to form categories. If so, then inductions of unseen
properties should be based on the similarity of the original and queried object.
However, even 2-year-old children override perceptual similarity and attribute
unobservable parts to members of the same category (Gelman and Coley, 1990;
Gelman and Markman, 1986). These children also seem to know which properties
should be inferred on the basis of category membership; they know that category
membership is more central than perceptual similarity. Although these findings do
not directly speak to the developmental origins of preschooler’s concepts, they do
suggest that children’s categories are based on more than perceptual similarity. The
findings suggest that ‘even before children can make use of subtle perceptual cues to
determine category membership, they readily use category labels as the basis of their
inferences’ (Gelman and Coley, 1990, p. 803).
Other evidence suggests that preschoolers develop knowledge of unobservable
internal parts in the opposite order to that suggested by Dogma 2. Children initially
have abstract expectations which guide their search for more concrete information
(Simons and Keil, 1995). In these studies, children were shown a target animal or
machine along with a set of potential ‘insides’: the insides of an animal, of a
machine, a pile of blocks, and a pile of rocks. On each trial, children were asked
to pick the insides that belonged with the target animal or machine. Even preschool
children expected the insides of animals and machines to differ, but they did not
have specific expectations for the physical appearance of the insides. Children consistently chose different insides for animals and machines. However, their mistakes
across several studies were particularly revealing. In one study, when they picked
the wrong insides for animals, they picked the rocks rather than the machine insides.
In a second study, when they chose the wrong insides for machines, they tended to
choose the blocks rather than the animal insides. Although children lack concrete
knowledge of the insides of animals and machines, they have abstract expectations
about the differences between these categories and use the expectations to guide
their inductions about unfamiliar properties. By 8 years, children have acquired
enough concrete knowledge that they rarely choose the wrong insides (Simons
and Keil, 1995). Abstract expectations about the differences between animals and
machines guide children’s search for more concrete details; concepts of animals and
machines become more like those of adults by refining broad initial expectations
with specific concrete details. Although studies of children’s knowledge of insides
118
F.C. Keil et al. / Cognition 65 (1998) 103–135
do not directly assess the relationship between perceptual similarity and rule based
components of concepts, they do suggest that young children are guided by rule-like
expectations for the differences between superordinate level categories. Preschoolers know that only natural kind insides belong in animals and only artificial
insides belong in machines. They can use this abstract distinction to draw inferences
without having to rely exclusively on perceptual groupings.
Some have argued that demonstrations like these are based on an excessively
impoverished notion of perceptual similarity. As a result, early categorization is
falsely portrayed as non-perceptual, and thus by necessity conceptual, when a richer
view of perceptual similarity makes clear the primacy of perception in development
(e.g. Jones and Smith, 1993). We need not enter that debate here (but for related
commentary see Barsalou, 1993; Gelman and Medin, 1993; Mandler, 1993; Mervis
et al., 1993; and Smith and Jones, 1993), but we should note that these more
elaborated forms of ‘perceptual similarity’ have little in common with the notion
put forth in Dogma 2. Instead, these richer versions of perceptual similarity include
highly relational and abstract structural properties and are heavily influenced by
context and relevant prior experience. There may be good reasons for attempting to
define a more principled perceptual/conceptual boundary, but they are beyond the
scope of this article.
If infants are sensitive to more complex forms of information from the start, they
must be biased to notice certain feature patterns over others. What, then, is the nature
of these biases and how do they transcend the laws of a raw similarity system? One
developmental model posits very crude initial sensitivity to relational patterns that
pick out some limited domain of information such as the human face. These crude
sensory biases isolate face-relevant information and then learning occurs via
domain-general mechanisms (Johnson et al., 1991). Thus, infants might first pick
out and attend to human faces by relying on the crude triangular configuration of the
eyes and mouth (see Johnson, 1992). Such information may be sent to one region of
the cortex which, other than being the endpoint for face related configural information, has no a priori specialization for face processing. This modern variant of
empiricism accounts for distinct causal explanations by adopting minimal prior
biases (possibly embodied in the neural architecture of the perceptual system)
which help to separate distinct domains into separate representational systems.
An alternative approach argues for initial sensitivity to many complex patterns at
both perceptual and cognitive levels. These might range from the particular temporal
patterning of social interactions to having expectations about the mechanics of
physical objects (Spelke et al., 1992). At a minimum, this approach requires early
sensitivity to informational patterns which uniquely specify important kinds in the
world. Positing sensitivity to complex patterns or invariants in the environment does
not require that infants have theories, conscious or otherwise, or even sets of beliefs,
but it does require a learning system that is tuned to particular kinds of causal
patterning and that uses such tuning to build and structure knowledge in a domain.
Despite these arguments and evidence against Dogma 2, it continues to resist extinction. One reason for its persistence arises from the tendency of some to mistake other
developmental patterns for versions of Dogma 2.
F.C. Keil et al. / Cognition 65 (1998) 103–135
119
6. Mistakenly interpreting developmental trends as evidence for Dogma 2
In recent years, more sophisticated versions of Dogma 2 have arisen that are not
so easily rendered implausible. Rather than simply positing a shift from similarity to
rules, recent models argue for more subtle transitions. For example, the acquisition
of word meanings and their corresponding concepts seems to follow a ‘characteristic-to-defining shift’ (Keil and Batterman, 1984; Keil, 1989). Other similar developmental trends include a shift from processing surface to deep features (Chi, 1992)
and a gradual transition from noting properties, to detecting relations between properties, and finally to detecting relations between relations (Gentner and Toupin,
1988). In all of these cases, the first kind of knowledge seems more like similarity
and the second more like rules and/or explanations. Real developmental changes
have inspired all of these claims (as well as all earlier ones), but a closer look at the
nature of children’s knowledge suggests that the changes have little to do with the
pattern described by Dogma 2.
Consider, for example, the shift from characteristic to defining features. Young
children regularly identify instances as members of a category on the basis of shared
typical or characteristic features even when ‘defining’ ones are absent. For example,
young children claim that a gift-giving, male, unrelated adult is an uncle, but an
adolescent who is their father’s brother is not. These children seem to be tabulating
the features most frequently associated with uncles and neglecting any use of the
kinship rule. Older children accurately reject such cases, but accept ones that have
almost no characteristic features when a defining feature is present (Keil, 1989).
A closer look at the characteristic to defining shift, however, reveals that it is not
truly a shift from similarity to rules, or from association to definitions. Even the
youngest children never rely solely on tabulations of the most perceptually salient
features associated with instances and adults rarely rely exclusively on rules or
explanations. The features children tabulate in demonstrations of a characteristic
to defining shift are always constrained; they limit their consideration even of
characteristic features to those likely to be relevant. For example, even if every
‘uncle’ a child encounters is seen wearing glasses, it is unlikely that the child
would weigh this feature as heavily as certain social, behavioral, and personality
traits that are characteristic of uncles. Typicality is always harnessed to the particular explanatory framework currently active; which typical features are taken to be
‘characteristic’ depends critically on the child’s cognitive ‘point of view’.
More broadly, younger children can differ from older ones in two ways that give
the illusion of reinforcing Dogma 2: (1) they know less about the world; and (2) they
may have different biases about what information they regard as explanatorily
relevant. In the first case, children might well be ignorant of specific causal mechanisms or rules underlying a phenomena and therefore may be forced to rely on notions
of causal frameworks much sooner than adults. Furthermore, given the necessary
vagueness of those frameworks, they may in turn have to rely more heavily on
similarity to structure the information they encounter. However, they do not and
could not ever use similarity exclusively. At most, there is a shift in the ratio of their
use of similarity-based vs. explanation-based knowledge. But by neglecting the less
120
F.C. Keil et al. / Cognition 65 (1998) 103–135
precise aspects of explanatory knowledge (such as notions of causal potency),
researchers might mistakenly attribute to children a full association to explanation
shift. In the second case, when trying to assimilate new information, a child may
implicitly adopt an inappropriate theory (e.g. psychological versus biological; see
Carey, 1985) that makes it appear as if she is ignoring critical features and simply
tabulating features indiscriminately when, in fact, she is using a theory, albeit one
inappropriate to the domain.
7. Causal potency of properties: an initial study
If explanatory knowledge constrains initial tabulations of similarity, how is such
knowledge represented? The early explanatory component of concept representation
might consist of a discrimination between causally central and causally peripheral
properties even when specific mechanisms are unknown. Such knowledge must
distinguish between causal centrality and typicality of properties (even though
they are often closely related). Knowledge of this distinction is easy to demonstrate
in adults. For example, adults view the property ‘curvedness’ as equally typical of
bananas and boomerangs, but do not regard it as equally central to determining
category membership (Medin and Shoben, 1988). Although a straight ‘banana’
could still be a banana, a straight ‘boomerang’ would be a stick. Research on the
causal centrality of properties typically asks participants to consider a hypothetical
situation in which a typical property (curvedness) is not present, and then to judge
whether this counterfactual vitiates the inclusion of an instance in a category. We
have used such counterfactual questions to examine the properties that adults believe
are causally central to categories such as animals, artifacts and non-living natural
kinds. Importantly, judgments of centrality are distinct from judgments of typicality;
the most typical properties are not always thought to be causally central (Keil and
Smith, 1996).
Although several researchers have considered the distinction between typicality
and centrality with adults, the development of notions of causal centrality has
remained largely unexplored. Children under 6 years of age usually do not greet
sets of counterfactual questions with much pleasure. Often, they simply refuse to
entertain the counterfactual (but see Harris et al., 1996). However, by describing a
novel category and posing less structurally complex questions, we have been able to
explore intuitions about causal centrality in children as young as 5 years.
In the following study, we read stories about a novel kind of animal (‘glicks’) and
a novel kind of machine (‘nilards’) to 5-, 7-, and 9-year-olds. Each story described
six property types: size, weight, color, surface markings, number of important inside
parts, and appearance of functional outside parts. Subsequently, we asked children if
other instances of the described category had to share the same property as those
described in the story, or whether something could still belong to the category if it
differed on a particular property type (e.g. if a something had a different color than
the glicks described, could it still be a glick?). Because both the machine and animal
stories described the same property types, we were able to assess to degree to which
F.C. Keil et al. / Cognition 65 (1998) 103–135
121
children believed that a particular property type was relevant to a decision about
category membership for different ontological kinds1.
8. Method
8.1. Participants
Sixty-eight children participated: 22 5-year-olds, 24 7-year-olds, and 22 9-yearolds (mean ages, 5 years 6 months, 7 years 6 months, 9 years 6 months). Children
were tested individually outside their preschool classroom. Each session lasted
roughly 10 min.
8.2. Materials and procedure
The stimuli consisted of two animal stories and two machine stories. Each child
heard one animal and one machine story, with half of the children of each age
hearing a ‘glick’ and ‘nilard’ pair and the other half hearing a ‘bleek’ and ‘jullet’
pair. The artificial labels, ‘glick’ and ‘bleek,’ each referred to a novel kind of animal;
‘nilard’ and ‘jullet’ each referred to a novel kind of machine. The order of presentation of the stories was counterbalanced across subjects. Each story contained a set of
property types that could in fact apply to either an animal or machine, and only an
introductory statement told the children the category (animal or machine). Because
each child heard an animal story and a machine story, the stories were not word for
word duplicates in terms of the properties described: although both stories indicated
the color of the ‘glick’ or ‘nilard’, in one story this color was black while in the other
story this color was, for example, yellow. This design was necessary to avoid
situations in which a child might confuse the two stories; however both stories
contained the same six property types. (See Tables 1 and 2 for examples).
After hearing a story, children responded to a series of questions about the properties mentioned in the story. Specifically, they were asked whether the object they
heard about would be the same kind of thing if a target property were changed. For
example, they might be asked ‘Do you think that all Glicks have to have black
stripes on their backs, or could something still be a Glick even if it didn’t have black
stripes on its back?’ We explored six different property types presented in a pseudorandom order for each child: surface markings (e.g. stripes), the number of internal
parts, the shape of external parts, color, size, and weight.
To make sure children understood the task, two control questions were asked:
‘Could something still be a Glick even if it didn’t have dirt on its tail?’ and ‘Could
something still be a Glick if it was made out of butter?’ All but one child answered
these control questions correctly (‘yes’ for the first question and ‘no’ for the second);
this child’s data were not included in the analyses.
1
To avoid confusion, we distinguish between property types and particular properties. That is, color is a
property type, while ‘red’ is a particular property or instantiation of that property type.
122
F.C. Keil et al. / Cognition 65 (1998) 103–135
Table 1
Animal story and questions read to 5-, 7-, and 9-year-olds
There is a kind of animal called a Glick. Have you ever heard of a Glick?
I certainly had not, but the other day, I was walking through the woods, and
I saw one. Do you want to hear what the Glick looked like? Well, it was
brown, had black stripes on its back, was about this big [gesture] and
weighed about 10 pounds. It also had 26 really important parts on the inside
of it. The Glick I saw really liked to eat berries, and had four parts on the
outside of it that it used to pick apart the berries. It also was sitting in a tree
with 16 branches and had a little bit of dirt on its tail.
Do you think Glicks really have to have dirt on their tails, or could something still be a Glick even if it didn’t have dirt on its tail?
Do you think Glicks have to be the same size as the one I saw, or could
something still be a Glick even if it was a different size?
Do you think that Glicks have to be brown, or could something still be a
Glick even if it was a different color?
Do you think that all Glicks have to have black stripes on their backs, or
could something still be a Glick even if it didn’t have black stripes on its
back?
Do you think that all Glicks have to weigh the same as the Glick I saw, or
could something still be a Glick even if it weighed something different?
Do you think that all Glicks have the same number of important inside
parts, or could something still be a Glick even if it had a different number of
important inside parts?
Do you think that all Glicks have the same kind of parts on the outside of it
or could something still be a Glick and have different looking parts on the
outside?
Could something be made of butter and still be a Glick?
[Dirt]
[Size]
[Color]
[Surface markings]
[Weight]
[# Of inside parts]
[Shape of outside parts]
[Butter]
8.3. Results
A belief that the hypothetical object with the changed property was still a member
of the labeled category was coded as ‘0’, and a belief that changing the property
precluded category membership was coded as ‘1’. These scores correspond to how
relevant a child thought a particular attribute was to determining an object’s category membership. And, when averaged across children, they provide the proportion
of children of each age group who believed that property to be central.
The mean responses for each age group are represented graphically in ‘spider
web’ plots (see Figs. 1–3). Each axis radiating from the center represents one
property, and distance from the origin indicates the degree to which children judged
the property to be central (i.e. that it had to be shared by all members of the labeled
category). For example (see Fig. 1), whereas about 75% of the children (averaging
across age) thought that all animals called ‘glicks’ had to have outside parts that
looked the same, only 20% of the children thought that all machines called ‘nilards’
had to share similar looking outside parts.
A 3 (Age: 5, 7, or 9 years) by 2 (Kind: animal or machine) by 6 (Property: stripes,
color, size, weight, # of internal parts, shape of external parts) mixed-design
ANOVA with Kind and Property as repeated measures compared children’s
123
F.C. Keil et al. / Cognition 65 (1998) 103–135
Table 2
Machine story and questions read to 5-, 7-, and 9-year-olds
There is this kind of machine called a Nilard. Have you ever heard of a
Nilard? I certainly had not, but the other day, I was walking through the
hardware store, and I saw one. Do you want to hear what the Nilard looked
like? Well, it was yellow, and had green stripes on it, was about this big
[gesture] and weighed about 50 pounds. It also had 30 really important parts
on the inside of it. The Nilard I saw is used to dig holes in the ground, and
had 5 parts on the outside of it that would dig those holes. It was sitting on a
shelf and had some dust on it.
Do you think Nilards really have to be dusty, or could something still be a
Nilard even if it wasn’t dusty?
Do you think Nilards have to be the same size as the one I saw, or could
something still be a Nilard even if it was a different size?
Do you think that Nilards have to be yellow, or could something still be a
Nilard even if it was a different color?
Do you think that all Nilards have to have green stripes on their backs, or
could something still be a Nilard even if it didn’t have green stripes on its
back?
Do you think that all Nilards have the weigh the same as the Nilard I saw, or
could something still be a Nilard even if it weighed something different?
Do you think that all Nilards have the same number of important inside
parts, or could something still be a Nilard even if it had a different number
of important inside parts?
Do you think that all Nilards have the same kind of parts on the outside of it
or could something still be a Nilard and have different looking parts on the
outside?
Can something be made of butter and still be a Nilard?
[Dirt]
[Size]
[Color]
[Surface markings]
[Weight]
[# Of inside parts]
[Shape of outside parts]
[Butter]
responses to the property changes for each story2. The analysis revealed significant
main effects only of Property (F(5,58) = 21.94, P , 0.0001) and Kind (F(1,62) = 22.69,
P , 0.0002) and a significant Property × Kind interaction (F(5,58) = 21.87, P ,
0.0001)3. No other main effects or interactions reached significance, although the
main effect of Age approached significance (F(2,62) = 2.481, P = 0.092). Post-hoc
analyses confirmed that each age group showed this same pattern of results. For each
age group a 2 (Kind) by 6 (Property) repeated measures ANOVA revealed significant main effects of, and a significant interaction between, kind and property: For 5year-olds, Kind F(1,21) = 3.80, P = 0.065, Property F(5,17) = 4.83, P = 0.006, Property by Kind F(5,17) = 6.39, P = 0.002; for 7-year-olds, Kind F(1,21) = 7.09, P = 0.015,
Property F(5,17) = 6.97, P = 0.001, Property by Kind F(5,17) = 7.91, P , 0.0006; for
9-year-olds, Kind F(1,20) = 13.81, P = 0.001, Property F(5,16) = 16.37, P , 0.0001,
2
Three participants (two 7-year-olds and one 9-year-old) were excluded from these overall analyses
because their responses to one of the property questions were ambiguous and were coded as missing
values. These subjects were included in analyses of the individual properties for which they gave unambiguous responses.
3
We do not report MSEs here because they are meaningless for repeated factors having more than two
levels. In mixed designs such as ours, the standard univariate tests of significance, with their accompanying neatly partitioned sums of squares, are invalid due to violations of the assumption of compound
symmetry (see Winer, 1971). We thank Richard Darlington for pointing this out.
124
F.C. Keil et al. / Cognition 65 (1998) 103–135
Fig. 1. Mean proportion of children (averaging across ages) who judged that all members of a novel
category of animals or machines must share a given property. Each axis of the radial graph represents one
of the properties about which children were asked. Distance from the center corresponds to the proportion
of children (out of 68 total) who indicated that all members of a particular category had to share the same
property (e.g. having black stripes) as the exemplar described in the story. Thus, high values on an axis
indicate the relative centrality of the designated property to category membership. The dashed plot line
shows the response profile for the novel machine category and the solid plot line shows the response
profile for the novel animal category.
Property by Kind F(5,16) = 8.23, P , 0.0006. In addition, we conducted follow-up
analyses to examine the effect of category for each property individually. For each
property, we subtracted responses to the machine story from responses to the animal
story to get a single difference score for each participant. Six separate one-way
ANOVAs compared the three age groups (5, 7, and 9 years) for each of the six
properties. No significant main effects were found for any property (color, F(2,65) =
2.31, P . 0.10; surface markings, F(2,65) = 1.66; weight, F(2,65) , 1; inside parts,
F(2,65) , 1; outside parts, F(2,65) = 1.67; size, F(2,65) , 1). Thus, children of all ages
tested were sensitive to differences among the properties they were asked to consider and to the differential relevance of particular attributes to animals vs.
machines.
In order to directly assess the significance of the differences in the profiles for
animals and machines, we averaged across the age groups and conducted McNemar
tests for each property4. Here, the McNemar test considers cases in which children
responded that changing that property casts doubt on category membership for one
kind but not on the other (e.g. all ‘glicks’, a novel animal, have to have stripes, but all
‘nilards,’ a novel machine, do not). Children significantly differentiated animals and
4
The McNemar test compares responses on two related, dichotomous variables based on a x2-distribution. The standard x2-test is inappropriate when a single participant would be counted in multiple cells.
F.C. Keil et al. / Cognition 65 (1998) 103–135
125
Fig. 2. Proportion of children at each age tested (5-, 7-, and 9-years-old) who judged that all members of a
novel category of machines must share a given property. Small dashed line shows response profile of 5year-olds; solid line shows response profile of 7-year-olds; and large dashed line shows response profile of
9-year-olds. To interpret this radial graph, see detailed description of similar graph in Fig. 1.
Fig. 3. Proportion of children at each age tested (5-, 7-, and 9-years-old) who judged that all members of a
novel category of animals must share a given property. Small dashed line shows response profile of 5year-olds; solid line shows response profile of 7-year-olds; and large dashed line shows response profile of
9-year-olds. To interpret this radial graph, see detailed description of similar graph in Fig. 1.
126
F.C. Keil et al. / Cognition 65 (1998) 103–135
Table 3
Number of children (n = 68 total) who judged that a property change was, or was not, relevant to
membership in an Animal vs. a Machine novel category
Changed property
Irrelevant to
both animal
and machine
categories
Relevant to
both animal
and machine
categories
Relevant to
animal but
not to machine
categories
Relevant to
McNemar x2b
machine but
not to animal
categories
Kind of outside partsa
# Of inside parts
Color
Surface markings
Size
Weight
19
10
41
26
44
39
10
32
2
7
3
8
34
20
18
34
2
5
2
6
7
1
19
16
30.44***
7.54**
4.84*
31.11***
13.76**
5.76*
a
Three participants gave ambiguous responses and are not counted for this property.
Whenever the expected frequency of a McNemar cell is below 5, the McNemar x2 tends to be inflated; in
these cases, we relied on the exact binomial formula to obtain a P-value. x2 degrees of freedom=1.
***P , 0.0001; **P , 0.005; *P , 0.05.
b
machines for all six properties. Changing the shape of outside parts had a greater
effect on animals than machines (i.e. changing the shape of outside parts disrupted
category membership more for animals than for machines). Similarly, changing the
number of inside parts, the surface markings, and the color had a greater effect on
animals than machines. Changing the size and the weight mattered more for
machines than for animals. See Table 3 for the frequencies of the four possible
response patterns.
9. Discussion
This study revealed that children as young as 5 years recognize the differential
importance of specific property types for animals and machines. The majority of
children indicated that all members of a novel animal category must share the same
color, surface markings, number of inside parts and appearance of outside parts5.
The same children thought that members of a novel machine category could vary
along these attributes yet still belong to the category. In contrast, children indicated
that members of an animal category could vary more in size and weight than could
members of a machine category.
Interestingly, the pattern of responses shows no systematic developmental
changes (as confirmed by the post hoc analyses; see Figs. 2 and 3). This consistency
5
As long as the outside parts of a particular machine perform the same function as other members of its
class, the actual appearance of those parts might vary; but the same is not likely to be true for members of
an animal class: if the outside parts look different despite serving the same function for the animal, we
would probably think that the animal belonged to a different class. For example, both squirrels and
nuthatches dig holes to cache food, but they do not have similarly shaped parts for digging those
holes. In contrast, different lamps may have highly dissimilar parts for turning them on and off, yet we
would hardly consider this difference significant enough to warrant classifying lamps on the basis of the
appearance of their on/off switches.
F.C. Keil et al. / Cognition 65 (1998) 103–135
127
indicates that children of different ages had the same notions about which types of
properties were most central to each category. Across a substantial developmental
period notions of centrality for these sets of properties remain remarkably stable.
Given the similarity of response profiles across ages 5, 7, and 9, there is no basis to
presume that children even younger than those tested here would have radically
different knowledge about the fundamental differences in the causal relevance of
particular properties to particular kinds of things.
We have suggested that children were responding on the basis of causal notions.
Children might not, however, have clear notions of the causal reasons for the
differential importance of, say, color, to animals versus machines. Instead, such
notions might develop from an accumulation of observations about property variation rates within different kinds. If so, the children in this study may have based their
responses not so much on an understanding of the causal importance of property
types to particular ontological kinds but rather on this accumulation of frequencybased information6. The children’s protocols suggest otherwise. Although children
varied in their verbal precocity, those children who did justify their responses often
did so in causal terms. For example, a large proportion of 5-year-olds spontaneously
appealed to notions of ‘growth’ to explain why all ‘glicks,’ a kind of animal, did not
have to be the same size. We did not conduct a systematic protocol analysis (partly
because younger children are notoriously bad at articulating explanations), but we
suggest that if children were asked to choose between a covariation explanation and
a causal explanation (similar to Ahn et al., 1995) for these results, they would
overwhelmingly prefer the causal account.
These data may not, then, rule out interpretations that appeal to frequency tabulations, but they do pose a challenge to such accounts. If younger children are insensitive to causal potency and must rely solely on correlational evidence, then when do
they start using causal notions—do causal notions click into place once a certain
threshold of correlational evidence has accumulated? But then, where do such causal
notions come from? Certainly children are not explicitly taught why, for example,
surface markings are relevant to what an animal is but not to what a machine is.
Furthermore, correlational accounts, in which information accrues from concrete
instances in the absence of later-developing abstract notions, would predict that
local explanations would develop first only to be replaced later by abstract notions
of how the mechanisms of growth, inheritance, and human intention apply broadly
to animals or artifacts in general. That is, children should first generalize locally
before generalizing observations about properties to all animals as a broad class. But
in our study, the animal is a novel one, and the description gives no information that
would allow local generalizations—indeed we forced children to consider ‘glicks’
as a specific type of animal, a very broad level of classification (i.e. as opposed to a
specific type of mammal, or pet, or insect). Finally, such accounts would seem to
have particular difficulty accounting for children’s belief that all animals of the same
kind should have the same number of inside parts, given that children very likely
have not ever seen, much less counted, the inside parts of various animals.
6
We thank an anonymous reviewer for pointing this out.
128
F.C. Keil et al. / Cognition 65 (1998) 103–135
These results also accord with a related task in which two groups of 8-year-olds
were familiarized with a single set of features, half of which characterized one
category, half of which characterized a contrasting category (Barrett et al., 1993).
One group of children was told that the features described two types of animals,
while the other group was told that those same features described two types of tools.
Subsequently, children’s classification of test items (with novel combinations of the
familiar features) differed, depending on whether or not the features had been
introduced as belonging to two kinds of animals or two kinds of tools. Thus,
these older children’s knowledge was structured and guided by intuitive notions
of the differential importance of certain features to animals vs. tools.
As suggested by the infant research reviewed earlier, these basic notions of causal
centrality may emerge early, possibly as early as sensitivity to the typicality of
properties. We are not suggesting that infants innately ‘know’ patterns of causal
centrality for different kinds (although even that possibility has not been ruled out).
Instead, we are suggesting that the ability to perceive and learn causal patterns may
be just as fundamental as the ability to learn typicality and frequency distributions;
typicality and causal centrality may go hand in hand in development.
10. Differential knowledge of local causal mechanisms
Knowledge about things such as living kinds and machines does develop. A
framework of causal understanding of the differential centrality of properties for
different kinds is only that—a framework. Within this framework, a great deal of
cognitive growth and conceptual change occurs. Although the results of our study
revealed no developmental differences, dramatic developmental changes can occur
even for the properties we examined. If we queried details about the causal mechanisms underlying a given property, undoubtedly, we would see developmental
changes. However, our questions required only a preliminary understanding of
general causal patterns. We did not ask children, for instance, about the inner workings of a gasoline engine, the adaptive value of claws vs. hooves, or how being white
helps a polar bear. This distinction between detailed knowledge of local causal
mechanisms and less precise frameworks is an important one, as we emphasized
earlier. Failure to acknowledge this distinction has sometimes led to mistaken
endorsements of Dogma 2 and may foster false controversies regarding developmental change when some research focuses on local mechanism knowledge and
other research emphasizes general explanatory frameworks (see Simons and Keil,
1995 for discussion).
We have recently begun to examine more carefully which aspects of causal
knowledge emerge early in development to form a relatively fixed explanatory
framework and which show change with increasing age. A full account of this
distinction will require an extensive set of studies, but the general point can be
illustrated through thought experiments. The study reported in this paper demonstrated that, during a developmental period marked by many changes in detailed
knowledge of living kinds and artifacts, a general framework of causal knowledge is
F.C. Keil et al. / Cognition 65 (1998) 103–135
129
likely to exist early on, side by side with a sensitivity to typicality information. This
somewhat counterintuitive prediction required an initial empirical demonstration.
One other, more intuitive pattern will almost certainly be found, although the details
of the developmental course can only be revealed through future studies:
For sufficiently complex mechanisms, even most adults will not have a precise
causal understanding (e.g. Au, in press). Thus, we might expect there to be relatively
stable causal intuitions across lay people and little developmental change in these
intuitions, but more dramatic, though highly specific, expert-novice differences.
That is, only the most sophisticated experts would have such knowledge, and
even for them it would likely be incomplete. An extreme example would be
properties that no one outside of the leading laboratories has ever encountered.
But even when almost everyone has heard of a property, only a few experts
may understand how it is involved in specific mechanisms. For example, color is
a salient property for both machines and animals, but only expert chemists might be
able to explain the chemical composition of paint pigments or the biological underpinnings of skin and fur pigments. Furthermore, such experts are likely to be the only
ones who could explain how the organic and inorganic compounds in animals and
machines produce colors differently. Similarly, only certain ecologists would be
able to explain in detail how a particular pattern of coloration distinct to a species
has adaptive value. For each of the properties used in our study, we could pose
similar sorts of questions about mechanisms that only a tiny fraction of adults would
know. Moreover, it would be almost trivial to demonstrate that some aspects of
knowledge about any particular mechanism take a lifetime of devoted study to
acquire.
Aside from such cases of extreme expertise, lay knowledge of both causal
mechanisms and typicality often will develop during childhood. Without further
careful studies, we cannot easily predict the developmental time course for particular properties, but few would doubt that many such cases exist. For example,
children of different ages would likely give quite different answers to questions
about the local mechanisms at work in such domains as biology, chemistry, and
physics: ‘Is the process that makes things rust like the process that makes people get
grey hair, or is it more like the process that makes batteries run down?’ In our
preliminary studies we have found that answers to such detailed questions about
color, size, and weight may change with age. For example, 5-year-olds do not
distinguish between an animal and a machine when asked which is more likely to
change its weight during the course of a day, but older children and adults think
animals are more likely to show weight fluctuations.
11. Dogma 2: summary
Children do learn as they get older, but the sorts of things they are capable of
learning may not undergo qualitative changes such as a shift from similarity to
causal frameworks or rules. Young children surely know fewer detailed mechanisms
in domains ranging from the germ theory of disease to the actions of levers and
130
F.C. Keil et al. / Cognition 65 (1998) 103–135
fulcrums. They will also know less about typicality distributions of disease types and
machine types. They may also acquire some new frameworks of causal understanding. However, our study suggests that such causal frameworks can be stable across
large periods of development. Even when children default to typicality to distinguish
classes of entities, they do not necessarily undergo a shift from strictly similaritybased reasoning to explanation-based reasoning. The illusion of this shift results
from our neglect of their more abstract forms of non-associative knowledge.
12. The unnecessary commitments of empiricism
We have not yet explained why the two dogmas have influenced so much research
in cognitive science. One powerful reason may be a tendency within cognitive
science to adopt the several unnecessary commitments of the empiricist approach
to the acquisition of knowledge. The strongest version of empiricism adopts the
view that all knowledge and constraints on learning are acquired through the senses.
This extreme view is untenable. Even the British empiricists (e.g. Locke) understood
the need for constraints on knowledge acquisition; the sensory organs are clearly
tuned to different kinds of information. Rather, they argued that there are no specialized knowledge acquisition devices tuned to different kinds of information.
Instead a single general learning capacity accounts for all knowledge acquisition
(e.g. Hume’s account of learning about causation). Most nativists objected not to the
absence of innate beliefs or knowledge but to the lack of domain-specific knowledge
acquisition systems. They believed that different systems were biased to acquire
different kinds of knowledge, that knowledge is constrained both by the sensory
organs and by cognitive biases as well.
If these were the only differences between nativists and empiricists (and in fact
they seem to be the only reliable, principled differences), then neither group should
be particularly disposed towards the two dogmas. However, many empiricists tend
to make additional assumptions that do not logically follow from their fundamental
position but which lead to the dogmas. Specifically, they assume that the domaingeneral learning mechanism relies on association across a stock of perceptual primitives provided by the sensory apparatus. Accordingly, they believe that new
associations must build on earlier associations such that higher-order relations can
only be represented after all of their constituents are firmly in place. Following
Hume, researchers often assume that causal relations can only be appreciated
after they are built from pre-existing correlational constituents.
There is, however, no obvious reason why a domain-general learning device must
rely on associations among sensory primitives. In fact, the strong form of this
empiricist model is clearly false. Infants perceive intermodally at birth and seem
to perceive causal relations as early as they have been tested (e.g. Leslie, 1982, 1984;
Leslie and Keeble, 1987). In addition, newborns can imitate the facial expressions
and body movements of their parent, revealing a sensitivity to particular kinds of
complex, relational information from the start. This message, of course, was the key
theme of the Gibsonian view of perceptual development. The Gibsons argued that
F.C. Keil et al. / Cognition 65 (1998) 103–135
131
we are all capable, at any age, of picking out complex relational patterns in the
environment, especially those that form invariants that could be used to guide action
(Gibson, 1966, 1979; Gibson, 1991). Their position was not clearly nativist or
empiricist because it did not commit to a domain-general or domain-specific system
for detecting invariants. We see no reason why a similar approach cannot be
extended to the apprehension of relations that are more cognitive in nature (an
extension that was not proposed by the Gibsons). Perhaps a domain-general learning
system could be sensitive to and immediately encode such relations as cause, containment, and temporal precedence. If so, we could quickly learn abstract relational
patterns that have immediate behavioral consequences. Such a learning system
would fit with empiricist principles to the extent that it applied equally well to all
sorts of information. There appear to be some innovative and clever attempts to
further such a position. Mandler’s early image schemas could possibly be interpreted
in this way (see Mandler, this issue). For example, containment may be a fundamental image schema accessible to young infants (Baillargeon et al., 1995).
The nativist perspective argues that different aspects of the mind are innately
tuned or optimized for picking up different kinds of high-level relational information. Thus, one aspect may be optimized for understanding physical, mechanical
causation and another for social causation. The difference between empiricist and
nativist accounts is closely linked to the number of distinct learning systems. If there
are only two different learning systems that are optimized for two very large
domains (e.g. social relations and mechanical forces), the differences between
empiricist and nativist accounts are less dramatic. Alternatively, if there are thousands of distinct systems with highly local biases and prejudices, the differences
between nativists and empiricists are large but potentially uninteresting as each
highly local bias appears trivial and hardly the basis for a system of knowledge or
explanatory insight. The middle ground between these extremes, with about a dozen
different learning systems, may best capture the heterogeneous causal structure of
the world without sacrificing coherency.
Our purpose here is not to determine whether particular kinds of knowledge
acquisition problems are best solved by empiricist or nativist approaches, nor is it
to argue that only one of these approaches fits with hybrid views of concepts. We do
suggest, however, that a certain brand of empiricism seems to lead naturally to the
two dogmas. Unfortunately, that brand seems to dominate many current models of
how we acquire and use concepts.
Acknowledgements
F.K. was supported by NIH grant R01-HD23922 and D.S. was supported by a
Jacob K. Javits fellowship. Many thanks to Bethany Richman for helping to design
the study and for collecting all of the data. Thanks also to the children, parents, and
teachers at the Ithaca area daycares and schools who made this research possible.
We thank three anonymous reviewers for comments that helped to strengthen and
clarify our arguments in several areas.
132
F.C. Keil et al. / Cognition 65 (1998) 103–135
References
Ahn, W., Kalish, C.W., Medin, D.L., Gelman, S.A., 1995. The role of covariation versus mechanism
information in causal attribution. Cognition 54, 299–352.
Anderson, J.R., 1976. Language, Memory, and Thought, Earlbaum, Hillsdale, NJ.
Armstrong, S., Gleitman, L., Gleitman, H., 1983. What some concepts might not be. Cognition 13, 263–
308.
Au, T., Romo, L., in press. Mechanical causality in children’s ‘‘folkbiology’’. In: Medin, D.L., Atran, S.
(Eds.), Folkbiology. MIT Press, Cambridge, MA.
Baillargeon, R., Kotovsky, L., and Needham, A., 1995. The acquisition of physical knowledge in infancy.
In: Sperber, D., Premack, D., Premack, S.J., Causal Cognition: A multidisciplinary debate, Clarendon
Press, Oxford.
Barrett, S.E., Abdi, H., Murphy, G.L., Gallagher, J.M., 1993. Theory-based correlations and their role in
children’s concepts. Child Development 64, 1595–1616.
Barsalou, L.W., 1993. Challenging assumptions about concepts. Cognitive Development 8, 169–180.
Bassock, M., 1996. Using content to interpret structure: effects on analogical transfer. Current Directions
in Psychological Science 5, 54–58.
Biederman, I., Mezzanotte, R.J., Rabinowitz, J.C., 1982. Scene perception: detecting and judging objects
undergoing relational violations. Cognitive Psychology 14, 143–177.
Blessing, S.B., Ross, B.H., 1996. Content effects in problem categorization and problem solving. Journal
of Experimental Psychology: Learning, Memory, and Cognition 22, 792–810.
Boshuizen, H.P.A., Schmidt, H.G., 1992. On the role of biomedical knowledge in clinical reasoning by
experts, intermediates and novices. Cognitive Science 16, 153–184.
Brown, A.L., 1990. Domain-specific principles affect learning and transfer in children. Cognitive Science
14, 107–133.
Bruner, J.S., Goodman, C.C., 1947. Value and need as organizing factors in perception. Journal of
Abnormal Social Psychology 42, 33–44.
Bruner, J.S., Postman, L., 1947. Emotional selectivity in perception and reaction. Journal of Personality
16, 69–77.
Carey, S., 1985. Conceptual change in childhood, MIT Press, Cambridge, MA.
Carpenter, G.A., Grossberg, S., 1987. A massively parallel architecture for a self-organizing neural
pattern recognition machine. Computer Vision, Graphics, and Image Processing 37, 54–115.
Chapman, L.J., Chapman, J.P., 1967. Genesis of popular but erroneous diagnostic observations. Journal of
Abnormal Psychology 72, 193–204.
Chapman, L.J., Chapman, J.P., 1969. Illusory correlation as an obstacle to the use of valid psychodiagnostic signs. Journal of Abnormal Psychology 74, 272–280.
Cheng, P.W., Novick, L.R., 1990. A probabilistic contrast model of causal induction. Journal of Personality and Social Psychology 58, 545–567.
Cheng, P.W., Novick, L.R., 1992. Covariation in natural causal induction. Psychological Review 99,
365–382.
Chi, M.T.H., 1992. Conceptual change within and across ontological categories: Examples from learning
and discovery in science. In: Giere, R. (Ed.), Cognitive models of science: Minnesota studies in the
philosophy of science, University of Minnesota Press, Minneapolis, MN.
Chi, M.T.H., Feltovich, P.J., Glaser, R., 1981. Categorization and representation of physics problems by
experts and novices. Cognitive Science 5, 121–152.
Cook, M., Mineka, S., 1989. Observational conditioning of fear to fear-relevant versus fear-irrelevant
stimuli in rhesus monkeys. Journal of Abnormal Psychology 98, 448–459.
Cook, M., Mineka, S., 1990. Selective associations in the observational conditioning of fear in rhesus
monkeys. Journal of Experimental Psychology: Animal Behavior Processes 16, 372–389.
Feltovich, P.J., Spiro, R.J., Coulson, R.L., 1989. The nature of conceptual understanding in biomedicine:
the deep structure of complex ideas and the development of misconceptions. In: Evans, D.A., Patel,
V.L. (Ed.), Cognitive Science in Medicine: Biomedical Modeling, MIT press, Cambridge MA, pp.
113–172.
F.C. Keil et al. / Cognition 65 (1998) 103–135
133
Fox, R., McDaniel, C., 1982. The perception of biological motion by human infants. Science 218, 486–
487.
Garcia, J., Koelling, R.A., 1966. The relation of cue to consequence in avoidance learning. Psychonomic
Science 4, 123–124.
Gelman, R., 1990. First principles organize attention to and learning about relevant data: number and the
animate-inanimate distinction as examples. Cognitive Science 14, 79–106.
Gelman, S.A., Coley, J.D., 1990. The importance of knowing a dodo is a bird: categories and inferences in
2-year-old children. Developmental Psychology 26, 796–804.
Gelman, S.A., Markman, E.M., 1986. Categories and induction in young children. Cognition 23, 183–
209.
Gelman, S.A., Markman, E.M., 1987. Young children’s inductions from natural kinds: the role of categories and appearances. Child Development 58, 1532–1541.
Gelman, S.A., Medin, D.L., 1993. What’s so essential about essentialism? A different perspective on the
interaction of perception, language, and conceptual knowledge. Cognitive Development 8, 157–168.
Gentner, D., Toupin, C., 1988. Systematicity and surface similarity in the development of analogy.
Cognitive Science 10, 277–300.
Gergely, G., Nádasdy, Z., Csibra, G., Bı́ró, S., 1995. Taking the intentional stance at 12 months of age.
Cognition 56, 165–193.
Gibson, E.J., 1991. An odyssey in learning and perception, MIT Press, Cambridge, MA.
Gibson, E.J., Walker, A.S., 1984. Development of knowledge of visual and tactual affordances of
substance. Child Development 55, 453–460.
Gibson, J.J., 1966. The senses considered as perceptual systems, Houghton-Mifflin, Boston.
Gibson, J.J., 1979. The ecological approach to visual perception, Houghton-Mifflin, Boston.
Goldstone, R.L., 1994. The role of similarity in categorization: providing a groundwork. Cognition 52,
125–157.
Goodman, N., 1972. Problems and Projects, Bobbs-Merrill, New York.
Harré, R., 1988. Modes of explanation. In: Hilton, D.J. (Ed.), Contemporary science and natural explanation: commonsense conceptions of causality, Harvester Press, Brighton, Sussex.
Harré, R., Madden, E.H., 1975. Causal powers: a theory of natural necessity. Totowa, NJ: Rowman and
Littlefield.
Harris, P.L., German, T., Mills, P., 1996. Children’s use of counterfactual thinking in causal reasoning.
Cognition 61, 233–259.
Inhelder, B., Piaget, J., 1969. The early growth of logic in the child, classification and seriation. New
York: W.W. Norton.
Intraub, H , 1980. Rapid conceptual identification of sequentially presented pictures. Journal of Experimental Psychology: Human Perception and Performance 7, 604–610.
Johnson, M.H., 1992. Imprinting and the development of face recognition: From chick to man. Current
Directions in Psychological Science 1, 52–55.
Johnson, M.H., Dziurawiec, S., Ellis, H., Morton, J., 1991. Newborns’ preferential tracking of face-like
stimuli and its subsequent decline. Cognition 40, 1–19.
Jones, S.S., Smith, L.B., 1993. The place of perception in children’s concepts. Cognitive Development 8,
113–139.
Keil, F.C., 1981. Constraints on knowledge and cognitive development. Psychological Review 88, 197–
227.
Keil, F.C., 1989. Concepts, kinds, and cognitive development, MIT Press, Cambridge, MA.
Keil, F.C., Batterman, N., 1984. A characteristic-to-defining shift in the development of word meaning.
Journal of Verbal Learning and Verbal Behavior 23, 221–236.
Keil, F.C., Smith, W.C., 1996. Is there a different ‘basic’ level for causal relations? Paper presented at the
37th annual meeting of the Psychonomic Society (November), Chicago, IL.
Kelley, H.H., 1967. Attribution theory in social psychology. Nebraska Symposium on Motivation 15,
192–238.
Kuhn, T.S., 1962. The structure of scientific revolutions, University of Chicago Press, Chicago.
Leslie, A.M., 1982. The perception of causality in infants. Perception 11, 173–186.
134
F.C. Keil et al. / Cognition 65 (1998) 103–135
Leslie, A.M., 1984. Spatiotemporal continuity and the perception of causality in infants. Perception 13,
287–305.
Leslie, A.M., 1995. A theory of agency. In: Sperber, D., Premack, D., Premack, A.J. (Eds.), Causal
Cognition: A multidisciplinary debate, Oxford: Clarendon Press, pp. 121– 141.
Leslie, A.M., Keeble, S., 1987. Do 6-month-olds perceive causality? Cognition 25, 265–288.
Mandler, J.M., 1993. On concepts. Cognitive Development 8, 141–148.
Medin, D.L., Shoben, E.J., 1988. Context and structure in conceptual combination. Cognitive Psychology
20, 158–190.
Medin, D.L., Schaffer, M.M., 1978. Context theory in classification learning. Psychological Review 85,
207–238.
Medin, D.L., Goldstone, R.L., Gentner, D., 1993. Respects for similarity. Psychological Review 100,
254–278.
Meltzoff, A.N., 1988. Infant imitation after a 1-week delay: long-term memory for novel acts and multiple
stimuli. Developmental Psychology 24, 470–476.
Mervis, C.B., Johnson, K.E., Scott, P., 1993. Perceptual knowledge, conceptual knowledge, and expertise:
comment on Jones and Smith. Cognitive Development 8, 149–156.
Murphy, G.L., Allopena, P.D., 1994. The locus of knowledge effects in concept learning. Journal of
Experimental Psychology: Learning, Memory, and Cognition 20 (4), 904–919.
Murphy, G.L., Medin, D., 1985. The role of theories in conceptual coherence. Psychological Review 92,
289–316.
Murphy, G.L., Wisniewski, E.J., 1989. Feature correlations in conceptual representations. In: Tiberghien,
G. (Ed.). Advances in cognitive science: Vol. 2. Theory and applications, Ellis Horwood, Chichester,
UK, pp. 23–45.
Neisser, U., 1967. Cognitive Psychology. Englewood Cliffs, NJ: Prentice-Hall.
Nisbett, R.E., Krantz, D.H., Jepson, C., Kunda, Z., 1983. The use of statistical heuristics in everyday
inductive reasoning. Psychological Review 90, 339–363.
Norman, G.R., Schmidt, H.G., 1992. The psychological basis of problem based learning. Academic
Medicine 67, 557–565.
Nosofsky, R.M., 1984. Choice, similarity, and the context theory of classification . Journal of Experimental Psychology: Learning, Memory and Cognition 10, 104–114.
Nosofsky, R.M., 1986. Attention, similarity, and the identification-categorization relationship. Journal of
Experimental Psychology: General 115, 39–57.
Nosofsky, R.M., 1992. Similarity scaling and cognitive process models. Annual Review of Psychology
43, 25–53.
Novick, L.R., 1988. Analogical transfer, problem similarity, and expertise. Journal of Experimental
Psychology: Learning, Memory and Cognition 14, 510–520.
Patel, V.L., and Groen, G.J., 1991. The general and specific nature of medical expertise: a critical look. In:
Ericsson, K.A., Smith, J. (Ed.), Toward a general theory of expertise, Cambridge University Press,
New York, pp. 93–125.
Rayner, K., Pollatsek, A., 1992. Eye movements and scene perception. Canadian Journal of Psychology
46, 342–376.
Rosch, E., Mervis, C.B., 1975. Family resemblances: studies in the internal structure of categories.
Cognitive Psychology 7, 573–605.
Rozin, P., Kalat, J.W., 1971. Specific hungers and poison avoidance as adaptive specializations of
learning. Psychological Review 78, 459–486.
Salmon, W.C., 1989. Four decades of scientific explanation, University of Minnesota Press, Minneapolis,
MN.
Simons, D.J., Keil, F.C., 1995. An abstract to concrete shift in the development of biological thought: the
insides story. Cognition 56, 129–163.
Sloman, S., 1996. The empirical case for two systems of reasoning. Psychological Bulletin 119, 3–22.
Smith, E.E., Medin, D.L., 1981. Categories and concepts, Harvard University Press, Cambridge, MA.
Smith, E.E., Shoben, E.J., Rips, L.J., 1974. Structure and process in semantic memory: a featural model
for semantic decisions. Psychological Review 81, 214–241.
F.C. Keil et al. / Cognition 65 (1998) 103–135
135
Smith, L.B., Jones, S.S., 1993. Cognition without concepts. Cognitive Development 8, 181–188.
Spelke, E.S., Breinlinger, K., Macomber, J., Jacobson, K., 1992. Origins of knowledge. Psychological
Review 99, 605–632.
Vygotsky, L.S., 1962. Thought and language (E. Hanfmann and G. Vakar, Trans.), MIT Press, Cambridge, MA.
Wellman, H.M., 1990. The child’s theory of mind, MIT Press, Cambridge, MA.
Wellman, H.M., Gelman, S.A., 1992. Cognitive development: Foundational theories of core domains.
Annual Review of Psychology 43, 337–375.
Werner, H., Kaplan, B., 1963. Symbol formation: An organismic-developmental approach to language
and the expression of thought, Wiley, New York.
Williams, J.M.G., Mathews, A., MacLeod, C., 1996. The emotional Stroop task and psychopathology.
Psychological Bulletin 120, 3–24.
Winer, B.J., 1971. Statistical principles in experimental design, 2nd ed., McGraw Hill, New York.
Wisniewski, E., Medin, D.L., 1994. On the interaction of theory and data in concept learning. Cognitive
Science 18, 221–281.
Wright, J.C., Murphy, G.L., 1984. The utility of theories in intuitive statistics: the robustness of theorybased judgments. Journal of Experimental Psychology: General 113, 301–322.