Visual Imagery and Consciousness

Visual Imagery and Consciousness
by
Nigel J.T. Thomas
California State University, Los Angeles,
Los Angeles, CA, USA
To appear in:
W.P. Banks (Ed.), Encyclopedia of Consciousness. Academic Press/Elsevier.
This penultimate draft ©Nigel J.T. Thomas, 2008.
Suggested keywords: image, visualization, consciousness, mental imagery, imagination, mind's
eye, perception, intentionality, imageless thought, quasi-pictorial, propositional, enactive,
sensorimotor, mental representation
Please note:
As is normal with commissioned pieces like this, there were some editorial restrictions on how it
should be written and structured. The title and topic were assigned to me (as well as the
requirement for a glossary), and if I had had my way I would not have restricted myself to just
visual imagery, but would have tried to explicitly cover imagery in other sense modes too. I believe
that all sensory modes of imagery are important in cognition and in our experience, and that our
consciousness of imagery in all modes should, and can, be understood within the same theoretical
framework. That said, however, visual imagery has been studied far more, and thus is better
understood, than imagery in any other mode, so it is not inappropriate to make it the main focus of
discussion.
But quite apart from this, the editorial policies of the Encyclopedia of Consciousness meant that
not only was I held to a quite strict length limit, and unable to use footnotes (or italics, for some
reason), but, more importantly, I was not permitted to give any citations within the text. In my
opinion, the latter was a very unfortunate and inappropriate restriction for an article dealing with
this sort of controversial subject matter (I protested to the editorial staff, but to no avail). The
"further reading" list that I was mandated to include in no way compensates for the lack of
appropriate in-text citations. The literature explicitly devoted to the problems of understanding
imagery as a conscious phenomenon is meager indeed. Most of the more important and influential
ideas about the subject are not to be found in works principally or even largely concerned with the
issue, but, rather, in brief passages, or sometimes just as implicit assumptions, within works whose
main focus lies elsewhere. In my view, the published article would have been much more useful to
readers if I had been able to provide copious citations.
I have been working on a revised version, to be made available both at the ASSC web site and my
own at http://www.imagery-imagination.com. I intend that it will include proper citations, as well as
notes that will expand upon, qualify, and, where appropriate, defend many of the claims that I
make. However, circumstances now make it unlikely that this can be made ready very soon, so I am
making this penultimate draft of the published version available here at the ASSC site now.
2
Visual Imagery and Consciousness
Nigel J.T. Thomas
California State University, Los Angeles,
Los Angeles, CA, USA
I.
II.
III.
IV.
V.
VI.
Introduction
Defining Imagery: Experience or Representation?
Historical Development of Ideas about Imagery
Subjective Individual Differences in Imagery Experience
Theories of Imagery, and their Implications for Consciousness
Conclusion
Glossary
Hard problem - In this article, the "hard problem" of consciousness refers to the problem of
understanding how a brain state (a spatio-temporal pattern of neuronal excitation, itself
reducible to a complex dance of molecules and ions around membranes) could possibly be a
conscious experience. Posed this way, the problem is unsolved, and may well be insoluble. It
does not necessarily follow that consciousness cannot be scientifically understood, however. The
problem may be ill-posed.
Imageless thought - A thought or thought content that is (i) consciously experienced, but (ii),
unlike mental imagery, has no sensory or "perception-like" character. The existence of such
thoughts has been highly controversial ever since the idea was first introduced into psychology
in the early twentieth century.
Intentionality - The property, possessed by many, perhaps all, mental acts or entities (such as
mental images, beliefs, desires, and thoughts in general), of being about, of, or directed at
something. The "thing" in question (sometimes referred to as the "intentional object" of the
image, thought, or whatever) may be real or imaginary (one may have a thought about a unicorn
quite as well as a thought about a horse). This technical, philosophical concept of intentionality
is only indirectly (if at all) related to the ordinary language notion of having an intention (to do
something), or of doing something intentionally (i.e, on purpose rather than inadvertently).
Mental Imagery - Quasi-perceptual experience: that is, experience that subjectively resembles
perceptual experience, but which occurs in the absence of the relevant perceptual stimuli. It is
generally acknowledged that imagery may occur in any sense mode – visual, auditory (including
what is sometimes called "inner speech," or "thinking in words"), olfactory, kinesthetic, etc. – or
even in several simultaneously. However, visual mental imagery, also colloquially referred to as
"visualization," "seeing in the mind's eye," "picturing," etc., has been by far the most extensively
atudied and discussed.
3
Mentalese - The hypothetical, innate and unconscious, "language of thought," held by some
cognitive scientists to be the symbolic format in which information is represented within the
brain, and which the brain, considered as a computer, uses in its computations. It supposedly
resembles the languages people actually speak in having a combinatorial syntax and an arbitrary
semantics. Clearly we are not consciously aware of our mentalese representations as such, but if
they do indeed exist they may be the vehicles of the conscious thoughts that we experience as
mental imagery and "inner" speech.
Retinotopically Mapped Visual Cortex - Certain visual processing areas of the cortex of the
brain, most notably the primary visual cortex (V1), that are structured as (rather distorted and
low resolution) maps of the light sensitive retina upon which light is focused at the back of the
eye. Adjacent regions of cortex, in these areas, correspond to adjacent areas on the retina such
that, during vision, the two-dimensional spatial pattern of excitation of the cortical neurons
corresponds topologically to the pattern of illumination, the optical image, on the retina.
Introduction
For most people visual mental imagery is a common, frequent experience. We often recall past
events, or imagine possible ones, by forming mental images of them. Our dreams also seem to
consist largely of mental imagery. Indeed, many philosophers and psychologists have held that,
together with immediate perceptual experience, imagery makes up the entirety of consciousness
(although clearly not just visual imagery, but also imagery in other sense modes, especially inner
speech, auditory or vocal-motor imagery spoken words). If we are to fully understand
consciousness, we will certainly need to understand mental imagery.
Philosophers have studied mental imagery for many centuries, experimental
psychologists have studied it for well over a hundred years, and more recently it has attracted the
attention of cognitive scientists and neuroscientists, but many basic questions still remain
unresolved. There is controversy not only over what cognitive and neural mechanisms are
responsible for imagery, but also over its function (if any) in cognition. Some regard mental
imagery (or some closely related notion, such as "perceptual symbols" or "image schemata") as
providing the necessary vehicle for all thought; some regard it as important only for certain types
of cognitive task (such as judging spatial relationships); and yet others regard it as a functionally
insignificant conscious epiphenomenon of the unconscious cognitive processes that really
constitute our thinking.
Like percepts, mental images bear intentionality. That is to say, they are always images
(or percepts) of something or other, of some (real or imaginary) "object." (Some philosophers
regard intentionality and consciousness as very closely related phenomena.) However, unlike
percepts, images occur in the absence of their object. You cannot perceive a cat when no cat is
present, but you can imagine a cat (or any other perceptible thing) at any time. Furthermore,
(mistakes aside) you cannot perceive a cat be other than where and how it actually is, but,
whenever you want to, you can imagine (i.e., form an image of) a cat as anywhere, or in any
4
condition. Images thus seem well suited to function as mental representations, allowing us to
think of things as they currently are not, and thus to recall the past, plan for the future, fantasize
about the unreal, and speculate about the unknown. Since ancient times, this has generally been
believed to be their cognitive function.
Defining Imagery: Experience or Representation?
Mental imagery is commonly defined as a form of experience: quasi-perceptual experience,
experience that subjectively resembles the experience we have when we actually perceive
something (see glossary). This implies that we are unlikely to be able to understand imaginal
consciousness unless we understand perceptual consciousness (and perhaps vice-versa).
Unfortunately, we do not yet have such an understanding (or, at least, one that is generally
agreed upon). It also, however, implies that imagery is always and necessarily conscious: if
something is not consciously experienced it cannot be mental imagery.
If, on the other hand, mental imagery is defined as being a form of mental representation,
as some contemporary cognitive theorists suggest, the tight conceptual linkage between imagery
and consciousness is broken, and it becomes conceivable that images (imaginal representations)
might sometimes play a role in cognition without our being consciously aware of them. There is
some evidence suggesting that this does indeed occur. For instance, experimental studies of
verbal memory have found that nouns for which it is easy to think of a corresponding image
(mostly words for concrete things, such as "dog," "ship," or "skyscraper") are more readily
remembered than nouns for which it is difficult to think of an image (mostly abstractions, such
as "truth," "nation," or "size"). However, this mnemonic effect of "imagability" seems to occur
quite regardless of whether any relevant images are actually consciously experienced (or, at
least, of whether the subjects report or recall experiencing them). One interpretation of this
finding is that image representations may be spontaneously evoked by the concrete words, and
may play a role in making these words more memorable, even when they do not rise to
consciousness.
Unlike the experiential conception of imagery with which we started, this
representational conception suggests that we might be able to understand the nature and function
of imagery without giving any attention to the fact that it is (at least sometimes) consciously
experienced, and, in fact, most empirical and theoretical cognitive science research on imagery
over the past few decades has proceeded on that assumption. The problem of the imaginal
consciousness has, very largely, been ignored or set aside, while questions and controversies
about its representational function (and, more recently, its neural instantiation) have been
pursued enthusiastically. Furthermore, little if any attention has been given to the question of
what might differentiate those cognitively active image representations that do enter
consciousness from those that do not.
But in any case, imagery probably cannot be satisfactorily characterized purely in terms
of its representational function. It is difficult or impossible to differentiate it from other forms of
5
actual or possible mental representation, without either, on the one hand, begging some very
controversial questions about its nature (about whether, for example, the relevant representations
are somehow picture-like), or, on the other hand, appealing to the experiential conception of
imagery. We could (and perhaps should) say that mental images are just those mental
representations whose presence to mind has the potential to give rise to quasi-perceptual
experiences. If so, however, the representational conception of imagery is made conceptually
dependent upon the experiential conception, and we cannot even begin truly to understand
imagery unless we take its conscious nature into account.
Historical Development of Ideas about Imagery
Contemporary scientific controversies about mental imagery by no means turn entirely on
matters of empirical fact. Disagreements turn largely upon conceptual issues, and upon differing
ideas about what questions a theory of imagery most needs to answer. For example, is it crucial
to consider the conscious and intentional nature of imagery when we seek to understand its
cognitive mechanisms and functions, or are such considerations irrelevant and distracting? An
awareness of the historical contexts from which the various contemporary research programs
emerged is indispensable in understanding and adjudicating between such differing perspectives.
Scholars disagree as to whether ancient thinkers such as Plato and Aristotle, whose works
contain the earliest known discussions of cognition, had anything like our modern concept of
consciousness. However, it is clear that they did have the concept of mental imagery. Plato
tentatively suggests that memory might be analogous to a block of wax into which our
perceptions and thoughts stamp impressions (i.e. memory images); he also speaks,
metaphorically, of an inner artist who paints pictures in the soul, and suggests that imagery may
be involved in the mechanisms by which the rational mind exerts its control over the animal
appetites.
However, it is Plato's successor Aristotle who provides the first systematic account of the
role of imagery in cognition. In Aristotelian psychological theory, images play much the same
role that the rather broader notion of mental representation plays in modern cognitive science.
He held that mental images play an essential role in memory and thought: memory is the recall
to mind of images of past events, and "It is impossible to think without an image". He also held
that images underpin the meaningfulness of language, and play a key role in motivation. Were it
not for mental images, he thinks, our speech would be empty noise, like coughing, and our desire
for (or fear of) something would only motivate our actions when that something was actually
being perceived. However, a mental image of the desired (or feared) thing enables us to think
about it in its absence, and thus can sustain our motivation at other times. Aristotle also posits a
mental faculty, phantasia (usually translated as "imagination"), that is closely allied to (or
perhaps even structurally identical with) the general faculty of sense perception, and is
responsible for creating our mental images. Some suggest that this faculty amounts to his
conception of consciousness.
6
In the wake of Aristotle's work, images generallycontinued to be seen as the principal
vehicles of mental content. This view lasted through later antiquity and the middle ages and into
the era of modern philosophy, as instigated by Descartes. In the work of Descartes and his
successors the mind was explicitly understood as conscious, and the contents of consciousness
were known as "ideas." Descartes himself seems to have at least two distinct concepts of "idea."
What he calls a "clear and distinct idea" is a direct (perhaps propositional) mental grasp of the
essence of something. It exists within consciousness itself, but is not an image . However, he
also hypothesizes that when we see, imagine, or remember something previously seen, a
pictorial image is formed deep within the brain (on the surface of the pineal gland). Such
images, which he also quite explicitly calls ideas, do not exist within the mind as such (since, for
Descartes, the conscious mind is immaterial, and distinct from the brain), but they are somehow
presented to the mind and are the immediate causes of our conscious perceptual, memory, and
imaginative experiences.
The British Empiricist philosophers' concept of "idea" seems to combine aspects of both
of the notions found in Descartes. Although some scholars today question whether John Locke
really believed ideas to be picture-like, it is natural to interpret him as thinking so, and there is
little room for doubt that his Empiricist successors, most notably Berkeley and Hume, conceived
of ideas as images. Hume distinguishes "impressions" (i.e., percepts) from "ideas" that arise from
the memory or the fancy, but he thinks that these differ only in the intensity with which we
experience them, and he seems to conceive of all visual impressions and ideas as consciously
experienced, picture-like images. Unlike the image-ideas of Descartes, however, for the
Empiricists image-ideas are themselves entirely mental, and exist only inasmuch as we are
conscious of them. Indeed, Hume's so called "bundle" theory of the mind suggests that these
images actually constitute consciousness: the mind is nothing but a bundle of impressions and
ideas.
Although it certainly had its critics, this Empiricist conception of mind continued to be
influential up to, and beyond, the emergence of scientific psychology in the late 19th century.
Pioneering experimental psychologists such as Wilhelm Wundt in Germany and William James
in the U.S.A. thought of psychology as the study of consciousness and of the images (and
emotional "feelings") that populate it. However, in the early 20th century a group of psychologists
based in Würzburg in Germany, led by Oswald Külpe, reported that systematic introspection of
thought processes under controlled laboratory conditions had led them to recognize that, as well
as images, the mind also contains "imageless thoughts." These were, supposedly, conscious
experiences (they had been discovered by introspection after all) and cognitive (they were
evoked during reasoning, and were not merely emotional feelings), but, unlike imagery, they had
no sensory character.
These claims proved extremely controversial. Some psychologists, such as Wundt,
argued that the introspective methodology employed by the Würzburg psychologists was
inherently unscientific and unreliable. Others, such as Edward Titchener, argued that
competently conducted introspective investigations revealed no evidence of imageless thoughts.
7
A bitter and irreconcilable dispute, the so called "imageless thought debate," developed within
psychology. It is important to be clear that no party to this debate was claiming that thought in
general is imageless; rather, the point at issue was whether any non-imaginal, but nevertheless
conscious, thought contents exist. Although the factious arguments about this have long since
died down, the issue has never truly been resolved. Some people still hold (although perhaps
more often as an implicit assumption, rather than an explicitly defended view) that all the
cognitive contents of consciousness have a sensory character, either as actual sensations or
percepts, or else as mental images; others hold (again, often implicitly) that there are also nonsensory conscious thought contents. These latter are sometimes referred to as states of "fringe
consciousness," and may be described as "feelings" of, for example, "familiarity," "rightness,"
"confidence," etc., (or their opposites).
Historically, however, the upshot of the deadlock of the imageless thought debate was a
general discrediting of introspective methodologies in psychology, and even of the very idea that
consciousness could be studied scientifically. John B. Watson argued that consciousness was an
inherently unscientific notion, and, as a central plank of this argument, questioned the very
existence of mental imagery. He urged that psychology should be reconceived as the study of
behavior rather than the study of consciousness, and the Behaviorist movement that he instigated
came to dominate the field for the next several decades (very roughly, from about 1920 to about
1960). Few, if any, of the Behaviorist psychologists who succeeded Watson went quite as far as
he did toward denying the reality of consciousness and imagery, but almost all of them, in
practice, treated them as beyond the reach of science. During the same period, the rise of the
analytical philosophy movement (with its emphasis on formal logic and language as the keys to
understanding knowledge and the mind), and particularly the influence of Ludwig Wittgenstein,
also led many philosophers to discount the significance of imagery, or simply to ignore it. The
upshot was that neither imagery nor consciousness received much serious attention during the
Behaviorist era.
A revival of interest in imagery, however, was an important component of the cognitive
psychology movement that challenged and eventually displaced Behaviorism as the dominant
psychological paradigm in the 1960s and '70s. New experimental methods for studying imagery
were devised that did not depend on purely subjective introspection, and evidence emerged
suggesting that imagery plays a large role both in memory (as shown mainly be experiments on
verbal memory), and spatial thinking (as shown by experiments on "mental rotation" and
"mental scanning"). A strong empirical case was built up for the functional importance of
imagery in cognition, and a vigorous (and continuing) debate ensued about the nature of the
cognitive mechanisms responsible for imagery.
However, there was not a comparable revival of scientific interest in consciousness until
the 1990s. As cognitive theories of imagery developed in the 1960s through the 1980s and
beyond, only cursory attention was given to the fact that it is a conscious phenomenon (or to the,
arguably, closely related fact that it bears intentionality). Even today (the early 21st century),
cognitive scientists and neuroscientists often treat imagery purely as a form of representation,
8
and offer no substantive account of how images are able to represent (almost certainly it is not
because they resemble their objects), or of how they come to be consciously experienced.
Subjective Individual Differences in Imagery Experience
Since the pioneering work of Francis Galton in the nineteenth century there has been a tradition
of research into individual differences in the subjective experience of imagery. Galton prepared
a questionnaire in which he asked his subjects to recall their morning breakfast table, and to
consider "the picture that rises before your mind's eye." They were then asked to comment on a
number of subjective aspects of that image, such as its brightness, its clarity, and the distinctness
of the colors. One of the best knowing findings of Galton's study is that some small minority of
the questionnaire respondents reported that they were unable to visualize anything whatsoever.
Unfortunately, no systematic research has since been done on this phenomenon, and the issue
remains very poorly understood (although recent research has directly contradicted Galton's
related claim that scientists are particularly likely to be weak or "non-" imagers).
Subsequent researchers have refined Galton's questionnaire technique in various ways,
most importantly, perhaps, by introducing numerical scales along which the subjects can rate the
strength of the image attribute of interest. Most often, this attribute is "vividness," but there have
also been attempts to use the questionnaire technique to measure such things as how easily
people can transform their mental images (by asking them to form an image and then change it
in some specified way, and then rate the difficulty of doing so), or how frequently they use
images of one or another sense mode in their thinking. In the early 20th century there was
considerable interest in classifying people into "imagery types" according to which sensory mode
of imagery (visual, auditory, haptic, etc.) they preferred. It was hoped that these "types" might
prove to be correlated with other independently measurable aspects of the person's psychological
makeup. However, no clear cut pattern of findings emerged.
Perhaps the most extensively and successfully used imagery questionnaire of recent times
is the Vividness of Visual Imagery Questionnaire (VVIQ) devised by David Marks. Taking the
VVIQ involves being asked to visualize a series of specified objects or scenes and to rate the
vividness of each resulting image on a five point scale ranging from "no image at all" to "as clear
and vivid as normal vision." An average score is then calculated.
Clearly research of this sort deals with imagery as a conscious phenomenon. It relies
upon the subjects' verbal, introspective reports of their subjective experience. Furthermore, there
has been much effort to find correlations between a people's scores on imagery questionnaires
and other psychological variables. The results, however, have generally been disappointing.
Even where reproducible correlations have been found, it has often proven difficult to make
much theoretical sense out of them. For example, there is some evidence that vivid imagers (as
measured by the VVIQ) remember pictures (color photographs) rather better than less vivid
imagers; however, surprisingly, it has also been found that the less vivid imagers (by the VVIQ
again) actually remember specific shades and hues of color better. Some research suggests that
9
when people form mental images they have elevated levels of activity in the retinotopically
mapped visual cortex of their brain, and that this activity is greater for more vivid imagers.
However, other researchers fail to find any elevated levels of activity in these areas during
imagery (they find elsewhere in the brain). Perhaps most surprisingly and disappointingly,
however, virtually no sign of any correlation has been found between people's vividness ratings
and their performance (speed or accuracy) in various visuo-spatial thinking and problem solving
tasks, even though, subjectively, such tasks very much depend on imagery.
There are a number of conceptual concerns about the validity of questionnaires of this
type. Most obviously, as the rating is necessarily purely subjective, and as nobody can
experience another's image, there is no way to tell whether the scales are being applied in a
consistent way. Perhaps an image that one person thinks of as "clear and reasonably vivid"
another might rate as "vague and dim," simply because different subjective standards are being
applied.
Furthermore, most people will probably agree that the vividness of their imagery can vary
markedly from time to time, circumstance to circumstance, and image to image: Some memories
may come back to us (often for no very apparent reason) as especially vivid images, whereas
others (or even the same ones on other occasions) are recalled only vaguely and dimly. Thus it
may be that these tests do not measure a person's true capacity for having vivid images, but only,
at best, the vividness that their images tend to have under the circumstances of filling out an
imagery questionnaire.
Finally, it is unclear quite what "vividness" truly means; whether it is really a well
defined, coherent attribute of experience. Perhaps different subjective image features such as
(for example) clarity, apparent brightness, level of discriminable detail, stability of the image
(whether it can be held in consciousness for a time, or quickly fades), and so forth, in fact vary
independently of one another, but tend to get indiscriminately lumped together as "vividness."
Some recent research suggests that introspective ratings of certain more fine grained aspects of
subjective imagery experience may be more meaningful than simple vividness ratings.
Theories of Imagery, and their Implications for Consciousness
There is much controversy in the contemporary scientific and philosophical literature about the
underlying nature and mechanisms of imagery. The rival theories fall into three broad classes,
each of which have distinctive implications for our understanding of imaginal consciousness:
! Picture theory (its contemporary advocates often prefer to call it "quasi-pictorial," or
"analog" theory)
! Description theory (also, rather misleadingly, known as "propositional" theory)
! Enactive theory (there is no established name for this type of theory; different versions
are known as "motor theory," "sensorimotor theory," "perceptual cycle theory,"role taking
theory," and "perceptual activity theory".)
10
Because of the vigorous, high-profile dispute (known as the "analog-propositional"
debate, or sometimes just "the imagery debate") that flared up between advocates of the first two
theory types in the 1970s, and that continues to rumble on even today, picture and description
theories have become by far the better known of these theoretical alternatives. Unfortunately,
this debate been conducted almost entirely without regard to the conscious nature (or, indeed,
the intentionality) of imagery. Each of the three theory types purport to be able to account for the
full range of established empirical facts about imagery, and, in fact, each is flexibly enough
conceived that it is likely to prove difficult to choose between them on narrowly empirical
grounds. That being so, it seems reasonable that our evaluation of them should look to broader
theoretical concerns, such as their respective implications for the theory of consciousness.
Picture theory
Picture theory, as the name suggests, holds that visual mental images may be identified with
representations in the mind and/or brain that are in some significant way like pictures, or, at
least, that represent things in much the same way that pictures represent things. (Whatever that
way may be. Although it is widely believed that pictures represent through resemblance, it does
not seem likely that resemblance can be the basis of mental representaton. Resemblance is a
fundamentaly subjective relationship that is recognized when a conscious mind focuses upon
certain similarities between things while, at the same time, discounting or ignoring other,
equally objectively real dissimilarities. In order to do this, the mind must already be able to
represent the things, and their relevant aspects, to itself.)
Picture theory is undoubtedly the oldest and most widely accepted theory of imagery.
After all, since prehistoric times people have used pictures as a means of causing one another to
have visual experiences of things that are not really there, and the analogy with mental imagery
(also experience of things that are not really there) is all too easily drawn. The theory goes back
at least to Plato, and some rudimentary version of it seems to be deeply entrenched in "folk"
(i.e., lay or commonsense) beliefs about how the mind works. Indeed, many of the expressions
used to talk about imagery in ordinary colloquial language (such as, "mental picture," "the mind's
eye," and even "image" itself) seem to be derived from this "folk" version of picture theory. The
fact that the theory is embedded in our language in this way can make it difficult to express or
understand criticisms of it, or to clearly grasp alternative accounts of the actual phenomena.
Nevertheless, many twentieth and twenty-first century philosophers have raised telling
objections to it, leading some of them to be accused (quite unjustifiably) of denying or doubting
that people have quasi-visual experiences.
However, the picture theory today is not just a folk theory. From the 1970s onward,
cognitive scientists, most particularly Stephen Kosslyn, have developed the basic idea of the
mental image as an inner picture into a sophisticated and detailed scientific model designed to
account for a wide range of experimental findings. More recent versions of the theory hold that
the visual mental image is embodied as a spatially extended two-dimensional pattern of neural
excitation in the retinotopically mapped visual cortices of the brain. Such patterns, isomorphic to
11
the pattern of optical stimulation on the retinae of the eyes, are known to occur during vision.
Kosslyn holds that our mental images are similar patterns, but generated from internal sources
rather than sensory input.
This is not the place to review all the alleged empirical shortcomings of quasi-pictorial
theory. However, both introspective and empirical evidence suggest a number of ways in which
the experience of imagery and the experience of looking at a picture vary. Perhaps the strongest
such evidence comes from experiments involving pictures that invite more than one
interpretation, such as the Necker cube and the duck-rabbit (figure 1). Subjects find it very much
harder to see both interpretations in their mental images of these than they do when the picture is
physically displayed in front them, and this applies even when the picture is their own drawing,
based upon their mental image.
<Figure 1 near here. Caption: The Necker cube and the duck-rabbit.>
However, Kosslyn maintains that the pattern of neural excitation that (he thinks)
constitutes the mental image is only a picture in an extended, metaphorical sense: a "quasipicture." After all, unlike regular pictures (drawings, photographs, projected optical images,
etc.), we do not need to see it with our eyes in order to derive information from it, or consciously
experience it. The neural excitation pattern is like a picture only inasmuch as it represents spatial
relationships within the two-dimensional projection of the represented visual scene by
topologically equivalent spatial relationships in the spatially extended pattern of excitation.
Kosslyn maintains that because his theory does not identify mental images with pictures in the
full, literal sense, but only with these "quasi-pictures," it avoids all of the many powerful
objections that have been raised against naïve "folk" versions of picture theory. Whether this is
so, however, remains controversial.
Although the modern version of pictorial theory can account for a large amount of
empirical data about how people use mental images as representations, its supporters have given
very little attention to the question of how it might account for the fact that imagery is (at least
sometimes) consciously experienced. There seem to be two basic ways in which this issue might
be approached. Either the quasi-picture in the cortex might be conscious in and of itself, or,
alternatively, consciousness arises when some other, more inward cognitive structure, a "mind's
eye" as it were, somehow "quasi-sees" the quasi-picture.
The first option, however, runs immediately into the notorious "hard problem" of
consciousness in its most intractable form: we really have no idea how a pattern of excitation in
part of the brain (excitation that is ultimately nothing but an elaborately choreographed dance of
molecules and ions around and through membranes) could possibly be, or give rise to, a
conscious experience of any sort, let alone an experience of a spatially extended structure
isomorphic to the excited brain region.
12
In any case, the second option appears to be favored by Kosslyn, the leading
contemporary advocate of picture theory. He speaks of (and diagrams) a "mind's eye function"
examining the quasi-pictorial image in the brain, and some of his explanations of empirical
findings seem to depend upon this idea. Perhaps then, just as ordinary visual consciousness
arises when people take in visual information with their eyes, imaginal consciousness arises
when the mind's eye takes in visual information from the inner picture.
But the "mind's eye" strongly suggests that the theory commits the homunculus fallacy.
To whom might this eye belong if not to a conscious little man inside my head, who experiences
the inner world of quasi-pictures in a way analogous to that in which I, a whole person,
experience the external world through my bodily eyes? Compounding this worry is the fact that,
at a formal, structural level (abstracting away from implementational details), Kosslyn's quasipictorial theory of imagery looks very like a more detailed version of Descartes' theory of
imagery. For Descartes too, mental images are material quasi-pictures formed within the brain
and consciously experienced not simply because they are there, but because they are somehow
"seen" by the soul. Kosslyn makes it clear that he does not intend to follow Descartes in
attributing our consciousness of these brain pictures to an immaterial soul (a ghostly
homunculus) beyond the reach of science; but although quasi-pictorial theory may not
necessarily imply Cartesian dualism, it does seem to be at least committed to Cartesian
materialism, the notion that subjective experience depends upon some structure in the brain that
(like a soul or a homunculus) acts as a conscious inner spectator of internal representations.
Quasi-pictorialists seem to have two comebacks to the homunculus objection. One is to
point out (as Kosslyn has often done) that a computer simulation of quasi-pictorial theory has
been implemented, and that there can be no homunculus (at least of any objectionable sort)
within a computer. Unfortunately, however, it turns out that the computer program in question
makes no attempt whatsoever to model the conscious nature of imagery. Like most of Kosslyn's
work, it is concerned to explore how certain sorts of spatial thinking might be achieved through
imagery, via such operations as scanning linearly across a quasi-picture, or rotating it into a new
orientation. The program performs such operations on internal data structures that are supposed
to model, at a functional level, the brain quasi-pictures posited by the theory, and it displays its
results in the form of actual pictures that are rotated or shifted across the computer's display
screen in relevant ways. No attempt is made to simulate the hypothesized "mind's eye function,"
and consciousness, either real or simulated, does not enter into the matter at all unless and until
some human operator looks at the screen. It is true that there is no homunculus in the computer,
but, inasmuch as the system models consciousness at all, it models it by co-opting a full sized
conscious human being to play the the homunculus role.
Not coincidentally, the program also omits to model the intentionality of imagery. The
fact that the two-dimensional patterns it produces and manipulates may look like pictorial
representations to human onlookers (and were designed to look that way by the programmers) is
quite irrelevant to the program's functioning. For the computer, they represent nothing.
13
The second comeback, however, is not dismissed quite so easily. Perhaps cognitive
science can render the homunculus and its "mind's eye" innocuous by showing that it can be
reduced to a set of computational or neural processes. Quasi-pictorial theory was originally
developed in the context of the "information processing" paradigm which dominated perceptual
theory at the time (the 1970s), and which still deeply influences the way many cognitive
scientists think about perception. In essence, "information processing" theory regards vision as a
one way flow of visual information, in through the eyes and then through a series of processing
stages in the brain until it is eventually transformed into a representation, or a set of
representations, suitable for guiding behavior. (This is an oversimplification, but not, in this
context, a misleading one. Developed information processing theories generally call for a
significant degree of top-down modulation of the bottom-up flow of information from the sense
organs. Nevertheless, the bottom-up flow is seen as dominating and driving perception.) A
common assumption (usually implicit) is that this final set of representations, this ultimate
product of the visual information processing system, is the immediate cause of conscious visual
experience.
(Some information processing theorists may prefer to think of visual consciousness not
as something attaching to representations, but, rather, as something arising from the workings of
the processing system as a whole, or even of the whole organism, including the sense organs and
the muscles that support behavioral response. However, this approach is not open to the pictorial
imagery theorist, who is directly committed to holding that we have conscious experience of
inner representations.)
According to quasi-pictorial imagery theory, one of the earlier stages of this visual
information processing is the creation of quasi-pictorial representations in the brain. These may
be derived from actual present visual input from the eyes (when we are actually seeing), or they
may be created from stored data in memory (when we are remembering or imagining), but in
either case they must be passed through several more stages of processing in order to extract
useful information from them. It is these further stages that constitute the "mind's eye function."
Thus, it can be argued, the mind's eye has a principled and independently motivated role within
the broader theory of vision. As scientists are actively investigating and seem to be making
progress in understanding the computational and neural mechanisms of visual information
processing, this "mind's eye" function is not so much a non-explanatory homunculus as a
promissory note drawn against the expected success of an ongoing research program.
But even if information processing theory does provide the right framework for
understanding visual perception (and not everyone thinks it does), the output of all the
processing is just more representations, instantiated as patterns of neural activity. Once again, it
seems that we must say either that these representations are conscious in and of themselves,
which brings us smack up against the "hard problem" once again, or else we need another
homunculus to read them and be conscious of what they represent. Perhaps this homunculus,
also, might be reduced to a further series of stages of information processing, but this would only
14
lead us to the same place yet again, and so on in unending regress. We still do not begin to
understand how quasi-pictures could be, or could produce, conscious imaginal experiences.
Description theory
Description theory, whose most important advocate has been Zenon Pylyshyn, began as an
attempt to understand how the phenomena of imagery could be fitted into a computational
theory of the mind. Pylyshyn holds that the way computers (and, by extension, brains) represent
information is necessarily more like language than like pictures. The syntax, and, indeed, the
vocabulary, of the hypothetical internal brain language (sometimes called "mentalese") might be
very different from that of any language that anyone speaks, but it is language-like because it
ultimately consists of symbolic tokens that represent things in the world in much the same way
as the words of a language do, not through resemblance but through some sort of arbitrary
correspondence relation.
In the case of "natural" languages, like English or Chinese, this correspondence (which
determines, for instance, that the word "dog" refers to dogs) is established by social convention.
In the case of humanly written computer programs the external reference of symbolic tokens
(when they have one) is set by the programmer. Matters are much less clear when we come to
hypothetical symbolic tokens in the brain, and there is great controversy over how, or whether,
the "symbol grounding problem" (the problem of understanding how the "words" of mentalese
might be able to refer to things outside the brain) can be solved. Nevertheless, much work in
cognitive science proceeds on the assumption that a solution is there to be found.
If the symbolic tokens that comprise computational representational systems represent as
words do, then it seems to follow that, by analogy with the way language represents visual
scenes, the brain-computer represents visual scenes with descriptions. From this perspective,
such mentalese descriptions are the end-product of perceptual information processing, and thus
responsible for our perceptual experiences. Of course, we do not experience the descriptions as
being descriptions, but, it is assumed, they nevertheless constitute our perceptual experience. If
mentalese descriptions of visual scenes are retrieved from memory rather than being the result of
present sensory input, or if they are constructed out of bits and pieces of various descriptions in
memory, then we have the experience of mental imagery.
Pylyshyn claims that, since descriptions can be partial, incomplete, and can leave out all
sorts of information (not only details, but sometimes even facts about the global structure of a
scene) description theory can explain the frequently indefinite and ambiguous nature of our
imagery better than picture theory can. However, it is clear that the theory's main motivation is
the belief that it better respects fundamental facts about the nature of computational
representation.
If anything, however, description theorists have paid even less heed than quasi-pictorial
theorists to the question of how images (or, come to that, percepts) come to be consciously
15
experienced. Clearly, we are not aware of mentalese descriptions as such, but the idea seems to
be that we can, at least sometimes, be consciously aware of what they represent. Like the quasipictorialists, however, description theorists appear to be implicitly committed to the view that
mental representations, including the elementary symbolic tokens of mentalese, and the symbol
complexes that make up visual descriptions (and, thus, mental images), are physically
instantiated as patterns of brain excitation (just as representations in a computer are instantiated
as electrical charge patterns in RAM chips and CPU registers). This brings us to exactly the
same "hard problem" as before: it seems impossible to conceive how patterns of brain excitation
(which, themselves, are reducible to movements of ions, etc.) could be, or give rise to, conscious
experience as we know it.
Enactive Theory
Quasi-pictorial and description theories are both attempts to explain how imagery, quasi-visual
experience, might be explained within the context of an information processing theory of
perception. They disagree over whether it should be identified with representations frm an early
or late stage of visual processing, and over the format of those representations. The enactive
theory of imagery depends upon a quite different way of conceiving of perception, one that was
pioneered in the twentieth century by James Gibson, and is more recently exemplified in "active
vision" techniques used in robotics, and the "sensorimotor" or "enactive" theory of perception
advocated by J. K. O'Regan and Alva Noë, amongst others. Instead of regarding vision as, at
root, a matter of information flowing in through the eyes into the brain, and toward some
internal center of consciousness, enactive theory regards it as a matter of the visual system
actively seeking out and extracting (or "picking up") desired information from the environment.
Seeing is not like taking a photograph (even a digital photograph whose file gets passed on to a
computer for further processing); it is more akin to performing a series of (many, rapid)
scientific tests and measurements on the information bearing ambient light that surrounds us
(what Gibson called the "optic array"). Rather than being the passive reception of information,
vision (and perception in general) is a purposive process of, as it were, asking questions about
our surroundings, and actively seeking out the answers. Visual consciousness is the experience
of being engaged in this exploratory, questioning, information seeking activity, involving not
only the making of the tests and measurements themselves, but also the continual adjustment of
our expectations, and thus our exploratory behavior itself, in response to their findings.
Consider how we identify an object by touch. If something is simply pressed against our
skin, we can tell very little about it. However, if we actively explore it, moving our fingers
around to feel its shape and texture, seeking out corners and edges, squeezing it to assess its
hardness, and so on, we can discover a great deal. What we learn derives not simply from the
sensations we feel in our skin, but, crucially, from the way those sensations change in response
to the purposeful, information-seeking movement of our fingers. Enactive visual theory holds
that vision works in fundamentally the same sort of way, except that the exploratory,
information-seeking behavior mostly happens so quickly and automatically that its details are
unavailable to introspection.
16
Eye movements are perhaps the most obvious (and experimentally accessible) aspect of
this visual information-seeking behavior – our eyes constantly, and purposefully, but largely
unconsciously, flit rapidly around to take in information from different points of interest – and
the enactive theory of imagery finds support in a number of recent experiments showing that the
stimulus-specific eye movement patterns produced when a subject examines a complex visual
stimulus are (quite unconsciously) re-enacted when the same subject later forms a memory
image of that stimulus.
However, eye movements cannot be the whole story. There clearly are bottom-up, inner
representations involved in human vision. For example, the pattern of illumination on the retina
does indeed produce a corresponding pattern of excitation in the primary visual cortex, and this
carries information about the scene momentarily before our eyes. From the enactivist
perspective, however, such representations are not conscious or even mental in any very
meaningful sense. Like the optical image formed inside a camera, the excitation pattern of
primary visual cortex carries data about our surroundings, but this only gives rise to visual
knowledge and awareness inasmuch as the data is purposefully explored and queried. It is this
active, exploratory questioning of a data source that turns the mere passive reception of visual
information (something that a camera can do) into true perception.
It is important to note, also, that the particular pattern of this exploratory questioning,
and the particular tests and measurements brought into play (both external eye movements and
internal data analysis processes), will be quite different when different sorts of things are being
looked at. For example, in order to see a cat we would need to go through a set of exploratory
actions appropriate to cat-seeing. These might include moving our eyes to focus on the likely
location of particular characteristic cat features (pointed ears, gently curving tail, etc.), but will
also undoubtedly include purely internal processes whereby further questions are answered by
querying the data flowing bottom-up into the brain. The key idea is that there is a specific
structured set of exploratory queries, what has been called a visual routine, appropriate to cat
seeing (or even Kitty-curled-up-asleep seeing), and a different set for each other type of thing
that we are able to recognise.
So, if visual experience arises from the active seeking for, and finding, of information,
imagery, quasi-visual experience, arises when we actively seek certain information, and persist
in going through the motions of looking for it, even though it is not there to be found. To
visually recognize a cat is to run through a specific, cat-recognizing visual routine, and to
imagine a cat is to perform (or partially perform) this same visual routine when there is no cat
present. The experience is different from actual seeing because there is no perceptual feedback
from the cat itself (and we must, as it were, force ourselves to go on looking for cat features
even though it is becoming ever more clear that we will not find them) but it is similar to real
cat-seeing inasmuch as the pattern of perceptual exploration that we perform is similar.
Arguably, this reflects the phenomenology of imagery, which is subjectively both like and unlike
true seeing.
17
From this theoretical perspective, to have a conscious visual experience is not to have a
representation in one's brain, but to act, and to be interacting with the world, in a certain way. To
have imagery is to act in the same sort of way, but to fail to interact. Actions (unlike brain states)
have inherent intentionality, just as mental images do: actions are necessarily about, or directed
at, something (possibly something imaginary, as when we search for the leprechaun's gold), just
as images are always images of something. When I enact my perceptual routine for looking at,
and recognizing, a cat, my action is intended to enable me to see a cat, and in a real sense it
remains so, it remains cat-directed, even when I enact it knowing that no cat is actually there to
be seen.
However, to point out that actions have intentionality is not, thereby, to explain how they
come to have that intentionality, and to say that mental images are really actions rather than
entities (representational brain states) is not, in itself, to explain how they come to be conscious.
Much more work needs to be done to make these matters clear. Although it has been argued that
enactive (or "sensorimotor") perceptual theory can explain perceptual consciousness, such
claims remain very controversial. Nevertheless, the enactive theory of imagery clearly radically
reframes the problem of explaining imaginal consciousness. No longer is it (as for quasipictorial and description theories) a problem of explaining how a brain state can be a conscious
(and intentional) state. Rather the problem is to give a scientific account of action and its
intentionality (a problem we faced anyway), and then to explain how such action can somehow
constitute consciousness. This is, no doubt, a difficult problem, but it is not the same as the
"hard problem."
Conclusion
At the time of writing, imagery is still a rather neglected topic within the broader, and growing,
field of consciousness studies. This is in rather stark contrast to the importance it was accorded
by earlier, pre-Behaviorist students of consciousness. Indeed, the Behaviorist revolt against
consciousness, which let to several decades of scientific neglect of the topic, seems to have been
fueled, in considerable part, by frustration at the factious and irreconcilable "imageless thought"
controversy, and it clearly involved the rejection of imagery quite as much as the rejection of
consciousness itself. Perhaps recent students of consciousness have been reluctant to fully
engage the topic of imagery because it has already proven so controversial within cognitive
science, and because the best known theories that have been developed in that context do little to
illuminate imagery's conscious nature. However, this situation may now be changing, and a
developing understanding of imagery may come, once again, to be seen as an integral and
essential aspect of our developing understanding of consciousness itself.
Further Reading
Baars, B. J. (ed.) (1996). Mental imagery. Special issue of the journal Consciousness and
Cognition, 5 (#3).
18
Berman, D. and Lyons, W. (2007). The first modern battle for consciousness: J. B. Watson's
rejection of mental images. Journal of Consciousness Studies, 14 (#11), 5-26.
Dean, G. M. & Morris, P. E. (2003). The relationship between self-reports of imagery and spatial
ability. British Journal of Psychology, 94, 245–273.
Ellis, R. D. (1995). Questioning consciousness: the interplay of imagery, cognition, and emotion
in the human brain. Amsterdam: John Benjamins.
Kosslyn, S. M., Thompson, W. L. and Ganis, G. (2006). The case for mental imagery. Oxford:
Oxford University Press.
Marks, D. F. (1999). Consciousness, mental imagery and action. British Journal of Psychology,
90, 567-585.
Morris, P. E. and Hampson, P. J. (1983). Imagery and consciousness. London: Academic Press.
O'Regan, J. K. and Noë, A. (2001). A sensorimotor account of vision and visual consciousness.
Behavioral and Brain Sciences, 24, 939-1031.
Pylyshyn, Z. W. (2002). Mental imagery: in search of a theory. Behavioral and Brain Sciences,
25, 157-237.
Richardson, J. T. E. (1999). Mental imagery. Hove, U.K.: Psychology Press.
Sartre, J.-P. (1940). The psychology of imagination. (Translated from the French by B.
Frechtman, New York: Philosophical Library, 1948.)
Schwitzgebel, E. (2002). How well do we know our own conscious experience? The case of
visual imagery. Journal of Consciousness Studies, 9 (#v-vi), 35-53.
Thomas, N. J. T. (1999). Are theories of imagery theories of imagination? An active perception
approach to conscious mental content. Cognitive Science, 23, 207-245.
Thomas, N. J. T. (2007). Mental imagery. In Zalta, E. N. (ed.) The Stanford encyclopedia of
philosophy. Stanford, CA: CSLI. Online publication:
http://plato.stanford.edu/entries/mental-imagery/
Thompson, E. (2007). Look again: phenomenology and mental imagery. Phenomenology and the
Cognitive Sciences, 6, 137-170.
Tye, M. (1991). The imagery debate. Cambridge, MA: MIT Press.