Listeners invest in an assumed other’s perspective despite

Cognition xxx (2011) xxx–xxx
Contents lists available at ScienceDirect
Cognition
journal homepage: www.elsevier.com/locate/COGNIT
Listeners invest in an assumed other’s perspective despite cognitive cost
Nicholas D. Duran ⇑, Rick Dale, Roger J. Kreuz
Department of Psychology, University of Memphis, Memphis, TN 38152, United States
a r t i c l e
i n f o
Article history:
Received 17 February 2011
Revised 3 June 2011
Accepted 13 June 2011
Available online xxxx
Keywords:
Perspective-taking
Audience design
Visuospatial processing
Mental rotation
Mousetracking
a b s t r a c t
We explored perspective-taking behavior in a visuospatial mental rotation task that
requires listeners to adopt an egocentric or ‘‘other-centric’’ frame of reference. In the current task, objects could be interpreted relative to the point-of-view of the listener (egocentric) or of a simulated partner (other-centric). Across three studies, we evaluated
participants’ willingness to consider and act on partner-specific information, showing that
a partner’s perceived ability to contribute to collaborative mutual understanding modulated participants’ perspective-taking behavior, either by increasing other-centric (Study
2) or egocentric (Study 3) responding. Moreover, we show that a large proportion of participants resolved referential ambiguity in terms of their partner’s perspective, even when
it was more cognitively difficult to do so (as tracked by online movement measures), and
when the presence of a social partner had to be assumed (Studies 1 and 2). In addition, participants continued to consider their partner’s perspective during trials where visual perspectives were shared. Our results show that participants will thoroughly invest in
either an other-centric or egocentric mode of responding, and that perspective-taking
strategies are not always dictated by minimizing processing demands, but by more potent
(albeit subtle) factors in the social context.
Ó 2011 Elsevier B.V. All rights reserved.
1. Introduction
When two people discuss the world around them, they
adapt their utterances to particular perspectives in order to
be understood. Suppose you, the reader, wish to know
where Al Green’s world-famous Full Gospel Tabernacle
church is. If the authors knew you were from Memphis,
we could respond: ‘‘By Graceland.’’ If you had a novice’s
perspective on Memphis, we might adorn the description
with more detail, such as: ‘‘About 20 min south of downtown Memphis, off Highway 55, also near Graceland.’’
(e.g., Fussell & Krauss, 1992; Isaacs & Clark, 1987). If our
spatial perspectives differed in some way (e.g., we were
chatting over the phone), I might give you directions by
tailoring them to your spatial perspective, such as ‘‘On your
left will be Graceland, then Al Green will be just. . .’’ (e.g.,
⇑ Corresponding author. Tel.: +1 909 569 2488; fax: +1 901 678 1336.
E-mail address: [email protected] (N.D. Duran).
Mainwaring, Tversky, Ohgishi, & Schiano, 2003; Schober,
1993, 1995). These adaptive processes are sensitive to a
variety of factors, and sometimes speakers mix strategies
during an interaction (Tversky, Lee, & Mainwaring, 1999).
Research conducted during the past couple of decades
has revealed a diversity of constraints on how these speech
acts are shaped by perspective (Schober, 1998, 2009). In
general, when conditions are right, and cognitive constraints are satisfied, speakers seem to employ perspectives adaptively (see also Brown-Schmidt, 2009).
Relatively less is known, however, about how the addressee herself adapts to utterances inherently involving
a spatial perspective. As noted above, speakers are neither
always careful nor explicit with perspective choice, and listeners themselves can face moments of ambiguity. Above,
when ‘‘On your left. . .’’ was used, it was assumed that the
traveler is headed south from downtown Memphis. Yet
conversational conditions do not always allow such ready
assumptions. In the construction of directions relative to
0010-0277/$ - see front matter Ó 2011 Elsevier B.V. All rights reserved.
doi:10.1016/j.cognition.2011.06.009
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
2
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
some map, ‘‘on the left’’ is cognitively challenging and demands a referent object, as does ‘‘near’’ and ‘‘far’’ though
they may be less challenging (Franklin & Tversky, 1990).
Earlier studies have suggested that perspective consistency facilitates comprehension (Black, Turner, & Bower,
1979; Tversky et al., 1999), and that different perspectives
comprehended from text may allow varying levels of cognitive flexibility (Brunyé, Rapp, & Taylor, 2008). Many
studies have explored the role of particular perspectives
for memory encoding and access of maps or routes
(McNamara, Sluzenski, & Rump, 2008; Mou, Liu, & McNamara, 2009), and such memories and related inferences
are sensitive to factors such as familiarity with a terrain
or environment, and aspects of a terrain like relative object
salience (Taylor & Tversky, 1992, 1996). In addition, extensive work by Carlson and colleagues has shown that comprehenders might maintain activation across multiple
perspective types before one is selected (Carlson, 1999;
Carlson & Covey, 2005), a notion that Taylor and colleagues
confirmed using online measures and event-related potentials (Taylor, Naylor, Faust, & Holcomb, 1999).
Though this extensive work on perspective comprehension and memory has done much to reveal underlying cognitive processes, we revisit a basic experimental design to
ask the question: What spatial perspective do listeners
adopt in the face of fundamental ambiguity? The work just
reviewed almost always resolves the ambiguity for the listener in the task. For example, Taylor et al. (1999) ‘‘force’’
the participants to one or another perspective and observe
the effects on processing (see also Black et al., 1979; Lee &
Tversky, 2005; Samson, Apperly, Braithwaite, Andrews, &
Bodley Scott, 2010; Tversky et al., 1999). The research on
speakers’ choices (e.g., Schober, 1993) is more open-ended
– the tendency for speakers to adopt one or another framework in an open-ended way has been the focus of conversational spatial perspective-taking work.
To study the underlying cognitive processes involved in
open-ended listener perspective-taking, we constructed a
simple scenario modeled after Schober (1993) and Tversky
and Hard (2009), but modified it to tap into the online
comprehension of referential terms. This classic experimental design has been used frequently to study production (as reviewed above), but rarely have the
spontaneous tendencies of listeners been studied (e.g., Fitneva & Song, 2009). In such cases, do listeners assume a
consistent reference-frame over the course of an interaction? Previous research suggests that perspective switch
costs are present during comprehension (Black et al.,
1979), but in this research the listener was not given the
opportunity to spontaneously choose a perspective under
ambiguity.
In addition, will taking another person’s spatial perspective (e.g., their left) always induce cognitive difficulty?
As reviewed above, speakers sometimes choose the addressee’s perspective, and here we test the same question with
listeners. If taking another’s perspective is cognitively challenging, will participants choose it as much as they would
their own reference frame? With the cognitive expense of
taking a perspective outside of one’s own, our design permits open-ended exploration of the selection tendencies
of comprehenders to take on their own or another’s per-
spective. One could predict that cognitive load alone, without sufficient interactive constraints, favors taking the
simplest perspective (often identified as one’s own, from
Levelt, 1989 to Epley, Keysar, Van Boven, & Gilovich,
2004, though there exists no consensus).
1.1. The social perspective-taking task
We modeled our task on the design of Schober (1993),
who used a basic object-identification task to study perspective taking in speakers. Speakers saw two identical objects placed on a two-dimensional surface, and had to
induce listeners to choose one of these objects. The speaker
and listener could occupy different positions, under relative rotation of 90° or 180°, on this two-dimensional surface (designated on a simulated, drawn table on a piece
of paper). Speakers thus had to formulate utterances using
spatial descriptions that involved particular perspectives.
For example, choosing her own perspective, a speaker
could simply say: ‘‘The object to my right.’’ This perspective is often termed the egocentric perspective, because
its frame of reference is the speaker herself. The speaker
may also say ‘‘The object to your left,’’ thus taking the other
person’s perspective, which we will refer to as ‘‘othercentric.’’ Other possible descriptions can be used, such as
‘‘object-centered’’ (‘‘Choose the one nearest the airplane
picture’’). In addition, speakers may simply produce
ambiguous statements, like ‘‘The one on the left’’ when
their positions are misaligned, and leave it to the listener
to figure things out.
General findings using this design are that speakers
take the addressee’s perspective (i.e., they are othercentric) when communicating with a simulated partner,
but will coordinate perspectives with actual partners when
they are present (Schober, 1993), and will choose objectcentered descriptions at times to avoid choosing an egocentric or other-centric perspective (Schober, 1995). In
addition, they will adapt these utterances under different
task conditions, such as directing their partners to the location versus asking about the location (Mainwaring et al.,
2003). Others have found that taking the other-centric perspective may involve cognitive cost (Horton & Keysar,
1996). This simple task has thus revealed the adaptive tendencies of speakers in choosing spatial perspectives when
formulating directions.
More recently, Tversky and Hard (2009) employed this
sort of task in a simple experiment that showed that the
mere implied presence of another person shaped the spatial perspective used by survey respondents. In a singletrial study, they presented participants with a picture of
two objects and queried them about where one of the objects was located. The presence of a person in the picture,
implying an ‘‘other’’ reference frame, induced usage of
other-centric responses (e.g., ‘‘on his left’’). In a second
study, they emphasized action in the instructions, and induced further other-centric descriptions. The authors concluded that the very presence of another person results in
the use of ‘‘external’’ reference frames, and because this
was done with only minimal cuing, there is a natural and
spontaneous tendency to embody the spatial perspectives
of another’s actions or potential actions.
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
One implication from Tversky and Hard (2009), and
from the results of Schober (1993), is that participants
are willing, even with very minimal and non-interactive
cues, to take the perspective of another, despite the fact
that some have suggested that this perspective is not the
default and is cognitively costly (Epley et al., 2004; Horton
& Keysar, 1996). However, in these previous studies, no online measure of cognitive processing was extracted to confirm either of these findings in the context of the
perspective-taking task. To do so here, we created a scenario that required participants to select and present objects to a simulated partner. The generated actions were
analyzed for overall time of execution, as well as for more
nuanced and graded signatures of difficulty that would be
problematic, if not impossible, to capture with simple button presses.
Another unaddressed issue is that neither Schober
(1993), nor Tversky and Hard (2009), explored the tendencies of the listener, who may be equally willing to invest in the other-centric perspective during occasionally
ambiguous exchanges. In the following three studies,
our task offers answers to these basic questions. First,
we show that even with a cognitively costly other-centric perspective, requiring measurable mental rotation
to take on this perspective, comprehenders of directions
will invest other-centrically. We show that this happens
even during trials when there is a shared perspective –
when the egocentric perspective is all that is needed to
solve the task – participants still pay the cognitive cost
to ‘‘transform’’ the visual array to accommodate an assumed partner. In a second and third study, we show
that comprehenders’ perspective-taking behavior is influenced by the pragmatic constraints in the communicative environment. Participants may seek to optimize
mutual understanding with their partner, and depending
on an assessment about the partner’s ability to do the
same, other-centric interpretations can increase or decrease dramatically. Like Tversky and Hard (2009), we
employed a very simple task, but embedded it in a sequence of trials and captured online cognitive measures.
And like these authors, we show that there is spontaneous, strategic investment in other-centrism despite its
difficulty.
2. Study 1: Listener’s perspective choice in an
ambiguous spatial layout
We simulate a simple interaction where a person,
standing at a table, asks another person to hand him a
folder from a set of two folders laid out on the tabletop,
as in ‘‘Give me the folder on the left.’’ Such a request would
be straightforward if both speaker and addressee were
standing side-by-side with a shared perspective of the table. However, if the speaker were to walk around the table,
now facing the addressee from the opposite side, the same
request presents a coordination problem for the addressee.
Should the interpretation be grounded in one’s own frame
of reference, or that of the other? And if addressees are unable to ask for further clarification (for example, the speaker becomes distracted or momentarily leaves the room),
3
what are the constraints in the communicative context
that might influence their decision?
The above scenario was implemented in a computerbased environment where requests for a folder were made
by a virtual speaker, whose location around a simulated table changed from trial to trial. For some of these trials the
requests were ambiguous, allowing participants to respond
either ego- or other-centrically. To make a decision on how
to respond, we hypothesize that participants are likely
considering many factors, with arguably the most important being the characteristics of the speaker and his intended meaning.
For this task, there are two speaker-related influences
that have particular relevance. First, although the virtual
speaker sounds natural from turn-to-turn, the speaker is
nevertheless delivering pre-recorded speech. As a result,
participants could react by viewing this simulated partner
as a nonentity that cannot possibly possess a unique
perspective, thus making the egocentric response the only
viable option. The recording could also fail to activate
other-centric inclinations that, with a real partner, would
normally lead to other-centric responding. Second, given
the lack of explicit feedback, there is no way of knowing
the speaker’s intent, thus there is little risk in choosing a
‘‘wrong’’ response. Therefore, it is reasonable to suspect
that the interpretation that requires the least amount of
cognitive effort, namely, the egocentric response, would
be preferred.
Yet, the limitations of the simulated partner could also
be the very basis for taking the partner’s perspective. Such
behavior would occur if participants were approaching the
interaction as a collaboration for establishing mutual
understanding. According to the work of Clark and colleagues, conversational partners optimize understanding
by attempting to minimize the effort for both themselves
and their partner (Clark, 1996; Clark & Krych, 2004; Clark
& Wilkes-Gibbs, 1986). Moreover, if it appears that one of
the partners is failing to understand (Horton & Gerrig,
2002, 2005), or is likely to find the interaction difficult
(Bortfeld & Brennan, 1997; Brennan, 1990), the other will
expend greater cognitive effort to ensure mutual understanding. These assessments of the partner are rapidly
made, and can be influenced by knowledge of the other’s
needs, characteristics, or limitations at the start of the
interaction, as in adjustments based on a partner’s age or
a known language disorder (Newman-Norlund et al.,
2009; Perkins & Milroy, 1997). Here, not only must an
assessment be made in the service of comprehension, but
it must also be extended to a partner who is incapable of
collaboration. If the participant accepts the burden of
ensuring mutual understanding, and the limitations of
the recorded speaker are deemed sufficient, we would expect higher rates of other-centric responding.
Finally, a central claim emphasized earlier is that the
other-centric response requires greater cognitive effort.
To be clear, we consider cognitive effort to be the amount
of mental work devoted to linguistic comprehension. But
unlike the many language studies where effort is confined
to retrieving words or parsing sentences, here effort is extended to a larger pragmatic and visuospatial context. To
interpret a verbal request other-centrically, not only must
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
4
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
participants consider the abilities of their partner, but also
their partner’s location in space. As such, cognitive effort
can be operationalized in the same ways as those used in
well-established research on visuospatial perspective
transformations (see Zacks and Michelon (2005) for a
review).
In this previous research, it has been shown that it
takes longer to describe objects from locations that are
offset from one’s own location (i.e., egocentric). Moreover,
the greater the deviation from the egocentric perspective,
the greater the time it takes to respond. This effect also
holds when describing objects from another person’s
perspective (Schober, 1993, 1995), regardless of whether
the person is real or simulated (Amorim, 2003; Schober,
1993) or perspective choice is spontaneous or forced
(Samson et al., 2010). Our scenario allows participants
to freely adopt the perspective of a simulated partner,
and to do so at varying locations of difficulty. This difficulty is implemented by offsetting the partner’s location
by 90°, 180°, and 270° relative to the participant. The
further participants must mentally rotate from their
own egocentric perspective to ‘‘view’’ the table from their
partner’s egocentric perspective, the greater the cognitive
effort invested.
Fig. 1. The experimental interface for capturing egocentric or othercentric behavior in a single computerized task. Note: The numeric labels
are for explanatory purposes only and were not seen by participants.
time (although payment would be forfeited per Amazon
Mechanical Turk rules).
2.1. Participants
2.2. Method
We recruited 89 participants using Amazon’s Mechanical Turk.1 Mechanical Turk is an online service that allows
users to post tasks that require human expertise, like rating
the usability of websites or compiling lists of business addresses. Tens of thousands of potential ‘‘workers’’ complete
tasks on their computers and receive a nominal payment
in return. Recently, cognitive psychologists and other scientists have begun exploring the potential of Mechanical Turk
for behavioral research (e.g., Dale & Duran, in press; Dale,
Duran, & Morehead, submitted for publication; Frank &
Gibson, 2011). Respondents are found to exhibit consistently
high inter-subject reliability (Kittur, Chi, & Suh, 2008; Snow,
O’Connor, Jurafsky, & Ng, 2008; Sorokin & Forsyth, 2008),
and maintain excellent reliability and significant correlations with laboratory-collected data (Munro et al., 2010).
Moreover, because participants are not necessarily aware
that they are being tested, nor is an experimenter present,
demand characteristics are reduced.
Participants in the current study (i.e., ‘‘task’’) were paid
40 cents for contributing responses over a 7–12 min
period. To identify unique responders, the IP address of
each participant’s computer was recorded and associated
with their data. If multiple sets of data had the same IP
address, we discarded these sets because the same participants may have completed the task more than once. If IP
addresses did not originate from the United States, these
data too were discarded. Once we identified unique
responders, the IP addresses were deleted from all files.
It should also be noted that prior to committing to the
study, participants were aware of the general purpose of
the study and that they were free to terminate at any
1
www.mturk.com.
The experiment was implemented in Adobe Flash and is
accessible from an Internet web browser.2 Upon navigating
to the correct webpage, participants read brief instructions
informing them that they would have to respond to simple
requests from a simulated partner.3 The participants then
proceeded by clicking a button with their computer mouse
to reveal an image of an empty tabletop that was centered
within a 330-by-330-pixel region on their computer screen.
After a brief pause (around 500 ms), a recording of their
male partner’s voice was automatically played. The partner
directed the participant to select one of two folders that
would appear on the tabletop. Each folder was 67-by-50pixels in size. The partner’s directions always began with
the phrase ‘‘Give me the folder that is. . .’’ and was completed
with, ‘‘on the right,’’ ‘‘on the left,’’ ‘‘in the back,’’ or ‘‘in the
front.’’ After hearing these directions, participants clicked
on a ‘‘GO’’ label at the bottom of the screen (centered
65-pixels below the tabletop image). At this point, the orientation of the folders and the location of the partner and
participant became visible. The folders were arranged in
one of four possible orientations: top-right diagonally
(Fig. 1, Position 2 and 3), top-left diagonally, vertically, or
horizontally, whereas the participant’s location was always
located at a ground position of 0° (Fig. 1, Position 1). The
participant’s task was to select a folder and drag it to the
2
A version of the experiment can be accessed at: cognactive.org/
perspectiveTask.
3
This partner was labeled as either an ‘‘Employer’’ or ‘‘Employee’’ in
Studies 1 and 2. These designated labels were used to test a separate
research question on social identity. No clear and consistent patterns in
responses were found between these labels. The analyses here are based on
the combined datasets.
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
partner’s position around the table. The partner could appear in any of the four positions of 0°, 90°, 180°, or 270°
(Fig. 1, Position 4).
The partner’s verbal directions were strategically paired
with particular folder/partner combinations to create 20
shared-perspective and 20 critical trials. It is during the
critical trials that a participant’s current spatial perspective
taking is revealed. For example, based on the layout of
Fig. 1, if the participant hears, ‘‘Give me the folder on the
left,’’ and selects the folder at Position 2, they are acting
egocentrically (‘‘folder on my left’’). But if the folder at Position 3 is selected, the participant is now acting ‘‘othercentrically’’ (‘‘folder on your left’’). Given the same layout,
a shared-perspective trial would occur if the participant
now hears, ‘‘Give me the folder in the front.’’ Regardless
of whether the participant takes an egocentric or othercentric perspective, the folder at Position 2 is aligned with
both perspectives and should always be selected. These trials serve as a type of control.
As participants moved toward a folder, we were rapidly
sampling the x, y coordinates of their mouse cursor (similar
to Brennan, 1990). These coordinate positions were recorded every 25 ms (a sampling rate of 40 Hz) from the
moment the ‘‘GO’’ region was clicked to the initial selection of a folder. The ‘‘GO’’ region is a point of calibration
that ensures all responses begin at the same location. The
subsequent temporal and trajectory analyses are based
on these movements.
In presenting the stimuli to participants, we randomly
shuffled shared-perspective and critical trials, ensuring
that no more than three trials of the same type ever occurred in a sequence. In each list, all possible combinations
of folder orientation and partner location were seen at
least twice – except for horizontal and vertical orientations
when participants were located at 90° or 270°. At these
combinations, the directions are no longer ambiguous
and are likely to bias perspective choice. For example,
imagine the scenario where the speaker is offset by 90°
and the folders are laid out vertically. If the speaker asks
for the folder on the right, the participant is forced to consider the speaker’s perspective because ‘‘right’’ is interpretable only from where the speaker is located. Additional
constraints made it impossible for the speaker in critical
trials (nonshared-perspective) to appear at the 0 degree
location (shared-perspective), and for the speaker in
shared trials to appear at the 180 degree location (nonshared-perspective). In the end, the 20 critical trials consisted
of 4 trials each at the 90 and 270 degree position, and 12 at
the 180 degree position. The same was true for the 20
shared-perspective trials, except now with 12 trials at
the 0 degree position. Finally, for trials at the 90 and 270
degree positions, and separately at the 0 and 180 degree
positions, there were an equal number of folders oriented
top-right and top-left diagonally, and from each direction
type. Trials at the 0 and 180 degree position also included
the horizontal and vertical orientations.
5
and excessive delays. Guessing is evident in the sharedperspective trials where a response to the partner’s directions should always converge on a single folder. If this target folder was not selected, then the response was flagged
as an error. Furthermore, if a trial lasted for more than ten
seconds, it too was flagged. Participants who made two or
more errors were excluded. A total of eight participants, or
8.98% overall, were removed. In addition, 101 trials, or
3.16% overall, were removed because the response time
to select a folder exceeded three standard deviations above
the mean (i.e., greater than 3986 ms).
2.3.1. Patterns of egocentric and other-centric responders
Based on the consistency of responses across critical
trials, participants were classified as egocentric, other-centric, or mixed responders. To identify category membership, we computed proportions of egocentric and othercentric trials for each participant. If proportion scores exceeded 0.70 for one of the two response categories, then
the participant was considered to be a member of that category. Participants scoring below 0.70 were labeled as
mixed responders. There were only 8 (or 9.76%) mixed
responders. Participants tended to be on the extreme ends
of a bimodal distribution, with 31 participants (38.80%)
classified as egocentric and 43 participants (52.44%) as
other-centric. However, this difference was not statistically
significant, {2(1) = 1.95, p = 0.16.
2.3.2. Greater processing costs in taking another’s perspective
In this analysis, we evaluate whether other-centric
responders are mentally rotating to their partner’s visual
perspective. To do so requires the participants to rotate from
their own spatial frame of reference, located at ground position 0°, to the partner’s perspective that is located at position
90°, 180°, or 270°. This rotation should be increasingly difficult the further the rotation is from the personal frame of
reference (Michelon & Zacks, 2006; Miller & Johnson-Laird,
1976; Presson & Montello, 1994; Shepard & Metzler, 1971;
Zacks, Vettel, & Michelon, 2003). Thus, when the partner is
rotated 180° (directly across from the participant) there will
be greater processing difficulty than when the partner is rotated 90° (at a ground position of 90° or 270°). Because this
behavior is expected to occur just with other-centric
responders, processing times at each of the ground positions
should also be much higher when compared to egocentric
responders.
We used a linear mixed-effects model4 to evaluate response time differences across degree of rotation (withinsubjects factor; three levels: 0° vs. 90° vs. 180°) and responder type (between-subjects factor; two levels: consistent
egocentric vs. consistent other-centric). It is important to
note that the rotated position at 90° also includes trials at
270° because both positions require equivalent degrees of
rotation. To simultaneously account for random variance
due to participants and items, ‘‘participant’’ was added as
a random effect, as well as folder orientation (e.g., horizontal, diagonal, and vertical), and verbal directions
2.3. Results
To ensure that participants were committed to the task,
we identified suspicious behavior like random guessing
4
Models were built using the PASW (SPSS) MIXED procedure, with
denominator degrees of freedom estimated using Satterthwaite’s approximation (default).
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
6
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Fig. 2. The critical trial response times and standard errors of egocentric and other-centric responders at 90 and 180 degrees of partner rotation, and
shared-perspective response times at 0 degrees of rotation. For each increase in rotation, other-centric responders take much longer than egocentric
responders. Other-centric responders are also slower than egocentric responders at each degree of rotation (Section 2.3.2), including shared-perspective
response times at 0 degrees of rotation and 90 degrees of rotation (Section 2.3.4). Furthermore, other-centric critical and shared-perspective response times
at 90 degrees of rotation are nearly identical (Section 2.3.4). Note: Statistical significance is coded as: ⁄p < .05, ⁄⁄p < .01, ⁄⁄⁄p < .001.
(‘‘e.g., ‘‘right,’’ ‘‘left,’’ ‘‘front,’’ and ‘‘back’’). As an additional
precaution that would control for practice effects, models
with and without trial number were compared using a
log-likelihood ratio test as recommended by Baayen (2008)
and Quene and van den Bergh (2008). The addition of trial
number resulted in a model with a better overall fit, and
was therefore included as another random effect.5
The overall model revealed a statistically significant
main effect for degree of rotation, F(2, 1424.03) = 48.62,
p < .001, a significant main effect for responder type
F(1, 66.75) = 7.72, p = .007, and a significant interaction between the two factors, F(2, 2066.31) = 2066.31, p = .001.
Given the significant interaction, follow-up simple main
effects were conducted across degree of rotation for egocentric and other-centric responders. For other-centric
responders, there was an overall significant effect,
F(2, 1562.86) = 54.02, p < .001, resulting from increased
cognitive processing across degree of rotation (see Fig. 2).
Planned comparisons show significant difference between
0° and 90° (mean difference = 172 ms6), p < .01, Cohen’s
d = .17; and between 90° and 180° (mean difference = 164.66 ms), p < .01, d = .27. For egocentric responders,
although no significant increase was expected, these participants also appear to consider some aspect of the partner’s
5
These random effect variables are included in all subsequent mixedeffects analyses. The specific model used here was chosen to give a
conservative unbiased evaluation of the dependent variables, following
established parameter specification techniques (Baayen, 2008; Baayen,
Davidson, & Bates, 2008; Quene & van den Bergh, 2008).
6
All reported and plotted means are the estimated marginal means that
have been adjusted by linear mixed-effects models to remove random
variance due to participant, item, and practice effects. The estimated
marginal means are those used by the model to derive inferential statistics,
and represent a better, unbiased account of outcome response patterns.
changing location, F(1, 1792.42) = 9.12, p < .001, but only
when location shifted from 0° to 90° (mean difference = 123 ms), p < .01, Cohen’s d = .05. Yet, it is clear that
other-centric responders are engaged in greater cognitive
work, as the average linear increase of response time for
other-centric responders was much higher than that of egocentric responders. To demonstrate this overall increase, an
additional mixed-effects model was computed with response time regressed on degree of rotation, now coded as
a continuous factor (e.g., ‘‘1’’, ‘‘2’’, and ‘‘3’’). The slope of
the linear regression line for other-centric responders,
b = 170.93, is much steeper than that of egocentric responders, b = 75.67, F(2, 1637.83) = 65.69, p < .001.
The other-centric response times at each of the three
ground positions (0°, 90°, and 180°) were also significantly
higher than those of egocentric responders. A series of
analyses based on the overall model confirmed that at 0
degrees of rotation, other-centric responders were
183 ms slower, F(1, 78.36) = 3.57, p = .06 (marginal),
d = 0.37; at 90 degrees of rotation, other-centric responders were 232 ms slower, F(1, 92.07) = 5.29, p = .02,
d = 0.40; and at 180 degrees of rotation, other-centric
responders were 361 ms slower, F(1, 79.06) = 13.81,
p < .001, d = 0.50. Taken together with the previous results,
even though egocentric responders appear to be affected at
some level by the locational changes of the recorded
speaker, other-centric responders are taking on the additional processing costs of mental rotation.
2.3.3. Greater processing complexity and competition for
other-centric responders
As participants use their mouse cursor to select a
folder, fine-grained changes in the decision movement
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
7
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Table 1
Study 1: Means and standard errors for directional shifts, maximum deviation, and acceleration components at 0, 90, and 180 degrees of rotation.
Rotation
0
90
180
Directional shifts
Maximum deviation
Acceleration components
Ego
Other
Ego
Other
Ego
Other
1.23 (.06)
1.32 (.07)
1.36 (.07)
1.47 (.05)
1.65 (.07)
2.07 (.07)
12.16 (1.58)
11.12 (1.46)
8.45 (.77)
18.83 (2.27)
25.64 (2.16)
23.15 (1.54)
8.78 (.23)
9.79 (.34)
9.70 (.29)
10.29 (.23)
11.22 (.30)
12.17 (.29)
are captured. These changes reflect processing complexity
and competition as a decision is made. Here, we consider
three trajectory measures of complexity and competition:
directional shifts in motion, maximum deviation, and acceleration components. Directional shifts occur when the
mouse cursor switches direction along the x- or y-axis,
depending on the orientation of the folders. For example,
as a participant moves from the bottom of the screen to a
folder, they might change their mind and move towards
the other folder. This change will be reflected in a flip
of direction along the axis in which folders are arranged
(x-axis for diagonal and horizontal; y-axis for vertical).
We can also measure the strength of the initial bias that
brought about the directional flip. Because the visually
co-present folders in the horizontal and diagonal configurations are laid out side-by-side along the x-axis, and
the initial location of the cursor at the bottom of the
screen is set equidistantly between the two folders,
(x, y = 0, 0), we can measure the maximum deviation
away from 0 that the trajectory is ‘‘pulled’’ (also see Spivey & Dale, 2006). Here, ‘‘pull’’ is toward the competitor
folder, as would be the case for other-centric responders
that move towards the ‘‘egocentric’’ folder while making
a final selection. Presumably, the further responders
travel towards the egocentric folder en route to their
final selection, the stronger the bias to respond egocentrically (or other-centrically in the case of egocentric
responders).
Alternatively, uncertainty and competition in selecting
a folder can occur without directional change along an axis.
In this scenario, a participant makes an initial confident response towards one folder, but slows down, and then
speeds up, as they deliberate about their final response.
The acceleration component measures the number of such
response hesitancies (Dale & Duran, in press). All three
measures supplement reaction time by providing further
evidence of the increased cognitive effort associated with
being other-centric.
2.3.3.1. Directional shifts. As with the reaction time data, a
mixed-effects analysis was conducted with degree of rotation and responder type as fixed effects. Both main effects
were statistically significant (both at the p = .001 level), as
was the interaction term F(1, 1938.59) = 8.55, p < .001.
Based on the means reported in Table 1, the directional
shifts for other-centric responders appear to increase with
degrees of rotation. This pattern was validated with a simple effects analysis, F(2, 1418.71) = 31.97, p < .001, with
significant increases from 0° to 90° (mean difference = 0.18
shifts), p = .04, d = .15 and from 90° to 180° (mean difference = 0.42 shifts), p < .001, d = .32. For egocentric respond-
ers, there were no statistically significant increases.
Further comparisons also show that the other-centric
responders had more shifts than egocentric responders at
each of the three ground positions: 0° (mean difference = 0.24 shifts), F(1, 103.19) = 3.03, p = .08 (marginal),
d = .27; 90° (mean difference = 0.33 shifts), F(1154.83) =
4.83, p = .03, Cohen’s d = .36; and 180° (mean difference = 0.71 shifts), F(1, 106.38) = 26.43, p < .001, Cohen’s
d = .49.
2.3.3.2. Maximum deviation. The main effect for responder
type was significant, p < .001, as well as a significant interaction between responder type and degree of rotation,
F(2, 1693.31) = 3.41, p = .03. For other-centric responders,
there was greater movement toward a competitor folder
between 0 and 90 degrees of rotation (mean difference =
6.81 pixels), p = .004, d = .17. For egocentric responders,
there were no equivalent movements. The comparisons
at each of the ground positions also show that othercentric responders deviate more toward the competitor
folder
at
0°
(mean
difference = 6.67
pixels),
F(1, 153.06) = 4.31, p = .04, Cohen’s d = .17; at 90° (mean
difference = 14.52 pixels), F(1, 218.63) = 16.89, p < .001,
d = .48; and at 180° (mean difference = 14.71 pixels),
F(1, 83.19) = 10.83, p < .001, d = .62.
2.3.3.3. Acceleration components. Both main effects were
statistically significant (degree of rotation, p < .001; responder type, p = .01), as was the interaction term,
F(2, 1937.42) = 3.11, p = .04. Follow-up analysis on the
interaction shows an increase for other-centric responders
across each degree of rotation, F(2, 1740.46) = 4.89,
p = .008; with the planned comparison between 0° and
90° being significant (mean difference = 0.93 components),
p = .004, d = .10, and the comparison between 90° and 180°
also significant (mean difference = .95 components),
p = .004, Cohen’s d = .18. We also found a significant increase of acceleration components for egocentric responders, F(2, 1569.07) = 21.59, p < .001, but only for the planned
comparison between 0° and 90°, mean difference = 1.00
components, p = .01, d = .12. Finally, other-centric responders exhibited more acceleration components at each of the
three ground positions: 0° (mean difference = 1.51 components), F(1, 82.09) = 4.06, p = .05, d = .37; 90° (mean difference = 1.43 components), F(1, 100.54) = 3.32, p = .07
(marginal), d = .32; and 180° (mean difference = 2.47 components), F(1, 83.19) = 10.83, p = .001, d = .43.
Overall, the trajectory measurements provide a more
detailed evaluation of the decision dynamics that occur
during the time required to initiate and finalize a response
movement. For each measure, the decision to be other-
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
8
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
centric is marked by greater instability, and that the activation of the competitor response, e.g., the egocentric option,
is strongest for other-centric responders. Moreover, the
processing difficulty associated with mentally rotating to
a partner’s visual perspective is amplified the further
respondents must travel in the rotation. This latter finding
generally holds for other-centric response behavior, with
the exception of maximum deviation from ground positions 90° to 180° (although the increase from 0° to 180°
was statistically significant, p = .01). And like the overall
response time analysis, there was some evidence that the
partner’s changing location added some complexity to egocentric responding, but only with acceleration components
between 0 and 90 degrees of rotation. There were no
equivalent effects with directional shifts and maximum
deviation.
2.3.4. Perspective-taking during shared trials
In the previous response time analysis (Section 2.3.2
above), the 0-degree rotated position was included as a
baseline in which participant and their partner shared
the same view of the table (and consequently, visual perspectives were aligned). At 0° there is no need to mentally
rotate; thus, other-centric response times may be similar
to that of egocentric responders. However, as evident by
the 183 ms difference, other-centric responders are somewhat slower. This result suggests other-centric responders
are continuing to consider their partner’s perspective, even
when it is unnecessary to do so. As further evidence, we
compared the shared-perspective trials of egocentric and
other-centric responses at 90 degrees of rotation. If
other-centric responders are mentally rotating and egocentric responders are not, then other-centric responders
will again be slower. Indeed, this result was confirmed,
(mean difference = 207 ms); F(1, 90.44) = 5.24, p = .02,
d = .29 (see Fig. 2). Finally, we assessed just other-centric
responses during shared-perspective and critical trials
at 90 degrees of rotation. Unlike the critical trials,
the shared-perspective trials present a situation where
the perspectives of participant and partner converge on
the same folder. These trials may be easier because there
is no conflict between perspectives (‘‘your right’’ or ‘‘my
right’’), thus there is no need for other-centric participants
to inhibit activation from their own perspective (‘‘my
right’’). However, the response times for critical and
shared-perspective trials were nearly identical, (shared:
M = 1756 ms,
SE = 44 ms;
critical:
M = 1792 ms,
SE = 38 ms); p > .10, suggesting that other-centric responders are continuing to mentally rotate, even when the visual
context does not place any demands on them to do so.
2.4. Brief discussion
Given an ambiguous spatial perspective task, participants exhibit an almost equivalent preference in taking
an egocentric or other-centric perspective. This commitment to any one perspective is a consistent mode of
responding that persists over an extended interaction with
an assumed social partner. Furthermore, we show that
other-centric responders are spontaneously choosing what
is a more cognitively challenging behavior. To take another’s perspective, responders are mentally rotating to
the spatial location of their partner. In doing so, they are
quite literally taking on the visual perspective of their partner. Not only does this process take longer the greater the
required rotation, but it is also marked by signatures of
processing complexity. Even so, more than half of the participants in this study chose to accommodate the perspective of their partner – a partner that participants know to
be a mere simulation.
3. Study 2: Perspective-taking modulated by
attributional cues
To respond other-centrically, participants appear to be
engaging in a pattern of behavior that optimizes the understanding of the other. Although the ‘‘other’’ in the current
context has no need or ability to understand, participants
still adhere to principles of collaboration, an apparent default for social interaction. Thus, when a partner is unable
to contribute equally to the collaboration, the more adept
partner expends greater effort to do what the other cannot
(Branigan, Pickering, Pearson, & McLean, 2010; Brennan,
1990). Of course, the question still remains: why then do
a large proportion of participants still act egocentrically?
One possibility is that the constraints in Study 1 are simply
insufficient. These constraints lack the force to trigger
attributions that are necessary for other-centric responding (Clark & Schaefer, 1989; Clark & Wilkes-Gibbs, 1986;
Wilkes-Gibbs & Clark, 1992). Such ineffectiveness might
be due to responders who have a high criterion for establishing a collaboration or noticing when a collaboration becomes unbalanced. Either way, by introducing a potentially
more relevant constraint, or attributional cue, the rate of
other-centric responding should dramatically increase.
However, if responders are ignoring the other as irrelevant,
and acting solely to minimize their own cognitive effort,
then the additional cue should have little effect on the
overall pattern of perspective-taking behavior.
In Study 2, we chose an additional attributional cue that
would be relevant only if participants were treating the
simulated speaker as a potential collaborator. As such, we
simply informed participants that their partner does not
know where they are seated at the table. Now, when the
participant hears a verbal direction like, ‘‘Give me the
folder on the right,’’ the interpretation must take into account that the partner’s knowledge state precludes the
possibility of ‘‘on the right’’ meaning the participant’s
‘‘right’’. The partner can only give directions from her
own perspective; thus, the participant should respond
other-centrically. In addition, using the same cognitive
measures, we consider whether this additional cue to be
other-centric makes an other-centric response easier, or
continues to be the more difficult option.
3.1. Participants
We recruited 96 participants using Amazon’s Mechanical Turk. Each was compensated with a small monetary
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
payment (40 cents). The participants’ IP addresses were
also different from those in Study 1.
9
responders increased by 22%, {2(1) = 6.82, p < .01, whereas
egocentric responders decreased by 16%, {2(1) = 4.21,
p = .04.
3.2. Method
The only difference from Study 1 is a slight change in
the initial instructions. We now include the statement:
‘‘Your partner DOES NOT know where you are seated.’’
All other aspects of the experiment were identical.
3.3. Results
We removed 15 participants, or 15.62% overall, who appeared to be randomly guessing on shared-perspective trials, or took an excessive amount of time to answer (greater
than 10 s). We also removed 61 trials, or 2.62% overall, that
were three standard deviations above the response time
mean for selecting an initial folder (i.e., 3634 ms).
3.3.1. Patterns of egocentric and other-centric responders
If participants are considering what their partner
knows, it is expected that the introduction of a more perspicuous attributional cue will shift the distribution of consistent responders toward other-centric. Indeed, there was
evidence of this effect. Based on the proportion of response
type over critical trials, 18 (22%) participants consistently
responded egocentrically on more than 70% of the trials,
and 60 (74%) participants consistently responded othercentrically. This difference between other-centric and egocentric responders was significant, {2(1) = 22.62, p < .001.
Only 3 (3.7%) participants were mixed responders (see
Fig. 3). Furthermore, not only was there a difference of
other-centric and egocentric responding within Study 2,
but when compared against Study 1, other-centric
3.3.2. Greater processing costs in taking another’s perspective
A linear mixed-effects model wth response time as the
dependent variable revealed a significant interaction between rotation and responder type, F(2, 2193.62) = 7.19,
p = .001. Starting with a closer examination of othercentric responders, the linear increase across 0°, 90°, and
180° was statistically significant, F(2, 1061.13) = 44.19,
p < .001. Planned comparisons show a marginally significant increase between 0° and 90° (mean difference = 59 ms), p = .07, d = .10; and between 90° and 180°
(mean difference = 202 ms), p < .001, d = .32 (see Fig. 4).
For egocentric responders, there was an overall marginally
significant increase, F(2, 2019.48) = 2.42, p = .09, but was
confined to the increase from 0° to 90°, mean difference = 120 ms, p = .04, d = .13. To evaluate whether this
egocentric increase was equivalent to the other-centric increase, we regressed reaction time on degree of rotation,
with degree of rotation coded as a continuous variable.
As expected, the fitted regression lines were significantly
different, with other-centric rising on average 132 ms
and egocentric only rising 47 ms, F(2, 1904.15) = 44.70,
p < .001.
Interestingly, in the between-subjects analysis at each
degree of rotation, other-centric responders took much
longer to respond than egocentric responders at two of
the three ground positions (Fig. 4). Follow-up comparisons
show statistically significant differences at 0° (mean difference = 203 ms), F(1, 88.01) = 2.91, p = .09 (marginal),
d = .28; and at 180° of rotation (mean difference = 380 ms),
F(1, 90.35) = 9.99, p = .002, d = .46.
3.3.3. Processing complexity and competition for other-centric
responders
Separate mixed-effects models were built with each of
the three trajectory measures as dependent variables. The
measures, and their descriptives, are displayed in Table 2.
3.3.3.1. Directional shifts. The interation for the overall
model was statistically significant, F(2, 2094.61) = 3.60,
p = .03. With this measure, other-centric responders had
more shifts across rotated positions, F(2, 2092.19) = 19.85,
p < .001, particularly between 0 and 90 degrees rotation,
mean difference = .341 shifts, p < .001, d = 28. And, as in
the previous analysis, other-centric responders had more
shifts than egocentric responders at each ground position:
at 0° (mean difference = .494 shifts), F(1, 108.16) = 8.81,
p = .004, d = .42; at 90° (mean difference = .449 shifts),
F(1, 150.56) = 6.13, p = .01, d = .36; and at 180° (mean difference = .796 shifts), F(1, 115.24) = 22.15, p < .001, d = .58.
Fig. 3. Distribution of egocentric and other-centric responders across the
three studies. For Study 2, there are a greater number of other-centric
responders and fewer egocentric responders when compared to Study 1.
For Study 3, this pattern is reversed, with a greater number of egocentric
responders and fewer other-centric responders when compared to Study
2 and Study 1.
3.3.3.2. Maximum deviation. Only a statistically significant
main effect for degree of rotation was found for this study,
F(2, 1827.93) = 4.19, p = .02, d = .25, showing a significant
increase of 6.73 deviations from 0 to 90 degrees of rotation.
3.3.3.3. Acceleration components. A statistically significant
main effect for degree of rotation, F(2, 2092.22) = 2.27,
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
10
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Fig. 4. The critical response times and standard errors of egocentric and other-centric responders at 90 and 180 degrees of partner rotation, and sharedperspective response times at 0 degrees of rotation. Between 0–90°, and 90–180°, other-centric responders take more time, whereas egocentric responders
increase from 0° to 90° (Section 3.3.2). However, other-centric responders are overall slower than egocentric responders (Section 3.3.2), including sharedperspective response times at 0 degrees of rotation (Section 3.3.4). Furthermore, other-centric critical and shared-perspective response times at 90 degrees
of rotation are nearly identical (Section 3.3.4).
Table 2
Study 2: Means and standard errors for directional shifts, maximum deviation, and acceleration components at 0, 90, and 180 degrees of rotation.
Rotation
0
90
180
Directional shifts
Maximum deviation
Acceleration components
Ego
Other
Ego
Other
Ego
Other
1.13 (.07)
1.22 (.09)
1.21 (.09)
1.63 (.05)
1.67 (.57)
2.01 (.56)
14.28 (3.66)
17.15 (4.65)
17.22 (4.39)
17.84 (1.46)
28.44 (2.47)
25.46 (1.59)
9.15 (.34)
9.77 (.43)
9.96 (.35)
11.24 (.23)
11.25 (.26)
12.72 (.25)
p < .001, revealed an increase from 90° to 180° (.32 components), d = .27. Also, a statistically significant main effect
for responder type, F(1, 72.44) = 6.72, p = .01, verified that
other-centric responders overall had 2.11 more components than egocentric responders, d = .48.
3.3.4. Perspective-taking in shared trials
The 0-degree rotated position represents shared perspectives where there is no need to mentally rotate. Despite this, other-centric responders still engage in greater
cognitive work, as they are 203 ms slower than egocentric
responders (see Section 3.3.2 above). When comparing the
shared-perspective trials of egocentric and other-centric
responses at 90 degrees of rotation, again, other-centric
responders are slower by 172 ms; however, this effect is
not statistically significant (Fig. 4). But importantly, the
other-centric response times during shared-perspective
and critical trials at 90 degrees of rotation did not vary.
Similar to Study 1, this result suggests that other-centric
responders are consistent across trial types, apparently engaged in a stable strategy of considering the perspective of
their partner.
3.4. Brief discussion
When the attributional cues in a communicative context are made more obvious, participants use this information to shape their perspective-taking choice. Here, a
participant is told that a partner does not know their location (and thus must infer that the partner is unable to take
their perspective). This information was sufficient to bias
nearly two-thirds of the participants to an other-centric
mode of responding. By doing so, participants are taking
on the visuospatial perspective of their partners in assessing how to optimally respond. Furthermore, this mode is
sustained throughout the trials, despite the finding that
this behavior is comparatively more difficult than egocentric responding, as it was in Study 1 (based on response
time and complexity measures).
4. Study 3: Perspective-taking under the assumption
that the direction-giver is ‘‘real.’’
Thus far, we have argued that participants adhere to
principles of collaboration when interacting with a simulated partner. Moreover, fundamental attributions about
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
the speaker’s inability to collaborate tend to lead to greater
other-centric responding. In Study 3, we examine this
claim in greater detail by now asking: what if participants
were to interact with a seemingly real human partner? To
achieve this pretense, we use the same experimental stimuli as in Study 1, but add a short, interactive introduction
to convince participants that they are connected to a real
human being.
In this new scenario, participants have a potentially collaborative partner who can be held responsible for ensuring mutual understanding. And because the partner is
initiating the request, and is the one that wants the folder,
the burden of collaboration is likely shifted to the speaker
– at least to a greater extent than in Study 1 and 2. In these
previous studies, there was no way for the recorded speaker to know the participant’s perspective, by virtue that the
speaker was removed in space and time. And because they,
the participants, were the only ones able to adopt an othercentric perspective, they seemingly obliged. In this study
we remove the partner’s previous limitations, and expect
the new attributional information to lead to greater egocentric responding.
4.1. Participants
Eighty participants were recruited from Amazon’s
Mechanical Turk and were paid 75 cents for their participation. The increase in payment is due to an increase of
about 5 min needed to complete the introductory stage
of the study. The participants’ IP addresses were also different than those in Studies 1 and 2.
4.2. Method
The major addition from Study 1 was to lead participants into thinking that they were remotely connecting
to another Mechanical Turk ‘‘worker’’ or Mechanical Turk
‘‘requester’’ (counterbalanced across participants). To
establish this ruse, we began with slight modifications in
how the study was advertised on Mechanical Turk, as well
as in the wording of the initial instructions. Both were altered to provide a cover story that we (i.e., the ‘‘Requesters’’) were testing software that would allow users to
engage in real-time audio communication within a Flashbased environment. However, because we were in ‘‘the
early stages of development,’’ the audio could only be
transmitted one-way. Nevertheless, we were most interested in the ability to connect two users, and to have these
users perform a joint activity within a shared virtual space.
And identical to Study 1, the instructions went onto explain that the activity would require participants to move
simulated folders in response to a direction-giver’s verbal
requests. But unlike Study 1, before participants could proceed, they had to first engage in an extended process of
connecting to what would appear to be a ‘‘real-life’’
partner.
In this second step of the introductory stage, participants were directed to press a large button labeled: ‘‘CLICK
TO CONNECT.’’ Upon clicking, participants were taken to a
screen with the message ‘‘Connecting to another Turk user,
please wait a few seconds. . .’’ or ‘‘Connecting to a Reques-
11
ter, please wait a few seconds. . .’’ displayed near the top.
The ellipses in the message above flashed every 500 ms
to mimic dynamic updating as the software attempted to
‘‘connect’’ to an outside worker or requester. After approximately 5000 ms, the ellipses stopped blinking and a new
screen appeared with the word ‘‘CONNECTED’’ in the upper
right-hand corner, and with an empty text box in the middle of the screen. After a brief delay, the male voice in the
previous experimental phases of Studies 1 and 2 began to
speak in the same tone and volume as in the experimental
trials. This male partner reiterated information from the
written instructions, which was that he alone could transmit audio but was unable to receive audio from the participant. Next, to (ostensibly) ensure that there was indeed a
connection, the male partner then asked the participant to
type the code word ‘‘CAT’’ into the empty text box and
press the ENTER key. If an incorrect code word was given,
the partner repeated the code word with a slightly different intonation. Once the male partner ‘‘received’’ the correct code word, thereby indicating to the participant that
the partner could hear him or her, the partner stated that
the instructions allowed both of them to proceed. The participant then clicked a button to begin the identical experimental task employed in Studies 1 and 2.
At the completion of the experimental task, several follow-up questions were asked to ensure that the participants believed that they had been connected to a real-life
Turk worker or requester. The first and second questions
were open-ended responses, with the first question
addressing general comments and feedback about the
Mechanical Turk task, and the second question asking,
‘‘How did you feel about your interaction with your partner?.’’ The third and fourth questions were yes or no responses, asking, ‘‘Did you FEEL like you were connected
to an actual person?,’’ and ‘‘Did you BELIEVE you were connected to an actual person?.’’ Each question was presented
in isolation on separate screens.
4.3. Results
We discarded trials from participants who responded
that they did not feel/believe that they were connected
to a real person, or if a participant indicated in any way
that they thought their partner was simulated (e.g., ‘‘it
sounded like a recording,’’ ‘‘the person was a fake’’). We
achieved a 75% deception rate in this modified task, with
60 out of the 80 participants seemingly convinced that
the virtual confederate was indeed a real person.
There were no participants who appeared to be randomly guessing, but one participant did have several trials
that exceeded the maximum time limit of ten seconds. This
participant was removed. Furthermore, 60 trials, or 2.58%
overall, were removed because they exceeded three standard deviations above the response time mean for selecting an initial folder (i.e., 3892 ms).
4.3.1. Patterns of egocentric and other-centric responders
The introduction of a seemingly real partner should
change the collaborative attributions that can be made.
In the earlier studies, the partner was an ineffectual collaborator, forcing participants to take on the additional work
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
12
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
of being other-centric. Now, the partner is capable of carrying the load, allowing participants to delegate work to the
partner. In this study, there were 38 (64%) egocentric
responders, 17 (29%) other-centric responders, and 4 (7%)
mixed responders (based on 70% or more of trials being
consistently ego- or other-centric). Thus, there is a higher
rate of egocentric responding, {2(1) = 8.02, p = .004, suggesting participants were indeed sensitive to the new attributional cues. In comparison, the 20 participants who
reported having a simulated partner, and therefore less
influenced by the attributional information, were also less
likely to respond egocentrically, with a response rate that
decreased by 22%.
Lower rates of egocentric responding were also evident
in Studies 1 and 2. Compared to the current result, there
was a 27% decrease of egocentric responding in Study 1,
{2(1) = 6.93, p = .008; and a 42% decrease for Study 2,
{2(1) = 20.55, p < .001. Conversely, in comparing rates of
other-centric responding across studies, there was a 24%
increase for Study 1, {2(1) = 6.87, p = .009; and a 45% increase for Study 2, {2(1) = 20.62, p < .001.
4.3.2. Greater processing costs in taking another’s perspective
A significant main effect was found for degree of rotation and responder type, both at the p < .001 level. Importantly, the interaction term was also significant,
F(2, 1594.32), p = .004 (see Fig. 5). Simple main effects for
degree of rotation at egocentric and other-centric responders showed statistically significant increases of response
time for both responder types across degree of rotation:
other-centric responders, F(2, 1594. 61) = 28.16, p < .001,
and egocentric responders, F(2, 1593.66) = 14.74, p < .001.
These increases occured from 0° to 90°, with other-centric
responders 208 ms slower, p < .001, d = .25; and egocentric
responders 78 ms slower, p = .03, d = .08. There were also
increases from 90° to 180°, with other-centric responders
167 ms slower, p = .004, d = .25; and egocentric responders
99 ms slower, p = .007, d = .18. Although both egocentric
and other-centric responders showed an upward trend
across degree of rotation, the cognitive costs of mental
rotation enacted a greater cost on other-centric responders. This is evident in the comparison of the fitted regression lines, whereby other-centric responders had a
steeper rise in response times, b = 195 ms, than that of
the egocentric response times, b = 88 ms, F(2, 1353.19) =
48.73, p < .001.
And lastly, simple main effects for responder type at
each ground position also show that other-centric
responders required more time to respond at 0° (mean difference = 339 ms), F(1, 63.15) = 9.51, p = .003, d = .64; at
90° (mean difference = 468 ms), p < .001, d = .76; and at
180° (mean difference = 537 ms), p < .001, d = .77.
4.3.3. Greater processing complexity and competition for
other-centric responders
4.3.3.1. Directional shifts. The main effect for degree of rotation was statistically significant, F(2, 1541.61) = 6.96,
p = .001, as was the main effect for responder type,
F(1, 51.51) = 4.82, p = .03. Thus, the increase from 0° to
90° (.08 shifts) and from 90 to 180 (.161 shifts) showed
increasing difficulty that is similar for egocentric and
other-centric responders; and, overall, the other-centric
responders still had more shifts then egocentric responders
(see Table 3). However, the interaction term in this model
was not statistically significant.
Fig. 5. The critical response times and standard errors of egocentric and other-centric responders at 90 and 180 degrees of partner rotation, and sharedperspective response times at 0 degrees of rotation. Between 0–90°, and 90–180°, other-centric responders take more time, whereas egocentric responders
increase from 0° to 90°. However, other-centric responders are generally slower than egocentric responders (Section 4.3.2), including shared-perspective
response times at 0 degrees of rotation (Section 4.3.4). Furthermore, other-centric critical and shared-perspective response times at 90 degrees of rotation
are nearly identical (Section 4.3.4).
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
13
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Table 3
Study 3: Means and standard errors for directional shifts, maximum deviation, and acceleration components at 0, 90, and 180 degrees of rotation.
Rotation
0
90
180
Directional shifts
Maximum deviation
Acceleration components
Ego
Other
Ego
Other
Ego
Other
1.14 (.05)
1.16 (.07)
1.29 (.06)
1.41 (.09)
1.54 (.12)
1.73 (.10)
9.08 (1.14)
9.02 (1.13)
9.73 (.93)
16.22 (2.25)
24.01 (3.13)
21.64 (2.25)
9.23 (.22)
9.71 (.32)
10.39 (.27)
11.56 (.47)
13.00 (.69)
12.71 (.59)
4.3.3.2. Maximum deviation. Both main effects and the
interaction were statistically significant. With further analysis of the interaction, F(2, 1339) = 3.34, p = .04, simple
main effects for degree of rotation at responder type show
that only other-centric was statistically significant,
F(2, 1330.17) = 4.88, p = .008, with the greatest deviation
occuring from 0 to 90 degrees of rotation (mean difference = 7.79 pixels), p = .003, d = .30. Simple main effects
for responder type at degree of rotation also show that
other-centric responders had greater deviation toward
the competitor response at 0° (mean difference = 7 pixels),
F(1, 87.31) = 4.97, p = .03, d = .30; at 90° (mean difference = 15 pixels), F(1, 117.61) = 18.80, p < .001, d = .66;
and
at
180°
(mean
difference = 12
pixels),
F(1, 95.14) = 13.22, p < .001, d = .56.
4.3.3.3. Acceleration components. Again, both main effects
were statistically significant, but the interaction term was
not. Interpreting the main effects, there is an overall increase from 0° to 90° (.96 components) and from 90° to
180° (.67 components), F(2, 1168.43) = 16.56, p < .001.
There is also a greater number of acceleration components
for other-centric responders (12.76 components) than egocentric responders (9.78 components), F(1, 51.30) = 7.38,
p = .009.
4.3.4. Perspective-taking during shared trials
As reported in Section 4.3.2 above, and shown in Fig. 5,
at 0 degrees of rotation the response times for othercentric responders were 339 ms slower than egocentric
responders. For shared trials at 90 degrees of rotation,
other-centric responders are again slower, now by
365 ms, F(1, 52.35) = 8.55, p = .005, d = .54. Furthermore,
there were no statistically significant differences between
shared and critical trials at 90 degrees of rotation. Taken
together, these results provide further evidence that
other-centric responders are continuing to mentally rotate
even when it is not necessary to do so.
4.4. Brief discussion
In Studies 1 and 2, participants interacted with a recorded speaker who was unable to share in the work of
forging mutual understanding. In the current study, the
same speaker now has the potential to be a viable contributor who shares the participant’s goals of collaboration. If
participants are sensitive to this information, and are willing to act on it, then previous demands that led to othercentricism are largely diminished. As such, the expected
result is a greater percentage of egocentric responders. Indeed, this was found. There was a significant increase of
egocentricism when compared to Studies 1 and 2. But like
these earlier studies, there were still some who chose to be
other-centric – nearly a third of the participants. Like the
egocentric participants in Study 1 who appeared to have
a stricter criterion for when they would take another’s perspective, the other-centric participants in this study might
also need more evidence that the speaker is acting with
them in mind (thereby allowing the participants to be egocentric). Interestingly, the cognitive ease associated with
taking an egocentric perspective does not seem to matter.
The other-centric responders still continued to expend
more cognitive effort, as shown in the mental rotation
analysis for response time, complexity, and competition.
Moreover, this effort was sustained over trials, even when
visual perspectives between participant and speaker were
shared.
5. Additional analysis: Right/left vs. back/front trials
In this paper, we have focused on spatial descriptions
that occurred in two referential systems, one where objects
are situated in terms of one’s own bodily coordinates, and
the other in which an external agent’s coordinates are
adopted. The descriptions were based on a common set
of referential terms, ‘‘right,’’ ‘‘left,’’ ‘‘front,’’ and ‘‘back,’’ that
could be used to distinguish folders relative to the viewpoint of either agent. For the right/left terms, a folder occured in conjunction with either an agent’s right or left sides,
thus intrinsic side was sufficient for identification. For
back/front trials, an alternative conceptualization is
needed, as folders were always to an agent’s front. In these
trials, participants are likely assigning the folder nearest an
agent (either themselves or their partner) as being the one
in ‘‘front,’’ or by imposing an intrinsic ‘‘front’’ or ‘‘back’’ to
the folder itself. Either way, different conceptual representations might lead to different processing costs, with previous research suggesting that back/front referential terms
are easiest to process (e.g., Franklin & Tversky, 1990).
Our goal in this section is to determine whether the
processing costs of right/left trials substantially vary from
back/front trials; and if so, how this might change our current interpretations of cognitive effort in perspectivetaking. To start, we combined the data from Studies 1 to
3 and plotted the response times of egocentric and othercentric participants’ right/left and back/front trials (which
divides the dataset into two equal parts). Based on visual
inspection alone (see Fig. 6), the bulk of the cognitive work
appears to be carried by the right/left trials. For these referential terms, there is considerable separation between
egocentric and other-centric response times, and when
compared to egocentric participants, the other-centric par-
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
14
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Fig. 6. The right/left and back/front critical response times of egocentric and other-centric responders at 90 and 180 degrees of partner rotation, and
shared-perspective response times at 0 degrees of rotation.
Table 4
For Studies 1–3, with back/front and left/right trials, the average increase of
response time across degree of rotation (linear slope) and response time
differences between egocentric and other-centric responders.
Table 5
For Studies 1–3, with just left/right trials, the average increase of response
time across degree of rotation (linear slope) and response time differences
between egocentric and other-centric responders.
All trials
Right/left trials
Study 1
Slope (b) Other
Slope (b) Ego
0 (Other–Ego)
90 (Other–Ego)
180 (Other–Ego)
Study 2
⁄⁄⁄
171 ms
76 ms ⁄⁄⁄
183 ms
232 ms ⁄
361 ms ⁄⁄⁄
Note: statistical significance is coded as:
132 ms
46 ms
203 ms
142 ms
380 ms
⁄
p < .05,
Study 3
⁄⁄⁄
⁄⁄
⁄⁄
p < .01,
Study 1
⁄⁄⁄
195 ms
87 ms ⁄⁄⁄
339 ms ⁄⁄
468 ms ⁄⁄⁄
537 ms ⁄⁄⁄
⁄⁄⁄
p < .001.
ticipants show a much steeper rise in response times
across degree of rotation. Conversely, the back/front trials
for egocentric and other-centric responders are essentially
the same, with minimal increases over degree of rotation.
The pattern in Fig. 6 does not alter our earlier claims
that there are greater cognitive costs in being other-centric, and that these costs are marked by time-consuming
visuospatial transformations. However, an update is
needed. When mental transformations do occur, they do
so between the intrinsic ‘‘right’’ and ‘‘left’’ sides of speaker
and participant. This finding also suggests that by combining right/left and back/front trials, we have inadvertently
weakened the critical effects reported in Studies 1–3. To
examine this possibility, we compare our previous findings
with a new round of analysis using just right/left trials.
Rather than reporting the entirety of the additional statistical testing, we simply report the response time differences and p values. For example, Table 4 shows a
summary of the results from Studies 1 to 3, including the
average increase of response times across degree of rotation (e.g., slope (b) Other), and the difference in response
times between egocentric and other-centric responders
(e.g., 0 (Other–Ego)). Asterisks by each response time indi-
Slope (b) Other
Slope (b) Ego
0 (Other–Ego)
90 (Other–Ego)
180 (Other–Ego)
Study 2
⁄⁄⁄
271 ms
128 ms⁄⁄⁄
228 ms⁄
438 ms⁄⁄
620 ms⁄⁄⁄
Note: statistical significance is coded as:
Study 3
⁄⁄⁄
257 ms
140 ms⁄⁄⁄
210 ms
318 ms⁄
703 ms⁄⁄⁄
⁄
p < .05,
⁄⁄
p < .01,
386 ms⁄⁄⁄
104 ms⁄⁄⁄
377 ms⁄⁄
724 ms⁄⁄⁄
990 ms⁄⁄⁄
⁄⁄⁄
p < .001.
cate the level of significance. Using the same analytical approach as before, Table 5 provides a snapshot of the results
when back/front trials are removed. By comparing these
two tables, it is clear that our earlier interpretations from
Studies 1 to 3 still hold. Moreover, an even stronger case
can be made given the response times in Table 5 (without
back/front trials) greatly exceed those shown in Table 4
(with back/front trials),7 and are still significant despite
the considerable loss of statistical power.
Given the improvement for response times, we now
have reason to believe that the use of ‘‘back’’ and ‘‘front’’
directions may have also obscured critical effects in the
trajectory measures. Because trajectory measures are
much more sensitive to subtle changes in response movements, any noise introduced by the back/front trials is
likely to be amplified. This sensitivity might explain why
the trajectory measures across the three studies were not
as robust as the response time results. By removing back/
front trials, we expect the trajectory measures to provide
7
These simple effects analyses are based on overall models with
statistically significant interaction terms (p < .05).
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
15
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Table 6
For select trajectory measures, the average increase for each measure across degree of rotation (linear slope) and differences between egocentric and othercentric responders (right/left trials only). The statistically significant interactions between rotation and responder type are also reported.
Right/left trials
Study 2
Interaction
Slope (b) Other)
Slope (b) Ego
0 (Other–Ego)
90 (Other–Ego)
180 (Other–Ego)
Study 3
Maximum deviation
Acceleration components
Directional shifts
Acceleration components
F(2,993.64) = 2.84, p = .05
5.57⁄⁄⁄
1.62
2.96
19.81⁄⁄
16.72⁄
F(2,988.63) = 5.49, p = .004
1.43⁄⁄⁄
0.09
2.05⁄
2.12⁄
4.37⁄⁄⁄
F(2,733.76) = 5.55, p = .004
0.35⁄⁄⁄
0.07
0.23
0.35
0.83⁄⁄⁄
F(2,731.50) = 11.76, p < .001
2.33⁄⁄⁄
0.27
2.49
4.92⁄⁄⁄
6.49⁄⁄⁄
Note: statistical significance is coded as:
⁄
p < .05,
⁄⁄
p < .01,
⁄⁄⁄
p < .001.
better support for increased complexity and competition in
other-centric response movements.
A new series of analysis was conducted for each of the
three trajectory measures for each of the three studies.
The general trends found in the earlier studies were again
replicated, albeit with increased or similar values for
other-centric responders relative to egocentric responders.
Because of the large number of additional analyses, and given that they are consistent with our previous conclusions,
we highlight only those results that had the largest gains.
In these earlier analyses, there were four (out of nine) trajectory measure models that showed no interaction between degree of rotation and responder type. In the new
analysis, the interaction term for each model is now statistically significant (see Table 6). A follow-up analysis across
degree of rotation reveals evidence of mental rotation for
just other-centric responders. And importantly, follow up
analysis comparing responder type at degree of rotation
(i.e., ground position) shows statistically significant differences between egocentric and other-centric responders at
several positions.
6. General discussion
In the introduction, we reviewed the extensive work on
the role of perspective-taking in spatial memory and other
forms of linguistic and navigational behavior. Yet, in a social context, few have looked at the listener’s natural tendencies to disambiguate perspectives in spatial
description. In the studies presented here, we created a
simple communicative context that required participants
to select one of two referents based on a simulated partner’s requests. The scenario is modeled after a hypothetical
situation where a person, standing next to a table, is asked
by someone else for one of two folders that are laid out on
the table. If the person making the request says, ‘‘Give me
the one on the right’’, and is directly across from the standing person (i.e., addressee), it is unclear if the addressee
should assume their own ‘‘right’’ (an egocentric interpretation) or the requester’s ‘‘right’’ (an other-centric
interpretation).
In Study 1, it is during these open-ended trials, with
loose constraints for how to disambiguate, that we evaluated participants’ spontaneous perspective-taking behavior. We found that more than half of the participants
consistently provided an other-centric response, although
this did not statistically differ from number of egocentric
responders. Even so, the other-centric perspective was
chosen despite a clear processing advantage for egocentric
responding, as evidenced by faster response times and less
complexity in egocentric response movements.
In Study 2, we sought to better grasp the driving force
that might influence participants to be other-centric. We
hypothesized that such responding was largely motivated
by attributions about the simulated partner’s ability (or
inability) to contribute to mutual understanding. However,
there is always the possibility that the artificial nature of
the task also places unintended task demands on a participant. To rule this out, we kept the same design as Study 1,
but added a single cue that was directed at influencing
partner-specific attributions. Here, the critical attributional
cue was a slight change to the instructions, informing participants that their partner does not know where they are
located. This information makes it impossible for the partner to consider any perspective other than his own. Following the goals of collaboration, participants who are
sensitive to this more explicit information are likely to take
on the increased burden of interpreting from the one perspective that is guaranteed to be understood, namely, the
simulated partner’s. Compared to Study 1, the result
should be an increased number of other-centric responders, and a continued cognitive cost of responding othercentrically. As expected, this predicted outcome was
confirmed.
Following the same rationale as in Study 2, we sought to
manipulate a single attributional cue that would now bias
response behavior toward egocentrism. By doing so, a
stronger case could be made about the role of partner-specific attributions in influencing perspective-taking behavior. In Study 3, the same design as Study 1 was used, but
this time we removed the constraint of having to interact
with a simulated partner. Participants now believed that
they were receiving directions from a real person – a person who might share responsibility in working toward mutual understanding. Moreover, because this ‘‘real’’ partner
was the one making the requests, and did not correct any
of the participant’s responses, we expected participants
to direct most of the responsibility toward the partner. If
so, participants are likely to assume the partner is acting
other-centrically, thereby choosing egocentric behavior
for themselves. Indeed, compared to Study 1, this predic-
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
16
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
tion was supported. We also found a modest number of
other-centric responders who were apparently unswayed
by the new attributional cue.
In all three studies, the other-centric responders were
willing to invest in behavior that was cognitively challenging. In similar research using a spatial perspective-taking
task (Schober, 1993; Tversky & Hard, 2009), it is assumed
that the increased cognitive effort is the result of mentally
rotating to the visual perspective of another. However,
without direct evidence to support this claim, it is unclear
if participants are in fact embodying the participant’s point
of view. The experimental task presented here allows us to
test this supposition. If participants are mentally rotating,
then the greater the offset of one’s own visual perspective
from that of a partner, the longer it will take to rotate to
the partner’s perspective. And indeed, this was found for
three positions that were offset by an equivalent and
increasing magnitude of distance. Other-centric responders were fastest when there was no rotation (shared perspective), but were significantly slower when shared
visual perspective was offset by 90°, and even slower when
shared visual perspective was offset by an additional 90°.
The extent by which these responses were slower also exceeded any slowdowns exhibited by egocentric
responders.
These findings suggest that other-centric participants
are anchoring to an egocentric frame of reference and making adjustments relative to this frame, a process Zacks and
Michelon (2005) calls a visuospatial perspective transformation. However, an egocentric visual anchor is not necessarily isomorphic to a socially-motivated egocentric bias. That
is, we cannot say that participants are first employing and
inhibiting an egocentric bias before considering the perspective of their partner. Our data suggest another possibility. During trials at 0 degrees of rotation, the
participant’s visual perspective is equivalent to their partner’s perspective. These shared visual perspective trials require no mental rotation; thus, if the other-centric
responder has been inhibiting an egocentric bias in making
a rotation, they are now free from having to do so. As a
consequence, response times should be at least equivalent
to the egocentric responder. But this is not the case. Othercentric responders are still significantly slower in shared
trials. If a strong egocentric bias is not at play, what then
causes the slower response times? We argue that the increased time results from an other-centric responder who
is still considering what his partner knows, even when visual perspective is aligned. Contrary to Herrmann (1988),
Herrmann, Burkle, and Nirmaier (1987) early assertions
(as reported by Schober (1993)), participants will indeed
‘‘step into the shoes’’ of their partner even with a shared
vantage point. Once a conceptual perspective has been
adopted, participants are consistent in the way they apply
this perspective in interpreting spatial referents. In a sense,
participants are still metaphorically ‘‘rotating,’’ even when
visual perspectives are aligned.
These findings also have direct implications for the current debate about exactly how partner-specific information is processed during communication. Our claims
coincide with Brennan and colleagues’ ‘‘one-bit’’ theory
that simple yet significant attributional cues can have an
immediate effect on behavioral outcomes (e.g., Brennan &
Hanna, 2009; Galati & Brennan, 2010), particularly when
the cues are easily tracked and introduced early on in the
communicative context (Bortfeld & Brennan, 1997; Hanna
& Tanenhaus, 2004; Tversky & Hard, 2009). Participants
commit wholesale to the elicited perspective (egocentric
for Study 3 and other-centric for Studies 1 and 2). Moreover, the use of attributional cues appears to be consistent
with Clark and Wilkes-Gibbs’ (1986) principle of least collaborative effort, whereby the motivation to consider another’s perspective is rooted in a shared responsibility to
ensure mutual understanding.
Our results can also be contrasted with another account
of perspective-taking behavior that posits language users
will consider another’s perspective only when a primary
egocentric preference becomes unsustainable (Horton &
Keysar, 1996; Pickering & Garrod, 2004; Shintel & Keysar,
2009). In this view, when there is a communication breakdown, or a breakdown is likely to occur, speakers (and presumably listeners) will engage in repair strategies that
require a more costly other-centric orientation. In the current study, we strove to create a scenario where such repair strategies would be unnecessary. Because a partner’s
trial-to-trial directions signaled no obvious confusion, participants could respond egocentrically without penalty or
communicative mishap. Furthermore, in responding egocentrically, the task could be completed faster – a primary
argument for why an egocentric response should be preferred. And given that the social partner only simulates
the intentions and presence of an actual person, such task
demands could be considered biased toward egocentric
responding (see Gerrig, Brennan, & Ohaeri, 2000 for discussion). Yet, despite these contextual influences that arguably favor an egocentric orientation, participants were
still willing to act on the attributional needs of an assumed
social agent, revealing high rates of other-centric responding in production, as shown by Tversky and Hard (2009),
and even here where comprehension is involved.
It also appears that when participants do act egocentrically, particularly when the attributional cues support an
egocentric perspective (Study 3), there is some evidence
that participants are still tracking information about their
partner. As we reported earlier, response times do increase
as a partner’s position is offset from 0° to 90° and 90° to
180°. Although this increase is far from the required time
to make a visuospatial transformation, it does suggest a
low-level awareness of conflicting perspectives. This possibility is weakly supported with the egocentric responders
in Studies 1 and 2. But because response times in these
studies increase from 0 to 90, and are maintained between
90 and 180, it is more likely that responders are merely
noticing the locational shift of the recorded speaker from
their own vantage point. Thus, an attentional distraction
leads to the slight increase of response time, which indicates, at the very least, that egocentric responders are engaged in the task.
Another notable finding is that the high rate of unsolicited other-centric responding is comparable to previous
findings (e.g., Galati & Brennan, 2010; Horton & Gerrig,
2002; Schober, 1993). Yet, in these studies, there are production- and task-based constraints that may make an
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
other-centric response more likely. For example, participants might be told ahead of time to consider the perspective of an addressee, or to design utterances in the service
of conversational pressures (e.g., to establish common
ground), or be allowed to negotiate and receive feedback
from their partner. Of course, in the current research, we
took a different tack by evaluating perspective-taking
behavior as simple comprehension. As with Tversky and
Hard’s (2009) paradigm, we imposed the most minimal
of task requirements for eliciting perspective choice – at
least for Study 1. When comparing these similarly matched
studies, 53% of our participants and 20% of Tversky and
Hard’s participants freely chose to be other-centric. Importantly, this difference in response rate also brings up a critical distinction between the two studies. Here,
perspective-taking behavior was allowed to stabilize over
a series of trials. As in previous research involving a real
or simulated social partner, perspective choice is sometimes gradually worked out in the early moments of interaction (Bard & Aylett, 2001; Horton & Keysar, 1996; Roche,
Dale, & Kreuz, 2010). Indeed, in our Study 1, 22 other-centric participants, or 51% overall, had between 1 and 3
short-lived ‘‘egocentric’’ responses in the initial trials. If
we had assessed perspective-taking behavior after a single
trial, these respondents would have been misclassified as
egocentric.
There are other differences that make our particular
task promising for further exploration. One of these is
tracking the temporal dynamics of social perspective-taking. In the present studies, we evaluated simple computer
mouse movements which revealed that other-centric
responding takes more time to execute, and that the executed movement is characterized by greater directional
shifts and decelerations. These latter findings suggest the
presence of a processing interference that arises when
adopting another’s frame of reference. Presumably, a major
source of interference is from one’s own activated frame of
reference (Brown-Schmidt, 2009). Although multipleframe activation and the primacy of a single activation
has received a great deal of attention in ambiguous scene
interpretation (Carlson-Radvansky & Irwin, 1994;
Carlson-Radvansky & Logan, 1997), it is less clear with
social perspective-taking. One issue that we examined in
greater detail is whether reference frames are activated
sequentially or in parallel (Taylor et al., 1999). If in parallel,
we expected to find competition effects during the
moment-by-moment changes during the response movement itself. These competition effects can be expressed in
paradigms like ours where a competitor and response target are visually co-present (Song & Nakayama, 2009; Spivey & Dale, 2006). Here, we provided a simple measure
of how much the movement is displaced toward a competitor en route to a target. For other-centric responders, we
found that movements towards a folder designating an
other-centric frame were shifted towards a folder designating an egocentric frame. This result suggests both
frames were simultaneously activated, but with the
other-centric response eventually winning out.
Greater evidence for competition effects, as well as processing costs, can also be had with situational factors that
are easily implemented in this task. For example, we can
17
evaluate perspective-taking behavior when there are
greater cognitive demands, as in introducing time constraints to respond (as in Bard et al., 2007). We might also
find that with stronger attributional cues that favor an
other-centric perspective, egocentric responders will have
significant difficulty in inhibiting an other-centric frame
of reference, perhaps even from the earliest stages of processing. And importantly, this basic task allows us to systematically identify pragmatically relevant cues that
might sway the activation and selection of egocentric or
other-centric perspectives. It also opens up the possibility
of exploring any individual differences that might interact
with these factors.
On a final theoretical note, a further possible interpretation of our results is that so-called ‘‘default’’ processes are
only one part of the explanatory story of adaptive abilities
during interaction. As Schober (1993) demonstrated, the
coordinative process of interaction permits rich patterns
of adaptation that emerge when one looks for them in
appropriately interactive contexts (see also Brennan &
Hanna, 2009; Brown-Schmidt, Gunlogson, & Tanenhaus,
2008). Yet, we have shown that even simulated social contexts can induce complete investment in more ‘‘difficult’’
response strategies. This is not simply a matter of monitoring, because the 0-degree trials in our studies still show
that participants have embraced the strategy of trialby-trial transformation for the benefit of an absent but
assumed partner. So ‘‘defaults’’ are transformed in
socially-embedded contexts, and the ‘‘tables are turned,’’
so to speak, because many participants have now organized their interpretive tendencies outside themselves,
into the other-centric frame. Our data suggest that a new
default, adapted for the benefit of the task, can become
that external frame. This suggests that theories of perspective-taking, and partner-specific adaptation in general,
need to focus attention on stable strategies in more taskspecific ways, rather than seeking generalizations that
may overreach, and miss the natural human capacity to
adapt cognitive processes in novel circumstances.
Acknowledgments
The research was supported by NSF Grant BCS-0720322
to the second author, and NSF Grant BCS-0826825 to the
second and third authors.
References
Amorim, M. A. (2003). ‘‘What is my avatar seeing?’’ The coordination of
‘‘out-of-body’’ and ‘‘embodied’’ perspectives for scene recognition
across views. Visual Cognition, 10, 157–199.
Baayen, R. H. (2008). Analyzing linguistic data: A practical introduction to
statistics in R. New York: Cambridge University Press.
Baayen, R. H., Davidson, D. J., & Bates, D. M. (2008). Mixed-effects
modeling with crossed random effects for subjects and items. Journal
of Memory and Language, 59, 390–412.
Bard, E. G., Anderson, A. H., Chen, Y., Nicholson, H. B. M., Havard, C., &
Dalzel-Job, S. (2007). Let’s you do that: Sharing the cognitive burdens
of dialogue. Journal of Memory and Language, 57, 616–641.
Bard, E. G., & Aylett, M. P. (2001). Referential form, word duration, and
modeling the listener in spoken dialogue. In J. Moore & K. Stenning
(Eds.), Proceedings of the 23rd Annual Meeting of the Cognitive Science
Society. Austin, TX: Cognitive Science Society.
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
18
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Black, J. B., Turner, T. J., & Bower, G. H. (1979). Point of view in narrative
comprehension, memory, and production. Journal of Verbal Learning
and Verbal Behavior, 18, 187–198.
Bortfeld, H., & Brennan, S. E. (1997). Use and acquisition of idiomatic
expressions in referring by native and non-native speakers. Discourse
Processes, 23, 119–147.
Branigan, H. P., Pickering, M. J., Pearson, J., & McLean, J. F. (2010).
Linguistic alignment between people and computers. Journal of
Pragmatics, 42, 2355–2368.
Brennan, S.E. (1990). Speaking and providing evidence for mutual
understanding.
Unpublished
doctoral
dissertation,
Stanford
University, Stanford, CA.
Brennan, S. E., & Hanna, J. E. (2009). Partner-specific adaptation in dialog.
Topics in Cognitive Science, 1, 274–291.
Brown-Schmidt, S. (2009). Partner-specific interpretation of maintained
referential precedents during interactive dialog. Journal of Memory
and Language, 61, 171–190.
Brown-Schmidt, S., Gunlogson, C., & Tanenhaus, M. K. (2008). Addressees
distinguish shared from private information when interpreting
questions during interactive conversation. Cognition, 107, 1122–1134.
Brunyé, T. T., Rapp, D. N., & Taylor, H. A. (2008). Representational
flexibility and specificity following spatial descriptions of real-world
environments. Cognition, 108, 418–443.
Carlson, L. A. (1999). Selecting a reference frame. Spatial Cognition and
Computation, 1, 365–379.
Carlson, L. A., & Covey, E. S. (2005). How far is near? Inferring distance from
spatial descriptions. Language and Cognitive Processes, 20, 617–631.
Carlson-Radvansky, L. A., & Irwin, D. E. (1994). Reference frame activation
during spatial term assignment. Journal of Memory and Language, 33,
646–671.
Carlson-Radvansky, L. A., & Logan, G. D. (1997). The influence of reference
frame selection on spatial template construction. Journal of Memory
and Language, 37, 411–437.
Clark, H. H. (1996). Using language. Cambridge: Cambridge University
Press.
Clark, H. H., & Krych, M. A. (2004). Speaking while monitoring addresses
for understanding. Journal of Memory and Language, 50, 62–81.
Clark, H. H., & Schaefer, E. F. (1989). Contributing to discourse. Cognitive
Science, 13, 259–294.
Clark, H. H., & Wilkes-Gibbs, D. (1986). Referring as a collaborative
process. Cognition, 22, 1–39.
Dale, R., & Duran, N. D. (in press). The cognitive dynamics of negated
sentence verification. Cognitive Science.
Dale, R., Duran, N. D., & Morehead, R. (submitted for publication).
Prediction in statistical learning, and implications for the explicit/
implicit divide.
Epley, N., Keysar, B., Van Boven, L., & Gilovich, T. (2004). Perspective
taking as egocentric anchoring and adjustment. Journal of Personality
and Social Psychology, 87, 327–339.
Fitneva, S. A., & Song, Y. (2009). The comprehension of ‘‘left’’ and ‘‘right’’ in
a referential communication task. In N. A. Taatgen & H. van Rijn (Eds.),
Proceedings of the 31st Annual Conference of the Cognitive Science
Society (pp. 2687–2691). Austin, TX: Cognitive Science Society.
Frank, M. C., & Gibson, E. (2011). Overcoming memory limitations in rule
learning. Language, Learning, & Development, 7, 130–148.
Franklin, N., & Tversky, B. (1990). Searching imagined environments.
Journal of Experimental Psychology: General, 119, 63–76.
Fussell, S. R., & Krauss, R. M. (1992). Coordination of knowledge in
communication: Effects of speakers’ assumptions about what others
know. Journal of Personality and Social Psychology, 62, 378–391.
Galati, A., & Brennan, S. E. (2010). Attenuating information in spoken
communication: For the speaker, or for the addressee? Journal of
Memory and Language, 62, 35–51.
Gerrig, R. H., Brennan, S. E., & Ohaeri, J. O. (2000). What can we conclude
from speakers behaving badly? Discourse Processes, 29, 173–178.
Hanna, J. E., & Tanenhaus, M. K. (2004). Pragmatic effects on reference
resolution in a collaborative task: Evidence from eye movements.
Cognitive Science, 28, 105–115.
Herrmann, T. (1988). Partnerbezogene Objektlokalisation – ein neues
sprachpsychologisches Forschungsthema. Bericht Nr. 25, Arbeiten der
Forschergruppe ‘‘Sprechen und Sprachverstehen im sozialen
Kontext’’, Heidelberg/Mannheim.
Herrmann, T., Burkle, B., & Nirmaier, H. (1987). Zur horerbezogenen
Raumreferenz: Horerposition und Lokalisationsaufwand. Bericht Nr. 12,
Arbeiten der Forschergruppe ‘‘Sprechen und Sprachverstehen im
sozialen Kontext’’, Heidelberg/Mannheim.
Horton, W. S., & Gerrig, R. J. (2002). Speaker’s experiences and audience
design: Knowing when and knowing how to adjust utterances to
addressees. Journal of Memory and Language, 47, 589–606.
Horton, W. S., & Gerrig, R. J. (2005). The impact of memory demands on
audience design during language production. Cognition, 96, 127–142.
Horton, W., & Keysar, B. (1996). When do speakers take into account
common ground? Cognition, 59, 91–117.
Isaacs, E. A., & Clark, H. H. (1987). References in conversation between
experts and novices. Journal of Experimental Psychology: General, 116,
26–37.
Kittur, A., Chi, H., & Suh, B. (2008). Crowdsourcing user studies with
Mechanical Turk. ACM Press. CHI 2008.
Lee, P. U., & Tversky, B. (2005). Interplay between visual and spatial: The
effect of landmark descriptions on comprehension of route/survey
spatial descriptions. Spatial Cognition & Computation, 5, 163–185.
Levelt, W. J. (1989). Speaking: From intention to articulation. Cambridge,
MA: MIT Press.
Mainwaring, S. D., Tversky, B., Ohgishi, M., & Schiano, D. J. (2003).
Descriptions of simple spatial scenes in English and Japanese. Spatial
Cognition & Computation: An Interdisciplinary Journal, 3, 3–42.
doi:10.1207/S15427633SCC0301_2.
McNamara, T. P., Sluzenski, J., & Rump, B. (2008). Human spatial memory
and navigation. In J. Byrne
(Ed.). Learning and memory: A
Comprehensive reference (Vol. 2, pp. 157–178). Oxford: Elsevier.
Michelon, P., & Zacks, J. M. (2006). Two kinds of visual perspective taking.
Perception & Psychophysics, 68, 327–337.
Miller, G. A., & Johnson-Laird, P. N. (1976). Language and perception.
Cambridge: Cambridge University Press.
Mou, W., Liu, X., & McNamara, T. P. (2009). Layout geometry in encoding
and retrieval of spatial memory. Journal of Experimental Psychology:
Human Perception and Performance, 35, 83–93. doi:10.1037/00961523.35.1.83.
Munro, R., Bethard, S., Kuperman, V., Tzuyin Lai, V., Melnick, R., & Potts, C.,
et al. (2010). Crowdsourcing and language studies: the new
generation of linguistic data. In Proceedings of the NAACL HLT 2010
Workshop on Creating Speech and Language Data with Amazon’s
Mechanical Turk, Los Angeles, CA.
Newman-Norlund, S. E., Noordzij, M. L., Newman-Norlund, R. D., Volman,
I. A. C., de Ruiter, J. P., Hagoort, P., et al. (2009). Recipient design in
tacit communication. Cognition, 111, 46–54.
Perkins, L., & Milroy, L. (1997). Sharing the communicative burden: A
conversation-analytic account of aphasic/non-aphasic interaction.
Multilingua, 16, 199–215.
Pickering, M. J., & Garrod, S. (2004). Toward a mechanistic psychology of
dialogue. Behavioral and Brain Sciences, 27, 169–226.
Presson, C. C., & Montello, D. R. (1994). Updating after rotational and
translational body movements: Coordinate structure of perspective
space. Perception, 23, 1447–1455.
Quene, H., & van den Bergh, H. (2008). Examples of mixed-effects
modeling with crossed random effects and with binomial data.
Journal of Memory and Language, 59, 413–425.
Roche, J. M., Dale, R., & Kreuz, R. J. (2010). The avoidance of ambiguity
during conversation: More than mere priming or mimicry? In S.
Ohlsson & R. Catrambone (Eds.), Proceedings of the 32nd Annual
Meeting of the Cognitive Science Society. Austin, TX: Cognitive Science
Society.
Samson, D., Apperly, I. A., Braithwaite, J. J., Andrews, B. J., & Bodley Scott, S.
E. (2010). Seeing it their way: Evidence for rapid and involuntary
computation of what other people see. Journal of Experimental
Psychology: Human Perception and Performance, 36, 1255–1266.
Schober, M. F. (1993). Spatial perspective-taking in conversation.
Cognition, 47, 1–24.
Schober, M. F. (1995). Speakers, addressees, and frames of reference.
Whose effort is minimized in conversations about locations?
Discourse Processes, 20, 219–247.
Schober, M. F. (1998). Different kinds of conversational perspectivetaking. In S. R. Fussell & R. J. Kreuz (Eds.), Social and cognitive
psychological
approaches
to
interpersonal
communication
(pp. 145–174). Mahwah, NJ: Lawrence Erlbaum.
Schober, M. F. (2009). Spatial dialogue between partners with
mismatched abilities. In K. R. Coventry, T. Tenbrink, & J. A. Bateman
(Eds.). Spatial language and dialogue (Vol. 1, pp. 23–39). Oxford:
Oxford University Press.
Shepard, R., & Metzler, J. (1971). Mental rotation of three dimensional
objects. Science, 171, 701–703.
Shintel, H., & Keysar, B. (2009). Less is more: A minimalist account of joint
action in communication. Topics in Cognitive Science, 1, 260–273.
Snow, R., O’Connor, B., Jurafsky, D., & Ng, A. Y. (2008). Cheap and fast – But
is it good? Evaluating non-expert annotations for natural language tasks.
Presented at the proceedings of the conference on empirical methods in
natural language processing. Stroudsburg, PA: Association for
Computational Linguistics.
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009
N.D. Duran et al. / Cognition xxx (2011) xxx–xxx
Song, J. H., & Nakayama, K. (2009). Hidden cognitive states revealed in
choice reaching tasks. Trends in Cognitive Sciences, 13(8), 360–366.
Sorokin, A., & Forsyth, D. (2008). Utility data annotation with Amazon
Mechanical Turk. In Presented at the IEEE Computer Society Conference
on Computer Vision and Pattern Recognition Workshops.
Spivey, M. J., & Dale, R. (2006). Continuous dynamics in real-time
cognition. Current Directions in Psychological Science, 15(5), 207–211.
Taylor, H. A., Naylor, S. J., Faust, R. R., & Holcomb, P. J. (1999). ‘‘Could you
hand me those keys on the right?’’ Disentangling spatial reference
frames using different methodologies. Spatial Cognition and
Computation, 1, 381–397. doi:10.1023/A:1010035613419.
Taylor, H. A., & Tversky, B. (1992). Spatial mental models derived from
survey and route descriptions. Journal of Memory and Language, 31,
261–292. doi:10.1016/0749-596X(92)90014-O.
19
Taylor, H. A., & Tversky, B. (1996). Perspective in spatial descriptions.
Journal of Memory and Language, 35, 371–391.
Tversky, B., & Hard, B. M. (2009). Embodied and disembodied cognition:
Spatial perspective-taking. Cognition, 110, 124–129.
Tversky, B., Lee, P., & Mainwaring, S. D. (1999). Why do speakers mix
perspectives? Spatial Cognition and Computation, 1, 399–412.
Wilkes-Gibbs, D., & Clark, H. H. (1992). Co-ordinating beliefs in
conversation. Journal of Memory and Language, 31, 183–194.
Zacks, J. M., & Michelon, P. (2005). Transformations of visuospatial
images. Behavioral and Cognitive Neuroscience Reviews, 4, 96–118.
Zacks, J. M., Vettel, J. M., & Michelon, P. (2003). Imagined viewer and
object rotations dissociated with event-related fMRI. Journal of
Cognitive Neuroscience, 15, 1002–1018.
Please cite this article in press as: Duran, N. D., et al. Listeners invest in an assumed other’s perspective despite cognitive cost. Cognition
(2011), doi:10.1016/j.cognition.2011.06.009