Final Reflections: Situation Awareness Models and Measures

573911
2015
EDMXXX10.1177/1555343415573911Journal of Cognitive Engineering and Decision MakingFinal Reflections
Special Issue
Final Reflections: Situation Awareness Models
and Measures
Mica R. Endsley, United States Air Force
A response is provided to the many authors who
commented on my initial paper for this special issue
on situation awareness (SA) addressing remaining questions and observations on the Endsley 1995 model of
SA. A discussion on historical perspectives on cognitive engineering and SA is included, along with future
research needs for the construct. New models on sensemaking, distributed SA, and situated SA are also discussed, with remaining comments on their differences
with the Endsley 1995 model of SA. Finally, a short
discussion of SA measurement approaches is provided,
addressing key issues raised in the commentaries.
Keywords: situation awareness, metrics, models, team
situation awareness
My initial intent in writing the introductory
paper to this special issue was to clear up some
misconceptions regarding my 1995 model of
situation awareness (SA) (Endsley, 1995c) that
have appeared in the recent literature (Endsley,
2015). Hopefully I was able to accomplish this
goal for those who are interested in what I have
intended to represent with the model.
As an unexpected benefit, the paper has
inspired a much broader discussion on SA from a
wide range of authors. Hoffman (2015) and Flach
(2015) each provided a fascinating historical perspective on much of the psychology literature
and movements going back across the last
century (and then some). This foundation provides an interesting lens with which to
Address correspondence to Mica R. Endsley, United States
Air Force, Pentagon, 4E130, Washington, DC 20330,
[email protected].
Author(s) Note: The author(s) of this article are U.S.
government employees and created the article within the
scope of their employment. As a work of the U.S. federal
government, the content of the article is in the public domain.
Journal of Cognitive Engineering and Decision Making
2015, Volume 9, Number 1, March 2015, pp. 101­–111
DOI: 10.1177/1555343415573911
consider the types of problems we are now contemplating in this modern and complex world,
and the models we are using to describe them. As
Flach points out, perhaps this history has created
a lens that has led some to misunderstand the SA
model as I intended it. In his characterization, the
psychology community has pulled away from
strictly linear “information processing models”
that were developed in laboratory environments
with artificial stimuli, towards a “triadic semiotic
frame” that considers the role of context and the
adaptive dynamics between the cognitive agent
and the environment. And because of that perhaps some people have misinterpreted the model,
although I believe my model to be quite consistent with such a situated view.
It is worth noting that in addition to the psychology history described in their articles, the
human factors community is multidisciplinary,
composed of many researchers whose training is
in engineering rather than primarily psychology.
As such, we arrive with a different lens. At the
foundation of the industrial engineering discipline is work by Frederick Taylor (from roughly
1870 to 1910) analyzing the work processes of
machinists. This approach was extended by
Frank and Lillian Gilbreth in the early 1900s,
who perfected the science of work analysis in
studying brick layers and laborers in manufacturing and clerical contexts. The experiences of
the Tavistock institute with coal miners during
World War II expanded this work into sociotechnical systems, recognizing the interaction
between people, technology, and organizational
structures and processes. (Although the Tavistock researchers were psychologists, this work
was embraced more by business and industrial
engineers involved in organizational and work
design than by experimental psychologists who
then trended towards behaviorism.)
Although the majority of this research foundation was concerned with physical tasks and
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
102March 2015 - Journal of Cognitive Engineering and Decision Making
behaviors (which characterized much of the
prevailing work environments of the day), rather
than cognitive tasks, it creates an entirely different approach to the study of human behavior
than that which Flach, Hoffman, and others rail
against. That is, we come from a perspective that
has always considered work in context and has
always studied primarily experts performing
their tasks within their real-world constraints,
mainly for the purpose of improving that work
from the standpoint of efficiency or safety. What
the field lacked was a way of incorporating cognitive work within that framework, for which we
came to the field of psychology. To most people
outside of psychology, the idea that behavior
should be studied in context is taken as a given,
leading to some perplexity at the idea that “situated cognition” or “naturalistic” studies are new.
So when I proposed a model to explain this
phenomenon of SA in my work with pilots and
the design of advanced fighter aircraft, I confess
that I took for granted that the construct occurred
within an ecologically valid environment with
expert pilots, within a dynamic, ongoing process
over long periods of time. Although others were
running around at that time complaining that “we
didn’t know what SA was,” I argued we had
much we could say about it, based on the rich
history of research on various cognitive mechanisms and constructs that made up my model.
Although I fully understand rejecting an isolated,
laboratory-based, stimulus-response model of
cognition, I chose not to throw out the baby with
the bath water, but rather to build something
more substantial from past work on cognition
and information processing—extending it to deal
with complex, cognitive work in the real world
of aviation where my research was based. The
point of the model was to provide a more detailed
understanding of SA via the interactions of these
cognitive mechanisms, the experiences, preconceptions and goals of the individuals, and the
characteristics of their tools, tasks, and environments, extending far beyond strictly data-driven
linear information processing models in many
significant ways that have already been elaborated. Although undoubtedly not complete in
representing all the details of these various factors, it is my belief that it provides a good starting
point for that discussion, subsequent research,
and our ultimate job of improving the tools and
workplaces we design for people.
Thus, the Endsley 1995 SA Model reflects significant detail on the cognitive mechanisms and
processes that are at play in creating the ongoing
mental representation of SA, and also the ways in
which characteristics of the tools, environment,
and systems affect those processes. Although I
certainly agree that our research and our models
need to reflect the ways in which people interact
with and are affected by the complex and dynamic
worlds in which they operate, I also believe we
need to realize that human cognition is also a relevant part of that ecological sociotechnical system and not one that can get easily dismissed or
ignored by only focusing on context.
In terms of historical background, Klein
(2015) points out that he discussed the importance of SA in his 1986 paper (Klein, Calderwood, & Clinton-Cirocco, 1986) on recognition
primed decision making (RPD) in fire fighters. I
referenced his RPD model in the Endsley 1995
SA model in describing the link between SA and
decision making within that context. However,
my formulation of decision making was primarily influenced by the seminal work of Stuart
Dreyfus (1981), who examined how experts
made decisions in a number of domains including business management, chess, and science.
He found strong evidence that people do not
typically go through a detailed analytic comparison of alternatives but rather recognize known
situational patterns that they match to memory
to arrive at what to do. Klein et al.’s 1986 study
on fire fighters primarily confirmed this finding
in a new domain. The more detailed model of
RPD that Klein and his colleagues later created
(Klein, 1993) goes much farther in describing
this phenomenon than the original Dreyfus
paper. However, neither author provided any
detail on what SA is or how people form SA
itself, which was the purpose of the 1995 SA
model and which I think is highly compatible
with and complementary to these works.
I endeavored to provide a comprehensive
account of a quite complex construct, one that
rests on many other cognitive mechanisms and
behaviors, and that is heavily embedded in the
challenges and messiness of complex environments. Although Klein (2015) argues that I may
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
Final Reflections
103
have changed or morphed my model over time,
the quotes provided from the original 1995
paper, in some cases augmented by quotes from
a 2000 paper that was written to further explain
parts of the model, show that these fundamental
aspects of the model have remained quite consistent, and significantly predate the articles with
which I took issue. Far from “resisting change”
as leveled by Stanton, Salmon, and Walker
(2015), I welcome research that is directed at
extending our knowledge of SA and Team SA.
Although there is plenty of room in academic
discussions to jostle over aspects of theories to
try to determine what may more accurately represent the phenomenon under discussion, it is my
belief that such discussions are most fruitful
when they discuss the real differences between
such models, rather than a (perhaps unintentional) misrepresentation of them. Thus, I can
only hope that these misunderstandings have
been rectified by this effort at clarification.
Although van Winsen and Dekker (2015)
remain strangely obtuse in some paradigm in
which boxes and arrows are somehow bad (but
circles and arrows are not?), I believe this is ultimately an artificial difference. Ecological design
and Neisser’s model actually represent many of
the same features as my model (no matter how
they are arranged on the page) (see Endsley,
2000). As Minotra and Burns (2015) and Hancock and Diaz (2002) point out, ecological
design and the SA model actually have much in
common (see also Endsley, 2004)). Van Winsen
and Dekker’s more extreme position seems to be
that SA should not be studied because it distracts
people from focusing on the task environment
(the situation). However, this is ultimately not a
terribly useful position as it provides no tools for
effectively designing that environment for the
cognitive work we are trying to aid.
Finally, as Flach (2015) points out, representations matter, so I will further address the final
questions raised on the model here. The arrows
in the Endsley 1995 model from the Task/System Factors and Individual Factors show influence on the process. Although I do show SA,
decision, and action as separate stages, they
occur in a loop acting on the environment, with
the changing state of the environment in turn
effecting those cognitive stages, often quite
rapidly. That the levels of SA were nested, and
not shown in a line with arrows, is because they
are inter-related levels and not linear stages. The
model provides the details on how the internal
and external factors all interact in affecting those
processes over time to create an ever-evolving
state of SA.
Future Research
This issue has also generated some highly
useful discussion on future research directions.
Wickens (2015) cites the need for more research
on projection (Level 3 SA) with which I heartily agree. He also discusses the interesting roles
of long-term memory (LTM), working memory (WM), and long-term working memory
(LTWM) and how they relate to SA. In that
my model (and subsequent research findings)
supports an integrated model of LTM and WM
(per Cowan, 1988) and Durso and Gronlund
(1999) also calls out an integrated model (per
Ericsson & Kintsch, 1995), it would seem that
more research would be warranted here as well,
perhaps addressing the concerns of Chiappe,
Strybel, and Vu (2015) as people move from
being novices to experts.
Vidulich and Tsang (2015) discuss the need
to consider both SA and workload when developing potential operator decision aids, which I
certainly agree with. Minotra and Burns (2015)
call out the need for more research on metacognitive issues of self-regulation and self-awareness,
SA in unstructured environments, and the differences between incorrect SA and low SA (meaning being wrong versus being uninformed).
Such meta-awareness is certainly important. For
example, not only the level of SA but also the
degree of confidence in that SA can affect how
people choose to act (e.g., conservatively or
boldly) with negative consequences if these are
misaligned (Endsley & Jones, 2012). I also echo
their support for Klein’s work on macrocognition that has sought to discover more detail on
issues such as problem detection, uncertainty
and risk management, and coordination in
teams. Although such research may not be considered as specific to SA, our research has certainly detailed how they are often integrally
related with SA, and how in many cases their
representation is a component of SA (Endsley &
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
104March 2015 - Journal of Cognitive Engineering and Decision Making
Jones, 2012). I am certain that far more can be
learned about these processes that would be of
import in understanding both SA and human
cognition in complex settings.
Equally importantly, we find that human factors work on SA has had a positive influence on
the design of systems and development of training programs (as outlined by Jones, 2015), and it
is an integral part of the way in which pilots (and
many others) think about their worlds, as discussed by Byrne (2015). As Byrne points out,
operators do not care about the theoretical arguments of academics but rather what we can do to
help them. To me the most beneficial outcome of
the past 25 years of research has been the ability
to contribute to systems designs in aviation, air
traffic control, military command and control,
intelligence operations, power grid operations,
transportation, mining, oil exploration, and myriad other complex operations. That work, by
myself and many, many others, I believe was
greatly aided by the fundamental understanding
provided by the model.
I certainly disagree with van Winsen and
Dekker (2015), who seem to believe that the
work of the human factors community to understand SA has somehow led the operational community astray with this crazy construct. On the
contrary, pilots invented SA—it was a part of
their vocabulary and conceptualization of their
world long before any of us got involved in trying to describe it, measure it, or design for it.
They would be talking about it (and lamenting
not having it), whether we were doing anything
to help them with it or not (see Byrne, 2015).
Van Winsen and Dekker’s further claim that
somehow SA researchers are to blame for the
conviction of Captain Lilgert in the sinking of a
passenger ferry in British Columbia, leading to
the death of two passengers, also does not hold
water. Capt Lilgert’s conviction was based on a
finding of gross negligence, stating that he failed
to properly operate and navigate the ferry in dereliction of duty, including failing to slow during
heavy weather, failing to have assistance on
deck, failing to follow standard practices and
regulations, failure to communicate with other
vessels, and being distracted from his duties on
deck due to a personal conversation with a shipmate with whom he had a romantic relationship
(Supreme Court of British Columbia, 2013).
Although we could probably engage in an analysis of the quality of the on-board systems and
whether there were extenuating circumstances
that contributed to the accident (although the
Judge did not find any), that the Judge stated that
“situational awareness … is key to proper navigation” within the 16-page finding is irrelevant
to the facts of his conviction. Van Winsen and
Dekker’s claim is spurious at best.
Other Models
Although the key intent of my paper was to
correct misunderstandings of the Endsley 1995
model, I also reviewed several newer models presented by their authors as a way of compensating
for the supposed weakness of the model they
criticized. Each of those authors had the appropriate opportunity to respond to my assessments
of their model, to which I’ll offer a final response.
Sensemaking
Klein (2015) continues to argue that the data/
frame model of sensemaking is different than
the Endsley 1995 SA model in that it “posits
two parallel and simultaneous activities, using
frames to define what counts as data, and using
data to select, modify and construct frames.” As
I’ve shown earlier, however, this is not different than the way in which schema and mental
models define what counts as relevant data (and
directs attention accordingly), or the way in
which data detected initially will trigger active
schema for further data gathering and interpretation (with anchoring and confirmation biases
potentially coming into play), as depicted in the
1995 SA model. The 1995 SA model also provides a detailed discussion of how these mental
models are created, modified, and selected
between. And contrary to Klein’s assertion, “the
formation of what counts as data in the first
place” is explicitly called out in the model:
•• “People are active participants in determining
which elements of the environment will become a
part of their (Level 1) SA by directing their attention based on goals and objectives and on the basis
of long-term and working memory” (Endsley,
1995c, p .41).
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
Final Reflections
105
•• “Advanced knowledge of the characteristics,
form, and location of information, for instance,
can significantly facilitate the perception of information” (Endsley, 1995c, p. 42).
•• “Working memory also plays an important role,
allowing one to modify attention deployment on
the basis of other information perceived or active
goals” (Endsley, 1995c, p. 42).
•• “A well-developed mental model provides (a)
knowledge of the relevant elements of the system
that can be used in directing attention and classifying information in the perception process …”
(Endsley, 1995c, pp. 43-44).
•• “For example, a pilot may perceive several aircraft
(considered to be important elements per the mental model) recognized as enemy fighter jets (based
on critical cues) that are approaching in a particular spatial arrangement (forming Level 1 SA)”
(Endsley, 1995c, p. 44).
however, find that the sensemaking model has
provided any new extension or explanation of
this construct. “Reframing” the same processes
that have previously been described in the SA
model in a different terminology only contributes to the potential confusion of readers that
these are somehow different processes. Klein’s
statement that I differentiate the state of SA from
the processes involved in achieving SA does not
amount to a justification for his claim that the
Endsley 1995 SA model does not address processes, which it most certainly does and quite
extensively. I certainly welcome Klein’s elaboration in this area of human cognition, as he
always has excellent insights to offer. My comments on the sensemaking model are to show
where the two models are similar and where
they are (or are not) different to improve the
clarity of discussion on this topic.
I believe this addresses what Klein refers to as
“determining what counts as data in the first
place,” or “constructing data.” Recognizing that
some cue is relevant and important occurs on
the basis of the mental model. Without one, this
would be quite difficult.
Klein also states that the data/frame model
was only intended to cover deliberate cases and
not the more automatic cases also covered by SA
theory, which is quite true. Nor is there a requirement to from his perspective. However, the fact
that it does not, while the 1995 SA model does,
leaves it at a disadvantage I believe, in that so
much of SA (or sensemaking) in real-world settings occurs with a combination of both processes in play. In fact, I find that a key problem
is whether someone realizes that the situation
they are in is novel and does not readily match to
existing or active schema. This needs to prompt
them into a more deliberate, thoughtful process
whereby they use more fundamental knowledge
from their mental models to puzzle out alternate
explanations of the information they have, or to
collect additional data to explain some discrepancy. The segue from one form of processing to
the other is critical and can be error prone, as
Klein knows from his work on RPD, and is in
need of further research attention.
I certainly respect Klein’s extensive work in
RPD and macrocognition. I do not, in this case,
Distributed SA (DSA)
I have already discussed the numerous inaccuracies that Stanton et al. (2015) have used to
bolster their claims for why a new model of SA
is needed, and will not further detail those here.
Also I have shown that DSA provides far less
explanation of SA in teams than other previous models and research on team and shared
SA (Endsley & Jones, 2001; Endsley & Robertson, 2000; Gorman, Cooke, Pederson, Connor, & DeJoode, 2005; Prince & Salas, 1993;
Prince, Salas, & Stout, 1995). In reviewing their
response, it primarily focuses on issues of SA
measurement in teams, specifically on the use
of the Situation Awareness Global Assessment
Technique (SAGAT), which I will address in
more detail below.
Before doing so, however, I would like to
address their contention that the only (and best)
approach is to examine observable actions and
communications between team members and
their tools due to the fact that “given the complexity of the socio-technical systems which
form the subject of much contemporary analysis, the study of information processing in the
minds of individuals has lost relevance” and
“one cannot ever know completely what is going
on in people’s minds” (Stanton et al., 2015).
This viewpoint strikes me as moving into a very
Skinneristic model, one that has long since been
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
106March 2015 - Journal of Cognitive Engineering and Decision Making
rejected in psychology, and against which significant progress has been made.
Although there is certainly information to be
gained from analyzing team processes, communications, and behaviors (and which has been
done for a long time by many researchers in the
SA area), there are also distinct limitations in
relying solely on such observations to understand
the SA of individuals or teams. Specifically, such
measures do not provide information on why an
individual may not have attended to certain information, and what assessments they make from
the information they do attend to in order to form
comprehensions and projections (unless they talk
about it, which is highly variable across individuals and situations). A key issue about SA is that it
deals with how people understand and form
meaning from the information they observe, and
this is highly lacking in such observational data
(Endsley, 1996; Endsley & Jones, 2012). So
although process observations and communications analysis can certainly be useful in analyzing the team processes underlying SA, by themselves they are limited for this application.
Situated SA
Chiappe et al.’s (2015) discussion of situated
SA is in many respects not as divergent from
the Endsley 1995 SA Model as it might initially
appear. Many of their arguments that reject “SA
in the head,” however, are unfortunately weak.
Change blindness, for example, can be explained
as an attentional failure that is present regardless
of which cognitive model of SA one uses. Their
argument that people do not hold a veridical visual
representation in memory is also entirely consistent with the Endsley 1995 SA Model, which only
holds that data needed by the active mental model
are retained. That is, people are only retaining
that data from the environment that is needed for
the safe operation of their system per their mental
model (e.g., to manage the traffic in their sector
as an air traffic controller or to fly the aircraft
as a pilot). The use of “external scaffolds” to aid
memory and assist in task performance is also
not a point of disagreement between us. These
“strings around the finger” are well ensconced
in many environments, including the tipping of
flight strips by air traffic controllers and the use of
speed bugs by pilots, which they mention.
Chiappe et al.’s research examples showing
strong WM dependence are unfortunately with
students, not experts. Novices have consistently
been found to have very low levels of SA (Endsley, 2006) and are believed to be WM limited in
the Endsley 1995 SA Model. They do not yet have
the mental models and schema that would allow
them to retain or process data effectively in these
complex domains and must therefore rely heavily
on external data sources, and ultimately have very
low SA (and the performance to match it). Other
examples are taken from nonoperational tasks—
for example, looking information up on a computer or retaining peripheral information such as
phone numbers or aircraft callsigns. I cannot
remember phone numbers either and am happy to
write them down. But I still need to know how fast
I am driving to avoid a ticket. These are very different things. The operational tasks where SA is
critical (flight, driving, military operations, power
grid operations, …) often involve significant time
constraints for decision making (no time to look
up the answer when you are trying to land an airplane or avoid getting shot in battle). The need for
a working mental representation of the situation is
real-time and demanding.
There is no disagreement that people will not
load up any more information in memory than
they need to. The real question is, How much do
they need to? And which information? And when?
In our work with expert air traffic controllers, for
example, I have found that they definitely do not
retain callsigns of aircraft (Endsley & Rodgers,
1996). I will call this peripheral data. Instead their
internal model is primarily a temporal-spatial one.
They know a lot about aircraft (e.g., location, type,
flight path, flight level, ascending/descending,
which aircraft are traffic for which other aircraft),
but their central mental representation anchoring
this information is a temporal-spatial one (which
is why we do not do SAGAT queries based on
aircraft callsigns, but rather referenced to a map
of the controller’s sector).
The active working situation model that controllers use to understand the flight situation and
project potential conflicts is composed of the
information that is necessary for decisionmaking for that task. Although they have certain
forms of automation (e.g., alerts that occur if
two aircraft are within 5 miles of each other),
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
Final Reflections
107
that system is only a back-up. Instead they
actively project potential conflicts and generally
move aircraft far, far in advance of the prescribed limits. This is an active mental process.
As different aircraft need clearances to ascend or
descend for various reasons, they must rapidly
assess how that change will play out over time to
make all the necessary changes across the many
aircraft under their purview. Controllers do
annotate flight progress strips with certain data
about the aircraft, which they are required to, as
discussed by Chiappe et al., however they consider this a “house-keeping task” and often do it
far after the fact in periods of low workload. And
pilots will tell you that setting the speed bugs in
no way relieves them from needing to know aircraft airspeed, which is critical to their flight. In
fact, we find pilots can report airspeed accurately in excess of 80% and generally have very
good knowledge of their flaps and slats settings.
Aids can certainly assist SA, but they do not
substitute for it.
When Chiappe et al. state that what is needed
is a “minimal internal representation,” I don’t
disagree. The key is in defining what that minimum representation is for these complex tasks.
As discussed above, it involves a lot of detailed
understanding of the changing physical dynamics of systems and environments that are needed
in real-time for operational performance. Irrelevant information (your teammate’s shoe size) is
not needed, nor is peripheral information (e.g.,
callsigns and phone numbers), which I will
define as data that are a part of the task, but not
central to decision making, and which are looked
up as needed. But the relatively poor performance of novice and student pilots, air traffic
controllers, and soldiers can be directly linked to
a poor ability to derive this minimal internal representation of critical situational information in
real-time dynamic worlds (Endsley, 2006). We
have addressed the question of what is needed
for SA (i.e., that information that is needed for
decision making) with detailed Goal Directed
Task Analyses for each domain and role. These
derive the goal-relevant, decision-needed information that is important for SA and that forms a
basis for measurement, design, and training
interventions (Endsley, 1993b; Endsley & Jones,
2012; Endsley & Rodgers, 1994).
This internal mental representation is also
important for creating comprehension and projection. It’s hard to keep those accurate if one
does not access the dynamically changing situation information and integrate it with other data
that generally must occur mentally. In early
work on SA, I wondered if the low-level data
were jettisoned once the comprehension and
projections were formed. We found that, contrary to that hypothesis, Level 1 SA data (e.g.,
airspeed, altitude, etc.) were reportable via
SAGAT (showing they were held mentally) and
are retained as far as 5 to 6 minutes after a
SAGAT freeze with no loss of accuracy (Endsley, 1990, 1995a). These data appear to be as
much of the mental situational model as any
derived assessments of comprehension and projection are. What is important to remember here
is that people in these environments are not just
looking at and memorizing information. Rather,
they are constantly updating a mental representation, setting up detailed scan patterns in many
cases, to make sure they are gathering the needed
data and keeping the model fresh to support
active decision tasks. Artificial tasks with student subjects are very unlikely to duplicate such
processes.
The other central question that underlies Chiappe et al.’s discussion is, does what is “out
there” count as SA? In most cases, we find that
people must actively manipulate their environments (e.g., displays that are in view, radar settings and directions, communications with others) to gather their SA. They are not passive
recipients of data from the displays in front of
them but are rather active participants in gathering the information they need (Endsley, 1995c).
Failures in this information-gathering process,
whether due to training, task loading, equipment
limitations, or poor display design, lead to lower
SA. And although transactive memory is certainly useful in distributed team settings, ultimately people still need to know the information
needed for their own job (Endsley, 1995c). It is
not sufficient that the copilot knows that the
plane is below the required minimum safe altitude if the pilot flying the plane does not.
For this reason, I ultimately believe we cannot assess SA just from what is “out there” but
rather at the point that people have successfully
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
108March 2015 - Journal of Cognitive Engineering and Decision Making
accessed it to update their internal representation on which their ultimate performance will
depend. Chiappe et al. finally concede that this
must occur but want to “give credit” for being
able to look up information when needed. I think
fundamentally there is no easy way to assess
how good those processes, behaviors, and system designs are for aiding SA without taking
into account whether they have completed the
process of updating their mental representation
with the needed information.
SA Measurement
Much of the disagreement on SA models that
has been presented ultimately has boiled down
to a disagreement on the best way to measure
SA, which is actually somewhat peripheral
to the theoretical model discussions. A more
detailed comparison of the substantial literature
base on SA measures must be reserved for a
separate discussion; however, I will address
some of the comments that have been made in
this special issue.
Stanton et al. (2015) report that they found
SAGAT created unexpected results in a study of
command and control decision aids. It should be
noted, however, that they referred to a study that
actually examined an artificial task based on the
game of chess in which they tried to have distributed teams of nonexperts do mission planning. However, it is widely known that chess
novices have very poor mental representations
of the game, whereas chess experts have much
more detailed mental representations (Chase &
Simon, 1973). I find their counterintuitive results
probably have far more to do with the artificial
tasks these nonexperts were asked to perform
than with the measurement technique itself. We
have used SAGAT quite successfully, on the
other hand, with expert soldiers performing
actual mission planning and command and control tasks in numerous studies (Bolstad & Endsley, 2003; Strater, Faulkner, Hyatt, & Endsley,
2006).
Other concerns were directed at the SAGAT
technique in general, which uses short freezes in
a simulation to provide queries about SA, questioning whether it relies too heavily on memory
(Chiappe, Rorie, Moran, & Vu, 2012; Durso
et al., 1998; Salmon, Stanton, & Young, 2011;
Sarter & Woods, 1991), suggesting that the Situation Present Assessment Method (SPAM),
which asks questions in real time (Durso et al.,
1998), may be a better method (Chiappe et al.,
2015; Vidulich & Tsang, 2015). Although we
can debate various aspects of these approaches,
the fact is there is quite a bit of data that help to
address these questions.
SAGAT was designed for use in simulation
settings. SPAM has the advantage of being
administrable in many real-world settings where
freezing is not possible. However, it generally
allows for far fewer queries to be provided
within the same test. During a SAGAT freeze,
many more questions can be asked and answered,
as compared to SPAM, which can only ask one
question at a time, as it must occur concurrently
with task performance. This leads to significantly lower sensitivity to SA differences for
SPAM as compared to SAGAT, as has been
demonstrated in numerous studies (Endsley,
Sollenberger, & Stein, 2000; Jones & Endsley,
2004; Pierce, 2012).
SPAM and SAGAT differ in two main
respects. SAGAT pauses the simulation and
blanks the displays while queries are administered, whereas SPAM leaves them present and
does not pause the simulation. One of the most
detailed comparisons of these two features of the
techniques was conducted by Morgan, Chiappe,
Kraut, Strybel, and Vu (2012), who did a detailed
comparison of Visible/Not Visible and Paused/
Not Paused in an ATC simulation with student
subjects. Not surprisingly, SA queries were more
accurate (by 17%) when the information
remained visible. Reaction time (RT) was slower
when the information remained visible, however,
by almost 2 s, revealing a significant speed/
accuracy tradeoff. In general, people will look to
check before answering queries if the information is available. This is consistent with Jones
and Endsley (2004), who also found a speed/
accuracy tradeoff for SPAM real-time probe
measures. This strategy works for questions of a
specific nature (e.g., Will any conflicts be present
for AAL123?), but not as well for their more general questions (e.g., Will any conflicts be present
if no action is taken?), which were answered 1
second faster, also pointing toward the necessity
of internal memory representations.
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
Final Reflections
109
The concern of whether SAGAT is too memory dependent seems to be largely unsupported.
When working with experts on realistic tasks,
they have access to mental models and schema
that support detailed internal representations of
key information needed for task performance
(Endsley, 1990, 1995a). Although people will
look up data if available (as shown with SPAM),
there is no indication that this provides a more
valid or reliable assessment of SA than does
SAGAT.
In fact, SPAM may interfere with concurrent
task performance due to the need to multitask.
Although numerous studies have not found any
interference effect of SAGAT on task performance (Bolstad & Endsley, 1990; Endsley,
1995a, 1995b; Hogg, Torralba, & Volden, 1993),
Pierce (2012) found that participants correctly
handled fewer aircraft during the SPAM trials
(approximately 6%), as compared to those without, and they interacted with aircraft significantly less frequently in the time period following a probe. Other studies have found an indication of higher workload in trials involving SPAM
(Endsley et al., 2000; Jones & Endsley, 2004;
Pierce, 2012).
In summary, claims of advantages of SPAM as
compared to SAGAT do not appear to be warranted by the available research. Due to the freeze,
SAGAT provides a much richer and more comprehensive assessment of SA by virtue of being
able to ask more queries than SPAM in a similar
length of time, thus providing greater reliability
and sensitivity. And there is no indication that
SPAM overcomes any memory reliance associated with SAGAT due to the availability of LTWM
by experts. Although people will often look up
information if available, the speed-accuracy tradeoffs associated with this do not result in any superior assessment of their SA. However, in situations
where freezing the action is not possible, SPAM
may provide a useful approach. Although it may
pose some intrusiveness to concurrent task performance, this may be warranted in order to obtain
useful data on SA in many cases.
Some measures combine aspects of workload
and SA into one metric, for example the Situation Awareness Rating Technique (SART)
(Taylor, 1990) and SPAM (Durso et al., 1998),
however, given the often independent nature of
these two constructs in less than overload situations (Endsley, 1993a; Vidulich & Tsang, 2015),
I feel it is important to keep these two separate.
It is quite easy to measure SA and workload concurrently, using specific measures for each,
which we have done in many of our studies.
Conclusions
Overall, I found this Special Issue to provide
a useful discussion of SA theory and measurement. I hope that it serves as a useful course
correction for those doing research in the field
and laid to rest any existing misunderstandings
about the Endsley 1995 SA Model. I am particularly thankful to the many researchers who
participated in this Special Issue and the many
kind things they had to say about the Endsley
1995 SA model. I am also indebted to those who
have offered their criticisms, as it has provided
the opportunity for a useful discussion.
A substantial body of research has been generated over the past 25 years on SA. Much of
this has confirmed or expanded upon the Endsley 1995 SA Model, significantly growing our
understanding of this important construct. More
importantly, that model has been used to drive a
wide body of work to develop extensive guidelines for improved training and system designs
to support SA (Endsley & Jones, 2012) that
have been used to create significantly improved
workstations in a wide range of aviation, command and control, military, intelligence, power
systems, healthcare, and the oil and gas industries. In my dealings with operators, pilots, physicians, controllers, and soldiers over this time
period, I have been overwhelmed with their
strong stated desire for systems to help support
their need for SA and their incredibly positive
response to the work we are doing in this field.
The corpus of theoretical and experimental
work on SA, and its associated metrics, provides a strong foundation for continuing to
address the real needs of people trying to
accomplish difficult jobs in complex and
demanding settings.
References
Bolstad, C. A., & Endsley, M. R. (1990). Single versus dual scale
range display investigation (NOR DOC 90-90). Hawthorne,
CA: Northrop Corporation.
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
110March 2015 - Journal of Cognitive Engineering and Decision Making
Bolstad, C. A., & Endsley, M. R. (2003). Measuring shared and
team situation awareness in the army’s future objective force.
Paper presented at the Human Factors and Ergonomics Society
47th Annual Meeting, Santa Monica, CA.
Byrne, E. (2015). Commentary on Endsley’s “situation awareness
misconceptions and misunderstandings.” Journal of Cognitive
Engineering and Decision Making.
Chase, W. G., & Simon, H. A. (1973). Perceptions in chess. Cognitive Psychology, 4, 55–81.
Chiappe, D. L., Rorie, R. C., Moran, C. A., & Vu, K. P. L. (2012). A
situated approach to the acquisition of shared SA in team contexts. Theoretical Issues in Ergonomics Science, iFirst, 1–19.
Chiappe, D. L., Strybel, T. Z., & Vu, K. P. L. (2015). A situated
approach to the understanding of dynamic situations. Journal
of Cognitive Engineering and Decision Making, 9(1), 33–43.
Cowan, N. (1988). Evolving conceptions of memory storage,
selective attention, and their mutual constraints within the
human information processing system. Psychological Bulletin,
104(2), 163–191.
Dreyfus, S. E. (1981). Formal models vs. human situational understanding: Inherent limitations on the modeling of business
expertise (ORC 81-3). Berkeley: Operations Research Center,
University of California.
Durso, F. T., & Gronlund, S. D. (1999). Situation awareness. In
F. T. Durso, R. Nickerson, R. Schvaneveldt, S. Dumais,
S. Lindsay, & M. Chi (Eds.), Handbook of applied cognition
(pp. 284–314). New York: Wiley.
Durso, F. T., Hackworth, C. A., Truitt, T. R., Crutchfield, J.,
Nikolic, D., & Manning, C. A. (1998). Situation awareness as a
predictor of performance for en route air traffic controllers. Air
Traffic Control Quarterly, 6(1), 1–20.
Endsley, M. R. (1990). A methodology for the objective measurement of situation awareness. In Situational awareness in aerospace operations (AGARD-CP-478) (pp. 1/1–1/9). Neuilly Sur
Seine, France: NATO-AGARD.
Endsley, M. R. (1993a). Situation awareness and workload: Flip
sides of the same coin. Paper presented at the Seventh International Symposium on Aviation Psychology, Columbus, OH.
Endsley, M. R. (1993b). A survey of situation awareness requirements in air-to-air combat fighters. International Journal of
Aviation Psychology, 3(2), 157–168.
Endsley, M. R. (1995a). Direct measurement of situation awareness in simulations of dynamic systems: Validity and use of
SAGAT. In D. J. Garland & M. R. Endsley (Eds.), Experimental analysis and measurement of situation awareness (pp.
107–113). Daytona Beach, FL: Embry-Riddle University.
Endsley, M. R. (1995b). Measurement of situation awareness in
dynamic systems. Human Factors, 37(1), 65–84.
Endsley, M. R. (1995c). Toward a theory of situation awareness in
dynamic systems. Human Factors, 37(1), 32–64.
Endsley, M. R. (1996). Situation awareness measurement in test
and evaluation. In T. G. O’Brien & S. G. Charlton (Eds.),
Handbook of human factors testing & evaluation (pp. 159–
180). Mahwah, NJ: Lawrence Erlbaum.
Endsley, M. R. (2000). Theoretical underpinnings of situation
awareness: A critical review. In M. R. Endsley & D. J. Garland
(Eds.), Situation awareness analysis and measurement (pp.
3–32). Mahwah, NJ: LEA.
Endsley, M. R. (2004). Situation awareness: Progress and
directions. In S. Banbury & S. Tremblay (Eds.), A cognitive approach to situation awareness: Theory, measurement
and application (pp. 317–341). Aldershot, UK: Ashgate
Publishing.
Endsley, M. R. (2006). Expertise and situation awareness. In K. A.
Ericsson, N. Charness, P. J. Feltovich, & R. R. Hoffman (Eds.),
The Cambridge handbook of expertise and expert performance
(pp. 633–651). New York: Cambridge University Press.
Endsley, M. R. (2015). Situation awareness misconceptions and
misunderstandings. Journal of Cognitive Engineering and
Decision Making, 9(1), 4–32.
Endsley, M. R., & Jones, D. G. (2012). Designing for situation
awareness: An approach to human-centered design (2nd ed.).
London: Taylor & Francis.
Endsley, M. R., & Jones, W. M. (2001). A model of inter- and
intrateam situation awareness: Implications for design, training and measurement. In M. McNeese, E. Salas, & M. Endsley
(Eds.), New trends in cooperative activities: Understanding
system dynamics in complex environments (pp. 46–67). Santa
Monica, CA: Human Factors and Ergonomics Society.
Endsley, M. R., & Robertson, M. M. (2000). Situation awareness in
aircraft maintenance teams. International Journal of Industrial
Ergonomics, 26, 301–325.
Endsley, M. R., & Rodgers, M. D. (1994). Situation awareness
information requirements for en route air traffic control (DOT/
FAA/AM-94/27). Washington, DC: Federal Aviation Administration Office of Aviation Medicine.
Endsley, M. R., & Rodgers, M. D. (1996, September). Attention
distribution and situation awareness in air traffic control.
Paper presented at the 40th Annual Meeting of the Human Factors and Ergonomics Society, Santa Monica, CA.
Endsley, M. R., Sollenberger, R., & Stein, E. (2000). Situation
awareness: A comparison of measures. Paper presented at the
Human Performance, Situation Awareness and Automation:
User-Centered Design for the New Millennium, Savannah,
GA.
Ericsson, K. A., & Kintsch, W. (1995). Long term working memory. Psychological Review, 102, 211–245.
Flach, J. (2015). Situation awareness: Context matters! Journal of
Cognitive Engineering and Decision Making, 9(1), 59–72.
Gorman, J. C., Cooke, N. J., Pederson, H. K., Connor, O. O., &
DeJoode, J. A. (2005). Coordinated awareness of situation by
teams (CAST): Measuring team situation awareness of a communications glitch. Paper presented at the Human Factors and
Ergonomics Society 49th Annual Meeting, Santa Monica, CA.
Hancock, P. A., & Diaz, D. D. (2002). Ergonomics as a foundation
for a science of purpose. Theoretical Issues in Ergonomics Science, 3(2), 115–123.
Hoffman, R. R. (2015). Origins of situation awareness: Cautionary
tales from the history of concepts of attention. Journal of Cognitive Engineering and Decision Making, 9(1), 73–83.
Hogg, D. N., Torralba, B., & Volden, F. S. (1993). A situation
awareness methodology for the evaluation of process control
systems: Studies of feasibility and the implication of use (199303-05). Storefjell, Norway: OECD Halden Reactor Project.
Jones, D. G. (2015) A practical perspective on the utility of situation awareness. Journal of Cognitive Engineering and Decision Making, 9(1), 98–100.
Jones, D. G., & Endsley, M. R. (2004). Using real time probes for
measuring situation awareness. International Journal of Aviation Psychology, 14(4), 343–367.
Klein, G. A. (1993). A recognition primed decision (RPD) model
of rapid decision making. In G. A. Klein, J. Orasanu, R. Calderwood, & C. E. Zsambok (Eds.), Decision making in action:
Models and methods (pp. 138–147). Norwood, NJ: Ablex.
Klein, G. A. (2015). Whose fallacies? Journal of Cognitive Engineering and Decision Making, 9(1), 55–58.
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015
Final Reflections
111
Klein, G. A., Calderwood, R., & Clinton-Cirocco, A. (1986). Rapid
decision making on the fire ground. Paper presented at the
Human Factors Society 30th Annual Meeting, Santa Monica,
CA.
Minotra, D., & Burns, C. M. (2015). Finding common ground:
Situation awareness and cognitive work analysis. Journal of
Cognitive Engineering and Decision Making, 9(1), 87–89.
Morgan, C. A., Chiappe, D. L., Kraut, J., Strybel, T. Z., & Vu, K. P. L.
(2012). An investigation into the potential sources of interference in SA probe techniques. Paper presented at the Applied
Ergonomics Conference, San Francisco, CA.
Pierce, R. S. (2012). The effect of SPAM administration during a
dynamic simulation. Human Factors, 54(5), 838–848.
Prince, C., & Salas, E. (1993). Training and research for teamwork
in the military aircrew. In E. Wiener, B. Kannki, & R. Helmreich (Eds.), Cockpit resource management (pp. 337–365). San
Diego, CA: Academic Press.
Prince, C., Salas, E., & Stout, R. (1995). Situation awareness:
Team measures, training methods. In D. J. Garland & M. R.
Endsley (Eds.), Experimental analysis and measurement of
situation awareness. Daytona Beach, FL: Embry-Riddle University Press.
Salmon, P. M., Stanton, N. A., & Young, K. L. (2011). Situation
awareness on the road: Review, theoretical and methodological
issues and future directions. Theoretical Issues in Ergonomics
Science, 13(4), 472–492.
Sarter, N. B., & Woods, D. D. (1991). Situation awareness: A critical but ill-defined phenomenon. The International Journal of
Aviation Psychology, 1(1), 45–57.
Stanton, N. A., Salmon, P. M., & Walker, G. H. (2015). Let the
reader decide: A paradigm shift for situation awareness in
socio-technical systems. Journal of Cognitive Engineering and
Decision Making, 9(1), 44–50.
Strater, L. D., Faulkner, L. A., Hyatt, J. R., & Endsley, M. R.
(2006). Supporting situation awareness under data overload
in command and control visualizations. Paper presented at the
International Ergonomics Association 2006 Congress, Maastricht, the Netherlands.
Supreme Court of British Columbia. (2013). Regina v. KarlHeinz Arthur Lilgert. Vancouver, British Columbia, Canada:
Supreme Court of British Columbia.
Taylor, R. M. (1990). Situational awareness rating technique
(SART): The development of a tool for aircrew systems design.
In Situational awareness in aerospace operations (AGARDCP-478) (pp. 3/1–3/17). Neuilly Sur Seine, France: NATOAGARD.
van Winsen, R., & Dekker, S. W. A. (2015). SA Anno 1995: A commitment to the 17th century. Journal of Cognitive Engineering
and Decision Making, 9(1), 51–54.
Vidulich, M. A., & Tsang, P. S. (2015). The confluence of situation
awareness and mental workload for adaptable human-machine
systems. Journal of Cognitive Engineering and Decision Making,
9(1), 95–97.
Wickens, C. D. (2015). Situation awareness: Its applications value
and its fuzzy dichotomies. Journal of Cognitive Engineering
and Decision Making, 9(1), 90–94.
Mica R. Endsley is the Chief Scientist of the United
States Air Force. Prior to assuming this position, she
served as President and CEO of SA Technologies, and
served on the faculty at Texas Tech University and the
Massachusetts Institute of Technology. She received
her PhD from the University of Southern California in
Industrial and Systems Engineering.
Downloaded from edm.sagepub.com by Yemao Man on February 25, 2015