Trait and state predictors of error detection accuracy in a simulated

Personality and Individual Differences 39 (2005) 47–60
www.elsevier.com/locate/paid
Trait and state predictors of error detection accuracy in a
simulated quality control task
Kraig L. Schell *, Abbie Woodruff, G. Brandon Corbin, Ellen C. Melton
Department of Psychology, Angelo State University, P.O. Box 10907, ASU Station, San Angelo, TX 76909, United States
Received 14 April 2004; received in revised form 11 October 2004; accepted 7 December 2004
Available online 26 January 2005
Abstract
This study examined whether personality traits or mood and workload states are more predictive of
detection accuracy in a simulated quality control task mimicking prescription checking in a pharmacy.
Ninety-one undergraduate volunteers checked 80 simulated scripts for accuracy in a basic task environment
with error rates ranging from 26% to 38%. Four dimensions of perceived workload and three task-related
mood factors were assessed multiple times during the task, and several theoretically relevant personality
traits were measured before the simulation. Performance was measured using hit rate and false alarm ratios.
Results suggested that trait factors were more predictive of hit ratios in this study than were measures of
task-related mood and perceived workload. Implications and future research directions are discussed.
Ó 2004 Elsevier Ltd. All rights reserved.
Keywords: Pharmacy; Quality control; Traits; Performance; Error detection; States
1. Introduction
The state versus trait issue is an enduring one in psychology, paralleling in many ways basic
arguments such as the nature/nurture debate. In the context of quality control, the relative influence of traits and states on human performance could lead to better interventions designed to
*
Corresponding author. Tel.: +1 342 942 2069/243.
E-mail address: [email protected] (K.L. Schell).
0191-8869/$ - see front matter Ó 2004 Elsevier Ltd. All rights reserved.
doi:10.1016/j.paid.2004.12.003
48
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
improve it (Grasha & OÕNeill, 1996, 1998). This could be useful in the pharmacy, where a misfilled
prescription could cause severe injury to a patient. However, pharmacy has been slow to recognize
the importance of the pharmacist and/or technician in the error control process, viewing them
implicitly as automatons and focusing attention instead on technological and workflow issues
(Cohen, 1999). In response to this, we have studied quality control tasks in both experimental
and field settings, outlining some potential factors that are relevant to detecting errors in a
real-world detection task (Grasha, Reilley, Schell, Tranum, & Filburn, 2000; Schell, Reilley, Grasha, & Tranum, 2003; Schell, Woodruff, Corbin, Bilsing, & Melton, 2002). It is an explicit goal of
our research to understand how individual differences affect pharmacy error control so that
improvement methods can be designed with those differences in mind.
In this research, we examined the relative contribution of task-related state factors and personality trait measures to error detection in a simulated prescription checking task. Our simulation
task has consistently demonstrated error rates and work speeds commensurate with those observed in pharmacies, despite using pharmacy-naı̈ve individuals in our studies (Grasha & Schell,
1999; Grasha et al., 2000). In the simulation, participants verify for correctness multiple sets of
‘‘scripts’’ and input their findings into a customized computer database. Since the scripts represent
common items such as beads and paper clips and no specialized medical knowledge is required,
these participants can learn the task quickly and perform effectively in most cases.
State and trait factors were included in this study based on previous literature suggesting a relationship to performance on these kinds of tasks. These factors are listed in Table 1. First, measures of anxiety such as trait anxiety, Neuroticism, social anxiety and inhibition motivation
have been related to performance in the simulated pharmacy environment (Grasha & Schell,
1999, 2001; Schell & Grasha, 2000; Schell et al., 2002), justifying further study. Second, research
shows that Type A tendencies were positively related to error detection accuracy (Grasha &
Schell, 1999; Schell et al., 2002), even though the structure of the task is not the stereotypical
one that such a person might choose. Third, extraversion was included because previous research
Table 1
Variables included in the proposed trait and state models of quality control performance
Dependent measure: overall hit ratio
Model T—Trait model
Model S—State model
Predictors
Trait anxiety
Inhibition motivation
Social anxiety
Neuroticism
Extraversion
Type A Trait
Locus of control
Predictors (workload)
Mental demand
Temporal demand
Effort
Frustration
Note: ParticipantsÕ false alarm ratios will be used in both models as a predictor.
Predictors (mood)
Engagement
Tension
Hedonic tone
Anger
Task-related motivation
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
49
has linked it to work performance in other domains (Matthews & Amelang, 1993; Matthews, Davies, & Holley, 1990). Finally, locus of control was included since control beliefs are important in
moderating anxiety and increasing motivation (e.g., Shapiro, Schwartz, & Astin, 1996) and anxiety and motivation have been implicated as predictors of performance in our task environment
before (Schell & Grasha, 2000; Schell et al., 2002).
Task-related mood and perceived workload are important state factors because of their extensive use in the performance literature (Braarud, 2001; Hancock & Meshkati, 1988; Matthews, Davies, Westerman, & Stammers, 2000; Nygren, 1991). To capture changes in these states over time,
measurements were taken on multiple occasions as participants worked. Measures of energy,
tense arousal and hedonic tone were measured as task-related mood constructs. These factors
have been related to performance in a number of task situations, including driving (Matthews
& Desmond, 2001), mail sorting (Matthews, Jones, & Chamberlain, 1992), and our simulation
(Tranum, Matthews, & Grasha, 2001). Perceived workload has also been connected to human
performance in many ways (Becker, Warm, Dember, & Hancock, 1995; Bunce & Sisa, 2002; Carayon-Sainfort, 1992; Prinzel & Freeman, 1997; Smith & Bourke, 1992). Specific dimensions of perceived workload also have been connected to accurate error detection using our simulation
(Grasha & Schell, 2001; Reilley, Grasha, & Schafer, 2002; Schell & Grasha, 2000), supporting
their inclusion.
The measurement of performance in the pharmacy simulation task draws from signal detection
theory (e.g., Davies & Parasuraman, 1982), since the basic skill is the detection and overt identification of errors (targets) amongst correct orders (noise). We used hit ratios as our measure of
accuracy, and false alarm ratios as our measure of response bias. Further, we used false alarm ratios as predictors of hit ratios in this study instead of calculating d-prime and beta statistics for
three reasons. First, the nature of the problem in this study (medication errors) demanded that
hit ratios were more important than false alarm ratios, due to differences in their potential
real-world consequences. Second, we were interested more in group-level analysis of performance
ability on this task rather than any comparison of individuals. That is, our intent was not to determine between two people who was a better performer of the task, but instead to understand how
certain factors were generally related to human performance in this domain. Third, assumptions
of d-prime include normally distributed signal and noise distributions as well as consistent signal
strengths for targets and non-targets (Davies & Parasuraman, 1982; Wickens, 1992). Because the
stimuli in this task were so complex, signal strength was impossible to hold constant, and so the
signal and noise distributions were probably not normal. So, d-prime and beta, although technically calculable in this study, were not used.
Our predictions were based on previous work employing the quality control simulation and theoretical arguments concerning the connection between personality traits, states and performance
in work contexts (Barrick & Mount, 1991; Johnson, Schneider, & Oswald, 1997; Mount & Barrick, 1998). Our work with this particular task suggests that personality factors such as trait anxiety, extraversion and conscientiousness are related to performance when objective task demands
(time pressure, etc.) are relatively low (Schell & Reilley, 2004; Schell et al., 2002). But, meta-analyses have found more consistent relationships between certain traits (i.e., conscientiousness) and
rated work performance (Barrick & Mount, 1991; Conte & Jacobs, 2003; Tett, Jackson, & Rothstein, 1991). It is not clear why these relationships are not stronger when performance is measured
objectively rather than subjectively. It has been suggested that accounting for the characteristics of
50
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
the task being studied may help answer this question (Hockey, 1984), and so we feel that personality must be studied in specific task contexts to understand how it relates to objective performance measures. Furthermore, state variables should affect performance since, for example,
stress and anxiety can lead to cognitive ruminations and other distractions which can then affect
performance either positively or negatively (Sarason, Sarason, & Pierce, 1990; Zeidner, 1998). So,
it is not clear whether state or trait factors should be more influential on performance in a quality
control task.
We expected that trait factors would account for more variance in hit ratios than would state
factors for two reasons. In previous studies using the simulation task, traits have shown stronger
correlations with performance than states (e.g., Grasha et al., 2000; Grasha & Schell, 1999; Schell
& Grasha, 2000). However, much of this research involved a prescription dispensing task instead
of a checking task. Since the dispensing task clearly requires different skills and behaviors than the
checking task used in this study, we needed to determine if traits were more predictive of pharmacy performance in general or if their predictive utility was restricted to dispensing only. Second,
the stimuli in a prescription checking task can be considered a variably-mapped set (Schneider &
Shiffrin, 1977), which forces a controlled processing mode for a longer period of time (Ackerman,
1987, 1988). Research has shown that abilities and personality traits are more predictive of performance during controlled processing, especially in early task performance (Ackerman & Kanfer,
1993; Ackerman, Kanfer, & Goff, 1995). Thus, traits were expected to show stronger relationships
to performance in our task.
To summarize, we expected that trait factors would be more predictive of performance on the
task than were state measures. In other words, it was expected that trait-based models would account for significantly more variance in hit ratios than would similarly constructed state-based models. Because the state variables (i.e., mood and workload) were measured both at mid-task and at
post-task points in time, separate models were constructed and tested for each measurement point.
2. Method
2.1. Participants
Ninety-one students enrolled in psychology courses at Angelo State University volunteered for
the research in exchange for course credit. In the sample, there were 27 men and 64 women with a
mean age of 20 (SD = 4).
2.2. Instruments
Avoidant motivational tendencies were measured using a subset of a self-report survey (Carver
& White, 1994) composed of seven items scaled from 1 (very true for me) to 4 (very false for me).
Trait anxiety was assessed using the trait section of the State–Trait Anxiety Instrument (Spielberger, 1983), consisting of 20 items scaled from 1 (almost never) to 4 (almost always). Extraversion
and Neuroticism were assessed using the Eysenck Personality Inventory (Eysenck, 1981), consisting of 57 questions to which respondents answered ‘‘yes’’ or ‘‘no’’. Finally, Type A tendencies and
social anxiety were assessed using the Grasha Holistic Stress Test (Grasha, 1990). The Type A and
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
51
the social anxiety scales consisted of 22 questions and 10 questions, respectively, both scored on a
5-point scale from 1 (disagree) to 5 (agree).
The Dundee State Stress Questionnaire (DSSQ: Matthews, Jones, & Chamberlain, 1990, 1992)
was used to measure task-related mood, motivation and cognitions associated with stress, arousal
and fatigue. The mood section of the DSSQ consisted of 29 items scored from 1 (definitely) to 4
(definitely not). Respondents were presented with adjectives and asked whether they experienced
those feelings as they worked on the task. Energization accounted for 8 of the 29 items, the tension
mood scale of the DSSQ accounted for 8 of the 29 items, and the hedonic tone scale accounted for
8 of the 29 items. All of these scales had minimum and maximum ranges of 8–32. Finally, the
anger scale of the DSSQ accounted for the last five items, with minimum and maximum scores
of 5–20. A separate section of the DSSQ measured task-related motivation on eight items scored
from 0 to 9. Because of reverse scoring procedures, each question contained different verbal anchors. This scale carried a potential range of 0–72, with low scores indicating lower motivation.
The DSSQ has been used to study a variety of behaviors and has been shown to be valid and reliable across work domains (Matthews et al., 1990; Matthews & Desmond, 1998; Matthews et al.,
1992).
A modified version of the NASA Task Load Index (TLX: Hart & Staveland, 1988) was used to
measure perceived workload during the task. Six dimensions of workload are measured on the
NASA Task Load Index: mental demand, physical demand, temporal demand, perception of performance, effort, and frustration, of which four will be used in this study. The TLX is perhaps the
most respected measure of perceived workload currently is use (Nygren, 1991), but because of
psychometric problems with the recommended weighting procedure used to obtain a global workload score (Dickinson, Byblow, & Ryan, 1993), we treated the six dimensions of workload as separate variables.
2.3. Simulation environment
Two computer workstations were located in a medium sized room. Each workstation was composed of a table, chair and computer with a keyboard and mouse. A small digital clock was positioned so participants would be aware of the time of day. Lighting, temperature and noise levels
were held constant within comfort regions. The workstations faced away from one another to
avoid unintentional eye contact by participants.
Products consisted of a prescription bottle containing an amount of simulated drug materials
and an order card, individually packaged in plastic bags. Simulated drugs were made out of beads,
nuts, washer, and paper clips of several different colors, sizes and/or shapes, half of which were
marked to simulate sound-alike/look-alike drug errors. Each item class was divided into two sets,
half of which were marked with a black permanent marker and their names slightly altered to
indicate their designation. For example, ‘‘BEADS-D-GL’’ indicated a gold colored, donut-shaped
bead, while ‘‘BEEDS-D-GL’’ was the same bead with a black mark. Additional distinctions were
made using simulated dosages such as ‘‘1.0’’ and 10.0’’ for items where there was more than one
size, such as the nuts and washers. More information on the items has been included in earlier
reports (see Grasha & Schell, 2001; Schell et al., 2003).
All order cards contained the following information: customer name, address, product name,
product quantity, special instructions (a random phrase that was included to enhance realism,
52
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
such as ‘‘Re-order every 90 days’’, etc.), telephone number and an order number. The order cards
were handwritten and made of standard card stock paper. The prescription labels used for comparison were typed in 12-point Times Roman font and contained the same basic information.
They were pasted on a separate sheet of paper for each order and placed in sequence in a small
desk calendar apparatus. The orders were grouped in eight bundles of 5, totaling 40 orders in each
of two sets. Thus, participants could verify a maximum of 80 orders during the simulation.
There were three possible types of errors: errors in information (spelling, omission or inclusion),
product mismatches, or item counts. It was possible to have more than one error on a particular
order. Two different sets of orders were used, each with their own inserted errors with error ratios
ranging from 26% to 38%. However, a constant ratio of 2:1 was maintained between errors of
incorrect information and product mismatch and counting errors in all sets, since the former type
of error is more common in actual pharmacies (Allan, Barker, Malloy, & Heller, 1995; Flynn &
Barker, 1999).
2.4. Procedure
Training consisted of an introduction to task materials, an overview of the task, and 10 supervised ‘‘practice trials’’. A ‘‘trial’’ involved the following steps: an order was selected and three
comparisons were performed. The first comparison involved the order card and the printed label,
the second comparison involved the order card and the item inside the bottle, and the last comparison involved the order card and the amount of the item in the bottle. On their computer workstation, participants answered three questions about each order, indicating whether (a) the item in
the bottle was the correct item, (b) the amount of the item was correct, and (c) the information on
the order card matched the information on the printed label. Next, the trial was completed by
entering the order number into the computer. Each set of orders contained two blocks of 40 orders each, which the participants were given 80 min to complete (40 min per block). This time
limit was chosen because it has been shown to be a comfortable amount of time for most participants to complete the set (Reilley et al., 2002). The DSSQ was administered at three points: before the simulation, after the first set of 40 orders, and after the second set of 40 orders. The TLX
was administered following each 40-order set. After all orders had been checked or time had
expired, participants were debriefed and released.
3. Results
The data were first analyzed descriptively in order to examine summary statistics over time for
the trait and state predictors. Means and standard deviations for these factors and for overall hit
ratio are shown in Table 2. Standard deviations for hit ratios were much higher than for false
alarm ratios. Also, the mean hit ratio of 65% is not surprising given the complexity of the stimuli
being judged, i.e., the potential presence of more than one kind of error. Mean scores on the
DSSQ and TLX scales were within expected ranges for a task of this type based on norms available in Matthews et al. (1990). Since our intent was not to induce stress in these participants, there
was no need for a manipulation check or a comparison with means observed in other domains.
Also, previous research has shown the pharmacy task to be moderate in workload as compared
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
53
Table 2
Means and standard deviations for overall hit rate and false alarm rate
Variable
Mean
SD
Overall hit rate
Overall false alarm rate
65.17
92.48
22.41
6.82
Note: False alarm rates are calculated as the percentage of correct orders judged as correct. r (hit rate, false alarm
rate) = .284, p < .02.
to a simple card sorting task (Grasha & Schell, 1999), and these data are in line with that observation also.
Correlations among trait and state predictors are shown in Tables 3 and 4, and correlations
between hit ratios and these predictors are shown in Table 5. There were some significant relationships among the predictors, which was expected given the relatedness of the constructs. It was
noted that Frustration was the workload dimension most consistently correlated with DSSQ subscales. Also, correlations among DSSQ and TLX factors decreased after the task when compared
to mid-task statistics. Finally, though the tables do not show trait–state correlations, we can report that in almost all cases, traits and states were not significantly correlated. The one exception
was post-task temporal demand, which was correlated significantly with the anxiety trait measures
(trait and social anxiety, inhibition motivation and neuroticism).
Finally, changes in DSSQ and TLX scores over time were assessed using one-factor repeatedmeasures ANOVAs. These analyses revealed that participants reported less motivation following
the task than they reported in the middle of the task (F(1, 81) = 28.37, p < .001), but no other
DSSQ score changed significantly during the task. For TLX scores, however, mental demand,
effort and frustration all increased significantly comparing mid-task to post-task scores (all
pÕs < .05).
To examine the hypothesis in this study, data were checked for regression assumptions of normality using one-sample Kolmogorov–Smirnov tests. These tests were not significant in all cases,
even though false alarm ratios were somewhat negatively skewed. Thus, regression equations were
constructed using the following steps. First, bivariate correlations were calculated between all predictors and the overall hit ratio criterion; significant correlations between proposed predictors and
Table 3
Correlations among trait predictors of hit rate
Variable
1.
2.
3.
4.
5.
6.
7.
Trait anxiety
Inhibition motive
Social stress
Type A behavior
Locus of control
Extraversion
Neuroticism
1
2
47
52
36
34
22
81
3
29
43
14
35
55
4
26
17
34
49
5
07
28
42
6
10
36
Note: Decimals removed for readability. Bolded correlations are significant at p = .01.
7
17
54
Variable
1. Mental demand—Mid-task
2. Temporal demand—Mid-task
3. Effort—Mid-task
4. Frustration—Mid-task
5. Mental demand—Post-task
6. Temporal demand—Post-task
7. Effort—Post-task
8. Frustration—Post-task
9. Engagement—Mid-task
10. Tension—Mid-task
11. Hedonic tone—Mid-task
12. Anger—Mid-task
13. Motivation—Mid-task
14. Engagement—Post-task
15. Tension—Post-task
16. Hedonic tone—Post-task
17. Anger—Post-task
18. Motivation—Post-task
1
2
45
47
05
82
33
55
14
10
15
01
09
21
02
08
02
05
05
3
36
35
35
64
42
26
11
35
13
26
13
13
21
10
12
17
4
15
52
23
47
19
36
14
04
06
42
09
08
03
00
23
5
08
27
15
70
42
47
55
61
32
30
07
29
16
12
6
38
59
16
20
12
03
03
34
03
08
08
07
19
7
43
29
15
25
09
23
03
17
08
18
07
16
8
07
26
20
03
00
35
10
14
01
08
32
9
32
43
54
63
28
39
19
41
22
23
Note: Bolded correlations are significant at p = .01. Decimals removed for readability.
10
16
21
50
41
26
00
18
09
26
40
48
13
08
33
21
19
08
11
61
56
21
16
37
24
11
12
51
33
19
37
35
21
13
18
04
17
24
45
14
45
71
48
46
15
53
56
13
16
73
31
17
24
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
Table 4
Correlations among state predictors of hit rate
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
55
Table 5
Correlations between hit rate and the trait and state predictors
Predictor
Pre-task
Mid-task
Post-task
Trait anxiety
Inhibition motivation
Social anxiety
Neuroticism
Extraversion
Type A Trait
Locus of control
Mental demand
Temporal demand
Effort
Frustration
Energy
Tension
Hedonic tone
Anger
Motivation
.12
.03
.31*
.11
.18
.27**
.15
n/a
n/a
n/a
n/a
.12
.03
.06
.07
.30*
n/a
n/a
n/a
n/a
n/a
n/a
n/a
.02
.03
.27**
.12
.13
.21
.17
.25**
.30*
n/a
n/a
n/a
n/a
n/a
n/a
n/a
.04
.02
.05
.02
.01
.06
.05
.09
.03
Note: n/a = Variable not measured at this time.
p < .01.
**
p < .05.
*
the dependent measure were identified. Next, exploratory hierarchical models were built using the
residual correlation method, where the strongest bivariate predictor was entered into the model
and then other predictors were correlated with the residuals of that model. The building process
stopped when no other predictors correlated with the residuals of the final model. Relevant model
statistics are displayed in Table 6.
First, the set of trait predictors was regressed on the overall hit ratio criterion. From the correlation matrix, we selected social stress (r(72) = .31) to be entered first, followed by overall false
alarm ratios (r(78) = .28) and Type A tendencies (r(72) = .27). Further analysis of regression
Table 6
Regression equations for overall hit rates as predicted by trait and state factors
R2D
Predictors
2
2
Model T—Trait Factors (R = .55, R = .31, R , adj. = .26
Social stress
Overall false alarm rate
Type A Trait
Locus of controla
F(4, 64) = 7.07, p < .01)
.11
.08
.06
.06
Model S—Mid-task state factors (R = .40, R2 = .16, R2, adj. = .14, F(2,70) = 6.76, p < .01)
Motivation
.09
Overall false alarm rate
.07
a
Std. beta
.26
.30
.25
.25
.29
.27
High LoC scores indicate internal control beliefs. All predictors univariately significant at least at p < .05.
56
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
residuals suggested that locus of control should also be included, even though it was not bivariately related to the overall hit ratio. Each of these four predictors was associated with a significant
change in R2 and produced the final model shown in Table 6. This model explains an adequate
proportion of the variance in overall hit ratio with an acceptable amount of shrinkage for an
exploratory model (R2 = .31, adjusted R2 = .26).
Next, sets of mid-task and post-task state predictors were regressed on the hit ratio criterion.
For the mid-task analysis, we selected mid-task motivation (r(73) = .30) to be entered first. The
only other variable included in the model shown in Table 4 was overall false alarm ratio—anger
and effort, even though they were correlated bivariately with hit ratio, did not significantly increase the variance accounted for in the model when they were added. R2 for this model was lower
than for the trait model (R2 = .16, adjusted R2 = .14). Also, when post-task state measures were
used in the model-building process, a regression model could not be calculated since no post-task
state variables were significantly related to overall hit rate.
4. Discussion
The purpose of this research was to explore whether traits or states are more predictive of error
detection in a quality control task. These data suggest that traits are more predictive of hit rates in
a quality control task than are state measures such as mood and workload. This is evidenced by
larger R2 values for the trait model versus the state model, as well as the larger number of traits
that are independently predictive of hit ratios.
Examining the predictors in the trait model (see Table 5), anxiety-related constructs are wellrepresented. This is consistent with previous research linking anxiety to task performance (Anton
& Klisch, 1995; Mahar, Henderson, & Deane, 1997). Social anxiety showed the strongest relationship to hit ratios, a result seen before using our simulation (Grasha et al., 2000). It may seem to
some that social anxiety should not be relevant to performance on this task because it was performed alone without performance feedback. Participants may have believed some sort of evaluation of them would be done later, which would connect these results with a large literature on
evaluation anxiety (see Sarason et al., 1990, for example). In the pharmacy, social anxiety can
come from a number of sources: customers, co-workers, supervisors, etc. The identification of
social anxiety as a predictor of error detection suggests that it should be studied further for its
impact on pharmacy performance.
The positive partial correlations between social anxiety and performance (as well as between
Type A tendencies and performance) could mean that participants thought the task was simple,
so that anxiety was facilitative (Zeidner, 1998). However, hit ratios and false alarm ratios were
negatively associated, suggesting a liberal criterion shift. Workload signatures on the task are
comparable to but slightly lower than those obtained in field studies of pharmacists (Grasha
et al., 2000), which would also support the liberal decision criterion explanation. In addition,
the regressions in Table 6 show that false alarm ratios explained a significant amount of the variance in hit ratios in both models. This suggests that a criterion shift may explain these data better
than an anxiety-facilitation argument. A study that induces anxiety before task performance
would add a needed perspective to this question.
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
57
With respect to the state model, task-related motivation is an interesting factor. The motivation
scale used in this research taps into what Ackerman and Kanfer (1993) call ‘‘motivational
thoughts’’. Questions assessed, for instance, whether the participant thought the task was interesting, how much they wanted to succeed, and how worthwhile they thought the task was. Ackerman
and Kanfer (1993) reported that both positive and negative motivational thoughts were related to
performance in complex tasks, and the data in this study are consistent with this conclusion as far
as positive thoughts go.
It is intriguing that some participants reported the task to be interesting and worthwhile, even
though the task seems to be rather boring and trite. It is possible that motivational thoughts were
more positive because participants understood the societal importance of the topic under study.
Clearly, reducing medication errors is a worthwhile cause, and participantsÕ motivation may reveal a belief that their time was being used for a ‘‘greater good’’. It is also possible that motivated
individuals ‘‘manufactured’’ interest to make up for the taskÕs monotony. This would conform to
the self-regulation literature, which argues that people will maintain motivation to complete uninteresting tasks by altering them (Ryan & Deci, 2000; Sansone & Smith, 2000). For example, perhaps participants imagined the task as a kind of game, challenging themselves and setting
personal performance standards. Another explanation is that cognitive dissonance (Thibodeau
& Aronson, 1992; Elliot & Devine, 1994) may have played a role. This was a long task to complete
for course credit, so the ‘‘motivated’’ individuals may have reported more motivation to explain
why they spent over two hours doing a mundane task. The correlation between motivation and
performance was not significant post-task, which could suggest that the charade of motivation
had run out of energy by the end of the day.
Future research should address some of this studyÕs limitations. First, it is possible that other
traits not measured in this work are also important; more exploration is needed here. Second, the
weak correlations among state factors and performance may be an artifact of the type of performance studied. Future research should enhance the simulation and add other contextual variables. Finally, narrower measures of personality traits may yield clearer conclusions; specific
hypotheses in this area should be developed and tested.
Acknowledgments
This research was partially inspired by masterÕs theses completed by Abbie Woodruff, Brandon
Corbin and Ellen Melton and was partially supported by funds from an internal grant supplied
by Angelo State University. We acknowledge William Davidson, William Fuller, Rob Mowrer
and William Montgomery for their input on the original documents, Luz-Eugenia Cox-Fuenzalida for comments on earlier drafts, and our research team at ASU for their help in many
ways.
References
Ackerman, P. L. (1987). Individual differences in skill learning: an integration of psychometric and information
processing perspectives. Psychological Bulletin, 102(1), 3–27.
58
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
Ackerman, P. L. (1988). Determinants of individual differences during skill acquisition: cognitive abilities and
information processing. Journal of Experimental Psychology: General, 117(3), 288–318.
Ackerman, P. L., & Kanfer, R. (1993). Integrating laboratory and field study for improving selection: development of a
battery for predicting air traffic controller success. Journal of Applied Psychology, 78(3), 413–432.
Ackerman, P. L., Kanfer, R., & Goff, M. (1995). Cognitive and noncognitive determinants and consequences of
complex skill acquisition. Journal of Experimental Psychology: Applied, 1(4), 270–304.
Allan, E. L., Barker, K. N., Malloy, M. J., & Heller, W. M. (1995). Dispensing errors and counseling in community
practice. American Pharmacy, 12, 25–33.
Anton, W. D., & Klisch, M. C. (1995). Perspectives on mathematics anxiety and test anxiety. In C. D. Spielberger & P.
R. Vagg (Eds.), Test anxiety: Theory assessment and treatment (pp. 93–106). Washington, DC: Taylor & Francis.
Barrick, M. R., & Mount, M. K. (1991). The Big Five personality dimensions and job performance: a meta-analysis.
Personnel Psychology, 44, 1–26.
Becker, A. B., Warm, J. S., Dember, W. N., & Hancock, P. A. (1995). Effects of jet engine noise and performance
feedback on perceived workload in a monitoring task. International Journal of Aviation Psychology, 5(1), 49–62.
Braarud, P. O. (2001). Subjective task complexity and subjective workload: criterion validity for complex team tasks.
International Journal of Cognitive Ergonomics, 5(3), 261–273.
Bunce, D., & Sisa, L. (2002). Age differences in perceived workload across a short vigil. Ergonomics, 45(13), 949–960.
Carayon-Sainfort, P. (1992). The use of computers in offices: impact on task characteristics and worker stress.
International Journal of Human–Computer Interaction, 4(3), 245–261.
Carver, C. S., & White, T. L. (1994). Behavioral inhibition, behavioral activation, and affective responses to impending
reward and punishment: the BIS/BAS scales. Journal of Personality & Social Psychology, 67, 319–333.
Cohen, M. R. (Ed.). (1999). Medication errors. Washington, DC: American Pharmaceutical Association.
Conte, J. M., & Jacobs, R. R. (2003). Validity evidence linking polychronicity and Big Five personality dimensions to
absence, lateness, and supervisory performance ratings. Human Performance, 16, 107–129.
Davies, D. R., & Parasuraman, R. (1982). The psychology of vigilance. London: Academic Press.
Dickinson, J., Byblow, W. D., & Ryan, L. A. (1993). Order effects and the weighting process in workload assessment.
Applied Ergonomics, 24, 357–361.
Elliot, A. J., & Devine, P. G. (1994). On the motivational nature of cognitive dissonance: dissonance as psychological
discomfort. Journal of Personality & Social Psychology, 67(3), 382–394.
Eysenck, H. J. (1981). A model for personality. Berlin: Springer.
Flynn, E. A., & Barker, K. N. (1999). Medication errors research. In M. Cohen (Ed.), Medication errors (pp. 6.1–6.30).
Washington, DC: American Pharmaceutical Association.
Grasha, A. F. (1990). Holistic stress test. Cincinnati, OH: Communication & Education Associates.
Grasha, A. F., & OÕNeill, M. (1996). Cognitive processes in medication errors. US Pharmacist, 9, 96–109.
Grasha, A. F., & OÕNeill, M., (1998). Human error and the pharmacy process. Paper presented at the National
Association of Chain Drug Stores Meeting, San Diego, CA.
Grasha, A. F., Reilley, S., Schell, K. L., Tranum, D., & Filburn, J., (2000). A cognitive systems perspective on human
performance in the pharmacy: implications for accuracy, effectiveness, and job satisfaction. Technical Report
062100-R. Cincinnati, OH: Cognitive-Systems Performance Laboratory, University of Cincinnati.
Grasha, A. F., & Schell, K. L., (1999). Psychosocial factors and time-accuracy functions in a simulated pharmacy.
Paper presented at the American Psychological Association annual conference, Boston, MA.
Grasha, A. F., & Schell, K. L. (2001). Psychosocial factors, workload and human error in a simulated pharmacy
dispensing task. Perceptual and Motor Skills, 92, 53–71.
Hancock, P. A., & Meshkati, N. (Eds.). (1988). Human mental workload. Amsterdam: North-Holland.
Hart, S. G., & Staveland, L. E. (1988). Development of NASA-TLX Task Load Index: results of empirical and
theoretical research. In P. A. Hancock & N. Meshkati (Eds.), Human mental workload (pp. 139–183). Amsterdam:
North-Holland.
Hockey, G. R. J. (1984). Varieties of attentional state: the effects of the environment. In R. Parasuraman & D. R.
Davies (Eds.), Varieties of attention (pp. 156–189). New York: Academic Press.
Johnson, J. W., Schneider, R. J., & Oswald, F. L. (1997). Toward a taxonomy of managerial performance profiles.
Human Performance, 10(3), 227–250.
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
59
Mahar, D., Henderson, R., & Deane, F. (1997). The effects of computer anxiety, state anxiety, and computer experience
on usersÕ performance of computer based tasks. Personality & Individual Differences, 22(5), 683–692.
Matthews, G., & Amelang, M. (1993). Extraversion, arousal theory and performance: a study of individual differences
in the EEG. Personality & Individual Differences, 14, 347–363.
Matthews, G., Davies, D. R., & Holley, P. J. (1990). Extraversion, arousal and visual sustained attention: the role of
resource availability. Personality & Individual Differences, 11, 1159–1173.
Matthews, G., Davies, D. R., Westerman, S. J., & Stammers, R. B. (2000). Human performance: cognition, stress and
individual differences. Philadelphia, PA: Taylor & Francis.
Matthews, G., & Desmond, P. A. (1998). Personality and multiple dimensions of task-induced fatigue: a study of
simulated driving. Personality & Individual Differences, 25, 443–458.
Matthews, G., & Desmond, P. A. (2001). Stress and driving performance: implications for design and training. In P. A.
Hancock & P. A. Desmond (Eds.), Stress, workload and fatigue (pp. 211–231). Mahwah, NJ: Erlbaum.
Matthews, G., Jones, D. M., & Chamberlain, A. G. (1990). Refining the measurement of mood: the UWIST mood
adjective checklist. British Journal of Psychology, 81, 17–42.
Matthews, G., Jones, D. M., & Chamberlain, A. G. (1992). Predictors of individual differences in mail-coding skills and
their variation with ability level. Journal of Applied Psychology, 77, 406–418.
Mount, M. K., & Barrick, M. R. (1998). Five reasons why the Big Five article has been frequently cited. Personnel
Psychology, 51(4), 849–857.
Nygren, T. E. (1991). Psychometric properties of subjective workload measurement techniques: implications for their
use in the assessment of perceived mental workload. Human Factors, 33, 17–33.
Prinzel, L. J., & Freeman, F. G. (1997). Task-specific sex differences in vigilance performance: subjective workload and
boredom. Perceptual & Motor Skills, 85(3), 1195–1202.
Reilley, S., Grasha, A. F., & Schafer, J. (2002). Workload, error detection and experienced stress in a simulated
pharmacy verification task. Perceptual & Motor Skills, 95, 27–46.
Ryan, R. M., & Deci, E. L. (2000). When rewards compete with nature: the undermining of intrinsic motivation and
self-regulation. In C. Sansone & J. M. Harackiewicz (Eds.), Intrinsic and extrinsic motivation: The search for optimal
motivation and performance (pp. 13–54). San Diego, CA: Academic Press.
Sansone, C., & Smith, J. L. (2000). Interest and self-regulation: the relation between having to and wanting to. In C.
Sansone & J. M. Harackiewicz (Eds.), Intrinsic and extrinsic motivation: The search for optimal motivation and
performance (pp. 341–372). San Diego, CA: Academic Press.
Sarason, I. G., Sarason, B. R., & Pierce, G. R. (1990). Anxiety, cognitive interference and performance. Journal of
Social Behavior & Personality, 5(2), 1–18.
Schell, K. L., & Grasha, A. F. (2000). State anxiety, performance accuracy and work pace in a simulated pharmacy
dispensing task. Perceptual and Motor Skills, 90, 547–561.
Schell, K. L., & Reilley, S. P. (2004). The Big Five and quality control tasks: personality and error detection accuracy.
Psychological Reports, 94, 1301–1311.
Schell, K. L., Reilley, S., Grasha, A. F., & Tranum, D. (2003). Improving accuracy in simulated pharmacy assembly
tasks using workspace interventions to enhance the cognitive environment. Perceptual & Motor Skills, 96(3),
915–926.
Schell, K. L., Woodruff, A., Corbin, B., Bilsing, L., & Melton, E., (2002). A comprehensive investigation of error
detection effectiveness in a complex product verification task. Technical Report 0802-01. San Angelo, TX: Human
Performance Laboratory, Angelo State University.
Schneider, W., & Shiffrin, R. M. (1977). Controlled and automatic human information processing: I. Detection, search
and attention. Psychological Review, 84, 1–66.
Shapiro, D. H., Schwartz, C. E., & Astin, J. A. (1996). Controlling ourselves, controlling our world: psychologyÕs role in
understanding positive and negative consequences of seeking and gaining control. American Psychologist, 51,
1213–1230.
Smith, M., & Bourke, S. (1992). Teacher stress: Examining a model based on context, workload, and satisfaction.
Teaching & Teacher Education, 8(1), 31–46.
Spielberger, C. D. (1983). Manual for the state–trait anxiety inventory. Palo Alto, CA: Consulting Psychologists
Press.
60
K.L. Schell et al. / Personality and Individual Differences 39 (2005) 47–60
Tett, R. P., Jackson, D. N., & Rothstein, M. (1991). Personality measures as predictors of job performance: a metaanalytic review. Personnel Psychology, 44(4), 703–742.
Thibodeau, R., & Aronson, E. (1992). Taking a closer look: reasserting the role of the self-concept in dissonance theory.
Personality & Social Psychology Bulletin, 18(5), 591–602.
Tranum, D., Matthews, G., & Grasha, A. F., (2001). Skill acquisition and mood change in a pharmacy simulation.
Poster presented at the American Psychological Association annual conference, San Francisco, CA.
Wickens, C. D. (1992). Engineering psychology and human performance (2nd ed.). New York: Harper Collins.
Zeidner, M. (1998). Test anxiety: The state of the art. New York: Plenum.