anglim 2011 skill acquisition phd thesis

STRATEGIES IN SKILL ACQUISITION:
RECONCILING CONTINUOUS MODELS OF
THE LEARNING CURVE WITH
ABRUPT STRATEGY SHIFTS
JEROMY KYRAM ANGLIM
Submitted in total fulfilment of the
requirements of the degree of Doctor of Philosophy
March 2011
PSYCHOLOGICAL SCIENCES
THE UNIVERSITY OF MELBOURNE
ii
Abstract
How does task completion time change with practice and what processes
underlie this change? Despite over 100 years of scientific research (e.g.,
Bryan & Harter, 1899) no wholly satisfactory answer has yet emerged.
After analysing many skill acquisition datasets Newell and Rosenbloom
(1981) influentially declared that the relationship between practice and
task completion time was best represented by a power function labelling
the relationship the Power Law of Practice. Use of the term ‘law’ might
suggest that the case was closed, yet several recent findings have challenged the Power Law’s ‘legal’ status. First, Heathcote, Brown, and
Mewhort (2000) concluded that the Power Law of Practice was an artefact of aggregation. Across a large number of skill acquisition datasets
they showed that when analysed at the individual-level the exponential function tended to provide superior fit. Second, several researchers
have suggested that strategy shifts may even cause discontinuities in the
learning curve (e.g., Delaney, Reder, Staszewski, & Ritter, 1998; Haider
& Frensch, 2002; Rickard, 1997). In summary, findings suggest that the
power function is an analytical artefact and that learning may in some
instances involve discontinuities.
In response to these challenges, this thesis had three aims. The first
aim was to develop and test mathematical models of the relationship
between practice, strategy use and performance. The second aim was to
assess the role of individual differences, including prior experience, ability,
iii
iv
and personality in predicting strategy use and performance. The third
aim was to model the differential effects of instructed versus self-initiated
strategy shift on strategy use and performance. Collectively, the aims
were designed to provide a multifaceted explanation of the relationship
between practice, strategy use, and performance.
To achieve these aims three studies were conducted. In each study
participants completed a set of trials on a text editing task. On each trial
strategy sophistication and task completion time were measured. Final
sample sizes in the three studies were n1 = 63, n2 = 154, and n3 = 154.
Each study also measured a selection of individual difference variables
including prior experience, demographics, personality, and ability. Text
editing was chosen as the criterion task because strategy use is important to task performance and strategy use could readily be measured.
The text editing task was developed to enable trial-level measurement
of strategy use. Strategy sophistication was operationalised as the proportion of key presses used that were classified as sophisticated (e.g.,
using control and right cursor keys to move between words) as opposed
to simple (e.g., using just the right cursor to move between characters).
In Studies 1 and 2 all participants received the same instructions. In
Study 3 participants were randomly assigned to one of three conditions
with varying instructions. In a No Training condition practice preceded
without interruption, in a Training condition additional instructions were
presented halfway through practice, and in a Control condition a filler
task was presented halfway through practice. Aims 1 and 2 were assessed
by Study 1 and 2 and the No Training condition of Study 3. Aim 3 was
assessed by comparing the conditions in Study 3.
With regard to Aim 1 results from the three studies told a consistent story. Results reiterated the importance of analysing data at the
individual-level. While at the group-level, a three parameter power function provided superior fit, at the individual-level a three parameter ex-
v
ponential function was significantly better in two out of three studies.
Similarly, at the group-level, strategy sophistication was a continuously
increasing, monotonically decelerating function of practice, well modelled
by a three parameter Michaelis–Menten function. In contrast, at the
individual-level, the functional form of the relationship varied dramatically between individuals with a variety of often discontinuous functions
providing good fit.
Although abrupt strategy shifts did occur, meaningful discontinuities
in the relationship between practice and task completion time were rare.
Findings supported a model that explained how abrupt strategy shifts
can co-occur with continuous learning curves. These findings were that:
(a) strategy shifts were more likely to occur early in practice when other
learning was occurring; (b) trial-to-trial variance in task completion time
was often large relative to the benefits of the strategy shift; and (c)
strategy shifts often took several trials to be fully realised. These and
other factors combined to generally smooth out the discontinuous effects
of strategy shift on performance.
In relation to the second aim, concerning individual differences, ability and prior experience consistently emerged as moderate to strong predictors of task performance, whereas self-reported Big 5 personality was
unrelated to task performance. Similar but generally weaker relationships
were found between individual differences and strategy sophistication. A
model that proposed that the effect of ability and prior experience on
task performance was mediated by strategy sophistication was not supported. Findings were broadly consistent with cognitive correlates and
skill transfer models of individual differences.
In relation to the third aim, looking at differences between instructed
strategy shift and self-initiated strategy shift, hypotheses were partially
supported.
In summary, relative to self-initiated strategy shifts, in-
structed strategy shifts were more abrupt. Performance also tended to
vi
decline sharply immediately following the instructed strategy shift. After
additional practice, performance was similar to groups that had not received instructed strategy shift. The study highlighted how the dynamics
of instructed strategy shift differ from self-initiated strategy shift with
regards to discontinuities.
Taken together the results tell an interconnected story regarding the
relationship between practice, strategy use, and performance. This thesis
contributes to skill acquisition research through a unique combination of
features including trial-level measurement of strategy use, individuallevel modelling, and the use of nonlinear and discontinuous functions.
It is hoped that future research will build on this approach using other
samples, tasks, and contexts.
vii
Declaration
I, Jeromy Anglim, certify that
(i) this thesis comprises only my original work towards the PhD except
where indicated in the Preface*,
(ii) due acknowledgement has been made in the text to all other material used,
(iii) this thesis is less than 100,000 words in length, exclusive of tables,
maps, bibliographies and appendices.
Signature:
Date:
viii
Preface
The dataset presented as Study 1 of this thesis comes from a collaboration between Niloufar Mahdavi, Janice Langan-Fox (my own principal
supervisor), and myself. The study formed part of Niloufar Mahdavi’s
honours thesis who was supervised by Janice Langan-Fox. The overall
design in terms of measurement of abilities and use of a criterion text
editing task was based on a series of studies that Janice Langan-Fox had
completed with various research students in the preceding years. Niloufar Mahdavi’s use of the dataset was distinct from my use. I was involved
in some design decisions, most notably the measurement of strategy use,
the choice of text editing keys, and the passage of text to be edited.
I programmed the experimental task, designing how strategy use and
performance data were recorded and setting out the algorithms by which
strategy was extracted from key logs. All analyses presented in this thesis
were conducted from raw data and represent my own work.
Initial analyses of Study 1 were presented in the following conference
proceedings
• Anglim, J., Langan-Fox, J., & Mahdavi, N. (2005). Modeling the
Relationship between Strategies, Abilities and Skilled Performance.
CogSci 2005, 27th Annual Meeting of the Cognitive Science Society,
July 21–23 Stresa, Italy.
ix
Acknowledgements
I would like to thank:
• Janice Langan-Fox, my principal supervisor, for inspiring me to
pursue a PhD, keeping me on track, setting high standards, and
guiding me during the critical moments.
• Alex Wearing for his wisdom, support, and encouragement.
• Yoshi Kashima for his collegiality, suggestions, and encouragement.
• Richard Bell for his helpful suggestions and encouragement.
• Niloufar Mahdavi for kindly and professionally collaborating with
me.
• Philip Smith, Paul Dudgeon, Murray Aitkin, and Garry Robins for
their instruction and support.
• Phillip Ackerman and Ruth Kanfer for sharing some of their raw
data with me. Although this data was not incorporated into the
final thesis, it helped me develop ideas about modelling learning
curves at the individual-level.
• Richard Moulding for reading an early draft and providing useful
suggestions.
• The many individuals who developed the various open source
tools—R, LaTeX, Sweave, git, make, bibtex, and more—used in
the creation of this thesis.
• The participants for giving up their time to be involved in my
research.
• Corrine for her love and support, the way that she listened, her
gentle encouragement, and her practical advice.
x
• Jasmine for her love and support over the course of my life, helping
me get over the hurdles and continue my journey in psychology,
and reminding me from time to time of John C. King’s prophetic
words of encouragement.
Contents
1 Introduction
1.1
1
Research Problems and Aims . . . . . . . . . . . . . . .
1
1.1.1
Practice, Strategy, and Performance . . . . . . . .
2
1.1.2
Individual Differences . . . . . . . . . . . . . . . .
4
1.1.3
Instructed Strategy Shift . . . . . . . . . . . . . .
4
1.2
Scope and Study Design . . . . . . . . . . . . . . . . . .
5
1.3
Thesis Structure . . . . . . . . . . . . . . . . . . . . . . .
6
2 Practice, Strategy, and Performance
2.1
2.2
7
Introduction . . . . . . . . . . . . . . . . . . . . . . . . .
8
2.1.1
Overview . . . . . . . . . . . . . . . . . . . . . .
8
2.1.2
Practice and Performance . . . . . . . . . . . . .
9
2.1.3
Practice and Strategy . . . . . . . . . . . . . . . .
21
2.1.4
Strategy and Performance . . . . . . . . . . . . .
38
2.1.5
The Current Studies . . . . . . . . . . . . . . . .
46
Method . . . . . . . . . . . . . . . . . . . . . . . . . . .
48
2.2.1
Overview . . . . . . . . . . . . . . . . . . . . . .
48
2.2.2
Common Features . . . . . . . . . . . . . . . . . .
48
2.2.3
Study 1 . . . . . . . . . . . . . . . . . . . . . . .
51
2.2.4
Study 2 . . . . . . . . . . . . . . . . . . . . . . .
59
2.2.5
Study 3 . . . . . . . . . . . . . . . . . . . . . . .
67
2.2.6
Analysis Plan . . . . . . . . . . . . . . . . . . . .
77
xi
xii
CONTENTS
2.3
2.4
2.5
2.6
Study 1 Results . . . . . . . . . . . . . . . . . . . . . . .
79
2.3.1
Practice and Performance . . . . . . . . . . . . .
79
2.3.2
Practice and Strategy . . . . . . . . . . . . . . . .
86
2.3.3
Practice, Strategy, and Performance . . . . . . . .
92
Study 2 Results . . . . . . . . . . . . . . . . . . . . . . .
92
2.4.1
Practice and Performance . . . . . . . . . . . . .
92
2.4.2
Practice and Strategy . . . . . . . . . . . . . . . .
100
2.4.3
Practice, Strategy, and Performance . . . . . . . .
106
Study 3 Results . . . . . . . . . . . . . . . . . . . . . . .
106
2.5.1
Practice and Performance . . . . . . . . . . . . .
106
2.5.2
Practice and Strategy . . . . . . . . . . . . . . . .
112
2.5.3
Practice, Strategy, and Performance . . . . . . . .
117
Discussion . . . . . . . . . . . . . . . . . . . . . . . . . .
117
2.6.1
Overview . . . . . . . . . . . . . . . . . . . . . .
117
2.6.2
Practice and Performance . . . . . . . . . . . . .
117
2.6.3
Practice and Strategy . . . . . . . . . . . . . . . .
123
2.6.4
Practice, Strategy, and Performance . . . . . . . .
130
2.6.5
Conclusion . . . . . . . . . . . . . . . . . . . . . .
133
3 Individual Differences
3.1
3.2
135
Introduction . . . . . . . . . . . . . . . . . . . . . . . . .
136
3.1.1
Overview . . . . . . . . . . . . . . . . . . . . . .
136
3.1.2
Theories and Frameworks . . . . . . . . . . . . .
137
3.1.3
Predictors of Performance . . . . . . . . . . . . .
139
3.1.4
Predictors of Strategy Sophistication . . . . . . .
143
3.1.5
Strategy Sophistication and Performance . . . . .
145
3.1.6
The Current Studies . . . . . . . . . . . . . . . .
146
Method . . . . . . . . . . . . . . . . . . . . . . . . . . .
146
3.2.1
Participants . . . . . . . . . . . . . . . . . . . . .
146
3.2.2
Predictor Measures . . . . . . . . . . . . . . . . .
147
CONTENTS
3.3
3.4
3.5
3.6
xiii
3.2.3
Criterion Measures . . . . . . . . . . . . . . . . .
154
3.2.4
Analysis Plan . . . . . . . . . . . . . . . . . . . .
154
Study 1 Results . . . . . . . . . . . . . . . . . . . . . . .
155
3.3.1
Preliminary Analyses . . . . . . . . . . . . . . . .
155
3.3.2
Hypothesis Testing . . . . . . . . . . . . . . . . .
156
Study 2 Results . . . . . . . . . . . . . . . . . . . . . . .
159
3.4.1
Preliminary Analyses . . . . . . . . . . . . . . . .
159
3.4.2
Hypothesis Testing . . . . . . . . . . . . . . . . .
159
Study 3 Results . . . . . . . . . . . . . . . . . . . . . . .
162
3.5.1
Preliminary Analyses . . . . . . . . . . . . . . . .
162
3.5.2
Hypothesis Testing . . . . . . . . . . . . . . . . .
162
Discussion . . . . . . . . . . . . . . . . . . . . . . . . . .
164
3.6.1
Overview . . . . . . . . . . . . . . . . . . . . . .
164
3.6.2
Performance . . . . . . . . . . . . . . . . . . . . .
164
3.6.3
Mediational Model . . . . . . . . . . . . . . . . .
167
3.6.4
Conclusion . . . . . . . . . . . . . . . . . . . . . .
168
4 Instructed Strategy Shift
4.1
4.2
4.3
171
Introduction . . . . . . . . . . . . . . . . . . . . . . . . .
172
4.1.1
Overview . . . . . . . . . . . . . . . . . . . . . .
172
4.1.2
Strategy . . . . . . . . . . . . . . . . . . . . . . .
173
4.1.3
Performance . . . . . . . . . . . . . . . . . . . . .
176
4.1.4
The Current Study . . . . . . . . . . . . . . . . .
177
Method . . . . . . . . . . . . . . . . . . . . . . . . . . .
177
4.2.1
Design . . . . . . . . . . . . . . . . . . . . . . . .
177
4.2.2
Participants . . . . . . . . . . . . . . . . . . . . .
178
4.2.3
Mid-Practice Condition . . . . . . . . . . . . . . .
178
Results . . . . . . . . . . . . . . . . . . . . . . . . . . . .
180
4.3.1
Practice and Strategy . . . . . . . . . . . . . . . .
180
4.3.2
Practice and Performance . . . . . . . . . . . . .
185
xiv
CONTENTS
4.4
Discussion . . . . . . . . . . . . . . . . . . . . . . . . . .
192
4.4.1
Overview . . . . . . . . . . . . . . . . . . . . . .
192
4.4.2
Strategy . . . . . . . . . . . . . . . . . . . . . . .
192
4.4.3
Performance . . . . . . . . . . . . . . . . . . . . .
195
4.4.4
Comparison with Previous Research . . . . . . . .
197
4.4.5
Conclusion . . . . . . . . . . . . . . . . . . . . . .
200
5 General Discussion
203
5.1
Overview . . . . . . . . . . . . . . . . . . . . . . . . . . .
203
5.2
General Themes . . . . . . . . . . . . . . . . . . . . . . .
208
5.2.1
Biasing Effect of Group-Level Analyses . . . . . .
208
5.2.2
Trial-Level Strategy Measurement . . . . . . . . .
209
5.2.3
Individual Differences . . . . . . . . . . . . . . . .
210
5.2.4
Continuity and Discontinuity . . . . . . . . . . .
210
Limitations . . . . . . . . . . . . . . . . . . . . . . . . .
211
5.3.1
Overview . . . . . . . . . . . . . . . . . . . . . .
211
5.3.2
Sample . . . . . . . . . . . . . . . . . . . . . . . .
211
5.3.3
Task . . . . . . . . . . . . . . . . . . . . . . . . .
212
5.3.4
Context . . . . . . . . . . . . . . . . . . . . . . .
213
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . .
214
5.3
5.4
References
215
A Study 1: Additional Materials
235
A.1 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . .
235
A.2 Materials
. . . . . . . . . . . . . . . . . . . . . . . . . .
235
A.2.1 Study 1 Prior Experience Scale . . . . . . . . . .
235
A.3 Text Editing Task . . . . . . . . . . . . . . . . . . . . . .
238
A.3.1 Additional Screenshots of Computer Interface . .
238
A.3.2 Text Editing Instructions . . . . . . . . . . . . . .
239
A.3.3 Accuracy Calculation . . . . . . . . . . . . . . . .
240
CONTENTS
xv
A.4 Study 1 Data Cleaning . . . . . . . . . . . . . . . . . . .
B Study 2: Additional Materials
B.1 Materials
240
245
. . . . . . . . . . . . . . . . . . . . . . . . . .
245
B.1.1 IPIP Personality Test . . . . . . . . . . . . . . . .
245
B.1.2 Typing Test . . . . . . . . . . . . . . . . . . . . .
248
B.2 Text Editing Standardisation . . . . . . . . . . . . . . .
251
B.3 Text Editing Task Instructions . . . . . . . . . . . . . . .
251
B.4 Study 2 Data Cleaning . . . . . . . . . . . . . . . . . . .
255
C Study 3: Additional Materials
C.1 Materials
257
. . . . . . . . . . . . . . . . . . . . . . . . . .
257
C.1.1 Prior Experience Questionnaire . . . . . . . . . .
257
C.1.2 IPIP Big 5 Personality Measure . . . . . . . . . .
262
C.2 Text Editing Task . . . . . . . . . . . . . . . . . . . . . .
265
C.2.1 Text Editing Task Initial Instructions . . . . . . .
265
C.2.2 Additional Controls . . . . . . . . . . . . . . . . .
268
C.2.3 Training Condition Instructions . . . . . . . . . .
269
C.2.4 Text Editing Task Problems . . . . . . . . . . . .
272
xvi
CONTENTS
List of Tables
2.1
Main Design Differences between the Three Studies. . . .
2.2
Study 1 Original and Corrected Text for the Text Editing
49
Task . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
55
2.3
Study 1 List of Text Editing Task Requirements . . . . .
56
2.4
Study 1 List of Shortcut Keys Provided to Participants .
57
2.5
Study 1 Experimental Protocol . . . . . . . . . . . . . .
58
2.6
Study 2 Corrected Text for Text Editing Task . . . . . .
60
2.7
Study 2 Text Editing Trial Types . . . . . . . . . . . . .
61
2.8
Study 2 Text Editing Tips . . . . . . . . . . . . . . . . .
64
2.9
Study 2 Experimental Protocol . . . . . . . . . . . . . .
65
2.10 Study 3 Example of Editing Requirements . . . . . . . .
70
2.11 Study 3 Experimental Protocol . . . . . . . . . . . . . .
74
2.12 Study 3 Keys Practiced in Initial Training Trials . . . . .
75
2.13 Study 1 Fit Statistics for Models of the Effect of Trial on
Task Completion Time at the Group-Level . . . . . . . .
80
2.14 Study 1 Mean Model Fit Statistics for the Effect of Trial
on Task Completion Time at the Individual-Level . . . .
82
2.15 Study 1 Fit Statistics for Models of the Effect of Trial on
Strategy Sophistication at the Group-Level . . . . . . . .
87
2.16 Study 1 Mean Model Fit Statistics for the Effect of Trial
on Strategy Sophistication at the Individual-Level . . . .
xvii
87
xviii
LIST OF TABLES
2.17 Study 2 Fit Statistics for Models of the Effect of Block on
Task Completion Time at the Group-Level . . . . . . . .
92
2.18 Study 2 Mean Model Fit Statistics for the Effect of Block
on Task Completion Time at the Individual-Level . . . .
98
2.19 Study 2 Fit Statistics for Models of the Effect of Block on
Strategy Sophistication at the Group-Level. . . . . . . .
100
2.20 Study 2 Mean Model Fit Statistics for the Effect of Block
on Strategy Sophistication at the Individual-Level . . . .
101
2.21 Study 3 Fit Statistics for Models of the Effect of Block on
Task Completion Time at the Group-Level . . . . . . . .
106
2.22 Study 3 Mean Model Fit Statistics for the Effect of Block
on Trial Completion Time at the Individual-Level . . . .
110
2.23 Study 3 Fit Statistics for Models of the Effect of Block on
Strategy Sophistication at the Group-Level . . . . . . . .
112
2.24 Study 3 Mean Model Fit Statistics of the Effect of Block
on Strategy Sophistication at the Individual-Level . . . .
113
3.1
Summary of Ability Tests Used . . . . . . . . . . . . . .
151
3.2
Study 1 Factor Loadings and Intercorrelations among
Ability Tests . . . . . . . . . . . . . . . . . . . . . . . . .
3.3
156
Study 1 Descriptive Statistics and Intercorrelations for
Task Performance, Demographics, Ability, and Prior Experience Variables . . . . . . . . . . . . . . . . . . . . . .
3.4
158
Study 2 Descriptive Statistics and Intercorrelations for
Task Performance, Demographic, Ability, and Prior Experience Variables . . . . . . . . . . . . . . . . . . . . . .
3.5
161
Study 3 Descriptive Statistics and Intercorrelations for
Task Performance, Demographic, Prior Experience, and
Personality Variables . . . . . . . . . . . . . . . . . . . .
163
LIST OF TABLES
4.1
xix
Change in Task Completion Time Pre- and Post- Training
in the Training Condition . . . . . . . . . . . . . . . . .
187
Summary of Support for Hypotheses . . . . . . . . . . .
205
A.1 Study 1 Trial Type Classification for Validity Purposes. .
241
5.1
A.2 Study 1 Task Completion Time Multiplier Associated with
a Given Trial Accuracy out of 12. . . . . . . . . . . . . .
243
B.1 Study 2 IPIP Big 5 Personality Measure . . . . . . . . .
245
C.1 Study 3 Items Used in IPIP Big 5 Personality Measure .
264
C.2 Study 3 Text Editing Task Items . . . . . . . . . . . . .
272
xx
LIST OF TABLES
List of Figures
2.1
Study 1 text editing task screenshot in Active Mode. . .
53
2.2
Study 1 text editing task screenshot in Stop Mode. . . .
54
2.3
Study 2 text editing task screenshot . . . . . . . . . . . .
63
2.4
Study 3 text editing task screenshot. . . . . . . . . . . .
69
2.5
Study 3 text editing task screenshot at the end of a block.
72
2.6
Study 1 task completion time by trial at the group-level.
83
2.7
Study 1 task completion time by trial at the individual-level. 84
2.8
Study 1 examples of model fits of the relationship between
2.9
practice and task completion time at the individual-level.
85
Study 1 strategy sophistication by trial at the group-level.
89
2.10 Study 1 strategy sophistication by trial at the individuallevel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
90
2.11 Study 1 examples of model fits of the effect of trial on
strategy sophistication at the individual-level. . . . . . .
91
2.12 Study 2 task completion time by block at the group-level.
93
2.13 Study 2 task completion time by block at the individuallevel — Part 1 of 2. . . . . . . . . . . . . . . . . . . . . .
95
2.14 Study 2 task completion time by block at the individuallevel — Part 2 of 2. . . . . . . . . . . . . . . . . . . . . .
96
2.15 Study 2 examples of model fits of the relationship between
block and task completion time. . . . . . . . . . . . . . .
99
2.16 Study 2 strategy sophistication by trial at the group-level. 102
xxi
xxii
LIST OF FIGURES
2.17 Study 2 strategy sophistication by block at the individuallevel — Part 1 of 2. . . . . . . . . . . . . . . . . . . . . .
103
2.18 Study 2 strategy sophistication by block at the individuallevel — Part 2 of 2. . . . . . . . . . . . . . . . . . . . . .
104
2.19 Study 2 examples of model fits of the effect of block on
strategy sophistication at the individual-level. . . . . . .
105
2.20 Study 3 task completion time by block at the group-level. 107
2.21 Study 3 task completion time by block at the individuallevel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
108
2.22 Study 3 examples of model fits of the effect of block on
task completion time at the individual-level. . . . . . . .
111
2.23 Study 3 strategy sophistication by block at the group-level. 114
2.24 Study 3 strategy sophistication by block at the individuallevel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
115
2.25 Study 3 examples of model fits of the effect of trial on
strategy sophistication at the individual-level. . . . . . .
4.1
Strategy sophistication by block and condition at the
group-level. . . . . . . . . . . . . . . . . . . . . . . . . .
4.2
182
Strategy sophistication by block in the Training condition
at the individual-level. . . . . . . . . . . . . . . . . . . .
4.3
116
184
Task completion time by block and condition at the grouplevel. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
188
4.4
Task completion time by block at the individual-level. . .
189
4.5
Scatter plot showing percentage increase in mean trial reaction time by increase in strategy sophistication from
blocks immediately before training (i.e., 12–15) with those
immediately after training (i.e., 16–18). . . . . . . . . . .
191
A.1 Study 1 text editing task dialog box displayed at the end
of a set of trials. . . . . . . . . . . . . . . . . . . . . . . .
238
LIST OF FIGURES
xxiii
A.2 Study 1 text editing task dialog box displayed at the end
of all trials on the text editing task. . . . . . . . . . . . .
238
xxiv
LIST OF FIGURES
Chapter 1
Introduction
Contents
1.1
1.1
Research Problems and Aims . . . . . . . . .
1
1.1.1
Practice, Strategy, and Performance . . . . .
2
1.1.2
Individual Differences . . . . . . . . . . . . .
4
1.1.3
Instructed Strategy Shift . . . . . . . . . . .
4
1.2
Scope and Study Design . . . . . . . . . . . .
5
1.3
Thesis Structure . . . . . . . . . . . . . . . . .
6
Research Problems and Aims
The functional form of the learning curve and the mechanisms that cause
it have been the subject of over 100 years of scientific research (e.g.,
Bahrick, Fitts, & Briggs, 1957; Bryan & Harter, 1899; Heathcote et
al., 2000; Lane, 1987; Newell & Rosenbloom, 1981; Snoddy, 1926). Despite this long history, several fundamental gaps in understanding remain.
This thesis aims to contribute empirical and theoretical knowledge with
the intent of closing gaps in understanding regarding three interrelated
topics: (a) the relationships between practice, strategy use, and performance, (b) individual differences in strategy use and performance, and
1
2
CHAPTER 1. INTRODUCTION
(c) the effect of instructed versus self-initiated strategy shifts on strategy
use and performance.
This chapter first introduces the motivating problem behind each of
these topics culminating in three main aims. This is followed by an
outline of the scope and method used to achieve these aims. Finally, the
structure of the thesis is set out.
1.1.1
Practice, Strategy, and Performance
In recent years researchers modelling the learning curve in psychology
have focused on three types of functions: power, exponential, and discontinuous. Newell and Rosenbloom (1981) analysed the relationship
between practice and task completion time in existing datasets across a
range of tasks, often at the group-level. They concluded that the power
function provided such a consistently good fit that the relationship should
be labelled the Power Law of Practice. Many influential researchers subsequently accepted the Power Law (Blessing & Anderson, 1996; F. J. Lee
& Anderson, 2001; Logan, 1992, 1988) often considering it a requirement
for any formal model of learning.
More recently, Heathcote et al. (2000) suggested that the conclusion
drawn by Newell and Rosenbloom (1981) was an artefact of the longknown (e.g., Estes, 1956; Lane, 1987), but often ignored, biasing effects
of group-level analysis. When Heathcote et al. (2000) modelled data at
the individual-level (i.e., the psychologically meaningful level) the exponential function tended to provide superior fit. A further challenge
came from Haider and Frensch (2002) who used simulation, theoretical
arguments, and data, albeit from only one study, to suggest that grouplevel analysis can and does smooth over discontinuities in individual-level
learning curves. Haider and Frensch (2002) argued that discontinuities
would be expected when individuals abruptly shift to a better strategy.
1.1. RESEARCH PROBLEMS AND AIMS
3
Interestingly, the tasks analysed by Heathcote et al. (2000) did not appear to allow for such strategy shifts.
A parallel line of research has examined the relationship between practice and strategy use, where strategy is typically defined as an optional
method of task completion. Much of this research has concerned either
children (e.g., Crowley & Siegler, 1999; Lemaire & Siegler, 1995; Robinson, 2001; Siegler, 1987, 1988a, 1996, 2006) or tasks involving algorithm–
retrieval strategy shift (e.g., Delaney et al., 1998; Logan, 1992, 1988;
Palmeri, 1999; Rickard, 1997, 1999). In contrast, only a few studies
have looked at the shift from simple to sophisticated strategy use in an
adult population. Fewer again have measured and modelled individuallevel strategy use with practice (for an exception, see John & Lallement,
1997). Nonetheless, a body of research has emerged showing that: (a)
most tasks have a wide range of possible strategies (e.g., Siegler & Crowley, 1996; Siegler & Shipley, 1995), (b) some strategies are more effective
than others (e.g., Delaney et al., 1998; F. J. Lee, Anderson, & Matessa,
1995; John & Lallement, 1997), (c) more effective strategies tend to be
used with practice (e.g., Delaney et al., 1998; John & Lallement, 1997),
and (d) strategy shift, although commonly gradual, is sometimes abrupt
(e.g., Delaney et al., 1998; John & Lallement, 1997; Yechiam, Erev, &
Parush, 2004).
An accurate model of the relationship between practice and performance is essential to both basic science and applied fields. However,
existing research has not yet determined the functional form of the relationship between practice and performance. Similarly more research is
required to characterise the functional form of the relationship between
practice and strategy use at the individual-level. Finally, existing models
have inadequately reconciled how abrupt strategy shifts can coexist with
continuous models of the learning curve. This led to the first aim of this
thesis:
4
CHAPTER 1. INTRODUCTION
Aim 1 Model the relationship between practice, strategy use, and performance at the individual-level.
1.1.2
Individual Differences
Another way of understanding the relationship between practice, strategy
use, and performance is from an individual differences perspective. Several researchers (e.g., Ackerman, 1987, 1988; Fleishman, 1972; Taatgen,
2001) have attempted to understand learning processes from an individual differences perspective. However, existing research has rarely looked
at individual differences in strategy use and performance simultaneously.
More data is needed on the relationship between predictors such as ability, personality, prior experience, age, and gender and criterion measures,
such as strategy use and task performance. This led to the second aim
of this thesis:
Aim 2 Assess the role of individual differences in predicting strategy sophistication and performance.
1.1.3
Instructed Strategy Shift
One way of understanding the effect of strategy shift on performance involves examining differences between self-initiated and instructed strategy shifts. Taken together several previous studies (e.g., Delaney et al.,
1998; Yechiam et al., 2004) suggest that instructed strategy shifts may
be more abrupt and are more likely to result in a drop in performance
than self-initiated strategy shifts. More research is needed to clarify these
findings. This led to the third and final aim of this thesis:
Aim 3 Model differences between self-initiated and instructed strategy
shifts on the relationship between practice, strategy use, and performance.
1.2. SCOPE AND STUDY DESIGN
1.2
5
Scope and Study Design
The above aims are interconnected. Measures of individual differences
help to explain the variability that results when practice, strategy use,
and performance are studied at the individual-level. The distinction
between self-initiated and instructed strategy shifts helps to explain the
conditions under which strategy shifts lead to discontinuities in the learning curve. While Aim 1 is given the greatest attention in this thesis, Aims
2 and 3 complement and extend Aim 1.
The scope of the aims in this thesis is narrowed in terms of sample,
task, and context. While generalisation is a relative concept, this thesis
aimed to be most relevant to understanding skill acquisition in relation
to (a) adults, (b) tasks of moderate complexity, i.e., more complex than
many tasks used in cognitive psychology but less complex than many realworld jobs or tasks studied in the expertise literature, (c) tasks with both
psychomotor and cognitive components, (d) tasks where task completion
time is the main measure of performance, and (e) tasks that allow for
a shift from simple to sophisticated strategy use (i.e., not algorithmretrieval shift). This focus captures many tasks common to educational,
work, and personal settings, and is particularly representative of tasks
involving interaction with computers.
To achieve these three aims within the defined scope, three studies
were conducted. Each study involved participants completing a battery
of individual difference measures, followed by a period of practice on a
criterion text editing task. A particular advantage of the text editing
task was that it enabled trial-level measurement of strategy use and task
completion time. The three studies involved moderately large sample
sizes in order to model variation in the relationships between practice,
strategy use, and performance at the individual-level, and obtain robust
estimates of predictor–criterion correlations. Data were analysed at the
6
CHAPTER 1. INTRODUCTION
group- and individual-levels. Graphical and model fitting approaches
were applied in order to compare the relative fit of various candidate
models of the relationship between practice and task completion time
and practice and strategy sophistication.
1.3
Thesis Structure
The remainder of this thesis is composed of three topic chapters and
a general discussion chapter. Chapters 2 to 4 are devoted to Aims 1
to 3 respectively. Each of these three topic chapters contain a literature
review, hypotheses, results, and a discussion related to the respective aim
of the chapter. The topic chapters are based on the same three studies
with the exception of Chapter 4 (Aim 3) which is based only on Study
3. Thus, to minimise redundancy, most of the method is described in
Chapter 2 with methodological details specific to Aims 2 and 3 described
in their respective chapters. Chapter 5 provides a general discussion
where links between the three topics are provided, and limitations and
possibilities for future research are discussed.
Chapter 2
Practice, Strategy, and
Performance
Contents
2.1
2.2
2.3
Introduction
. . . . . . . . . . . . . . . . . . .
8
2.1.1
Overview . . . . . . . . . . . . . . . . . . . .
8
2.1.2
Practice and Performance . . . . . . . . . . .
9
2.1.3
Practice and Strategy . . . . . . . . . . . . .
21
2.1.4
Strategy and Performance . . . . . . . . . . .
38
2.1.5
The Current Studies . . . . . . . . . . . . . .
46
Method . . . . . . . . . . . . . . . . . . . . . .
48
2.2.1
Overview . . . . . . . . . . . . . . . . . . . .
48
2.2.2
Common Features . . . . . . . . . . . . . . .
48
2.2.3
Study 1 . . . . . . . . . . . . . . . . . . . . .
51
2.2.4
Study 2 . . . . . . . . . . . . . . . . . . . . .
59
2.2.5
Study 3 . . . . . . . . . . . . . . . . . . . . .
67
2.2.6
Analysis Plan . . . . . . . . . . . . . . . . . .
77
Study 1 Results . . . . . . . . . . . . . . . . .
79
7
8
CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
2.4
2.5
2.6
2.1
2.1.1
2.3.1
Practice and Performance . . . . . . . . . . .
79
2.3.2
Practice and Strategy . . . . . . . . . . . . .
86
2.3.3
Practice, Strategy, and Performance . . . . .
92
Study 2 Results . . . . . . . . . . . . . . . . .
92
2.4.1
Practice and Performance . . . . . . . . . . .
92
2.4.2
Practice and Strategy . . . . . . . . . . . . . 100
2.4.3
Practice, Strategy, and Performance . . . . . 106
Study 3 Results . . . . . . . . . . . . . . . . . 106
2.5.1
Practice and Performance . . . . . . . . . . . 106
2.5.2
Practice and Strategy . . . . . . . . . . . . . 112
2.5.3
Practice, Strategy, and Performance . . . . . 117
Discussion . . . . . . . . . . . . . . . . . . . . . 117
2.6.1
Overview . . . . . . . . . . . . . . . . . . . . 117
2.6.2
Practice and Performance . . . . . . . . . . . 117
2.6.3
Practice and Strategy . . . . . . . . . . . . . 123
2.6.4
Practice, Strategy, and Performance . . . . . 130
2.6.5
Conclusion . . . . . . . . . . . . . . . . . . . 133
Introduction
Overview
Research on the relationship between practice and performance has a
long history (e.g., Bryan & Harter, 1899; McGeoch, 1927). Throughout this long history, a diverse range of tasks have been studied, including Morse Code transmission (Bryan & Harter, 1899), cigar rolling
(Crossman, 1959), inverted reading (Kolers, 1975), alphabet arithmetic
(Logan, 1988), numerosity judgements (Palmeri, 1997), playing computer
2.1. INTRODUCTION
9
games (Donchin, 1995), and controlling air traffic (F. J. Lee & Anderson,
2001). This history has also been recorded in various quantitative (e.g.,
Ackerman, 1987; Heathcote et al., 2000; Newell & Rosenbloom, 1981),
and narrative reviews (e.g., Fitts & Posner, 1967; Lane, 1987; LanganFox, Armstrong, Balvin, & Anglim, 2002; Proctor & Dutta, 1995; Rosenbaum, Carlson, & Gilmore, 2000; Speelman & Kirsner, 2005) as well as
more specific reviews of cognitive skill acquisition (VanLehn, 1996), motor skill acquisition (Adams, 1987), and the differences between cognitive
and psychomotor skill acquisition (Rosenbaum et al., 2000).
The following section reviews relevant literature. The review starts by
examining models of the relationship between practice and task completion time along with empirical evaluations. It then summarises research
on the relationship between practice and strategy use. Finally, the theory
and measurement of strategy relative effectiveness is discussed, particularly as it relates to practice. Hypotheses are proposed and justified
throughout.
2.1.2
Practice and Performance
Overview
Influential quantitative syntheses of learning curve datasets have been
conducted by Mazur and Hastie (1978), Newell and Rosenbloom (1981),
and Heathcote et al. (2000). A book length treatment of research on the
learning curve can be found in Lane (1987). Before discussing models of
the relationship between practice and performance, the terms ‘practice’
and ‘performance’ are first defined.
Practice can refer to many activities. Within the skill acquisition
literature, practice tends to refer to the effect of repeated task performance. Such a definition is distinct from the everyday use of the term
which often includes adaptive training and external instruction. In skill
10 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
acquisition studies, practice is commonly operationalised as either the
amount of time spent practicing the task or the number of practice trials, where little has been made in the literature of this distinction. The
focus on repetition leads to research designs where initial instructions are
provided but typically no additional instructions are given once practice
is initiated.
Task performance has been defined and measured in many different
ways. At an abstract level task performance is any evaluative attribute
of task execution. More typically, task performance is operationalised as
completion time, accuracy, quality, or attainment. Attainment measures
are merely a multiple of the inverse of completion time. As mentioned
in the introduction, this thesis focuses on tasks where performance is
operationalised as task completion time. This includes both reaction
time when responding to a stimuli, and also task completion time on
tasks involving multiple steps. Task completion time is applicable to
a large number of tasks where it is speed of performance which is the
main attribute that distinguishes levels of performance. Focus on task
completion time also provides a link with the general chronometric aims
of cognitive science.
Together, the relationship between practice and performance is known
as the learning curve, reflecting the seemingly lawful improvement in performance that occurs with practice. Such a relationship can and has been
modelled at both the individual-level and the group-level. Individual-level
modelling occurs when the raw data is the performance over practice for
an individual. Individual-level parameter estimation may be done separately for each individual (for a good example, see Heathcote et al.,
2000) or parameters can be estimated simultaneously using approaches
such as nonlinear multilevel modelling (for a review see Cudeck & Harring, 2007). Group-level modelling occurs when raw data is averaged
over participants, typically using the mean, for a given trial or block of
2.1. INTRODUCTION
11
trials.
The subsequent section focuses on models of the learning curve where,
in line with the scope of this thesis, practice is operationalised as amount
of task repetition, and performance is operationalised as task completion
time. After summarising proposed models of the learning curve, the
empirical evidence for these models is critically reviewed.
Quantitative Models
Learning curves have been studied from multiple perspectives. In addition to the extensive literature in psychology, researchers in industrial
engineering have frequently modelled learning curves in industrial settings (for a review see Lane, 1987). While researchers in industrial
engineering have developed various models of the learning curve, such
models have little relevance to understanding psychological mechanisms
of learning. Such research is analysed at the group-level and typically
confounds employee learning with other sources of improvement such as
technologogical innovation. Thus, while group-level modelling has many
important applications, such as when predictions are desired about the
future productivity of a group, it is the individual-level which is most
relevant for understanding psychological processes.
Also, some researchers have used polynomial models of the learning curve. However, such models have little relevance to the present
discussion. A polynomial of a sufficiently high degree can be used to
model any empirical function. Such models are sometimes adopted for
convenience of estimation within latent growth curve frameworks (e.g.,
Lang & Bliese, 2009; Morrison & Brantner, 1992; Voelkle, Wittmann, &
Ackerman, 2006). However, such models use linear approximations for
processes that can be more parsimoniously represented by a nonlinear
modelling framework. Polynomial models do not capture the asymptotic
12 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
behaviour of learning curves and make inaccurate predictions outside the
range of data (for discussion of the benefits of nonlinear modellings, see
Bates & Watts, 1988; Cudeck & Harring, 2007; Kelley, 2009). They also
require more parameters to obtain equivalent levels of fit relative to truly
nonlinear functions.
As described by Lane (1987) and others, many quantitative models
of the learning curve have been proposed. However, as stated in the
introduction, this thesis focuses mainly on power, exponential, and discontinuous functions.
When presenting functions in this thesis, notation is used that is
consistent with Huet, Bouvier, Poursat, and Jolivet (2004). The power
function (first proposed by Snoddy (1926), and rediscovered by De Jong
(1957)) in its three parameter form is
fP 3 (x, θ) = θ1 x−θ2 + θ3 ,
(2.1)
and the three parameter exponential function is
fE3 (x, θ) = θ1 exp(−θ2 (x − 1)) + θ3 ,
(2.2)
where f (x, θ) is the expected task completion time for trial x and the
subscripts P 3 and E3 denote the three parameter power and exponential functions respectively. The amount of practice is denoted by x and is
typically represented by an integer starting from 1 for the power function
and 0 for the exponential function. This thesis uses x − 1 in the exponential function (i.e., Equation 2.2) in order to model all learning curves
with trial number starting from 1. The asymptotic level of expected
performance is captured by θ3 , which is predicted to be approached as
amount of practice approaches infinity. Amount of learning is the difference between expected performance on the first trial (i.e., x = 1) and
2.1. INTRODUCTION
13
asymptotic performance (i.e., θ3 ), and is captured by θ1 . The shape of
the learning curve is determined by θ2 .
A two parameter power function can also be obtained by fixing the
asymptote θ3 to zero. While having fewer parameters is desirable, it
makes the implausible prediction that after infinite practice, task completion time would be zero. Thus, while the function often predicts the
observed data well, it makes unreasonable predictions outside the range
of the data, which is one of the key motivations for nonlinear modelling.
The power function sometimes includes a fourth parameter,
fP 4 (x, θ) = θ1 (x + θ4 )−θ2 + θ3 ,
(2.3)
to represent prior learning (introduced by Seibel, 1963). This parameter
has typically been fixed to zero by subsequent researchers (e.g., F. J. Lee
& Anderson, 2001). Also, Heathcote et al. (2000) have proposed a four
parameter function called the APEX function which incorporates aspects
of both the three parameter power and exponential functions,
fAP (x, θ) = θ1 exp(−θ2 x)x−θ4 + θ3 .
(2.4)
Further discussion of mathematical properties of the above functions can
be found in Newell and Rosenbloom (1981) and Heathcote et al. (2000).
The preceding functions are all monotonically decreasing and monotonically decelerating. This contrasts with a set of functions proposed by
several researchers that involve discontinuities. Discontinuous functions
can capture temporary plateaus in learning (e.g., Bryan & Harter, 1899)
and abrupt improvements in performance (e.g., Haider & Frensch, 2002;
Kolers & Duchnicky, 1985). Discontinuities can occur in performance
itself or in the rate of learning (i.e., the first derivative). While learning curve researchers in psychology have rarely formalised discontinuous
14 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
models, such models are often discussed in statistics and econometrics
(e.g., Zeileis, Leisch, Hornik, & Kleiber, 2002; Zeileis, Kleiber, Krmer, &
Hornik, 2003). Discontinuous models can be seen as piecewise functions
that have one or more breakpoints and where the number of segments is
equal to one more than the number of breakpoints. Each segment has
its own function. The functions for each segment may be of the same or
different functional type. The timing of the breakpoints is also typically
a parameter.
It would appear that psychological researchers have rarely formally
analysed discontinuities. One exception, Beem (1995), critical of informal methods (e.g., Kyllonen, Lohman, & Woltz, 1984), developed software for a limited form of breakpoint analysis and applied it to several
psychological datasets (Ippel & Beem, 1987; Luwel, Beem, Onghena, &
Verschaffel, 2001; Luwel, Verschaffel, Onghena, & De Corte, 2003; Verschaffel, De Corte, Lamote, & Dherdt, 1998). Seber and Wild (2003) discuss modelling discontinuities in nonlinear functions. Reviews of software
(Andersen, Carstensen, Hernndez-Garca, & Duarte, 2009), an outline of
the statistical theory (Chapter 9 in Seber & Wild, 2003), and a general
review (Hansen, 2001; Khodadadi & Asgharian, 2008), are also available,
along with several influential papers on the topic (Andrews, 1993; Brown,
Durbin, & Evans, 1975; Chu, Stinchcombe, & White, 1996; W. Kramer,
Ploberger, & Alt, 1988; Ploberger & Kramer, 1992). In the absence
of formalised discontinuous models of the learning curve, the following
paragraphs attempt to extract plausible discontinuous models for model
testing purposes.
The most basic form of a discontinuous learning model involves two
2.1. INTRODUCTION
15
constants where the constant changes following a breakpoint,
fCnCn (x, θ) =


θ1
if x ≤ θ3

θ2
if x > θ3
.
(2.5)
Such a model could represent an idealised version of insight learning.
Haider and Frensch (2002) discuss it implicitly as a hypothetical function potentially relevant to performance following abrupt strategy shift
at the individual-level. Given that existing research strongly suggests
that improvements in performance with practice are more gradual, the
model is unlikely to provide a good fit to data. However, it does provide
a baseline comparison for evaluating more sophisticated discontinuous
models.
More plausible discontinuous models combine a breakpoint with
monotonic improvement and deceleration typical of most learning curves.
Such models are suggested, but not formalised, in the work of Delaney et
al. (1998). In an attempt to formalise such discontinuous models whilst
also keeping the number of parameters to a reasonable level, the following
two models are proposed. The first model combines two two-parameter
power functions separated by a breakpoint
fP2BP2 (x, θ) =


θ1 x−θ2
if x ≤ θ5

θ3 x−θ4
if x > θ5
.
(2.6)
The second such model is a three parameter exponential model with a
different asymptotic parameter based on the breakpoint
fE3B (x, θ) =


θ1 exp(−θ2 x) + θ3
if x ≤ θ5

θ1 exp(−θ2 x) + θ4
if x > θ5
.
(2.7)
All three discontinuous models proposed include a single breakpoint as a
16 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
parameter. This is a simplification, given that multiple discontinuities are
possible, but arguably a useful one for facilitating parameter estimation.
All three discontinuous functions permit both abrupt improvements and
abrupt declines in performance. The P2BP2 and E3B functions can both
potentially capture, for example, the temporary drop in performance,
that might follow a strategy shift.
Empirical Evaluation
Power Functions Several early researchers proposed that the power
function (De Jong, 1957; Snoddy, 1926) was a good model of the relationship between practice and task completion time. Newell and Rosenbloom
(1981) influentially analysed many existing skill acquisition datasets
across various types of tasks (e.g., Crossman, 1959; Hirsch, 1952; Kolers, 1975; Neisser, 1963; Neves & Anderson, 1981; Seibel, 1963; Snoddy,
1926). The datasets varied in whether they were individual-level, grouplevel, or cross-sectional. Newell and Rosenbloom found that the two
parameter power function provided a reasonable fit to the obtained data
and that a four parameter power function was superior to a three parameter exponential function. They found the evidence in support of
the power function so compelling that they labelled the relationship the
Power Law of Practice.
Many subsequent researchers have accepted the Power Law of Practice and incorporated it into cognitive architectures and other models of
the learning process (for a review, see Ritter & Schooler, 2002). These
include Logan’s Instance Theory of Automaticity (Logan, 1992), Anderson and colleagues’ ACT-R (Anderson et al., 2004), Lee and Anderson’s
(2001) model of subtasks, and Delaney et al.’s (1998) model of learning
within strategies. Subsequent researchers have typically used the three
parameter function to model the learning curve, often at the group-level.
2.1. INTRODUCTION
17
Several reasons for not including the fourth parameter exist (see
Heathcote et al., 2000). First, the fourth parameter is meant to represent prior practice. Thus, it arguably should not be a parameter that
is fit to the data; it should be measured. However, measurement of prior
practice is often not possible. Second, many studies are designed with
novel tasks, such that prior practice should be zero. Third, a four parameter model may sometimes suffer from issues of estimation particularly in
noisy individual-level data. Fourth, the three parameter power function
often performs quite well at the group-level.
Exponential Function More recently several researchers (i.e., Heathcote et al., 2000; Haider & Frensch, 2002; Lacey, 2007; Myung, Kim,
& Pitt, 2000) have challenged the validity of the Power Law of Practice. These researchers argued that group-level analyses provide a biased
model of the individual-level, and that it is the individual-level which is
psychologically meaningful. The general bias of group-level analyses has
long been known in the learning curve literature (see Estes, 1956), but
it is often not mentioned by researchers performing group-level analyses.
The critique of group-level analysis however is consistent with the increasing interest in psychology in quantitative and theoretical models of
within-person change (e.g., Cudeck & Harring, 2007; Raudenbush, 2001;
Singer & Willett, 2003).
A major critique of group-level analyses was presented by Heathcote
et al. (2000) who analysed a larger number of studies than Newell and
Rosenbloom (1981).
In contrast to Newell and Rosenbloom (1981),
Heathcote and colleagues performed all analyses both at the group-level
and at the individual-level. Consistent with actual use in subsequent research, they compared the three parameter power function with the three
parameter exponential function. Consistent with Newell and Rosenbloom
(1981), Heathcote et al. found that at the group-level the power function
18 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
provided superior fit. However, at the individual-level, the exponential
function provided a better fit for a majority of individuals in a majority
of studies. Also, when four parameter models were included, the APEX
function provided better fit than the four parameter power function.
Heathcote et al. (2000) showed that the potential biasing effects of
group-level analyses applied to actual skill acquisition datasets. The
results suggest that the power law should be repealed and possibly replaced with an exponential law. Alternatively, the functional form of
the learning curve may vary between individuals with the exponential
function more often being superior to the power function. It should also
be noted that while Heathcote et al.’s analyses included a wide range of
task types (i.e., memory search, counting, mental arithmetic, alphabet
arithmetic, visual search, mental rotation, and simple motor learning),
they did not include more complex psychomotor tasks such as text editing or air traffic control. However, based on the research of Newell and
Rosenbloom (1981) and Heathcote et al. (2000), the following hypotheses
are proposed:
Hypothesis 2.1 The relationship between practice and task completion
time at the group-level is better explained by a three parameter power
function than a three parameter exponential function.
Hypothesis 2.2 The relationship between practice and task completion
time at the individual-level is better explained by a three parameter exponential function than a three parameter power function.
If both Hypothesis 2.1 and Hypothesis 2.2 are correct, then by implication, group-level models provide biased evidence of the relationship
between practice and performance at the individual-level.
Discontinuous Functions A second critique of the Power Law of
Practice comes from Haider and Frensch (2002). They argued that group-
2.1. INTRODUCTION
19
level analyses often disguised discontinuities in the learning curve at the
individual-level. If this were true, neither the power nor the exponential
function would be an appropriate model. Haider and Frensch showed
mathematically and by simulation that discontinuities in the learning
curve at the individual-level could completely disappear at the grouplevel if, as would be expected, the timing of the discontinuities varied
between individuals.
Haider and Frensch (2002) also presented empirical data from an alphabet string verification task where discontinuities consistently emerged
in individual-level learning curves. The task required participants to read
a string of letters such as “C D E F G [4] L” (p. 393) and indicate whether
after substituting the number for alphabetical letters the string was in
alphabetical order. The task was structured such that the initial letters
were always in alphabetical order. Thus, an effective strategy was for participants to ignore the initial letters and only process the letter–number–
letter triple (e.g., “G[4]L”). Graphs of individual-level data suggested
that in addition to participants getting faster with practice, most participants had a discontinuity whereby reaction time increased abruptly.
The timing of this discontinuity varied between participants. Haider and
Frensch argued that the discontinuity occurred as a result of participants
abruptly ceasing to process the initial redundant letters in the string.
Generalising these findings, Haider and Frensch (2002) proposed that
abrupt strategy shifts from less to more effective strategies would exist
on many tasks. They suggested that if a shift to a more effective strategy
was abrupt and particularly if it occurred later in practice, a discontinuity
in the learning curve would occur. They reasoned that the absence of
previously identified discontinuities was due to a reliance on group-level
analyses.
The validity of Haider and Frensch’s argument depends on the empirical frequency of discontinuities. Many researchers have analysed data
20 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
at the individual-level and have not found discontinuities (e.g., Kolers,
1975; Seibel, 1963). Similarly, Bryan and Harter (1897, 1899) found evidence for plateaus in individual-level learning curves for learning Morse
Code. However, subsequent researchers have generally failed to replicate
this finding and have suggested that plateaus are rare, often the result of
poor training, and not of fundamental theoretical interest (Keller, 1958;
Reed & Zinszer, 1943).
In summary, discontinuities sometimes occur at the individual-level.
However, prevalence rates of discontinuities, the conditions which increase prevalence, and the types of discontinuities that occur are less
clear. The infrequency of empirical reports of discontinuities may be
partially explained by group-level analyses and a lack of readily available
breakpoint modelling techniques. However, it is more likely that discontinuities occur infrequently and possibly only under special conditions.
In particular, tasks that permit an abrupt shift in strategy seem more
likely to show discontinuities. However, even then, there are several reasons to suspect that discontinuities will be rare. This led to the following
two hypotheses:
Hypothesis 2.3 Discontinuities do not occur in the relationship between
practice and task completion time at the group-level.
Hypothesis 2.4 Discontinuities occasionally occur in the relationship
between practice and task completion time at the individual-level.
If true, the above hypotheses in combination imply another biasing effect
of group-level analyses.
Conclusion
In summary, individual-level learning curves are the psychologically
meaningful level of analysis, and research suggests that the group-level
2.1. INTRODUCTION
21
provides a biased representation of the individual-level. When modelling
the individual-level, the best evidence is that a three parameter exponential function provides on average a superior fit in comparison to the
three parameter power function. This might lead us to call such a relationship the Exponential Law of Practice, but the best fitting function
seems to vary sufficiently between individuals that the term law may be
inappropriate. Also, more research is needed in order to ascertain the
conditions under which discontinuities are likely to occur. In particular,
this requires the application of quantitative modelling approaches that
are designed to identify discontinuities. It may be that tasks that permit abrupt strategy shifts from poorer to better strategies provide likely
candidates for discontinuities. Such tasks were absent from Heathcote et
al.’s (2000) review and, as previously mentioned, constitute the focus of
this thesis.
2.1.3
Practice and Strategy
Overview
The following section of the literature review focuses on the relationship
between practice and strategy use. As well as being inherently interesting, changes in strategy use are one reason to expect discontinuities
in the learning curve. Before discussing the effect of strategy shift on
performance, research on the relationship between practice and strategy
use is first critically reviewed.
Researchers have explored the relationship between practice and
strategy use from a range of perspectives including childhood learning
(Lemaire & Siegler, 1995; Siegler, 1988b, 2006; Siegler & Crowley, 1996,
1992; Siegler & Shipley, 1995; Siegler & Stern, 1998), expertise (e.g.,
Ericsson & Charness, 1994), time and motion (e.g., Gilbreth, 1911), cognitive architectures (e.g., productions in ACT-R, Anderson, Matessa, &
22 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Legiere, 1997), algorithm–retrieval shifts (Delaney et al., 1998; Logan,
1988; Rickard, 2004), and human computer interaction (e.g., methods
in the GOMS approach, (Card, Moran, & Newell, 1983)). Reviews of
research examining strategy use can be be found in Siegler and Shipley
(1995) and Cary and Reder (2002).
Before reviewing the literature on strategy use it is important to define the term strategy. The term has been used in several different ways
even within cognitive psychology. In this thesis strategy refers to an optional means of task completion (e.g., Delaney et al., 1998; Lemaire &
Siegler, 1995; Rickard, 1997). An equivalent but more elaborate definition is a “method used for obtaining solutions in skill-learning tasks in
which different methods are possible” (Touron & Hertzog, 2004, p. 565).
The usage of the term strategy in this thesis is consistent with the usage of many other researchers (e.g., Delaney et al., 1998; John & Lallement, 1997; Lemaire & Siegler, 1995; Haider & Frensch, 2002; Siegler
& Lemaire, 1997; Siegler & Shipley, 1995; Yechiam et al., 2004). This
definition allows for both behavioural and cognitive strategies. Such
a definition does not require conscious awareness (see Reder & Ritter,
1992).
‘Strategy’ can be a nebulous term capturing any possible qualitative
distinction in task execution. As stated in the introduction, this thesis is
focused on tasks that permit a simple–sophisticated strategy shift. Such
a shift commonly (a) is facilitated by explicit processes of instruction,
deduction, or analogy, (b) is typically able to be executed accurately
once known, and (c) typically requires practice to be executed rapidly.
The following examples from the literature capture the above class of
strategy shifts: (a) one-at-a-time (simple) versus aggregated (sophisticated) creation of computer graphics (Bhavnani & John, 2000; Charman
& Howes, 2003), (b) simple versus sophisticated text editing commands
(Cook, Kay, Ryan, & Thomas, 1995), (c) mouse (simple) versus script-
2.1. INTRODUCTION
23
based (sophisticated) spreadsheet manipulation (Yechiam et al., 2004),
(d) lifting bricks from the ground (simple) versus using an adjustable
level (sophisticated) when laying bricks (Gilbreth, 1911); (e) one at a
time (simple) or sequential (simple) versus opportunistic (sophisticated)
strategies for allocating planes to runways on an air-traffic control simulation (John & Lallement, 1997); (f) processing all information (simple)
versus ignoring unnecessary information (sophisticated) on an alphabet
verification task (Haider & Frensch, 1999, 2002).
This focus contrasts with algorithm–retrieval strategy shifts, which
have been frequently researched in relation to learning curves. Examples of algorithm–retrieval strategy shifts include solving novel arithmetic problems by either computing or retrieving an answer (Blessing
& Anderson, 1996; Delaney et al., 1998; Rickard, 1997), alphabet arithmetic (Logan, 1988), retrieving an answer or using a plausibility strategy
for story recollection (Reder, 1988), dot counting strategies (Palmeri,
1997), and scan versus retrieval strategies on a noun–pair lookup task
(Ackerman & Woltz, 1994).
Algorithm-retrial shift differs from simple-sophisticated shift in several respects. First, participants tend to adopt a retrieval strategy with
practice in the absence of explicit instruction to retrieve, whereas many
participants do not adopt a sophisticated strategy by mere practice alone
(see work on insight, e.g., Luchins, 1942). In particular, explicit instruction can can dramatically increase the uptake of sophisticated strategies.
Second, use of retrieval strategy requires that an association has formed
in memory, whereas individuals are often able to perform a sophisticated
strategy if they are specifically told to use it. Finally, retrieval shifts
typically involve an immediate improvement in speed, whereas sophisticated strategies can involve a temporary reduction in performance as
performance of the new strategy is refined.
24 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Theoretical Models
Many theoretical models exist that either directly concern strategy use
or have strategy use as a component. Some of these are general models
of human cognition such as ACT-R (Anderson et al., 2004), and others
concern particular tasks or domains. All such models have to deal with
three things: (a) how a strategy enters the repertoire, (b) how a strategy
is selected, and (c) how selection of strategies from a repertoire changes
over time.
In order for a strategy to be selected it is required that the strategy be known to the individual in some sense. Mechanisms can be
broadly distinguished based on whether strategies enter explicitly versus implicitly (for a theory and commentary, see Dienes & Perner, 1999).
Crowley, Shrager, and Siegler (1997) also discuss differences in strategy
shift processes using the terms metacognitive and associative for what
roughly corresponds to explicit and implicit processes respectively. Explicit mechanisms include analogy, deduction, and receiving new declarative knowledge, such as through written or verbal instruction (Reber,
1989). Implicit mechanisms are based on detecting correlations in the
task environment.
Strategy selection has been incorporated into several models of problem solving and skill acquisition (e.g., Lovett, 2005; Lovett & Anderson, 1996; J. W. Payne, Bettman, & Johnson, 1988; Schunn, Reder, &
Nhouyvanisvong, 1997; Siegler & Shipley, 1995). While each model has
its nuances, a general characterisation is as follows. An individual has
a repertoire of strategies each of which has an associated utility in a
context. The probability of a strategy being selected is monotonically
related to its perceived relative utility for the individual on the task in
the context. The utility is assumed to include both task-relevant goals
such as efficiency and accuracy, as well as costs associated with cognitive
2.1. INTRODUCTION
25
resource requirements. Gray, Sims, Fu, and Schoelles (2006) suggested
resource costs included time to access an option, motoric complexity, and
memory demands. This framework is at the heart of the ‘rational’ perspective proposed by Anderson (1990) and reflected in the ACT-R theory
(Anderson et al., 2004).
Models of strategy selection typically include a stochastic component.
This is captured in Siegler’s Overlapping Waves Theory (see Siegler,
2006) based on observations that children continue to use multiple strategies over long periods of time and that learning occurs as a gradual process of replacing simpler strategies with better and more sophisticated
strategies. It is also captured in the results of many probability matching studies whereby instead of participants always selecting the most
probable outcome, participants tend to choose options in proportion to
their probability of success (e.g., Lovett & Anderson, 1996). Several
explanations for this diversity include: (a) the broader adaptiveness of
maintaining a diverse strategy repertoire, (b) forgetting processes, and
(c) the complexity of the environment.
Strategy selection also changes with practice both in terms of strategies entering the repertoire and in the utilities associated with given
strategies. Lemaire and Siegler (1995, p. 83) proposed that there were
four sources of cognitive change: “(a) which strategies are used, (b) when
each strategy is used, (c) how each strategy is executed, (d) how strategies
are chosen.” Standard three phase theories of skill acquisition (Anderson,
1982; Fitts & Posner, 1967; Schneider & Shiffrin, 1977) suggest that
phase one involves a search and refinement of strategies on a task. In a
related sense, inadequate performance, which is more likely to occur early
in practice, may trigger a search for a better strategy. M. K. Singley and
Anderson (1989) suggested that the search for a better strategy is more
likely to occur when a strategy fails completely as opposed to when it is
merely suboptimal. Bhavnani and John (2000) note that there can be
26 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
cognitive costs in executing a strategy and costs in identifying a superior strategy. Despite these shortcomings, strategy selection is typically
assumed to be relatively adaptive given the individual’s knowledge and
resources.
Strategy selection theories also assume that perceived utilities for a
given strategy are reasonably accurate. In order for perceived utilities
to be accurate, a process is posited whereby feedback resulting from
strategy use is assumed to cause an update of perceived utility based on
effectiveness of the strategy in terms of performance outcomes and the
cognitive resource requirements. For example, Gray and Boehm-Davis
(2000) showed that strategy use changed in adaptive ways in response to
interface changes that made some strategies slightly slower. In another
example, Erev, Ert, and Yechiam (2008) within the context of decision
making under uncertainty propose a model whereby individuals first engage in a period of exploration where the utility of various choices are
evaluated followed by a period of exploitation where choices with greater
perceived utility tend to be sampled. The updating process also implies
the existence of an attribution processes whereby results from an action
are attributed to the strategy used.
Methodological Issues
Before discussing the empirical relationship between practice and strategy use it is worth discussing methodological issues related to measuring
and modelling strategy use. Purely cognitive strategies often require
probes to measure strategy use (e.g., Rickard, 1997; Touron, 2002). Two
issues often discussed in the literature are validity (i.e., can participants
accurately report their strategy use?) and reactance (i.e., does having
participants report their strategy use change their behaviour?). In other
cases, strategy use has been inferred from reaction time (e.g., Haider
2.1. INTRODUCTION
27
& Frensch, 2002). One of the benefits of studying strategies with behavioural consequences is that reactance is unlikely to occur. In particular, computerised tasks can be designed that permit the silent recording
of participant behaviour. An example of this can be seen in the use of
key logs to extract strategy use on an air traffic control simulation (John
& Lallement, 1997).
A third methodological issue relates to the level of analysis when
modelling the relationship between practice and strategy use. As with the
relationship between practice and performance, modelling at the grouplevel leads to a biased representation of the individual-level. If strategy
shifts are abrupt and occur at time points that vary across individuals,
a smooth function will tend to result at the group-level even though a
discontinuous model is more reasonable. While some researchers have
reported individual-level strategy use (e.g., Delaney et al., 1998; John
& Lallement, 1997) many researchers have only reported results at the
group-level (e.g. Touron & Hertzog, 2004). While the group-level can
provide a convenient overview of the general transition of a strategy, it
is the individual-level that indicates whether the shifts were abrupt or
gradual, and partial or complete.
Models
Overview In order to add rigour to the modelling of the relationship
between practice and strategy use, a set of models of the relationship are
proposed. All the models assume that strategy use, i.e., that which is
being modelled, has been defined on a zero to one scale. In the present
thesis, strategy use was operationalised as the proportion of keys classified as sophisticated, but in other cases, it might the probability of using
a given sophisticated strategy. While some researchers have discussed
mathematical models of the relationship, this has not always been the
28 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
case, and thus, potential models had to be developed based on qualitative
descriptions in the literature.
Assumptions of strategy shift immediately rule out linear, quadratic,
and higher-order polynomial functions. While a polynomial of sufficiently
high-order will provide good fit to observed data, as mentioned in relation
to practice and performance, such polynomial functions have specific
problems. First, they make inappropriate predictions outside the range
of the data. An inverted U-shaped quadratic function eventually predicts
that strategy sophistication will decline with practice. Both linear and
quadratic functions suggest that with sufficient practice, probability of
use of sophisticated strategy will be either less than zero or greater than
one. Second, they do not suggest plausible change mechanisms in the
same way that some other functions do.
More plausible functions are those that increase monotonically to an
asymptote. The remainder of this section presents several plausible functions, focusing on a Constant function, two monotonically increasing and
decelerating functions (Michaelis–Menten and Saturating Exponential ),
a sigmoidal function (Logistic), and two functions with discontinuities
(Constant–Constant and Constant–Saturating Exponential ).
Constant A simple model that can be used to describe individuals who
do not change strategy is the Constant model,
fCn (x, θ) = θ1 .
(2.8)
Such a function can be used to describe a range of individuals. These include constant low, medium, and high strategy sophistication individuals,
as well as individuals who are variable in their strategy sophistication but
the general probability of strategy sophistication over time is constant.
2.1. INTRODUCTION
29
Michaelis–Menten The Michaelis–Menten (Michaelis & Menten,
1913) function is a monotonically increasing and decelerating function
(for a discussion, see Chapter 3 of Bolker, 2008). Bolker (2008) reviewed
the history of the function noting how it was proposed by Michaelis and
Menten (1913) in relation to enzyme kinetics. Bolker (2008) also commented on other similar functions such as the Monod function, Holling
Type II, and the Beverton–Holt model. A three parameter version of
the Michaelis–Menten function allows for a non-zero starting point and
a non-zero asymptote,
fMM (x, θ) =
θ1 (x − 1)
+ θ3 ,
θ2 + (x − 1)
(2.9)
where x is trial number starting from 1, θ1 is the amount of change, θ2
is the number of trials it takes for half the change to occur, and θ3 is
strategy sophistication on trial 1. The function is positive, negatively
accelerated, and approaches an asymptote. In order for the values to be
meaningful, the following constraints need to be satisfied: θ1 ≥ 0; θ2 > 0;
θ3 ≥ 0; θ1 +θ3 ≤ 1. Cudeck and Harring (2007) stated that the Michaelis–
Menten function provided a clear set of interpretable parameters, using
Frensch, Lindenberger, and Kray (1999) to illustrate this point. Frensch
et al. (1999) used the function to model the relationship between rate
of stimulus change to monitoring accuracy on a continuous monitoring
task. However, following a literature search, no evidence was found of
the function being applied to modelling the relationship between practice
and strategy use.
Saturating Exponential Another monotonically increasing and accelerating function is the Saturating Exponential function. Bolker (2008)
suggested that the differences between it and the Michaelis–Menten func-
30 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
tion are subtle. The function is
fSE (x, θ) = θ1 (1 − exp(−θ2 (x − 1))) + θ3 ,
(2.10)
where θ1 is strategy sophistication on trial 1, θ1 +θ3 is asymptotic strategy
sophistication, and θ2 is a shape parameter.
Logistic A contrast to the above two functions is the logistic function,
which suggests a sigmoidal relationship between practice and probability
of strategy use. One parameterisation of the three parameter logistic
function is
fLg (x, θ) =
θ1
,
1 + exp(−(x − θ2 )/θ3 )
(2.11)
where θ1 represents asymptotic proportion of strategy use, θ2 represents
the trial of the inflection point (i.e., half the asymptote) and θ3 is a
scaling value. θ1 is often set to 1.0 where it is assumed that strategy use
will approach 100% with infinite practice.
Constant–Constant The Constant–Constant model represents an
abrupt onset and an abrupt shift in strategy use,
fCnCn (x, θ) =


θ1
if x ≤ θ3

θ2
if x > θ3
,
(2.12)
where there is a breakpoint at θ3 after which, strategy use changes from
θ1 to θ2 . This is consistent with an abrupt shift typically from not using
a strategy to using a strategy.
2.1. INTRODUCTION
31
Constant–Saturating Exponential A second model with a breakpoint is the Constant–Saturating Exponential,
fCnSE (x, θ) =


θ1
if x ≤ θ5

θ2 (1 − exp(−θ3 (x − 1))) + θ4
if x > θ5
.
(2.13)
This function is consistent with an initial period of low or otherwise
constant strategy use, followed by an abrupt onset of a strategy shift,
but where the actual shift is gradual. A similar simpler model might also
be explored where θ4 is constrained to be equal to θ1 . In some respects
it is similar to a logistic function, yet it is theoretically distinguished by
its abrupt onset.
Empirical Evaluation
The empirical evidence regarding the relationship between practice and
strategy use is diverse in terms of task, design, sample, and context. Also,
a lot of existing research has the previously mentioned methodological issues related both to inadequate strategy measurement and not reporting
individual-level analyses.
Before discussing simple–sophisticated strategy shifts, the algorithm–
retrieval strategy shift literature is briefly described. Reflecting the fundamental importance of algorithm–retrieval shift, many studies have examined this shift (e.g., Delaney et al., 1998; Logan, 1992, 1988; Palmeri,
1999; Rickard, 1997, 1999, 2004). Single digit multiplication provides a
prototypical example. With practice, individuals transition from solving
problems, like four times four, using general algorithms, such as repeated
summing, to directly retrieving answers from memory. Common findings
from such studies include: (a) at the group-level, retrieval use tends to
increase following either an exponential or logistic function, (b) retrieval
shift is largely item specific, (c) within an item there is some, but not a
32 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
lot, of shift back and forth between algorithm and retrieval, (d) strategy
shift is less abrupt when operating over more naturalistic time frames,
(e) often a small subset of individuals rarely if ever use retrieval, (f) individuals are relatively adaptive in their selection of algorithm or retrieval
strategies.
Simple-sophisticated shift differs in several ways from algorithm–
retrieval shift. Retrieval strategies require that the stimulus-response
pair has been memorised which takes at least one trial, but typically
many more to occur; sophisticated strategies often only require awareness of the strategy, and can often be used on the first trial. Transition to
retrieval strategies typically occur naturally with practice; sophisticated
strategies are often not suggested by the simpler strategy; for example,
using the mouse does not suggest the existence of a script (Yechiam et al.,
2004). Retrieval strategies are almost always faster than the algorithm
strategy; sophisticated strategies can be slower than the simpler strategy
when first adopted; for example, learning more sophisticated text editing
short cut keys can be awkward at first.
While the research on individual-level strategy shifts from simple to
sophisticated strategies is sparse, several relevant findings exist. John
and Lallement (1997) examined strategy use on an air traffic control
simulation. They found that around half the participants did not shift
strategy, which is consistent with the Constant model. Other participants adopted a different strategy after several trials, although John and
Lallement (1997) did not specify exactly when the transition occurred.
They labelled 39 of the 50 observed strategy shifts as gradual and the
remainder as abrupt. These two levels of abruptness are consistent with
Constant–Constant and Constant–Saturated Exponential models. If the
onset of the shift occurred in the first trial, then Michaelis–Menten or
Saturating Exponential may provide better fit.
Another line of research has examined how people use computer soft-
2.1. INTRODUCTION
33
ware. A common observation is that users often plateau at strategies
that are asymptotically suboptimal (for a summary of this literature,
see Bhavnani & John, 2000; J. M. Carroll & Rosson, 1987; Charman &
Howes, 2003). Classic studies in time-and-motion found that industrial
workers rarely discovered the most efficient strategies for performing their
tasks (Gilbreth, 1911). Charman and Howes (2003) found that about half
the participants in their study shifted to a more sophisticated graphics
drawing strategy, although results did not indicate how abrupt the shift
was nor the timing of the shift. Charman and Howes (2003) suggested
that strategy shift may be greater in the lab than in the real-world because participants are focused more on low-level task goals whereas real
world workers are more concerned with high-level project goals. Furthermore, any discussion of optimality raises the question of what is being
optimised, and the various costs and benefits of searching for optimal
strategies.
Similarly, Yechiam et al. (2004) examined the use of a mouse versus
a script-based strategy, along with a much slower keyboard-based strategy, on a spreadsheet manipulation task. Half of the participants were
allowed to use the mouse strategy from the start, while the other half
were required to use the script strategy initially. The script was difficult
to learn, but ultimately quicker than the other strategies. They found
that participants were generally unlikely to switch to the script strategy when it was introduced half-way through practice. Also, although
Yechiam et al. (2004) did not report individual-level results, the pattern
of results suggested that the shift to the using the script-based strategy
tended to occur on the first or second trial of its availability or not at all.
It would also seem that the shift to the script-based strategy was abrupt.
34 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Individual-Level Predictions
In summary, the literature on simple–sophisticated strategy shift is
sparse. Results are somewhat contingent on task, context, and participant properties. Studies have also often failed to report results in such
a way to enable an understanding of individual-level patterns of shift in
terms of timing and abruptness. As with the relationship between practice and task completion time, it is the individual-level that is relevant
for understanding how individuals learn. In light of the above literature,
the following hypotheses are proposed.
Hypothesis 2.5 For individuals that shift strategy, some shifts are gradual and others are abrupt.
Abrupt shifts will be indicated by support for the Constant–Constant
model. Gradual shifts will be reflected by support for Michaelis–Menten,
Saturating Exponential, Logistic, and Constant–Saturating Exponential
models. In general strategy shift is likely to be more abrupt where explicit instructions are given, generalisation is straight forward, and the
sophisticated strategy is immediately and always far more effective than
the simpler strategy.
Hypothesis 2.6 Many individuals will be characterised by only partial
strategy sophistication.
This will be indicated by participants who use a mix of both simple and
sophisticated strategies.
In contrast to algorithm–retrieval shifts which often take time to develop and can follow various continuous functions, simple–sophisticated
strategy shift is hypothesised to have the following pattern:
Hypothesis 2.7 The probability of a strategy shift occurring decreases
monotonically with practice.
2.1. INTRODUCTION
35
Given the tendencies for inertia, it is likely that some individuals never
discover sophisticated strategies.
Hypothesis 2.8 For some individuals, the relationship between practice
and strategy sophistication involves near continuous use of a simple strategy.
This will be indicated by support for the Constant model with a θ1 (i.e.,
the mean proportion of strategy use) at a low level.
It is also clear that some individuals either know a sophisticated strategy from the start or learn it on the very first trial.
Hypothesis 2.9 For some individuals, the relationship between practice
and strategy sophistication involves near continuous use of a sophisticated
strategy.
This will be indicated by support for the Constant model with a θ1 (i.e.,
the mean proportion of strategy use) at a high level.
Group-Level Predictions
While not the primary focus of this thesis, it is still interesting to consider the relationship between practice and strategy sophistication at the
group-level. Several functions have been proposed in the literature. At
other times, results have been presented graphically that suggest a functional form without defining one. The following discussion is limited to
functions where the dependent variable is the proportion of instances of
use of a sophisticated strategy, or for comparison purposes, a retrieval
strategy.
It is generally assumed that use of a sophisticated strategy increases
with practice based on the adaptive nature of strategy use. Thus, the
following questions remain: First, is sophisticated strategy use zero on
the first trial? Second, does sophisticated strategy use asymptote at 100
36 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
percent sophisticated strategy use or something less than that? Third,
although the assumption of a monotonically increasing function seems
almost certain, is the function monotonically decelerating, or does it
follow a sigmoidal function? Fourth, related to the third question, what
is a functional form of the relationship? Finally, how does this functional
relationship vary as a function of task, sample, and context?
For several reasons, a limited set of the previously mentioned functions seem plausible for modelling the relationship between practice and
strategy sophistication at the group-level.
These are the Michaelis–
Menten, Saturating Exponential, and Logistic. The two discontinuity
functions seem implausible at the group-level because the timing of any
discontinuity should vary between individuals, and such discontinuity
models are assumed to hold only for some participants. The Constant
model is inconsistent with ample evidence that strategy use does change
from simple to sophisticated strategies at least for some participants.
At least three functions have been proposed to model the group-level
relationship between practice and strategy use. First, Rickard (1997) applied a logistic function to model algorithm–retrieval shift. The logistic
function assumes that the rate of adoption of a strategy initially accelerates before decelerating. This is a plausible model of algorithm–retrieval
shift because it may take several trials for a memory trace to become
established. However, in the case of simple–sophisticated strategy shift,
monotonic deceleration seems more plausible. Search for a sophisticated
strategy may be maximal when an individual is first orienting to a task
at the start of practice. Initial task instructions may also trigger insights
that affect early trials.
Touron and Hertzog (2004) used a one parameter version of the Saturating Exponential function to model the relationship between practice
and probability of using a retrieval strategy on an algorithm–retrieval
task at the group-level. The one-parameter version was achieved by im-
2.1. INTRODUCTION
37
plicitly constraining θ1 to 1.0 and θ3 to 0.0, based on the assumption that
initial probability of retrieval would be zero, and after infinite practice
would be one. While such constraints may be reasonable in the context of algorithm–retrieval shift, they seem inappropriate in relation to
simple–sophisticated strategy shift. It is likely that initial strategy sophistication will often be above zero and asymptotic levels will be below
one.
Based on the literature and hypotheses mentioned earlier, various expectations result regarding the relationship between practice and strategy
sophistication at the group-level. These individual-level assumptions are
that (a) some individuals use the sophisticated strategy from the first
trial, (b) some individuals never use the sophisticated strategy even after
extensive practice, (c) the probability of shift decreases with practice,
and (d) individuals vary in whether the shift is gradual or abrupt. Based
on these assumptions it would be expected that the group-level curve
relating practice to strategy sophistication should (a) start at a non-zero
level, (b) approach an asymptotic level less than one, (c) and be a negatively accelerated function, whereby the greatest change happens in the
early period of practice.
Although they did not use a mathematical model, F. J. Lee et al.
(1995) presented data broadly consistent with the above propositions.
They graphed the relationship between practice and use of a more efficient runway landing strategy, called the Hold 1 strategy, on the Kanfer–
Ackerman Air Traffic Control Task. On trial one, around 28% of participants used the sophisticated strategy. The proportion of participants
using the Hold 1 strategy increased monotonically with practice with the
rate of increase monotonically decelerating with practice. The proportion
of participants using the Hold 1 strategy on the final trial was around 48
percent.
Based on the above theoretical considerations both the Michaelis–
38 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Menten and Saturating Exponential functions should provide a good fit
at the group-level. This should be superior to the Logistic function. It
also seems worthwhile to compare the fits of the above three functions
on a task that allows for simple–sophisticated strategy shift to compare
relative fit and also check the assumptions made above regarding starting
and asymptotic levels of strategy use. Specifically, the following hypothesis is proposed.
Hypothesis 2.10 The relationship between practice and strategy sophistication at the group-level is effectively modelled by a three parameter
Michaelis-Menten function.
In combination, as with practice and performance, these hypotheses assert that the relationship between practice and strategy sophistication
at the group-level is a biased representation of the individual-level. The
group-level relationship is the sum of individual curves, and these individual curves are predicted to have different functional forms and varying
parameters within curves. If this is the case, then even if the group-level
relationship corresponds to the functional form for some individuals, it
would be inappropriate for others.
2.1.4
Strategy and Performance
Overview
This section examines the relationship between strategy use and performance. First the concept of strategy relative effectiveness is defined and
some of the challenges of measuring it are discussed. Then models and
empirical data on the effect of strategy shift on task completion time
are presented. This section aims to highlight reasons why discontinuities
would or would not be expected in the learning curve given that strategy
shifts are sometimes abrupt.
2.1. INTRODUCTION
39
Relative Effectiveness
In general, strategy relative effectiveness is the degree to which one strategy is more or less effective than an alternative. In theory strategy relative effectiveness may be defined relative to many different criteria including accuracy, speed, quality, or minimisation of cognitive load. However,
for the purpose of this thesis, strategy relative effectiveness will generally
be operationalised as the difference in expected task completion time for
two strategies.
Clearly, strategy relative effectiveness depends on many factors; some
of the most prominent include: (a) general properties of the strategies,
(b) individual differences in general, (c) individual differences in how
well each strategy has been learnt, (d) features of the task and context,
and (e) amount of practice on the task and on each strategy. Despite
the effect of these factors, researchers have often sought to estimate the
absolute relative effectiveness of two strategies. Theoretical and empirical
approaches can be contrasted.
Theoretical estimates can be derived from various models of task performance. Such models are typically grounded in broader frameworks of
human action such as GOMS (Card et al., 1983), CPM-GOMS (Gray,
John, & Atwood, 1993; Gray & Boehm-Davis, 2000), cognitive architectures (for a review see John, 2003), and industrial engineering approaches
(Konz, 1987). Task analytic approaches are based on assumptions of the
nature of information processing and validated using empirical data.
While theoretical estimates may be useful, they are only as accurate
as the theory that is used to generate them. Such theories are validated
using empirical data, and they generate predictions that are capable of
empirical testing. They are also less developed in areas of specific interest
to this thesis around learning and individual differences. Strategy relative
effectiveness at the time of a strategy shift and during the learning process
40 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
is critical to understanding continuity in the learning curve. Individual
differences also lead to variability in performance, which is often only
incorporated in heuristic ways such as the way that GOMS uses typing
speed as a measure of individual differences or the way that it assumes
that the skill has been learnt.
Empirical Estimates
Empirical estimates are calculated by obtaining a sample of performance
times using the strategies of interest. The difference in task completion
time for the two samples constitutes a measure of strategy relative effectiveness. However, as discussed extensively by Siegler and Lemaire
(1997), empirical estimates of strategy relative effectiveness are often
biased. Biases arise because observations are typically not sampled randomly. For example, participants with great skill in general may be more
likely to use better strategies, or sophisticated strategies may be more
likely to be used with practice after other learning has already occurred.
As such the estimate of strategy relative effectiveness is biased due to
the correlation between strategy use and another important predictors
of task performance.
To highlight the problems of bias in measurement of strategy relative
effectiveness, three examples from the literature are provided. First,
F. J. Lee et al. (1995) compared trial performance on the Kanfer–
Ackerman Air Traffic Control Task where a strategy either was or was
not used. They treated the data from the 18 trials and 58 participants
as 1044 observations and reported the correlation between use of the superior plane landing strategy and task performance. Such an estimate is
biased because some individuals used the strategy more than others and
the use of the strategy increased with practice.
Second, John and Lallement (1997), also on the Kanfer–Ackerman Air
2.1. INTRODUCTION
41
Traffic Control task, compared trial performance across several classified
strategies on the final trial. This approach dealt with the confounding
effect of practice. However, it did not deal with the confounding effect of
individual differences. It is quite plausible that participants who used the
superior strategy would also be more skilled in other ways. In addition,
by only examining the final trial it was not possible to estimate relative
effectiveness at the time of the strategy shifts.
Third, Touron, Hoyer, and Cerella (2004) presented a graph of average
reaction time for algorithm and retrieval shift over practice. However,
the individuals that adopt retrieval early are quite different to those that
adopt it later. Each of these examples highlights the challenge of getting
an estimate of the relative effectiveness of a strategy for a particular
individual at a particular instance in time.
In order to get unbiased estimates of strategy relative effectiveness
Siegler and Lemaire (1997) proposed the Choice / No-Choice design (for
another exmaple, see, Walsh & Anderson, 2009). The Choice / No-Choice
design involves measuring strategy use and performance in multiple conditions. In the choice condition participants are allowed to choose the
strategy that they wish to use. In the no-choice conditions—one for each
strategy—participants are required to use a designated strategy. Comparing performance between strategies in different no-choice conditions
is meant to yield an unbiased estimate of strategy relative effectiveness.
While the Choice / No-Choice design overcomes many problems with
estimation of strategy relative effectiveness, the design is not suited to
certain situations. First, the design can not be used to estimate relative
effectiveness at the time of a strategy shift. It assumes that learning
effects are not dominating estimates. Second, it can not be used when
it is strategy relative effectiveness at the time of self-initiated strategy
shift that is of interest. In such cases, the no-choice condition both informs the participant of the existence of a given strategy and instructs
42 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
the individual to use the strategy. If, as is to be expected, self-initiated
strategy shifts have particular dynamics related to discovery and exploration, such timings of strategy shift would not be observed with the
Choice / No-Choice design.
In summary, there is at present no standard method for estimating
the relative effectiveness of an old and new strategy at the time of a selfinitiated strategy shift. Looking at individual-level differences may be
a good approach, particularly where the task is constant over the onset
point.
Models of Performance following Strategy Shift
Despite these difficulties, several researchers have proposed models of,
and empirically estimated, strategy relative effectiveness within the context of the learning curve. Such models should address both the relative
effectiveness immediately following a strategy shift, and also how strategy
use improves with practice.
Perhaps the earliest of such models was proposed by Crossman (1959).
He proposed that the learning curve resulted from changes in probabilities of strategy use over time. Strategies were assumed to result in a fixed
performance level. Probability of strategy use was assumed to move towards faster strategies and the speed of this transition in probabilities
was greater when the relative effectiveness was also greater. The problem with Crossman’s model, however, was that first, the model confuses
strategy use, a process, with performance. Strategies can be performed
at various degrees of effectiveness both by individuals and over time. Second, Crossman did not measure strategy use and thus did not validate the
model assumptions about shifting probabilities of strategy use. Subsequent research (e.g., John & Lallement, 1997) measuring strategy use has
shown that the model makes incorrect predictions about the relationship
2.1. INTRODUCTION
43
between practice and strategy use.
A second set of models are concerned with algorithm–retrieval shift
(Delaney et al., 1998; Logan, 1988; Palmeri, 1999; Rickard, 1999). While
this thesis focuses on simple–sophisticated strategy shifts, models of
algorithm–retrieval shift still provide useful insights. In general, studies have shown that both algorithm and retrieval strategies get faster
with practice. In these tasks the retrieval strategy is typically substantially quicker than the algorithm even from early trials of use. Because
strategy shift is typically gradual over a set of items the learning curve
is assumed to be, and typically is, continuous.
Delaney et al. (1998), in particular, proposed the idea that the learning curve is best represented by power functions within strategies. The
model takes strategy use as given and makes a prediction. Although they
did not express it as such, the model can be expressed as follows:
−(θ +θ z )
fs2 (x, z; θ) = (θ1 + θ2 zi )xi 3 4 i


θ1 x−θ3 if zi = 0
i
,
=

θ2 x−θ4 if zi = 1,
i
(2.14)
(2.15)
where fs2 (x, z; θ) is expected task completion time for the ith trial, given
strategy use zi (0 indicating no strategy use and 1 indicating strategy
use), trial number xi and the parameter values θ1 , θ2 , θ3 , θ4 . Delaney et
al. (1998) found that this model predicted reaction time over and above
a standard two parameter power function.
There are several reasons to critique the model proposed by Delaney
et al. (1998). First, in algorithm–retrieval studies the main skill that is
acquired is the ability to retrieve. It is not possible to retrieve without
practice. Thus, by incorporating strategy use as a variable in prediction,
it is unfair to compare predictions with models that only incorporate
amount of practice. Second, it is meaningless to talk about predicted
44 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
performance time using the retrieval strategy on early trials when an individual is unable to retrieve. In such a situation, if the relevant stimulus–
response pair is not represented in memory, an individual will not be able
to retrieve the correct answer. Third, it is unclear why practice with retrieval would increase algorithmic reaction time as implied by the model.
Fourth, the two parameter power function makes the implausible prediction that task completion time will approach zero with infinite practice
both for the algorithm and the retrieval strategy.
While substantial research exists on modelling algorithm–retrieval
shift, there are several reasons to assume that different models are required for tasks involving simple–sophisticated strategy shifts. In addition to the possibility of abrupt shifts, evidence suggests that performance may temporarily decline immediately after shifting to a sophisticated strategy (e.g., Study 3 in Delaney et al., 1998).
The general dynamics of simple–sophisticated strategy shift can be
seen in many domains of life. The researcher shifts from using a word processor to LATEX and from a menu driven statistics package to one based on
writing code. The student decides whether to shift from hunt-and-peck to
touch typing (see Yechiam, Erev, Yehene, & Gopher, 2003). These are everyday examples where a temporary drop in performance follows a strategy shift before ultimately superior performance is attained. Yechiam
et al. (2004) uses the concept of escalation of commitment (Staw, 1981)
to explain why an individual may be reluctant to temporarily drop performance for uncertain future benefits. A related idea is that of local
minima, whereby strategy use is seen as an optimisation problem where
individuals can get caught in local minima.
Several studies are relevant to understanding relative effectiveness at
the time of simple–sophisticated strategy shift. However, each one tends
to be missing an element to make the desired inferences. Haider and Frensch (1999, 2002) present data from an alphabet string verification task.
2.1. INTRODUCTION
45
The individual-level learning curves show discontinuities. The timing of
these discontinuities varies between individuals. If Haider and Frensch’s
interpretation is correct, the difference between pre- and post- shift reflects the relative effectiveness of learning to ignore the first few letters
in the verification task. However, because there is no direct measure of
strategy use, the results are open to alternative interpretations.
Other studies have an appropriate design, but lack the necessary
measurements or analyses. John and Lallement (1997) obtained triallevel strategy use and performance data but only compared performance
across strategies on the final trial. Even though the reaction time data
made it fairly clear, Yechiam et al. (2004) did not measure strategy use
on each trial. In summary, more research is needed with trial-level measurement of strategy use and performance on tasks that have a simple–
sophisticated strategy shift in combination with individual-level analysis.
Reconciling Abrupt Strategy Shift with Continuous Learning
Curves
The immediately preceding section has summarised models and empirical
evidence regarding the influence of strategy shift on the learning curve.
If it is true that discontinuities in learning curves are rare yet abrupt
strategy shifts are rather common, then some combination of factors
must prevent strategy shifts from causing discontinuities. Strategy use
then becomes a partial mediator along with other learning processes of
the effect of practice on task completion time. Potential factors include:
(a) the time to learn the strategy, (b) the occurrence of gradual strategy
shifts, (c) the process of generalising the use of a strategy to variations
in task features and context, and (d) the greater probability of the shift
occurring early in practice.
It was previously proposed that discontinuities in the learning curve
46 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
at the individual-level are rare (i.e., Hypothesis 2.4). First, this can
be reconciled with abrupt strategy shifts by considering that strategy
shifts are only sometimes abrupt. Furthermore, abruptness can mean
abrupt onset would be less likely to lead to an abrupt performance shift.
Second, as asserted previously some individuals do not shift strategy
either because they always use the sophisticated strategy (i.e., Hypothesis
2.8) they never use the sophisticated strategy (i.e., Hypothesis 2.9), or
changes in strategy use from trial to trial do not reflect a systematic shift
(i.e., Hypothesis 2.6). Third, as asserted previously strategy shift is often
gradual (i.e., Hypothesis 2.5).
Even when strategy shifts do occur and are identified, there are reasons to expect a lack of discontinuity in task completion times. There
are several reasons for this. First, based on general principles of summing random variables, trial-to-trial variance in performance should be
greater at the individual-level than at the group-level. In many instances
this variance will be large in comparison to strategy relative effectiveness
both immediately following the shift and once the new strategy has been
learnt. Second, as asserted previously strategy shifts are more likely
to occur at the start of practice (i.e., Hypothesis 2.7) when the rate of
learning potentially due to other factors is at its maximum (i.e., this
is consistent with Hypothesis 2.2) thereby increasing the trial-to-trial
variance at the time of shift. Third, although it is difficult to measure
empirically, it is assumed that following a strategy shift, strategy specific
learning typically takes place. This means that the full strategy relative
effectiveness is not realised until a period of practice has passed.
2.1.5
The Current Studies
In order to test the above hypotheses, three studies were conducted.
For the purposes of the analyses in this chapter each study examined
2.1. INTRODUCTION
47
a single group of participants performing a keyboard text editing task
over a period of practice. Performance was measured as the time to
complete text editing changes. Other researchers have used text editing
to study other aspects of skill acquisition (e.g., Armstrong, 2000; Card,
Moran, & Newell, 1980; Cook et al., 1995; Harvey & Rousseau, 1995;
S. J. Payne, Squibb, & Howes, 1990; Robertson & Black, 1983; M. Singley
& Anderson, 1985, 1987).
Keyboard text editing was chosen as the task in this thesis for several
reasons. First, text editing permits a variety of strategies. Such strategies
can readily be classified as simple or sophisticated. Second, sophisticated
strategies should be more asymptotically efficient, but may take time to
acquire. Third, key logs permit automated trial-level measurement of
strategies.
The three studies were designed to test the hypotheses under varying conditions, including degree of initial text editing training and the
structure of the text editing requirements. Studies 1 to 3 involved a progressive increase in the amount of instruction provided to participants on
sophisticated text editing strategies. Study 1 involved a consistent set
of editing requirements across trials, whereas Study 2 and 3 varied the
editing requirements. More details of variation across the three studies
are described in the method.
The main expectations for the three studies are reflected in the hypotheses presented in this chapter. However, in addition, Study 1 trialto-trial variance in both performance and strategy sophistication at the
individual-level was expected to be less than in the other two studies
because Study 1 involved both more edits per trial and consistent edits
across trials. Use of sophisticated strategies was expected to be greatest
in Study 3 and least in Study 1 based on the corresponding levels of
strategy instruction.
48 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
2.2
2.2.1
Method
Overview
This section describes the method used in the three studies reported in
this thesis. The three studies share many common features, but also
differ in several respects. In order to describe the method of the three
studies in a clear and concise way, common features are first presented
followed by a description of the unique aspects of each study. Additional
details are presented in Appendices A, B, and C.
2.2.2
Common Features
Overview
Each study involved participants completing a selection of individual
difference measures, then performing a series of practice trials on a keyboard based text editing task lasting, depending on the study, between
30 and 60 minutes. On each trial of the text editing task key presses
were recorded. Key presses allowed for measurement of strategy use and
task completion time.
This chapter examines the relationship between practice, strategy sophistication, and task completion time. Methodological aspects related
to individual differences is reserved until Chapter 3. Also, Study 3 had
three conditions, one of which involved a continuous block of practice on
the text editing task (i.e., the No Training Condition) and was broadly
equivalent to Study 1 and Study 2. In contrast, the other two conditions
(i.e., Control and Training) in Study 3 involved various mid-Practice manipulations. Only the No Training condition is described and analysed in
this chapter. The other two conditions pertain to the effect of instructed
strategy shifts and are described and analysed in Chapter 4.
The main differences between the studies are summarised in Table 2.1.
2.2. METHOD
49
Table 2.1: Main Design Differences between the Three Studies.
Feature
Study 1
Study 2
Study 3
Sample size
63
154
Sample
source
Experiment
Duration
Personal networks
Undergraduates
154 (61 in No
Training
Condition)
Undergraduates
150 minutes
120 minutes
100 minutes
54 Trials
30 Minutes
None
None
Constant
Varying
30 blocks each lasting at least 60 seconds
Training, No Training, and Control
Varying
Amount
Practice
of
Experimental
manipulation
Trial variation
Edits per trial
Edits shown
Edit types
Six
On paper
Delete,
Replace,
Cut and Paste,
One
On screen
Delete,
Replace,
Cut and Paste,
Insert
Initial strat- Given list of keys Given list, demonegy instruc- but not told to use stration of how to
tions
use
Key list
On paper
Permanently
on
screen
One
On screen
Delete
Practice Trials
Typing Test
None
None
Anglim
Typing
Test
Two items
Anglim
Test
14 items
One Practice Trial
(no time limit)
10 Thumbs
Theoretical training and practice
using each key
Available at end of
block
Typing
Prior Experi- 17 items
ence
Personality
None
100 Item IPIP
50 Item IPIP
Test
Ability Tests ERV, IT, CC, NC, ERV, IT, CC, NC, None
NS, CS, RT1v1, NS, RT1v2, RT4v2
RT2, RT4v1
Note. The following initials for ability tests were used ERV: Extended
Range Vocabulary Test (Ekstrom, French, Harman, & Dermen, 1976),
IT: Inference Test (Ekstrom et al., 1976), CC: Cube Comparison Test
(Ekstrom et al., 1976). NC: Number Comparison Test (Ekstrom et al.,
1976), NS: Number Sort (Ekstrom et al., 1976), CS: Clerical Speed and
accuracy (Ekstrom et al., 1976); RT1: Simple Reaction Time, RT2: 2Choice Reaction Time, RT4: 4-Choice Reaction Time.
50 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Variation in individual difference measures, task instructions, and task
features allowed for an exploration of the generality of the propositions
put forward in this thesis.
While a text editing task was used in each study, several design elements varied between the studies, including (a) initial instructions, (b)
accessibility of the text editing keys, (c) editing requirements on each
trial, (d) duration of practice and number of trials, (e) aspects of the
participant interface (performance feedback; position and size of the editing text box), and (f) method of ending trials (trial-time out; ended by
accurate completion versus participant initiated ending). This section focuses on the features of the text editing tasks that were common across
studies. These include: (a) the key press information that was recorded,
(b) general aspects of performance measurement, and (c) measurement
of strategy sophistication.
Key Press Measurement
At the key press level, the following variables were measured for every
key press: (a) block number (where appropriate), (b) trial number, (c)
raw key press number: reset to zero at the start of each trial, (d) raw
delay: milliseconds since the previous key press or, in the case of the
first key press, since the start of the trial, (e) the key pressed, and (f)
whether modifiers Ctrl, Shift, and Alt were held down at the time of
the key press. The raw key press data file contained two types of key
presses: key presses that involved the participant striking the key with
their finger and key presses that resulted from holding a key down and
letting the keyboard repeatedly fire. When Ctrl, Shift, and Alt were
pressed repeatedly, the second and subsequent presses were removed,
because it is equivalent to a single key press. Raw key press numbers
were adjusted so that after removal of repeated modifier keys, the new
2.2. METHOD
51
key press numbers were in the format 1, 2, 3, ... without integer gaps.
Likewise, adjusted delay was constructed from raw delay by aggregating
the total time associated with an instance of repeated firing of modifier
keys and adding this time to the next physical key press in the key log.
Strategy Sophistication
A measure of strategy sophistication was extracted from the key logs
for each participant on each trial. All key presses were classified as either sophisticated, simple, or neutral. The following keys were classified as sophisticated: (a) Ctrl+Left, (b) Ctrl+Right, (c) Ctrl+Down,
(d) Ctrl+End, (e) End, (f) Ctrl+Home, (g) Home, (h) Ctrl+Up, (i)
Shift+Down, (j) Shift+End, (k) Shift+Home, (l) Ctrl+Shift+Left,
(m) Ctrl+Shift+Right, (n) Shift+Up, (o) Ctrl+Backspace, and (p)
Ctrl+Del. The following keys were classified as simple: (a) Left, (b)
Right, (c) Shift+Left, (d) Shift+Right, (e) Backspace, and (f) Del.
Any other keys were classified as neutral. Strategy sophistication at the
trial-level was measured as
SO
,
SO + SI
(2.16)
where SO and SI are the respective trial counts of sophisticated and
simple key presses. Block-level strategy sophistication was measured as
mean trial-level strategy sophistication for accurate trials in the block.
2.2.3
Study 1
Participants
An initial sample of 116 adults participated in Study 1. However, 14
(12%) participants were excluded because of corrupt or missing data,
and 39 (34%) were excluded because their data was deemed invalid. An
52 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
analysis of reasons for invalid data are presented in Appendix A.4. The
size of the final retained sample for analysis was 63. Of the retained
sample 44 were female and 19 were male. Median age was 22 (mean =
23, min = 18, max = 39).
Participants were recruited through personal networks. All participants were initially screened to ensure that they had (a) competence in
English as defined by (i) the ability to carry out a conversation in English, (ii) having lived in Australia for at least eight years, (iii) and had
been in an English speaking educational system; (b) experience using a
word processor on average once per week; (c) normal or corrected to normal vision; and (d) no severe physiological problems that would impair
movement of the hands and wrists.
Text Editing Task
The Study 1 text editing task was programmed in Visual Basic 6. The
task was programmed specifically for the study and adapted from Anglim
(2000b). Calls to the Windows API were used to improve accuracy of
temporal measurement. In contrast to earlier versions of the text editing
task used in our research group, this version was the first to log key
presses of participants. The timing and type of all key presses were
recorded. Participants used standard 104 key IBM/Windows keyboards
and 17 inch CRT monitors.
Screenshots of the user interface are shown in Figure 2.1 and Figure 2.2. The display included an area for editing text. Below this was
feedback and instructions on the keys to press to start and stop a trial.
A counter displayed the number of seconds that had passed on the trial.
The current trial number, the response time for the previous trial, and
the accuracy for the previous trial were all displayed. Additional dialogue boxes displayed at the end of blocks and the experiment are shown
2.2. METHOD
53
in Appendix A.3.1.
Figure 2.1: Study 1 text editing task screenshot in Active Mode.
Each trial of the task required participants to use the keyboard to
make a set of text editing changes. The initial text included three long
sentences. The task required six editing changes to be made. These
involved deleting words, cutting and pasting words, selecting passages of
text, and inserting characters. Table 2.2 shows the original and corrected
text. Table 2.3 sets out the required changes. Participants received a
piece of paper that indicated the required changes and a piece of paper
with a list of text editing shortcut keys (see Table 2.4).
In Study 1 the set of text editing changes were identical for each trial.
54 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Figure 2.2: Study 1 text editing task screenshot in Stop Mode.
Consistent editing changes meant that participants were more likely to
achieve automaticity. It also minimised the time spent by participants
reading the editing requirements on subsequent trials.
Each trial started when the participant pressed F2. This activated the
text editor and started the timer. Trials ended either after the passage
of one minute or when the participant pressed F3 to indicate that the
trial was complete. At the end of each trial, feedback was displayed on
the time to complete the previous trial and the number of correct edits
out of 12.
For each trial, task completion time and accuracy for each edit were
2.2. METHOD
55
Table 2.2: Study 1 Original and Corrected Text for the Text Editing
Task
Version
Text
Original
We don’t know of no languages or people
without names, no cultures in which some
manner of distinction between self and other,
we and they are not made. Self knowledge,
always a construction no matter how much it
feels like a discovery is never altogether
separable from claims to be known in specific
ways by others (Calhoun, 1984).
An anthropological account of selfhood and
identity.
Corrected
We know of no people without names, no
languages or cultures in which some manner
of distinction between self and other, we and
they are not made. Self-knowledge, always a
construction no matter how much it feels like
a discovery is never altogether separable from
claims to be known in specific ways by others
(Calhoun, 1994).
An anthropological account of identity and
selfhood.
recorded. Information about the nature and timing of key presses was
also recorded. Performance was measured as the time to complete the
edits. Accuracy was out of 12 based of 12 textual checks of the final text
as set out in Appendiex A.3.3. Appendix A.4 discusses how trials with
imperfect accuracy were processed. Measurement of strategy sophistication was derived from the key logs as described in Section 2.2.2.
Procedure
Table 2.5 shows the sequence and duration of experimental tasks. Participants first completed the individual difference measures (analysed and
described further in Chapter 3). Then after a break they completed the
56 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.3: Study 1 List of Text Editing Task Requirements
Edit Description
1.
2.
3.
4.
5.
6.
Delete “don’t”
Cut “languages or” and Paste after the second “no”
Insert “-” then backspace between “Self” and “knowledge”
Delete “8” and insert “9”
Cut “selfhood” and paste after “and”
Cut “identity” and paste before “and”
text editing task.
Prior to starting the text editing task, instructions were read out
verbatim as set out in Appendix A.3.2. In brief the instructions provided:
(a) information on why text editing is a useful real-world skill; (b) an
overview of the task; (c) an overview of the sequence of trials and breaks;
(d) information about the list of short-cut keys; (e) reiteration not to use
the mouse; and (f) a request to start the task. Participants were given
a list of text editing short-cut keys as shown in Table 2.4. Participants
were expected to read this list while they completed an initial practice
trial.
The text editing task involved one practice trial and 54 performance
trials. The practice trial had no time limit and was the same text as the
main task. Performance was only recorded on the performance trials.
Participants were encouraged to take short breaks after trials 18 and 36.
2.2. METHOD
57
Table 2.4: Study 1 List of Shortcut Keys Provided to Participants
Key Combination
Action
Cursor keys
Moves cursor
Control-X
Control-C
Control-V
Control-Z
Cut
Copy
Paste
Undo
Shift
Shift & Control & Left or Right
Control & Left or Right
Select text
Select word
Move between words
Backspace
Delete
Control & Delete
Delete letter to the left of cursor
Delete letter to the right of cursor
Delete word after the cursor
Home
Shift & Home
Move cursor to start of line
Select text between where the
cursor was and the start of the
line
Move cursor to start of document
Select text between where the
cursor was and the start of the
document
Move cursor to end of line
Select text between where the
cursor was and the end of the line
Move cursor to end of document
Select text between where the
cursor was and the end of the document
Control & Home
Shift & Control & Home
End
Shift & End
Control & End
Shift & Control & End
F2
F3
Start Trial
End Trial
58 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.5: Study 1 Experimental Protocol
Task
Duration
Individual Differences
Introduction
Demographics
Study 1 Prior Knowledge Questionnaire
10 Thumbs Typing Test
Inference Test
Extended Range Vocabulary Test
Cube Comparison Text
Number Comparison Test
Number Sort Test
Clerical Speed and Accuracy
Simple Reaction Time V1.0
2-Choice Reaction Time V1.0
4-Choice Reaction Time V1.0
BREAK
2
2
3
1
1
1
1
1
1
1
1
1
1
5
Text Editing Task
Task Instructions
Practice Trial
Trials 1 to 18
BREAK
Trials 19 to 36
BREAK
Trials 37 to 54
5 min
2 min
approx 15 min
5 min
approx 15 min
5 min
approx 15 min
Debrief
min
min
min
min
min
min
min
min
min
min
min
min
min
min
instructions;
instructions;
instructions;
instructions;
instructions;
instructions;
instructions;
instructions;
instructions;
instructions;
2 min task
12 min task
12 min task
6 min task
3 min task
3 min task
6 min task
4 min
5 min
5 min
2.2. METHOD
2.2.4
59
Study 2
Participants
Participants in Study 2 were drawn from students in a third year undergraduate psychology subject at an Australian university who consented
to allow their involvement to be used for research purposes. The final
retained sample of participants completing the text editing task was 154,
drawn from an initial sample of 193 (i.e., 79.8% were retained). Reasons
for removal of cases are presented in Appendix B.4. Of the retained sample 113 were female (73.9%) and 40 were male (26.1%) with one unknown.
Median age was 21 (mean = 21.1, min = 18, max = 32).
Text Editing Task
The Study 2 text editing task was programmed using Visual Basic.Net.
The design of the study and the nature of the task differed in several
respects from Study 1.
Each trial involved making one of the following four types of edits:
Delete, Replace, Insert, or Cut and Paste. Each trial had different editing
requirements. Each trial was based on initial text taken from a passage
from Alice’s Adventures in Wonderland (L. Carroll, 1865). The corrected
text is shown in Table 2.6. Table 2.7 shows the four trial types with
information on the instructions and the details of the trial. A database
of 400 items was created each with a different editing requirement. For
each of the four trial types there were 100 items.
60 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.6: Study 2 Corrected Text for Text Editing Task
Corrected Text
Alice was beginning to get very tired of sitting by her
sister on the bank, and of having nothing to do: once
or twice she had peeped into the book her sister was
reading, but it had no pictures or conversations in it,
‘and what is the use of a book,’ thought Alice ‘without
pictures or conversation?’
So she was considering in her own mind (as well as she
could, for the hot day made her feel very sleepy and
stupid), whether the pleasure of making a daisy-chain
would be worth the trouble of getting up and picking
the daisies, when suddenly a White Rabbit with pink
eyes ran close by her.
There was nothing so very remarkable in that; nor did
Alice think it so very much out of the way to hear
the Rabbit say to itself, ‘Oh dear! Oh dear! I shall
be late!’ (when she thought it over afterwards, it
occurred to her that she ought to have wondered at
this, but at the time it all seemed quite natural);
but when the Rabbit actually took a watch out of its
waistcoat-pocket, and looked at it, and then hurried
on, Alice started to her feet, for it flashed across
her mind that she had never before seen a rabbit with
either a waistcoat-pocket, or a watch to take out of
it, and burning with curiosity, she ran across the
field after it, and fortunately was just in time to see
it pop down a large rabbit-hole under the hedge.
In another moment down went Alice after it, never once
considering how in the world she was to get out again.
The rabbit-hole went straight on like a tunnel for some
way, and then dipped suddenly down, so suddenly that
Alice had not a moment to think about stopping herself
before she found herself falling down a very deep well.
Note. Text is taken from Alice’s Adventures in Wonderland (L. Carroll,
1865).
2.2. METHOD
61
Table 2.7: Study 2 Text Editing Trial Types
Trial
Type
Instructions Details of Task
Delete
Delete red
text
Insert
After blue,
type
Replace
Delete red
text and
replace
with
Cut and Cut
red
Paste
text and
paste after
blue
Item Construction
Text appeared in One or more words
red with a red strike was inserted (5 to 40
through
characters in length);
these words needed to be
deleted; position varied
throughout base text.
Text was highlighted One word was deleted
in blue; the word spec- from the base text and
ified was to be typed needed to be inserted
after this point
Text appeared in One or more words were
red with a red strike inserted (3 to 20 charthrough;
this was acters) and these needed
to be deleted and a to be replaced typically
single text was to be with one word
typed in its place
Red text appeared on A string of text was inthe screen as well as dicated to be cut and
text that was high- pasted into an insertion
lighted in blue
point. The text to be
cut ranged from 12 to 344
characters.
62 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
The editing requirements were incorporated into the computer display. This removed the need for participants to look at a separate piece
of paper to read the editing requirements. This modification made it
easier to have different editing requirements on each trial, yet still allow
participants to obtain a level of automaticity. It also made the editing
task more similar to how individuals edit when making changes that they
have mentally identified as opposed to responding to a commented draft
with requested changes. Colour was used to indicate the type of editing
change required.
A screenshot of the task interface is shown in Figure 2.3. At the top
of the screen were instructions for the trial indicating the trial type. In
the case of Insertion and Replace trials, the content of the insertion and
replacement were displayed here. Below this was a tip about editing text
which changed on each trial. Below this taking up the majority of the
display and aligned to the bottom left of the screen was the text editing
box. In the upper right was information on the trial number, the seconds
remaining on the current trial, and the minutes of task time remaining
in the experiment. Below this was a list of text editing keys. The mouse
was disabled.
Each trial commenced with an initial three second delay where the
instructions for the trial were displayed and the text editing box was
visible but shaded in grey. After this delay the text editing box became
editable, as was evident by its background turning white. This triggered
the onset of trial completion time measurement. The cursor started in
the top-left position of the text box. During the trial the time remaining
on the trial counted down from 40 seconds. The trial ended automatically
either when the participant successfully made the editing change or when
40 seconds had passed since the commencement of the trial.
The text editing tips were included to encourage participants to adopt
new strategies. The 14 tips are presented in Table 2.8. The tip for each
2.2. METHOD
63
Figure 2.3: Study 2 text editing task screenshot
trial was randomly selected from the set of tips.
Trial-level performance was measured using a standardised form of
task completion time. Before standardisation occurred several cases and
trials were excluded for a range of reasons, details of which are presented
in Appendix B.4. Adjusted task completion times were based only on
retained data. Z-scores of task completion times for each trial type were
generated. These standardised times were then rescaled to the grand
mean and standard deviation of all retained trials. This is expressed
mathematically in Appendix B.2.
Models at the individual-level used trial number as a predictor. Trial 1
was deemed to start on the second actual trial of the experiment. Trials
that were not retained still contributed to the trial number. For example,
if trial 4 was not retained for an individual, a participant’s trial sequence
would be 1, 2, 3, 5. That is to say, trial number 5 was not adjusted to
64 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.8: Study 2 Text Editing Tips
Text Editing Tip
1
2
3
4
5
6
7
8
9
10
11
12
13
14
Control+down moves the cursor to the start of the next paragraph;
Control+up moves the cursor to the start of the previous paragraph
Control+z can be used to undo a change
Home Key moves the cursor to the start of the current line; End
Key moves the cursor to the end of the current line
Control+Home moves the cursor to the beginning of the file; Control+End moves the cursor to the end of the file
Control+Shift+Left selects the word to the left; Control+Shift+Right selects the word to the right
Control+Left moves the cursor one word to the left; Control+Right
moves the cursor one word to the right
Control+Delete deletes the word in front of the cursor, or the remainder of the word in front of the cursor if the cursor is half-way
through a word
Control+backspace deletes the word behind the cursor
Shift+Down and shift+Up select whole lines of text
One way to delete text involves selecting text and then pressing
delete
Shift+down selects the line below; Shift+up selects the line above
Control+Shift+Down and Control+Shift+Up selects from the cursor to the end start of the previous or next paragraph
When editing, place your right hand on the cursor keys. Index
on Left and use to press Delete and Backsapace; middle on Up or
Down and use to press Home and End; Ring finger on Right.
When editing place your left hand near left corner of keyboard.
Little finger on left control; Ring finger on shift; Middle finger on
z; and index finger use to press x, c or v.
be trial 4.
The study used a fixed duration for all participants. As such the
number of trials varied between participants. In order to calculate grouplevel task completion time and strategy sophistication, trial numbers
were rescaled within each individual as a percentage of the total trials
for that individual that were completed. This percentage completion
measure of trial number was then broken up into 30 blocks. Performance
time and strategy sophistication at the block-level was the mean of the
2.2. METHOD
65
corresponding trial-level measures.
Procedure
Table 2.9 sets out the experimental protocol for Study 2. Participants
were given a copy of the personality test to complete in their own time
prior to the main experimental session. In the main experimental session participants completed the three general ability tests and the two
perceptual speed tests. They were then given a 5 minute break.
Table 2.9: Study 2 Experimental Protocol
Task
Duration
One Week Prior to Main Experimental Session
100 Item IPIP Personality Test
Approx 10 min
Main Experimental Session
Ability Testing
Inference Test
Extended Range Vocabulary Test
Cube Comparison Test
Number Comparison Test
Number Sort Test
Text Editing Task
Instructions and Training
Trials
One week after the Main Experimental Session
Simple Reaction Time V2.0
4-Choice Reaction Time V2.0
Anglim Typing Test
Prior Experience Questionnaire
1
1
1
1
1
min
min
min
min
min
instructions;
instructions;
instructions;
instructions;
instructions;
12 min task
12 min task
6 min task
3 min task
3 min task
10 min
30 min
1
1
3
4
min instructions; 4 min task
min instructions; 5 min task
min
min
Participants were then given verbal text editing instruction made up
of four parts: (a) motivating introduction, (b) overview of hand positioning, (c) follow-along demonstration of text editing keys, and (d) task
specific instructions. The instructions aimed to motivate participants
and give an initial exposure to each text editing key. Details of the in-
66 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
structions are provided in Appendix B.3. Participants then completed as
many trials as they could in 30 minutes. Participants completed the psychomotor ability, prior experience, and typing test one week later. This
chapter only analyses strategy sophistication and task completion time
on the text editing task. Description and analysis of individual difference
measures are presented in Chapter 3.
2.2. METHOD
2.2.5
67
Study 3
Participants
Study 3 participants consisted of 154 adults recruited from a third year
undergraduate psychology subject at an Australian university who consented to allow their data to be used for research purposes. All participants had a typing speed above 15 words per minute and completed the
main performance task adequately. Participants included 109 females,
44 males, and 1 unknown. Median age was 21 (mean = 22.5, min = 19,
max = 44.5). Participants were randomly allocated to one of three conditions. For the purposes of this chapter, only the No Training Condition is
described and analysed. Discussion and analysis of the other conditions
is presented in Chapter 4. The No Training Condition analysed in this
chapter had 61 participants.
Text Editing Task
The text editing task in Study 3 was similar to Study 2. As with Study 2,
the task was programmed in Visual Basic.Net. The program was run on
Dell Optiplex GX520 computers with Intel Pentium 4, 3.00 GHz CPUs
and 504 MB ram, running Windows XP Pro SP3. The displays were
15 in. LCD displays with 60 hz refresh rates set at a screen resolution of
1200 width by 1024 height. A screenshot of the main display is shown in
Figure 2.4
Each trial required the participant to use the keyboard to delete a
single continuous set of words marked in red. This required the participant to navigate the cursor to the location of the highlighted words and
delete the words.
Each trial involved one randomly selected editing item drawn from a
set of 500 items. Each item required the deletion of one to ten consecutive
words. The number of words to be deleted on the trial was labelled the
68 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
deletion length. The set of editing items was generated by taking an
initial five-paragraph passage of text and adding deletion text after an
insertion point. The deletion text consisted of words drawn from the
500 most common words in the English language (as sourced from, Fry,
Kress, & Fountoukidis, 1993). Fifty insertion points were selected so that
the deletion text was in different positions on different trials. Thus, the
500 items were generated by crossing the 50 insertion points with the ten
deletion lengths. An example of the text to be edited with deletion text
included is shown in Table 2.10. The complete list of the editing items
is shown in Appendix C.2.4.
2.2. METHOD
69
Figure 2.4: Text editing task during a representative trial. The task
goal was presented to participants as “Remove red text”. Information
about the block number, trial number, and seconds past (Trial Time)
are displayed in the top right of the display. Performance feedback is
displayed on the middle-right of the screen. This feedback includes the
number of seconds to complete the previous trial and the average time in
seconds to complete all preceding blocks. In the space between top-right
and middle-right, messages were displayed when certain events occurs.
These included when there was no user activity for longer than five seconds and when the participant made an error related to having too many
or too few spaces.
70 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.10: Study 3 Example of Editing Requirements
Text to be Edited
Alice was beginning to get very tired of sitting by her
sister on the bank, and of having nothing to do: once
or twice she had peeped into the book her sister was
reading, but it had no pictures or conversations in it,
’and what is the use of a book,’ thought Alice ’without
pictures or conversation?’
So she was considering in her own mind (as well as she
could, for the hot day made her feel very sleepy and
stupid), whether the pleasure of making a daisy-chain
would be worth the trouble of getting up and picking
the daisies, when suddenly a White Rabbit with pink
eyes ran close by her.
There was nothing so very remarkable in that; nor did
Alice think it so very much out of the way to hear
the Rabbit say to itself, ’Oh dear! Oh dear! I shall
be late!’ (when she thought it over afterwards, it
occurred to her that she ought to have wondered at
this, but at the time it all seemed quite natural);
but when the Rabbit actually took a watch out of its
waistcoat-pocket, and looked at it, and then hurried
on, Alice started to her feet, for it flashed across
her mind that she had never before seen a rabbit with
either a waistcoat-pocket, or a watch to take out of
it, and burning with curiosity, she ran across the
field after it, and fortunately was just in time to see
it pop down a large rabbit-hole under the hedge.
food war that sometimes ten from up sing problem In
another moment down went Alice after it, never once
considering how in the world she was to get out again.
The rabbit-hole went straight on like a tunnel for some
way, and then dipped suddenly down, so suddenly that
Alice had not a moment to think about stopping herself
before she found herself falling down a very deep well.
2.2. METHOD
71
Before each trial, there was a one second delay after which the editing
box became active and the trial commenced. At the start of each trial
the cursor was located at the top-left position of the text box. Trials
ended either automatically when an edit was successfully completed or
after 30 seconds had passed since trial commencement. At the end of each
trial, task completion time for the trial that had just ended was displayed.
There were also some additional messages displayed when participants
engaged in off-task behaviour or made errors (see Appendix C.2.2).
Trials were grouped into blocks. After the passage of 60 seconds on
a block, the ending of the active trial triggered the end of the block.
Within a block, the end of each trial and the beginning of the next trial
were separated by one second where no edits could be made. Block-level
performance was the mean trial completion time in the block excluding inaccurate trials. This system was designed to ensure that the total
length of the experiment was similar for each participant. This blocking
structure was designed to: (a) get reliable block performance measurement, (b) have a sufficient number of blocks to model learning curves, and
(c) enable trial-level modelling if desired without introducing block-level
artefacts.
At the end of each block, participants were given feedback on the
number of blocks out of 30 that they had completed and the average task
completion time for accurate trials for the block that had just ended.
Participants could then choose to press ENTER to continue to the next
block or press F1 to show a list of text editing keys. Figure 2.5 shows
an example screenshot of the end-of-block feedback screen after pressing
F1.
72 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Figure 2.5: Study 3 text editing task screenshot at the end of a block.
The display also shows the list of text editing keys displayed after a
participant presses F1 at the end of a block.
2.2. METHOD
73
The main measure of task performance was time to complete a trial.
Inaccurate trials (i.e., those that timed out after 30 seconds) were excluded. In order to control for item effects, raw trial completion times
were converted to z-scores within each item. Z-score-scaled trial completion times were then rescaled to the typical time for a trial throughout the
experiment (M = 6.2 sec and SD = 3.2 sec). Block completion time for
each individual was calculated as the mean of rescaled trial completion
times for trials in the given block.
Procedure
Table 2.11 sets out the procedure for Study 3. The personality test was
completed in a supervised setting one week prior to the main experimental session and administered on computer using Inquisit v.3.0.
Introductory Instructions Initial instructions were read out to participants. The verbatim text of the instructions is presented in Appendix C.2.1. The following summarises these instructions. First, participants were instructed to move to seats so that there was a gap between
each participant. Participants were then informed of the nature of the
task (keyboard text editing) and were given information about the value
of the skill in educational and work domains in order to motivate them
to learn. Participants were also told that the quality of the experimental results depended on them trying hard. Participants were then given
an overview of the experiment and the time involved in each component.
Participants were given a series of rules that they were requested to follow
to ensure experimental conditions were maintained. Participants then
completed the typing test, demographics, and prior experience measures
before commencing the text editing task.
74 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.11: Study 3 Experimental Protocol
Task
Approximate Duration
Two Weeks Prior to Main Experimental Session
PLS and Consent
5 min
One Week Prior to Main Experimental Session
50 Item IPIP Personality Test
10 min
Initial Section
Introductory Instructions
Anglim Typing Test
Demographics Questionnaire
Prior Experience Questionnaire
10
3
1
1
Text Editing Task
Instructions
Initial Training
Practice Trials Blocks 1 to 15
Mid-Practice Condition
If Training: Mid-Practice Training
If Control: 4 Choice RT Task
If No Training: Continue to practice
Practice Trials Blocks 16 to 30
Participant Debriefing
min
min
min
min
4 min
4 min
25 min
5
5
0
25
min
min
min
min
Note. Exact timing of most sections varied based on participant behaviour.
Initial Text Editing Instructions Instructions for the text editing
task were administered on computer. The exact wording of these instructions is presented in Appendix C.2.1. In summary, participants were told:
(a) of the importance of reading the instructions carefully; (b) the nature
of the task and the real-world value of text editing; (c) to not use the
mouse and place it behind the computer; (d) about the instructions and
structure of the task including number of blocks, relationship between
trials and blocks, nature of trial pausing, and what to do if an error
is made; (e) about the rules to follow in order to ensure experimental
rigour; (f) about proper finger placement for effective text editing; and
2.2. METHOD
75
(g) about what to do on the initial training trials.
Initial Training Participants then completed initial training trials on
keyboard text editing. Each practice trial introduced one or more text
editing key combinations as listed in Table 2.12. Each trial displayed a
practice paragraph where participants were required to apply the introduced text editing keys. Participants could only proceed to the next trial
once they had pressed each of the keys a specified number of times. This
ensured that participants did not skip the practice period.
Table 2.12: Study 3 Keys Practiced in Initial Training Trials
Trial Keys Practiced
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
Up, Down, Left, Right
Ctrl+Left, Ctrl+Right
Ctrl+Up, Ctrl+Down
Home, End
Ctrl+Home, Ctrl+End
Shift+Up, Shift+Down, Shift+Left, Shift+Right
Ctrl+Shift+Left, Ctrl+Shift+Right
Shift+Home, Shift+End
Backspace, Del
Ctrl+Backspace, Ctrl+Del
Ctrl+X, Ctrl+C, Ctrl+V, Ctrl+Z
Once participants completed the training trials, the following message
was displayed:
Key instructions have ended. You can bring up the list of text
editing keys at the end of each block. On the next screen the
main task begins. Your task is to use text editing keys to
remove the text marked in red as quickly as possible. Press
Enter To Continue.
Main Task Participants in the condition presented in this chapter (i.e.,
the No Training condition) then completed 30 blocks of trials of the text
76 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
editing task without interruption. Details about the other conditions are
described in the method section of Chapter 4.
2.2. METHOD
2.2.6
77
Analysis Plan
The following section sets out the quantitative methods used to test the
proposed hypotheses. The general orientation combined model fitting
with extensive use of graphics. Models were broadly evaluated in terms:
(a) fit to the data, (b) parsimony as indicated by fewer free parameters,
and (c) theoretically meaningfulness as indicated by plausible predictions outside the range of the data. All analyses were performed using
R 2.11.1 (R Development Core Team, 2010). In addition to base R
the following R packages were used: (a) psych (Revelle, 2010) for some
descriptive statistics and test scoring, (b) lattice (Sarkar, 2010) for
generating trellis plots of individual-level learning curves. (c) Sweave
(Leisch, 2002) for integrating tables, figures, and results text into the
thesis in a reproducible manner.
All functions proposed in the introduction for modelling practice and
strategy use and practice and task completion time were modelled using
nonlinear least squares regression (for theoretical treatment, see Bates
& Watts, 1988; Seber & Wild, 2003) using the nls function (Bates &
Watts, 1988) in R (for discussion of use of nls, see Huet et al., 2004; Ritz
& Streibig, 2008).
Estimation was performed using the nls function in R (Bates & Watts,
1988) using the Gauss-Newton algorithm. For each model, several different starting strategies were adopted, which involved (a) estimating the
full model, (b) estimating a set of models with one parameter fixed for
each model but varying over the set of models, (c) fixing one or more parameters that would otherwise be free to vary. The resulting model with
the smallest residual sums of squares was retained as the actual model for
subsequent model summary purposes. These alternative strategies were
particularly important because (a) individual-level data was often noisy,
(b) model estimation was desired even for models that were clearly not
78 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
representative of the data generating process, and (c) some of the models had a relatively large number of parameters relative to the number of
data points.
The models estimated and reported in this thesis are all designed
to incorporate parameters that lead to plausible predictions outside the
range of the data and reflect reasonably plausible psychological processes.
This raises a range of issues regarding how possible values of parameters
should be constrained to enforce the above properties. While other parameter constraints might be considered, in this thesis the only constraint
applied to parameter estimates was that parameters representing breakpoints were forced to occur after the fifth block/trial (note Study 1 uses
trials, and Study 2 and 3 uses blocks of trials), and before the fifth last
block/trial. Such breakpoint parameters occur in CnSE, CnCn, P2BP2,
and E3B models. This meant for example that the CnSE model was
required to predict constant strategy sophistication at least in the first
five blocks, whereas without this constraint, it could discard the constant component and perfectly replicate the SE model. This constraint
was employed to increase the chance that the breakpoint would have a
psychologically meaningful interpretation, and was not just used to give
additional freedom to model fitting.
When comparing models, smaller AIC and root mean squared error
(RMSE or just se) values were used to indicate model superiority. Where
two main models had the same number of parameters, R2 was also used
as a basis for comparison. At the individual-level, paired-samples t-tests
were used to compare the RMSEs for selected model comparisons: (a)
three parameter power with three parameter exponential; (b) four parameter power with APEX; and (c) APEX with three parameter exponential
with breakpoint. Also, a chi-square test was used to assess whether the
number of fits at the individual-level supporting one model compared to
another was greater than chance.
2.3. STUDY 1 RESULTS
79
The subsequent results section is at first instance organised around
studies. Within each study results for practice and performance are
presented followed by results for practice and strategy sophistication.
Within each of these sections group-level, and then individual-level results are presented. Within each of these subsections, graphical and
model fitting approaches are presented.
2.3
2.3.1
Study 1 Results
Practice and Performance
Group-level
Figure 2.6 shows the group-level relationship between practice and task
completion time. Despite a few bumps the plot shows a monotonically
decreasing (consistent with Hypothesis 2.3) and decelerating function
that approaches an asymptote towards the end of practice.
Table 2.13 shows the model fits statistics for group-level data. Consistent with Hypothesis 2.1, the three parameter power function provided
better fit than the three parameter exponential function as evident in the
larger R2 and smaller AIC.
To determine whether results were broadly robust to the particular
accuracy adjustment performed, several additional analyses were performed. Models were fit to raw reaction time, and the same general
pattern persisted when comparing three parameter power with three parameter exponential models with power being superior at the group-level
and exponential being superior at the individual-level. While there are
reasons to believe accuracy adjusted reaction times are a more valid measure of performance on the task, core results were robust whether this
adjustment was applied or not.
80 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.13: Study 1 Fit Statistics for Models of the Effect of Trial on
Task Completion Time at the Group-Level
1.
2.
3.
4.
5.
6.
7.
8.
Model
ka
se
R2
AIC
Cn
P3
E3
APEX
P4
CnCn
P2BP2
E3B
1
3
3
4
4
3
5
5
6.039
0.730
1.384
0.738
0.737
3.379
0.693
1.114
.000
.986
.949
.986
.986
.699
.988
.969
350.44
124.24
193.27
126.22
126.07
289.65
120.44
171.66
Note. Model name abbreviations were previously defined in Section 2.1.2
a
k = Number of parameters.
Individual-level
Description and Plots Figure 2.7 shows the relationship between
practice and task completion time for each participant. As expected,
trial-to-trial variation in performance is relatively small but still greater
than the group-level. All participants improved with practice. For most
participants the rate of improvement appears to be monotonically decelerating. However, some participants seem to approach a personal asymptote earlier than others, and there are a few cases where improvement is
almost linear (e.g., cases 14, 33).
Figure 2.8 shows examples of cases for each of the modelled functions.
Cases were chosen that generally fit well for the function relative to other
individuals.
Power versus Exponential Function Table 2.14 summarises
individual-level model fit information for the relationship between practice and task completion time. Supporting Hypothesis 2.2, the exponential function provided substantially superior fit in comparison to the
power function (rmseP 3 = 5.24, rmseE3 = 5.10). Using a paired samples
t-test, this was a statistically significant difference, t(62) = 2.46, p = .02.
2.3. STUDY 1 RESULTS
81
In terms of individual cases, the exponential function beat the power
function in 39 cases, compared to only 24 wins for the power function.
This was almost statistically significant when compared to a
chi-square goodness of fit test assuming 50% wins for both functions,
χ2 (df = 1) = 3.57, p = .06.
APEX versus P4 Function Table 2.14 summarises individual model
fit information for three parameter power and exponential functions.
There was little difference in the fits of the four parameter power
and the APEX function, rmseAP EX = 5.19, rmseP 4 = 5.19. Using a
paired samples t-test, this was not a statistically significant difference,
t(62) = 0.144, p = .89. In terms of individual cases, the APEX function
beat the power function in 33 cases, compared to 30 wins for the power
function. This was not a statistically significant difference when compared to a chi-square goodness of fit test assuming 50% wins for both
functions, χ2 (df = 1) = 0.143, p = .71.
E3B versus APEX Function The three parameter exponential break
function provided superior fit in comparison to the APEX function
(rmseE3B = 4.73, rmseAP EX = 5.19). Using a paired samples t-test,
this was a statistically significant difference, t(62) = 6.12, p < .001. In
terms of individual cases, the E3B function beat the APEX function in
58 cases, compared to 5 wins for the APEX function. This was statistically significant difference when compared to a chi-square goodness of fit
test assuming 50% wins for both functions, χ2 (df = 1) = 44.59, p < .001.
82 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.14: Study 1 Mean Model Fit Statistics for the Effect of Trial on
Task Completion Time at the Individual-Level
1.
2.
3.
4.
5.
6.
7.
8.
Model
ka
se
R2
AIC
Cn
P3
E3
APEX
P4
CnCn
P2BP2
E3B
1
3
3
4
4
3
5
5
8.207
5.239
5.100
5.194
5.186
5.754
4.754
4.730
.000
.562
.579
.595
.581
.484
.646
.647
370.30
323.53
321.17
323.10
323.14
335.63
315.74
315.16
Note. Number of individual learning curves = 63.
a
k = Number of parameters.
2.3. STUDY 1 RESULTS
Task Completion Time (sec)
60
83
●
50
●
●
●
40
●
●
●
●●●
●
30
●●●●●
●
●
●
● ● ●●●
●●●●●● ● ●
●
● ● ●●●●●●● ●●●●●
●
●●●● ●●
20
10
0
0
10
20
30
40
50
Trial
Figure 2.6: Study 1 task completion time by trial at the group-level.
84 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
0 20 40
92
93
0 20 40
95
0 20 40
96
97
98
99
●
●●
●
● ●
●
●
●
●
●● ● ●●
●
●●
●
● ● ●● ●●
●
●
●
●
●●●●
●●●
●
● ●● ●
●
●
●
● ●●
●
●
●
●
●
●●
●
●
●●
●
●
●●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●● ●●
●
●
●
●
●●● ●●● ●●
●
●
●
●
●
●●
●
●●
●● ●●
●●●
●● ●
●●
●●●● ●
●●
●
●
●●
●●
●
●
●
●● ●
●●
●
●
●● ● ●
●
●●
●
●
●
● ●●
●●
●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●●
●●●●
●●
●● ●
●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●
●
●●
●●
●●●●● ●
●●
●
●●
●
●
●
●
●●
●
● ● ● ●●●
●
●
●
●●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●●
●●●
●
●
●
●●
●
●
74
60
20
61
Task Completion Time (sec)
76
77
79
81
●
● ●
●
●●
●
●
●
●
●
●
●●●● ●
● ●
●
●
●
●
●
●●●
●
●●●●
●
●
●● ● ● ●● ●●
● ●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●●●●
●
●
● ●●●
●● ● ● ● ● ●
●
●
●
●●
●
●
●●
●
● ●●
●●
●
●
●
●●
●
● ●
●●●
●● ●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●●
●
●●
●●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●●
●●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●● ●●
●
●●●
● ●●
●
●
●
●
●
●
●
●
●
●
●
● ●●●
●
●
●
●
●
●●
●
●
●
62
63
65
60
20
82
89
91
67
68
7
●●
●
● ●
●
●
●●
●
●●
●
●
●
●
●
●
● ●● ● ●●
●
●
●
●
●
●
●
●● ●
●
●●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●●●
●●
●
●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●●●
● ● ●
●
●
●
●
●● ● ●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
66
●
●
●
●
●
●
●
●
●
● ●
●
●●
●
●
●●●
●
●●
●
●
●
●
●●
●●●
●
●
●
●
● ●●
●
●
● ●
●
●
●
●● ●
●
●● ● ● ●
●
●
●
●●
●
●●
●
●● ●●
●
●
●●●
●●●●
●
●
●
●
●●
●
●●●
●
●
●●
●
●
●
●
●●
●
●●
●●●
●●●
●
●
●
●
●
●●●●●
●
●
●
●
●
●●
●
●
●
● ● ● ●
●
●
●
●●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●●
●
● ●●●
●
● ●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●●
●
●
●
●●●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●●●●
●
●●
●●
●
●●
●●
●
●
●
●
●
● ●●
●
●
●
●
●
●●
●
●
●
●
●●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●●●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
5
60
20
50
52
53
58
●
●
● ●
●
●
●
● ●
●●●
●
●
●
●
●
●●●
●●
●
●●
●
●●
●● ●● ●
●●
●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●●●
● ●
●
●●
●
●
●
●
●●
●
●●
●●●
●
●
●●
● ●●●
●
●●
●
●
●
●
●
●
●
●●
●●
●●●●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●●
●
●
●
●●
●
● ●
●
●●
●
●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
4
40
43
44
46
59
6
60
20
60
●
●
●
● ●● ● ●● ●● ●●● ●
●
●
●●
●●●●● ● ●
●
●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●
●● ●
●
●●
●
●
●
●●●●●
●●
●
●
●
●●
●
●
● ●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
47
48
49
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●●
● ●●
●
●
●
● ●●●
●
●
●
● ●
●
●●●● ● ●● ●●
●
●●
●
●●
●
● ●● ●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●●●
●
●
●●●
●
●
●●
●
●●
●
●
●●
●●●
●
●
●
●●
●● ●●
●
●
●●
●
●●
●●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●● ●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●●
●●●
● ● ● ●●●●●
●●
●●
●
●
●
●●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
● ● ●
●
●
●
●
●
●● ●●
●
●●
●
●●●●●
● ●
●
●●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●●●
●
●●
●
●
●
●● ●●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●●●
●
●●
●
●
●●
●●●
●●
●
●●
●
●
●
●●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
22
60
20
27
3
32
33
35
36
38
●● ● ●
●●
●
●
●
●
●●●● ●●
●
● ●● ●
●
●●
●
●●●●
● ●
●
●
●●
●
●
●
●
● ●● ● ● ●
●● ●
●
●
●
●
●●
●
●
●●●
●
●
●
●
● ●●●
●
●●
●●●● ●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●●●
●● ●●
●
●
●
●
●
●●●
●
●
●
●
●●
●
●● ●●
●
●●
●●
●
●
●
● ●●
●●
●
●
●
●
●
●
●
●
●
● ●● ●
●
●
●
● ●●● ● ●● ●
● ●
●
●
●●
●
●
●
●
●●●●
●
●
●
●
●
●
●
●●● ●● ●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●●●
●
●
●
●●
●●●●
●●
●
●●
●
●
●
● ●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
● ●●
●
●●
●●
●●
●●
●
●
●
●●
110
114
12
13
14
15
2
21
●
●
●
●●
●
●●
●
●●● ●
●●
●
●
●
●●
● ●
●●
●
●●
●
● ●
●●
●● ●
●●
●
●●
●
●
●
●●●
●
●●
●
●●● ●
●
●
● ●●
●●
●
●● ●
●●
●
●●●
●
●●
●● ●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●●●
●
●● ●
●
●
●
●●
●
●
●●
●●●
● ●
●●
●
●
●
●●
●●
●
●
●●●●●
●
●
●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●
●●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●●●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●
●● ●
● ●●
●●
●
●
●● ● ● ●
●
●●●●
●●● ● ● ●●
●●●
●
●
●●
●
● ●
●
●
●●
●
●●
●
●●
●
●●
●
●
●
●
●
●●●
●●●●●
●
●
●
●
●
●●
●●●
●
●●●
●
●●●
●
● ●
●●
●
●
●
●
●
● ●
●●●
●●●
●
●
●
●
●
●
● ●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●●
● ●●●●●
●
●
●●
●
●
●
●
●
●●
●
●●●
●
●
●
●●
●●●●
●●●●●
●
●
●
●
●
●
60
20
60
20
●
●
●
100
101
103
104
105
108
109
11
●●
●
●● ●● ●
●
●●
●
● ●
●
●●
●●●●
● ●
●
●
●
●
●●
●
●●●●●●● ●
●
●●
●
●●●
●●●●
●
●●
●●
●●●●
● ●●
●
● ●
●
●
●
●●
●●
●●●
●●● ● ●
● ● ●
●●
●●
●●
●
●
●● ●
●
●
●●
●
●
●
●●
●●
●●
●
●●●
●●
●●
●
●
●
●
●●
●
●
●
●●●●
●
●
●
●
●●●
●
●
●
●
●
●●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●●
●●●
●
●
●● ●
●
●●●
● ●●●●
●
●
●●
●
●●●
●●
● ●
●
●
●●●
●
●
●●●
● ●●●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●●●
●
●●●
●●
●
●
●
●
●●
●●
●
●
●● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●
●
●
●
●
●●
●●
●
●●●●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ●
●
●●
●
●
●
●●
0 20 40
0 20 40
0 20 40
0 20 40
Trial
Figure 2.7: Study 1 task completion time by trial at the individual-level.
60
20
2.3. STUDY 1 RESULTS
30
●
0
30
20
10
40
40
30
20
●●
●
● ●
●●●●
● ●
●
●●
●● ● ●
●● ●● ●
●
● ●
● ● ●●
●
●
●
●●
●●●
●
●
●
●
●● ●
●
● ●●
●
●
●●
●
10
0
0
20
40
50
40
30
20
●●
●
● ●
●●●●
● ●
●
●●
●● ● ●
●● ●● ●
●
● ●
● ● ●●
●
●
●
●●
●●●
●
●
●
●
●● ●
●
● ●●
●
●
●●
●
10
0
0
20
40
0
20
40
Trial
Trial
Trial
Trial
P4: ID=67
CnCn: ID=63
P2BP2: ID=67
E3B: ID=14
●
●
●
●
●
●
●
●
●
●●
●● ●
●●
●
●
●
● ●
●
●
●●●
●●
●
●
●●● ●●●
●
●●
●●
●
●
●●●
● ●●
●
●
10
0
Task Completion Time (sec)
Task Completion Time (sec)
20
●
40
20
●
●
●
●
●
●
●
●●
●● ●
●●
●
●
●
● ●
●
●
●●●
●●
●
●
●●● ●●●
●
●●
●●
●
●
●●●
● ●●
●
●
0
0
50
30
●
50
50
40
30
●
20
10
●
● ●●
●●● ●
● ●●
●●● ●
●● ● ● ●
●
●●●●
●
●
●●●●
● ●●●●●●
●●
● ●
●●●●●●●
●
●
●
0
0
20
40
Trial
50
●
●
40
30
20
●
●
●
●
●
●
●
●
●●
●● ●
●●
●
●
●
● ●
●
●
●●●
●●
●
●
●●● ●●●
●
●●
●●
●
●
●●●
● ●●
●
●
10
0
0
20
40
Trial
Task Completion Time (sec)
10
●
● ●●
●●● ●
● ●●
●●● ●
●● ● ● ●
●
●●●●
●
●
●●●●
● ●●●●●●
●●
● ●
●●●●●●●
●
●
●
●
40
Task Completion Time (sec)
20
●
APEX: ID=14
Task Completion Time (sec)
40
50
E3: ID=14
Task Completion Time (sec)
50
P3: ID=67
Task Completion Time (sec)
Task Completion Time (sec)
Cn: ID=63
85
50
40
30
20
●●
●
● ●
●●●●
● ●
●
●●
●● ● ●
●● ●● ●
●
● ●
● ● ●●
●
●
●
●●
●●●
●
●
●
●
●● ●
●
● ●●
●
●
●●
●
10
0
0
20
40
0
Trial
Figure 2.8: Study 1 examples of model fits of the relationship between
practice and task completion time at the individual-level.
20
40
Trial
86 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
2.3.2
Practice and Strategy
Group-level
Figure 2.9 shows the relationship between practice and strategy sophistication at the group-level. The relationship is approximately linearly
increasing with a slight deceleration of the rate of change over the course
of practice. Model fits are shown in Table 2.15. Consistent with Hypothesis 2.10 the Michaelis–Menten function provides a good fit. The almost
linear increase in strategy sophistication is somewhat surprising given
that task completion time rapidly approached an asymptote. In contrast, the strategy sophistication curve suggests that substantially more
improvement would have occurred had the period of practice been longer.
While a linear model would provide reasonable fit, it is not reported because of the previously discussed problems associated with polynomial
models, such as their failure to predict effectively outside the range of
the data. It is also worth remembering that the lower r-squared observed
for the CnSE model relative to the SE model is due to the parameter
constraint on the CnSE model that required the breakpoint to occur at
least five trials into practice.
At the group-level many of the models provided similar levels of
fit. However, the Michaelis–Menten and Saturated Exponential function seemed the most reasonable. There was no obvious constant at the
start ruling out the Constant–Saturated Exponential model, and there
was no obvious S-shape, ruling out the Logistic model.
Individual-level
Figure 2.10 shows the relationship between practice and strategy sophistication at the individual-level. Table 2.16 summarises the support for
the proposed models at the individual-level.
Consistent with Hypothesis 2.9, several participants never used so-
2.3. STUDY 1 RESULTS
87
Table 2.15: Study 1 Fit Statistics for Models of the Effect of Trial on
Strategy Sophistication at the Group-Level
Model
1.
2.
3.
4.
5.
6.
Cn
MM
SE
Lg
CnCn
CnSE
ka
se
R2
AIC
1 0.060 .000 -147.58
3 0.009 .980 -354.14
3 0.009 .980 -354.66
3 0.009 .980 -355.50
3 0.029 .771 -223.24
5 0.009 .980 -352.07
Note. Model name abbreviations were previously defined in Section 2.1.3
a
k = Number of parameters.
Table 2.16: Study 1 Mean Model Fit Statistics for the Effect of Trial on
Strategy Sophistication at the Individual-Level
Model
1.
2.
3.
4.
5.
6.
a
Cn
MM
SE
Lg
CnCn
CnSE
k = Number of parameters.
ka
se
R2
AIC
1 0.088 .000 -225.87
3 0.060 .299 -249.46
3 0.056 .364 -253.04
3 0.052 .308 -254.70
3 0.048 .435 -264.03
5 0.052 .358 -207.07
88 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
phisticated strategies (e.g., cases 93, 96, 97). Consistent with Hypothesis
2.8 some participants almost always used sophisticated strategies (e.g.,
cases 11, 110). Some participants shifted to sophisticated strategies fairly
quickly in the early trials (e.g., cases 3, 58, 63). Consistent with Hypothesis 2.5 the speed of transition varied between participants. Constant–
Constant suggests an abrupt onset and shift. Constant–Saturating Exponential suggests an abrupt onset but a more gradual shift. Consistent
with Hypothesis 2.6 several participants only partially adopted sophisticated strategies (e.g., cases 35, 46, 68). Figure 2.11 illustrates for each
function one participant with the model fit overlayed.
Broadly consistent with Hypothesis 2.7, many of the participants
who increased in their strategy sophistication throughout practice started
their increase in the first trial (e.g., cases 7, 22, 79, 105). However, there
was also a large number of participants where the onset of a strategy
shift occurred halfway through practice. A small number of participants
appear to have more than one strategy onset point (e.g., cases 43, 49).
Also a couple of participants briefly used more sophisticated strategies
before reverting back to simpler strategies. Participant 74 provides a
particularly clear example of this.
2.3. STUDY 1 RESULTS
89
1.0
Strategy Sophistication
0.8
●
● ● ●
0.6
● ● ●
● ●
● ● ●
● ●
●
●
● ● ● ● ●
● ● ● ●
●
●
●
●
0.4
●
0.2
0.0
0
5
10
15
20
25
Block
Figure 2.9: Study 1 strategy sophistication by trial at the group-level.
30
90 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
0 20 40
92
93
0 20 40
95
0 20 40
96
97
98
99
●
●●
●
●●
●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●●●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●●● ●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
74
Strategy Sophistication
0.8
0.4
0.0
76
●
●●
●●●●
●●
●
●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●● ● ●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
61
62
77
79
81
82
89
0.8
0.4
0.0
91
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●●
●
●
●
●●
●
●
●●
●
●●●
●
●
●●
●
●●
●
●●●
●
●●●
● ●●
● ●●
●
●●
●
● ●
●●
●
● ● ●
●
●●● ●
●
●
●● ●
●
●
●
●
●
●
●
●●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
63
65
66
67
68
7
●●●
●●
●●
●●
●
●
●
●
●●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●
●●
● ●
●●
●
●
● ●●●
●●●●●
●
●●
●●
●●
●
●●●
●
●●●●
●●●
●●
●
●
●
●●●●
● ●
●
●●●
●
●
●●●
●
●
●
●
●●
●
●
●
●
●
●
●●●●●
●●
●●
●●
●
●●
●
●
● ●●
●●●●●
●
●
●●
●
●
●
●●
●
●
●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●● ●
●
●●●
●●
●
●
●●●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●● ●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
5
0.8
0.4
0.0
50
52
53
58
●●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●● ● ●
●
●
●
●
4
40
43
●
●●
●●
●
●
●●
●
●
●●
●●
●
●
●●
●
●●
●
●
●
●
●●
●
●
●
●●
●●
●
●
●
●●
●
●
●
●
●●●●●
●
●●● ●
44
46
59
6
60
●●●
●●
●
●●●
●
●
●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
47
48
49
● ●●●●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●● ●●
● ●
●
●
●
●
●
●
● ●●
●●
●
● ●●
●
●●●
●●
●
●●
●●●●●●
● ●●
●
●
●●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●●●●●●●●
●
●
●
● ● ●●
●
●
●
●
●
●●●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●●
●
●
●
●
●●
●
●●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●●
●●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
22
0.8
0.4
0.0
27
3
32
33
35
36
38
0.8
0.4
0.0
●
●●
●●
●
●
●●●
●
● ●●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●
●
●
●●
●
●
●●
●●
●
●● ●
●
●
●
●●
●●
●
●●
●
●
●
●●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●● ●●
●
●● ●●
●
●●
●
●
●
●
●
●●
●●
●
●●
●
●
●
●●
●
●
●●
●●●
●● ●●
●
●●
●●●
●
●
●
●●
●●
●
●● ●
●●●
●
●●
●● ●
●●
●
● ●●
●
●
●
●●
●●
●●●
●
●
●
●
●
●●●
●●
●
●
●● ●●
●
●●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●●
●
●
●●●
●
●
●
●
●
●●
●●●
●●
●
●●●●
●●
●●●
●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
110
114
12
13
14
15
2
21
●●
●
●
●
●●●●
●
●●
●
●●●
●●●
●
●● ●
●
●●●
●●
● ●
●●
●
●
●
●●●●●●● ●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●●
●●
●
●
●●●●●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
100
0.8
0.4
0.0
0.8
0.4
0.0
101
103
104
105
108
109
11
●
●●
● ●●
●●
●●●
●●
●
●●
●●
●●●
●●
●
●
●
●
●
●●
●
●●●
●
●
●
●●●●
●
●
●●
●
●
●
●
●
●●
●
●
●●
●
●
●●●
●
●
●
●●
●
●
●●●●●
●●
●
●
●
●
●
●●●
●●●●●●●●●
●●
●
●
●●
● ●●
●●
●
●
●●●
●●
●
●
●
●● ●
●●● ●
●
●●
●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●
●
●
●
●
●●
●●
●
●●●●● ●
●●
●●● ● ●
●
●●●
●●
●●
●●
●●
●●●
●
●
● ●●●●
●
●
●
●
●
●●
●
●●● ●
● ●●●
●●
●●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
0 20 40
0 20 40
0 20 40
0 20 40
Trial
Figure 2.10: Study 1 strategy sophistication by trial at the individuallevel.
0.8
0.4
0.0
2.3. STUDY 1 RESULTS
91
MM: ID=79
1.0
0.8
0.8
0.6
0.4
0.2
SE: ID=22
1.0
●
●
●
●
● ●●●● ●●● ●●● ●●●● ●●●●●●●●●●
● ● ● ● ● ● ●
● ● ●
●●
● ●
●
●
0.6
●
0.4
●●
●
0.2
0.6
●
● ●
●
●
● ●●●●●● ●●●● ●●●●●●● ●●● ●●●
●●
●●● ●
●
●
●
● ●
●
●
●
●
●
●
●
0.8
Sophistication
1.0
Sophistication
Sophistication
Cn: ID=93
● ●
●
●
0.4
●
●
●
0.2
●●
●
● ●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
0
10
30
0.0
50
0
30
50
0
50
Lg: ID=82
CnCn: ID=104
CnSE: ID=101
0.6
●
●
●
●
0.4
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●● ●
●
●
●
● ●● ● ●● ●
● ●● ● ● ● ●● ● ●●●
●
● ●
●
●
0.6
●
● ●
● ●
0.4
●
●
0.2
●
●●●●●●●●●
10
1.0
0.8
Sophistication
●●
●
●
●●
● ●
●
● ●
● ●●●
●
●
●
●●●● ●
●●
●
0
30
Trial
1.0
0.2
10
Trial
0.8
Sophistication
10
Trial
1.0
0.0
0.0
0.0
30
Trial
50
0.8
Sophistication
0.0
0.4
0.2
●●●●●●●●●●●●●●
●●
0
10
0.0
30
Trial
50
●●
0.6
●
●●
●
●●
●
●●
●●
●
●
●
● ● ●
●
●
● ●●
●● ●
●
●
●
●
●●●
●●●●●●●●●●●●●●●●●● ●
0
10
30
●
50
Trial
Figure 2.11: Study 1 examples of model fits of the effect of trial on
strategy sophistication at the individual-level.
92 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
2.3.3
Practice, Strategy, and Performance
Cases with standard deviations above 0.10 were selected for correlation
analysis. Of the 63 cases, 23 satisfied this criteria. Within individual
correlations between log task completion time and strategy sophistication
were calculated for each of these participants separately. Correlations
tended to be large and negative (M = -0.55, SD = 0.22, min = -0.86,
max = -0.07).
2.4
2.4.1
Study 2 Results
Practice and Performance
Group-level
Figure 2.12 shows the group-level relationship between practice and task
completion time. Besides a few minor deviations, the figure shows data
representative of a monotonically decreasing and decelerating function
typical of group-level learning curves.
Table 2.17 shows the model fits statistics for group-level data. Consistent with Hypothesis 2.1 the three parameter power function provided
superior fit relative to the three power exponential function.
Table 2.17: Study 2 Fit Statistics for Models of the Effect of Block on
Task Completion Time at the Group-Level
1.
2.
3.
4.
5.
6.
7.
8.
a
Model
ka
se
R2
AIC
Cn
P3
E3
APEX
P4
CnCn
P2BP2
E3B
1
3
3
4
4
3
5
5
1.281
0.159
0.281
0.162
0.161
0.777
0.171
0.236
.000
.986
.955
.986
.986
.657
.985
.971
102.98
-20.52
13.72
-18.25
-18.85
74.87
-14.19
4.91
k = Number of parameters.
2.4. STUDY 2 RESULTS
93
14
●
Task Completion Time (sec)
12
●
10
●
● ●
8
● ●
● ●
● ●
●
●
●
●
● ● ●
● ●
● ●
●
● ● ● ● ● ●
●
6
4
2
0
0
5
10
15
20
25
Block
Figure 2.12: Study 2 task completion time by block at the group-level.
30
94 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Individual-level
Description and plots Figures 2.13 and 2.14 show the relationship
between practice and task completion time at the individual-level. Trialto-trial variation was substantially larger than Study 1. This can be attributed to trials with only a single edit and the variable editing content.
Learning curves varied between individuals in several respects including
(a) initial performance, (b) final performance, (c) shape of the learning
curve, (d) the frequency and timing of outliers, and (e) the consistency
of performance from trial to trial. Figure 2.15 shows example function
fits for individual participants.
2.4. STUDY 2 RESULTS
05 1525
94
15
5
95
96
97
99
87
88
89
9
●
●
●
● ●●
●
●●
●●●
●
● ●● ●
●
●
● ● ● ●● ● ●
● ●
●
●
●
●
● ● ●●
●● ●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●●
●
●
●
●● ● ●
●
●
●●
●
●
●
●
●●
● ●● ●
●
●● ●●● ● ● ●●
●
●
●●
●●
● ●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
● ● ●●●
●
● ●●
●
●
● ●
●
●
90
91
92
93
●
●
●
●
● ●
●
●● ● ●●
●●
●
● ●●●
●●
● ●
● ● ●
● ●
● ●● ●
●
●
●
●
●●
●
●● ● ●
●
●●
●
●
●● ● ●
● ●
●●
●
●●●
●
●
●●
●
●●
●●● ● ●●
●●●
●
●
●● ●
●
●●● ●● ●
●
●
●●●●
●
●●
●
●●
●●
●
●
● ● ● ● ●●
● ● ● ●
●●
●
●●
● ●●
● ●● ● ● ●
●
●● ●
●●● ●
●●
●
●● ●
●
●
●
●● ●
●
●
●
●●●
●
●
●●
●
●
●
●
●●
●●
●
●●
●● ●● ●
●
●
●
●
●
●●
●
●
●
●
●●●
●
●
●
●
● ●●●
●
●
●
●
● ●
●
●
●●
● ●●
●
●
●
●●
●●
●
●
●
●
●●
●● ●
●
●
●●
●
●
●●
●
● ●●●
●
●●●
●●
●
●
● ●●●
●
●
●
●●
● ●●
●
Task Completion Time (sec)
05 1525
●
86
15
5
95
76
77
78
79
8
82
83
84
85
64
66
68
69
7
70
71
72
73
●
●
● ●
●
●
●
●
●
●
●
●
●
●
● ●
●
● ●●
●
●
●
●●
●
● ●
●● ●
● ●
● ●●
● ●
●
●
●●
●
●
●● ● ● ●
●
●
●
●●●
●
●
●
●●
●
●
●
● ●●
●
●
●
●●●
●
●
●
●
●●
●
●
●●●
●
●● ● ●●
●●
●
●
●●● ●
●
●
●
●● ●
●● ●●
●
●
●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●● ●
●
●●
●●
●
●
●●
●
●●
●●●
●
●
●●●
●●
●●
●
●
●
●
●
●
●
●
●●
●
●● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●●●
● ● ●
●
●
● ●●
●
●
●
●●
●
●
●
●●●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●● ●●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●●
●
●
●
●●●●
● ●● ●
●
●
●
●
●●
●
●
●
●
●●
●●
●
● ● ●
●● ● ● ●
● ●●
●
●
●
●● ●
●●● ● ●
● ● ● ● ●●
●
●
●
● ●
●●
●●
●●
●
●
● ●●
●
●
●●
●
●
●
● ●
●●
●
●●
●
● ●
●
●● ● ●
●
●
●● ●
●●
●
● ●● ●
●●
●
●●
●
●
●●
●●
●
●
●
●
●
●● ●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
● ●●
●
●●
● ● ● ● ●●
●
●
●
●
●●●
●
●
●●
● ●●
●
●
●● ●
●
●
●
●●
●
●●
●
●●
●
●
●
●
●
●
●●●
●
●
● ●
●
●
●
● ●●
●
●
●
●●
●
●
●●
15
5
52
53
54
55
57
58
59
6
62
44
45
46
47
48
49
5
50
51
3
31
32
35
36
39
4
40
15
5
43
● ●
●
●
●
●
●
●
●●
●
●
● ●
●●
●●
●
● ●
●
●
● ●
●
●
●●
●● ●
●
●● ● ● ●
● ●● ●
●
●
●●
●●
●●●
●
●●
●
●●
●
●
●
●● ●
●
●●
●●●
●
●●
●●
●●
●●●
●
●
●
●●
●●
●● ● ●
● ●●
●
●●
●
●
●
●
●
●
●●●
●
●
●
●●
●●●
●●
●
●
●
●
●●
●●
●●
●
●
● ●● ●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●●
●
● ●●●
●●●
●
●
●
●
●● ●
●
●
●
●
●●
●
●● ● ●
●●
●
●
●
●●●
●
●
●
●
●
●●
●●
●
●●
●
●●
●
●●●
●●● ●
●
●
●
●
●●
●
●
●
●● ●
●
●
●
●●●
●
●
●
● ●
●
●
●●
● ● ● ●●
●
●
●
●
●
●
●
●
● ●
●
●
●●
●●
●
●● ●
●
●
●
●
●
● ●
● ● ● ●●
●●●
●
●
●
●
●
● ●
●
●● ● ● ●
●● ●●● ●
●
●
● ●
●
●●
●
●
● ●●● ●
● ●● ●●
●
● ●●
●●
●
●●
●●
●
● ●● ● ●
●
●
●●
● ●
●●
●● ● ●
●
●●●
●●
●● ● ●
●
●
●●
●●
●
●
●●
●●
●●
●●
●●●●
● ● ● ● ●●
●● ●
●
●
●
●● ● ●
●
●●
●●
●●● ●
●
● ● ●
●
●●
●
●
●
●●
●●
●
●
●●
●
●
●●
●●
●
●
●
●
●
●
●
●
●●
●
● ●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●●●●
●●
●●
●
●● ● ●
●
●●
●
●
● ● ●●
●
●
●● ●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
2
1
15
5
15
5
●
●
●
●
●
●●
●
●●
●● ●
●
● ●● ●
●
●
●● ●
●●
●
●●●
●
●
●● ●
●●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●●●
●
●
●
●
●
●
●●
●
●
●
●● ● ●
●
●
● ●● ●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ●●
●
●●
●
●
●
●
●
●●
●●
● ●●
●
●●
● ● ●
● ●●
●
●
●
●
●
●
●● ●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
● ●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
● ●
●
● ●●
●●
●
●
●●
●
●
●● ●
●
●
●●●
●
●
●
●●
●
● ●●● ● ●
●
●● ●
● ●
●
●●
●● ●
●
●
●●
●
●● ● ● ● ●
●
● ● ●
●●
● ●
● ● ● ●● ●
●
●
●
●●
●●
●● ● ●
●●
●
●●
●● ● ●
●
●
●●
●●
●
●
●●
● ●● ●
●
●
●
●
●●
●
●
●
●
●●
●
●●
●●
●
●
●● ●
●
●
●
●● ●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
● ●
●
●●●
●
●
● ● ● ●●
●
●
●●
●
●
●
● ●●
●●
● ●●
●
●●
●●●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●●
●
●●
●●●
●
●
●
●● ●
●
●
● ●
●
●
●
●
●
15
5
15
5
●
21
10
22
100
23
101
24
11
25
13
26
14
27
15
29
18
●
●
●
●
●
●
●
● ●
●
● ●●
● ● ●● ●●
●●
●
●
●●
●●
●
● ● ● ●●
● ●
●●
●
●
● ● ●
●
●
●
● ●
●
●●
●●● ●● ●
● ●●
● ● ●●
●
● ●
●
●
● ●●
●●
●●
●
●
●
●●
●● ●
●●●
●
●●●
●●
●● ●
● ●
● ●
●
●●
●
●●
●●
●
● ●●
● ●●●
●
●
●
●●
●
●
●
●
●●
●
●●
●
●
●
● ● ●
●
●
●●
●
●
● ● ●●
●
●
●
●●
●
●
●
●
●●
●
●
● ●●●
●● ●
●● ● ●
● ●●
●
●
●●●
●●●
●
●●●●
●
●
●
●
●●●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●● ●
●
●
●●
●●
●
●
●
●
●
●
●●
●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
05 1525
05 1525
05 1525
05 1525
05 1525
Block
Figure 2.13: Study 2 task completion time by block at the individual-level
— Part 1 of 2.
15
5
96 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
05 1525
15
5
●
05 1525
209
210
211
212
213
197
199
200
202
203
●
●
● ●
●
●
●●●
●
● ●
●
●●
●
●●
●●● ● ● ●
●
● ●●
●
● ● ● ●●
●● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●● ●●
●
●
●
● ●● ●● ● ●●●
●●
●
●
● ● ● ● ● ●●
●●
● ●● ● ●
●
●
●●●
●●
●
●
●
● ●● ●
●
●●
●
● ●
●
●●
●
●
● ●● ●
● ● ● ●●
●
●●●
●●
●
●●
●●
●
●
●●
●
● ●●
●
●●
● ●●
204
205
206
208
Task Completion Time (sec)
●
●
●
●
●
●●
●●
●
●
● ●
● ●
●● ●
●
● ● ●
●
● ● ● ●●●
●● ●
●●
●
●●
●
●●
●
● ● ●● ●
●
● ●
●
● ●
●
● ● ● ●● ●
●
●
●
● ●●
●●
●
●● ●
●
● ●
●●
●● ●●
● ●●●● ●
●●
●
●
●●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●● ●●
● ●●
●
●
●
●
●●● ●
●●
●●●
●●
●
●●
●
●●
● ●●
●●
● ●
●●●
●
●
●●
●
●●
●●
● ●
●
●
● ●
●
●
●
●
●
●
●● ●
●
●
●
● ● ● ●● ●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●● ● ●
●
●
●
●
●
●●
●
●
●
●●
●
●●
●
●
●●
●
15
5
181
183
184
185
188
190
192
195
196
170
171
172
173
174
177
178
179
180
●
●
●
●
●
●
●
● ●
● ●
●
●
● ●●
●● ●
● ●
●
●
●
●
●●
●● ● ●
●
●
●
●
●● ●
●
●
●●
●
●
● ● ●
●●
●
●
●●
●
●
●
● ●● ●
●
●
●
●
●
●
●
●●
●
●
●
● ●
● ●●
●
●
●
●
●
● ●
●
●● ●
●●
● ●●●●
●
● ●● ● ● ● ●
●
●●
●●●
●●
●
●
●
●●●●
●
●●
●
●
●●● ●
●
●
●
●● ●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
● ● ●●
● ● ●● ● ●●
●●
●
●
●
●●
●
●
●
●
●●
●
●
● ●
●●
●
●
●
● ●● ●
●
●●●
●
●●●
● ●
●●● ● ● ● ●●
●●
●
●● ●
●
●●
●●
●●
●●
● ●●●
●●
●
●
●
●●
●●●
●
● ●
●
●
●
●●
●
● ● ●
●
●
●●
●
● ● ●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●●
●
●
●
●
●
●
●●
●
●
●
●
●
● ●
●●
●●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●● ●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ●
15
5
157
158
159
161
162
163
167
168
169
142
143
144
145
146
147
148
149
150
130
132
133
134
136
138
139
140
141
118
120
122
123
124
126
128
129
15
5
●
●
●
●
●
●
●
●●
●
●
●
● ●
●
●
●
●
●
●
● ●●
● ● ●● ●
●●
●
● ●● ●●
●●
● ●●
●
●●
●
●
●
●
● ●●● ●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●●● ●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●●
●
●●
● ●●
●
●
●
●
●●
●●
●
● ●●
● ●● ●
●●
●
●
●
●
●
● ● ●●
●●
● ●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●●
●
●
●
●
●
●
● ●
●●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●●
● ●
●
●
●
●●●
●
●
●
● ●●
●
●
●
●
●● ●
●
●
● ●●
●●
●
●
●●
●
●
●●
●
●●
● ● ●●
●
●
● ●
●
●
●
117
●
●
●
●
●
● ●
●
●
● ● ● ● ●●
●
●
●
●●
●●●●
● ●
●
●
●
● ● ●
●●
●
●
● ● ● ●
● ● ●●
●● ●
●
●
●
●
●
●
●
●● ●●●
● ●●
●●
●●
●●
●● ●
●
● ● ●●
●
●●
●
●●●
●●
●
●
●
●
●●
●
●●● ●
●
●
●
●
● ●●
●
●
●
●●
●
●● ● ● ●
●●
●●
●
●
●
●
●● ●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●●●
●
●●●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
● ●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●● ●
● ●
●● ●
●
●●
●●
●● ● ●
●●●
●
● ●
●●●
● ●
● ●● ●
●
●●
● ● ● ●●
● ●
15
5
15
5
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●● ●
● ●
●
● ●
●
●● ●
● ●● ● ● ●
●
●●●
●●●
● ● ●● ●
●● ● ●
●
●●●
●
●
●
●
●
●
●●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●
●●●●●●●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●●
●● ●
●
●●
●●
●●
●
●
●●
●
● ●● ●
● ● ●
●
●
●
●
●
●●
●
●
●● ●●
●
●
●●
●
●
●●
●
●
●● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●
● ●
●●● ●
●●●
●
●
●●
●
●
●●
●●
● ● ●●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ●
●
●
●
●
●
●
●
●
● ●●●
●●
● ●●●
●
●
●
●●
●●
●
●
●
●●
●●
●●
●
●
● ●● ●●
●●
●
● ●●
●●
●
● ●● ●
●●
● ●●
●
●
●
●
●
●
●
● ●●● ●
●●
●●
●
● ● ●
●
●●
●
●●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●●
●
● ●●
● ●● ●
●● ●●
●● ●
●
●
●●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●●
●
●
● ●
●
●
●
●● ●
●
●
●
●
●
●●
●●●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
● ●●●
●●
●●
●●●
●
●
●
●
●
● ●
●●●
●
●
●
●●●●
●
● ●●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●●
●
●
● ●● ●
15
5
15
5
102
103
104
109
110
111
112
113
116
●
●
●
●
●
●
●
●
● ●●
●
●
● ●●
●●
●●
● ●
● ●
●
●
●
●●
● ●
●
● ● ●● ●
●
●● ●
● ●●
●
● ●● ●
● ●
●
●
●
●
●
● ●●
●
● ●●● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
● ● ●
●●
● ●
●
●●
● ●●
● ● ● ●●
● ●●
● ●
●●
●
●
●
●●
● ● ●
●●
●
● ● ●●●
●●
●
●●
●
●● ● ●
● ●
● ●
●●● ● ●
●
●●
●
● ●
●
●●
●●●
●
●
●
●
●
● ● ●● ●
●
●●
●
●●●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●●●
●
●
●●
●●●
●
●
●
●
●
●
●
●●● ●●
●
●
●
●
● ●
●
●●
●
●●
● ●
●
● ●●
●
●●
●●
●●
●
●●
●
●
05 1525
05 1525
05 1525
05 1525
05 1525
Block
Figure 2.14: Study 2 task completion time by block at the individual-level
— Part 2 of 2.
Power versus Exponential Function Table 2.18 summarises
individual-level model for the relationship between practice and task
completion time. Supporting Hypothesis 2.2, the exponential function
provides superior fit in comparison to the power function (rmseP 3 =
1.75, rmseE3 = 1.74). Using a paired samples t-test, this was a statistically significant difference, t(153) = 2.93, p = .004. In terms of individual
15
5
2.4. STUDY 2 RESULTS
97
cases, the exponential function beat the power function in 96 cases, compared to only 58 wins for the power function. This was significantly
different using a chi-square goodness of fit test assuming 50% wins for
both functions, χ2 (df = 1) = 9.38, p = .002.
APEX versus P4 Function The APEX function had superior fit to
the four parameter power function (rmseAP EX = 1.73, rmseP 4 = 1.78).
Using a paired samples t-test, this was a statistically significant difference, t(153) = 4.73, p < .001. In terms of individual cases, the APEX
function beat the power function in 110 cases, compared to 44 wins for
the power function. This was a statistically significant difference when
compared to a chi-square goodness of fit test assuming 50% wins for both
functions, χ2 (df = 1) = 28.29, p < .001.
E3B versus APEX Function The three parameter exponential break
function provided superior fit in comparison to the APEX function,
(rmseE3B = 1.66, rmseAP EX = 1.73). Using a paired samples t-test,
this was a statistically significant difference, t(153) = 6.89, p < .001. In
terms of individual cases, the E3B function beat the APEX function in
133 cases, compared to 21 wins for the APEX function. This was a statistically significant difference when compared to a chi-square goodness of fit
test assuming 50% wins for both functions, χ2 (df = 1) = 81.45, p < .001.
98 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Table 2.18: Study 2 Mean Model Fit Statistics for the Effect of Block on
Task Completion Time at the Individual-Level
1.
2.
3.
4.
5.
6.
7.
8.
Model
ka
se
R2
AIC
Cn
P3
E3
APEX
P4
CnCn
P2BP2
E3B
1
3
3
4
4
3
5
5
2.224
1.755
1.738
1.734
1.776
1.856
1.652
1.659
.000
.385
.397
.424
.397
.331
.501
.489
129.55
117.06
116.68
117.32
118.68
120.84
115.36
115.53
Note. Number of individual learning curves = 154.
a
k = Number of parameters.
2.4. STUDY 2 RESULTS
P3: ID=90
15
10
●
0
●
●
●●
●
●
●● ●●
● ●
●
● ●● ●
●
● ●●
●
●
● ● ●
5
●
20
15
●
●
●
10
●
● ●●
● ●●● ●
●●
●
●●●
●
●
●
●
●
●●●
●
●
5
0
0
●
●
15
●
●
10
●●
● ●
● ●
●● ● ●● ●
● ● ●●
●
● ● ●●
●● ●
●
5
0
10 20 30
5
15 25
0
10 20 30
Block
Block
Block
Block
P4: ID=99
CnCn: ID=29
P2BP2: ID=10
E3B: ID=7
●
●
●
●
10
●
●
●
●
● ● ● ●●
●●
● ● ●●
●● ● ●
●●
●●●
5
0
15
20
●
●
● ●
●
●
●●
●
● ● ●● ●●
●
●●
●
● ●
●
●
●
●
●
●
●
10
5
0
0
10 20 30
Block
20
●
15
●
●
10
5
●
●
● ●
●
●
●
●
●● ●
●●● ●● ●●
●● ● ●
●
●
0
0
10 20 30
Block
Task Completion Time (sec)
20
●
Task Completion Time (sec)
Task Completion Time (sec)
●
10
10 20 30
●
15
●●
0
0
20
15
Task Completion Time (sec)
5
●●● ● ●
● ●
●●● ●●●●●●●●●●●●●●●
●
● ●●
APEX: ID=82
20
Task Completion Time (sec)
20
Task Completion Time (sec)
Task Completion Time (sec)
20
E3: ID=97
Task Completion Time (sec)
Cn: ID=59
99
●●
●
15
●
●
●
●●
●
●
●● ●
●
●
●
● ● ●
●● ●
●● ●
●
●
●
●
●
10
5
0
0
10 20 30
0
Block
Figure 2.15: Study 2 examples of model fits of the relationship between
block and task completion time.
10 20 30
Block
100 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
2.4.2
Practice and Strategy
Group-Level
Figure 2.16 shows the relationship between practice and strategy sophistication at the group-level. Both the initial and final levels of strategy
sophistication was much greater than in Study 1. Table 2.19 provides a
table of model fit statistics. Consistent with Hypothesis 2.10 a MichaelisMenten function provided a good fit to the data at the group-level. The
data did not appear to exhibit an S–shape.
Table 2.19: Study 2 Fit Statistics for Models of the Effect of Block on
Strategy Sophistication at the Group-Level.
Model
1.
2.
3.
4.
5.
6.
a
Cn
MM
SE
Lg
CnCn
CnSE
ka
se
R2
AIC
1 0.077 .000 -65.95
3 0.014 .971 -168.00
3 0.014 .969 -166.64
3 0.015 .965 -163.12
3 0.038 .773 -106.41
5 0.025 .910 -130.24
k = Number of parameters.
Individual-Level
Figure 2.17 and 2.18 shows the relationship between practice and strategy
sophistication at the individual-level. Table 2.20 summarises the fit at
the indiviudal-level. The individual-level strategy sophistication is both
greater and more noisy than Study 1.
Consistent with Hypothesis 2.9 several participants consistently had
strategy sophistication around zero (e.g., cases 54, 85). Consistent with
Hypothesis 2.8 some participants had consistently high levels of strategy
sophistication (e.g., cases 132, 149, 206). Some shifted to sophisticated
strategies fairly quickly in the early trials (e.g., cases 112, 120). Consistent with Hypothesis 2.5 the speed of transition varied. Constant–
2.4. STUDY 2 RESULTS
101
Table 2.20: Study 2 Mean Model Fit Statistics for the Effect of Block on
Strategy Sophistication at the Individual-Level
Model
1.
2.
3.
4.
5.
6.
a
Cn
MM
SE
Lg
CnCn
CnSE
ka
se
R2
AIC
1 0.196 .000 -14.88
3 0.171 .235 -20.86
3 0.169 .248 -21.08
3 0.170 .234 -20.66
3 0.163 .287 -22.77
5 0.177 .227 -8.52
k = Number of parameters.
Constant suggests an abrupt onset and shift. Constant–Saturating Exponential suggests and abrupt onset but a more gradual shift. Consistent
with Hypothesis 2.6 several individuals only partially adopted sophisticated strategies (e.g., cases 122, 123, 167). Figure 2.19 illustrates for each
function one participant where that function fit relatively well. Consistent with Hypothesis 2.7, the onset of the strategy shifts was greatest in
the first block. In broad terms, the frequency of shifts decreased after
this.
102 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
1.0
Strategy Sophistication
0.8
●
0.6
●
●
●
0.4
● ●
●
●
● ● ●
●
● ●
●
●
● ●
●
●
●
● ● ● ● ●
●
●
●
●
0.2
0.0
0
5
10
15
20
25
Block
Figure 2.16: Study 2 strategy sophistication by trial at the group-level.
30
2.4. STUDY 2 RESULTS
05 1525
94
0.8
0.4
0.0
103
05 1525
95
96
97
99
●
●●
●
●●
●
●●●
●●●
●
●
●●
●
●●
● ●●●
●●
●
●
●●
● ●●●●
● ●
●●● ●
●● ●
●●●●
●●
●●
●
●
●●
● ●●●
●
●●
●
●
●
●●
●
● ●
● ●
●
● ●
●
●
● ●●●● ●● ●
●
● ●●
● ●
●
●
●●
●
●
●
●
●
●
● ● ●●
● ●
● ● ●
●
●● ●
●●
●
●●●
●●
●
●
●●
●●
●●
●
●●
●
●
●
● ●●●●
●
●● ●
●●
86
87
88
89
9
90
91
92
93
●
● ●
●
●● ●
●●
●●●
●
●
●
● ●●● ●● ●
● ●
●● ● ●
●
●●
●
●
●●
●
●●
●●●●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●●
●
●●
●●●
●
● ●●●
●● ● ● ●
●●●●
●
●● ●● ●
●●
●●
●
●
●●●
●●●
●
●●●
● ● ●● ●●
●
●●●● ● ●●
●
●
●
●● ●
●
●● ●
●●●●●● ●●●●
●●
●
●●●
●
●
●
●
●
● ● ●●● ● ● ●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●● ●
●
●
●
●
●●● ●● ●●
●
●
● ● ●●
●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●●●●
●●
●
●
●
●
● ● ●●
● ●●
●
●●
●● ●
●●●● ● ● ●
Strategy Sophistication
76
0.8
0.4
0.0
77
78
79
8
82
83
84
85
●●
●●●●
●
●●●●
●
● ●● ●
●●●
●●● ●●
●●
●
●
●●●
●●●●●● ● ●●
●
● ●●
●
●●
●
●●
●●
●
●
●
●
●
●
●●●
●●●●
●●
●●
●
● ●●
●
●
●
●●
●●
●●
●●
●●●
●●●●
●●●
●
●●●
●
●
●
●
●●●●
●●● ●●●
●●●
●●●
● ●
●●●
●●
● ●● ●●
●● ●●
●
●●● ●
●●●
●
●
● ●●●
●●
●
● ●●
●
●
●●
● ●
●
●●
●●
●●
● ● ●●
●
●● ●●
●
●
● ●
● ●
● ●
●
●
●
●
●●●●
●
● ●
●
●● ●
●
●
● ●●
●
●
●
●●● ●
● ●● ●
●
●
●
●
●
●
●
●
●
● ●●●●
●
● ●● ●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
64
66
68
69
7
70
71
72
73
●●●
●
●
●●●
●
●●
●
●
●
●●●
● ●●●●●
●
●
●
●
●●●●●
● ● ●
●
● ● ●
●●
●
●●
●●
●
●
●
●
● ●●
● ●●
●
●●
●
●●●
●
●
●
●
● ●
●
●●
● ● ●●
●●
● ●
●
●
●●●
● ●●
●
●
●●
●
●●
● ●●
●●
●
●
● ●●
●●
●
●
●
● ●
●
●
●● ●● ●
●
● ● ●
●
●●● ●●
● ●
● ● ●●
●
●
●●
●
●
●
●
● ●
●
●● ●
●●
●●
●
●
● ●
●
●
●
●
●
● ● ●●●
●
●
●
●
●
●
●
●
●
● ●
● ●●
●
● ●●
●
●
● ● ●
●●
●
●
●
● ●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
● ●●
●●
●●
●
● ● ●●●●●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●●
●
●●●●
52
0.8
0.4
0.0
53
54
55
57
58
59
6
62
44
3
45
46
31
47
48
49
5
50
51
● ● ●●
●●●
●
●●●
●
●●
●●●●● ●
●
●● ●●●● ●●●
● ●●
●
●
●
●
●
●
● ●
● ●●
●
●
● ●●
●●●●
●● ●●●
●
●
●●● ● ●
●
●
●
●●●
●
●
● ●●
●●
●
●●
● ●●
●● ● ●
●●●
●
● ●●●● ● ● ●●
●
●●
●
●
●●
●● ● ●
●
●● ● ●● ●● ● ● ● ●
●
● ●
●●
● ●●
● ●
● ● ● ●● ● ●●
●
● ● ●
●
●●●● ●
●
●
●
●
●
●
●
●
●● ●●●
●
●
●●●●
●
●● ●
●●
●●●
●
●●
●
●
●
●●●
●
●
●●
●
●●●
●●
●
●
●
●●●
●
32
35
36
39
4
40
43
0.8
0.4
0.0
●
●
●
●
●● ●
●●●●●
●●
●● ●
●●●●
●
● ●
●
●● ●
●
●
●
● ●●●● ●
●
●
●
● ●●
●
●
●
● ●●
●
●
●●●
●●●
●
●●
●● ●
●●●●
● ●
● ●
●●●●
●●
● ●
● ●●● ● ●●●
●●
●
●●● ● ●
●
●●
● ●●
● ●
● ●
●●
●
●●
● ●● ● ●●
●
●
● ●● ●
●●
●●●
●
● ●
●● ● ●● ●●
●● ● ●
●
●
●●
●●● ● ● ● ●● ● ●●
● ●
● ●
● ●
●●
●●
●
●● ● ● ● ● ● ●
●● ●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●●
●● ●● ●●
● ●●●
●● ●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●●
● ●
●
●
●●●●
●●
● ●●
●
●●
●
● ●
●●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
2
21
22
23
24
25
26
27
29
●
●
●
●
●●●●
●
● ● ●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●● ●
●
●
●
●●
●●
●
●
●●
●
●
●
●
●
●●●
●
●
●●
●
●
●
●
●●●●
●●
●
●
●
●
●
● ●●
●
●● ●●
●●●●●
● ● ●●
●●
●
●●
●●
●● ●
●
● ●
●
●●
●
●● ●
●
●●
● ●●
●
● ●●
●
●
●●● ●● ●●
● ●●●● ●●
●●
●
●
●●
● ● ●●
●
● ●
●
●●
● ●● ● ● ●●
●●●● ● ●
● ●
● ● ●●● ●
●
●
●
●● ●
● ●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ● ● ●
●●
●
●
●
●
● ●●
●● ●
●
●●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●
●
●
●
1
0.8
0.4
0.0
0.8
0.4
0.0
●
●
●
●
●●● ● ●
●
●●
●
●
●●
●
● ● ●
●
●
●
●●●●
●
●
●
● ●
●● ●
●
●
●
●●
●
●
●●
●
●
●●●●
●
● ●
●●●
●●
●
●
●●●
●
●●●
● ●
●
●● ●
●● ●
●
●● ●●
●
●
●●
● ●●
●●
●●
● ●
●
● ●●
●
●
●
●
●● ●
●●
●
●● ●
● ●
●
● ●
●●
●
●●
●●
● ●
●● ●
●●●
●●● ●
●
●
●
●
●● ●
●● ● ●
●
●● ●●●●●●● ●●
● ●●
● ● ●
●
●
●
●
●
●
●
●
●
●
●
● ● ●●
●●
●
●
●
●
●
●● ●
● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●●
●●
●
●
●
●●●
●
●
●●
●
●
●●●
●
● ●
●● ●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●● ●
●●
●
●
●●
●●
●●
●
●●●● ●●●
●● ●
●
●
●
●
●
●●
●
●
● ●● ●
● ●●● ●
●●
●●
● ●
●
●
●
●●
●●
● ● ●●●●
●
●
0.8
0.4
0.0
0.8
0.4
0.0
10
100
101
11
13
14
15
18
● ●●●
●●
● ●●● ●
●●●●
●
●●● ●
●
●● ●
●●
●
●
●●
●●
●●
●
●
●●●
●
●
●● ●●●●
●
●
●●●● ● ● ●
●
●●●
●
●
●●
●
● ●●
●
●●
●
●
●
●
●● ● ●● ● ●
●
●
●
● ●● ●● ●●
●
●
●
●
●
●
●
●●
●
●●
●●
● ●
●● ●
●●
●● ●
●
●
●●
● ●●
●●●●● ●
●●●●●
●
● ●
●
●
●●
●
●●●
●●
● ●●
●●
● ●
●●
●● ●●●
●
●
● ●●●
●
●● ●
●●● ●
● ●●
● ●
●●●●
●●●
●● ●●
●●
●
●●
●
●
●
● ●●●
● ●
●
●
●
●●
●
●● ●● ● ●
●
●
● ●
●
●
●
●
●● ●
● ● ●● ●
●
●● ● ● ●
● ● ●●
●
●
●
●
●
●
●●●
●
● ● ●●
● ●●
●
●
●●
05 1525
05 1525
05 1525
05 1525
05 1525
Block
Figure 2.17: Study 2 strategy sophistication by block at the individuallevel — Part 1 of 2.
0.8
0.4
0.0
104 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
05 1525
209
0.8
0.4
0.0
210
05 1525
211
212
213
●
●
●
●●●
●● ● ●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●●
● ●●
●
●● ●●●
●●● ●
●
●
●●●●
●●
●
●
●
●●● ● ●
●
●● ●
●●●●
●
●
●
●●
●
● ●
● ●
●
●
●
● ● ●
●
●
●
●●
●
●●● ●
● ● ● ●
●● ●
●● ●●●●●
●●
●
●
● ●●●●
●
●
●
●
●
●
●
● ●● ●
● ●●●
●
●
●
● ●●●●●●●
●
●
●
●●
●
●
●
197
199
200
202
203
204
205
206
208
● ●● ●
●●
●
●
● ●● ●●●●
●
●● ●
●
●
●
●●
●
●●●●
●
●●
●●●●
●
●
●
●
●●
●●
●
●
●
●
●
● ●
●
●● ● ●
●●● ● ●●●● ●
●
● ●
●
●
●
●
● ●● ●●●
●● ●●
●●
● ●●●●
●
● ●●●
●●
● ●
● ●
●
●●
●● ●●
●
● ●● ●●
●
●● ●
●
●●
●●
● ● ●
● ●●
●● ●
●
●●●
● ● ●
● ●
● ●
●● ●●● ●
● ●● ●
●
●
● ● ●● ●●
●●
●● ●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●●
●● ●●
●●●
● ●●● ●
●
● ●
●
●
●
●
● ●
●
●
● ●
●
●●
●
●
●●
●
●
●
●
●●
●● ●
●
●
●
●
●
●
●
●
●
●
●● ●●
●
●
●●
●
●
●●
●
●
●
●●
● ●
● ●
●●
●●
●
Strategy Sophisitication
181
0.8
0.4
0.0
184
185
188
190
192
195
196
●
●
●
●
●●●●●●●
●
●
●●●
●
●● ●
●
●
●●
● ● ●●●●
●● ●
●● ●
● ●
●
●●●
●
●
●● ●●●● ●
●
●
●
● ●
● ●
●
●
●● ●
●
●●●
●
●●●
●●
● ● ●●● ●
● ●
●●
●
●●●
●
●
● ● ●●●●
●
●● ●
● ●●●
● ●
●
●●
●
●●
●●
● ●●●●
●● ●
●●●
●
●● ●●
●
● ● ●●● ●
●●
●●
●
●
●
●
●●●
●●
● ●●
●
●
●●
●●●
●
●●
●●
●
●
●●
●
● ●
●
●
●● ●●
●●
● ●●
● ●
● ●
●
●
●
●
●
●● ●●
●●
●●●
●
●●●
●
●● ●●
●
●
●
●
●●
●
●●●●
●
● ●
●
●
●
● ●●● ●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●●●
●
●
●● ●
●●
● ●
●
●
170
●●
●●
●●
●
●●●
●
●
●
●
●●
●●
●●
●●
●●
●●
●
●
●
●
157
0.8
0.4
0.0
183
171
172
173
174
177
178
179
180
●
● ●●●●
●
●
●
●
● ●
●
●● ●
●
●
●●
●
●
●
●●
●
●●●
● ●●
●
●
●●● ●●
●
●
●
●● ●●●
●●
●
●
●
●
●
●●●●
●
●
●
●
● ●● ●●●
● ●●
● ●● ●
●
●
●
●
●
●●
● ●●●●●● ●●
●●●
●
● ●●
●
●
●
●●●
●
●●
●● ●●
●
●●● ●●●●
●●●●
●● ●
●
● ●●
●
●
●
●●● ●
●
● ●
● ● ●
●● ●
●
● ● ●
●
●●●
●
● ●●
●
● ●
●●
●
●● ● ●
●
●●
●● ●
●
●
●
● ●
●
● ●● ●● ●●
● ●
●
●●
●
●
● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●● ●
●
●
●
●
●
●
● ●
●
●
●
158
159
161
162
163
167
168
169
142
130
143
144
145
146
147
148
149
150
132
133
134
136
138
139
140
141
0.8
0.4
0.0
●●
●●●
●●●●●
●●
●
●
●●●
●●●●
●●
●●
●
●
●● ●
●
●
●
●●
●
●
●
●●●●
●
●
●● ●
●
●●
●
●
●●
●●
●●●
●●
●●
●
●
●●
●●●●●●
●● ● ●
● ●
●●
● ●●● ● ●
● ●
●
●
●
●
●
● ●●
●●
●●
● ●●●● ●●●
●●●●
●
●●
●
●
●
● ●●●
●
●
●●
●●● ●
● ● ●●●●
●
●
●●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●●
● ●● ●●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●● ●●●
● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●●●● ●
●●
117
118
120
● ●●
●
●●
●●
●
●
●●
●●
●
●
●
● ●
●●
●
●
●●
●●
●
● ●●●●
●●
●●●●●
●●●●
●
●
● ●●
●●
●
●●●●
● ●●
● ●●●
●
●
●
●
●
● ●●
● ● ●
●
●●●
●● ●
● ● ●●
●●● ●
102
0.8
0.4
0.0
0.8
0.4
0.0
● ●● ● ●
●●
●● ●●
●
● ● ●
●● ● ●
●
●●
●●
●●●
●
● ●● ● ●●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●●
●
●
●
●● ●
● ●●
●
●
●
●
●●
●
●● ● ●
●●
●●●
●●
● ●
●●
●●
● ●●●
●●●●
●
● ●●● ●
●●
●●●
●
●
● ●
●
●● ●
●
●●
●
●●
● ●● ●
● ●●● ● ●
●●●●
●
● ●
●●●
●●
●
●
●
●
● ●
●
●●
●
●
●●●● ● ●
●
●
●
●
●●
● ●● ● ●●●●●●●●
● ●
●
●
● ●●
● ● ●
● ●
● ●●●
●
●●● ●●
●
●
●●●●
●
●
● ●●
●
● ●
● ●●● ●●
●
●
●
●
●
●●
●
●
●
●
●
●
●●●
●
● ● ● ●
● ●
●
●
●●
●●
●
●●
●●
●
●● ●
●
●
●
●
●●●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●●
●
●●
● ●●
●
●
●
●
●
●
●●
●
●
●
● ● ●●●●
●●
●
● ●
●
●
●
●●
●
● ●
●●
●
●●
●●
●
●
●●●
●
●●●●
●●
●●
●
● ●● ●
●
●
●●●
●● ●●
● ●● ●
● ● ●
●●●
●● ●● ●●●
●●●
●●●
●
● ●
● ●●●
●●
●●
●●
●●
● ●
●
●●
●
●
● ●●
● ●
● ●●
●
●
●
●●● ● ● ●●●
●
●
● ●
●
●
●● ●
●●
● ●
●
●
●
●● ●●
●
● ●●
●●
● ● ●
●
●
●
●
●●● ●
●
●
●
●
●
●
●
●
● ●
●
● ● ● ●
●● ●
●
● ●
● ●
●●
●
●
●●
●
●
●
●
● ●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●●
●●
●
●
●
●●●
●
● ● ●
● ● ●
0.8
0.4
0.0
0.8
0.4
0.0
103
104
122
123
124
126
128
129
●
●
●●● ●
●
●
●
● ●●
●
●
●●
●
●
● ●● ●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●●●
●
●●
●
●
●
●
●
●
●
●
● ●
●●
●●
●
●●
●
●●
●
●●
●●●
●●●
● ● ●●
●
●●
●● ●● ● ●
●
●
●●
●●
●● ●●
● ●
● ●● ●
●●● ●
● ● ●● ●
●
●
● ●
● ●●
●
● ●
●
●
● ● ●● ●
● ●● ●●●
●● ● ●
●● ● ●
●●
●● ●
●● ●
●
●
●
●
●
●
●
● ●
●
● ● ●
● ●●
●
●
● ●● ● ●●
●
109
110
111
112
113
116
●●
● ●●
●●
●
●●
●●
● ●
●
●●
●● ● ●●
●●
●●
●
●●
●
●
●
●●
●● ●
●●
●●
●
●
●●
●● ●
●● ●●
● ● ●●●
● ●
●●
●●
●
●
●
●●
●
●
●●
●
●
●
● ●
●●
●●
●
●
●
●
●●●●●●
●
●● ●
●●
●●
●
●●●●
●
● ●
●●
●● ● ●
●
● ●●
●● ● ●● ●
●
● ● ●● ● ●● ●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
● ● ● ●●●●
●
●●● ●●
●● ● ● ●
●
●● ●
●●
●
●
●●
●
●
●
●
●●●
●
●
●
●●
●●
●
●
●
●
●
●●
●
●
●
●● ● ●●
●
●
●●●
●
●
●●●●
●
●
●
●
●
●●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●●
● ●
● ●●●
●
● ●
● ●●
●
●
●
●
●
●
● ● ●
●●●
●
05 1525
05 1525
05 1525
05 1525
05 1525
Block
Figure 2.18: Study 2 strategy sophistication by block at the individuallevel — Part 2 of 2.
0.8
0.4
0.0
2.4. STUDY 2 RESULTS
105
Sophistication
0.8
●
●
●● ●
● ● ● ● ●● ● ●
●
●
●
● ●●●
●
●●● ●
●
●
●
●
●
0.6
0.4
1.0
●
0.8
0.6
●
●
●
●
●
●
●
SE: ID=130
1.0
●
●
●
● ●
●●
●
● ●
●
●
●
●
●
Sophistication
1.0
MM: ID=174
●
●
●
●
●
●
0.4
0.2
0.0
5 10
20
30
0.6
●
0.4
●
●
0.0
5 10
30
0
1.0
20
30
1.0
●● ●
●
●
●●
● ●●
●
●●
●
●●●
● ●●
●
● ●
0.6
●
●●
●
0.4
●●●
● ●●
●●
●
●
●
0.2
●
●
●
0.0
20
Block
30
0
●
●
●
●●
Block
30
●
●
● ●
●
●
●
●
●
●
●
0.0
20
●
●●
● ●
●
0.4
0.2
5 10
●
●
●
●
●
●
●
0.6
●●
●
5 10
●
0.8
Sophistication
● ●
0
5 10
CnSE: ID=113
0.8
●
●
CnCn: ID=103
0.6
0.0
20
●
Lg: ID=103
Sophistication
Sophistication
●
●●
Block
0.8
●
● ●
●
●
●
Block
●● ●
●
●
●●
● ●●
0.2
● ●
●
Block
1.0
0.4
●
0.2
●
0
●
●●
●
0.0
0
●
●
●
●
●
0.2
●
●
●
0.8
Sophistication
Cn: ID=149
0
5 10
20
Block
Figure 2.19: Study 2 examples of model fits of the effect of block on
strategy sophistication at the individual-level.
30
106 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
2.4.3
Practice, Strategy, and Performance
Within-individual correlations between strategy sophistication and log
trial completion times were calculated for participants with a standard
deviation in strategy sophistication greater than 0.20 (129 out 154 participants were above the cut-off). The typical correlation was negative
and small to medium (M= −0.23, SD = 0.18, range: −0.74 to 0.25).
2.5
2.5.1
Study 3 Results
Practice and Performance
Group-Level
Figure 2.20 shows the relationship between practice and performance
at the group-level. Table 2.21 shows the fit information for the grouplevel learning curve. Consistent with Hypothesis 2.1 the three parameter
power function had superior fit in comparison to the three parameter
exponential function.
Table 2.21: Study 3 Fit Statistics for Models of the Effect of Block on
Task Completion Time at the Group-Level
1.
2.
3.
4.
5.
6.
7.
8.
a
Model
ka
se
R2
AIC
Cn
P3
E3
APEX
P4
CnCn
P2BP2
E3B
1
3
3
4
4
3
5
5
1.250
0.141
0.219
0.144
0.144
0.694
0.140
0.191
.000
.988
.971
.988
.988
.713
.989
.980
101.50
-27.41
-1.12
-25.43
-25.53
68.10
-26.12
-7.68
k = Number of parameters.
2.5. STUDY 3 RESULTS
107
12
●
10
Task Completion Time (sec)
●
●
● ●
8
●
●
● ●
●
●
●
6
●
● ● ● ●
● ●
●
● ● ●
● ● ●
● ●
● ●
4
2
0
0
5
10
15
20
25
Block
Figure 2.20: Study 3 task completion time by block at the group-level.
30
108 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Individual-level
Overview Figure 2.21 shows the relationship between practice and
task completion time at the individual-level. While participants tended
to get quicker over time, large individual differences existed in average
performance, amount of improvement, shape of improvement, and trialto-trial variability.
05 15 25
●
84
05 15 25
86
90
93
96
●
●
●
●
●●
●●●
●●●●● ● ●●
●
●
●
●
●● ●●●
●●●
●●● ● ● ●●●
●●● ●●
●●
●●●●●●● ● ●
●
●● ● ●●●●●●●●●
●●●●●●●●●●●●●●●●● ● ● ●●● ●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●● ● ●● ●●●●●●●●●●●●●●●●
●
72
10
5
● ●●
●●●●●●●●●●●● ●●●●●●●
● ●● ●●●●●
Task Completion Time (sec)
44
●
75
76
● ●●●●
●● ●
●
●
● ●●●●●● ●● ●●●●
●● ● ●●●
●●
●●●●●●●●●●●●●●●●●●●●● ●
● ●● ● ●●
48
49
77
78
●
●
●●● ● ●●
●●
●
●●●●●● ●●●● ● ●
●●●●●●●●●●● ●
● ●●●●●●
●
●●●●●●●●●●●●●●
● ●●
50
57
10
5
●●
79
80
81
●
●
●
● ●●
●●●●●●●●
●●
●●
● ● ●●●●●●●●●●●●● ●●●●
●● ●●●●● ●●●● ●●
●
●● ● ●●●●● ●●●●●●●●●●●●●● ● ●
● ● ●●●●●●●●●●●
58
●
6
67
●
●
●●
●
●
●
●
●●
●
● ● ●●●
●
●●
●●●● ● ●●
●
●●●
●● ● ● ●
●
●●●●●●
●
●
●
●
●
●●
● ● ●● ●● ●●●● ●●●● ●●●●● ●
●●●●●●● ●●
●●●●●●●●●●●●● ●● ●●●●●●●
●●●●●●● ●●●●●●●●● ● ●●●
●
●
●● ● ●
●
●
●
●
●
●
●
●
●
●
●
● ● ●●●●●●●●●
●●●●●●●●● ●● ●●●
●●●●●●●●●●●●●●●●●●
●●
●●●
●●● ●● ●●●●●●●●●●●●
●
●●●
●●●●●●●●●●●● ●●●●●●
●
●●●●
●
● ●
●● ●●●●●
●
●
10
5
18
201
27
39
4
40
43
●
●
●●
●
●
●●
●
●
●●●
●
●●● ●●
●
●●●●
●●
●
●
● ●
● ●● ●●●●●●●●● ●
●●
●●
●●●●●●●●●●●●●● ●●●●●●●●● ●●● ●●●●● ●●●
● ●● ●●●●●●●●●●●●●
●●● ● ●●●●●●●●●● ●
●●● ●●●● ●●
● ● ●●● ●●●●●●● ● ●●●●●●●● ●●●●●● ●
●● ● ● ●
●
●●●●● ●● ●
●
●
●
●
●
●
●
●
●
●
●
● ● ●●● ●●●●●
●
●
●●●●●●●●●●●●●●●●● ● ●●
●●●●● ●● ●●●●
●●● ●●●●●●●●●●●
● ●●●●●●
●
152
156
●
●● ●
●
●
●●
●● ●● ● ● ● ●
●●
●●●●●● ●● ●●●● ●
●
●●●●
●
●●●●●●●●●●●●●●●●●●●●●●●
●
●
●●●●
●● ●
●
●●
●
●● ●● ●●
●
● ● ●● ●●●●
●
●
● ● ●●
●●
●● ●● ● ●●
● ● ●●●
●●●●● ●●●●●●●●●● ●● ●●● ●●●●●●●● ●
●
●
●●●●●● ●●
●
●● ●●
● ● ●●●●●●●●●●●●●●● ● ●
●
●
●
● ● ●●●●
●● ●●●● ●●●●●●●●●●●● ●●●●●●●
●
●
●●●●● ●●
●●●●●●●●●●●●●●●●●●●●●●●
159
161
167
168
17
173
140
141
142
143
146
148
15
150
12
120
121
127
129
13
133
137
●
●
●
● ●
●
●●●● ●●●●●● ●●● ●●●● ●
●
●
●
●● ●● ●●●●
● ●
●●
●●●●● ●●
●●●●
●
●●●●●●●● ● ●● ●●●●●
●
● ● ● ●●
●●●
● ●●●
● ●●
●
● ●●●●●●● ●●● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●●●●●●●●●●●●●●●●
●● ●●●● ●● ●●●●●●●●●●●●●●●●●●●●●●●● ●●● ●●●●●●●● ●
●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●
●
●● ● ●●●●●●●●●●●●●●
●●
1
10
5
10
5
●
●
● ●●
●●
●●
● ●●●●●● ● ●●●●●
● ●
● ●● ●
●●
●●● ● ●
●●
●●● ●
●● ●● ● ●
●
●
●
●
●
●
●
●
●
●
● ●● ●●●●●●●●● ● ● ●● ●● ●●● ●● ●
●●
●●●● ●
●
● ●
● ●●●●●●●●●●●●●●●● ●●●
●●
●
●●●●●●●●●●●●●●●●●●●●●●●●●
●● ● ● ●●● ●●●● ●●
●
●
● ●●●●●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●●●●
● ●●
●
●
10
5
28
10
5
10
101
102
●
108
11
116
117
●
● ●
●
●
●● ●
●
●
● ● ●●
●●
●
●
●
● ●
●
●
●
●●●●● ●
●●
●●● ● ● ●●●● ●●● ●●● ●●
● ●●
●●●●
●
●
●
●
●
●
●●●●●●●● ●●● ●●● ●
●
●● ●●●
●● ● ●●●●●● ●●●●●●●
●
●
●
●
● ● ●●● ● ●
● ●●●●●●
●●●●●●●●●●●●●●●
●● ●●● ●
●●●●●●●● ●● ●●
●
●● ● ●●● ● ●●●
●
●●●● ●●●●●●●● ●●●●●●
●
●
●
●
●
●● ● ●●●●● ●
● ●●●
●●●● ●●●●●●●
●●●●●●●●●●● ●
● ●● ●●
● ●●●
●●●●●●●●●
05 15 25
05 15 25
05 15 25
05 15 25
Block
Figure 2.21: Study 3 task completion time by block at the individuallevel.
10
5
2.5. STUDY 3 RESULTS
109
Power versus Exponential Function Table 2.22 summarises
individual-level model fit information for three parameter power and exponential functions. While the sample results were in a direction supporting Hypothesis 2.2 the exponential function fit was not significantly better than the fit for the power function (rmseP 3 = 1.02, rmseE3 = 1.01).
Using a paired samples t-test, this was not statistically significant different, t(60) = 1.14, p = .26. In terms of individual cases, the exponential function beat the power function in 29 cases, compared to 32
wins for the power function. This was not significantly different using
a chi-square goodness of fit test assuming 50% wins for both functions,
χ2 (df = 1) = 0.148, p = .70.
APEX versus P4 Function The APEX function provided somewhat
better fit than the four parameter power function in terms of mean error
averaged across participants, (rmseAP EX = 1.00, rmseP 4 = 1.02. Using a paired samples t-test, this was not quite a statistically significant
difference, t(60) = 1.81, p = .08. In contrast, the APEX function beat
the power function in only 24 cases, compared to 37 wins for the power
function. This also was not a statistically significant difference when
compared to a chi-square goodness of fit test assuming 50% wins for
both functions, χ2 (df = 1) = 2.77, p = .10. The reason for this seemingly contradictory result is that for the majority of participants, the
fit for the P4 and APEX models was almost identical, but with a slight
tendency for better fit for P4, yet for six cases the APEX function was
far superior.
E3B versus APEX Function The three parameter exponential break
function provided superior fit in comparison to the APEX function,
(rmseE3B = 0.932, rmseAP EX = 0.999). Using a paired samples t-test,
this was a statistically significant difference, t(60) = 6.66, p < .001. In
110 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
terms of individual cases, the E3B function beat the APEX function in
58 cases, compared to 3 wins for the APEX function. This was a statistically significant difference when compared to a chi-square goodness of fit
test assuming 50% wins for both functions, χ2 (df = 1) = 49.59, p < .001.
Table 2.22: Study 3 Mean Model Fit Statistics for the Effect of Block on
Trial Completion Time at the Individual-Level
1.
2.
3.
4.
5.
6.
7.
8.
Model
ka
se
R2
AIC
Cn
P3
E3
APEX
P4
CnCn
P2BP2
E3B
1
3
3
4
4
3
5
5
1.671
1.021
1.007
0.999
1.015
1.152
0.931
0.932
.000
.599
.606
.628
.614
.516
.689
.684
112.47
84.16
83.97
84.12
84.68
92.91
80.83
80.86
Note. Number of individual learning curves = 61.
a
k = Number of parameters.
2.5. STUDY 3 RESULTS
P3: ID=10
15
10
0
●
●●●
●●
●● ●
●●●● ●● ●●
●
● ●●
● ● ●● ●
● ●
5
10 20 30
20
15
●
10
5
●●
●
●
●
● ● ●
●
● ●
●
●
●● ● ●● ●●● ●
● ●
●●
●●●
0
0
10 20 30
15
10
●
●
●●●
5
●●●●●
●●●●●● ●●●●
●●● ● ●●●●●
●
0
0
10 20 30
0
10 20 30
Block
Block
Block
Block
P4: ID=57
CnCn: ID=127
P2BP2: ID=79
E3B: ID=96
●
●
●
10
●
●●
●
●
●●
●●●●● ●●●● ●
●● ● ●●
●●●●
●
5
0
20
15
●
10
●
●●●
●
● ●
●
● ●
● ●
● ●
● ● ● ●● ●●
●● ● ●●●●●
5
0
0
10 20 30
Block
15
20
●●
●
●
10
● ●
●
●
●
● ●
●
●
●
●● ●
●●● ● ●
● ● ●
●●●●
●
5
0
0
10 20 30
Block
Task Completion Time (sec)
20
Task Completion Time (sec)
20
Task Completion Time (sec)
●
0
0
15
10
●
Task Completion Time (sec)
5
●
●
● ●
●
● ●●●
● ●●● ●● ●● ●● ●●●
●
●●●●● ●●
15
APEX: ID=48
20
Task Completion Time (sec)
20
Task Completion Time (sec)
Task Completion Time (sec)
20
E3: ID=101
Task Completion Time (sec)
Cn: ID=167
111
15
10
5
●
●
●
●● ● ●
●
●●
● ●●
●
●
●●● ●
●● ● ● ●●●
●● ●
●
0
0
10 20 30
0
Block
Figure 2.22: Study 3 examples of model fits of the effect of block on task
completion time at the individual-level.
10 20 30
Block
112 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
2.5.2
Practice and Strategy
Group-level
Figure 2.23 shows the relationship between practice and strategy sophistication at the group-level. Table 2.23 shows the model fit information.
Supporting Hypothesis 2.10, a Michaelis-Menten function provided good
fit to the group-level data. Interestingly the Michaelis–Menten function
was clearly better than both the Saturating Exponential and the Logistic
functions. The level of strategy sophistication was greater than both the
previous two studies.
Table 2.23: Study 3 Fit Statistics for Models of the Effect of Block on
Strategy Sophistication at the Group-Level
Model
1.
2.
3.
4.
5.
6.
a
Cn
MM
SE
Lg
CnCn
CnSE
ka
se
R2
AIC
1 0.087 .000 -58.68
3 0.010 .987 -183.92
3 0.017 .966 -155.73
3 0.020 .949 -144.55
3 0.049 .696 -90.45
5 0.036 .850 -107.63
k = Number of parameters.
Individual-level
Figure 2.24 shows the relationship between practice and strategy sophistication at the individual-level. The figure reveals several distinct patterns
of strategy use and strategy change at the individual-level. Table 2.24
shows a summary of model fits for each of the modelled functions.
Consistent with Hypothesis 2.9 several participants consistently had
strategy sophistication around zero (e.g., cases 156, 168). Consistent
with Hypothesis 2.8 some participants had consistently high levels of
strategy sophistication (e.g., cases 80, 167). Some participants shifted to
sophisticated strategies fairly quickly in the early trials (e.g., cases 12,
2.5. STUDY 3 RESULTS
113
Table 2.24: Study 3 Mean Model Fit Statistics of the Effect of Block on
Strategy Sophistication at the Individual-Level
Model
1.
2.
3.
4.
5.
6.
a
Cn
MM
SE
Lg
CnCn
CnSE
ka
se
R2
AIC
1 0.125 .000 -55.66
3 0.071 .553 -85.89
3 0.068 .569 -86.67
3 0.072 .512 -83.58
3 0.078 .484 -77.13
5 0.085 .432 -64.27
k = Number of parameters.
44, 48). Consistent with Hypothesis 2.5 the speed of transition varied.
Consistent with Hypothesis 2.6 several individuals only partially adopted
sophisticated strategies (e.g., case 27). Figure 2.25 illustrates for each
function one participant where that function fit relatively well.
Consistent with Hypothesis 2.7, the onset of the strategy shifts was
greatest in early blocks. In broad terms, the frequency of shifts decreased
with practice.
114 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
1.0
Strategy Sophistication
0.8
●
● ● ●
0.6
● ● ●
● ●
● ● ●
● ●
●
●
● ● ● ● ●
● ● ● ●
●
●
●
●
0.4
●
0.2
0.0
0
5
10
15
20
25
Block
Figure 2.23: Study 3 strategy sophistication by block at the group-level.
30
2.5. STUDY 3 RESULTS
05 15 25
84
115
05 15 25
86
90
●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●●●● ●●●
●
●
●
72
●
●●●●●●●●●●●●●●●●●●●●●●
●
●●●● ●
●
76
48
Strategy Sophistication
18
79
80
81
57
28
159
6
67
39
4
40
161
167
168
17
●
● ●●
●●●● ●●●●●●●●●●●●●●●●● ● ●●● ●●●●● ●●●●●●●●●
●●● ● ● ● ●● ●● ●
●● ●
●
●
●
●●
●
141
58
43
0.8
0.4
0.0
●● ●●● ●● ●● ● ●●●●●●●●●●●●●●●●●●●●●
● ●●●●●●●●●●●●●●●
● ●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●● ●●●●● ●●●
●
● ●●●●● ● ●●●●●●●●● ●
●● ●●
●●
●
●●●● ●●●●●● ●●●●
●
●
● ●● ● ● ●
●
●
● ●
● ●
● ●
●●●
●●●●●●●●● ● ●●●●●●●●●● ●
●●
●●
●
●
●
●●●
●●●
156
173
●● ●●● ●●●●●●●●●●●●●●
●
●●
●● ●● ●
●●
●
●●●●●
● ●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●
●●●●●●●●●●●●●●●●●●●●●●●● ●
142
143
146
148
15
150
0.8
0.4
0.0
●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●
●● ●
●
●●
●
●
●
●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●● ●●●●
● ●●● ●●●●●●●●●●●●●●
●● ●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●● ●●●● ●●● ●● ●●●●●●●
●●●
● ●
●●● ● ●
● ● ● ●●●●●● ●
●
●●● ●●
● ● ● ● ●
●●
●
●●
●
●
●
●
●
●
●●
●●
●● ●
●
12
120
121
●●●●●●●●●●●●●●
●●●● ● ●
●
●
●● ●●●●
●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
1
0.8
0.4
0.0
50
27
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
0.8
0.4
0.0
78
0.8
0.4
0.0
● ●●●
●
●●
●
●
●●●●●●●●●●●●●●● ●●●●
●●●● ●●●●● ●●●●●● ●● ●●●●●●●●●●●●●●●●●●● ●●●
● ●●●●●●●●●●●●●●●●●●●●●
● ●●●●●●●●●●●●● ●●● ●●●●●●●●●●●●●●●●●●●●●
●● ● ● ●● ●
●●●
● ●●● ●
●●●
●
●●●● ● ●
● ●●
● ● ●
●●● ●●
●●
●
●
●
●
●
●●
●
●
●●●
●
●
●
201
●●●●● ●●●●●● ●●
● ● ● ●● ● ●●● ●
●●●
●
● ● ●●●●●●●● ●●
●
●
●●●●●●●● ●●● ●●
●● ●
140
77
49
●●●
152
●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●●●●●●●●●●●●●●●
●●●● ● ●●●●●●●●
●
●
● ● ●●●●●
●●●●●●●● ●●●●●●●
●
●●●●● ●
●
●● ●●● ●●● ●● ●●●
●
● ●● ● ● ● ●●
●
●
●
●
●
●
●
●●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
44
● ●●●●●●●●●●●●●●●●●●●●●
●●●
●
●
●●●
● ●● ● ●●●●●●●
●●●●●● ●●●●●● ● ●
●
●●
●
75
●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●●●
● ●● ● ● ●
●●
●●
●●●
●●
0.8
0.4
0.0
96
●
●
●
0.8
0.4
0.0
93
● ● ●●● ●●●●●● ●●●●●●●●●●●●●●●●●●●●●●●●●●●●
● ●● ● ● ●● ●● ●
●
●●●●● ●●
10
●
●
●
●●●●
● ●
● ●●●
●
●●●●●● ●●●●●●●
●●●
129
13
133
●
● ●●● ●●●●●
●●●●●●●●●●●●●●●●●●●●
●
●
● ●●●●●●● ●● ● ●●●●●
●● ● ●● ● ●●
●●
● ●●●●●●●●●●●●●●● ●● ● ● ●
●
●
●
●
●
05 15 25
102
108
● ●
●●●●●●●●●●
●
●●●●●●●●●●●●●●●●●
11
05 15 25
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
116
●●●●●●●●●● ●●●●●●●●●●●●● ●● ●●●●●●●●●●●● ●
●●●●
●●
●
●● ●●●● ● ●● ●●
●
●●●
137
●
●
●●●● ●●● ●●● ●●
● ●●●●● ●●●● ●●
●
●
●●
●●●●●●
101
● ● ●● ●● ●●
●
●●●●●●●●●●●●●● ●●●●●●●● ●●●● ●●●●● ● ●●●●●●●●●●●●●●●●●●●●●●●●●
●●
●
●●
●●
●●●●●●●●
●
●●●
●
●●
●
●
05 15 25
127
117
●●●●●●●●
●
● ●
●
● ●●●●● ●●●●●●
● ● ●● ●●
●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
05 15 25
Block
Figure 2.24: Study 3 strategy sophistication by block at the individuallevel.
0.8
0.4
0.0
116 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Cn: ID=168
MM: ID=173
1.0
1.0
●
●
●
Strategy Sophistication
0.8
0.6
0.4
0.2
●
●●●●●●●●●●●●●●●●●●●●●●●●●●●●●
0
5 10
Strategy Sophistication
1.0
20
0.2
0.0
0.4
0.2
0.0
0
5 10
20
●
30
●
●●
0
5 10
20
Block
Lg: ID=6
CnCn: ID=102
CnSE: ID=4
●
●
●
●
0.6
●
●
●
●
●
0.0
1.0
5 10
20
Block
30
30
1.0
●
0.8
0.6
●
0.4
●
● ●
● ● ●●●
●● ●
0.2
0.0
0
●
0.6
Block
●
0.2
0.4
●●
● ●●● ●●●
●●
●●
●
●●
● ●●
●
●
●
●
●
●
●
●
●
0.8
Block
●
0.4
0.6
30
● ● ●
● ●●
●
● ●
● ● ● ●● ●
●
● ●●
0.8
●●● ●
● ●
●●
●
●
● ●
●● ●
●
● ●
● ●
●
●
●
0.8
●
●●●●●●●●●●●●●●●●●
0
5 10
Strategy Sophistication
0.0
Strategy Sophistication
Strategy Sophistication
●
Strategy Sophistication
1.0
SE: ID=201
0.8
●●
●● ●
● ●●●● ● ●● ● ●●
●
●● ●
●
●●
0.6
●
●
0.4
●
●
●
●
0.2
0.0
20
Block
30
0
5 10
20
Block
Figure 2.25: Study 3 examples of model fits of the effect of trial on
strategy sophistication at the individual-level.
30
2.6. DISCUSSION
2.5.3
117
Practice, Strategy, and Performance
Within individual correlations between strategy sophistication and log
trial completion time over trials were obtained for participants in the No
Training Condition who had a standard deviation in strategy sophistication above 0.15 (45 out of 61 participants). The correlation tended to be
moderate and negative (M = -0.34, SD = 0.17, range: -0.58 to 0.44).
2.6
2.6.1
Discussion
Overview
This chapter aimed to model the relationship between practice, strategy, and performance. To achieve this aim, hypotheses were tested and
models were examined across three studies.
2.6.2
Practice and Performance
Group-Level
In all three studies, the relationship between practice and task completion
time at the group-level was better explained by a three parameter power
function than by a three parameter exponential function. The three
parameter power function also provided excellent fit at the group-level in
absolute terms explaining between 98% and 99% of variance. The degree
of superiority was greatest in Study 1. There was almost no difference
between four parameter power functions and APEX functions at the
group-level for all three studies. Functions with discontinuities generally
had worse fit than the continuous three parameter power function.
These results are consistent with the group-level findings of Newell
and Rosenbloom (1981) and Heathcote et al. (2000). Thus, they support a limited version of the Power Law of Practice that is confined to
118 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
group-level analysis. It is likely that the pooling of many noisy individual
learning curves with their varying functional forms, learning slopes, timing of outliers, and occasional variably timed mild discontinuities lead to
an averaged curve well explained by a three parameter power function.
While small deviations in group-level curves were obtained, these are
likely to be further smoothed over as sample sizes increase. Nonetheless,
this says little about the individual-level, which should be the level of
interest when developing a theory of psychological learning processes.
Individual-Level: Power versus Exponential
In studies one and two the relationship between practice and task completion time was better explained by a three parameter exponential function
than a three parameter power function at the individual-level. Thus, the
superior model at the individual-level differed from what applied at the
group-level. The size of the difference in average fit was relatively small.
However, given the limit on the amount of systematic variance to be
explained the observed differences are still noteworthy.
These results are consistent with those of Heathcote et al. (2000).
They support the repealing of the Power Law of Practice. Given that
it is the individual-level which is of fundamental importance for understanding psychological change and learning, the results further highlight
the problems associated with only examining the group-level relationship.
The general diversity in shapes and patterns in learning curves reiterated the importance of individual-level analyses. Graphical plots of
individual curves were also a powerful tool for understanding this variation. While this thesis focuses on modelling expected task completion
times, the plots showed clearly that other factors varied between individuals including frequency of outliers, trial-to-trial variability, and the
degree to which one model provided superior fit.
2.6. DISCUSSION
119
Nonetheless, power and exponential functions are both monotonically
decreasing and declerating in their functional form, and both approach
an asymptote. Both are consistent with a model of skill acquisition as the
progressive optimisation of an individual to a task and an environment.
Large improvements are attained initially before additional refinements
are made to smaller and smaller degrees and with less and less frequency
with practice. Thus, the differences between power and exponential models are not as profound as the differences that both power and exponential
have with discontinuous models.
Individual-Level: Discontinuities
The analysis of discontinuities yielded mixed results. Graphical inspection of learning curves suggested that meaningful discontinuities were
rare. There were only a couple of clear jump discontinuities where a
participant abruptly improved performance. However, there were other
cases where the learning curve looked to be made up of segments of varying learning rates, arguably separated by a point where the learning rate
slowed fairly abruptly.
The graphical observation that discontinuities were rare contrasted
with the model fits for tested discontinuity models. While the ConstantConstant model provided poor fit, the P2BP2 and E3B models both provided far superior fit than the monotonically decreasing and decelerating
three and four parameter functions. The superiority of E3B and P2B2B
functions applied both to biased measures of fit such as R-squared and
to parsimony adjusted measures such as AIC and RMSE.
There are several plausible reasons for the superior model fits of the
discontinuous functions. First, the distribution of residuals was both
skewed and often included outliers. Such outliers were often slow trials,
although once discontinuous models are under consideration, discrimi-
120 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
nating outliers from discontinuities can be challenging. Discontinuous
models were seemingly able to capture this noise more effectively than
the continuous monotonic functions. To reduce the effect of outliers on
model comparisons, future research could consider modelling outliers using some form of mixture model. Task completion time could be drawn
from a main model with some large probability as well as some outlier
generating process with a much smaller probability.
Second, the learning curves of many individuals had segment-like
qualities. Thus, even if the learning curves lacked jump discontinuities,
many learning curves seemed to contain discontinuities in the rate of
learning. This suggests that learning is a continuous process of accumulating skill that occurs in a stochastic manner. Thus, even if learning is
typically more rapid early in practice, learning may accelerate and decelerate over time. Discontinuous models are better able to capture such
shifts in the rate of learning, which differ from the smooth monotonically
decelerating functions.
Thus, it appears that discontinuous models have some merit in modelling individual-level learning curves. However, in terms of the text editing task studied in this thesis, the observed discontinuities were rarely
of the form discussed by Haider and Frensch (2002), i.e., abrupt improvement in performance. Despite the text editing task permitting
strategic insights, and despite these insights occurring, abrupt shifts in
performance were almost never observed. Thus, while group-level learning curves provide a biased representation of the learning curve at the
individual-level, the current study supports the idea that even on tasks
permitting discontinuous strategies, performance tends to remain cumulative.
Differences between the studies appear to be related to the structure
of the task. Study 1 involved a consistent set of editing requirements and
multiple edits per trial. Both these factors reduced the trial-to-trial vari-
2.6. DISCUSSION
121
ation in task completion. The clearest case of an observed jump discontinuity in performance occurred in this study. This is consistent with the
idea that such discontinuities depend on a meaningful simple and single
discontinuity allowing a jump to occur. Study 2 and Study 3 had varying
editing requirements across trials which increased the trial-to-trial variation in task completion time. Study 2 in particular had different types of
edits which led to the greatest trial to trial variation. Statistically, such
variation makes it harder to detect discontinuities. Theoretically, such
variation in task features may make it less likely that a single insight
can lead to an abrupt improvement in performance. Any insight that
does occur may need to be adapted to the dynamic task environment.
Furthermore, dynamic real-world environments may be more likely to
require a degree of generalisation.
Overall, the findings are consistent with the idea that discontinuous
improvements are a rare occurrence. Individual-level performance over
short periods of temporal aggregation and on a variable task is fairly
variable. The distribution of task completion time is often skewed and
involves outliers which can lead to the impression of discontinuities. However, there are instances, although rare where meaningful discontinuous
improvements do occur.
Implications
The above findings support the large scale analyses of learning curves
by Heathcote et al. (2000). They consolidate the critique of the proposed Power Law of Practice (Newell & Rosenbloom, 1981). Given that
the individual-level is the psychologically meaningful level of analysis,
if anything, an exponential law seems more appropriate. This also has
implications for researchers who have developed models which assume
that the learning curve is a power function such as Logan’s Instance
122 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Model (Logan, 1988, 1992) and various cognitive architectures such as
Act-R (Anderson et al., 2004). However, given that both power and exponential functions are continuous and monotonically decelerating, the
theoretical adjustments may be relatively minor. The greater challenge
may be to capture the variability in learning curves and to model the
trial-to-trial variability.
The above findings also relate to researchers who have proposed that
learning curves at the individual-level contain discontinuities. Haider and
Frensch (2002) presented data on an alphabet string verification task and
found discontinuities in the learning curves of many of their participants.
They then showed analytically and by simulation that group-level curves
can smooth out discontinuities at the individual-level. They implied
that given the prevalence of strategy shifts, discontinuities in individual
learning curves are likely to be prevalent.
However, the results of the three studies suggested discontinuities
were rare. Group-level analyses did produce a biased representation of
the relationship and trial-to-trial variability, but discontinuities at the
individual-level were quite rare. As will be discussed shortly, abrupt
strategy shifts do not equate to performance discontinuities. Also, discontinuities that are observed are just as likely, if not more likely, to be
the result of some form of temporary drop in performance as opposed to
a performance improvement. Given that few researchers in the literature
have reported discontinuities, it may be that the dynamics in the study
by Haider and Frensch (2002) which led to the observed discontinuities
are unusual.
A major contribution of this thesis was the use of formal models of
discontinuities. Neither Haider and Frensch (2002) nor Heathcote and
colleagues (2000) tested such models. It would be interesting if future
researchers applied such models to a greater number of learning curves
at the individual-level.
2.6. DISCUSSION
123
Future research could further explore the role of parameter constraints
in modelling learning curves. The present thesis took a minimalist approach to parameter constraints, constraining only breakpoint parameters to ensure that breakpoints were meaningful. Further constratints
could be extended to ensure that predicted task completion time and
strategy sophistication are within plausible values beyond observed practice levels and toward asymptotic levels. Such constraints include requiring that asymptotic performance be above zero, and perhaps even above
some plausible minimum task completion time. Likewise, strategy sophistication models could have asymptotes constrained to be less than
or equal to one. Decisions about choosing constraints also raise challenges
about the purpose of modelling learning curves and the implications of
good fit achieved by psychologically implausible parameters.
Related to parameter constraints, future research could also explore
the issues related to model fit statistics. This thesis largely relied on
mean square error and AIC to compare models. However, once model
constraints are incorporated, or where the number of parameters differ,
then substantial debate exists about what fit statistic is most useful.
In particular, future research could explore BIC and cross validation
approaches. Models that capture outliers and non-normal residuals might
also provide a more accurate representation of the data and might assist
in evaluating the relative merits of competing models. The challenge is
for the model fit statistics to reward not just statistical fit, but also a
psychologically meaningful model.
2.6.3
Practice and Strategy
Group-Level
In all three studies, at the group-level the three parameter Michaelis–
Menten function provided excellent fit to the relationship between prac-
124 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
tice and strategy sophistication explaining over 97 percent of variance.
Four key aspects of the observed relationship captured by the function
were that: (a) strategy use showed some level of sophistication at the
start of practice, (b) strategy sophistication increased monotonically with
practice, (c) the rate of increase in strategy sophistication decreased
monotonically with practice, and (d) strategy sophistication appeared
to be approaching an upper level asymptote toward the end of practice
that was less than one.
The superiority of Michaelis–Menten varied across the three studies.
In Study 1, several functions including the Logistic and the Saturated
Exponential provided almost as good fit as the Michaelis–Menten. In
Study 2 the superiority of Michaelis–Menten was larger but still small,
and in Study 3 the superiority was relatively large. This superiority
also mirrored the relative ordering of key models, which is to say that
Study 2 and Study 3 emphasised more clearly which models were second,
third, and fourth best. A clear ordering of models emerged in Study 3
with Michaelis–Menten fitting best followed by Saturated Exponential,
Logistic, and then Constant–Saturated Exponential.
The increase in model differentiation across the studies mirrored the
size of strategy change in the three studies. Study 3 had the greatest
increase in strategy sophistication with practice, whereas Study 1 had
the smallest increase. The strategy sophistication curve in Study 3 was
prominently decelerating with rapid strategy change occurring in early
blocks. In contrast, Study 1 had lower levels of strategy sophistication
and lower levels of strategy change. The curve departed minimally from
linearity.
There are several explanations for these differences. First, Study 3
had the most comprehensive initial strategy training of the three studies.
Second, the repetitive nature of the deletion task in Study 3 relative to
Study 2 may also have facilitated use of sophisticated strategies. Sim-
2.6. DISCUSSION
125
plicity should reduce attentional demands, reduce the complexity of generalisation, and make strategies specific to the particular problem (e.g.,
deletion) more effective. Third, the sample of third year university students used in Studies 2 and 3 appeared to have more prior experience
than the mixed university and general population sample used in Study
1.
The above provides a model of how strategy change occurs at the
group-level. However, as suggested by theory and supported by results,
the group-level provides a misleading representation of the relationship
between practice and strategy sophistication at the individual-level.
Individual-Level
Functional form The main hypotheses regarding the relationship between practice and strategy sophistication at the individual-level were
supported in all three studies: (a) Some individuals never shifted either
because they started with low levels or high levels of strategy sophistication and persisted at that level throughout practice; (b) shifts varied in
whether they were abrupt or gradual; (c) probability of a strategy shift
decreased with practice; and (d) many participants only partially shifted
to the sophisticated strategy.
Results supported the argument put forward by Siegler and others
(e.g., Lemaire & Siegler, 1995; Siegler, 1987, 1988a) that the relationship
between practice and strategy use needs to be analysed at the individuallevel. Trajectories varied substantially between individuals. Also, the
individual trajectory tended not to resemble the group-level relationship.
For those individuals that changed, strategy use a somewhat abrupt onset
and a relatively rapid transition was a common pattern. This contrasts
with the gradual increase starting from the first trial as suggested by the
group-level model. While such a model did occur at the individual-level,
126 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
it was only suitable for a subset of participants.
Of those individuals that did shift, the degree to which strategy shift
was gradual or abrupt also varied between individuals. While an arbitrary cut-off could be given to classify a shift as either gradual or abrupt,
a more meaningful representation would describe the shift in terms of
the degree to which it was gradual. Some shifts took only two or three
trials to complete whereas others took ten or twenty to complete. Thus,
assessment of the degree to which shifts were abrupt is contingent on the
number of trials allowed in the transition period.
In addition, the concept of ‘shift’ varied in its degree of applicability.
First, even once a shift had occurred, individual trials with lower levels
of sophistication still occurred. In many cases the proportion of these
low-sophistication trials tended to decrease with practice. Thus, even
when the shift was largely abrupt, it still often took time for it to be
complete.
It is also inappropriate to describe the relationship between practice
and strategy sophistication in terms of a single functional form. The
variation in functional form was much greater than for the relationship
between practice and task completion. Each of the six models proposed
were effective in modelling the relationship between practice and strategy sophistication for some participants. The models captured various
combinations of gradual and abrupt change in strategy sophistication,
gradual and abrupt onset of strategy shift, immediate and delayed onset
of strategy shift, and whether a shift occurred or not.
Implications for Learning Curves In addition to understanding the
relationship between practice and strategy use for its own sake, this thesis
was motivated by a desire to resolve the tension between a discontinuous
relationship between practice and strategy use, and a continuous relationship between practice and performance. The results suggest both that
2.6. DISCUSSION
127
this tension is real and also that it is not as great as some authors have
implied (e.g., Haider & Frensch, 2002). Of the variety of functional forms
for the relationship between practice and strategy sophistication, many
were more discontinuous than individual-level learning curves. However,
the majority of strategy shifts do take a period of time to complete which
would tend to reduce the potential for discontinuous improvement in performance.
The tendency for strategy shifts to occur early in practice makes their
effect on performance more difficult to discern. In early trials performance is more variable, and a large amount of improvement is occurring
due to a range of learning processes. Isolating the unique contribution
of strategy shift is often not possible.
Adaptivity Another aspect of the observed relationship between practice and strategy sophistication is that very few individuals moved from
a sophisticated strategy to a simpler strategy. In the few cases where
this occurred, the change to the simpler strategy seemed to occur after
the participant tried the more sophisticated strategy for only a few trials.
There are several explanations for this overall pattern of results. First,
the strategies were sufficiently superior for most participants to persist
with them. Second, participants were adaptive in their strategy choice
and sensitive to small improvements in strategy effectiveness. Finally,
individuals able to use the strategies well were more likely to consider
the strategy and try it.
Comparison with Algorithm-Retrieval Shift The relationship between practice and strategy sophistication was also different to what is
typically found in studies looking at algorithm-retrieval shift (e.g., Delaney et al., 1998; Rickard, 1997, 1999). In algorithm-retrieval studies
most participants shift to the superior retrieval strategy, and the rela-
128 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
tionship between practice and probability of using the retrieval strategy
tends to be sigmoidal or exponential. The reason for the greater uptake
in retrieval studies is probably related to the uptake process of retrieval
responses which are facilitated by mere exposure to stimulus and correct
response. In contrast, in text editing, the simple strategy does not necessarily suggest the more sophisticated strategy. Greater practice may
increase the motivation to find a quicker strategy, but if the strategy is
not in the repertoire or its benefits are not known, then it may never be
adopted. This was seen in Study 1 where few hints were given regarding
sophisticated strategies and there was minimal uptake.
Differences in the timing of algorithm-retrieval and simplesophisticated strategy shifts may be related to how the strategy is acquired. Sophisticated editing strategies often enter through some form
of declarative representation. In addition to being able to execute the
strategy, it often takes time to learn to execute the strategy fluently and
quickly. In contrast, trying to execute a retrieval strategy is useless without a representation of the correct response in memory. Such a memory
representation typically takes many exposures, especially where the size
of the set of items is large.
The probability of a shift appears to decrease monotonically with
practice. This is consistent with the idea that when engaging in a repetitive activity, participants initially explore the environment and then settle into a satisficing routine (e.g., exploration and exploitation models,
Erev et al., 2008). Furthermore, practice using an adopted strategy often
leads to greater proficiency in the adopted strategy and, thus, a greater
performance and automaticity cost associated with switching strategies.
Text editing strategy shifts may also be similar to other types of
strategy shifts that involve an element of discovery (e.g., Luchins, 1942)
in that the options are not known by all individuals. The main difference
is that on typical insight based tasks, discovery is required to perform a
2.6. DISCUSSION
129
task at all, whereas in the case of text editing strategies, it merely results
in suboptimal performance. Furthermore, individuals may not be aware
that their performance is suboptimal, because they are unaware of the
benefits of the more effective strategy. Further explaining the monotonic
reduction in rate of strategy shift is the idea that individuals differ in
their readiness to shift strategies. For those ready to shift, the shift
occurs early; for those not ready to shift, the shift may not happen at
all.
Summary There were also differences at the individual-level between
the studies. First, there was the previously mentioned greater uses of
sophisticated strategies in Studies 2 and 3 relative to Study 1. Second,
the trial-to-trial variation in strategy use was much lower in Study 1 than
in Studies 2 and 3. Two factors that are likely to explain most of these
differences are first that the text editing requirements were constant in
Study 1 and variable in Studies 2 and 3. Constant editing requirements
should be more likely to induce a regular pattern of responding. Second,
trials in Study 1 involved multiple edits whereas Studies 2 and 3 involved
a single edit. Aggregation alone should serve to smooth out variability.
This finding is broadly consistent with Siegler’s (1987) Overlapping Wave
Theory, which suggests that strategies overlap in usage over time with
more sophisticated strategies gradually replacing simpler ones.
The overall pattern of results is qualitatively consistent with Lemaire
and Siegler’s (1995) proposal of four types of strategy change that occur
with practice: entry into repertoire, increased use of efficient strategies,
improved execution of strategies, and improved selection of strategies.
The findings are also broadly consistent with models of how individuals
keep track of the success of strategies, as captured in many models of
learning such as ACT-R.
Overall, this thesis aimed to improve models of the relationship be-
130 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
tween practice and strategy use by combining trial-level strategy measurement, graphing, and modelling of strategy sophistication at the
individual-level. Previous studies have rarely used such a combination,
particularly on tasks characterised by a simple–sophisticated strategy
shift. The models, particularly at the individual-level, suggest a more
nuanced account of the concept of abrupt versus gradual shift, and provide additional data that cognitive architectures need to consider. In particular, the results provide a strong warning against drawing inferences
about individual-level strategy shift processes from group-level data.
2.6.4
Practice, Strategy, and Performance
A number of findings emerged regarding the relationship between strategy use and performance and its potential effect on the learning curve.
First, strategy sophistication tended to increase with practice. Second,
the period of greatest increase in strategy sophistication was associated
with the period with the greatest change in task completion time. Third,
while strategy shift was often abrupt, discontinuities in performance were
rare. Fourth, at the level of intra-individual variation, strategy sophistication was correlated with task completion time.
Taken together, these findings suggest that increased strategy sophistication contributes to an eventual improvement in performance. Increasing strategy sophistication can be seen as partially mediating the effect
of practice on performance.
Given that participants shifted towards more sophisticated strategies
and rarely away, such individuals were presumably aware implicitly or
explicitly of the benefit in using the more sophisticated strategies. Similarly, it may be that participants who did not adopt more sophisticated
strategies were also those less likely to benefit from the sophisticated
strategies. However, it is also possible that participants were unaware
2.6. DISCUSSION
131
of the potential benefits of the strategies or had some other reason to
persist with simpler strategies.
Several pieces of evidence also suggest that the initiation of a strategy
shift is only the beginning of realising the performance benefit. First,
while the onset of a strategy shift was often abrupt, it often took several
trials for the generalisation of the strategy shift to be complete. Second,
the time for strategy sophistication to emerge following initiation suggests
that it takes time for participants to learn how to apply the strategy.
Third, a theory driven analysis of the benefits offered by some of the
more sophisticated strategies suggest that even once expertise is acquired,
improvements are modest relative to to other factors, such as learning
the task interface, learning the task, learning how to do text editing
in general, and sources of other randomness such as variation in the
type of trial, and random variation in effectiveness and distraction. In
combination, this leads to less abrupt performance benefits from strategy
shift.
While not detracting from the actual effect of strategy shift on performance, several factors appear to reduce the ability to detect the effect
of strategy shifts on performance. First, because strategy shifts tend to
occur earlier in practice, any performance benefit of a shift tend to occur
at a time when both other learning processes are occurring and also when
error variance is greatest. Second, in general, and particularly in Studies
2 and 3, the error variance tends to be fairly large at the individual-level.
These factors add noise to any estimation of the effect of strategy shift,
and thus, make it difficult to detect discontinuities in the learning curve.
Several differences between the studies are likely to influence the degree to which the effect of strategy shift results in a discontinuity in the
learning curve. First, Study 1 had a consistent set of edits, whereas
Studies 2 and 3 varied the editing requirements. Varying the editing
requirements appeared to lead to longer periods of strategy generalisa-
132 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
tion. Second, Study 1 and 2 were more complex than Study 3 in the
editing requirements that they entailed. This greater complexity meant
that adopting a sophisticated strategy involved making more changes,
which were less likely to occur in combination and more likely to occur in parts. The greater complexity also meant that a single strategic
change was less likely to result in abrupt improvement. Third, Studies 2 and 3 had greater error variance relative to the overall amount of
learning, which would make strategy shifts harder to detect against this
backdrop of noise. Fourth, the degree to which the prior knowledge of
participants, the task itself, and the instructions makes strategy shift
both possible and apparent varied between the studies. Studies 2 and
particularly 3 provided greater motivation and instruction on using the
shortcut keys. This manifested in a greater number of strategy shifts
with the consequent greater potential for a discontinuity to be observed.
These findings suggest modification to the thinking of several researchers on the relationship between practice, strategy use, and performance.
Crossman (1959) made an early attempt to explain the learning curve
in terms of changing probabilities of strategy use. Practice was assumed
to give rise to feedback of strategy effectiveness which led to an updating
process of strategy use probabilities. First, the present results show that
strategy use and performance are not synonymous. The findings show
that the effectiveness of a strategy varies over time, and between individuals. Second, a strategy may have zero probability of use at a given
moment if it is not in the repertoire. Thus, prior knowledge, instruction, motivation, and contextual affordances are likely to be important
influences of strategy use.
Similarly, while not treating performance and strategy use as synonymous, Haider and Frensch (2002) seem to argue for a greater correspondence between strategy shift and performance shift than is suggested
2.6. DISCUSSION
133
by the results in this thesis. In this thesis, relatively abrupt strategy
shifts sometimes occurred, yet abrupt performance shifts almost never
occurred. The theory proposed in this thesis provides a reconciliation of
this.
2.6.5
Conclusion
The results of the three studies refine models of the relationship between
practice, strategy use, and performance. They reinforce the importance
of the individual-level measurement and provide a means of reconciling
discontinuous effects of practice on strategy sophistication with continuous effects of practice on performance.
134 CHAPTER 2. PRACTICE, STRATEGY, AND PERFORMANCE
Chapter 3
Individual Differences
Contents
3.1
3.2
3.3
3.4
Introduction
. . . . . . . . . . . . . . . . . . . 136
3.1.1
Overview . . . . . . . . . . . . . . . . . . . . 136
3.1.2
Theories and Frameworks . . . . . . . . . . . 137
3.1.3
Predictors of Performance . . . . . . . . . . . 139
3.1.4
Predictors of Strategy Sophistication . . . . . 143
3.1.5
Strategy Sophistication and Performance . . 145
3.1.6
The Current Studies . . . . . . . . . . . . . . 146
Method . . . . . . . . . . . . . . . . . . . . . . 146
3.2.1
Participants . . . . . . . . . . . . . . . . . . . 146
3.2.2
Predictor Measures . . . . . . . . . . . . . . . 147
3.2.3
Criterion Measures . . . . . . . . . . . . . . . 154
3.2.4
Analysis Plan . . . . . . . . . . . . . . . . . . 154
Study 1 Results . . . . . . . . . . . . . . . . . 155
3.3.1
Preliminary Analyses
. . . . . . . . . . . . . 155
3.3.2
Hypothesis Testing . . . . . . . . . . . . . . . 156
Study 2 Results . . . . . . . . . . . . . . . . . 159
135
136
CHAPTER 3. INDIVIDUAL DIFFERENCES
3.5
3.6
3.1
3.1.1
3.4.1
Preliminary Analyses
. . . . . . . . . . . . . 159
3.4.2
Hypothesis Testing . . . . . . . . . . . . . . . 159
Study 3 Results . . . . . . . . . . . . . . . . . 162
3.5.1
Preliminary Analyses
. . . . . . . . . . . . . 162
3.5.2
Hypothesis Testing . . . . . . . . . . . . . . . 162
Discussion . . . . . . . . . . . . . . . . . . . . . 164
3.6.1
Overview . . . . . . . . . . . . . . . . . . . . 164
3.6.2
Performance
3.6.3
Mediational Model . . . . . . . . . . . . . . . 167
3.6.4
Conclusion . . . . . . . . . . . . . . . . . . . 168
. . . . . . . . . . . . . . . . . . 164
Introduction
Overview
The previous chapter showed how individuals differ substantially in strategy sophistication and task performance. It showed how group-level analyses lead to a biased representation of the individual-level. It showed how
no one functional form can describe the relationships between practice,
strategy, and performance.
If individuals differ in their learning processes, a natural question is
what predicts this variability. Such an approach fits into the broader
research enterprise of building links between psychometrics, individual
differences, and cognitive psychology. Such an enterprise is captured in
the work of researchers such as Ackerman (e.g., Ackerman, 1992, 1990,
1989, 1988, 1987; Ackerman, Beier, & Boyle, 2002; Ackerman, Kanfer, &
Goff, 1995) and many others (e.g., Ecker, Lewandowsky, Oberauer, Chee,
& Ecker, 2010; M. D. Lee & Webb, 2005; Lovett, Reder, & Lebiere, 1997).
3.1. INTRODUCTION
137
This chapter builds on this research tradition by examining relationships between predictor and criterion measures on the text editing task,
and thereby addressing Aim 2 of this thesis. Specifically, connections
between four types of predictors and two criterion measures are examined. The four types of predictors are (a) ability, (b) prior experience, (c)
personality, and (d) demographics. The two types of criterion measures
are (a) task performance, and (b) strategy sophistication.
Many studies already exist that look at such predictor–criterion relationships. Whilst more modest than the previous chapter, the present
chapter still aims to make at least two unique empirical contributions.
First, it examines predictors of strategy sophistication, a variable rarely
studied by previous researchers. Second, a model of strategy mediating
the effect of predictors on task completion time is tested. This chapter
also complements the previous chapter by focusing on individual differences at the trait-level.
To achieve these aims, the subsequent literature review is organised
as follows. First, general theories and frameworks for predicting task performance are presented. Second, empirical evidence is discussed regarding the prediction of performance from demographics, prior experience,
ability, and personality. Third, prediction of strategy sophistication is
discussed.
3.1.2
Theories and Frameworks
Predicting performance has a long and contentious history in psychology (for a discussion, see Neisser et al., 1996). In addition to improving
decision making in contexts such as employee selection (for a review,
see P. Sackett & Lievens, 2008) and allocation of educational opportunities (for a review, see P. Sackett, Schmitt, Ellingson, & Kabin, 2001),
researchers have also been interested in using individual differences to
138
CHAPTER 3. INDIVIDUAL DIFFERENCES
understand information processing models of cognition (e.g., Ackerman,
1987, 1988; Adams, 1987; Fleishman, 1972; Keil & Cortina, 2001). The
challenge for research on individual differences is to explain why observed
relationships vary as a function of factors such as type of predictor, type
of criterion, context, and sample characteristics.
Various general frameworks have been proposed. First, meta-analyses
and other validity generalisation studies (e.g., Burke, 1984; Schmidt &
Hunter, 1977) have revealed the degree of robustness of factors such as
intelligence in predicting factors like job performance. However, theory
and empirical findings in meta-analyses typically indicate that the true
effect size varies between studies. This leads to the challenge of identifying key moderators of this effect.
Second, many but not all (see Patrick, 1992) individual difference researchers endorse a version of the cognitive correlates perspective whereby
the size of correlation between predictor and performance is assumed to
be related to the importance of that predictor for task performance (for
a discussion, see Proctor & Dutta, 1995). The cognitive correlates approach applies both to general enabling abilities and models of specific
transfer, such as Identical Elements Theory (Thorndike & Woodworth,
1901) and modern day variants (e.g., A. Kramer, Strayer, & Buckely,
1990; MacKay, 1982; M. Singley & Anderson, 1985; M. K. Singley &
Anderson, 1989). These ideas regarding ability–performance correlations
have been extensively explored in studies by Fleishman (Fleishman &
Fruchter, 1960; Fleishman, 1972) and Ackerman (Ackerman, 1987, 1988,
1989, 1990, 1992; Ackerman & Woltz, 1994; Beier & Ackerman, 2005;
Kanfer & Ackerman, 1989).
Third, there is a general principle that correlations between predictors
and outcomes are stronger if they correspond in terms specificity. Thus,
stable traits such as intelligence and personality are assumed to be better
predictors of long term measures of performance such as job performance.
3.1. INTRODUCTION
139
In contrast, performance on a specific occasion is assumed to be better
predicted by state variables close in time to the performance occasion.
As with temporal congruence, there is a similar notion with regards to
the generality of the domain, such that domain specific predictors should
predict domain specific outcomes, and domain general predictors should
predict domain general outcomes. The implication of this for experimental tasks is that relevant prior experience and related skills should be
better predictors of task performance than general abilities.
While these general frameworks have value, it is important to be
critical of causal interpretations of predictor–criterion correlations. A
predictor may be an enabling capacity or it might just be a sign. And if
it is a sign, it may just be epiphenomenal or it may have led to the third
variable. It is important to consider alternative causal interpretations
particularly regarding differences in prior experience of participants and
how this might be related to other predictors.
A second issue that arises in relation to testing theories of predictor–
criterion relationships is that the hypothesis testing orientation is not
always suited. The issue is often not whether there is a relationship,
but rather the relative size of the relationship. To deal with this issue,
proposed hypotheses are qualified with the terms, small, medium, and
large corresponding to approximate expectations of correlations around
0.10, 0.30, and 0.50 respectively. These adjectives corresponds to Jacob
Cohen’s (1992) influential rules of thumb for correlations in social science
research.
3.1.3
Predictors of Performance
Demographics
Age and gender have all been extensively studied as predictors of performance. While researchers have often showed differences between younger
140
CHAPTER 3. INDIVIDUAL DIFFERENCES
and older adults (e.g., Hertzog, Cooper, & Fisk, 1996; Rogers, Hertzog, &
Fisk, 2000), differences within younger and middle-aged adults are often
expected to be minimal. When differences do arise, it is interesting to
consider the role of declining abilities with age as opposed to differences
in prior experience often related to cohort effects. Likewise, some studies
find gender differences (for a discussion, see Ackerman et al., 1995), and
similar questions arise over differences in prior experience related to the
kinds of tasks used in skill acquisition experiments. In particular, males
often perform better on computer based tasks common in skill acquisition
experiments (e.g., Ackerman et al., 1995). Thus, it was expected that
on the text editing task, males would be slightly better than females and
younger adults would be slightly better than older adults.
Prior Experience
Prior experience is generally a good predictor of task performance.
McDaniel, Schmidt, and Hunter (1988) provides a discussion of experience predicting job performance. In the context of experimental designs,
despite tasks often being selected for their novelty, prior experience is
still relevant. For example, when the task involves using computers, experience with video games is often a good predictor. Rabbitt, Banerji,
and Szymanski (1989) obtained a correlation with task performance of
.33 for self-rated and .40 for experimenter rated video game experience.
This links with theories of transfer, which often show somewhat weak
correlations with other specific skills. However, task analytic approaches
to selecting predictors of job performance are often used. In particular, where the period of practice is short relative to a life of potentially
relevant prior experience, this prediction may be greater.
The present studies used a keyboard based text editing. Based on
task analysis, many prior experience measures should be relevant such as
3.1. INTRODUCTION
141
amount of experience with keyboards, computers, word processors, and
other text manipulation tools. Experience with keyboard-based video
games may also be relevant.
Thus, it was hypothesised that
Hypothesis 3.1 Typing speed is strongly positively correlated with text
editing performance.
Hypothesis 3.2 Prior experience is strongly positively correlated with
text editing performance.
Ability
Before discussing the prediction of ability, it is worth reviewing what
is meant by the term ‘ability’. In contrast to skill, and following on
from Fleishman (1972), ability is treated as a (a) general, (b) relatively resistant to change, (c) enabling capacity. Intelligence as g is
the best known and most studied ability, yet many other representations have emerged. Several multi-faceted and often hierarchical representations of cognitive ability have been proposed (e.g., J. B. Carroll,
1993). Other researchers have focused on non-cognitive abilities (e.g.,
Fleishman, 1972). Ackerman (1988) adapted the multidimensional scaling analysis of Marshalek, Lohman, and Snow (1983) and divided ability
into general cognitive, perceptual speed, and psychomotor ability.
Empirical evidence generally shows strong relationships between ability measures and task performance. Meta-analyses typically show strong
correlations between intelligence and job performance. For example,
Hunter and Schmidt (1998) report a mean meta-analytic correlation between general mental ability and job performance of r = .63. Other researchers such as Ackerman (1988) and Fleishman (1972) have focused on
changes in ability–performance correlations over time (for a meta analysis, see Keil & Cortina, 2001). In general the literature has shown logical
142
CHAPTER 3. INDIVIDUAL DIFFERENCES
connections between particular abilities and the apparent demands of
tasks (see Farina & Wheaton, 1971).
Thus, it was hypothesised that:
Hypothesis 3.3 Ability is moderately positively correlated with text editing task performance.
In this thesis, ability is operationalised in ways consistent with Ackerman
(1988) in terms of general cognitive, perceptual speed, and psychomotor
abilities. Differential prediction of the three classes of abilities was of
interest, but no specific hypotheses were proposed. The meta-analysis by
Keil and Cortina (2001) broadly suggested stronger correlations with task
performance for cognitive and perceptual speed than for psychomotor
ability. However, Ackerman (1990) has shown evidence of prediction for
all three types of ability on an air-traffic control simulation, which mirrors
some general features of text editing.
Personality
Personality traits have received increased attention over the last 20 years
as potential predictors of performance. This can be attributed to two
major factors. First, the Big 5 model of extraversion, neuroticism, conscientiousness, agreeableness, and openness (or intellectance) has received
increased acceptance and has emerged as a useful means of synthesising
the diverse set of non-ability based individual difference constructs (see
Digman, 1990; L. Goldberg, 1993). Second, building on the Big 5, influential meta-analyses have tended to show weak but positive (negative for
neuroticism) correlations between the Big 5 scales and job performance
with conscientiousness in particular emerging as a consistent predictor
(see Barrick & Mount, 1991; Tett, Jackson, & Rothstein, 1991).
Some attempts have also been made to integrate ability and personality perspectives (e.g., Ackerman et al., 1995) within the training context.
3.1. INTRODUCTION
143
In general, theories often suggest that personality is associated with what
an individual ‘will do’, whereas ability is concerned more with what they
‘can do’. If performance requires effort and ability, personality is assumed
to be a positive predictor of discretionary allocation of effort.
However, there are several reasons to expect self-report measures of
personality to be unrelated to task performance in laboratory settings.
First, even in applied settings, correlations tend to be small (see Barrick & Mount, 1991; Tett et al., 1991). Second, laboratory experiments
minimise the role of discretionary effort. The nature of the experimental
environment is typically short enough and sufficiently monitored to align
more with what P. R. Sackett, Zedeck, and Fogli (1988) call a maximum
performance situation, as opposed to a typical performance situation.
Third, self-reported measures of style and approach are often relatively
poor indicators of competencies. Competencies rather than motivational
tendencies are likely to be better predictors of performance, particularly
in settings where discretionary effort is largely controlled. Thus, it was
hypothesised that:
Hypothesis 3.4 Big 5 personality factors are unrelated to text editing
task performance.
3.1.4
Predictors of Strategy Sophistication
While there is a lot of evidence of what predicts performance, there is
much less evidence of what predicts strategy sophistication. A starting
position is that the same things that predict task performance also predict
strategy sophistication.
Prior Experience
Prior experience represents a more domain specific pathway of acquiring
sophisticated strategies. With practice, people often adopt more sophis-
144
CHAPTER 3. INDIVIDUAL DIFFERENCES
ticated strategies, although it is not uncommon for this to asymptote at
suboptimal levels. This would suggest that variables like typing speed
and prior experience with text editing would be associated with greater
text editing strategy sophistication.
Ability
Kyllonen et al. (1984) provides an early example of studying the relationship between ability and strategy use. Kyllonen et al. (1984) had 30 participants complete various ability tests followed by a task that required
the construction of designated shapes from assorted pieces. Participants
were classified in terms of the shape construction strategies that they
adopted. While results suggested that ability was related to strategy
use, the sample size was too small to draw clear conclusions.
In another article, Schunn and Reder (2001) report three studies that
looked at correlations between a battery of ability tests and adaptive
strategic use of runway landing strategies on a simplified air-traffic control task. Results generally showed that approximately ten percent of
measured ability tests were significantly correlated with operationalised
strategy adaptivity. However, the choice of analyses was problematic.
Running significance tests for each test in a large battery, particularly
with small sample sizes, leads to problematic inferences. For example, if
the true population correlation with task performance of all tests was .20,
sample correlations would vary from test to test. By chance alone, some
tests would be statistically significant and others would not. Furthermore, given the large correlations between individual tests, it is unlikely
that any one test is a pure measure of an individual cognitive component. Factor analytic approaches that reduce individual tests into general
measures are likely to yield more parsimonious and plausible mappings
between predictors and strategy sophistication. They also reduce the
3.1. INTRODUCTION
145
problems associated with multiple significance tests because fewer tests
need to be run. In summary, Schunn and Reder (2001) do suggest correlations exist between practice and strategy adaptivity, but the nature
of differential predictions based on ability classes remains unclear.
Ackerman (1988, p. 295) theorised that the “efficacy of the initial
productions formulated in Phase 1 of skill acquisition [are] a function of
general-broad content abilities”. This builds on ideas of Anderson and
colleagues (e.g., Anderson, 2000, 1993, 1990, 1987) who propose that an
important component of skill acquisition is the process of knowledge entering declaratively before it becomes proceduralised. This would suggest
a correlation between general ability and strategy sophistication.
It also seems likely that instruction could moderate the relationship
between predictors and strategy sophistication. Considering that strategy sophistication can be manipulated with instruction or practice, the
relationship between ability and strategy sophistication may vary. Effective top-down reasoning is a defining feature of cognitive ability. However, instruction or prior knowledge can presumably facilitate the discovery of such insights. Thus, it may be that across the three studies
reported in this thesis, as the degree of instruction increases, the correlation between ability and strategy sophistication may decline.
3.1.5
Strategy Sophistication and Performance
In considering models of the relationship between practice, strategy, and
performance, it is necessary to consider the correlation between strategy
sophistication and performance. As a matter of interest, sophisticated
strategies are generally chosen for study because they are superior in some
sense. Thus, it is expected that strategy sophistication will correlate
positively with performance. However, the size of this correlation should
depend on task and strategy characteristics.
146
CHAPTER 3. INDIVIDUAL DIFFERENCES
In general, studies with larger correlations between strategy sophistication and performance are likely to have a more similar pattern of
correlations with other predictors. Researchers looking at age differences
on algorithm-retrieval tasks have found that older adults are less likely
to use a retrieval strategy and are slower on both retrieval and algorithm
strategies (e.g., Touron et al., 2004). Given this, it seems plausible that
strategy sophistication partially mediates the effect of predictors on task
performance. Ability may lead to greater insight into strategy superiority, and this represents one mechanism by which ability measures produce
correlations with performance.
In summary it was hypothesised that:
Hypothesis 3.5 Strategy use partially mediates the relationship between
ability and prior experience on task performance.
3.1.6
The Current Studies
The results presented in this chapter are based on the same three studies presented in the previous chapter. Each of these studies measured
demographics and prior experience. Studies 1 and 2 measured ability,
and Studies 2 and 3 measured Big 5 personality. All studies measured
strategy sophistication and task completion time on a text editing task.
Performance was aggregated over practice to form an overall measure.
3.2
3.2.1
Method
Participants
Participants for Study 1 were the same as reported in Chapter 2. For
Study 2 measures were collected at different times points and resulted in
some missing data among the retained sample: 131 (85.1%) completed
the personality test, 143 (92.9%) completed the paper ability tests, 149
3.2. METHOD
147
(96.8%) completed the computerised ability, typing, and prior experience
measures. By analysing performance only on the performance blocks
prior to the mid-practice training, Study 3 analyses were based on the
whole sample (n = 154) across the three conditions, but only a subset of
participants (n = 104) completed the personality test.
3.2.2
Predictor Measures
Overview
The following sections describe the individual difference measures in the
three studies organised under the categories of demographics, prior experience, typing speed, ability, and personality. Each study used a different
but overlapping set of measures as outlined in Table 2.5 on page 58, Table 2.9 on page 65, and Table 2.11 on page 74. In brief, Study 1 did
not have personality measures, Study 2 measured both personality and
ability, and Study 3 did not measure ability.
Demographics
All studies measured age and gender.
Prior Experience
Study 1 Prior Experience Scale The measure contained 17 items.
Each item was one of three types. Five items assessed self-reported experience and physiological readiness, five items measured self-reported
competence in word processing tasks, and seven items tested participant
knowledge of text editing shortcut keys. Each response option was assigned a weight related to the degree to which it was theorised to reflect
prior text editing experience. Items and weights are shown in appendix
A.2.1. To calculate a measure of prior experience, a raw score was first
calculated as the sum of the weights for the selected response options.
148
CHAPTER 3. INDIVIDUAL DIFFERENCES
The raw score was converted to a scale score such that the minimum
possible score was zero and the maximum possible score was 100. This
can be expressed in mathematical notation as
P E raw =
k
X
P E i,
i=1
and
P E scaled = 100 ×
P E raw − min (P E raw )
,
max (P E raw ) − min (P E raw )
where P E i is the weight for the i = 1, ..., k th prior experience question,
and max (P E raw ) and min (P Eraw ) refer to the maximum and minimum
possible scores respectively.
Study 2 Prior Experience Scale A quick two-item measure of prior
experience was used that showed good predictive validity (see correlations in Table 3.4). Question 1 (Q1 ) was “when typing do you look at
the keys?” and Question 2 (Q2 ) was “prior to the training in the previous class, how often did you use the SHIFT key to select text?” Both
questions had response options: 1= “Never”, 2 = “Almost Never”, 3 =
“Sometimes”, 4 = “Usually”, 5 = “Always”. Prior experience was calculated to be
Q2 +6−Q1
.
2
A third question, “On average, how many hours
per week do you spend using a computer?”, was considered for inclusion
but showed no predictive validity and hence was excluded.
Study 3 Prior Experience Scale The scale included 14 questions.
Each question was closed-ended and each response option was assigned
a non-negative integer weight. Larger weights were assigned to response
options that were theorised to be associated with better performance
on the text editing task. Question wording, response option wording,
weights, and the rationale for the weights are shown in Appendix C.1.1.
Items were generally good predictors of task performance. The weights
3.2. METHOD
149
were developed based on knowledge of the factors that influence individual differences in text editing. This knowledge in turn was based on (a)
observations of participants in previous text editing experiments, (b) task
analysis, and (c) theories of transfer. The method of calculating prior
experience was the same as that used for the Study 1: Prior Experience
Measure.
Typing Speed
Ten Thumbs Typing Test The typing test was locally developed (see
Armstrong, 2000). Participants were required to type random grammatically correct sentences for two minutes. If participant typed an incorrect
letter the program would not allow the cursor to proceed until the correct
letter had been entered. The measure of typing speed was average words
typed per minute.
Anglim Typing Test Studies 2 and 3 used an improved measure of
typing speed. The typing test was developed and reported in Anglim and
Waters (2007) where it obtained a high internal consistency reliability
with Cronbach’s alpha equal to 0.98. The typing test involved three
trials each of one minute duration. Each trial involved typing a specified
passage of text. The passages of text were taken from Wikipedia and
were selected because they used common English words and standard
punctuation. Participants were instructed that their score would be the
number of words that they typed in one minute, minus the number of
words they typed incorrectly. There were no explicit constraints on what
the participants typed and no feedback was provided as to whether they
had typed the text correctly or incorrectly. After the passage of one
minute the trial automatically ended. Participants pressed Enter to
start the next trial. The program was implemented in Inquisit version
3.2. See Appendix B.1.2 for further details.
150
CHAPTER 3. INDIVIDUAL DIFFERENCES
An algorithm was devised to check the accuracy of the text entered
by participants. The final score was the number of characters typed multiplied by the accuracy percentage. The metric was converted from characters per minutes to words per minute using the standard conversion of
five characters, including spaces, representing one word. A participant’s
final score was the mean of the three trials.
The task has been used in several additional studies unrelated to
this thesis and has shown to have high internal consistency reliability in
university samples (i.e., Cronbach’s alpha above 0.90, Anglim & Waters,
2007). The improvements over the test used in Study 1 were: (a) the
ability to estimate reliability, (b) a longer period of measurement, (c)
increased logging of participant responses, (d) more precise control over
trial duration, (e) removal of the confusion that occasionally resulted
when participants typed the wrong letter of a word.
Ability Tests
Study 1 included nine different ability tests, two psychomotor tests were
updated in Study 2. Table 3.1 summarises these tests. Selected tests
corresponded to appropriate points on Ackerman’s (1988) multidimensional scaling of 31 reference tests based on data from Allison (1960).
Consistent with Ackerman’s (1988) terminology, the nine tests were classified into one of three ability classes: (a) general cognitive ability, (b)
perceptual speed ability, or (c) psychomotor ability. Factor analysis in
Armstrong and Langan-Fox (2000) supported the three factor structure.
Each test was scored as set out in Table 3.1. A measure for each of the
three ability classes was obtained by converting constituent tests into
z-scores and summing the z-scores. This sum was then converted to a
z-score. For purposes of mediational analyses, an overall ability measure
was formed by summing the z-scores of the three ability classes.
Inference
Test
Cube
Comparison
Test
IT
CC
Ekstrom et General
al. (1976)
Ekstrom et General
al. (1976)
Ekstrom et General
al. (1976)
Extended
Range Vocabulary
Test
ERV
Ability
Measured
Reference
Abbreviation Name
Two parts; 24 items
per part; 6 minutes
per part; choose which
of five words is most
similar in meaning to
a target word
Two parts; 10 items
per part; 6 minutes
per part;
choose
which of five statement repersenta a
logical inference
Two parts; 21 items
per part; timed test
with 3 minutes per
part; choose whether
two cubes are the
same or different
Description
S1a
X
X
X
S2a
Table continues on next page
Percentage of total X
possible correct after
subtracting one mark
for each item incorrect
Percentage correct af- X
ter subtracting a quarter mark for each item
incorrect
Percentage correct af- X
ter subtracting a quarter mark for each item
incorrect
Scoring
Table 3.1: Summary of Ability Tests Used
3.2. METHOD
151
Clerical
Speed and
Accuracy
Simple
Reaction
Time
2-Choice
Reaction
Time
CS
RT1
RT2
NS
Description
Two parts; 90 seconds per part; indicate
pairs of numbers that
are different
Ekstrom et Perceptual Two parts; 90 secal. (1976)
Speed
onds per part; indicate
which of five numbers
is largest (part 1) or
smallest (part 2)
Ekstrom et Perceptual Two parts 3 minutes
al. (1976)
Speed
per part; read a manual to find a marked
item and mark the
marked item in the answer sheet
Anglim
Psychomotor 40 trials; respond by
(2000a)
pressing F5 to target
as quickly as possible
Anglim
Psychomotor 40 trials; respond by
(2000a)
pressing F5 of F6 corresponding to two response options
Ekstrom et Perceptual
al. (1976)
Speed
Number
Comparison
Test
Number
Sort
NC
Ability
Measured
Reference
Abbreviation Name
S1a
X
X
X
X
S2a
Table continues on next page
Mean reaction time
for correct trials excluding outlier trials
Mean reaction time
for correct trials excluding outlier trials
Total count of items X
correctly
marked
minus
incorrectly
marked
Total count of items X
correctly
marked
minus
incorrectly
marked
Total count of items X
correctly
marked
minus
incorrectly
marked
Scoring
152
CHAPTER 3. INDIVIDUAL DIFFERENCES
a
Anglim
(2000a)
4-Choice
Reaction
Time
Simple
Reaction
Time
Version 2
4-Choice
Reaction
Time
Version 2
RT4
RT1V2
Description
Mean reaction time
for correct trials excluding outlier trials
Mean reaction time
for correct trials excluding outlier trials
Scoring
Psychomotor 40 trials; respond by Mean reaction time
pressing F5, F6, F7, for correct trials exor F8 corresponding to cluding outlier trials
four response options
Psychomotor 40 trials; respond by
pressing F5, F6, F7,
or F8 corresponding to
four response options
Psychomotor 40 trials; respond by
pressing F5 to target
as quickly as possible
Ability
Measured
X indicates that the test was included in the study.
RT4V2
Reference
Abbreviation Name
X
S1a
X
X
S2a
3.2. METHOD
153
154
CHAPTER 3. INDIVIDUAL DIFFERENCES
Personality
IPIP Personality Test The International Personality Item Pool
(IPIP ) is a self-report measure that includes a measure of the Big 5 factors of extraversion, conscientiousness, neuroticism, agreeableness, and
openness/intellectance (L. R. Goldberg et al., 2006). Study 2 used a
100 item version, and Study 3 used a 50 item version. Each factor had
an equal number of items, and each scale had a similar numbers of positively and negatively worded items. Both versions have shown good
internal consistency reliability (L. R. Goldberg, 1999). Each item asked
the participant to indicate the degree to which the given statement described himself or herself on a 5-point scale from 1 = very inaccurate, to
5 = very accurate. Scale scores were computed as the mean score after
reversing negatively worded items, where a reversed item was 6−x, and x
is the item score. For details of the instructions given see Appendix B.1.1.
For a complete list of items and assigned factors, see Appendix B.1.1 for
the 100 item version, and Appendix C.1.2 for the 50 item version.
3.2.3
Criterion Measures
Task completion time and strategy sophistication were all calculated as
the mean level over the period of practice in the study. In Study 3, the
mean was based only on the first 15 blocks in order to use a consistent
and complete sample.
3.2.4
Analysis Plan
Correlations were used to examine the strength of relationship between
predictor and criterion measures. Several separate mediational models
were conducted to test whether the effect of several predictors on task
performance was mediated by strategy sophistication. Separate models were conducted for typing speed, prior experience, and a composite
3.3. STUDY 1 RESULTS
155
ability measure. The Sobel Test (Sobel, 1982) was used to evaluate the
significance of the indirect effect.
3.3
3.3.1
Study 1 Results
Preliminary Analyses
Split-half reliability estimates of general ability tests (Inference = 0.78;
Vocabulary = 0.62; Cube Comparison = 0.65) and perceptual speed tests
(Number Comparison = 0.62; Number Sort = 0.63; Clerical Speed =
0.89) varied between reasonable and very good. An initial factor analysis was performed on the nine ability tests to assess whether it was
appropriate to combine the tests into three ability classes. Psychomotor
tests were reversed such that higher scores meant more of the ability. A
maximum likelihood factor analysis with Promax rotation was performed
extracting three factors. Factor loadings and correlations between factors
are shown in Table 3.2. The first three factors accounted for 38.1, 15.7,
and 9.0 percent of variance respectively. Given the reasonable factor analytic support and the theoretical rationale, the three ability composites
were created as set out in the Method.
The ability tests showed reasonable consistency with the proposed
factor structure with three main deviations. First, the cube comparison test did not correlate highly with the other cognitive measures, and
showed some cross-loadings with perceptual speed tests. Possibly, the
requirements for spatial rotation and speeded completion were distinct
from the Inference and Vocabulary tests with their associated greater requirement for verbal knowledge and reasoning ability. Second, 2–Choice
and 4–Choice RT were more closely related to each other than they were
with Simple RT. Third, Number Sort and Number Comparison were more
correlated with each other than they were with Clerical Speed and Accu-
156
CHAPTER 3. INDIVIDUAL DIFFERENCES
racy. These latter two points are consistent with similarities in information processing requirements of the tests. Despite these deviations from
a clean factor structure, the three ability measures were still constructed
as the z-score of the sum of z-scores of constituent tests. Although not
reported, multidimensional scaling and hierarchical cluster analyses were
also performed on the reversed correlation matrices between ability tests
and told a similar story as the factor analysis.
Table 3.2: Study 1 Factor Loadings and Intercorrelations among Ability
Tests
Ability test
Ia
IIb
IIIc
Inference
.08 −.03
.99
Vocabulary
−.16
.03
.61
Cube Comparison
.09
.18
.29
Number Comparison
−.07
.87 −.01
Number Sort
−.07
.75
.09
Clerical Speed and Accuracy
.20
.65 −.18
Simple RT
.77 −.05
.15
2-Choice RT
.95 −.02 −.02
4-choice RT
.88
.01
.02
Factor intercorrelations
I. Psychomotor
II. Perceptual Speed
.45
III. General
.22
.44
1.
2.
3.
4.
5.
6.
7.
8.
9.
Note. Loadings > |.30| are shown in bold.
a
Psychomotor Ability; b Perceptual Speed Ability;
3.3.2
c
General Ability.
Hypothesis Testing
Table 3.3 shows descriptive statistics and intercorrelations for predictor
and criterion variables. Typing speed (supporting Hypothesis 3.1) and
prior experience (supporting Hypothesis 3.2) were both strong predictors
of task completion time. Moderate to strong prediction was obtained
between the three ability tests and task performance, broadly supporting
Hypothesis 3.3. The trend in the data was consistent with expectations
3.3. STUDY 1 RESULTS
157
that younger adults and males would have faster task completion times,
although the correlations were not statistically significant.
Task RT (s)
Strategy Sophistication (0 - 1)
Malea
Age (years)
Typing Speed (wpm)
Prior Experience (0 - 100)
General Ability (z-score)
Perceptual Speed Ability (z-score)
Pscyhomotor Ability (z-score)
a
Note. N = 63; r > |0.25| is significant at α = .05.
Female = 0 and Male = 1.
1.
2.
3.
4.
5.
6.
7.
8.
9.
Variable
SD
1
2
3
4
5
6
7
8
9
32.81 8.71
—
0.22 0.25 −.47
—
0.30 0.46 −.18
.04
—
23.05 4.43
.16 −.16
.30
—
33.30 10.65 −.53
.14 −.19 −.07 —
66.08 12.78 −.53
.21
.08 −.01 .49 —
0.28 0.98 −.29
.11 −.06 −.01 .23 .06 —
0.24 0.89 −.41 −.04
.10 −.10 .22 .02 .26 —
0.30 0.73 −.50
.36
.28 −.31 .13 .11 .04 .31 —
M
Table 3.3: Study 1 Descriptive Statistics and Intercorrelations for Task Performance, Demographics, Ability, and Prior Experience Variables
158
CHAPTER 3. INDIVIDUAL DIFFERENCES
3.4. STUDY 2 RESULTS
159
Three mediation analyses were performed assessing whether the effects of ability, typing speed, and prior experience, respectively, on
task performance were mediated by strategy sophistication. The standardised indirect effects were all non-significant: ability (indirect effect
= −0.905, z = −1.50, ns), typing speed (indirect effect = −0.0458, z =
−1.05, ns), prior experience (indirect effect = −0.054, z = −1.53, ns).
Strategy sophistication was strongly related to task performance. Psychomotor ability was the only statistically significant predictor of strategy
sophistication, although prior experience was approaching significance.
Failing to support Hypothesis 3.5, results were broadly consistent with
the idea that strategy sophistication and various forms of aptitude provide independent contributions to performance. Results also indicate
that ability and prior experience predict task performance better than
strategy sophistication.
3.4
3.4.1
Study 2 Results
Preliminary Analyses
Split-half reliabilities for the five ability tests were as follows: Inference
= 0.69; Vocabulary = 0.49; Cube Comparison = 0.61; Number Comparison = 0.77; Number Sort = 0.36. Cronbach’s alpha reliability for the five
personality factors were all reasonably good: extraversion = 0.93; agreeableness = 0.84; conscientiousness = 0.88; emotional stability = 0.93;
openness = 0.91. Typing speed had a Cronbach’s alpha of 0.96.
3.4.2
Hypothesis Testing
Table 3.4 shows descriptive statistics and correlations for predictors and
criterion measures. Typing speed (supporting Hypothesis 3.1) and prior
experience (supporting Hypothesis 3.2) were both strong predictors of
160
CHAPTER 3. INDIVIDUAL DIFFERENCES
task performance. Moderate to strong predictions were obtained between the three ability tests and task performance, broadly supporting
Hypothesis 3.3. Males performed better on the task, although no relationship was observed with age, perhaps reflecting the minimal variation
in age in the sample.
Consistent with Hypothesis 3.4 there was minimal evidence of a correlation between personality and task performance. Openness was the
one exception to this. One interpretation of this significant correlation is
that it is a Type I error resulting partially from running five significance
tests. Another interpretation is that the IPIP measure of openness is
related to self-perceived intelligence. This latter interpretation is supported by the relatively strong correlation between openness and general
ability.
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
7.63
0.59
0.26
21.14
53.40
3.02
0.11
0.05
0.04
3.43
3.96
3.42
3.09
3.68
M
1.76
0.26
0.44
1.60
15.33
0.96
0.97
1.02
1.06
0.64
0.41
0.55
0.71
0.54
SD
—
−.18
−.32
−.02
−.46
−.51
−.45
−.34
−.24
.08
−.03
−.05
−.13
−.20
1
—
.05
.09
.25
.14
.17
.17
.28
.07
.21
.04
.14
.06
2
—
.17
−.08
.16
.18
.04
.16
−.03
−.09
−.16
.20
.19
3
—
−.03
.12
.05
−.17
−.10
−.02
.09
.05
.07
.12
4
—
.52
.29
.22
.12
.09
.07
.13
.03
.15
5
—
.15
.11
.09
.00
−.05
−.05
.14
.19
6
—
.15
.28
.03
.16
−.05
.02
.49
7
—
.11
.00
.12
.11
−.03
−.01
8
—
−.18
−.05
−.16
.11
−.04
9
—
.43
.24
.23
.43
10
—
.20
.06
.49
11
—
.02
.18
12
—
.18
13
—
14
Note. Correlations were calculated based on pairwise deletion of missing data. Personality variables are variables 10 to 14.
Correlations not involving a personality variable had N = 154; if r > |0.16|, then p < .05. Correlations involving a personality
variable had N = 131; if r > |0.18|, then p < .05.
b
The variable ‘Male’ is coded such that 1 is Male and 0 is Female.
Task RT (s)
Strategy Sophistication (0 - 1)
Malea
Age (years)
Typing Speed (wpm)
Prior Experience (0 - 100)
General Ability (z-score)
Perceptual Speed Ability (z-score)
Psychomotor Ability (z-score)
Extraversion (1 - 5)
Agreeableness (1 - 5)
Conscientiousness (1 - 5)
Emotional Stability (1 - 5)
Openess (1 - 5)
Variable
Table 3.4: Study 2 Descriptive Statistics and Intercorrelations for Task Performance, Demographic, Ability, and Prior Experience Variables
3.4. STUDY 2 RESULTS
161
162
CHAPTER 3. INDIVIDUAL DIFFERENCES
Predictors of psychomotor ability, typing speed, and surprisingly,
agreeableness, correlated with strategy sophistication. Strategy sophistication correlated to a small to moderate extent with task performance.
Three mediation analyses were performed assessing whether the effect of
ability, typing speed, and prior experience, respectively, on task performance were mediated by strategy sophistication. The standardised indirect effects were all non-significant: ability (indirect effect = −0.0100, z =
−0.220, ns), typing speed (indirect effect = −0.002, z = −0.875, ns), and
prior experience (indirect effect = −0.0288, z = −1.15, ns). Thus, Hypothesis 3.5 was not supported.
3.5
3.5.1
Study 3 Results
Preliminary Analyses
Cronbach’s Alpha reliabilities for the personality scales were good: extraversion = 0.92, agreeableness = 0.78, conscientiousness = 0.85, emotional stability = 0.9, openness = 0.84. Typing speed had a Cronbach’s
alpha of 0.96.
3.5.2
Hypothesis Testing
Table 3.5 shows descriptive statistics and intercorrelations between predictor and criterion measures. Although correlations were slightly less
than expected, typing speed (supporting Hypothesis 3.1) and prior experience (supporting Hypothesis 3.2) were both moderately strong predictors of task performance. Consistent with expectations, males and
younger adults performed better on the task. Consistent with Hypothesis 3.4, there was minimal evidence of a correlation between personality
and task performance. Surprisingly conscientiousness had a small negative correlation with task performance.
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
M
SD
1
2
3
5
6
7
8
9
10
11
—
−.20
—
−.11
.35
—
−.14 −.05 −.08
—
−.00 −.02 −.04
.17 —
.17 −.03 −.23 −.01 .07 —
.18
.13
.16
.26 .09 .19 —
.04
.04
.16
.02 .32 .11 .07 —
4
Note. Correlations were calculated based on pairwise deletion of missing data. Personality variables are variables 7 to 11.
Correlations not involving a personality variable had N = 149; if r > |0.17|, then p < .05. Correlations involving a personality
variable had N = 104; if r > |0.20|, then p < .05.
a
The variable ‘Male’ is coded such that 1 is Male and 0 is Female.
Task RT (s)
7.51 1.94
—
Strategy Sophistication (0 - 1) 0.60 0.30 −.34
—
a
Male
0.29 0.45 −.37
.27
—
Age (years)
22.46 5.05
.31
.06
.05
Typing Speed (wpm)
51.73 15.82 −.41
.29
.06
Prior Experience (0 - 100)
46.37 12.28 −.46
.42
.44
Extraversion (1 - 5)
3.32 0.92
.04 −.07 −.04
Agreeableness (1 - 5)
4.19 0.49
.03 −.04 −.17
Conscientiousness (1 - 5)
3.48 0.74
.23 −.14 −.07
Emotional Stability (1 - 5)
2.97 0.88 −.16
.10
.19
Openness (1 - 5)
3.81 0.65 −.09
.23
.16
Variable
Table 3.5: Study 3 Descriptive Statistics and Intercorrelations for Task Performance, Demographic, Prior Experience, and
Personality Variables
3.5. STUDY 3 RESULTS
163
164
CHAPTER 3. INDIVIDUAL DIFFERENCES
Two mediation analyses were performed assessing whether the effects
of typing speed and prior experience, respectively, on task performance
were mediated by strategy sophistication. Typing speed was a moderate
predictor and prior experience was a strong predictor of strategy sophistication. Strategy sophistication correlated to a small to moderate extent
with task performance. The standardised indirect effects were all nonsignificant: typing speed (indirect effect = −0.00793, z = −2.30, p < .05),
prior experience (indirect effect = −0.0110, z = −1.93, ns). Thus, Hypothesis 3.5 was not supported.
3.6
3.6.1
Discussion
Overview
This chapter examined the relationship between predictor and criterion
measures on the text editing task. Overall, with the exception of the mediational hypothesis, hypotheses and expectations were supported. The
discussion is organised in terms of predictors of task performance, and
the mediational model of predictors, strategy sophistication, and performance.
3.6.2
Performance
Demographics
The general pattern of results were consistent with expectations. Males
and younger adults generally performed better at the text editing task.
Gender differences were observed in all three studies, albeit only as
a trend towards significance in Study 1. In Study 3, males had greater
prior experience, a strong predictor of performance, yet in the other two
studies, gender showed minimal correlations with other predictors. It
3.6. DISCUSSION
165
may be that males were more competitive, or had greater prior experience
with computers, computer games, and text editing. Study 3 provided
some support for attributing differences to known relevant sources of
prior experience. However, in Studies 1 and 2 other possible predictors
such as typing speed did not correlate with gender.
The negative correlation between age and performance was small. It
emerged as a trend in Study 1 and a small significant effect in Study 3.
Study 2 lacked meaningful variation in age to test the relationship.
Equally, the amount of variation in age was also fairly low in Studies
1 an 3. In broad terms the findings are consistent with both cognitive
aging and cohort effects related to generational increases in use of computers. However, given the relatively small variance in age across the
studies and the convenience sampling, the interpretation of the observed
age related differences should be treated with caution.
Ability
Ability was a moderate to strong predictor of task performance. In
Study 1 psychomotor ability had the strongest correlation followed by
perceptual speed and then general ability. In Study 2 the order of the
size of correlations was reversed. Given the large sample size in Study 2
it should be given more weight. Random sampling is one plausible explanation for the differences between the results. It is also possible that
abilities and prior experience may have differed between Study 1 with its
community sample and Study 2 with its third year university sample.
It is also worth considering the degree to which the ability measures
chosen provide a basis to generalise to other potential ability measures.
Other researchers might have preferred that other particular ability measures were employed such as an operation-span test to measure multitasking or a working memory measure. The three classes of ability tests
166
CHAPTER 3. INDIVIDUAL DIFFERENCES
were chosen in order to mirror the framework often adopted by Phillip
Ackerman. The list of possible ability tests is quite large. Expanding
the pool of tests would have been empirically interesting, but the multiplicity of predictors can create challenges to interpretation. It can also
be argued that interpretation of the cognitive correlates approach should
be treated with caution. Just as correlation is not necessarily causation, the relative size of a correlation does not necessarily correspond to
relative size of causation. Correlation with external variables, degrees
of test validity and reliability, and the general intercorrelated nature of
ability testing should temper causal interpretation of ability-performance
correlations.
Prior Experience
Prior experience was a strong predictor of task performance in all three
studies both when it was operationalised as typing speed and as selfreported experience. This is consistent with cognitive correlates, transfer,
and correlation specificity perspectives.
Personality
In both Studies 2 and 3 where personality was measured, self-reported
measures of personality rarely predicted task performance. A small positive correlation with openness in Study 2 and a small negative correlation
with conscientiousness in Study 3 with task performance was observed.
It seems plausible that when openness measures include items related
to self-perceived intelligence, that correlations could emerge. However,
given that ten significant tests were performed and the lack of an a priori
reason to expect such correlations, a reasonable interpretation is that
population correlations are close to zero.
In summary findings are consistent with personality being unrelated
3.6. DISCUSSION
167
to task performance on an experimental text editing task. They are also
consistent with the notion that performance on the text editing task is a
skill influenced more by ‘can do’ than by ‘will do’ factors. In particular,
conscientiousness, which has been found by meta-analyses to be related
to workplace performance was unrelated to task performance. A related
interpretation is that the correlation is particularly small, perhaps less
than r = .10. Even if this were the case, personality is arguably of
minimal importance.
Skill acquisition experiments may better relate to what Paul Sackett
(P. Sackett, Zedeck, & Fogli, 1988) calls ‘maximal performance’ rather
than ‘typical performance’. Maximal performance is believed to occur
over short periods of time where performance is being monitored. Skill
acquisition experiments are highly monitored environments where participants are asked, and may often feel obliged, to apply maximal effort.
3.6.3
Mediational Model
It was hypothesised that the effect of ability, prior experience, and typing
speed on task performance would be mediated by strategy sophistication.
In all three studies, and for all three predictor variables, the mediation
model was not supported. Prior experience was a reasonable predictor
of strategy sophistication particularly in Study 3. Likewise typing speed
showed some mixed evidence of a relationship with strategy sophistication. Strategy sophistication also correlated with task performance, although the correlation was relatively small given the potential efficiency
gains that can result from the more sophisticated strategy.
The relationship between individual differences in strategy sophistication and performance in Study 2 was less than in Study 1. While random
sampling might explain some of this difference, another intriguing possibility is that increased instruction of sophisticated strategies led to a
168
CHAPTER 3. INDIVIDUAL DIFFERENCES
drop in the observed correlation. In Studies 2 and 3 more participants
may have been using the sophisticated strategies for the first time. In
contrast, participants in Study 1 received minimal instruction to use the
sophisticated keys. As such, use of the keys may have been more of a
proxy for prior experience in text editing general.
The correlation between prior experience and strategy sophistication
is consistent with the idea that using good strategies is a relatively domain specific adaptation that may often evolve slowly over time in natural
environments.
Schunn and Reder (2001) proposed a theory that suggested that ability would correlate with adaptive strategy selection. In some respects,
strategy sophistication captures the concept of adaptive strategy selection on the text editing task. If so, present results provide only mixed
support for Schunn and Reder’s theory. Psychomotor ability was the better predictor of strategy sophistication across the two studies that measured ability, although some small correlations were observed in Study 2
with general cognitive ability. It seems that factors related to prior experience, including prior experience itself, typing speed, and possible proxy
measures, such as gender, are better predictors of strategy adaptivity.
In this framework, strategy adaptivity is a domain specific adaptation,
where in adult samples, prior experience is likely to overwhelm effects of
more general adaptive abilities.
3.6.4
Conclusion
In broad terms the findings are consistent with several general principles
regarding predictor–criterion relationships. First, the cognitive correlates
perspective was supported with overlap between prior experience, typing
speed, and text editing reflected in observed correlations. Second, the
fact that prior experience and typing speed were generally stronger pre-
3.6. DISCUSSION
169
dictors than ability is consistent with the specificity perspective. Third,
the lack of prediction by personality is consistent with the idea that task
performance is about skill and not self-reported dispositions.
However, caution should still be applied when applying a causal interpretation to the cognitive correlates perspective. Theory and the observed results suggest several alternative interpretations of observed correlations. In particular, correlations may be inflated due to the nature
of the sample or a third variable. In particular more plausible measures
of past experience and skill may correlate with ability based measures in
surprising ways.
170
CHAPTER 3. INDIVIDUAL DIFFERENCES
Chapter 4
Instructed Strategy Shift
Contents
4.1
4.2
4.3
4.4
Introduction
. . . . . . . . . . . . . . . . . . . 172
4.1.1
Overview . . . . . . . . . . . . . . . . . . . . 172
4.1.2
Strategy . . . . . . . . . . . . . . . . . . . . . 173
4.1.3
Performance
4.1.4
The Current Study . . . . . . . . . . . . . . . 177
. . . . . . . . . . . . . . . . . . 176
Method . . . . . . . . . . . . . . . . . . . . . . 177
4.2.1
Design . . . . . . . . . . . . . . . . . . . . . . 177
4.2.2
Participants . . . . . . . . . . . . . . . . . . . 178
4.2.3
Mid-Practice Condition . . . . . . . . . . . . 178
Results . . . . . . . . . . . . . . . . . . . . . . . 180
4.3.1
Practice and Strategy . . . . . . . . . . . . . 180
4.3.2
Practice and Performance . . . . . . . . . . . 185
Discussion . . . . . . . . . . . . . . . . . . . . . 192
4.4.1
Overview . . . . . . . . . . . . . . . . . . . . 192
4.4.2
Strategy . . . . . . . . . . . . . . . . . . . . . 192
4.4.3
Performance
. . . . . . . . . . . . . . . . . . 195
171
172
4.1
4.1.1
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
4.4.4
Comparison with Previous Research . . . . . 197
4.4.5
Conclusion . . . . . . . . . . . . . . . . . . . 200
Introduction
Overview
Results presented in Chapter 2, showed that under self-directed conditions, individuals often initiate shifts from simple to sophisticated strategies. Such shifts are (a) often gradual but can be abrupt, (b) more likely
to occur early in practice than later, and (c) rarely lead to observable
discontinuities in performance. However, research suggests that different
dynamics operate when strategies are introduced by external forces, such
as instruction (e.g., Delaney et al., 1998; Yechiam et al., 2004). In such
situations, following external instruction, strategy shifts may be more
abrupt, and disruptions to task performance may occur. Such externally
introduced strategies may result in what may appear as a new learning
curve (e.g., Delaney et al., 1998).
This chapter addresses Aim 3 of the thesis. It examines the effect
of instructed strategy shifts on task completion time and contrasts this
with self-initiated strategy shifts. It examines both abruptness of strategy shifts and effects on task completion time following the introduction
of a new strategy after a period of practice. It examines both the immediate effect on task completion time and the effect following subsequent
practice. The literature review that follows is organised around the above
two topics first focusing on strategy use, and then discussing effects on
performance of instructed strategy shift.
4.1. INTRODUCTION
4.1.2
173
Strategy
As discussed in Chapter 2 most models of strategy use and performance
(e.g., Lovett, 2005; Lovett & Anderson, 1996; J. W. Payne et al., 1988;
Schunn et al., 1997; Siegler & Shipley, 1995) share qualitatively similar
features. In order for a strategy to be used, an individual must either
explicitly or implicitly know both of its existence and how to use it. Individuals are more likely to use a strategy if they implicitly or explicitly
believe that the strategy is superior to an alternative. Thus, strategy
change is often assumed to be caused by a corresponding change to information about strategy existence, execution, or relative effectiveness.
This is typically assumed to be stochastic, operating at the level of probabilities of strategy use. Information regarding strategy use can come
from many sources including both internal sources, through interaction
with the task environment and through external instruction.
In this chapter a distinction is made between self-initiated and instructed strategy shifts. The literature review focuses on how instructed
strategy shifts operate, and how they may have different strategy use
and performance dynamics. With regards to instructed strategy shifts,
various distinctions can be made. Important types of information are (a)
strategy existence, (b) how to execute a strategy, and (c) why to execute
a strategy (i.e., the relative effectiveness of the strategy). The instructor
can also vary the format of the instructions to be a command, request,
suggestion, or provide information only.
An important literature for understanding the process of strategy shift
and instruction is the concept of training needs. The first step before
implementing a training program often involves implementing a training
needs analysis (e.g., Moore & Dutton, 1978; Rossett, 1987). Training
needs analysis is grounded in rational principles of utility maximisation.
Training needs analysis generally recommends training for an individual
174
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
where: (a) the individual does not currently have the skills taught, (b)
the individual is in a zone of development where they would be able to
learn the skills taught, (c) the skills taught are superior to any alternative
that the learner currently uses to achieve a relevant goal, (d) the learner
will have the opportunity to apply the skills taught, and (e) weighing the
costs of training in terms of time spent engaged in off task behaviour,
and in reduced effectiveness, and all the benefits of training in terms of
improved effectiveness in the future, the training provides a rational net
benefit. Such a perspective forms a backdrop for strategy instruction
decisions. These principles of training needs analysis have implications
for the expected effect of introducing instruction on strategy use and
performance.
It seems likely that self-initiated strategy shifts will lead to greater
improvement in performance. First, relatively automatic associative processes are likely to pull participants towards strategies that are effective in
the short term. Second, understanding that a more sophisticated strategy
exists may indicate a readiness to perform the strategy. Third, initiation
and persistence with a more sophisticated strategy is more likely when
the individual experiences the strategy as adaptive in the short term. In
contrast, instructed-strategy shift is likely to lead individuals who may
not be ready in the short term to adopt the strategy.
It should be noted that when participants are explicitly told to use a
particular strategy, they generally will use it (e.g., last study reported in,
Delaney et al., 1998). Delaney et al’s (1998) final study involved teaching
a single participant how to be an expert in mental calculation. When the
participant was first taught and told to use a more sophisticated strategy,
they temporarily dropped in performance before eventually improving
to levels superior to before the introduction of the more sophisticated
strategy.
When developing a theory of the effect of instructed versus self-
4.1. INTRODUCTION
175
initiated strategy shift, it is useful to consider the differences between implicit and explicit strategy use and strategy change. Several researchers
have suggested that when strategies enter explicitly they are more likely
to be abrupt (for a discussion, see Siegler & Stern, 1998). Instructed
strategy shift almost by definition has to enter explicitly. Thus, at a
basic level it is hypothesised that:
Hypothesis 4.1 The combination of a request to use a strategy and information on how to use a strategy increases strategy use.
It is also interesting to consider how features of a strategy influence the
effect of instruction on subsequent strategy use. In the case of algorithm–
retrieval strategy shifts, a learner can not immediately shift to retrieval
by mere instruction. Research has shown that explicit instruction can
increase the degree and speed of retrieval (Ackerman & Woltz, 1994),
but such an increase is predicated on having an accessible representation
in memory. With simple-sophisticated strategies it is generally possible
to perform the sophisticated strategy following instruction, but performance may decline while the more sophisticated strategy is being learnt.
In other cases, the sophisticated strategy may be too complex for the
individual to perform immediately and may require additional off task
behaviour to understand. Yechiam et al. (2004) provides and example
of where using a more sophisticated scripting strategy required reading
a manual and going through a process of trial and error in order to use
the scripting strategy effectively.
In summary, research suggests that instruction can change awareness
and motivation to implement a new strategy. Such instruction may often
lead to strategy selection operating explicitly. It may also override normal
strategy selection mechanisms that emphasise effectiveness of a strategy
in the short term.
176
4.1.3
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
Performance
The previous chapters in this thesis showed that when individuals selfinitiate a strategy shift, there is typically minimal change to performance.
One explanation for this is that strategy selection is relatively adaptive.
Individuals are inclined to shift when the short term rewards suggest that
the strategy is effective. Thus, individuals who shift are more likely to
shift when they are ready. It may also be that being aware of a strategy
is related to a capacity to perform it at an adequate level. It is also
possible that individuals already know the strategy, and the observed
shift in strategy use is simply a change in perceptions about what is
appropriate in the circumstances.
Delaney et al. (1998) proposed the idea that learning curves should
be considered as learning curves within strategies. The final study in
Delaney et al. (1998) was a case study that examined performance on
complex multiplication. Phase one involved one strategy and phase two
introduced a new strategy that was designed to be superior to the first.
Several important findings emerged. First, performance on the first strategy improved in accordance with something approximating a power law.
Second, performance on the second strategy involved an initial decline
relative to the first strategy. Third, performance on the second strategy
showed a pattern consistent with the commencement of a new learning
curve.
Thus, learning a new strategy is like learning a new skill. In the case
of strategies introduced by others, it is likely that the learner will be
less prepared to use the strategy than those that naturally shift to the
strategy. The requirement to use the new strategy can override normal
strategy selection rules which limit the selection of slower strategies.
The amount of performance decline and the potential for a strategy
to ultimately surpass performance of the initial strategy are likely to
4.2. METHOD
177
be specific to aspects of individual, task, and context. Nonetheless, the
following hypotheses were formed as applicable to the transition from
simple to sophisticated strategies on the text editing task and also to
many other tasks permitting a simple–sophisticated strategy shift.
Hypothesis 4.2 When sophisticated strategies are externally introduced
later in practice, performance declines immediately after introduction.
Hypothesis 4.3 When sophisticated strategies are externally introduced
later in practice, performance eventually surpasses the level that would
have been attained had instruction not been received.
4.1.4
The Current Study
This chapter reports additional results from Study 3. In addition to the
group that practiced text editing without interruption, a second group
was given additional instruction and training half way through practice
on the text editing task. Chapter 2 only presented results for the No
Training group. This chapter examines the differences in strategy use and
performance between the Training and No Training groups over practice
following the introduction of additional training.
4.2
Method
4.2.1
Design
The study used a 3 × 30 mixed design. Participants were randomly allocated to one of three conditions (Training, No Training, and Control). All
participants completed 30 blocks of practice on the text editing task. After completing 15 blocks the experience of participants differed based on
condition. Participants in the Training condition were given additional
178
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
instruction on sophisticated text editing strategies, as well as explicit instructions to use these strategies on subsequent trials. Participants in
the Control condition completed a four-choice reaction time task instead
of receiving mid-practice training. Participants in the No Training condition proceeded to block 16 immediately after block 15. More details
on the procedure was presented previously in Section 2.2.5.
4.2.2
Participants
Seventy six (49.4%) participants completed the Training condition, 17
(11.0%) completed the Control condition, and 61 (39.6%) completed the
No Training condition. The sample size for the Control condition was deliberately smaller than the other two groups. The purpose of the Control
condition was to ensure that the break alone was not sufficient to change
strategy use or performance. The important research questions related
to differences between the Training condition and the No Training condition as these conditions assessed most clearly the differences in the effect
of practice and strategy shift on the learning curve under instructed and
non-instructed settings.
4.2.3
Mid-Practice Condition
Training Condition
Participants in the Training condition following block 15 were given additional instructions on additional text editing strategies followed by an
opportunity to practice these strategies. This condition was included in
order to examine strategy use and performance following explicit instructions to use particular strategies in contrast to self-discovered strategy
shifts.
Training condition instructions were presented on the computer over
five screens. To maximise the chance that participants actually read
4.2. METHOD
179
the instructions a specified time was required to pass before participants
were permitted to advance through the instruction screens. The verbatim text of the instructions is presented in Appendix C.2.3. In brief the
instructions (a) outlined what the training involved; (b) presented theory
on text editing, describing text editing in terms of movement, selection,
and manipulation, and encouraging participants to consider long term
efficiency that results from reducing the number of key presses required
to perform a given editing task; (c) outlined strategies to speed up movement; (d) outlined strategies to speed up deletion; and (e) explained
Training condition trials.
Participants in the Training condition completed six training trials.
These were further composed into three sub-trials where participants
were asked to (a) attempt to minimise key presses, (b) follow a specific
strategy displayed on the screen, and (c) repeat the previously displayed
sequence without assistance. Each of the six training trials involved
different text. At the end of each sub-trial, participants were given feedback on the number of key presses they made. If they completed the
task in the minimum number of key presses, they were also given the
feedback “Well Done”. Otherwise they were given feedback, “Keep trying to reduce your key presses”. At the end of the six trials, participants
were presented with the following message: “The additional training has
ended. For remaining trials, try to use the strategies used in the training
trials.”
Control Condition
The Control condition consisted of a filler task after block 15 of the text
editing task. Participants were given the following message:
You are now having a break from the main text editing task
in order to restore your energy. The following task is a four-
180
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
choice reaction time task. Place your left middle and index
fingers on the D and F and your right index and middle fingers
on J and K. Press the appropriate key in response to the red
light as quickly as possible. Press Enter To Continue.
The reaction time task went for two minutes and was not used for measurement purposes. The condition was included for comparison purposes
with the Training condition. It was designed to help attribute changes
in strategy use and performance in the Training condition to the instructions as opposed to the break from task performance.
No Training Condition
The No Training condition involved no break from the text editing task.
The condition was included in order to explore how strategies and performance change with practice in the absence of explicit instruction. In
this sense, it is similar to the design of Studies 1 and 2.
4.3
4.3.1
Results
Practice and Strategy
Group-level
Figure 4.1 shows the relationship between practice and strategy sophistication for each condition at the group-level. The No Training and
Control conditions showed decelerating functional relationships seemingly approaching an asymptote. In contrast, the Training condition had
an abrupt increase in strategy sophistication following the mid-practice
training. Supporting Hypothesis 4.1, between Blocks 15 and 16 the Training condition increased by 11.2 percentage points, whereas the other
conditions showed little change (No Training changed by -0.2 percentage
4.3. RESULTS
181
points; Control changed by -1.3 percentage points). This abrupt shift
in strategy sophistication suggests that the mid-practice training had at
least some of the intended effect of increasing strategy sophistication.
The data also highlights that many participants were already using sophisticated strategies prior to training and that training did not make all
participants sophisticated strategy users.
While the Control condition did not change immediately following the
Mid-Practice break, it is interesting to note that in the second block following the break, strategy sophistication was often more similar to the
Training condition than to the Control condition. Given the substantially smaller sample size in the Control group, this may be attributed
to random sampling. The Control condition also seemed to have slightly
higher levels of strategy sophistication prior to the break. When comparing blocks 12 to 15 with blocks 16 to 19, the Training condition increased
by 13.5 percentage points, the No Training condition by 2.4 percentage
points, and the Control condition by 7.2 percentage points. Pairwise
comparisons of these change scores between conditions using independent groups t-tests showed that only Training and No Training conditions were significantly different (p < .001). The unadjusted p-value for
Training versus Control was 0.07 and No Training versus Control was
0.07.
182
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
1.0
Strategy Sophistication
0.8
0.6
●
●
● ● ● ● ● ●
● ● ●
● ● ● ●
● ●
●
● ● ● ● ●
●
● ● ● ●
●
● ● ● ●
● ● ●
● ●
● ● ●
●
●
● ● ● ●
● ●
●
●
●
●
●
● ●
0.4
● ●
●
●
●
Training
No Training
Control
0.2
0.0
0
5
10
15
20
25
Block Number
Figure 4.1: Strategy sophistication by block and condition at the grouplevel. Dotted vertical line indicates timing of mid-practice training in the
Training condition.
30
4.3. RESULTS
183
Individual-Level
Figure 4.2 shows the relationship between practice and strategy sophistication at the individual-level. Many of the same patterns observed
in the No Training condition (see Figure 2.24) can also be observed in
the Training condition. The main distinction is the change in strategy
sophistication observed after the Mid Practice training.
Many participants had already adopted a relatively sophisticated
strategy prior to the mid-practice training. In some cases participants
used a sophisticated strategy from the first block (e.g., cases 2, 16). Other
participants had attained a sophisticated level over the course of the first
15 blocks. In such cases, participants were unable to shift to a more
sophisticated strategy because they had no training need. Other participants were at an intermediate level of strategy sophistication at the time
of the mid-practice training and did not increase sophistication further
(e.g., cases 113, 176). It is unclear whether such participants were unaware of the potential increases in sophistication that could be attained,
or whether they chose not to adopt such approaches.
Participant 14 dabbled with more sophisticated strategies after the
Mid Practice Training and then reverted to simpler strategies. Inspection
of Figure 4.4, shows how the participant experienced a fairly abrupt decline in performance while trying to use the more sophisticated strategy.
One interpretation is that external instruction was insufficient to overcome the substantial drop in performance the participant experienced
following the adoption of the more sophisticated strategy.
Several participants permanently increased strategy sophistication
following the mid-practice training (e.g., cases 23, 24, 34, 35, 45, 46, 53,
71, etc.). In particular, many of these shifts were abrupt both in onset
and in the level of strategy sophistication. The overall pattern highlights
how external instruction can lead to more abrupt strategy shifts than
184
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
when strategy shifts are self-initiated.
05 1525
174
0.8
0.4
0.0
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
176
●
●
●
●
●
●
●
●
●
●
151
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
155
179
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
● ●
●
●
●
158
162
166
169
170
171
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●●●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
● ●●
●
●
●
●
●● ●
●
● ●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●●●
●
●
●
●
●●●●● ●●●
●
●●
●
●
●
●
●
●●
●
●
●● ●
●●
●
●
●● ●●● ●
●
●
●
●●
●●
●●
●
● ●●
●
●●●
● ●●●●
●
●
●
●
●
● ●●
●●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●● ●
●
● ●● ●
●●
●
●●
●
●
●● ●●
●●
● ●
●
●
●
●
●●
●
130
Strategy Sophistication
178
●● ● ●
● ●
●
●
●●●●●
●
●●
●
●
●●
●●●
●●
●●●
149
●
●
●
●
●
●
● ●●●●
●●
●●●●
●
●
●●
●
0.8
0.4
0.0
05 1525
131
134
135
136
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●● ●
●
●
●●
●
●●
●
●●
●
●
●●
●●
●
●
●
●
●
●●
●
●
●
●●
●
●●
●
●
●
●●
●●
●
●
● ●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
112
113
114
118
122
138
139
145
147
●● ●
●
●
●●●
●
●
●●●
●●
●
●
●
●
●
●
●
●
●
●●
●●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●●
● ●● ●
●
●●●
●
●
●
●● ●
●●
●
●
●●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
123
125
126
128
●●
●
●
●
●
●
● ●
●●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
● ●●
●●
●●
●
●●●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
● ● ●
●
●
●● ●
●
●
●
●●
●●
●
●
●
●●●
●
●
●
●●
●
●
●
●● ●
●●
●
●●●●
●
●
●
●
● ●
●
●●
●
●
●
● ●●● ● ●●
●
●
●● ●●●
●●●
● ●
●
●●
●
●
● ●●
●●
● ●
●
●
●
●●
● ●● ● ●●
●
●
●
●
●
●
●
●●
●
●●
●●●
●
●●
● ●
●
●
●
●
●●
●
●
●
●
●●
●●● ●●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
● ●
●
● ●●
●
●
●
●
●
●
91
0.8
0.4
0.0
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●●
92
97
99
100
103
104
110
111
64
68
42
●●●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●● ●
●
●
●
●●
●
●
●
71
73
74
83
85
88
89
45
46
47
52
53
59
60
62
26
29
31
2
7
14
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
05 1525
0.8
0.4
0.0
●●
●
●
●
●●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
● ●●
●●
●
●
●
●
●
●●
●
● ●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●●
●
●
●
●
● ●●
●
●
● ●●●
●●
●
●●
●
●
●●
●●
●
●●● ●
●● ●
●●●
●
●
●●
●
● ●●●
●
●
●●
●●● ●● ●
●●
●●
●●
●●
●●
●●
●
●●
● ●
●
●●
●●
● ●
●
●
●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ●
●●●
●● ●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
32
34
35
37
38
41
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
● ●●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●●●
●● ●
●
●
●
●
●
●
●●
●
●●
●●
●
●●●
●●
● ●●
●
●
●
●
●
●
●
●●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●●
●
● ●
●
●●●
●
●
●●
●
●●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
0.8
0.4
0.0
0.8
0.4
0.0
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●● ●
●
●
●
●●●
●
●●
●●●
●
●●● ● ●
●●●●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●●●
●●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
● ●●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●
● ●●
●
●
●
●
●
●●
●
●
●●
●
● ●
● ●
●
●●●
●●
●
●
●
●
●
●●●
●
●
●
●
●
●
●●●
●
●●
●
●
●
●●
●
●
●
●●
● ●
●●
●
●
●
●
●
●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●●●● ●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
0.8
0.4
0.0
0.8
0.4
0.0
●
●
●
●
●
●
●
●
●
05 1525
16
19
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
20
23
24
25
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●●
●●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
● ●● ●●●
●
●
●
● ●●
●
●●
●
●
●●
●
●●
●
●
●●
●●
●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
05 1525
05 1525
05 1525
Block
Figure 4.2: Strategy sophistication by block in the Training condition at
the individual-level. Black points indicate blocks before the mid-practice
training and grey points after the mid-practice training.
0.8
0.4
0.0
4.3. RESULTS
4.3.2
185
Practice and Performance
Group-level
Figure 4.3 shows the relationship between practice and task completion
time by condition at the group-level. The functional form of the No
Training condition was well fit by a three parameter power function.
The Control condition was similar to the No Training condition. The
greater volatility in the Control condition can be explained in terms of
the smaller sample size. Importantly, the break from task performance in
the Control condition did not appear to have an effect on task completion
time.
Supporting Hypothesis 4.2, performance dropped substantially in the
Training condition following the mid-practice training. A significant
effect of condition was obtained at block 16, F (2, 151) = 12.01, p <
.001, η 2 = 0.137. Tukey’s Post Hoc Test showed a significant difference
between the Training and the other two conditions (p < .05). By the end
of practice in block 30, performance in the Training condition was similar
to block 15 (block 15 = 6.17 block 30 = 6.11, and slightly worse than the
other two conditions (Control block 30 = 5.00, No Training block 30 =
5.48), although not significantly so, F (2, 151) = 2.67, p = .07, η 2 = 0.034;
Tukey’s HSD was also non-significant for all pairwise comparisons at the
.05 level. Thus, Hypothesis 4.3, which proposed that performance would
be superior in the Training condition by the end of practice, was not
supported.
The drop in performance in the Training condition is consistent with
the change in strategy following the mid-practice training, which put
individuals on a new learning curve. The absence of a similar change
following Block 15 in the Control condition suggests that the drop in
performance in the Training condition was not simply due to the break
from the task.
186
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
Individual level: Training Condition
Figure 4.4 shows the relationship between practice and performance in
the Training condition before and after the mid-practice training. The
figure highlights how the immediate and final effect of the mid-practice
training varied between individuals. Some participants had little initial
or final effect (e.g., cases 151, 155). Some participants had an initial
drop, which was sustained to the end of the experiment (e.g., cases 89,
158). Some participants dropped initially, but ultimately restored their
performance (e.g., case 145) or surpassed their prior performance (e.g.,
case 88). A small number of participants obtained immediate benefits
(e.g., case 45). Many participants were better at the end of practice
than they were prior to the mid-practice training, although it is not clear
whether this was due to the mid-practice training or to more general
learning processes that may have occurred anyway.
Table 4.1 presents some quantitative summaries of these effects. In
particular, the table shows that most participants in the Training condition were slower following the extra training. After four blocks, approximately half of the participants were at pre-training levels. By the
final trials performance was similar to before the mid-practice training
for participants on average. This is consistent with the idea that performance drops when there is a shift from a well-known to a novel strategy.
It may also be that the strategies taught in training were only minimally
more effective than those adopted naturally by participants.
4.3. RESULTS
187
Table 4.1: Change in Task Completion Time Pre- and Post- Training in
the Training Condition
d
c
b
% improvede
SD%∆
M%∆
Comparison
M∆a SD∆
12–15 vs 16–17 1.80
12–15 vs 18–20 0.90
12–15 vs 28–30 -0.17
a
2.16 29.10
1.91 14.30
1.26 -2.07
35.54
30.82
20.37
18.42
43.42
60.53
M∆ is mean raw increase in RT (secs).
SD∆ is SD of raw increase in RT (secs).
c
M%∆ is mean of percentage increase in RT.
d
SD%∆ is SD of percentage increase in RT.
e
“% improved” is percentage of the sample that improved after training.
b
188
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
12
●
●
●
Task Completion Time (s)
●
Training
No Training
Control
10
●
●
●
8
●
● ●
●
● ●
●
●
●
●
● ●
● ●
●
● ●
●
6
●
●
●
● ● ●
●
●
●
●
●
● ● ●
●
● ●
● ●
●
●
●
●
●
● ●
● ● ●
●
● ●
● ●
● ●
4
0
5
10
15
20
25
30
Block
Figure 4.3: Task completion time by block and condition at the grouplevel. Dotted vertical line indicates the onset of mid-practice training.
4.3. RESULTS
189
05 1525
174
15
5
176
05 1525
178
179
●
● ●●
●●
●
●●●
●
●
●
●
●
●
●
●●●
●
●
●
●
●●●●
●
●
● ●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
● ● ● ●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
149
151
155
158
162
166
169
170
171
●
●
●
●
●
●
● ●
●●
●
●
●●
●
●
●
●
●
●
●
● ● ●●
●●
●
●●
●
●
●
●
●
●
●
●
● ●●
●
●
●●
● ●
●● ●
● ●
●
●●
●
●
●
●● ● ●
●
● ●●
●●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●
●●
● ●●
●
●
●
●
●
●
● ●
●
●
●●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●●
●
●●
●
●●●
●●
●
●
●●
●●
●
●
●
●●
● ●● ● ●
●
●
●
●
●●
●
●
●●
●
●
●
● ●●
● ●
●
●
●
●
●
●
●●
●
●
●●●● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
● ●
●
●
● ●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
Task Completion Time (sec)
130
15
5
131
134
135
136
138
139
145
147
●
●
●
●
● ●●
●
● ●●● ● ●
●
●
●
●●
●
●
●
● ●
●
●
●
●
●
●
● ●●●
●
●
●●
● ● ●
●●
●●
●
●●
● ●●
●●●
●
●●
●
● ● ●●
●●
●
●●
●●
●
●
● ●●
●●
●
●
●
●
●
●
●
●●
●●●●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●●
●
●●● ●
●
●
●
●
●●
●
●
●
●
●● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●
● ●●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
112
113
114
118
122
123
125
126
128
●
● ●●
●
●
●
●● ●
●●
●
●
●●
●
● ● ● ●●
●
● ●●
●●●
●●
●● ● ●
●●
●●
●
●
●● ●
●●
●● ● ● ●
●●
●
●●
●
●
●
●
●●
● ●
●●
●
●●●●
●
●
●●
●
●●
●
●
● ● ●●
●
●● ● ●
●
●
●●
●
●●
●
●
●●
●
●●
●
●●●
●●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
● ●● ● ●
● ● ●
●
●
●
●●
●● ●
●
●
●
●●
●
●●
●
●
●
●
●●
●●
●
● ●●
●
●
●●
●●
●
●
●
● ●
●●
●
●●
●
●
●
●
●
●
●
● ●●
● ●
●
●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●●
91
15
5
92
97
99
100
103
104
110
111
●
●
●
●
●
●
●●
● ●
●●
●
●
●● ● ●
●
●
●●●
●●●
●
●●
●
●● ● ●●
●
●●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
● ●●●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
68
71
73
74
83
85
88
89
●
●
●
●
●
●
●
●●
●● ●
●
●
●
●
●
● ●● ● ●
●
●● ●
●
●
●
●
●
●
●●
●
●●
●
●
●●●
●
●
●
●
●●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●●
● ●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
● ●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
42
45
46
47
52
53
59
60
15
5
62
●
●
●
●
●
●
●●●
●
●
●
● ●
●
●
●
● ●●●●
●
●●
●
●●●
●
●●
●● ● ●
●
●
●
● ●● ●
●●
●
●
●
●
●● ● ●
●
●
●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●
●
●●
●
● ●
●● ● ● ●
●
●
●
●
●
●
●
● ●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●●
● ●●
●
●
●
●
●
●
●
●
●●
●
●
●
● ●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
● ●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
● ●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●●
●●
●
●
●
●
●
●
26
29
31
32
34
35
37
38
41
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●●
●
●●
● ●
●●
● ●
●●
●
●
●
●●
● ●●
●● ●
●
●
●
●
●
●
●
●● ●
●●
●
●●
●
●
●
●
●●
●
●
●
●
●●
●
●
●
●
●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●●●
●
●
●
●
●●
●
●
●
● ●
●
●●
●
●
●
●●●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●●●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
● ● ●● ●●
●
●
●
●
●
●
●
●●
2
15
5
15
5
●
64
15
5
15
5
7
14
16
19
20
23
24
25
●●
●
●
●●
●
● ●
●
●
●
●
●
●
●●● ●
●●●
●●●
●
●●●
● ●
●
●
●
●● ● ● ●
●●
●
●
●●●
●
●●
●
●
●
●
●
●
●●
●●
●●
●●
●●
●●
●
●
●
●
●
●●
●
●
● ●
●
●
●●
●●
●
● ●
●
●
●
●●
●
●
●
●
●
●
●
●
●●
●
●●
●
●
● ●
●●
●
●● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
05 1525
05 1525
05 1525
05 1525
05 1525
Block
Figure 4.4: Task completion time by block at the individual-level. Blocks
before mid-practice training are shown in black, and blocks after are
shown in grey.
15
5
190
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
Figure 4.5 examines whether participants in the Training condition
who changed their strategy following training also had the greatest
change to their task completion time. This appears to be the case. The
figure compares data from the four blocks before training with the four
blocks after training. The smoothed line of fit shows that participants
with minimal change in strategy use generally had minimal change to
their task completion time. In contrast, participants who changed strategies typically were slower following the change. Interestingly, participants
who reduced their strategy sophistication also appeared to decline in performance.
Percentage Increase in RT
4.3. RESULTS
191
No Training
150
100
50
●●●● ●
● ●
●
●
●
●
●
●
●
●●
●●
●
●
●
●
●●
●●
●
●
●
●
●
●
●●
●
●●●
●●
●
●● ●●
●
●
0
●
−50
−0.2
0.0
0.2
0.4
0.6
0.8
1.0
Percentage Increase in RT
Raw Increase in Strategy Sophistication
Training
150
●
●
●
●●
●
●
●●
●
●●
●●
●●
●●
●
●
●
●
●
●●● ●●●●
●
●
●
●
●●
●
●
●
●
●
●
●
●
●
●
●●●●
●
●
●●
●●●
●
●
●
●
●
● ●●
100
50
0
●
●
●
●
●
−50
−0.2
0.0
0.2
0.4
0.6
0.8
1.0
Raw Increase in Strategy Sophistication
Figure 4.5: Scatter plot showing percentage increase in mean trial reaction time by increase in strategy sophistication from blocks immediately
before training (i.e., 12–15) with those immediately after training (i.e.,
16–18). Top graph shows No Training condition and bottom graph shows
training condition. Loess smoother is overlayed. Dotted line indicates
no change.
192
4.4
4.4.1
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
Discussion
Overview
This chapter aimed to improve understanding of the dynamics of strategy
change and performance following instructions to shift strategy. Previous chapters suggested that self-initiated strategies were often gradual
and typically led to no discernible discontinuities in performance. This
chapter showed that instructed strategy shifts were more abrupt, led to
initial performance decline, and often led to discernible and new learning
curves.
4.4.2
Strategy
As expected the mid-practice strategy instruction led to an increase in
strategy sophistication. Consistent with expectations, participants were
able, at least partially, to learn from the training and follow the instructions.
The actual size of the strategy change at the group-level was modest.
An examination of the individual-level suggested several explanations
for this. First, a large number of participants were already using a sophisticated strategy at the time of the additional instruction. Initial
instructions appear to have been fairly effective in encouraging use of sophisticated strategies. Individuals already using sophisticated strategies
were thus largely unable to increase sophistication. Second, for participants using a simple strategy at the time of the mid-practice training
the shift was not always complete. Participants may not have been able
to absorb all the ideas about the sophisticated strategy at once.
At the individual-level, patterns of shift following mid-practice training were diverse. Of the shifts that did occur, most were abrupt. Several plausible reasons for this abruptness can be proposed. First, the
4.4. DISCUSSION
193
mid-practice training included several opportunities to consolidate the
increased strategy sophistication. Thus, strategy change may have happened gradually but over the period of the mid-practice training rather
than on the measured performance trials. Second, participants were given
explicit instruction to use the more sophisticated strategy. Third, in the
Training condition prior to the mid-practice training, and in the other
conditions at all times, participants were requested to optimise task completion time. Following the mid-practice training participants were asked
to optimise strategy sophistication and then performance. This may have
led participants who received the training to persist in using sophisticated
strategies in spite of a drop in performance. Whether this persistence was
due to perceived long term benefits of text editing strategy sophistication or to general compliance with experimental instructions is difficult
to determine. Finally, most participants were relatively stable in their
strategy use once mid-practice training occurred, which means that the
effect of instruction was more discernible.
In one case, an individual abruptly shifted to the sophisticated strategy and then reverted to the simpler strategy, possibly because of performance difficulties. Examination of the participant’s performance data
revealed a rather large drop in performance while the participant used
the more sophisticated strategy. Arguably, such a situation mirrors a
common occurrence across training situations where an individual lacks
the prerequisites to implement a strategy effectively, or where an individual perceives the time to attain competence in the new strategy as too
long or requiring too much effort. In the language of local minima, the
participant is unwilling to traverse the distance to an ultimately superior
minimum. Alternatively, the particular individual may not perceive the
required investment to improve performance in the longer term to be
rational.
Many other individuals shifted incompletely with some individuals
194
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
settling on a strategy that was only partially sophisticated. This may
be due to limits in amount of information individuals could absorb. In
a more iterative environment of feedback and training typical of personalised instruction or deep self-reflection, such plateaus of strategy sophistication could be removed.
The timing of the mid-practice training also limited the strategy shifts
that could be observed. Many participants were already using relatively
sophisticated strategies at the onset of the mid-practice training.
It is interesting to note the changes in the control group before and
after the mid-practice break. The change did not occur immediately
following the break. The increase that did eventually occur was not
quite statistically significantly different to the other conditions. Several
explanations are possible. One explanation is that it was just random
variation. Another is that the disruption encouraged a new orientation
to the task and eventual increase in strategy sophistication. Either way,
the absense of an immediate shift after the break and the fact that the
sample observed shift was smaller than that for the Training condition
suggests that the shift observed in the Training condition was a result of
the Training and instruction it contained and not merely the break from
task performance. Theory would suggest that the increase in strategy
sophistication in the training condition was the result of the explicit entry
into the repertoire and explicit request to use sophisticated strategies
incorporated into the Training Condition.
Overall the results reiterated the biasing effect of group-level analysis.
The group-level results combined participants who had abrupt increases
in their strategy sophistication with those that changed very little. At
the group-level, it appeared that explicit instruction resulted in only a
modest increase in strategy sophistication. However, at the individuallevel, many participants increased abruptly and substantially in their
strategy sophistication. After these participants were combined with
4.4. DISCUSSION
195
others who were already operating at a high level of sophistication, the
discernible effect was greatly reduced.
4.4.3
Performance
As hypothesised, performance declined following strategy instructions.
The Control group did not appear to decline in performance following
a break from task practice. Thus, it seems likely that the instruction
induced a change in strategy which disrupted established strategies, and
slowed task completion time.
As expected, performance at the group-level and often at the
individual-level exhibited characteristics of a new monotonically decelerating learning curve. The results are also consistent with the idea that
new strategies are similar to a new task. They can create a new learning curve. It may be that the processes in naturalistic studies minimise
this effect by strategy selection processes operating when the strategy is
relatively adaptive in the short term.
In contrast to expectations, performance did not surpass either performance prior to the introduction of the strategy or performance relative
to the No Training group. However, by the end of practice, performance
was relatively similar, yet strategy sophistication remained substantially
higher.
These findings have several implications for applied work. Externally introduced strategy shifts typically require longer to learn. It reinforces general management principles that encourage the avoidance of
micro-management. It highlights how the expert’s view of the relative
effectiveness of a sophisticated strategy differs from that of the novice.
Effective trainers need to be wary of imposing their preferred strategy on
to novices. The expert is in a position to appreciate the long term benefits of investing in the superior strategy, but may need to apply effort
196
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
to be mindful of the extended period the novice will experience where
performance declines. Such strategy adjustment is also likely to result in
a drop in automaticity, which may reduce the potential for the novice to
focus on broader task goals.
Another reason for the drop in performance may be related to the
individuals that shifted. In particular, many participants had already
shifted to the more sophisticated strategies. Participants who had already shifted are likely to have either felt intrinsically ready to make the
shift, or may have had prior experience relevant to using the same or similar strategies. Thus, the participants who were continuing to use simple
strategies at block 15 were less competent in general at text editing, and
particularly less ready for the more sophisticated text editing keys.
Along these lines, it would be interesting for future research to repeat
the present study with different levels of initial strategy training. With
less initial training, strategy sophistication should be lower at the point
of the mid-practice training. As such, it may be that the pool of potential
shifters that remain would include more participants ready for strategy
shift. By comparing low and high initial training conditions following
mid-practice training, the relative effect of readiness for shift and the
mere fact of introducing a strategy shift could be compared in terms of
adoption of the instructions and disruptions to performance.
The findings support the idea that individuals are adaptive in selecting strategies at least given the short term levels of use operating in the
present study. In the case of text editing, more sophisticated strategies
reduce key presses but also often require more complex coordination or
more complex mental models of text editing. Learning to coordinate new
movements and acquire new mental models takes time, arguably longer
than the duration of practice in the present study.
4.4. DISCUSSION
4.4.4
197
Comparison with Previous Research
Yechiam et al
The study by Yechiam et al. (2004) suggested that individuals were reluctant to switch to a more sophisticated strategy if they had already
invested time in learning a reasonably effective simpler strategy. In the
present study many participants shifted to a more sophisticated strategy
following instruction and training to do so. There are several explanations of why the results of Yechiam et al. (2004) differ from the present
study. First, the duration of the present study was of a defined duration, whereas Yechiam et al’s study was ostensibly defined in terms of
completing a defined number of trials, regardless of how long it took
to complete. Thus, assuming participants wanted to finish the study
quickly, in Yechiam et al’s study participants had an incentive to avoid
strategies with unknown pay-offs whereas in the present study whether
participants were fast or slow, the period of practice was of constant
duration.
Second, a similar point regarding incentives could be made about the
real world relevance of the strategies used. In Yechiam et al’s study the
script based strategy was invented for the experimental task. It was not
a scripting language that participants might use in the future to improve
their productivity. In contrast, the sophisticated strategies in the present
study had the potential to improve the productivity of participants outside the experiment. This fact was highlighted to participants in the
initial instructions. Real-world relevance should increase the incentive to
invest in sophisticated strategies.
Third, the immediate cost of switching to the sophisticated strategy
differed between the studies. In Yechiam et al’s study, the sophisticated
strategy involved writing a computer script. Participants needed to read
a manual and experiment, with no certainty that they would ever get
198
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
the script to work. Thus, the real strategic choice was whether to invest
time in trying to use the sophisticated strategy. The actual execution of
the script once understood was very simple. This is a common model for
script based strategies, which are hard to learn, but easy to implement.
This contrasts with the sophisticated text editing strategy. Presumably,
most participants were able to press the sophisticated text editing keys.
However, even once participants were aware of the sophisticated keys,
there was still substantial learning required in order to automate the
psychomotor routines associated with the key combinations and refine
the production rules associated with using the keys in a given context.
Performance losses associated with a lack of skill would typically result
in reduced speed, but the task goal could still be achieved. This made it
possible to dabble with the sophisticated text editing keys and consider
their relevance.
Fourth, the training differed between the two studies. In Yechiam
et al’s study participants were merely given a manual which they could
choose to study. In the present study participants were required to work
through a set of interactive instructions and exercises. They were also
explicitly told at the completion of training that they should try to continue to use the more sophisticated strategies. Most likely, richer training
and stronger instructions can override the tendency of people to persist
in using simpler strategies.
Finally, the studies differed in the potential for transfer across simple
and sophisticated strategies. In Yechiam et al’s study the simple mouse
based strategy had no overlapping elements with the script based strategy. In contrast, sophisticated text editing share many elements with
simpler text editing. This degree of transfer should ease the transition
between the two strategies.
In summary, the above discussion interpreted the findings of the
present study in terms of incentives, information, and task characteris-
4.4. DISCUSSION
199
tics. Incentives can flow from an instructor, from the task itself, or from
the potential for future benefits. Incentives reflect the degree to which
short term performance declines are required for longer term benefits.
Incentives also reflect whether longer term benefits even exist.
The central conclusion of Yechiam et al. (2004) was that instructors
need to be careful when introducing simple strategies first in training.
The argument was that individuals may be reluctant to learn more complicated strategies once they have invested and grown comfortable with
simpler strategies. While this argument seems reasonable, the present
study suggests a few qualifications. First, external instruction and training can override the tendency to use simpler strategies. It is the role of
training to provide reasons, and a pathway, to more sophisticated strategies. Second, for some individuals, who do not perform the task often,
it may be rational for them to maintain the simpler framework of thinking about the task. This removes the need for additional learning, a
temporary drop in performance, and the increased complexity and cognitive resource requirements that may be initially required to learn the
more sophisticated strategies. Third, task interfaces can often be used to
facilitate the conversion from simpler to more sophisticated strategies.
Delaney et al
Results of the present study were consistent with Delaney et al. (1998)
who found that external introduction of a new strategy can lead to a
temporary reduction in performance. The new strategy introduces new
task elements to learn and introduces a new learning curve.
The new learning curves are even more apparent when the strategy
is externally introduced. Self-initiated strategy shifts on the text editing task appear to be more adaptive. Participants appear less willing to
accept temporary drops in performance for potential future gains. Par-
200
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
ticipants who initiated a strategy shift on their own seem to be more
ready to benefit from these more sophisticated strategies.
4.4.5
Conclusion
The results have interesting implications for the decision of whether to
instruct people to adopt a more sophisticated strategy when they are already comfortable with an existing simpler strategy. A temporary drop
in performance is often to be expected. Whether the acceptance of such
a temporary drop in performance is rational would depend on the details of the duration of the drop, the ultimate superiority of the more
sophisticated strategy, and how often the strategy would be used in the
future. It also helps to explain why it is sometimes better to teach more
sophisticated strategies from the start.
Overall, the results show that instructed strategy shifts yield discontinuities in performance. However, such discontinuities, at least in the case
of text editing, are typically the opposite to those proposed by Haider
and Frensch (2002). Rather than improvement in performance, strategy
shift when externally introduced can result in a short term drop in performance. In the literature there have been occasions where ultimately
superior strategies are seen as being synonymous with performance improvement (e.g., Crossman, 1959). The present study has shown empirically that process and performance are distinct, and that sometimes
the benefits of an ultimately superior strategy take a period of extended
practice to be realised.
It may be that casual observations help to explain the ongoing belief
in discontinuities following strategy shift. First, if an individual who is
skilled in both a simple and a sophisticated strategy chooses to compare
his or her own performance with the simple and sophisticated strategy,
the individual will notice that performance abruptly improves when he
4.4. DISCUSSION
201
or she uses the sophisticated strategy. Second, if an individual is skilled
in the simple strategy but not the sophisticated strategy and is told to
adopt the sophisticated strategy when they are not ready, they will notice an abrupt drop in performance. However, the strategy shift that
typically occurs as part of the learning curve is different to both of the
above scenarios. Typically, when an individual shifts strategy as part of
the learning process, shifts are likely to occur when they seem appropriate. While different dynamics are possible, it seems that self-initiated
strategies are more likely to occur when there are benefits in the short
term. Thus, empirically observed strategy shifts are self-initiated and
often lead to minimal observed changes to performance. The learner is
both ready to shift, but has not yet realised the full benefits of the shift.
In combination, these different conditions may help to explain the persistence of ideas of discontinuous effects on performance following strategy
shift.
202
CHAPTER 4. INSTRUCTED STRATEGY SHIFT
Chapter 5
General Discussion
Contents
5.1
Overview . . . . . . . . . . . . . . . . . . . . . 203
5.2
General Themes . . . . . . . . . . . . . . . . . 208
5.3
5.4
5.1
5.2.1
Biasing Effect of Group-Level Analyses . . . 208
5.2.2
Trial-Level Strategy Measurement . . . . . . 209
5.2.3
Individual Differences . . . . . . . . . . . . . 210
5.2.4
Continuity and Discontinuity . . . . . . . . . 210
Limitations . . . . . . . . . . . . . . . . . . . . 211
5.3.1
Overview . . . . . . . . . . . . . . . . . . . . 211
5.3.2
Sample . . . . . . . . . . . . . . . . . . . . . 211
5.3.3
Task . . . . . . . . . . . . . . . . . . . . . . . 212
5.3.4
Context . . . . . . . . . . . . . . . . . . . . . 213
Conclusion
. . . . . . . . . . . . . . . . . . . . 214
Overview
This thesis aimed to understand several fundamental aspects of the relationship between practice, strategy use, and performance. The first aim
203
204
CHAPTER 5. GENERAL DISCUSSION
was concerned with determining the functional form of the relationships
between practice and strategy use, and practice and performance. The
second aim examined the role of individual differences in explaining variation in strategy sophistication and performance. The third aim assessed
differences between self-initiated and instructed strategy shifts.
To achieve these aims, three studies were conducted. Each study
measured individual differences and strategy use and performance on a
text editing task. The methodology and analytic approach combined
individual-level analysis, consideration of multiple continuous and discontinuous models, and trial-level measurement of strategy sophistication and task completion time. Results confirmed and extended previous
research on the relationship between practice, strategy use, and performance. Specifically, several hypotheses were proposed and tested. Table
5.1 sets out these hypotheses indicating over the three studies whether
each one was supported. In summary, most hypotheses were supported.
2.7
2.6
2.5
2.4
Supported
Supported
Supported
Supported
Supported
Discontinuities do not occur in the relationship between
practice and task completion time at the group-level.
Discontinuities occasionally occur in the relationship
between practice and task completion time at the
individual-level.
For individuals that shift strategy, some shifts are gradual and others are abrupt.
Many individuals will be characterised by only partial
strategy sophistication.
The probability of a strategy shift occurring decreases
monotonically with practice.
2.3
2.2
The relationship between practice and task completion Supported
time at the group-level is better explained by a three
parameter power function than a three parameter exponential function.
The relationship between practice and task completion Supported
time at the individual-level is better explained by a three
parameter exponential function than a three parameter
power function.
2.1
Study 1
Hypothesis
Number
Table 5.1: Summary of Support for Hypotheses
Supported
Supported
Supported
Supported
Nonsignificant
trend
in
predicted
direction
Supported
Supported
Study 3
Table continues on next page
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Study 2
5.1. OVERVIEW
205
For some individuals, the relationship between practice
and strategy sophistication involves near continuous use
of a simple strategy.
For some individuals, the relationship between practice
and strategy sophistication involves near continuous use
of a sophisticated strategy.
The relationship between practice and strategy sophistication at the group-level is effectively modelled by a
three parameter Michaelis-Menten function.
Typing speed is strongly positively correlated with text
editing performance.
Prior experience is strongly positively correlated with
text editing performance.
Ability is moderately positively correlated with text
editing task performance.
Big 5 personality factors are unrelated to text editing
task performance.
Strategy use partially mediates the relationship between
ability and prior experience on task performance.
The combination of a request to use a strategy and information on how to use a strategy increases strategy
use.
2.8
4.1
3.5
3.4
3.3
3.2
3.1
2.10
2.9
Hypothesis
Number
Not Tested
Rejected
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Study 1
Partially Supported
Rejected
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Study 3
Table continues on next page
Not Tested
Rejected
Supported
Supported
Supported
Supported
Supported
Supported
Supported
Study 2
206
CHAPTER 5. GENERAL DISCUSSION
4.3
When sophisticated strategies are externally introduced Not Tested
later in practice, performance declines immediately after
introduction.
When sophisticated strategies are externally introduced Not Tested
later in practice, performance eventually surpasses the
level that would have been attained had instruction not
been received.
4.2
Study 1
Hypothesis
Number
Not Tested
Not Tested
Study 2
Not
ported
Sup-
Supported
Study 3
5.1. OVERVIEW
207
208
CHAPTER 5. GENERAL DISCUSSION
The three previous chapters have discussed specific findings. It is not
the purpose of this chapter to repeat these. The remainder of this general
discussion chapter focuses on general themes, limitations, and concluding
comments. Suggestions for future research are also interspersed.
5.2
5.2.1
General Themes
Biasing Effect of Group-Level Analyses
Throughout this thesis it has been argued that it is the individual-level
that is relevant to understanding psychological change. While grouplevel data are easier to model due to both the need to only model one
dataset and the reduced error variance, group-level models are inappropriate for drawing inferences regarding the individual-level. The biasing
effect of group-level analysis was demonstrated on multiple occasions in
this thesis. Given the frequent use of group-level analyses, each of these
findings challenges important existing ideas in the research literature.
First, the exponential function generally provided superior fit in comparison to the power function at the individual-level, yet the reverse was
true at the group-level. This finding goes against Newell and Rosenbloom’s (1981) proposed Power Law of Practice. It supports the findings
of Heathcote et al. (2000), extending them by studying not only cognitive
tasks but also a task with cognitive and perceptual-motor components.
Second, the group-level smoothed over the few discontinuities that
did occur at the individual-level. The infrequency of discontinuities at
the individual-level goes against the suggestions by Haider and Frensch
(2002) that discontinuities were under-identified largely due to grouplevel analyses. However, it did take individual-level analyses and active
modelling of discontinuities in order to actually assess the presence of
discontinuities at the individual-level.
5.2. GENERAL THEMES
209
Third, as with task performance, similar biasing effects were observed
when modelling the relationship between practice and strategy sophistication at the group-level. Strategy sophistication often increased fairly
abruptly at the individual level while at the group-level it followed a
continuous, increasing, decelerating pattern.
Taken together, these findings have a number of important implications. Theories, measurement, and modelling of the relationships between practice, strategy use, and performance need to proceed at the
individual-level if psychologically meaningful conclusions are to be drawn.
In the present thesis a reconciliation of discontinuous strategy shifts and
continuous learning functions was provided that was only possible due to
the individual-level perspective adopted.
5.2.2
Trial-Level Strategy Measurement
Another theme in this thesis was the importance of trial-level measurement of strategy use at the individual-level. The use of text editing
enabled recording of key presses which facilitated extraction of strategy
measurement. This enabled actual measurement and modelling of the
relationship between practice and strategy use.
This contrasts with many previous studies which did not or were not
able to measure strategy for each individual on each trial. For example,
Haider and Frensch (2002) inferred strategy use from average performance in blocks and behaviour on a transfer task. While not all tasks
have strategies that manifest in observable behaviour, tasks that do have
observable strategies represent a good starting point for researching the
relationship between practice and strategy use.
210
5.2.3
CHAPTER 5. GENERAL DISCUSSION
Individual Differences
A third general theme was that of individual differences. On the criterion side, the thesis provided a more nuanced explanation of individual
differences in performance. On the predictor side, models that emphasise
prior experience, ability, and skill were supported.
Importantly, no one function can be used to describe relationships
between practice, strategy use, and performance. In addition to the common finding that parameters of functions vary between individuals, the
actual functional form differed between individuals. This variation was
even more prominent in the relationship between practice and strategy
sophistication than it was between practice and performance. These two
levels of randomness—between individuals and over time—are essential
to developing models of skill acquisition.
5.2.4
Continuity and Discontinuity
The theme of continuity and discontinuity has flowed throughout this thesis. Analyses highlighted the importance of adopting a formal modelling
procedure that assesses the degree of fit of discontinuous models. Such
analyses were also predicated on the previously mentioned individuallevel of measurement.
Measurement, modelling, and individual-level focus have not cooccurred in previous research when modelling the learning curves. There
are several possible explanations for this. First, many learning curves
appear to be relatively well-explained by continuous models. Second,
the literature on discontinuity analysis is relatively disconnected from
psychology largely existing in statistics and econometrics. Third, many
tasks are unlikely to have discontinuities. A contribution of this thesis has
been to highlight how individual-level discontinuities can be modelled.
Instructed strategy shift, not surprisingly, resulted in discontinuities
5.3. LIMITATIONS
211
to both strategy sophistication and performance in ways distinct from
self-initiated strategy shift. External interventions represent a discontinuity in the environment, which helps to explain the discontinuous effect
on the individual. In contrast, without external intervention, the environment tends to be fairly constant in a typical skill acquisition study. This
typically leads to big shifts occurring early and with reduced frequency
over time. In addition, natural processes of strategy selection appear
to interact with top-down processes leading to minimal immediate performance effects of self-initiated strategy shifts. Similarly self-initiated
strategy changes are more often gradual, possibly caused by both local
reinforcement processes and the potentially implicit method of strategy
entry into the repertoire.
5.3
5.3.1
Limitations
Overview
This section focuses on some of the limitations of the empirical research
presented in this thesis. For the most part the limitations focus on the
degree to which the findings can be generalised to other samples, tasks,
and contexts.
5.3.2
Sample
First, participants in the present research were mostly young adult university students. This sample can be contrasted with children (e.g.,
Siegler, 2006) and older adults (e.g., Hertzog, Touron, & Hines, 2007;
Rogers, Fisk, & Hertzog, 1994; Rogers et al., 2000), two groups that
have both received substantial research interest. Also, while the sample
varied in prior experience on the text editing task, most participants had
a reasonable level of prior experience.
212
CHAPTER 5. GENERAL DISCUSSION
Given the selective nature of the sample, it is unclear how well the
findings would generalise to children, older adults, and those with less
prior experience. One possibility is that the findings would largely be
the same except that performance would be poorer, use of sophisticated
strategy would be less, and achievement of minimal task requirements
would occur less often. Alternatively, quite different dynamics might
emerge. In particular, the inability to use a keyboard or having no prior
experience with text editing would create a wide range of new challenges
which this thesis has not aimed to address. It would be interesting if
future research were to adopt the same design and modelling paradigms
as used in this thesis and applied it to these other populations.
5.3.3
Task
The second limitation relates to the consistent use of keyboard based
text editing task in all three studies. The task was chosen to enable
generalisation of findings to other repetitive tasks that contain cognitive,
perceptual, and motor components along with the potential for a simple–
sophisticated strategy shift, such as air-traffic control tasks, and many
other computerised tasks. Nonetheless, future research could examine the
extent to which present findings generalise to other tasks. In particular,
it may be that the complexity of real-world tasks reduces further the
potential for discontinuities to emerge.
Future research could also take a broader perspective on strategy use.
While the present thesis operationalised strategy use based on key logs,
future research could examine several other strategic elements related
to the text editing. First, algorithm-retrieval shift could be examined.
This could include memorising editing changes, memorising aspects of
the edited text, and memorising shortcut keys. Second, eye gaze could
be studied, which could include the efficient allocation of visual atten-
5.3. LIMITATIONS
213
tion to aspects of the editing screen (for an example, see F. J. Lee &
Anderson, 2001). Finally, the learning orientation of participants could
be examined. For example, a learner may decide strategically to learn
and apply advanced shortcut keys even if it means a temporary decline
in performance. Another learner may not be willing to accept temporary
declines in performance.
5.3.4
Context
The studies presented in this thesis all involved a monitored laboratory
setting with performance feedback. While this is common for psychological studies of learning curves, it does raise issues regarding generalisation.
It may be that learning processes are simply sped up in an experimental skill acquisition study. However, in real-world settings discretionary
effort, breaks in practice, and the unstructured nature of instructions
are likely to change the dynamics of learning. The main reason for the
choice of the lab based experimental method was to focus on the particular learning processes of interest and thereby remove consideration
of issues such as discretionary effort, which are certainly important in
the real-world. Nonetheless, caution should be exercised in generalising
findings to work or other real-world settings. It may be, as Charman and
Howes (2003) suggested, that participants in lab settings switch to better
strategies more quickly because they are less concerned with higher-order
goals. Likewise, the lack of prediction by personality variables may be
related to the minimal capacity to use discretionary effort.
A second aspect of the setting is the duration and concentration of
practice. For the studies in this thesis, total practice time was short
relative to some real-world contexts. Also, in real-world settings, practice
is often less concentrated. Factors of motivation, fatigue, and forgetting
are likely to be more important. In particular in the real-world, learning
214
CHAPTER 5. GENERAL DISCUSSION
tasks such as typing or using complex software can occur over months and
years. It is likely that on these more complex tasks that discontinuities
are likely to occur with much less frequency.
5.4
Conclusion
The process by which humans get faster at completing a task with practice has been and remains an exciting area of research. This process
is part of the amazing capacity of humans to adapt to their environment. Describing the functional form of the relationship between practice and performance, and articulating the processes that underlie both
task performance and learning, remains an active interest to researchers
in psychology. This thesis aimed to contribute to this understanding in
several ways, by taking an individual-level perspective, by testing discontinuous models, by examining individual differences, and by comparing
self-initiated and instructed strategy shifts. A means of reconciling discontinuous models of strategy shift with continuous models of the learning curve was proposed. These theoretical, modelling, measurement, and
design ideas will hopefully be useful for future researchers looking at new
tasks, samples, and contexts.
References
Ackerman, P. L. (1987). Individual differences in skill learning: An integration of psychometric and information processing perspectives.
Psychological Bulletin, 102 , 3–27.
Ackerman, P. L. (1988). Determinants of individual differences during
skill acquisition: Cognitive abilities and information processing.
Journal of Experimental Psychology: General , 117 , 288–318.
Ackerman, P. L. (1989). Within-task intercorrelations of skilled performance: Implications for predicting individual differences? (a
comment on Henry & Hulin, 1987). Journal of Applied Psychology,
74 (2), 360–364.
Ackerman, P. L. (1990). A correlational analysis of skill specificity:
Learning, abilities, and individual differences. Journal of Experimental Psychology: Learning, Memory, and Cognition, 16 , 883–
901.
Ackerman, P. L. (1992). Predicting individual differences in complex skill
acquisition: Dynamics of ability determinants. Journal of Applied
Psychology, 77 , 598–614.
Ackerman, P. L., Beier, M. E., & Boyle, M. O. (2002). Individual
differences in working memory within a nomological network of
cognitve and perceptual speed abilities. Journal of Experimental
Psychology: General , 131(4), 567–589.
Ackerman, P. L., Kanfer, R., & Goff, M. (1995). Cognitive and noncog215
216
References
nitive determinants and consequences of complex skill acquisition.
Journal of Experimental Psychology: Applied , 1(4), 270–304.
Ackerman, P. L., & Woltz, D. J. (1994). Determinants of learning and
performance in an associative memory/substitution task: Task constraints, individual differences, volition, and motivation. Journal
of Educational Psychology, 86(4), 487–515.
Adams, J. A. (1987). Historical review and appraisal of research on the
learning, retention and transfer of human motor skills. Psychological Bulletin, 101 , 41–74.
Allison, R. B. (1960). Learning parameters and human abilities (Office
of Naval Research Techinical Report). Princeton, NJ: Educational
Testing Service.
Andersen, T., Carstensen, J., Hernndez-Garca, E., & Duarte, C. M.
(2009). Ecological thresholds and regime shifts: Approaches to
identification. Trends in Ecology & Evolution, 24 (1), 49–57.
Anderson, J. R. (1982). Acquisition of cognitive skill. Psychological
Review , 89 , 369–406.
Anderson, J. R. (1987). The architecture of cognition. Cambridge, MA:
Harvard University Press.
Anderson, J. R. (1990). The adaptive character of thought. NJ: Lawrence
Erlbaum.
Anderson, J. R. (1993). Rules of the mind. Hillsdale, NJ: Erlbaum.
Anderson, J. R. (2000). Cognitive psychology and its implications (5th
ed.). New York: Worth Publishers.
Anderson, J. R., Bothell, D., Byrne, M. D., Douglass, S., Lebiere, C., &
Qin, Y. (2004). An integrated theory of the mind. Psychological
Review , 111(4), 1036–1060.
Anderson, J. R., Matessa, M., & Legiere, C. (1997). ACT-R: A theory of
higher level cognition and its relation to visual attention. Human
Computer Interaction, 12 , 439–462.
References
217
Andrews, D. (1993). Tests for parameter instability and structural change
with unknown change point. Econometrica: Journal of the Econometric Society, 61 , 821–856.
Anglim, J. (2000a). Simple, 2-Choice, 4-Choice Reaction Time Test
[Computer software manual].
Anglim, J. (2000b). Text editing task for jeromy anglim’s honours thesis, supervised by janice langan-fox (1.0 ed.) [Computer software
manual].
Anglim, J., & Waters, L. (2007). Practical tips on how to conduct a
sophisticated online psychological experiment. In 43rd APS Annual
Conference, 23-27 September 2008. Hobart, Tasmania, Australia..
Armstrong, K. (2000). Cognitive appraisals, metacognition and emotion
during skill acquisition. Unpublished doctoral dissertation, University of Melbourne, School of Behavioural Science.
Bahrick, H. P., Fitts, P. M., & Briggs, G. E. (1957). Learning curves:
Facts or artifacts? Psychological Bulletin, 54 (3), 256–268.
Barrick, M., & Mount, M. (1991). The Big Five personality dimensions and job performance: A meta-analysis. Personnel Psychology,
44 (1), 1–26.
Bates, D. M., & Watts, D. G. (1988). Nonlinear regression analysis and
its applications. NY: John Wiley & Sons, Inc.
Beem, A. L. (1995). A program for fitting two-phase segmented-curve
models with an unknown change point, with an application to the
analysis of strategy shifts in a cognitive task. Behavior Research
Methods, Instruments and Computers, 27 (3), 392–399.
Beier, M. E., & Ackerman, P. L. (2005). Age, ability, and the role
of prior knowledge on the acquisition of new domain knowledge:
Promising results in a real-world learning environment. Psychology
and Aging, 20 , 341–355.
Bhavnani, S. K., & John, B. E. (2000). The strategic use of complex
218
References
computer systems. Human-Computer Interaction, 15 (2), 107–137.
Blessing, S. B., & Anderson, J. R. (1996). How people learn to skip
steps. Journal of Experimental Psychology-Learning Memory and
Cognition, 22 (3), 576–597.
Bolker, B. M. (2008). Ecological models and data in R. New Jersey:
Princeton Univ Press.
Brown, R., Durbin, J., & Evans, J. (1975). Techniques for testing the
constancy of regression relationships over time. Journal of the Royal
Statistical Society. Series B (Methodological), 37 (2), 149–192.
Bryan, W. L., & Harter, N. (1897). Studies in the physiology and psychology of the telegraphic language. New York: Macmillan.
Bryan, W. L., & Harter, N. (1899). Studies on the telegraphic language:
The acquisition of a hierarchy of habits. Psychological Review , 6 ,
345–375.
Burke, M. (1984). Validity generalization: A review and critique of the
correlation model. Personnel Psychology, 37 (1), 93–115.
Card, S. K., Moran, T. P., & Newell, A. (1980). The keystroke-level
model for user performance time with interactive systems. Communications of the ACM , 23 (7), 396–410.
Card, S. K., Moran, T. P., & Newell, A. (1983). The psychology of
human-computer interaction. Hillsdale, NJ: Erlbaum.
Carroll, J. B. (1993). Human cognitive abilities: A survey of factoranalytic studies. New York: Cambridge University Press.
Carroll, J. M., & Rosson, M. B. (1987). Paradox of the active user.
Cambridge, MA, USA: MIT Press.
Carroll, L. (1865). Alice’s adventures in wonderland. England: Macmillan.
Cary, M., & Reder, L. M. (2002). Metacognition in strategy selection:
Giving consciousness too much credit. In M. Izaute, P. Chambres,
& P. Marescaux (Eds.), Metacognition: Process, function, and use
References
219
(pp. 63–78). New York: Kluwer.
Charman, S. C., & Howes, A. (2003). The adaptive user: An investigation
into the cognitive and task constraints on the generation of new
methods. Journal of Experimental Psychology: Applied , 9 (4), 236–
248.
Chu, C., Stinchcombe, M., & White, H. (1996). Monitoring structural
change. Econometrica: Journal of the Econometric Society, 64 (5),
1045–1065.
Cohen, J. (1992). A power primer. Psychological bulletin, 112 (1), 155–
159.
Cook, R., Kay, J., Ryan, G., & Thomas, R. (1995). A toolkit for appraising the long-term usability of a text editor. Software Quality
Journal , 4 (2), 131–154.
Crossman, E. R. F. (1959). A theory of the acquisition of speed skill.
Ergonomics, 2 , 153–166.
Crowley, K., Shrager, J., & Siegler, R. (1997). Strategy discovery as
a competitive negotiation between metacognitive and associative
mechanisms. Developmental Review , 17 , 462–489.
Crowley, K., & Siegler, R. (1999). Explanation and generalization in
young children’s strategy learning. Child Development, 70 , 304–
316.
Cudeck, R., & Harring, J. R. (2007). Analysis of nonlinear patterns of
change with random coefficient models. Annual Review of Psychology, 58 , 615–637.
De Jong, J. R. (1957). The effects of increasing skill on cycle time and
its consequences for time standards. Ergonomics, 1 (1), 51–60.
Delaney, P. F., Reder, L. M., Staszewski, J. J., & Ritter, F. E. (1998).
The strategy-specific nature of improvement: The power law applies by strategy within task. Psychological Science, 9(1), 1–7.
Dienes, Z., & Perner, J. (1999). A theory of implicit and explicit knowl-
220
References
edge. Behavioral and Brain Sciences, 22 , 735–808.
Digman, J. (1990). Personality structure: Emergence of the five-factor
model. Annual Review of Psychology, 41 (1), 417–440.
Donchin, E. (1995). Video games as research tools: The space fortress
game. Behavior Research Methods Instruments and Computers,
27 , 217–217.
Ecker, U., Lewandowsky, S., Oberauer, K., Chee, A., & Ecker, U. (2010).
The components of working memory updating: An experimental
decomposition and individual differences. Journal of Experimental
Psychology: Learning, Memory and Cognition, 36 (1), 170–189.
Ekstrom, R. B., French, J. W., Harman, H. H., & Dermen, D. (1976).
Kit of factor-referenced cognitive tests. Princeton, NJ: Educational
Testing Services.
Erev, I., Ert, E., & Yechiam, E. (2008). Loss aversion, diminishing sensitivity, and the effect of experience on repeated decisions. Journal
of Behavioral Decision Making, 21 (5), 575–597.
Ericsson, K. A., & Charness, N. (1994). Expert performance: Its structure and acquisition. American Psychologist, 49 , 725–747.
Estes, W. K. (1956). The problem of inference from curves based on
group data. Psychological Review , 53 , 134–140.
Farina, A., & Wheaton, G. (1971). Development of a taxonomy of human
performance: The task characteristics approach to performance prediction. Defense Technical Information Center.
Fitts, P. M., & Posner, M. I. (1967). Human performance. Belmont,
CA: Brooks/Cole.
Fleishman, E. A.
(1972).
On the relation between abilities, learn-
ing, memory and human performance. American Psychologist, 27 ,
1017–1032.
Fleishman, E. A., & Fruchter, B. (1960). Factor structure and predictability of successive stages of learning morse code. Journal of
References
221
Applied Psychology, 44 (2), 97–101.
Frensch, P., Lindenberger, U., & Kray, J. (1999). Imposing structure on
an unstructured environment: Ontogenetic changes in the ability
to form rules of behavior under conditions of low environmental
predictability. In A. D. Friederici & R. Menzel (Eds.), Learning:
Rule extraction and representation (pp. 139–164). Berlin: Walter
De Gruyter Inc.
Fry, E. B., Kress, J. E., & Fountoukidis, D. L.
ing teachers book of lists (3rd ed.).
(1993).
Read-
Center for Applied Re-
search in Education. Available from http://www.duboislc.org/
EducationWatch/First100Words.html
Gilbreth, F. B. (1911). Motion study: A method for increasing the
efficiency of the workman. New York: Van Nostrand.
Goldberg, L. (1993). The structure of phenotypic personality traits.
American Psychologist, 48 (1), 26–34.
Goldberg, L. R. (1999). A broad-bandwidth, public-domain, personality inventory measuring the lower-level facets of several five-factor
models. In I. Mervielde, I. Deary, F. D. Fruyt, & F. Ostendorf
(Eds.), Personality psychology in europe (Vol. 7, pp. 7–28). Tilburg,
The Netherlands: Tilburg University Press.
Goldberg, L. R., Johnson, J. A., Eber, H. W., Hogan, R., Ashton, M. C.,
Cloninger, C. R., et al. (2006). The international personality item
pool and the future of public-domain personality measures. Journal
of Research in Personality, 40 , 84–96.
Gray, W. D., & Boehm-Davis, D. A. (2000). Milliseconds matter: An
introduction to microstrategies and to their use in describing and
predicting interactive behavior. Journal of Experimental Psychology: Applied , 6 (4), 322–335.
Gray, W. D., John, B. E., & Atwood, M. E. (1993). Project Ernestine:
Validating a GOMS analysis for predicting and explaining real-
222
References
world task performance. Human-Computer Interaction, 8 (3), 237–
309.
Gray, W. D., Sims, C. R., Fu, W. T., & Schoelles, M. J. (2006). The soft
constraints hypothesis: A rational analysis approach to resource
allocation for interactive behavior. Psychological Review , 113 , 461–
482.
Haider, H., & Frensch, P. A. (1999). Information reduction during skill
acquisition: The influence of task instruction. Journal of Experimental Psychology: Applied , 5(2), 129–151.
Haider, H., & Frensch, P. A. (2002). Why aggregated learning follows
the power law of practice when individual learning does not: Comment on Rickard (1997, 1999), Delaney et al. (1998), and Palmieri
(1999). Journal of Experimental Psychology: Learning, Memory,
and Cognition, 28(2), 392–406.
Hansen, B. (2001). The new econometrics of structural change: Dating
breaks in US labor productivity. Journal of Economic perspectives,
15 (4), 117–128.
Harvey, L., & Rousseau, R. (1995). Development of text-editing skill:
From semantic and syntactic mappings to procedures. HumanComputer Interaction, 10 (4), 345–400.
Heathcote, A., Brown, S., & Mewhort, D. J. K. (2000). The power law
repealed: The case for an exponential law of practice. Psychonomic
Bulletin and Review , 7(2), 185–207.
Hertzog, C., Cooper, B. P., & Fisk, A. D. (1996). Aging and individual differences in the development of skilled memory search performance. Psychology and Aging, 11 , 497–520.
Hertzog, C., Touron, D. R., & Hines, J. C.
(2007).
Does a time-
monitoring deficit influence older adults’ delayed retrieval shift during skill acquisition? Psychlogy and Aging, 22 (3), 607–624.
Hirsch, W. Z. (1952). Manufacturing progress functions. Review of
References
223
Economics and Statistics, 34 (2), 134–155.
Huet, S., Bouvier, A., Poursat, M. A., & Jolivet, E. (2004). Statistical
tools for nonlinear regression. New York: Springer.
Hunter, J., & Schmidt, F. (1998). The validity and utility of selection methods in personnel psychology: Practical and theoretical
implications of 85 years of research findings. Psychological bulletin,
124 (2), 262–274.
Ippel, M., & Beem, A. (1987). A theory of antagonistic strategies.
Learning and Instruction: European Research in an International
Context, 1 , 111–121.
John, B. E. (2003). Information processing and skilled behavior. In
J. M. Carroll (Ed.), HCI models, theories, and frameworks: Toward a multidisciplinary science (pp. 55–101). San Francisco, LA:
Morgan Kaufmann.
John, B. E., & Lallement, Y. (1997). Strategy use while learning to perform the Kanfer-Ackerman Air Traffic Controller Task. In Proceedings of the Nineteenth Annual Conference of the Cognitive Science
Society: August 7–10, 1997, Stanford University (pp. 337–342).
Hillsdale, NJ: Lawrence Erlbaum Associates.
Kanfer, R., & Ackerman, P. L. (1989). Motivation and cognitive abilities: An integrative/aptitude-treatment interaction approach to
skill acquisition. Journal of Applied Psychology, 74 , 657–690.
Keil, C. T., & Cortina, J. M. (2001). Degradation of validity over time:
A test and extension of Ackerman’s model. Psychological Bulletin,
127 , 673–697.
Keller, F. S. (1958). The phantom plateau. Journal of the Experimental
Analysis of Behavior , 1 (1), 1-13.
Kelley, K. (2009). The average rate of change for continuous time models.
Behavior Research Methods, 41 (2), 268–278.
Khodadadi, A., & Asgharian, M. (2008). Change-point problem and
224
References
regression: An annotated bibliography. COBRA Preprint Series,
44.
Kolers, P. A. (1975). Memorial consequences of automatized encoding.
Journal of Experimental Psychology: Human Learning and Memory, 1 (6), 689–701.
Kolers, P. A., & Duchnicky, R. (1985). Discontinuity in cognitive skill.
Journal of Experimental Psychology: Learning, Memory, and Cognition, 11 (4), 655–674.
Konz, S. A. (1987). Work design: Industrial ergonomics. Scottsdale,
AZ: Publishing Horizons.
Kramer, A., Strayer, D., & Buckely, J. (1990). Development and transfer of automatic processing. Journal of Experimental Psychology:
Human Perception and Performance, 16 (3), 505–522.
Kramer, W., Ploberger, W., & Alt, R. (1988). Testing for structural
change in dynamic models. Econometrica: Journal of the Econometric Society, 56 (6), 1355–1369.
Kyllonen, P., Lohman, D., & Woltz, D. (1984). Componential modeling
of alternative strategies for performing spatial tasks. Journal of
Educational Psychology, 76 (6), 1325–1345.
Lacey, S. (2007). Fundamental processes of skill acquisition for basic
choice-reaction tasks. Unpublished doctoral dissertation, University
of Michigan.
Lane, N. E. (1987). Skill acquisition rates and patterns: Issues and
training implications. New York: Springer-Verlag.
Lang, J. W. B., & Bliese, P. D. (2009). General mental ability and two
types of adaptation to unforeseen change: applying discontinuous
growth models to the task-change paradigm. Journal of Applied
Psychology, 94 (2), 411–428.
Langan-Fox, J., Armstrong, K., Balvin, N., & Anglim, J. (2002). Process
in skill acquisition: Motivation, interruptions, memory, affective
References
225
states and metacognition. Australian Psychologist, 37(2), 104–117.
Lee, F. J., & Anderson, J. R. (2001). Does learning a complex task have
to be complex?: A study in learning decomposition. Cognitive
Psychology, 42(3), 267–316.
Lee, F. J., Anderson, J. R., & Matessa, M. P. (1995). Components of
dynamic skill acquisition. In Proceedings of the Seventeenth Annual Conference of the Cognitive Science Society (pp. 506–511).
Hillsdale, NJ: Lawrence Erlbaum Associates.
Lee, M. D., & Webb, M. R. (2005). Modeling individual differences in
cognition. Psychonomic Bulletin and Review , 12 (4), 605–621.
Leisch, F. (2002). Sweave: Dynamic generation of statistical reports using literate data analysis. In W. Härdle & B. Rönz (Eds.), Compstat 2002 — proceedings in computational statistics (pp. 575–580).
Heidelberg, Germany: Physica Verlag.
Lemaire, P., & Siegler, R. S. (1995). Four aspects of strategic change:
Contributions to children’s learning of multiplication. Journal of
Experimental Psychology: General , 124 , 83–97.
Logan, G. D. (1988). Toward an instance theory of automatization.
Psychological Review , 95 (4), 492–527.
Logan, G. D. (1992). Shapes of reaction-time distributions and shapes
of learning curves: A test of the instance theory of automaticity.
Journal of Experimental Psychology, 18(5), 883–914.
Lovett, M. C. (2005). A strategy-based interpretation of stroop. Cognitive Science, 29 , 493–524.
Lovett, M. C., & Anderson, J. R. (1996). History of success and current context in problem solving: Combined influences on operator
selection. Cognitive Psychology, 31(2), 168–217.
Lovett, M. C., Reder, L. M., & Lebiere, C. (1997). Modeling individual
differences in a digit working memory task. In M. G. Shafto &
P. Langley (Eds.), Proceedings of the nineteenth annual conference
226
References
of the Cognitive Science Society (pp. 460–465). Mahwah, NJ.
Luchins, A. S. (1942). Mechanisms in problem solving: The effect of
Einstellung. Psychological Monographs, 54 , 1–110.
Luwel, K., Beem, A. L., Onghena, P., & Verschaffel, L. (2001). Using
segmented linear regression models with unknown change points
to analyze strategy shifts in cognitive tasks. Behavior Research
Methods, Instruments, and Computers, 33 , 470–478.
Luwel, K., Verschaffel, L., Onghena, P., & De Corte, E. (2003). Strategic
aspects of numerosity judgment: The effect of task characteristics.
Experimental Psychology, 51(1), 63–75.
MacKay, D. G. (1982). The problem flexibility, fluency, and speedaccuracy trade-off in skilled behavior. Psychological Review , 89 ,
483-506.
Marshalek, B., Lohman, D. F., & Snow, R. E. (1983). The complexity continuum in the radex and hierarchical models of intelligence.
Intelligence, 7 , 107–127.
Mazur, J. E., & Hastie, R. (1978). Learning as accumulation: A reexamination of the leanring curve. Psychological Bulletin, 85 (6),
1256–1274.
McDaniel, M. A., Schmidt, F. L., & Hunter, J. E. (1988). Job experience
correlates of performance. Journal of Applied Psychology, 73 , 327–
330.
McGeoch, J. A. (1927). The acquisition of skill. Psychological Bulletin,
24 (8), 437–466.
Michaelis, L., & Menten, M. (1913). Die kinetik der invertinwirkung.
Biochemische Zeitschrift, 49 , 333-369.
Moore, M., & Dutton, P. (1978). Training needs analysis: Review and
critique. Academy of Management Review , 3 , 532–545.
Morrison, R., & Brantner, T. (1992). What enhances or inhibits learning
a new job? A basic career issue. Journal of Applied Psychology,
References
227
77 (6), 926–940.
Myung, I. J., Kim, C., & Pitt, M. A. (2000). Toward an explanation
of the power law artifact: Insights from response surface analysis.
Memory and Cognition, 28 (5), 832–840.
Neisser, U. (1963). Decision time without reaction time: Experiments in
visual scanning. American Journal of Psychology, 76 (3), 376–385.
Neisser, U., Boodoo, G., Bouchard, T. J., Boykin, A. W., Brody, N.,
Ceci, S. J., et al. (1996). Intelligence: Knowns and unknowns.
American Psychologist, 51 , 77–101.
Neves, C. M., & Anderson, J. R. (1981). Knowledge compilation: Mechanisms for the automatization of cognitive skills. In J. R. Anderson
(Ed.), Cognitive skills and their acquisition (pp. 57–84). Hillsdale,
NJ: Erlbaum.
Newell, A., & Rosenbloom, P. S. (1981). Mechanisms of skill acquisition
and the law of practice. In J. R. Anderson (Ed.), Cognitive skills
and their acquisition (pp. 1–55). Hillsdale, NJ: Erlbaum.
Palmeri, T. J. (1997). Exemplar similarity and the development of automaticity. Journal of Experimental Psychology: Learning, Memory,
and Cognition, 23 (2), 324–354.
Palmeri, T. J. (1999). Theories of automaticity and the power law of
practice. Journal of Experimental Psychology: Learning, Memory,
and Cognition, 25(2), 543–551.
Patrick, J. (1992). Training: Research and practice. Academic Press
London.
Payne, J. W., Bettman, J. R., & Johnson, E. J. (1988). Adaptive strategy
selection in decision making. Journal of Experimental Psychology:
Learning, Memory, and Cognition, 14 , 534–552.
Payne, S. J., Squibb, H. R., & Howes, A. (1990). The nature of device
models: The yoked state space hypothesis and some experiments
with text editors. Human-Computer Interaction, 5 (4), 415–444.
228
References
Ploberger, W., & Kramer, W. (1992). The CUSUM test with OLS residuals. Econometrica: Journal of the Econometric Society, 60 (2),
271–285.
Proctor, R. W., & Dutta, A. (1995). Skill acquisition and human performance. Thousand Oaks, CA: Sage.
R Development Core Team. (2010). R: A language and environment for
statistical computing [Computer software manual]. Vienna, Austria. Available from http://www.R-project.org (ISBN 3-90005107-0)
Rabbitt, P., Banerji, N., & Szymanski, A. (1989). Space Fortress as
an IQ test? Predictions of learning and of practised performance
in a complex interactive video-game. Acta Psychologica, 71 (1-3),
243–257.
Raudenbush, S. W. (2001). Comparing personal trajectories and drawing causal inferences from longitudinal data. Annual Review of
Psychology, 52 , 501–525.
Reber, A. S. (1989). Implicit learning and tacit knowledge. Journal of
Experimental Psychology: General , 118 , 219–235.
Reder, L. M. (1988). Strategic control of retrieval strategies. In G. Bower
(Ed.), The psychology of learning and motivation (pp. 227–259).
New York: Academic Press.
Reder, L. M., & Ritter, F. E. (1992). What determines initial feeling
of knowing? familiarity with question terms, not with the answer.
Journal of Experimental Psychology. Learning, Memory, and Cognition, 18 (3), 435–451.
Reed, H. B., & Zinszer, H. A. (1943). The occurrence of plateaus in
telegraphy. Journal of Experimenta1 Psychology, 33 , 130–135.
Revelle, W. (2010). psych: Procedures for psychological, psychometric,
and personality research [Computer software manual]. Available
from http://CRAN.R-project.org/package=psych
(R package
References
229
version 1.0-88)
Rickard, T. C. (1997). Bending the power law: A CMPL theory of
strategy shifts and the automatization of cognitive skills. Journal
of Experimental Psychology: General , 126 , 288–310.
Rickard, T. C. (1999). A CMPL alternative account of practice effects in
numerosity judgments. Journal of Experimental Psychology: Learning, Memory, and Cognition, 25 , 532–542.
Rickard, T. C. (2004). Strategy execution in cognitive skill learning:
An item-level test of candidate models. Journal of Experimental
Psychology: Learning, Memory and Cognition, 30 (1), 65–82.
Ritter, F., & Schooler, L. (2002). The learning curve. In International
encyclopedia of the social and behavioral sciences (pp. 8602–8605).
Amsterdam: Pergamon.
Ritz, C., & Streibig, J. C. (2008). Nonlinear regression with R. New
York: Springer.
Robertson, S. P., & Black, J. B. (1983). Planning units in text editing behavior. In Proceedings of the SIGCHI conference on Human
Factors in Computing Systems (pp. 217–221). New York: ACM.
Robinson, K. M. (2001). The validity of verbal reports in children’s
subtraction. Journal of Educational Psychology, 93(1), 211–222.
Rogers, W. A., Fisk, A. D., & Hertzog, C. (1994). Do ability-performance
relationships differentiate age and practice effects in visual search?
Journal of Experimental Psychology: Learning, Memory and Cognition, 20 , 710–738.
Rogers, W. A., Hertzog, C., & Fisk, A. D. (2000). An individual differences analysis of ability and strategy influences: Age-related differences in associative learning. Journal of Experimental Psychology:
Learning, Memory and Cognition, 26 , 359–394.
Rosenbaum, D. A., Carlson, R. A., & Gilmore, R. O. (2000). Acquisition of intellectual and perceptual-motor skills. Annual Review of
230
References
Psychology, 52 , 453–470.
Rossett, A. (1987). Training needs assessment. New Jersey: Educational
Technology Publications.
Sackett, P., & Lievens, F. (2008). Personnel selection. Annual Review of
Psychology, 59 , 419-450.
Sackett, P., Schmitt, N., Ellingson, J., & Kabin, M. (2001). Highstakes testing in employment, credentialing, and higher education:
Prospects in a post-affirmative-action world. American Psychologist, 56 (4), 302–318.
Sackett, P., Zedeck, S., & Fogli, L. (1988). Relations between measures of typical and maximum job performance. Journal of Applied
Psychology, 73 (3), 482–486.
Sackett, P. R., Zedeck, S., & Fogli, L. (1988). Relations between measures of typical and maximum job performance. Journal of Applied
Psychology, 73 , 482–486.
Sarkar, D. (2010). lattice: Lattice graphics [Computer software manual].
Available from http://CRAN.R-project.org/package=lattice
(R package version 0.18-8)
Schmidt, F., & Hunter, J. (1977). Development of a general solution to
the problem of validity generalization. Journal of Applied Psychology, 62 (5), 529–540.
Schneider, W., & Shiffrin, R. M. (1977). Controlled and automatic
human information processing: I. detection, search, and attention.
Psychological Review , 84 , 127–190.
Schunn, C. D., & Reder, L. M. (2001). Another source of individual differences: Strategy adaptivity to changing rates of success. Journal
of Experimental Psychology: General , 130(1), 59–76.
Schunn, C. D., Reder, L. M., & Nhouyvanisvong, A. (1997). To calculate
or not to calculate: A source activation confusion model of problem
familiarity’s role in strategy selection. Journal of Experimental
References
231
Psychology: Learning, Memory, and Cognition, 23 (1), 3–29.
Seber, G. A. F., & Wild, C. J. (2003). Nonlinear regression (2nd ed.).
Hoboken, NJ: John Wiley & Sons.
Seibel, R. (1963). Discrimination reaction time for 1,023-alternative task.
Journal of Experimental Psychology, 66 , 215–226.
Siegler, R. S. (1987). The perils of averaging data over strategies: An
example from children’s addition. Journal of Experimental Psychology: General , 116 (3), 250–264.
Siegler, R. S. (1988a). Individual differences in strategy choices: Good
students, not-so-good students, and perfectionists. Child Development, 59 (4), 833–851.
Siegler, R. S. (1988b). Strategy choice procedures and the development of
multiplication skill. Journal of Experimental Psychology: General ,
17(3), 258–275.
Siegler, R. S. (1996). Emerging minds: The process of change in children’s
thinking. New York: Oxford University Press.
Siegler, R. S. (2006). Microgenetic analyses of learning. In D. Kuhn
& R. S. Siegler (Eds.), Handbook of child psychology (Vol. 2, pp.
464–510). Hoboken, NJ: Wiley.
Siegler, R. S., & Crowley, K. (1992). Microgenetic methods revisited.
American Psychologist, 47 , 1241–1243.
Siegler, R. S., & Crowley, K. (1996). The microgenetic method. In
L. Smith (Ed.), Critical readings on Piaget (pp. 606–620). London:
Routledge.
Siegler, R. S., & Lemaire, P. (1997). Older and younger adults’ strategy choices in multiplication: Testing predictions of ASCM using
the choice/no-choice method. Journal of Experimental Psychology:
General , 126 (1), 71–92.
Siegler, R. S., & Shipley, C. (1995). Variation, selection, and cognitive
change. In G. Halford & T. Simon (Eds.), Developing cognitive
232
References
competence: New approaches to process modeling (pp. 31–76). New
York: Academic Press.
Siegler, R. S., & Stern, E. (1998). Conscious and unconscious strategy discoveries: A microgenetic analysis. Journal of Experimental
Psychology: General , 127 , 377–397.
Singer, J. D., & Willett, J. B. (2003). Applied longitudinal data analysis: Modeling change and event occurrence. New York: Oxford
University Press.
Singley, M., & Anderson, J. (1985). The transfer of text-editing skill.
International Journal of Man-Machine Studies, 22 (4), 403–423.
Singley, M., & Anderson, J. (1987). A keystroke analysis of learning
and transfer in text editing. Human-Computer Interaction, 3 (3),
223–274.
Singley, M. K., & Anderson, J. R. (1989). The transfer of cognitive skill.
Cambridge, MA: Harvard University Press.
Snoddy, G. S. (1926). Learning and stability. Journal of Applied Psychology, 10 , 1–36.
Sobel, M. (1982). Asymptotic confidence intervals for indirect effects in
structural equation models. Sociological Methodology, 290–312.
Speelman, C. P., & Kirsner, K. (2005). Beyond the learning curve: The
construction of mind. Oxford, UK: Oxford University Press.
Staw, B. (1981). The escalation of commitment to a failing course of
action. Academy of Management Review , 6 , 577–587.
Taatgen, N. A. (2001). A model of individual differences in learning air
traffic control. In E. M. Altmann, A. Cleeremans, C. D. Schunn, &
W. D. Gray (Eds.), Fourth International Conference on Cognitive
Modeling (pp. 211–216). Mahwah, NJ: Lawrence Erlbaum.
Tett, R., Jackson, D., & Rothstein, M. (1991). Personality measures as
predictors of job performance: A meta-analytic review. Personnel
Psychology, 44 (4), 703–742.
References
233
Thorndike, E. L., & Woodworth, R. S. (1901). The influence of improvement in one mental function upon the efficiency of other functions.
Psychological Review , 8 , 247–261.
Touron, D. R. (2002). Does what we know affect how we learn? Adult
age differences in the effects of item knowledge and rule knowledge
on skill development (Doctoral dissertation, Syracuse University).
Dissertation Abstracts International: Section B: The Sciences and
Engineering, 62(9-B), 4250.
Touron, D. R., & Hertzog, C. (2004). Strategy shift affordance and
strategy choice in young and older adults. Memory and Cognition,
32 , 298–310.
Touron, D. R., Hoyer, W. J., & Cerella, J. (2004). Age-related differences
in the component processes of cognitive skill learning. Psychology
and Aging, 19(4), 565–580.
VanLehn, K. (1996). Cognitive skill acquisition. Annual review of psychology, 47 (1), 513–539.
Verschaffel, L., De Corte, E., Lamote, C., & Dherdt, N. (1998). The acquisition and use of an adaptive strategy for estimating numerosity.
European Journal of Psychology of Education, 13 (3), 347–370.
Voelkle, M. C., Wittmann, W. W., & Ackerman, P. L. (2006). Abilities
and skill acquisition: A latent growth curve approach. Learning
and Individual Differences, 16 , 303–319.
Walsh, M. M., & Anderson, J. R. (2009). The strategic nature of changing
your mind. Cognitive Psychology, 58 , 416–440.
Yechiam, E., Erev, I., & Parush, A. (2004). Easy first steps and their
implication to the use of a mouse-based and a script-based strategy.
Journal of Experimental Psychology: Applied , 10(2), 89–96.
Yechiam, E., Erev, I., Yehene, V., & Gopher, D. (2003). Melioration and
the transition from touch-typing training to everyday use. Human
Factors, 45 (4), 671–684.
234
References
Zeileis, A., Kleiber, C., Krmer, W., & Hornik, K. (2003). Testing and
dating of structural changes in practice. Computational Statistics
and Data Analysis, 44 (1-2), 109–123.
Zeileis, A., Leisch, F., Hornik, K., & Kleiber, C. (2002). strucchange:
An r package for testing for structural change in linear regression
models. Journal of Statistical Software, 7 (2), 1-38. Available from
http://www.jstatsoft.org/v07/i02/
Appendix A
Study 1: Additional Materials
A.1
Overview
This Appendix provides assorted additional material related to the introduction, method, and results of Study 1.
A.2
A.2.1
Materials
Study 1 Prior Experience Scale
Question text, response options, and assigned weight for each item on
the scale are shown below. Weights were used in the calculation of prior
experience. Weights were not displayed to participants.
Q1) How often do you use word processing programs,
for example Microsoft Word?
8 = Frequently (nearly ever day);
5 = Often (once or twice a week);
2 = Occasionally (once or twice a month);
0 = Rarely or Never
Q2) Would you say you are experienced
235
236
APPENDIX A. STUDY 1: ADDITIONAL MATERIALS
and/or skilled at using a keyboard?
5 = YES, very experienced and skilled;
3 = YES, moderately experienced and skilled;
0 = NO, not at all experienced or skilled
Q3) Do you feel eyestrain and have vision problems
from looking at a computer screen?
0 = Yes, usually after a few minutes;
1 = Yes, but only after a long period of time;
2 = NO, never
Q4) Do you ever have any recurring problems with your vision?
0 = Yes, my vision is substantially impaired;
3 = Yes, but my vision is corrected by visual aides;
4 = NO, never
Q5) Do you have any physiological (physical) problems that
impair your motor control (the movement) in your arms or wrists?
5 = NO;
2 = YES --- moderate impairment;
0 = YES --- severe impairment
For each of the following items participants were asked to rate their
level of competence and familiarity using the following scale: 3 = Highly
competent; 2 = Somewhat competent; 1 = Familiar with but not experienced; 0 = Never heard of it.
Q6) Short cut keys
Q7) Cut, copy, paste
Q8) Formatting paragraphs (e.g., using tabs, indenting)
Q9) Outline view or master document
A.2. MATERIALS
237
Q10) Macros
The following questions required participants to provide an answer to
each question. Each item was scored 3 = correct and 0 = incorrect.
Q11) What is the Windows short cut key for copy?
Q12) What is the Windows short cut key for paste?
Q13) What is the Windows short cut key for undo?
Q14) Which of the following keys is used to highlight
text with the keyboard?
i) Alt
(please circle)
ii) Control (Ctrl)
iii) Shift
Q15) Where does the cursor go if you press the ‘home’ key?
Q16) Where does the cursor go if you press
the ‘ctrl’ and ‘end’ keys?
Q17) What does holding ‘ctrl’ and ‘shift’
and pressing the right arrow button, accomplish?
Answers were scored correct if the following answers were provided:
Q11: Ctrl+X; Q12: Ctrl+V; Q13: Ctrl+Z; Q14: Shift; Q15: Start of
line; Q16: End of document; Q17: one of (a) Select word to the right or
(b) unselect word from left of selection.
238
A.3
A.3.1
APPENDIX A. STUDY 1: ADDITIONAL MATERIALS
Text Editing Task
Additional Screenshots of Computer Interface
Figure A.1: Study 1 text editing task dialog box displayed at the end of
a set of trials.
Figure A.2: Study 1 text editing task dialog box displayed at the end of
all trials on the text editing task.
A.3. TEXT EDITING TASK
A.3.2
239
Text Editing Instructions
The following instructions were given to participants to read:
(Adapted from Armstrong (2000))
The task you are now going to complete is a text-editing task. This
task is a relevant skill in many work settings. For example, text editing is
used in administration and office settings. It is also becoming increasingly
important in managerial roles where professional reports are commonly
written with the aid of a personal computer. Publishing is also an area
where this skill is vital. Text editing may also be a useful predictor of job
performance in many white-collar jobs. Today you will be given a chance
to practice this skill and will receive feedback about your performance
on this task.
The task stimuli will consist of a computer text-editing task. This
will entail making corrections in the text file from a marked manuscript
provided.
You will sit in front of a computer terminal with a keyboard for input
and a video display terminal for output. You are to make each of the
marked modifications in the text file to produce an updated file. Do not
change anything else in the document and complete the corrections as
quickly and accurately as possible. Do not spend much time checking
what you have completed.
You will be provided with feedback on your speed and accuracy after
each 1 minute trial. The first trial will be for practice and then you
will have 54 trials to complete divided into 18 trials of approximately 20
minutes, with breaks in between. When you are ready to start the trial
press F2 and when you have completed each trial press F3 to move on
to the next trial. Once you have reached the end of the 18 trials please
take a 5 minute break.
In order to make the changes to the file on the computer you will
need to use several different text-editing keys. An outline of all possible
text-editing functions is included on the following page.
PLEASE DO NOT USE THE MOUSE!
Once you have read the instructions and understood what is required
from you for this task, you will be directed to open the file on the computer named ‘Text Editing’ and then upon instruction you can commence
the task.
If you have any questions please do not hesitate to ask the researcher
240
A.3.3
APPENDIX A. STUDY 1: ADDITIONAL MATERIALS
Accuracy Calculation
Accuracy was measured as a score out of 12. One accuracy point was
awarded for the presence of each of the following strings.
(i) “We know”
(ii) “, no languages or”
(iii) “languages or cultures”
(iv) “Self-”
(v) “-knowledge”
(vi) “199”
(vii) “1994”
(viii) “and selfhood”
(ix) “selfhood.”
(x) “of identity”
(xi) “of identity and”
(xii) “of identity and selfhood”
A.4
Study 1 Data Cleaning
Study 1 revealed many of the challenges that can arise in experimental
skill acquisition research. Specifically, the aim of measurement was to
obtain an accurate measure of a participants’ performance on a given trial
assuming they were applying reasonable effort. Extensive exploratory
analysis of the resulting data revealed many problematic trials. These
problematic trials occurred more frequently for some participants than
A.4. STUDY 1 DATA CLEANING
241
others. Thus, from a research perspective some participants were more
problematic than others. The following section outlines how problematic
data was identified and how it was cleaned.
After a thorough analysis of key logs, task completion time, and accuracy data, the following procedure was adopted to clean the data. A
small number of participants had corrupted or missing key log data, who
were thus excluded from analysis. For participants with key log data,
each trial was classified into one of seven categories (see Table A.1). As
a percentage of all raw trials, 0.73% were Cheat, 61.31% were Accurate,
21.46% were Sloppy, 3.88% were Slow, 5.74% were Skip, and 4.72% were
Avoid. There were 6 participants who had one or more trials classified
as a cheat trial.
Table A.1: Study 1 Trial Type Classification for Validity Purposes.
Type
1.
Definition
Cheat
Pasted clipboard from
previous trial
2. Accurate Not cheat and Accuracy = 12
3. Sloppy
Not cheat and 9 <=
Accuracy < 12
4. Slow
Trial timed out and 5
<= Accuracy < 9
5. Sleepy
Trial timed out and
Accuracy < 5
6. Skip
Trial ended manually
and 5 <= Accuracy <
9
7. Avoid
Trial ended manually
and Accuracy < 5
Cheat
Valid
Adjust
Penalty
TRUE
FALSE
NA
100
FALSE
TRUE
FALSE
0
FALSE
TRUE
TRUE
1
FALSE
TRUE
TRUE
2
FALSE
FALSE
NA
4
FALSE
FALSE
NA
4
FALSE
FALSE
NA
8
Note. Accuracy is out of 12.
Each trial type was given a penalty rating based on the degree to
which the trial type reflected undesirable participant behaviour. These
penalty points were then summed for each participant over the 54 trials of
242
APPENDIX A. STUDY 1: ADDITIONAL MATERIALS
practice. Participants with more than 39.5 penalty points were removed
from the cleaned dataset. This specific threshold was chosen for two reasons. First, the distribution of penalty points appeared to be a mixture
distribution reflecting a main normally distributed group of participants
who were presumed to have adequate skill and be applying reasonable
effort, and a second skewed distribution derived from problematic participants. The chosen threshold discriminated these two distributions well.
Second, actual examination of sequences of participant trial codes after
the threshold started to appear problematic in an absolute sense, based
on the frequency and type of problematic trials.
In addition to using trial types for assigning penalty points, trials
types were also classified as either valid or invalid. Invalid trials were
removed from the cleaned dataset. The result was that most retained
participants still had one or two trials removed. As a percentage of all
trials for retained participants, none were Cheat, 81.28% were Accurate,
15.29% were Sloppy, 1.47% were Slow, 1.23% were Skip, and 0.47% were
Avoid.
Finally, trial completion time was adjusted for accuracy. Accuracy
ratings were based on 12 checks done on the task. Accuracy of 12 out of
12 (i.e., 100%) had a multiple of 1.0, which meant that the completion
time for the trial was unaltered. Table A.2 shows the multiples applied
for less than perfect accuracy. The multiple aimed to approximate the
time it would have taken the participant to complete the trial with perfect
accuracy.
Fortunately, through careful data cleaning, meaningful data could be
extracted from the majority of cases in Study 1. However, the challenges
encountered in Study 1 provided several experimental design lessons that
were incorporated into Studies 2 and 3. Trials labelled as cheating in the
form of copy and pasting completed text from previous to subsequent
trials was prevented in study 2 and 3 by clearing the clipboard after
A.4. STUDY 1 DATA CLEANING
243
Table A.2: Study 1 Task Completion Time Multiplier Associated with a
Given Trial Accuracy out of 12.
Accuracy
RT Multiplier
12
11
10
9
8
7
6
5
1.00
1.05
1.10
1.15
1.20
1.25
1.30
1.40
each trial. Sleepy trials were subsequently largely avoided through the
use of a system whereby the absence of a key press for five seconds or
more paused the timer. Thus, participants could not engage in avoidant
behaviour. Skipping was prevented in Study 2 and 3, by making end of
trials contingent on accurately completing the required edits rather than
allowing participants to end the trial themselves. Several issues with
accuracy were subsequently resolved by designing the text editing task
to only have one edit per trial and incorporating informative messages
related to common accuracy issues such as leaving extra spaces in the
edited text.
244
APPENDIX A. STUDY 1: ADDITIONAL MATERIALS
Appendix B
Study 2: Additional Materials
B.1
Materials
B.1.1
IPIP Personality Test
Big 5 personality was measured using the IPIP personality test
(L. R. Goldberg et al., 2006). The details of each item are shown in
Table B.1. The Reverse column indicates whether the item was negatively worded and needed to be reversed (i.e., 6 − x, where x is the score
on the item) before calculating total scores. The test aimed to measure
the Big 5 personality factors: Extraversion (EX), Conscientiousness (C),
Emotional Stability (ES), Openness (O), and Agreeableness (A).
Table B.1: Study 2 IPIP Big 5 Personality Measure
Item Text
1
2
3
4
5
6
7
I
I
I
I
I
I
I
Reverse
am the life of the party.
insult people.
am always prepared.
get stressed out easily.
have a rich vocabulary.
often feel uncomfortable around others.
am interested in people.
Table continues on next page
245
1
-1
1
-1
1
-1
1
Factor
EX
A
C
ES
O
EX
A
246
APPENDIX B. STUDY 2: ADDITIONAL MATERIALS
Item Text
8 I leave my belongings around.
9 I am relaxed most of the time.
10 I have difficulty understanding abstract ideas.
11 I feel comfortable around people.
12 I am not interested in other people’s problems.
13 I pay attention to details.
14 I worry about things.
15 I have a vivid imagination.
16 I keep in the background.
17 I sympathize with others’ feelings.
18 I make a mess of things.
19 I seldom feel blue.
20 I am not interested in abstract ideas.
21 I start conversations.
22 I feel little concern for others.
23 I get chores done right away.
24 I am easily disturbed.
25 I have excellent ideas.
26 I have little to say.
27 I have a soft heart.
28 I often forget to put things back in their proper
place.
29 I am not easily bothered by things.
30 I do not have a good imagination.
31 I talk to a lot of different people at parties.
32 I am not really interested in others.
33 I like order.
34 I get upset easily.
35 I am quick to understand things.
36 I don’t like to draw attention to myself.
37 I take time out for others.
38 I shirk my duties.
39 I rarely get irritated.
40 I try to avoid complex people.
41 I don’t mind being the center of attention.
42 I am hard to get to know.
43 I follow a schedule.
44 I change my mood a lot.
45 I use difficult words.
46 I am quiet around strangers.
47 I feel others’ emotions.
48 I neglect my duties.
Table continues on next page
Reverse
Factor
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
B.1. MATERIALS
Item Text
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
I
seldom get mad.
have difficulty imagining things.
make friends easily.
am indifferent to the feelings of others.
am exacting in my work.
have frequent mood swings.
spend time reflecting on things.
find it difficult to approach others.
make people feel at ease.
waste my time.
get irritated easily.
avoid difficult reading material.
take charge.
inquire about others’ well-being.
do things according to a plan.
often feel blue.
am full of ideas.
don’t talk a lot.
know how to comfort others.
do things in a half-way manner.
get angry easily.
will not probe deeply into a subject.
know how to captivate people.
love children.
continue until everything is perfect.
panic easily.
carry the conversation to a higher level.
bottle up my feelings.
am on good terms with nearly everyone.
find it difficult to get down to work.
feel threatened easily.
catch on to things quickly.
feel at ease with people.
have a good word for everyone.
make plans and stick to them.
get overwhelmed by emotions.
can handle a lot of information.
am a very private person.
show my gratitude.
leave a mess in my room.
take offense easily.
am good at many things.
Table continues on next page
247
Reverse
1
-1
1
-1
1
-1
1
-1
1
-1
-1
-1
1
1
1
-1
1
-1
1
-1
-1
-1
1
1
1
-1
1
-1
1
-1
-1
1
1
1
1
-1
1
-1
1
-1
-1
1
Factor
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
248
APPENDIX B. STUDY 2: ADDITIONAL MATERIALS
Item Text
91
92
93
94
95
96
97
98
99
100
I
I
I
I
I
I
I
I
I
I
B.1.2
Reverse
wait for others to lead the way.
think of others first.
love order and regularity.
get caught up in my problems.
love to read challenging material.
am skilled in handling social situations.
love to help others.
like to tidy up.
grumble about things.
love to think up new ways of doing things.
-1
1
1
-1
1
1
1
1
-1
1
Factor
EX
A
C
ES
O
EX
A
C
ES
O
Typing Test
The typing test consisted of three one minute trials. Before the task
started, the following initial instructions were displayed on the screen:
TYPING TEST: The following task assesses your typing
speed. The task involves three trials. Each trial involves typing as much of a paragraph of text as you can in one minute.
At the end of one minute the task will end. Your performance
will be the number of words that you accurately type in the
paragraph minus the number of words that you inaccurately
type. Try to type as quickly and accurately as you can.
On each trial, the following message was displayed at the bottom
of the screen: “YOU HAVE 60 SECONDS TO TYPE THE TEXT IN
GREEN IN THE BOX BELOW:”.
At the end of each trial, the screen changed to an all white background
with the following text: “Press [ENTER] when you are ready for the next
trial”
The typing test paragraphs were taken from articles on Wikipedia.
The paragraphs were selected, and subsequently cleaned, so that they
B.1. MATERIALS
contained only words, numbers, and common punctuation.
Paragraph 1: http://en.wikipedia.org/wiki/Psychology
Psychology describes and attempts to explain
consciousness, behaviour, and social interaction.
Empirical psychology is primarily devoted to
describing human experience and behaviour as it
actually occurs. Various schools of thought have
argued for a particular model to be used as a
guiding theory by which all, or the majority, of
human behaviour can be explained. The popularity
of these has waxed and waned over time. Some
psychologists may think of themselves as adherents
to a particular school of thought and reject the
others, although most consider each as an approach
to understanding the mind, and not necessarily
as mutually exclusive theories. Psychology
encompasses a vast domain, and includes many
different approaches to the study of mental
processes and behaviour.
Paragraph 2: http://en.wikipedia.org/wiki/Space
The concept of space has been of interest for
philosophers and scientists for much of human
history. The term is used somewhat differently
in different fields of study. Hence, it is
difficult to provide an uncontroversial and
clear definition outside of specific defined
contexts. Disagreement also exists on whether
space itself can be measured or is part of the
measuring system. Science considers space to
be a fundamental quantity. Thus an operational
definition is used in which the procedure of
measurement of space intervals and the units of
measurement are defined. The way in which space
is perceived is an area which psychologists first
began to study in the middle of the 19th century,
and it is now thought by those concerned with such
studies to be a distinct branch within psychology.
Paragraph 3: http://en.wikipedia.org/wiki/Peru
249
250
APPENDIX B. STUDY 2: ADDITIONAL MATERIALS
Peru is a representative democratic republic
divided into 25 regions. Its geography varies
from the arid plains of the Pacific coast to the
peaks of the Andes mountains and the tropical
forests of the Amazon Basin. Its main economic
activities include agriculture, fishing, mining,
and manufacturing of products such as textiles.
The main spoken language is Spanish, although
a significant number of Peruvians speak native
languages. This mixture of cultural traditions
has resulted in a wide diversity of expressions
in fields such as art, cuisine, literature, and
music. Peruvian economic policy has varied widely
over the past decades. Recent economic growth
has been fuelled by macroeconomic stability,
improved terms of trade, and rising investment
and consumption. Peru’s main exports are copper,
gold, zinc, textiles, and fish meal.
B.2. TEXT EDITING STANDARDISATION
B.2
251
Text Editing Standardisation
The standardisation of task completion time on the text editing task can
be expressed mathematically as follows.
zij =
yij − ȳj
sd(yj )
where yij is the raw task completion time for the ith observation for the
jth trialcode, ȳj is sample mean raw task completion time for the jth
trial code, sd(yj ) is the sample standard deviation for the jth trial code,
and thus, zij is a z-score of raw task completion time within trialcodes.
Then, adjusted task completion time, ŷij , was calculated as
ŷij = ȳ.. + sd(yij )zij
where ȳ.. is the grand mean of all observations, and sd(yij ) is the standard
deviation of all observations
B.3
Text Editing Task Instructions
1. Motivating Introduction: The following initial instructions were given
in an attempt to motivate participants to apply effort to the training
task.
I am making the assumption that most of you perform text
editing using the mouse. While the mouse can be used to
do text editing, this training will focus on performing text
editing exclusively with the keyboard. Keyboard-based text
editing is typically more efficient than using the mouse once
you get familiar using the keys. In particular moving from
typing to keyboard based text editing is quicker than moving
to the mouse.
252
APPENDIX B. STUDY 2: ADDITIONAL MATERIALS
We all spend a lot of time editing documents, writing emails,
and entering text using a keyboard. While some of you may
be familiar with some of the keys I will discuss, it is unlikely
that you are familiar with them all. Text editing is the kind
of task that can save you a minute a day, an hour a month,
and a day a year. You may gain a whole month of your
life in additional productivity, if you become efficient at text
editing. Also, by learning how to efficiently edit text, you can
focus more on the process of content creation.
2. Overview of Hand Positioning: The instructor then explained
One of the keys to efficient text editing is proper hand positioning. Just as touch typing involves positioning the hands
and fingers on the home keys, ASDF for the left hand and
JKL; on the right hand, there is also an efficient positioning
for hands and fingers for keyboard based text editing.
The instructor then held up the keyboard to the group of participants
with the key faced towards them so they could hear and see where they
should place their hands when performing text editing. Participants were
then told that
You should place your left little finger on the Control key
and your left ring finger on shift which will be used to for
modifying and selecting text; Place you left middle finger
over the Z-key, which can be used to undo text and your left
index finger can be used to press, X, C and V for cut, copy
and paste, respectively. We will go over each of these keys
later. For now, just focus on understanding where to place
your fingers.
The instructor then held up the index, middle and ring fingers of the
right hand and said,
B.3. TEXT EDITING TASK INSTRUCTIONS
253
Now, place your right hand index, middle and ring fingers on
the cursor keys. Thus, your index finger is on the Left arrow,
your middle finger is on the Up or Down arrow, and your ring
finger is on the Right arrow. You use your left index finger
to press Left and Backspace, and Delete. You use your
middle finger to press Up, Down, Home and End. You use your
ring finger to press Right and you can also press Page Up
and Page Down, although we wont be using the last two keys
much in the training.
3. Follow-Along Demonstration of Text Editing Keys: Participants
were then asked to open up a passage of text in Microsoft Word
TM
. It
was the same passage of text as was used in the main Text Editing Task.
Participants were told,
The idea is for you to practice the keys after I read them out so
that you can see what each key does. To start with I assume
you are all familiar with the idea that left-right moves the
cursor one character to the left and one character to the right
and that Up and Down moves the cursor up and down lines.
Home and End keys move the cursor to the beginning and
end of the line. Control Left and right moves the cursor
forward and back whole words at time. This is particularly
useful as often we want to edit whole words. Control up
and control down moves the cursor to the beginning of the
previous or next paragraph. Control Home and Control End
takes you the start and end of the document. Often we want
to edit text. To do this we typically first have select the text.
The main key for selecting text is the Shift key. If we hold
down the Shift key and move the cursor keys left and right
we can select characters. If we press Shift Up and Down we
254
APPENDIX B. STUDY 2: ADDITIONAL MATERIALS
can select lines of text. We can use the Control Shift and
Left or Right to select words. Once we’ve selected text we
can use Control X to cut and Control V to paste or Control
C to copy and Control V to paste. If we wish to undo a
change we’ve just made, we can press Control Z. I assume
that you are familiar with Delete and backspace. Delete
removes characters to the right of the cursor and backspace
removes characters to the left of the cursor. In addition,
Control Delete and Control Backspace deletes the word
in front or behind the cursor.
In between each of these sentences participants were given a few seconds to practice the key combinations that had just been presented.
They were also shown the effect of the key combinations on text using
an overhead projector.
4. Task Instructions: Participants were then told:
In the next period you will have the opportunity to practice
these short cut keys and acquire the skill of advanced text
editing. The task involves a series of trials, which requires
you to make a single change to a passage of text. Each trial
will end once you have correctly made the change. Each trial
automatically times out, if you are unable to complete the
trial in 40 seconds. In order to proceed, you need to make the
change exactly. In particular, when editing text, make sure
that you leave only a single space between words. Leaving
no space or leaving two spaces will not constitute a successful
edit.
Participants were then instructed to click on a link on their computer
to start the task.
B.4. STUDY 2 DATA CLEANING
B.4
255
Study 2 Data Cleaning
The aim of the initial data cleaning was to exclude trials that provided
invalid data and exclude participants that did not provide sufficient valid
trial data. The guiding principle was that a trial should be excluded if it
did not yield valid information about the speed with which a person could
complete the task. The first trial was removed because participants were
typically still getting acquainted with the task interface. The final trial
was removed because of interference caused by ending the experiment.
A small number of trials were labelled broken because problems arose in
determining accuracy.
In contrast to the previously mentioned types of invalid trials the
following two types were related to participant behaviour. First, delayed
trials were defined as those where no key was pressed for a period greater
than six seconds. This was typically caused by participants taking a
break or engaging in other off-task behaviour. Second, incomplete trials
were those that timed out after 40 seconds. Probable causes included
a lack of skill, making an irrecoverable error, distraction, and taking a
break.
This second set of invalid trials were used to allocate penalty points.
Each trial that fell into this second category was given a penalty point.
A participant’s penalty score was their percentage of trials with penalty
points. After examining the distribution of penalty points and the associated data with various cut-offs, cases were retained with penalty points
on fewer than 7.5% of trials. A small number of cases were also excluded
because they had less than 27 minutes of on-task performance time.
Finally, an examination of trial types showed that cut-and-paste trials were taking around twice as long to complete as the other three trial
types. Even with standardisation such trials would introduce substantial noise into analyses. For this reason, all cut-and-paste trials were
256
APPENDIX B. STUDY 2: ADDITIONAL MATERIALS
excluded. Note that in all the above exclusion cases except the first trial,
the excluded trial still counted towards the trial count for modelling and
graphing purposes.
Appendix C
Study 3: Additional Materials
C.1
C.1.1
Materials
Prior Experience Questionnaire
The questions and response options for the Prior Experience Questionnaire are shown below. The Prior Experience Weighting is shown in
brackets for each response option.
1. What is the computer Operating System that you use most often?
1. Apple Mac (0)
2. Windows (3)
3. Linux (5)
2. When typing do you look at the keys?
1. Always (1)
2. Usually (2)
3. Sometimes (3)
4. Almost never (4)
5. Never (5)
257
258
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
3. How often do you use the SHIFT key
to select text on the computer?
1. Never (0)
2. Almost never (1)
3. Sometimes (2)
4. Usually (4)
5. Always (6)
4. What is the status of your vision?
1. Severe visual impairment (0)
2. Mild visual impairment (2)
3. Normal or corrected to normal (3)
5. Do you have normal motor control in your hands?
1. Mild deficit of motor control in hands (0)
2. Yes, normal motor control (2)
6. When editing text how often do you use keys such as
control+c, x, and v for cut, copy, and paste?
1. Never (0)
2. Almost Never (1)
3. Sometimes (2)
4. Usually (3)
5. Always (4)
7. How comfortable do you think you would be in
using only the keyboard to edit text?
1. Very uncomfortable (0)
2. Somewhat uncomfortable (1)
3. Neither comfortable nor uncomfortable (2)
C.1. MATERIALS
4. Comfortable (3)
5. Very comfortable (4)
8. When you are editing text such as selecting,
copying, pasting, and so on,
what is the main device that you use?
1. Mouse (0)
2. Mouse and keyboard (2)
3. Keyboard (4)
9. How often do you play computer games?
1. Never (0)
2. Almost never (1)
3. Sometimes (2)
4. Often (3)
5. Very often (4)
10. How often do you play computer games where
you use a standard keyboard to play the game?
1. Never (0)
2. Almost never (1)
3. Sometimes (2)
4. Often (3)
5. Very Often (4)
11. Look down at the keyboard in front of you,
does it have the same key layout
as the keyboard you typically use?
1. No, it is different (0)
2. It looks similar (1)
259
260
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
3. It’s exactly the same (2)
12. Main computer
1. Laptop (0)
2. Desktop (1)
13. Would you describe yourself as a power-user of computers?
1. Not at all (0)
2. Probably Not (1)
3. Possibly (2)
4. Yes, definitely (3)
14. How often do you do computer programming?
1. Never (0)
2. Almost Never (1)
3. Sometimes (2)
4. Often (3)
5. Very Often (4)
The following provides a brief rationale for each item in the Prior
Experience Questionnaire.
1. The text editing task was conducted on a Microsoft Windows
operating system and the text editing keys were standard for Microsoft
Windows operating system. Thus, participants with more experience
using Microsoft Windows, should be better at the task. While Linux
users might be used to different text editing keys, they also tend to be
advanced computer users.
2. Touch-typing is generally associated with faster typing. Faster
typing leads to faster text editing. In addition, the Text Editing Task
displays the required changes on the screen. Thus, if participants do
C.1. MATERIALS
261
not need to move their eyes between the screen and the keyboard, they
should perform better.
3. Using the Shift key for text editing on Microsoft Windows is
indicative of a more serious adoption of the keyboard as a means of text
editing. This can be contrasted with a participant who would usually
use the mouse to select text and may or may not combine a few shortcut
keys such as Ctrl+X, Ctrl+C, and Ctrl+V.
4. Lack of vision impairment should make deciphering text on the
screen easier.
5. Text editing is a psychomotor task. In particular it requires rather
complex coordination of movement of the fingers.
6. Ctrl+X, Ctrl+C, and Ctrl+V are some of the most commonly use
shortcut keys. A participant who instead uses the mouse to activate
menus to perform these tasks is likely to be less familiar with shortcut
keys in general.
7. Self-reported comfort with using just the keyboard to edit text
should be associated with skill in doing so.
8. Participants who normally uses just the keyboard to edit text, are
likely to be better at using the keyboard.
9. Playing computer games may be indicative of greater exposure to
computers. The nature of the experiment is also such that it resembles a
computer game in some respects. There is a goal, a computer interface,
feedback of results, and the aim is to complete the task as quickly as
possible. Knowledge of these conditions, and skill in performing well in
such situations may be related to playing computer games.
10. Computer games vary in the degree to which they involve use of
the keyboard. Many PC games involve using the keyboard and often have
a rather large set of shortcut keys for performing actions. Experience and
skill with such games may improve text editing skills. In addition, such
experience is likely to be correlated with more experience with computers
262
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
in general.
11. Keyboards vary in design. In particular, laptop keyboards typically have a different layout for navigational keys such as Home, End,
Left, Right than do standard PC keyboards. If the keyboard that a
participant typically used is the same as the one in the experiment, this
may provide greater transfer of prior skill. Note: This item was not
associated with text editing performance. It may be that knowing the
difference between two keyboards is indicative of greater familiarity with
computers.
12. Because laptops typically have a different arrangement of navigational keys, and because the experiment used standard PC keyboards, it
was expected that the task would be easier for participants who did not
typically use a laptop.
13. Participants who see themselves as advanced computer users are
likely to be more experienced at text editing. Note: during the administration of the experiment, two or three participants, probably with
English as a second language, asked ”What is a power-user?”. Thus,
in future it may be better to reword ”power” to ”advanced”. However,
another perspective is that someone who does not know what a ”poweruser” is, is not a power-user.
14. Computer programming is a task performed by people who are
advanced computer users. It implies that the participant has many years
of experience using computers. Computer programming also encourages
the development of advanced text editing skills because of the way that
it creates a need to write and edit code efficiently.
C.1.2
IPIP Big 5 Personality Measure
Big 5 personality was measured using a 50 item version of the IPIP personality test (L. R. Goldberg et al., 2006). Items are shown in Table C.1.
C.1. MATERIALS
263
The Reverse column indicates whether the item needed to be reversed
(i.e., 6−x, where x is the score on the item) before calculating total scores.
The scale aimed to measure the Big 5 personality factors: Extraversion
(EX), Conscientiousness (C), Emotional Stability (ES), Openness (O),
and Agreeableness (A).
The initial instruction page had the following text:
On the following pages, there are phrases describing people’s
behaviours. Please use the rating scale below to describe how
accurately each statement describes you. Describe yourself as
you generally are now, not as you wish to be in the future.
Describe yourself as you honestly see yourself, in relation to
other people you know of the same sex as you are, and roughly
your same age. So that you can describe yourself in an honest
manner, your responses will be kept in absolute confidence.
Please read each statement carefully, and then circle the response most appropriate.
Each item was displayed on a separate page with the following text
displayed on the top of the screen: ”How accurately does this statement
describes you?”
The response options were displayed as:
1=Very Inaccurate
2=Moderately Inaccurate
3=Neither Inaccurate nor Accurate
4=Moderately Accurate
5=Very Accurate
264
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
Table C.1: Study 3 Items Used in IPIP Big 5 Personality
Measure
Item Text
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
I Am the life of the party
I Feel little concern for others
I Am always prepared
I Get stressed out easily
I Have a rich vocabulary
I Don’t talk a lot
I Am interested in people
I Leave my belongings around
I Am relaxed most of the time
I Have difficulty understanding abstract ideas
I Feel comfortable around people
I Insult people
I Pay attention to details
I Worry about things
I Have a vivid imagination
I Keep in the background
I Sympathize with others feelings
I Make a mess of things
I Seldom feel blue
I Am not interested in abstract ideas
I Start conversations
I Am not interested in other peoples problems
I Get chores done right away
I Am easily disturbed
I Have excellent ideas
I Have little to say
I Have a soft heart
I Often forget to put things back in their proper
place
I Get upset easily
I Do not have a good imagination
I Talk to a lot of different people at parties
I Am not really interested in others
I Like order
I Change my mood a lot
I Am quick to understand things
I Don’t like to draw attention to myself
I Take time out for others
I Shirk my duties
Table continues on next page
Reverse
Factor
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
1
-1
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
EX
A
C
-1
-1
1
-1
1
-1
1
-1
1
-1
ES
O
EX
A
C
ES
O
EX
A
C
C.2. TEXT EDITING TASK
Item Text
39
40
41
42
43
44
45
46
47
48
49
50
I
I
I
I
I
I
I
I
I
I
I
I
C.2
C.2.1
265
Reverse
Have frequent mood swings
Use difficult words
Don’t mind being the center of attention
Feel others emotions
Follow a schedule
Get irritated easily
Spend time reflecting on things
Am quiet around strangers
Make people feel at ease
Am exacting in my work
Often feel blue
Am full of ideas
-1
1
1
1
1
-1
1
-1
1
1
-1
1
Factor
ES
O
EX
A
C
ES
O
EX
A
C
ES
O
Text Editing Task
Text Editing Task Initial Instructions
All instructions for the text editing task were administered via computer.
Participants were first required to read a series of instruction pages. The
participant could only navigate to the next instruction page after a minimum amount of time had passed. This minimum time was set as the
time it took to read aloud the passage at a moderate rate. This was done
to maximise the chance that participants read the instructions carefully.
The instructions are shown below:
“The following screens present important task instructions. Please
read these instructions slowly and carefully. Failure to understand these
instructions may make your results invalid. To ensure careful reading it
is not possible to go to the next instruction screen until the expected
reading time has passed.
You are about to complete an experimental task that lets you practice Windows keyboard-based text editing. Most people who perform
266
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
text editing use the mouse. Once the skill is acquired keyboard-based
text editing is more efficient than mouse-based text editing. There are
several reasons for this including: 1) it is quicker to move from typing to
keyboard-based editing than it is to move from typing to the mouse; 2)
keyboard-based text editing is more precise; 3) pressing a key to edit is
quicker than accessing menus, toolbars, and other mouse based editing
options. Given the amount of time people spend using computers becoming skilled at text editing can save you substantial time. Such a skill
can also help you focus on the broader goals of editing your text, such
as improving your writing.
The task uses only the keyboard to edit text. The mouse will not
work, and it should not be used. Please place the mouse behind your
computer now.
The experiment is made up of 30 blocks. Each block goes for 80
seconds. Each block contains multiple trials. Each trial requires you
to delete a passage of text marked in red as quickly as possible. Once
you have correctly made the deletion the trial automatically ends. Your
performance on each block is based on the number of trials you can
accurately complete in 80 seconds. If you stop performing the task for
more than 5 seconds, the Trial Timer will pause and will restart when
you recommence editing. If you have not successfully completed the edit
after 30 seconds of elapsed trial time, the trial will end. If you make a
mistake during a trial, you can press CONTROL+Z to undo the change.
This study aims to provide meaningful data for the Data Analysis
Task and class discussion. It may also be used for publication purposes to
advance knowledge in the area of skill acquisition. Thus, it is important
that a certain standard of experimental control exists during the exercise.
Specifically:
• Do not talk to other students until the task is complete
C.2. TEXT EDITING TASK
267
• Do not instruct other students on how to complete the task
• Do not look at how other students are completing the task
• Do not use your computer for other purposes such as the internet
• Turn off or put on silent your mobile phone
• If you have a question about the experiment, raise your hand and
your tutor will come round
• If you need to have a break during the task, feel free to stretch or
go for a short walk, but do not talk to other students
• If you do not want or are unable to continue to do the task, feel
free to leave the room and come back at 1 hour and 10 minutes
after class commenced (e.g., if your class started at 11:00AM come
back at 12:10PM)
Before the task commences you are going to be given some instructions about keyboard-based text editing. Before being introduced to each
of the keys, you will be introduced to proper hand positioning.
• Place your left little-finger on the LEFT CONTROL KEY and your
left ring-finger on the LEFT SHIFT KEY
• Place your left middle finger over the Z-key (undo) and your left
index finger can be used to press X (cut), C (copy), and V (paste)
• Place your right hand index, middle and ring fingers on the arrow
keys.
• Use your right index finger to press LEFT, BACKSPACE, and
DELETE.
• Use your right middle finger to press UP, DOWN, HOME, and
END.
268
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
• Use your right ring finger to press RIGHT
In the next section of the instructions you will be introduced to each
of the text editing keys. Each instruction screen sets out a couple of
text editing keys for you to briefly practice. On each screen try to get
a sense of what each key presented does. Look at the effect of the key
press on the text editor. Try to use the recommended hand position
mentioned previously. Once these key instructions are over the main task
will commence. You can proceed to the next screen of key instructions
once 12 seconds has passed and you have pressed EACH key introduced
FOUR times.”
C.2.2
Additional Controls
If a participant did not press a key for five seconds, trial time was paused
and the following message was displayed: “No user activity: timer has
stopped. If you stopped because you made an error, hold down Control+Z to Undo.” This message was incorporated into the text editing
software for several reasons: (a) Five seconds was long enough to reflect
actual task withdrawal behaviour as opposed to natural delays between
key presses. (b) The mechanism discouraged off-task behaviour, because
any off-task behaviour added to the total time to complete the study.
(c) Prevalence of off-task behaviour was not an aspect of performance
that this study aimed to model. Thus, trial pauses were used to minimise the influence of distractions, breaks, and other off-task behaviour
on performance measurement. (d) Participants who made an error from
which they could not recover often triggered the pause. The error message communicated how to recover from such an error using the Undo
functionality.
To prevent participants making the mistake of leaving two spaces
where there should be one, the following message was displayed: “You
C.2. TEXT EDITING TASK
269
need to remove additional spaces from your text”.
C.2.3
Training Condition Instructions
The following instructions were displayed after block 15 to participants
in the Training Condition.
You are now going to receive some additional training
in keyboard-based text editing. You will first receive some
general theory. Then, you will have some practice. To ensure
careful reading it is not possible to go to the next instruction
screen until the expected reading time has passed.
Text editing involves three core elements: movement, selection, and manipulation. Speed of text editing can be improved by reducing the time between key presses or by reducing the required number of key presses. Reducing key presses
often requires some thought and can involve using more sophisticated key combinations. However, with practice key
selection becomes automatic and pressing the keys becomes
a smooth action.
Movement involves pressing keys to move the cursor from
the current location to the target location. In general, the
task can often be broken down into steps: 1) get cursor
to paragraph [use control+down/up; or control+home/end];
2) get cursor to line [use up/down]; 3) get cursor to word
[use control+left/right; or home/end].
Sometimes it is
quicker to go passed the target location and then come back
(e.g.,control+end then control+left).
Deletion can be done by simply pressing delete or
backspace.
Faster options include control+delete, con-
trol+backspace, and select and delete. The select and delete
270
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
strategy involves selecting the text and then pressing delete.
This is a desirable deletion strategy as the number of words to
be deleted increases. It involves four steps. 1) Get to one end
of the text to be deleted; 2) hold down shift; 3) move cursor
to other end of text; 4) press delete. Steps 1 and 3 apply all
the principles of movement. Efficient movement depends on
minimising key presses.
On the following screens you will do 6 Training Trials
similar to the main task. Your goal is NOT speed. Your goal
is to minimise key presses by applying the theory presented
on the previous screens. Each Training Trial is made up of
three Sub-Trials:
Sub-trial A) This is your first attempt to minimise key
presses
Sub-trial B) You are presented with a sequence of keys
which will minimise key presses. Your task is to follow them
exactly.
Sub-trial C) The aim is to use the keys set out in Sub-trial
B without assistance.
To communicate the sequence of keys that you should
press, key sequences like this will be presented:
Down, Down, End, Control(Left, Left), Shift(Down, Control(Left, Left, Left, Left)), Delete
The order of the words indicates the sequence that you
should press the keys. Brackets are used to indicate that
you should continue to hold down the key preceding the
bracket. E.g., ‘Control(Left, Left)’ indicates that you should
hold down control while you press Left twice. If you have followed the sequence, the text should be successfully deleted.
As you press the key sequence make sure to also observe
C.2. TEXT EDITING TASK
the effect that the sequence is having on the text editor.
271
272
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
C.2.4
Text Editing Task Problems
Table C.2 shows the database of 500 text editing task problems used to
create the text editing trials in Study 3. The Words column indicates the
number of words. The Point column indicates the number of characters
into the text editing original text that the Deletion Text was inserted.
Table C.2: Study 3 Text Editing Task Items
ID
Words
Point
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
7
13
19
25
31
37
43
49
55
61
67
73
79
85
91
97
103
109
115
121
127
133
139
145
151
157
163
169
175
Deletion Text
boy
school
must
covered
song
city
such
a
now
few
known
paper
wood
night
my
color
really
kind
would
study
air
car
animal
plan
thousands
my
oh
example
something
too
Table continues on next page
C.2. TEXT EDITING TASK
ID
Words
Point
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
181
187
193
199
205
211
217
223
229
235
241
247
253
259
265
271
277
283
289
295
1
7
13
19
25
31
37
43
49
55
61
67
73
79
85
91
97
103
109
115
121
127
Deletion Text
oil
machine
war
stand
only
once
vowel
large
money
notice
pulled
street
street
horse
water
travel
country
heavy
their
ever
am fall
would once
men father
life without
products music
surface remember
five become
upon surface
animal she
room told
life king
went he
try became
change don’t
time was
when our
also until
than heavy
side check
body round
learn gave
seen numeral
Table continues on next page
273
274
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
ID
Words
Point
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
3
3
3
3
3
133
139
145
151
157
163
169
175
181
187
193
199
205
211
217
223
229
235
241
247
253
259
265
271
277
283
289
295
1
7
13
19
25
31
37
43
49
55
61
67
73
79
Deletion Text
questions important
animal am
be eat
year took
space understand
line heavy
play number
space up
those hear
run one
an these
we add
during building
write use
sea always
even grow
below ask
write him
both said
hear should
area after
game show
go began
hours space
war young
all special
follow music
their set
find no think
try step find
learn become not
morning reached play
light start off
light birds three
wind map carry
give didn’t read
about him black
world should measure
know too object
letter fall less
they way another
call school near
Table continues on next page
C.2. TEXT EDITING TASK
ID
Words
Point
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
4
4
4
4
4
4
85
91
97
103
109
115
121
127
133
139
145
151
157
163
169
175
181
187
193
199
205
211
217
223
229
235
241
247
253
259
265
271
277
283
289
295
1
7
13
19
25
31
Deletion Text
however common soon
together hours voice
large travel wait
got though city
got had no
just second reached
read surface cut
sing our music
house line covered
there friends America
inches many thing
wood reached saw
how ship noun
great noun problem
learn went which
voice bring mile
idea play up
step saw take
learn scientists hours
island my made
an rest grow
world car other
father an use
after how soon
long several fast
until didn’t shape
did several less
put with before
deep verb time
full ten eye
language whole family
field week object
cannot write different
circle base into
north plane take
America less mile
ask eye year thousands
her five a small
whole another came didn’t
fly remember step knew
early every got this
three vowel land unit
Table continues on next page
275
276
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
ID
Words
Point
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
37
43
49
55
61
67
73
79
85
91
97
103
109
115
121
127
133
139
145
151
157
163
169
175
181
187
193
199
205
211
217
223
229
235
241
247
253
259
265
271
277
283
Deletion Text
among of inches life
deep carry live understand
he write leave light
start work left stop
than himself family than
horse old into ball
king complete dry may
understand knew several wind
and another but been
not fish leave hand
filled must space stood
even men word gave
hand show them often
whole friends almost base
door got different went
today since gave yes
stand found vowel grow
men piece may the
it’s white side any
cut both come way
and horse don’t system
since white important green
we own another animal
any as measure along
war all cannot land
once life night four
cried system little her
point come room car
small follow three include
second deep waves it
pattern find front such
minutes follow shape there
material filled four room
equation move kind sing
four check after fire
over dry read your
away size plan run
farm eye well ran
near family are more
children him man me
money be morning became
saw keep air scientists
Table continues on next page
C.2. TEXT EDITING TASK
ID
Words
Point
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
4
4
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
289
295
1
7
13
19
25
31
37
43
49
55
61
67
73
79
85
91
97
103
109
115
121
127
133
139
145
151
157
163
169
175
181
187
193
199
205
211
217
223
229
235
Deletion Text
large by name my
they mean write cold
color feel white give covered
color complete use special too
many fire more told hand
mean almost language place read
song than less minutes your
over near mother down song
wheels from state how father
hand own but around mile
decided base up notice one
finally thing dog became father
body measure rest yet soon
first us explain all river
building vowel large money travel
work shown farm look covered
between green often about country
together sing spell write able
object for any you ago
example king tell fast of
made low make problem two
once remember best street understand
happened slowly six man eat
come gave stood products through
its found body material deep
white even wheels inches hot
of six of plan children
not pulled side never family
family quickly time left boat
warm not more north give
later think great country remember
explain am plan understand most
had will mile home stay
would animal begin would that
dry talk head hard be
plant always carefully animal found
material no most government upon
the may young stay usually
less five different upon left
these wait horse around turn
if away because night fire
check whole until something inches
Table continues on next page
277
278
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
ID
Words
Point
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
5
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
241
247
253
259
265
271
277
283
289
295
1
7
13
19
25
31
37
43
49
55
61
67
73
79
85
91
97
103
109
115
121
127
133
139
145
151
157
163
169
175
181
187
Deletion Text
live story piece by night
paper well home word both
strong go ground people any
story power grow only come
halt use would farm wait
until strong mark six English
your language hard small hot
answer hear though some while
class see ball second plan
during inside letter left page
surface sing no animal seen year
travel time still upon base room
face state building paper only head
those wheels side food too special
same her does fish big full
sure became one I’ll close base
people back kind call true a
new with far book young less
song sound say she only another
did filled oh friends its as
in fact too against war this
verb but inside home song land
day bring at birds plane system
space passed king quickly that class
its yet example which old show
should sometimes learn other clear ocean
today color give its new then
step four own not waves have
circle contain music pair hours rest
letter thing boat money room both
stars away are being much way
walk hold plane space number then
new did give three early toward
above decided her oil began pulled
waves inside inches river animal passed
gave part inside your so ball
became group may list girl contain
came keep number vowel read sea
kind ball products sea there add
grow sing oil old seen minutes
eat has object stars above when
map inches all were behind just
Table continues on next page
C.2. TEXT EDITING TASK
ID
Words
Point
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
193
199
205
211
217
223
229
235
241
247
253
259
265
271
277
283
289
295
1
7
13
19
25
31
37
43
49
55
61
67
73
79
85
91
97
103
109
115
121
127
133
139
279
Deletion Text
look ocean made look land listen
any town among during vowel ago
red me word day example back
known other old south machine and
figure came though sure listen food
children circle low a warm above
inside white year nothing door cannot
once write animal us father include
music heard verb together fact face
each name often they think people
must him explain through work around
mile ago car her water yes
plant covered explain important do building
boat thousands building last hundred they
another when king plane follow thing
answer watch is their letter stand
class try sound behind below room
first play special note America young
set time high top fish less inside
though form deep certain travel even English
man been follow number make made our
best ball become need fine person cut
done among week week little carry side
note always cut game war answer hand
whole these vowel got house mark there
say space great men room she around
way strong were it here friends size
order people river happened put pulled list
he himself another all took mountain oh
near full became great off knew oil
and power country it’s halt any began
ago once her front learn black close
reached many passed red understand other listen
today open have mile wait back mean
old study should machine been hundred money
across that other king have explain problem
told finally house early sun boy listen
told behind might state live ball reached
not same white minutes include day grow
questions miss fall made one write quickly
light time life form paper war follow
earth stand until week mark found correct
Table continues on next page
280
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
ID
Words
Point
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
145
151
157
163
169
175
181
187
193
199
205
211
217
223
229
235
241
247
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8
8
8
253
259
265
271
277
283
289
295
1
7
13
19
25
31
37
43
49
55
61
362
363
364
8
8
8
67
73
79
Deletion Text
told week hear common top back read
questions mile put there less I’ll part
him order not whole figure saw farm
circle him by I’ll school air circle
again we might at look example learn
south high cut mountain look family is
do halt food group course morning spell
second voice covered ocean before never side
around song became important home tree an
boy family cold pulled circle south stand
plant air too understand full front game
yes scientists small ground cried vowel want
dry thought fly heavy don’t wood just
think mean whole fish plane sentence both
last note rock above could place slowly
off can stop last any I’ll feet
produce vowel sound write book next five
large few children sometimes mountain scientists
white
island able like out language any ground
sea cannot complete high us game verb
system toward heard been however city class
or vowel such heavy plant I north
begin hold explain do reached has him
cannot miss why his several idea course
correct inches down line was feet once
king both its go don’t bring went
heard has word fact correct until run check
number your king air hand black use every
verb deep true form leave come took better
watch always animal surface what look too base
note building being must below top today easy
fall seem road plan I miss measure over
plane river but order me mountain down find
she sound toward the special me start now
through north two where oh go knew road
men inside three eye color did where watch
hand problem them such carefully should among
time
known of answer river it’s way the face
has let oil all friends products do after
big spell way mark little here told around
Table continues on next page
C.2. TEXT EDITING TASK
ID
Words
365
366
367
368
369
370
371
372
373
374
375
376
377
8
8
8
8
8
8
8
8
8
8
8
8
8
378
379
380
381
382
8
8
8
8
8
383
384
385
386
387
388
8
8
8
8
8
8
389
390
391
392
8
8
8
8
393
394
395
396
397
398
399
400
401
8
8
8
8
8
8
8
8
9
Point
281
Deletion Text
85 did country group an plant inches became six
91 man carefully long with sing low birds look
97 might the always very long among less unit
103 body life south saw eye heavy stop came
109 full green front him true inches language check
115 also way their eye farm set table hear
121 south town little ever form sun notice first
127 line below fine if didn’t ball there during
133 often set against pattern every ball down numeral
139 round against keep measure if sound which mile
145 night equation these said enough built girl next
151 plan really ran other many any come farm
157 understand let water light well person people scientists
163 air rule far much best cold house I’ll
169 something base town contain farm some two course
175 night his such knew he young equation material
181 size five after study your own explain surface
187 stand together fine under language picture its
short
193 figure always check remember try man fast yet
199 though name were made sound has water right
205 front would call is rest pair among under
211 an food yes area of story near hold
217 stop were stood head heard in year part
223 easy close area products sure known however however
229 slowly children being great better miss night ran
235 size friends you ran turn oil there upon
241 your earth brought came might seen about well
247 almost known or thousands paper island plane English
253 toward during he piece space shape below is
259 car need plant fire week face power include
265 and always five saw never food way food
271 seem after before land move since problem are
277 today order she tell birds color almost play
283 began told too knew any true stood old
289 mark food car morning start able plane example
295 become his base across unit then wheels every
1 hot street example follow most turn live school
were
Table continues on next page
282
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
ID
Words
Point
402
403
404
405
406
407
408
409
410
411
9
9
9
9
9
9
9
9
9
9
7
13
19
25
31
37
43
49
55
61
412
413
414
415
9
9
9
9
67
73
79
85
416
417
9
9
91
97
418
419
420
421
422
9
9
9
9
9
103
109
115
121
127
423
424
9
9
133
139
425
9
145
426
427
428
9
9
9
151
157
163
429
9
169
430
9
175
431
432
433
434
9
9
9
9
181
187
193
199
Deletion Text
people into ten leave went the yet place work
true live below car because tree heavy cannot short
seen music answer town had ball before clear boy
yes man sentence start any rest whole side number
even men her thing as nothing list after sound
dog above mark look well both get there map
only seem food where mean and will these her
food oh area base seen while tell whole different
said he our listen complete point measure one fall
both road the ball school minutes letter upon
ocean
both every covered right war done often city ship
ground do week base remember they those call his
sing keep well heat reached cold list course into
noun king own being young then seem however
without
fall an change I me follow easy him may
paper across certain began problem long low stars
follow
our English white there were song pair has ocean
pulled my since those later week may live plant
oh top really need people run have find full
only space am around want walk door that minutes
well began how happened known street back building rest
until water voice if mean figure stand use nothing
became south should there open leave verb story
add
made carefully find ask because heard just run
road
much letter easy back yes see your known wait
three room south dog door before time town during
slowly keep once sometimes didn’t next last wait
several
him bring toward special being heavy less hear picture
become covered keep happened yes start such
check world
first miss set area front special special building ago
I’ll their deep didn’t our almost same tell watch
six slowly boat town wait near the king sun
told street call special an name step point near
Table continues on next page
C.2. TEXT EDITING TASK
ID
Words
Point
435
436
437
438
9
9
9
9
205
211
217
223
439
440
441
9
9
9
229
235
241
442
443
444
445
9
9
9
9
247
253
259
265
446
447
9
9
271
277
448
9
283
449
450
451
9
9
10
289
295
1
452
10
7
453
454
10
10
13
19
455
10
25
456
457
458
10
10
10
31
37
43
459
460
10
10
49
55
461
462
10
10
61
67
463
10
73
283
Deletion Text
halt building big rock easy hear verb waves been
if away here told what mountain sure what first
pair if night plane back land close but school
didn’t building close certain tell measure never told
the
must away when friends land on our ship plane
go against out hear idea another strong ocean fire
around book cold through door family usually by
only
your than book family group page went don’t mark
food war that sometimes ten from up sing problem
box plan family live must its other this material
story hard across clear important inches voice map
north
those top cannot food sun three across how also
began end equation numeral being note island dog
boy
table waves each usually word did close ocean
change
learn then heard fish a heat five music correct
warm they day front really write ground us or
grow form circle put tree about food English better
I’ll
those so you give short this back equation toward
had
street circle hold young will say noun front but way
look check I six problem machine write wind birds
passed
both air example products up when have me street
story
let stay quickly miss course line yes size class small
above or hard can so complete plant all one who
been father class contain yes around knew world
begin ship
line a only first clear nothing heat would also see
bring watch take wood knew their stars came there
fine
air very better hold told there wind mile hot whole
sure feel put some enough night want stood sure
listen
problem special city grow is questions to in next
yet
Table continues on next page
284
APPENDIX C. STUDY 3: ADDITIONAL MATERIALS
ID
Words
Point
464
10
79
465
10
85
466
10
91
467
10
97
468
10
103
469
10
109
470
10
115
471
10
121
472
10
127
473
10
133
474
10
139
475
10
145
476
10
151
477
10
157
478
10
163
479
10
169
480
10
175
481
10
181
482
10
187
483
10
193
484
485
10
10
199
205
Deletion Text
easy each table four little check feel inches stars
see
building just mile more farm heavy week ago very
mountain
our about mother side put island part order have
not
thousands like shape plant explain hundred then
add example walk
sure surface kind turn very turn ten ask deep notice
pair such door scientists week food street around
across against
live feel began often plan know power name change
mile
pair reached town also group place study war scientists long
fast contain word more talk dark able halt remember mean
government show just were cried men pair try
nothing idea
note may think south plan person today deep put
door
north there round head still carry how where over
made
language covered or and four boat across does try
earth
me remember five king that more cannot light was
travel
every take right from best short only travel waves
horse
first short during always light than field material
watch carry
list which many most such whole whole notice
word same
tree kind you few heard enough decided more field
heavy
part never complete use for big under mark very
wait
built yes well north knew head true the them number
will full room notice piece its he animal force went
fact form by tell front line life front help water
Table continues on next page
C.2. TEXT EDITING TASK
ID
Words
Point
486
10
211
487
10
217
488
10
223
489
10
229
490
10
235
491
10
241
492
10
247
493
10
253
494
10
259
495
10
265
496
497
498
10
10
10
271
277
283
499
10
289
500
10
295
285
Deletion Text
music though size easy mile and study early watch
head
just wind children right right me space notice object built
side far something equation five do too run happened mark
cannot something off listen fish said it follow yet
also
him dry strong pattern begin far stop sure heavy
through
cannot scientists Indian ship write those young
new material word
carry animal place sentence many begin cried are
questions would
birds children gave example cannot be many best
order something
room built plan wood hot any enough front easy
contain
America base he often because think sometimes
found explain yet
rest turn correct us feet in city fact it’s hand
children form fall plane year use work help like city
not boat need idea wait piece set list thousands
land
correct got done took start side short would need
explain
old numeral problem carefully figure all wind fire
kind note