Chapter 5: General discussion

Maaike Prangsma - Multimodal representations in collaborative history learning
5.
General discussion
We cannot – in the 21st century – imagine a school textbook not containing many
different types of illustrations. Modern schoolbooks are packed with pictures, tables,
graphs, and diagrams – in addition to texts. Visual or multimodal representations are
seen as being instrumental in encouraging deep learning. When explaining my research
to non-educationalists the most common response is: “Of course pictures make learning
easier and better!” However, this opinion was not always predominant, and pictures
were not always taken for granted as an ingredient of learning materials. The question,
which has been researched since the 1960s, is: Do these pictures, tables, graphs, et
cetera really do what we think they do and why? The main topic of this thesis is
multimodal representation used in learning. The aim was to determine whether using
multimodal representations in history learning tasks makes a difference for the learning
outcomes achieved and/or the learning process carried out. Using representations is
defined in this thesis as co-constructing representations in small groups.
This chapter is outlined as follows. First, the four research questions mentioned
in the introduction to this thesis are answered, followed by an answer to the main
research question. Next, methodological issues of the studies described in this thesis are
considered. Next, theoretical implications of the findings are discussed. Finally,
suggestions for future research are made.
5.1 Answering the research questions
What are the effects of the general type of co-constructed representation – textual,
multimodal, or integrated multimodal – on the acquisition of a chronologicalconceptual frame of reference?
Based on the literature on learning with multimodal representations (e.g., Ainsworth,
1999; Paivio, 1991; Mayer, 2001) the collaborative construction and completion of
multimodal representations should contribute more to history learning than the
collaborative completion and construction of textual representations. In addition,
multimodal representations may also facilitate the acquisition of a chronologicalconceptual frame of reference, which is important in history learning (e.g., Leinhardt,
Stainton, & Virji, 1994; Carretero, Jacott, Limón, López-Manjón, & León, 1994).
Consequently, integration of multimodal representations in a timeline should strengthen
such an effect. The individual learning results presented in Chapter 2 showed that the
collaborative completion and construction of multimodal representations did not
contribute more to learning than the collaborative completion and construction of
textual representations: Students who constructed and completed multimodal
representations did not outperform students who constructed textual representations,
although the means did point in the direction of the hypotheses. However, the results
also showed that working on multimodal representations integrated in a timeline did
lead to higher learning short-term outcomes than co-constructing textual
representations, although this significant difference disappeared in the long run. These
outcomes raised the next question, investigated in Chapter 3:
GENERAL DISCUSSION
63
Maaike Prangsma - Multimodal representations in collaborative history learning
What are the effects of the collaborative completion and construction of integrated
multimodal representations versus textual representations on the content of the student
dialogue?
Chapter 2 showed that there were some differences in learning outcomes between the
textual and timeline conditions. This raised the question – answered in Chapter 3 –
whether the content quality of the learning process might mediate this in the timeline
condition. Student dialogues were recorded during data collection for Study 1. The
utterances for Task 2 in the Text and Timeline conditions were coded as procedural,
social or content utterances. Content utterances were analysed further for level (core vs.
auxiliary) and concept use.
The results showed that there were some differences between the two conditions.
The multimodal timeline representation task elicited more utterances in general. This
was reflected in all main topics (procedural utterances, social utterances and content
utterances), although only the former two showed a significant difference. A closer
analysis of the content utterances showed that the multimodal timeline elicited more
auxiliary content utterances, i.e., it elicited more content utterances, but did this in less
direct terms; by referring to content without using full sentences with historical
concepts.
What is the effect of the type of multimodal representation – process diagram, network
chart, structure diagram, or cartogram – on the content of the student dialogue?
The preliminary exploratory analyses of pupil discourse discussed in Chapter 2
suggested substantial differences in dialogue quality between tasks within the timeline
condition, even though all four tasks were designed to elicit productive interaction. This
outcome was further explored in Chapter 3. Student dialogues were recorded during
data collection for Study 1. The utterances for all four tasks in the Timeline condition
were coded as procedural, social or content utterances. Content utterances were
analysed further for level (core vs. auxiliary), reference to visual elements, integration
of modes, and concept use.
The results showed that there were indeed some interesting differences between
the four multimodal representation task types. Task 1 – involving determining the
chronological order of pictures and writing the story they tell underneath the pictures –
elicited more utterances in general, more content utterances in particular, and more
references to visual elements on the task sheet as well as integration of visual elements
of the task sheet with information from the task text. In addition, this task elicited more
historical concepts and a wider range of the concepts available from the task was used.
Possible explanations might be found in differences between Task 1 and the other three
tasks. Task 2 – in the shape of a causal network – involved only selecting pictures, not
ordering them, and students were not asked to distinguish causes from effects. Task 3 –
a structure diagram – contained a single picture, so that discussion of selecting or
ordering pictures did not apply to this task, and questions focused mainly on the task
text. This task was perhaps too easy and the picture not really necessary to complete the
task successfully. Finally, the two cartograms in Task 4 did not contain very concrete
visual elements, and pupils seemed to have problems understanding the maps.
The references to visual elements on the task sheet as well as the number of
historical concepts used correlated positively with learning outcomes on the post-test,
indicating that it is worth encouraging learners to do both of these things.
64
CHAPTER 5
Maaike Prangsma - Multimodal representations in collaborative history learning
Further reflection on the studies described in Chapters 2 and 3 led to the idea that
perhaps one of the distinguishing factors between different types of multimodal
representations is the inclusion, exclusion or combination of abstract (schematic) and
concrete (depictive) representations. This led to the following question, investigated in
Chapter 4:
What are the effects of combining text and different types of visualisations – abstract
and concrete – in collaborative history tasks on learning historical phenomena and
concepts?
In a follow-up experiment that dealt with differences in learning results between
different types of multimodal representations – abstract, concrete or a combination of
abstract and concrete – the short term and long term learning outcomes for these three
multimodal conditions were compared to those for a textual condition. As learning
about a historical phenomenon requires understanding both abstract and concrete
aspects of this phenomenon, concepts, and relations, a combination of abstract and
concrete visualisations should have the greatest potential for enhancing learning.
Although working on abstract, concrete or a combination of abstract and concrete
representations did not result in significant differences in learning outcomes, students
who worked with concrete visualisations rated their task as easier and thought that they
had learnt more. It seems that combining text and different types of visualisations in
learning tasks does not necessarily enhance history learning. Possible explanations are a
lack of active use of the visualisations, elements of the real life setting (e.g., ability of
pupils to concentrate, absence of whole-class teaching), the way representations are
presented and relationships defined (i.e., the semiotics of the representations used) in
the domain of history, and the difficulty of multi-interpretable representation of
historical phenomena and concepts – that calls for a clear contextual frame of reference.
The central issue in this research is how multimodal representations can support
verbalisation and co-construction of meanings and relations, and thus contribute to the
attainment of domain specific conceptual knowledge. The studies in this thesis look at
learning tasks with multimodal representations, and at integrating different
representations in a timeline, and contrast these with learning tasks with textual
representations. In other words: How can learning activities with different multimodal
representations in relation to each other contribute to meaningful learning? The central
question addressed in this thesis was specified as follows:
How does making and connecting different types of multimodal representations
affect the collaborative learning process and the acquisition of a chronologicalconceptual frame of reference in 12 to 14-year-olds enrolled in pre-vocational
education?
Several conclusions can be drawn from these results. First, learning with multimodal
representations without integration in a timeline does not result in higher learning
outcomes than learning with textual representations – neither in the short term, nor in
the long run. Second, learning with multimodal representations integrated in a timeline
resulted in higher learning outcomes than learning with textual representations –
although the difference disappears in the long run. Third, compared to learning with
GENERAL DISCUSSION
65
Maaike Prangsma - Multimodal representations in collaborative history learning
textual representations, learning with multimodal tasks integrated in a timeline does
affect the collaborative learning process by eliciting more interaction in general and
more auxiliary content utterances in particular. Also, discussing content by referring to
visual elements and using more historical concepts are positively related to short-term
learning outcomes. Finally, learning with either concrete or abstract multimodal
representations, or with a combination of concrete and abstract multimodal
representations does not result in significantly higher learning outcomes than learning
with textual representations.
5.2 Methodological issues
5.2.1 Participants
All studies in this thesis involved 12 to 14-year-old participants in the first year of
pre-vocational secondary education (VMBO). A majority of Dutch pupils in secondary
school (some 60%) attend this level. In general, the language proficiency of pupils
attending such schools is lower than that of pupils in general secondary education, in
part due to the fact that a considerable proportion of these pupils (about 24%) are from
migrant families and do not have Dutch as a mother tongue.
However, because of their relatively low language proficiency, visualisations
might be particularly suitable for these learners. This also means that there may be a
differential effect for the use of visualisations. Previous research in the field of
representations for educational purposes has focused primarily on the undergraduate and
college preparatory levels. The existing lack of diversity in the choice of participants
was, of course, also an important reason to choose a different target population for the
research reported in this thesis. Rather than saying the findings reported here do not
necessarily generalise to other learner types, we can now say that perhaps the findings
of previous research do – at least to some extent – generalise to lower levels.
In addition, the practical nature of the tasks used in this research seems suitable
for this learner level, as pre-vocational secondary education is practice-oriented to suit
its pupils‟ needs. The evaluation questionnaire described in Chapter 4 showed that
pupils enjoyed working on the multimodal tasks, and that they felt they had learnt from
the tasks. At the same time, the pupils seemed to have some specific problems whilst
working on the multimodal representations that might be related to their skills and
experience. First, some of the participants in the studies described in Chapters 2 and 3
made some unexpected mistakes in colouring the maps in the post-test, for example by
colouring the North Sea as Islamic. This suggests that some of the participants lacked
basic map reading skills. It was beyond the scope of this research to investigate this
further. However, several studies have detected difficulties in working with and
interpreting maps (Postigo & Pozo, 1998; Scevak & Moore, 1998). Further research on
cartogram tasks may shed light on the question whether our target group – 12 to 14year-olds in pre-vocational secondary education – can recognise and interpret maps of
Europe. Second, the tasks might have been more effective if they had also been
reviewed in a whole-class discussion with the teacher, because these pupils are not used
to working independently for three consecutive lessons. Of course, this would probably
also have decreased differences between conditions.
There was considerable attrition in the experiment described in Chapter 2 and
Chapter 3. Although the experiment started out with 143 pupils, the final sample
includes only 85, an attrition rate of 40%. The high proportion of discarded dyads was
66
CHAPTER 5
Maaike Prangsma - Multimodal representations in collaborative history learning
mainly due to loss of concentration among pupils who were participating in the Muslim
religious festival Ramadan during the period the experiment took place. Many of these
pupils did not manage to finish all tasks, and several different pupils were absent from
each lesson. Due to the random assignment of students to the conditions, no effect of
this attrition on the results is expected.
5.2.2 Research design
The experiments described in this thesis were all carried out in ecologically valid
settings. Choosing for ecological validity has its pros and cons. The experiments were
closer to school learning situations than much previous research on multimodal
representations has been, but choosing for an ecologically valid setting also sets some
limitations on control and ethics. First, working with pupils and schools in an authentic
school setting implies that some things will go wrong, for example because people are
absent, rooms are not available, or pupils start walking around. Ethically, it is not
possible in a setting like this one to have a proper control group that only does the pretests, post-tests and retention tests or one that does not include essential learning
materials. The pupils in that condition would have to skip part of the curriculum or deal
with the topic much later, at an illogical point in the curriculum.
In addition, the experimenters encountered some technical and practical
problems with recording the pupil dialogues and organising sufficient space. First, out
of the 85 pupils left in the final sample of the first experiment, only 26 dyads (52 pupils)
delivered complete and usable recordings of all four tasks. Pupil discourse was recorded
using small portable tape recorders placed on the pupils‟ desks. Problems encountered
with these recorders concerned mainly technical malfunctioning and improper use of the
machines by pupils.
The second practical problem was organising sufficient space. The original idea
for the experiment described in Chapter 2 and Chapter 3 was to take dyads out of the
classroom one by one (two pupils at a time, or four at a time if a research assistant was
available) and give them three consecutive hours to finish the tasks. This would have
allowed for video recordings instead of only audio tape recordings and thus for the
capturing and analysis of full interaction. It would also have been possible to administer
the post-test immediately after the tasks for each dyad, instead of one to several days
after the task lessons. However, with three consecutive lessons for each dyad (out of
eight for a normal school day), about 24 pupils in each class, two classes per school, one
research assistant and two video cameras it would have taken six full days per school.
Unfortunately, it was not logistically possible for the schools to set aside space for such
an extended period of time.
No time was taken to prepare the participants for learning with multimodal
representations, whilst all participants were experienced users of textual representations.
The pilot study used for the studies described in Chapters 2 and 3 did not give reason to
include a training session. This may have given the pupils in the textual conditions an
advantage over those in the visual, timeline, concrete, abstract, and combined
conditions. In hindsight, some pupils did seem to have problems with the cartogram task
(see Chapter 3), and it might have been a good idea to build up the multimodal
representations instead of introducing all elements at once, in particular in the
experiment described in Chapter 4, where the abstract and concrete elements of the
representations were presented rather than assembled by the pupils. Building up the
representations might be especially important in the light of Nadolski, Kirschner, and
GENERAL DISCUSSION
67
Maaike Prangsma - Multimodal representations in collaborative history learning
Van Merriënboer (2005) who suggest that there might be different levels of construction
involved in complex learning tasks. This idea might be particularly helpful in improving
the cartogram task by introducing steps to help pupils understand the maps.
The task texts used for the studies reported on in Chapters 2 and 3 had different
numerical and topic characteristics, and these differences were not evened out between
the four tasks (Chapter 3), in part because the different topics required different texts.
Thus, it is possible to ascribe differences found between tasks to differences between
the accompanying texts. Recently research has been done on suitable text characteristics
focusing specifically on the same type of learners as in this thesis (pre-vocational
secondary education, VMBO). Research by Land, Sanders and Van den Bergh (2005)
shows that formal texts and narrative texts can have very different effects: Whilst the
former improves comprehension, the latter increases enjoyment, and thus motivation.
Although the texts used for the four tasks were carefully written to ensure their internal
coherence, the texts were not aligned to each other regarding a number of text
characteristics, such as the number of words. Possibly, the textual condition performed
as well as the other two conditions because the texts by themselves were powerful
enough to result in sufficient understanding of the topic. Still, the design of the first
study might have been strengthened by removing the pictures from the task texts,
although these texts and accompanying pictures were the same for all conditions. This
might have enlarged the differences between the textual and multimodal conditions.
In addition, it should also be noted that one condition was missing from the study
in Chapter 2: textual representations integrated in a timeline. The conclusion that the
timeline combined with multimodal representations is the catalyst for integration of
textual and visual information must be drawn cautiously: Possibly, differences between
the conditions are attributable to the timeline per se, and not to the combination of
timeline and multimodal representations.
In the experimental study with abstract and concrete visualisations (Chapter 4)
no differences between conditions were found. However, the test in that experiment
only measured factual recognition and recall. Van Meter and Garner (2005) showed that
benefits of tasks in which students construct visual representations (in their case
student-generated drawings) are revealed especially in higher-order assessments, such
as the test items used in the first experiment (Chapter 2).
5.2.3 Domain and topics
The tasks in the studies in Chapters 2 and 3 were presented in a fixed order for all dyads
in all conditions. It may seem a little odd to start a section on domain and topics by
saying that the tasks were not ordered randomly, as in a solid research design they
should be. However, this thesis deals with history learning, and chronology is an
inevitable ingredient of any topic in the domain of history – probably even its main
characteristic. As was noted on several occasions in this thesis, pupils have problems
developing a consistent chronological frame of reference. Consequently, the
chronological order of the topics used for the different tasks described in this thesis was
kept intact.
The next question is: Would the results have been different if different topics had
been chosen for the task content? This question can be looked at from two perspectives,
one comparing different conditions, and another comparing different tasks. For the first
point of view, it is important to realise that a topic can have a certain level of
„imaginability‟ or concreteness, but also that different texts about the same topic can
68
CHAPTER 5
Maaike Prangsma - Multimodal representations in collaborative history learning
have different levels of „imaginability‟ or concreteness. For example, an explanation of
the fall of the Roman Empire can be quite abstract, whereas an explanation of the retreat
of the Roman administrators from Western Europe which is a subtopic of the former can
be more easily visualised as actual people leaving a part of the world that still exists on
today‟s maps. Also, Vikings, for example, seem to tickle many children‟s imagination,
and once they know what they look like (the Vikings are part of the primary school
history curriculum in The Netherlands) they will likely not need extra pictures to make
their imagination run wild. Following this line of thought, a study comparing textual
and multimodal tasks about abstract topics should lead to larger differences than a study
comparing textual and multimodal tasks about concrete topics. However, the topics in
the studies discussed in this thesis all incorporated both abstract and concrete aspects:
not the abstractness or concreteness of the topics was varied, but the abstractness or
concreteness of the representation tasks (see Chapter 4).
The second point of view takes us back to Chapter 3, where different types of
multimodal representations were compared. There, the problem was that the four tasks
also had different topics, and different types of topics (e.g., developments, structure, and
causal relations). It might have been interesting to vary only one of these characteristics
at a time across four tasks. The four tasks would then have had a) the same
representation type and topic type, but a different topic (e.g., four developments, each
shown in a process diagram), b) the same representation type and topic, but a different
topic type, or c) the same topic and topic type, but a different representation type.
However, although it is possible to represent a topic from the viewpoint of different
topic types (situation b), for example by showing the development of manorialism, and
the structure of manorialism, the contents covered would not be the same, and no single
representation type would be suitable for representing both topic types. Similarly, using
different representations to represent the same topic types can easily result in illmatched topic-representation combinations. In other words, it is not always possible to
vary the factors of representation type and topic type, as they are often inherently linked
to each other.
5.3 Theoretical implications
Although Dual Coding Theory (Paivio, 1991) and the Cognitive Theory of Multimedia
Learning (Mayer, 2001) and other previous research on learning with multimodal
representations offer valuable insights, they do have their limitations, most notably in
(1) their generalisability from the technical or natural science domains to other domains,
(2) their tendency to focus on presented representations rather than construction of
representations by learners, and (3) their focus on individual settings and learning
outcomes rather than on collaborative settings and learning processes. The results of the
studies described in this thesis are discussed in the light of these aspects below.
5.3.1 Multimodal representations in different domains
Because of the specificity of the domain of history, generalisability of the results of this
thesis to other domains or topics may be limited. However, this is also one of the main
reasons why the domain of history was chosen: Most studies on learning with
multimodal representations have been limited to the natural sciences. Consequently, the
research presented in this thesis shows that perhaps learning with multimodal
representations per se is not as beneficial as some of the research on science learning
GENERAL DISCUSSION
69
Maaike Prangsma - Multimodal representations in collaborative history learning
suggests, although there does seem to be a bright future for learning by integrating
multiple multimodal representations.
The introduction to this thesis described several differences between the domains
of history and science. It was noted that representations in the natural sciences mainly
show „How things work‟, suiting the needs of learning in that domain. At the same time,
these representations also often show „What things look like‟. History learning on the
other hand requires building a chronological frame of reference. Knowing „What things
look like‟ certainly plays a role in forming such a framework, but what holds it together
is knowledge of „What things mean‟ and „Why things happened when they happened‟.
For example, after reading this thesis one may be able to interpret Figure 5.1b, but
Figure 5.1a can be interpreted more easily.
Figure 5.1a. Operation of a bicycle
pump (from Tro, 2003).
Figure 5.1b. Timeline of the transition between
Antiquity and the Early Middle Ages.
On the other hand, the task in the first study that most closely resembled a „How things
work‟-approach (i.e., Task 1 with its process diagram) also seemed to elicit the most
productive dialogue. Perhaps multimodal representations are most effective for topics
dealing with step-by-step processes such as the fall of the Roman Empire in Task 1.
The scale of the represented phenomena might also play a role here, both in
terms of length of time, and in terms of the size of objects or locations. Perhaps it is
more difficult to understand a timeline covering several ages with multiple events and
phenomena, than understanding how a bicycle pump fills a bicycle tire with air in a few
seconds, because the latter can be felt, relived in its entirety, making it easier also to
visualise. In addition, the scale of the overall objects is very different, as Figure 5.1a
and Figure 5.1b show: A bicycle pump can be held in one hand, whilst a village or
Empire cannot. Within the history domain this aspect of scale could be investigated
further by comparing, for example, learning about European versus local issues.
As was noted earlier in Chapters 2 and 4, the grammar – or semiotics – of
visualisations in the domain of history has no unambiguous principles or rules as there
are in science. A domain like science, even though it deals with complex phenomena
like waves, offers a structured visual grammar linked to basic governing principles –
such as frequency and wavelength –that remain the same, whether the wave runs
through water, air or solid matter. In history, however, even seemingly simple symbols
like arrows can be interpreted in different ways, for example as indicating dynamic
relations such as causality, or merely as temporal relations, or as static relations
describing the structure of phenomena (O‟Donnell et al., 2002). For example, the arrow
70
CHAPTER 5
Maaike Prangsma - Multimodal representations in collaborative history learning
shown in Figure 5.2, with a picture of a woman giving a chicken to a man, could be
interpreted in several ways: Why does she give him a chicken? Did it escape? Is she
paying him? Is he her husband, her boss, or a tax collector? Does she get anything in
return? The truth in the context of the Early Middle Ages is that the woman is a serf,
who gives part of her crop to the lord in exchange for protection. However, even when
the context is known pictures and schemas are often still open to multiple
interpretations. This makes it hard for students to learn to understand the form of
representation – the importance of which is underlined by Ainsworth (2006): There are
no fixed formats in history, for example for visualising time, war or manorialism.
Figure 5.2. What does the arrow mean?
5.3.2 Activity continuum: presented vs. construction
As was noted in Chapter 1, some studies on collaborative learning with multimodal
representations in small groups have shown that new domain knowledge can be built by
articulating or representing ideas and by making connections in the shape of concrete,
visual structures in concept maps (Van Boxtel, Van der Linden & Kanselaar, 2000;
Roth & Roychoudhury, 1994). By comparing interaction processes and learning
outcomes of learning activities with different types of representations we can gain
insight into the possibilities, and even more so the limitations of each of the
representational forms in (collaborative) history learning. The different types used in the
studies in Chapters 2 and 3 were: making a timeline, completing a storyboard (process
diagram), completing a causal map (network chart), describing a historical image
(structure diagram), and completing a cartogram. A comparison of these different
multimodal representation tasks in Chapter 3 showed that they do elicit different
dialogue, and some of the characteristics of the four different tasks might well be linked
to their place on the activity continuum. The task that seemed to encourage the most
content utterances involved the activity of ordering or sequencing pictorial information,
whilst less physical activity was involved in the tasks that elicited significantly fewer
content utterances. Although no dialogue analysis was done for the study in Chapter 4,
perhaps the absence of significant differences between conditions in that study is
attributable to the lack of physical activities – thus minimizing differences between the
conditions (i.e., textual, concrete, abstract, and combined). At the other end of the
continuum, research by Van Boxtel and Van Rijn (2006) suggests that letting pupils
GENERAL DISCUSSION
71
Maaike Prangsma - Multimodal representations in collaborative history learning
draw their own pictures – in their study drawings of the fall of the Roman Empire –
leads to better learning outcomes than presenting pictures to the pupils.
5.3.3 Facilitating collaborative learning with multimodal
representations
At the start of this research project, it was thought that multimodal representations
would have a facilitating role in collaborative learning, by providing a common referent
that makes communication easier, and by providing a shared frame of reference.
Dialogue analyses of the Timeline condition in Chapter 3 showed that some of the
content utterances did indeed include references to visual elements of the multimodal
representations. Thus, multimodal representations do seem to facilitate discussion of
content. One of the differences found between the textual and integrated multimodal
conditions that might explain the difference found in learning outcomes was, however,
not an increase in explicit talking about the frame of reference, but rather an increase in
implicit reference to content.
As was noted in Chapter 3, verbal deixis (e.g., through use of demonstrative
pronouns such as “this”, “that one”) in the dialogue suggested that physical deixis (i.e.,
extra-linguistic behaviour such as pointing) may have played an important role in the
communication between dyad partners. Pupils working individually would not point or
refer to anything, simply because there would be no one to point things out for or to talk
to. Still, the deictic properties of multimodal versus textual representations might be
clarified to some extent by videotaping dialogues for collaborative groups, versus eye
movement tracking for individuals. This would be an interesting topic for future
research.
5.3.4 Integrating multiple multimodal representations
As was noted in the introductory chapter, acquiring a chronological frame of reference
calls for visualisation and conceptualisation of specific periods in history and of
relationships between aspects of these periods. Previous research has shown that
students have trouble developing a consistent „chain of events and developments‟, and
as a result they cannot properly remember the chronological overview or reuse it,
causing confusion of phenomena and concepts (Beck & McKeown, 1994; Leinhardt,
Stainton & Virji, 1994). The results of the analysis of learning outcomes reported in
Chapter 2 show that integrating multiple multimodal representations in a timeline does
encourage the development of a chronological frame of reference. Dialogue analysis
confirmed the idea that visualisation and conceptualisation are important, as there was a
positive correlation between these aspects of the dialogue and learning outcomes for
learners working with multimodal timeline tasks. However, integration did not lead to
higher learning outcomes in the long run. Possibly, the chronological order of events is
harder to remember in the long term than other types of relations, and a timeline
difficult to reconstruct after a while so that it no longer functions as an anchor for
remembering.
5.3.5 Artificial laboratory studies vs. ecologically valid field studies
The research reported on in this thesis was done in a classroom setting in pre-vocational
secondary education, where the tasks were incorporated in the normal history
curriculum. Pupils were graded for their work and worked for an entire lesson during
72
CHAPTER 5
Maaike Prangsma - Multimodal representations in collaborative history learning
each session. Learning with multimodal representations in the context of the Cognitive
Theory of Multimedia Learning has been primarily tested in artificial, experimental,
laboratory-like settings, where participants were – sometimes paid – volunteers, who
did not receive grades for their learning outcomes, and where distraction by other
participants or surroundings was eliminated as much as possible. In addition, the
learning assignments were usually quite short, lasting from 30 seconds to 20 minutes.
For determining the value of theories on learning with multimodal representations for
use in education and in learning materials could benefit from an increase in the amount
of ecologically valid research, as there is apparently a gap between traditional
laboratory research and ecologically valid classroom-based research. As a result, it is
not clear whether learning with multimodal representations has equivalent positive
effects on learning outcomes in the context of the classroom where there is more
interference from factors such as motivation, or class atmosphere. The studies in this
thesis are an attempt to bridge this gap, and Table 5.1 gives an overview of some of the
major differences between Mayer‟s research on which the Cognitive Theory of
Multimedia Learning is based (Mayer, 2001), and the studies described in this thesis.
This gap may be bridged further by replicating better controlled laboratory research in
more ecologically valid settings, varying specific factors very carefully (e.g., learner
characteristics, domain characteristics, use of collaboration, types of representations,
integration of representations) one at a time. The research reported in this thesis also
suggests that in realistic educational settings, additional requirements may play a role in
the effectiveness of learning with multimodal representations. For example, multimodal
tasks might be more effective when they are embedded in a series of lessons that
include whole-class instruction and take into account the optimum attention span of the
pupils.
Table 5.1. Comparison between Mayer‟s work and the studies in this thesis.
Aspects
Mayer
This thesis
setting
lab
school
domain
physics
history
age
18-20
12-14
education level higher education psychology students
pre-vocational secondary education
location
US
NL
time
5 minutes
30 minutes
embedding
no embedding
in curriculum
group size
individual
collaborating dyads
activities
study the item
complete
no integration of multiple
integration
integration of multiple representations
representations
5.4 Future research
It would be interesting to investigate how text can support correct interpretation of
representations in different domains, as Ainsworth (2006) claims that text guides the
interpretative process in picture comprehension. In history a picture without text will
often result in a guessing game. Text is required there. Again, this might be very
different for domains or topics that deal with „How things work‟. Future research could
investigate what text characteristics are suitable for a task in which visual and textual
elements need to be integrated.
GENERAL DISCUSSION
73
Maaike Prangsma - Multimodal representations in collaborative history learning
Although no explicit attempt was made to falsify the Cognitive Theory of
Multimedia Learning in this thesis, the results suggest that prudence is in order in
generalising the principles that lie at the foundation of CTML to their use and effects in
actual learning situations in home, study and school settings. Many other studies
referring to Mayer‟s work – whether they are replication studies or indirect validations
such as the studies in this thesis – have raised the same doubts. The modality effect, for
example, could not be confirmed by a number of studies (De Westelinck, Valcke,
De Craene, & Kirschner, 2005; Jeung & Chandler, 1997; Kalyuga, Chandler, &
Sweller, 2000; Tabbers, 2002; Tabbers, 2006). It would be interesting to see how well
CTML holds when one element is changed at a time, for example, the context, the
domain, or learner type.
The research was done in pre-vocational schools with students who do not have
much prior knowledge of the topics chosen for the tasks. The effects of using
multimodal representations were often weaker than expected. In future research
variation in student characteristics – such as prior knowledge, level of experience in the
construction of multimodal representations in the domain of history, and collaboration
skills – is necessary to investigate the prerequisites for using this type of material in
pre-vocational education.
The results in this thesis suggest that it is worth taking a closer look at the role of
deictic properties of multimodal representations in collaborative learning, both with
respect to verbal deixis and physical deixis (Suthers, Girardeau, & Hundhausen, 2003).
A similar approach is also suggested by Jewitt, Kress, Ogborn, and Tsatsarelis (2001),
who make a plea for a multimodal approach to interaction research where not just the
linguistic interaction is analysed, but also the visual and physical interaction.
Finally, there is a dearth of research on how different types of representations
(such as the eleven categories defined in Lohse, Biolsi, Walker, and Rueter, 1994) work
differentially by eliciting differences in learning processes. In addition, more research is
needed on how these different types of representations work together. From a domainspecific point of view, this could lead to further research on understanding and
remembering temporal relations through construction of timelines, including through
integration of multiple (multimodal) representations.
74
CHAPTER 5