Baltic hard bottom mesocosms unplugged: replicability, repeatability

Journal of Experimental Marine Biology and Ecology,
240 (1999) 229–258
L
Baltic hard bottom mesocosms unplugged: replicability,
repeatability and ecological realism examined by
non-parametric multivariate techniques
Patrik Kraufvelin*
˚
Department of Biology, Abo
Akademi University, Artillerigatan 6, FIN-20520 Turku, Finland
Received 19 March 1998; received in revised form 16 April 1999; accepted 1 May 1999
Abstract
The general utility of large-scale artificial ecosystems for ecological and ecotoxicological
research is evaluated by a case study of the replicability, repeatability and ecological realism of a
Baltic Sea hard bottom littoral mesocosm (called BHB-mesocosm). The structure (species
abundance and biomass listings) of the macrofauna community associated with bladder-wrack,
Fucus vesiculosus L., in nine control mesocosms run during 4 separate years, is investigated by
descriptive and analytical multivariate statistical techniques.
Non-metric multidimensional scaling (NMDS) is used to visualise differences in community
structure between mesocosm controls run during the same year (replicability), differences among
mesocosms run during different years (repeatability) and differences between mesocosms and the
field mother system (ecological realism). Analytically the community structure of parallel
mesocosms is shown to differ significantly by use of one-way analysis of similarities (ANOSIM)
tests, which demonstrates a poor replicability. Despite this high degree of variability between
parallel mesocosms, the null hypothesis of no differences among years can in turn be rejected by a
two-way nested ANOSIM for both abundance and biomass data, which is evidence of a poor
repeatability. If it is assumed that the field samples taken in connection with the transplantation are
representative of the initial situation in the mesocosms, the divergence of the mesocosms from the
mother systems was considerable in all years. When all 4 years are used as replicates, i.e.
independent observations on dissimilarities between the mother system and the mesocosms after 5
months of enclosure, in a two-way crossed ANOSIM, significant differences for both macrofauna
abundance and biomass data are revealed, which is evidence of a poor ecological realism.
The similarity percentage breakdown procedure (SIMPER) establishes the species principally
responsible for these differences. Considering abundance data Theodoxus fluviatilis is always an
important discriminator between groups, mostly accompanied by Mytilus edulis and either Idotea
spp. (replicability) or Gammarus spp. (realism). For biomass data Mytilus and Idotea are the most
important overall discriminators, followed by Lymnaea spp. (repeatability and realism) and
*Tel.: 1358-2-215-4052; fax: 1358-2-215-4748.
E-mail address: [email protected] (P. Kraufvelin)
0022-0981 / 99 / $ – see front matter  1999 Elsevier Science B.V. All rights reserved.
PII: S0022-0981( 99 )00061-1
230
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Cerastoderma glaucum (realism). Finally the factors and processes restricting mesocosm performance are outlined and their consequences are briefly discussed. It is concluded that the degrees
of replicability, repeatability and ecological realism are too low for straightforward use of these
and probably most other mesocosms in predictive risk assessment or in extrapolation of results to
natural ecosystems.  1999 Elsevier Science B.V. All rights reserved.
Keywords: Baltic Sea; Community structure; Ecological realism; Mesocosm; Multivariate
statistics; Repeatability; Replicability; Rocky shore macrofauna
1. Introduction
It is a disquieting fact that so much of the research in marine mesocosms just seems to
have been driven by the need to investigate the fates and effects of pollutants with the
consequence that basic underlying information on the principles that govern the
functioning of these experimental systems, and of the natural systems of which they are
living models, have been given inadequate attention (Pilson, 1990). Replicability,
repeatability and realism are all important aspects of experimental ecosystems, but
nevertheless almost unknown when it comes to larger, more complex systems.
Replicability is defined as the degree of similarity of spatially replicated experimental
units that are meant to represent the same conditions by definition and design, i.e. in this
paper mesocosm controls run during the same year. The term repeatability is used to
describe the similarity of responses in independent systems that are observed at different
points of time within the same research facility and by use of the same scientific
methods, i.e. in this study mesocosm controls run at the same place but during different
years. The ecological realism or accuracy of the mesocosm is the degree of similarity
between the artificial system and the natural ecosystem mimicked. All definitions
originate from Giesy and Allred (1985).
The economically and logistically optimal implementation of replicability and
repeatability is more or less diametrically opposed to the essential concept of test system
realism and a central problem of ecological experimentation (Kuiper et al., 1983;
Hurlbert, 1984; de Lafontaine and Leggett, 1987). This is mainly due to the fact that all
these aspects of mesocosm similarity are related to scale (Gamble, 1990; Cairns and
McCormick, 1991; SETAC-Europe, 1991; Landis et al., 1997). Ecological realism tends
to increase gradually with increasing spatial and temporal scale of a study, while our
abilities to replicate and repeat an experiment logically decrease more rapidly, because
of the restrictions all artificial designs impose on the included ecosystem parts
(Carpenter, 1996). Therefore a high degree of replicability and repeatability is generally
sacrificed on behalf of ecological realism, which mostly is stated to be the major reason
for carrying out a mesocosm study in the first place. Although it has been stressed that
the value of a model ecosystem resides in its ability to mimic some real system, not
itself (Perez, 1995), all these three aspects of internal and external mesocosm similarity
are closely interlinked when it comes to interpretation and validation of results. Since all
three cannot be met with the same intensity simultaneously and by use of the same
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
231
methods, the scientist is often forced to emphasise one or the other. The typical trade-off
from increased ecological realism gained with a larger mesocosm size is less experimental control, often expressed as less information on individual processes, less isolation of
cause and effect as well as less ability to define accurately the densities of contained
biota, which ultimately makes it harder to interpret results (Steele, 1979; Stephenson et
al., 1984; Crossland and La Point, 1992).
Only a few case studies are available, where the replicability of aquatic mesocosms
(i.e. artificial ecosystems bigger than 1 m 3 ) have been thoroughly described (Takahashi
et al., 1975; Pilson et al., 1980; Brazner et al. 1989; Heimbach et al. 1992; Rosenzweig
and Buikema, 1994; Jenkins and Buikema, 1998; Kraufvelin, 1998). The ecological
realism of aquatic mesocosms has on the other hand been discussed more frequently
(e.g. Gearing, 1989; Lalli, 1990; Adey and Loveland, 1991; Clark and Cripe, 1993;
Kennedy et al., 1995), although most comparisons between mesocosms and the field
have not been very ambitious or innovative (Pilson, 1990; Perez, 1995). Precise data on
mesocosm repeatability, finally, have never been presented at all. This last finding is
least to say surprising, especially taking into account the large number of aquatic
mesocosm test systems currently in operation world-wide and the fact that also some
insight in repeatability is needed if mesocosms are to be used for prediction of real
effects in natural systems (Crane, 1997). In the recent paper by Kraufvelin (1998) some
problems with the replicability of BHB-mesocosm were pointed out. Coefficients of
variation (CVs) were presented for a large number of variables, many of which
previously have been used as test endpoints in the mesocosm in question (e.g. Landner et
al., 1989; Lehtinen and Tana, 1992; Lehtinen et al., 1993, 1994, 1995, 1996, 1998; Tana
et al. 1994). These CVs were generally so high that they would probably have prevented
any detection of significant differences between controls and treatments, if only real
replicates not ‘simple pseudoreplicates’, as defined by Hurlbert (1984), had been used.
Partly in order to accentuate the problem with poor replicability and partly since many
large-scale community studies, that more effectively could be analysed by other means,
still are stuck with univariate methods, the paper by Kraufvelin (1998) was only
concerned with the univariate one-way ANOVA. A more proper (eco)logical approach to
analyse this data would have been to carry out some kind of multivariate statistical
analysis parallel to the univariate ones. This would increase the understanding of the
behaviour of the bladder-wrack macrofauna communities in several replicated and
repeated control mesocosms. Comparisons with simultaneous measurements in the field
would further strengthen the overall picture of the performance of BHB-mesocosms and
other large-scale experimental ecosystems.
In this paper I therefore first present the replicability, repeatability and ecological
realism of BHB-mesocosms visually by ordination. Then I discriminate the parallel
mesocosms (replicability), different years (repeatability) and the mesocosms and mother
system (ecological realism) analytically by various ANOSIM tests. This is first done by
using subsampled bladder-wrack plants (replicability) and then the mesocosms themselves as replicates (repeatability and realism). Note that the former tests do not imply
pseudoreplication as long as they are just considered along the line ‘is mesocosm A
different from mesocosm B’ and no issues of causality (e.g. confounding with
treatments) are incorporated (Hurlbert, 1984). Further objectives include to search for
232
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
the causes behind the observed patterns in macrofauna community structure and to
demonstrate some often overlooked problems when working with living communities.
This will be accomplished by examination of bladder-wrack macrofauna communities
both in the mesocosms and in the field mother system and by pin-pointing the species
and processes basically responsible for observed differences between studied groups.
This is possible thanks to the low number of species present in the mesocosms and in the
Baltic bladder-wrack zone in general, at least compared to fully marine environments
(Haage, 1975; Wallentinus, 1991; Kautsky et al., 1992). Finally I discuss the consequences of these findings for an effective use of large-scale mesocosms in ecological and
ecotoxicological research.
2. Materials and methods
Since a thorough description of the mesocosms 1989–1992, the field sampling,
transplantation to mesocosms, mesocosm sampling, and analytical methods already has
been given in Kraufvelin (1998), only information of immediate relevance for this paper
will be provided below. The BHB-mesocosms were situated at a field station belonging
to the Finnish Environmental Research Group in Nagu, Archipelago Sea, SW Finland
(608 139 N, 228 069 E). The mesocosms consisted of 11–15 (depending on year) circular
outdoor basins (volume 8 m 3 ) equipped with a flow-through system of brackish water
(salinity 6‰), which was pumped unfiltered to the mesocosms from a nearby bay (8 m
of depth). The mean flow rate into each mesocosm was 168 l h 21 . Most mesocosms
received different kinds of or dilution of pulp mill effluents, just one mesocosm for each
treatment, but two to three replicated mesocosms each year served as controls. These
latter unpolluted systems are the ones to be examined closer in this paper and they will
be labelled A and B (1989); C, D and E (1990); F and G (1991) and H and I (1992).
Bladder-wrack communities, consisting of brown alga (8 l per mesocosm) with
associated macroinvertebrates and periphyton, were usually transplanted to the mesocosms in June (1990: early July, 1992: late May) and the mesocosms were run for | 5
months until November. At the end of the experiments five (1990–1992) or six (1989)
subsamples, each consisting of one bladder-wrack specimen with associated algae and
macrofauna communities, were taken from each mesocosm, in order to estimate the
amount of organisms associated with the alga. Three to six samples were also taken from
the field each year. These field samples were taken both in connection with transplantation to mesocosms (to get a rough measure of the initial community structure in the
mesocosms) and in connection with termination of experiments. Two semi-sheltered
field locals, Havero¨ (1989, 1991–1992) and Utterholm (1991), were used. In 1991,
monthly samples were taken from Utterholm for examination of seasonal differences and
in 1996 a large number of field samples (64) were taken from Havero¨ to be used for
simulated transplantations to the mesocosms. The field samples from November are
labelled with the letter N and the year when they were gathered 89, 90, 91 and 92,
whereas field samples from early summer are labelled with the letter S followed by the
year (90, 91 and 92). Note that no start samples are available from 1989. The monthly
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
233
field samples from 1991 are labelled in the following way: JN (samples from June), JL
(July), A (August), S (September), O (October) and N (samples from November).
At the sampling for transplantation a plastic bag was carefully folded over each Fucus
specimen including the stone to which the alga was attached (at the sampling for
laboratory analyses the stone was removed). By using this method most of the mobile
and agile animals associated with the bladder-wrack could not escape or loosen. At the
laboratory the volumes of algae were determined by the water displacement method.
After termination of experiments, animal abundances and biomasses (wet weight) were
related to the dry weight of bladder-wrack (after 24 h at 608C) for comparisons among
samples. Just eight species or taxa of macrofauna associated with the bladder-wrack are
included in the analyses, i.e. groups for which both abundance and biomass data were
registered all 4 experimental years. These species / taxa are: Gammarus spp. i.e. G.
˚ and G. zaddachi Sexton; Idotea spp., i.e. I. baltica (Pallas) and I.
oceanicus Segerstrale
chelipes (Pallas); Palaemon adspersus (Rathke); Theodoxus fluviatilis L.; Lymnaea spp.,
¨
i.e. L. peregra (Moller)
and L. stagnalis (L.); Mytilus edulis L.; Cerastoderma glaucum
(Poiret) and Gasterosteus aculeatus L. The real species lists were, however, much
longer. The following species or taxonomic groups were also found at least once in the
mesocosm bladder-wrack 1989–1992: Turbellaria, Acarina, Chironomidae, Tricoptera,
Zygoptera, Gerris sp., Corixa sp., Notonecta sp., Gyrinus sp., Dytiscus sp., Balanus
improvisus Darwin, Jaera albifrons Leach, Praunus inernis (Rathke), Electra crus¨
tulenta (Pallas), Hydrobia spp., Macoma balthica (L.), Nereis diversicolor (Muller),
Oligochaeta, Cottus gobio L., Pungitius pungitius (L.), Alburnus alburnus (L.), Phoxinus
phoxinus (L.), Myoxocephalus quadricornis (L.) and Zoarches viviparus (L.).
Multivariate statistical analyses of community structure were done using the PRIMER
program (Plymouth Routines In Multivariate Ecological Research). The methods of this
program have been described in detail by Clarke and Warwick (1994) and Carr (1996).
Non-parametric multivariate techniques were used as recommended for biological data
by Clarke (1993, 1999). Abundance data were square-root transformed and biomass data
fourth-root transformed in order to maintain roughly the same importance of less
dominant species for both types of data. The transformed data were put into triangular
matrices based on Bray–Curtis similarities. Ordination of samples was performed by
NMDS (Kruskal and Wish, 1978; Clarke and Green, 1988), a robust method that deals
with non-linearities by using ranks. Significance tests for differences among parallel
mesocosms (replicability) were carried out using one-way ANOSIM permutation tests,
whereas differences among years (repeatability) were examined with a two-way nested
ANOSIM and differences between mesocosms and the mother system (realism) by a
two-way crossed ANOSIM. The species contributing to Bray–Curtis dissimilarities
between parallel mesocosms, among years as well as between mesocosms and the field
sites were investigated using the similarity percentage breakdown procedure, SIMPER
(Clarke, 1993).
The simulated transplantation of field bladder-wrack from 1996 to the mesocosms was
repeated 10 times. At each simulation the 64 bladder-wrack samples were distributed to
four mesocosms in such a way that each mesocosm received 16 samples (standardised to
100 g DWT). Separate coefficients of variation (CVs) were calculated for all 10
234
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
occasions and averaged to get a measure of the overall initial variability. The results
from the simulated transplantations were analysed using SPSS.
3. Results
3.1. Visualisation and analytical testing of mesocosm performance
Macrofauna raw data are initially presented to get a rough picture about the absolute
differences between the studied groups (Table 1): mesocosm abundance data (Table 1a),
mesocosm biomass data (Table 1b), field abundance data (Table 1c) and field biomass
data (Table 1d). NMDS-ordinations of Fucus macrofauna abundance (Fig. 1) and
biomass data (Fig. 2) using the five to six subsamples available from all nine control
mesocosms (labelled with the letters A–I) all 4 years, demonstrate that parallel
mesocosms are fairly different (the degree of replicability is poor). One-way ANOSIM
tests based on the same similarity matrices and with subsamples used as replicates
confirm that these differences are statistically significant (before correcting for multiple
testing) all 4 years for both abundance and biomass data (Table 2).
From the figures (Figs. 1 and 2) it is also obvious that there are differences among
different years, since 1989 mesocosm samples are almost exclusively found in the lower
middle or at the bottom, 1990 samples to the upper left, 1991 samples more or less in
the middle, whereas 1992 samples may be found most to the right of the two ordination
maps. The suspected differences among years could be confirmed for both abundance
and biomass data with a two-way nested ANOSIM using the mesocosms as replicates
with subsamples nested in the analysis. The uncorrected probability values (P-values) for
global tests of equal community structure among years are 0.030 for abundance and
0.019 for biomass data (Table 3).
Presentations of the initial start communities and the mesocosm and field mother
system communities in November in the same NMDS-ordinations demonstrate that the
mesocosms have diverged considerably not only from each other (poor replicability and
poor repeatability) but also from the field during the experimental periods both
considering abundance (Fig. 3) and biomass data (Fig. 4). In these figures a poor
replicability is shown by the angles and different lengths of the arrows connecting the
mesocosms with the respective original start community (transplantation data from field
samples in early summer). A poor repeatability is visualised by the different characteristics of the three ‘arrow complexes’ with 1990 to the left, 1991 in the middle and 1992 to
the right. A poor ecological realism is finally indicated by the ‘mesocosm’ and ‘field’
arrows spreading out in almost all directions, although they are originating from a
theoretically similar community structure each year at the start of experiments. When
just mesocosm samples from November and pooled November field samples are
presented in the same NMDS-ordination, the actual dissimilarities between mesocosms
and the field become even more evident for both abundance (Fig. 5) and biomass data
(Fig. 6). These differences in community structure are established as statistically
significant, with uncorrected P-values at 0.009 for both abundance and biomass data,
using two-way crossed ANOSIM (Table 4).
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
235
Table 1
Distribution of macrofauna in samples from control mesocosms and the field 1989–1992 (values per 100 g
Fucus DWT): (a) abundance in mesocosms, (b) biomass (g WWT) in mesocosms, (c) abundance in the field,
(d) biomass (g WWT) in the field
(a) Abundance in mesocosms
Cerastoderma
Gammarus spp.
Gasterosteus
Idotea spp.
Lymnaea spp.
Mytilus
Palaemon
Theodoxus
A
B
C
51.86
6.18
22.05
0.46
200.49
3.89
8.85
0.83
8.30
0.38
42.17
0.22
230.64
0.11
8.41
0.24
4.55
1.43
32.06
0.23
464.05
Total
346.96
295.28
(b) Biomass in mesocosms
Cerastoderma
Gammarus spp.
Gasterosteus
Idotea spp.
Lymnaea spp.
Mytilus
Palaemon
Theodoxus
A
B
Total
(c) Abundance in the field
Cerastoderma
Gammarus spp.
Idotea spp.
Mytilus
Palaemon
Theodoxus
Total
(d) Biomass in the field
Cerastoderma
Gammarus spp.
Idotea spp.
Mytilus
Palaemon
Theodoxus
Total
0.46
65.46
–
0.092
3.846
D
–
–
33.79
0.77
387.55
65.70
–
171.83
F
16.19
9.02
–
37.77
24.28
80.47
–
277.13
511.08
435.67
245.39
444.86
490.81
628.10
463.35
C
D
E
–
F
G
H
I
3.455
1.531
6.126
0.203
6.452
0.118
0.379
0.231
0.574
0.128
10.732
0.137
6.349
0.014
0.511
0.051
0.361
0.070
9.077
0.126
9.062
21.706
18.648
19.272
–
E
0.45
10.50
0.26
2.35
0.057
0.804
0.083
0.157
–
–
6.96
–
0.90
–
0.128
22.063
0.69
33.30
205.35
0.46
210.81
–
0.358
0.258
4.123
5.380
0.065
1.020
18.850
0.294
7.659
20.194
24.523
28.504
15.509
–
3.101
0.260
0.076
0.146
4.078
5.336
–
2.205
1.795
14.155
–
12.568
0.625
7.770
2.20
6.04
91.51
0.20
383.00
H
150.38
6.54
0.52
6.10
70.32
184.77
–
209.47
4.37
3.49
0.529
0.460
0.435
–
G
–
N89
0.65
26.77
4.85
37.40
–
2.01
S90
–
179.73
36.18
78.32
–
187.35
N90
–
14.19
1.02
58.61
–
8.58
S91
–
57.29
6.60
44.71
3.70
4.05
N91
2.07
86.53
1.38
100.53
–
7.92
S92
1.28
302.20
8.56
151.94
–
25.12
N92
1.50
42.75
1.67
111.55
–
2.27
71.68
481.59
82.40
116.35
198.42
489.10
159.75
N89
0.008
0.346
0.240
4.058
–
0.010
S90
–
1.029
1.115
10.436
–
5.300
N90
–
0.432
0.015
10.136
–
0.141
S91
–
0.182
0.240
7.728
2.532
0.088
N91
0.015
2.039
0.199
7.189
–
0.177
S92
0.034
1.963
0.337
4.684
–
–
N92
0.025
0.374
0.233
4.036
–
0.025
4.660
17.880
10.724
10.770
9.619
7.418
4.693
I
2.93
9.81
–
0.149
0.748
–
3.393
0.061
5.902
21.610
0.356
5.142
16.389
33.970
–
The P-values that have been corrected for multiple comparisons, using Holm’s (1979)
sequential Bonferroni, are still significant for the test of repeatability (Table 3, P 5 0.038
for both abundance and biomass) and the test of ecological realism (Table 4, P 5 0.018
for both). For the replicability test (Table 2) only one out of eight ‘significant’ P-values
(P 5 0.002 for abundance comparison between A and B 1989) is still significant after
236
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Fig. 1. NMDS-ordination of square-root transformed Fucus macrofauna abundance data in control mesocosms
1989–1992 (stress 0.18).
correction, if a is set at 0.05. Since a P-value of 0.008 (for abundance comparisons
between F and G as well as H and I and biomass comparisons between H and I) is the
minimum attainable significance level with 126 distinct permutations, while a sequential
Bonferroni would demand a P-value , 0.004 for statistical significance, such corrections may be too conservative and have a limited usefulness (Clarke, 1999). Due to the
increased risks of Type II errors (to conclude no difference when one exists) comprised
by these adjustments and the fact that Type II errors are at least equally serious as Type I
errors (to conclude a difference when none exists) in environmental sciences (Peterman,
1990; Fairweather, 1991; Mapstone, 1996), also the uncorrected P-values will be
retained and considered in the discussion together with the corrected P-values.
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
237
Fig. 2. NMDS-ordination of fourth-root transformed Fucus macrofauna biomass data in control mesocosms
1989–1992 (stress 0.21).
3.2. Species responsible for the observed differences
3.2.1. Mesocosm replicability
The contributions of individual species to Bray–Curtis dissimilarities between parallel
mesocosms (replicability) were determined by similarity percentage analyses (SIMPER)
on square-root transformed abundance (Table 5a) and fourth-root transformed biomass
data (Table 5b). The highest average dissimilarities are found for A and B 1989
(di 532.80 for abundance and di 531.70 for biomass data), F and G 1991 (di 528.28 for
abundance data) and H and I 1992 (di 530.60 for abundance data), whereas C and D
1990 were the most similar of all parallel controls (di 519.45 for abundance data), which
238
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Table 2
One-way ANOSIM test of differences in community structure (square-root transformed abundance and
fourth-root transformed biomass data) between parallel controls 1989–1992 a
Controls
compared
Statistical
value
Possible
permutations
Significant
statistics
Significance
level
Adjusted
sign. level
Abundance
A–B 1989
C–D 1990
C–E 1990
D–E 1990
F–G 1991
H–I 1992
0.511
20.002
0.612
0.244
0.740
0.664
462
126
126
126
126
126
1
55
2
11
1
1
0.002
0.437
0.016
0.087
0.008
0.008
**
0.024 *
0.714
0.198
0.348
0.088
0.088
Biomass
A–B 1989
C–D 1990
C–E 1990
D–E 1990
F–G 1991
H–I 1992
0.492
0.072
0.432
0.064
0.304
0.628
462
126
126
126
126
126
4
30
4
33
4
1
0.009
0.238
0.032
0.262
0.032
0.008
**
*
**
**
*
*
**
0.088
0.714
0.198
0.714
0.198
0.088
a
The right column shows P-values corrected by a sequential Bonferroni. The symbol * means that
0.01,P#0.05 and ** means that P#0.01.
also can be anticipated from the P-values in ANOSIM tests in Table 2. Gammarus is an
important discriminator between mesocosms 1989 (both considering abundance and
biomass) due to a mass occurrence in A, but otherwise this species does not contribute to
much of the dissimilarities. Idotea is a central discriminator between A and B 1989, C
and E 1990 as well as F and G 1991 and consistently more dominant in former
mesocosms each year. Palaemon adspersus is of some importance regarding biomass
comparisons between D (highest biomass) and the two other controls 1990 (C and E).
Theodoxus fluviatilis is important concerning abundance comparisons between all
parallel mesocosms, although the differences are remarkable only in 1990 and 1991.
Lymnaea is mainly important for biomass data, especially in discriminating A and B
1989, as well as C and the other two control mesocosms 1990 (D and E), with
consistently higher biomass in former mesocosms each year. Cerastoderma is most
important during later years, i.e. in discriminating F and G 1991, and especially H and I
Table 3
Results of a global two-way nested ANOSIM on differences in macrofauna community structure (abundance
and biomass) among years 1989–1992 (repeatability) on square-root transformed abundance and fourth-root
transformed biomass data a
Variable
Global R
Permutations
Number of
sign. statistics
Uncorrected
P-value
Corrected
P-value
Abundance
Biomass
0.444
0.533
1260
1260
38
24
0.030 *
0.019 *
0.038 *
0.038 *
a
All possible permutations (1260) were used. The right column shows P-values corrected by a sequential
Bonferroni. The symbol * means that 0.01,P#0.05.
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
239
Fig. 3. NMDS-ordination of mesocosm and field samples at the start and at the end of experiments for Fucus
macrofauna square-root transformed abundance data (stress 0.10). Arrows connect samples from the same year
and visualise the divergence of mesocosm and field communities. Note that no field samples from the start
were available in 1989.
1992 due to a mass occurrence in H. Mytilus edulis, finally, is a central discriminator
between all parallel mesocosms considering both types of data and has consistently
higher abundances and biomasses in latter pools every year, i.e. in B, E, G and I. See
also Table 1a and b for the exact numerical values.
3.2.2. Mesocosm repeatability
The two-way nested ANOSIM could only demonstrate global differences, i.e. that
there are overall differences in community structure 1989–1992, regarding the repeatability of BHB-mesocosms. A closer look at the years, two at a time with SIMPER
240
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Fig. 4. NMDS-ordination of mesocosm and field samples at the start and at the end of experiments for Fucus
macrofauna fourth-root transformed biomass data (stress 0.09). Arrows connect samples from the same year
and visualise the divergence of mesocosm and field communities. Note that no field samples from the start
were available in 1989.
analyses (Table 6a and b), reveals that the most different pair of years are 1989 and 1992
(41.39% Bray–Curtis dissimilarity for abundance and 33.04% dissimilarity for biomass
data) and 1990 and 1992 (40.37% dissimilarity for abundance and 34.24% dissimilarity
for biomass data), which also can be anticipated from the NMDS ordinations. The
molluscs Lymnaea, Cerastoderma and Mytilus edulis, which all are found at considerably higher abundances in 1992, are the most important abundance discriminators
between 1992 and 1989–1990. Also Theodoxus fluviatilis is an important discriminator
for abundance data. Considering biomass data Lymnaea is especially important
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
241
Fig. 5. NMDS-ordination of mesocosm and field samples at the end of the experiment on Fucus macrofauna
square-root transformed abundance data (stress 0.07).
comparing 1992 with 1989 and 1990, and Mytilus edulis when just the earlier years are
compared with one another. The exact numerical values are given in Table 1a and b.
3.2.3. Ecological realism of the mesocosm
The highest overall dissimilarities in this paper are not surprisingly found for the
comparisons of macrofauna community structure between the mesocosms and the field
mother system, i.e. 58.54% for abundance data (Table 7a) and 44.73% for biomass data
(Table 7b). These differences also proved to be statistically significant using a two-way
crossed ANOSIM. SIMPER analyses demonstrate that Theodoxus fluviatilis L., whose
normal autumn migrations to deeper waters are prevented in the mesocosms (thus the
242
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Fig. 6. NMDS-ordination of mesocosm and field samples at the end of the experiment on Fucus macrofauna
fourth-root transformed biomass data (stress 0.08).
Table 4
Results of global tests from a two-way crossed ANOSIM on differences in macrofauna community structure
between mesocosms and the mother system at the end of experimental periods 1989–1992 a
Variable
Global R
Permutations
Number of
sign. statistics
Uncorrected
P-value
Corrected
P-value
Abundance
Biomass
1.000
0.772
108
108
1
1
0.009 **
0.009 **
0.018 *
0.018 *
a
All possible permutations (108) were used. The right column shows P-values corrected by a sequential
Bonferroni. The symbol * means that 0.01,P#0.05 and ** means that P#0.01.
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
243
Table 5
Each species’ contribution (di ) to the average dissimilarities between parallel mesocosms: (a) square-root
transformed abundance data and (b) fourth-root transformed biomass data
Species
di 89AB
di 90CD
di 90CE
(a) Abundance
Cerastoderma
Gammarus
Gasterosteus
Idotea
Lymnaea
Mytilus
Palaemon
Theodoxus
Total d %
2.16
7.67
0.76
6.83
2.62
5.53
0.71
6.53
32.80
0.60
2.37
0.66
2.12
1.63
3.92
1.00
7.15
19.45
0.27
2.11
0.57
2.70
1.81
4.10
0.36
15.20
27.12
(b) Biomass
Cerastoderma
Gammarus
Gasterosteus
Idotea
Lymnaea
Mytilus
Palaemon
Theodoxus
Total d %
3.55
4.62
2.06
5.05
5.06
7.27
2.65
1.44
31.70
1.76
2.13
2.61
3.74
3.84
4.79
4.12
0.66
23.65
0.93
1.76
2.22
5.32
4.23
3.50
1.60
3.46
23.03
di 90DE
0.55
2.94
0.38
2.03
–
6.55
1.17
12.74
26.36
–
2.92
1.55
4.06
–
6.63
4.46
3.11
24.17
di 91FG
3.47
2.29
di 92HI
6.76
3.59
5.43
0.22
6.53
28.28
11.19
1.21
0.52
2.12
3.01
5.97
0.42
6.16
30.60
4.00
2.87
–
7.77
3.42
5.05
–
1.96
26.41
5.05
2.38
1.86
2.94
1.42
4.36
2.59
2.16
22.76
–
high abundance values), by far is the most important discriminator. Gammarus and
Mytilus edulis, which contribute to the abundance dissimilarities with more than 10%
each, are other important discriminators. Lymnaea, Mytilus edulis, Idotea and Cerastoderma are in addition to Theodoxus central contributors to biomass dissimilarities. See
also Table 1a–d for the exact numerical values.
3.3. Spatial and temporal variability in the field mother system
The field mother system was investigated in order to get basic information about the
natural spatial and temporal variability of bladder-wrack macrofauna and an idea of
possible consequences for mesocosm experiments. The spatial variability between
neighbouring bladder-wrack plants, differences between sampling locations (Havero¨
1990 and 1992, Utterholm 1991) and differences between years have partly been
reported previously (Kraufvelin, 1998, Table 5 on p. 259). The year-to-year differences
and differences between mother systems are recapitulated in this paper together with
some rough information about seasonal differences, i.e. early summer compared to late
autumn (Table 1). The seasonal variability is, however, better visualised by monthly
samples from Utterholm 1991 plotted out in an NMDS ordination scheme (Figs. 7 and
8), where a continuous temporal change during the year is evident. Although there is a
high degree of spatial variability between replicates, it may be noted that the samples
most similar to the June samples are the ones from November, i.e. the samples most
244
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Table 6
Each species’ contribution (di ) to: (a) the average abundance dissimilarity (data square-root transformed) and
(b) the average biomass dissimilarity (data fourth-root transformed) between every pair of years 1989–1992
Species
di 89–90
di 89–91
di 89–92
di 90–91
di 90–92
di 91–92
(a) Abundance
Cerastoderma
Gammarus
Gasterosteus
Idotea
Lymnaea
Mytilus
Palaemon
Theodoxus
Total d %
1.67
5.05
0.62
6.07
1.79
5.23
0.72
9.36
30.51
2.96
4.95
0.35
4.99
3.97
6.16
0.47
7.31
31.16
7.19
3.81
0.52
4.73
7.75
10.46
0.52
6.40
41.39
3.54
2.28
0.26
4.37
4.74
5.66
0.47
7.44
28.76
8.32
1.66
0.43
1.73
9.09
9.71
0.54
8.89
40.37
6.22
1.63
0.26
3.53
4.42
6.98
0.32
7.01
30.38
(b) Biomass
Cerastoderma
Gammarus
Gasterosteus
Idotea
Lymnaea
Mytilus
Palaemon
Theodoxus
Total d %
3.07
3.13
1.89
5.76
3.79
6.17
2.81
1.87
28.49
3.56
3.88
1.05
5.07
5.85
5.79
2.06
1.59
28.85
4.90
2.99
1.69
5.24
8.43
5.30
2.33
2.16
33.04
4.69
2.84
1.06
4.92
7.32
4.67
2.20
1.83
29.53
7.09
2.07
1.71
3.10
11.07
4.05
2.53
2.62
34.24
4.02
2.45
0.98
4.27
4.16
3.96
1.81
2.27
23.92
distant in time. The samples from July are on the other hand the ones that differ most
from the June samples. This last fact has immediate consequence for transplantation to
mesocosms. The time lag in transplantation present both in 1989 and 1992 is thus an
important explanation for differences between control A and B as well as H and I, i.e.
the parallel mesocosms that differed most from each other.
The results from simulated transplantations to BHB-mesocosms are presented in
Tables 8 and 9. An estimate of the theoretical amount of organisms in the mesocosm
bladder-wrack zone at the start of experiments is received, if the values per 100 g DWT
(|500 ml of volume) in Table 8 are multiplied by 16 (corresponding to the total
bladder-wrack volume of 8 l). Judging from the field data from 1996 some numerically
important taxa (at least initially) like Chironomidae (about 16 500 individuals per
mesocosm), Balanus improvisus (3500), Jaera albifrons (450) and Hydrobia sp. (200)
have most likely been overlooked at the analyses of mesocosm fauna. The initial
variability is estimated for 48 variables by CVs averaged from 10 simulated transplantations (Table 9). These CVs are also compared with a number of real end CVs from the
mesocosms 1989–1992 for evaluation of how much the initial situation determines the
final mesocosm replicability. A closer look at these comparisons indicates that most of
the final variability for Cerastoderma, Mytilus, Palaemon and Theodoxus is determined
already by differences in the transplanted bladder-wrack. The same seems to be true for
Lymnaea and Gasterosteus, but as we will see later, this is not the case. Regarding
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
245
Table 7
Each species’ contribution (di ) to the average dissimilarities between mesocosms and the mother system at the
end of experimental periods 1989–1992: (a) square-root transformed abundance data (total d %558.54) and
(b) fourth-root transformed biomass data (total d %544.73)
(a) Abundance
Species
Cerastoderma
Gammarus
Idotea
Lymnaea
Mytilus
Palaemon
Theodoxus
Average
abundance
field
Average
abundance
mesocosm
Average
term
di
Ratio
Percent
Cumulative
percent
0.88
37.90
2.41
0.00
69.10
0.00
5.37
19.11
15.31
13.48
15.24
81.99
0.26
278.85
3.88
6.02
4.58
4.60
8.68
0.45
30.00
0.72
1.23
1.05
0.96
1.36
0.45
2.54
6.63
10.29
7.82
7.86
14.83
0.77
51.25
98.67
76.37
92.04
84.22
66.08
99.44
51.25
Average
biomass
field
0.01
0.73
0.16
0.00
6.63
0.00
0.09
Average
biomass
mesocosm
0.47
0.91
0.85
1.58
12.48
0.19
6.16
4.48
3.35
5.89
6.99
6.64
1.84
14.46
1.06
1.09
1.28
1.09
1.34
0.47
2.66
10.02
7.49
13.16
15.62
14.83
4.12
32.33
85.96
93.45
75.94
47.94
62.78
97.57
32.33
(b) Biomass
Cerastoderma
Gammarus
Idotea
Lymnaea
Mytilus
Palaemon
Theodoxus
Gammarus and Idotea the initial differences seem to be less important as long as no
time lags in transplantation are present.
4. Discussion
4.1. Examining mesocosm performance
4.1.1. Replicability, repeatability and realism
The degree of replicability and repeatability of a mesocosm depends on the
complexity of the system, the endpoints that are measured and the laboratory and field
conditions (seasonality and long-term changes), which determine the development of
systems and the variability over time (Smith, 1995; Crane, 1997; Kraufvelin, 1998).
Replicability and repeatability are both important aspects of mesocosm similarity,
although impossible to control for large-scale artificial ecosystems due to logistical and
economical limitations. Replicability is especially important in order to test whether
deviations from controls observed in treatments are statistically significant by using a
specific number of replicates at specific levels of acceptable statistical error postulated
(Kraufvelin, 1998). The importance of repeatability is in turn most evident when it
246
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Fig. 7. NMDS-ordination of Fucus macrofauna field samples from 1991 after square-root transformation of
abundance data (stress 0.13).
comes to interpretation of results and in dealing with the ecological relevance of results,
i.e. in making predictions for the real-world or in controlling the reliability of results
(Kennedy et al. 1995; Crane, 1997). If the repeatability of a mesocosm is poor, we will
lack important generality of the results (Lamont, 1995) and experimenters cannot tell if
observed deviations are real or due to error introduced by time, space, experimental
method, chance (Crane, 1997) or just a natural quality of the dynamics in the (mother)
ecosystem selected for the test. Thus the possibilities to make real-world predictions of
the results will be restricted (Underwood and Peterson, 1988; Rosenzweig and Buikema,
1994). This is rather ironic, since the search for high ecological realism has led many
scientists to a point where the arguments for replicability and repeatability are sacrificed
(cf. Landner et al., 1989).
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
247
Fig. 8. NMDS-ordination of Fucus macrofauna field samples from 1991 after fourth-root transformation of
biomass data (stress 0.13).
The third aspect of mesocosm similarity, the similarity with the mimicked mother
system (ecological or environmental realism), is important for practically the same
reasons as repeatability. If you cannot even repeat a partial ecosystem in containers, how
can you then expect the results to be universally applicable? It may be argued to infinity
about which one of these three aspects that is most important, but even with profound
knowledge of all these aspects of similarity for the experimental ecosystem in question it
may be impossible to carry out reliable predictive risk assessments or extrapolate results
to real ecosystems. According to Perez (1995) accuracy or fidelity with the field should
be first priority, precision, while important, is a secondary consideration. A high degree
of ecological realism is indeed crucial for a study using experimental ecosystems and
aiming at realistic predictive assessments for the real world (not just a study focused on
248
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Table 8
Mean abundance and biomass6S.D. of all animal species present in the 64 bladder-wrack samples from
Havero¨ 1996. The values are calculated per 100 g Fucus DWT. Multiply by 16 to get a theoretical start value
for mesocosms if these samples had been used for transplantation
Species
Number of
samples present
Mean
abundance
6S.D.
Mean
biomass g
6S.D.
Acarina
Balanus improvisus
Cerastoderma sp.
Chironomidae
Cottus gobio
Gammarus spp.
Gasterosteus aculeatus
Hydrobia sp.
Idotea baltica
Idotea chelipes
Jaera albifrons
Lymnaea peregra
Macoma baltica
Mytilus edulis
Nereis diversicolor
Palaemon adspersus
Praunus inernis
Pungitius pungitius
Theodoxus fluviatilis
Tricoptera
Zygoptera
1
43
7
64
1
64
2
35
35
48
48
16
1
38
1
12
13
1
51
2
3
0.06
226.08
0.14
1030.94
0.02
237.66
0.03
14.38
2.26
4.54
28.75
0.28
0.02
68.10
0.02
0.19
0.23
0.02
66.28
0.03
0.05
0.50
371.15
0.54
766.34
0.13
207.39
0.18
23.08
3.51
5.12
35.40
0.56
0.13
143.14
0.13
0.47
0.59
0.13
72.72
0.18
0.21
,0.01
11.78
,0.01
0.32
0.01
0.70
0.01
0.10
0.29
0.10
0.03
0.07
,0.01
2.68
,0.01
0.18
,0.01
,0.01
1.88
,0.01
,0.01
,0.01
20.17
0.01
0.51
0.04
0.57
0.07
0.16
0.46
0.11
0.04
0.15
,0.01
4.69
,0.01
0.53
,0.01
0.03
1.83
,0.01
0.01
what might happen in other mesocosms receiving the same treatment). Without any
information about the repeatability, however, experimenters lack an objective base for
judging the relevance of the results and may come to decisions that lead to environmental (or economical) damage (Crane, 1997). When in turn the replicability is poor only
extreme differences can be detected (Kraufvelin, 1998).
During all these 4 experimental years parallel mesocosms had a very different
macrofauna community structure, i.e. a low degree of replicability, which in this paper
has been shown by both descriptive and analytical multivariate statistical methods and
previously by the generally high mean CVs for a number of variables (Kraufvelin,
1998). This fact has immediate relevance for the ‘pseudoreplicated’ investigations
carried out in these mesocosms using the same control dataset as in this paper (Lehtinen
and Tana, 1992; Lehtinen et al., 1993, 1994; Tana et al. 1994), since if already the
controls can be shown to differ significantly, then how could treatment effects ever be
distinguished from basin effects (‘tankiness’)? In the light of the restricted replicability it
is not surprising that problems with the repeatability will occur as well. In this material
repeatability was poorer than the replicability, i.e. mesocosms run during different years
differed more in community structure than parallel mesocosms. This is by the way a
fundamental demand for the test of repeatability to be significant, since the ANOSIM
procedure here makes use of the observed differences among years (repeated mesocosms) contrasted with differences between replicates within years (individual meso-
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
249
Table 9
Estimated mean start CVs and 95% confidence intervals for a number of variables from 10 randomised
transplantations of 64 bladder-wrack specimens to four mesocosms (field data 1996) and real end CVs from
1989–92
Variable
Simulated start
Real results at the end
Mean CV
95% CI
Population variables
Acarina, Cottus, Macoma, Nereis, Pungitius,
abundance and biomass
Balanus abundance
Balanus biomass
Cerastoderma abundance
Cerastoderma biomass
Chironomidae abundance
Chironomidae biomass
Gammarus abundance
Gammarus biomass
Gasterosteus abundance
Gasterosteus biomass
Hydrobia abundance
Hydrobia biomass
Idotea baltica abundance
Idotea baltica biomass
Idotea chelipes abundance
Idotea chelipes biomass
Jaera abundance
Jaera biomass
Lymnaea abundance
Lymnaea biomass
Mytilus abundance
Mytilus biomass
Palaemon abundance
Palaemon biomass
Praunus abundance
Praunus biomass
Theodoxus abundance
Theodoxus biomass
Tricoptera abundance
Tricoptera biomass
Zygoptera abundance
Zygoptera biomass
200.0
44.6
45.0
104.2
96.0
18.0
39.2
21.8
17.1
149.6
150.0
34.8
39.2
31.4
33.2
26.8
30.1
26.9
28.3
64.1
73.1
47.8
41.7
64.3
69.2
57.8
64.0
22.1
21.9
132.8
194.0
97.4
97.3
32.2–57.1
33.0–57.0
91.4–117.1
77.2–114.8
14.8–21.1
30.2–48.1
14.4–29.1
10.6–23.6
118.6–180.6
119.2–180.8
25.8–43.9
29.5–48.9
22.5–40.2
25.7–40.7
15.6–38.0
19.0–41.2
18.7–35.1
20.7–35.9
40.2–88.0
54.1–92.0
34.9–60.7
34.0–49.4
53.2–75.4
52.4–86.0
39.0–76.5
34.0–94.0
16.8–27.4
16.3–27.6
107.5–158.1
191.7–196.3
74.2–120.5
74.3–120.3
Community variables all species
Total abundance
Total biomass
Number of species
Margalefs´ species richness, abundance data
Shannon–Wiener diversity, abundance data
Pielous´ evenness, abundance data
9.4
32.4
12.6
13.8
12.5
12.5
6.5–12.3
23.7–41.1
9.9–15.4
10.6–17.0
8.3–16.7
9.3–15.7
a
b
Idotea baltica and Idotea chelipes were pooled 1989–1992.
Mesocosm values are calculated just for the eight mesocosm species / taxa.
Mean CV
95% CI
112.4
80.9
89.8–135.0
14.4–147.4
54.8
64.4
91.3
89.6
15.9–93.7
27.6–101.2
25.7–156.9
29.4–149.8
104.3 a
76.1 a
104.3 a
76.1 a
79.4–129.2
26.3–125.9
79.4–129.2
26.3–125.9
108.3
96.0
26.4
42.9
113.7
110.1
56.7–159.9
16.9–175.1
5.8–47.0
14.0–71.8
71.0–156.4
57.3–162.9
19.4
22.7
0.8–38.0
7.4–38.0
18.5 b
18.6 b
6.6–30.4
23.2–40.9
b
2.8–27.8
23.7–39.5
24.9–54.5
15.4
31.6 b
39.7 b
250
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
cosms) in computing the test statistic, global R. The main problem with poor mesocosm
repeatability lies in the interpretation and application of results. If it, as with the
BHB-mesocosms, seems to be impossible to repeat even a set of controls during
subsequent years using the same or a neighbouring mother system, the same methodology, the same research facility, the same staff and the same equipment, then how can
possible important experimental results ever be controlled by repetition or reproduction
of tests, if necessary? If in addition the mesocosms can be demonstrated to diverge this
strongly from the field, then how could a reliable extrapolation to the real-world ever be
accomplished? The factors restricting replicability and repeatability of BHB-mesocosms
have already been discussed in depth by Kraufvelin (1998), whereas the factors
restricting ecological realism as well as the species principally responsible for the
observed differences will be outlined in this paper.
4.1.2. Statistical considerations
There are a number of important reasons for choosing multivariate statistical analyses
instead of multiple univariate ones (Scheiner, 1993; Clarke and Warwick, 1994; Clarke,
1999). Firstly, ecological questions are often multivariate, involving interactions among
response variables such as complexities of dependencies among species. Differences
among groups are seldom a feature of just one response variable alone, but more often of
an entire suite of variables. An examination of ‘all’ of the data at once may reflect the
nature of ecological structures more accurately (Landis et al., 1997; Kedwards et al.,
1999a,b). Moreover a multivariate approach provides a single global test of significance,
whereas multiple univariate tests inflate the a -value, i.e. increases the probability of
making a Type I error in hypothesis testing, which has to be corrected for. Additional
benefits of multivariate techniques are their ability to summarise large datasets (Van den
Brink and Ter Braak, 1998, 1999; Maund et al., 1999) and for some methods their
conceptual simplicity, which facilitates their use and understanding by environmental
managers and regulators (Clarke, 1999). Non-parametric techniques were chosen in this
paper instead of parametric, since the samples could not be transformed to approximate
(multivariate) normality due to the large number of zeros (Clarke and Warwick, 1994).
The relative importance for the scientific objectives of Type I and Type II errors,
respectively, should always be considered when a study is planned. Then the appropriate
methods for tests of the hypothesis can be selected with emphasis on the type of error to
be careful with. If the risk of Type I errors is emphasised, a correction for multiple
testing should be carried out whenever several tests are performed simultaneously (Day
and Quinn, 1989; Rice, 1989; Sokal and Rohlf, 1995; Underwood, 1997). Such
corrections will affect the minimum detectable differences (MDDs) and minimum
number of replicates (MNRs) dramatically, calling for even larger effect sizes or more
replicates for successful detection of effects.
An investigation of mesocosm performance by non-parametric multivariate methods is
entirely different from the univariate one-way ANOVA approach in Kraufvelin (1998).
ANOSIM is a multivariate distribution free method, comparable to the univariate
Mann–Whitney–Wilcoxon or Kruskall–Wallis tests. This means that the rank similarities in community structure determine if there will be significant differences between
controls and treatments in the permutation tests. The exact degree of variability between
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
251
replicates is not of primary importance and it is much more difficult to calculate for
example the minimum number of replicates (MNRs) needed for successful use of these
methods based on variability measures and effect size levels. Detection of differences
based on abundances and biomasses of several animal populations, as in this paper, may
of course be extra difficult to handle due to the large temporal variances of many
populations and the marked lack of concordance from one place to another (Osenberg et
al., 1996; Underwood, 1996).
4.2. What decreased mesocosm performance?
4.2.1. Mesocosm replicability
The main factors decreasing the replicability of BHB-mesocosms have already been
outlined by Kraufvelin (1998). Three major categories were distinguished: (1) initial
variability between transplanted bladder-wrack communities; (2) errors in experimental
design and execution, and (3) chance events or normal divergence of mesocosms.
Although an attempt to look at the initial variability in the mesocosms has been made in
this paper (Tables 8 and 9) much more research would be needed with regard to this
topic. We still have very vague ideas of the initial and inherent variability among
experimental ecosystems in general and how representative the initial conditions are to
those occurring in nature and how they shape the final results of an experiment (Boyle
and Fairchild, 1997). All eight species analysed were subjected to initial differences in
the transplanted communities (Table 9), but these differences may just have been
marginal for Lymnaea and Gasterosteus, which both were very rare in the field samples
taken in connection with transplantation (Kraufvelin, 1998). Most of the pond snails and
stickle-backs present at the end originated from separately transplanted individuals (in
known, equal numbers). Therefore the final differences for these groups are probably
caused by chance events affecting for example the reproductive success or the juvenile
survival.
The mass occurrences of Gammarus and Idotea in A compared to B 1989, as well as
Cerastoderma in H compared to I 1992, are in turn the most evident examples of errors
in experimental design and execution. Due to a time lag in transplantation of bladderwrack communities of several weeks, much more individuals of these three species
entered the former pools to which transplantation was accomplished later. The
seasonality of the bladder-wrack macrofauna community is known to be strong in the
¨
Baltic Sea (Haage, 1975; Fagerholm, 1978; Kolding, 1981; Anders and Moller,
1983;
Figs. 7 and 8). The close similarity of samples from June and November in Figs. 7 and 8
is probably due to the high degree of similarity in many important fauna structuring
variables (e.g. low water temperatures and a scarce epiphytic vegetation). Mobile snails
and juvenile crustaceans were almost absent in June and November, whereas these
taxonomic groups were rather dominant during the other months, including a rapid
change from June to July. These results imply that mesocosms meant to be parallel
should always be started simultaneously in order for them to contain a sufficiently
similar biological material for further meaningful analyses.
Idotea was in 1991 probably also affected by chance events like differences in
survival rate of the green alga, Cladophora sp., which provides essential food and
252
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
shelter for juvenile crustaceans. In F, where Cladophora was present for a longer period,
Idotea was markedly more abundant than in G, where the green alga died after just a
few weeks. Kraufvelin (1998) listed a number of other factors with the possibility to
cause divergence of mesocosms. Out of these factors the random occurrence of
Spirogyra sp. in three of the controls, B 1989 and D and E 1990, and the possible
presence of additional fish predators may have been of most importance for the
bladder-wrack macrofauna.
4.2.2. Mesocosm repeatability
Repeatability of mesocosms (year-to-year similarity) was poor partly due to the poor
replicability (1–3 above) and partly due to other reasons. The repetition of BHBmesocosms was also affected by: (4) variations between years and seasonality, which
prevents a transplantation of similar start communities during subsequent years; (5)
differences in weather conditions during the treatment periods during different years; (6)
the fact that bladder-wrack specimens were gathered from two different mother systems
(Kraufvelin, 1998). The last factor does not seem to contribute to much of the poor
repeatability, however, since 1991, the year when the bladder-wrack samples occasion¨ has the
ally were gathered from another mother system (Utterholm instead of Havero),
lowest mean total dissimilarities (i.e. lowest mean values of all pair-wise abundance or
biomass comparisons) in Table 3 of all individual years. For abundance data the mean
dissimilarities (n53) are: 1989534.35, 1990533.21, 1991530.10 and 1992537.38;
whereas for biomass data the mean dissimilarities are: 1989530.13, 1990530.75,
1991527.43 and 1992530.40. These differences are not crucial, but at least they
demonstrate that the mesocosms in 1991 were not at the extreme end in the between
year comparisons. In the ordination maps the samples from 1991 also cluster out in the
middle (Figs. 1 and 2). The temporal variations, both in the mother system before
transplantation and in the mesocosms after transplantation thus seem to be of much
greater importance in this material (together with the factors decreasing replicability).
A closer look at the individual species does not reveal very much about the causes of
poor repeatability, especially the possible impact of chance events is hard to evaluate in
this perspective. The initial variability between transplanted bladder-wrack communities,
natural or due to year to year differences, seems anyway to have been the major cause.
This is also indicated by the field samples taken in connection with transplantation to
mesocosms 1990–1992 (Kraufvelin, 1998), although according to Kautsky (1991)
fluctuations between years do not seem to be that pronounced in the Baltic Sea as for
example in the North Sea. Differences in the weather conditions during the experimental
period during different years may also have been important for certain species such as
Lymnaea and Theodoxus, whereas the differences for Gammarus (1989 compared to
other years) and Cerastoderma (1992 compared to other years) mainly were due to the
fact that one of the mesocosms each year had shown extreme values because of
transplantation errors.
4.2.3. Ecological realism of the mesocosm
The ecological realism was affected by a variety of abiotic and biotic factors (known
or unknown). It is impossible to simulate the abiotic environment exactly in an artificial
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
253
ecosystem. Containerisation always means loss of natural refuges, isolation of an
environment can result in artefacts caused by the physical containment devices and the
reduced size cause other restrictions that limit our ability to relate mesocosm results to
natural systems (Perez, 1995; Carpenter, 1996; Chen et al., 1997; Schindler, 1998).
Abiotic factors reducing the ecological realism of mesocosms include: wall effects
(Strickland, 1967; Giesy and Odum, 1980; Lundgren, 1985), differences in water
exchange rate and more or less accompanying differences in water temperature
fluctuations (Notini et al., 1977), smaller turbulence than in nature (Steele et al., 1977;
Davies and Gamble, 1979; Alcaraz et al., 1988; Kotak and Robinson, 1991; Howarth et
al., 1993), as well as lack of large predators, differences in the habitat structure and
differences in spatial scaling (Schindler, 1998). The spatial scale affects the physical
models in two major ways: species inclusion or exclusion, and potential artefacts
associated with containerisation (Perez, 1995). An understanding of both fundamental
scaling effects and direct consequences of the enclosure is crucial to the comparative
analysis of processes among ecosystems, and to extrapolation of results from experimental to natural ecosystems (Petersen et al., 1997).
The importance of biotic factors, relative to the abiotic, for structuring the mesocosm
fauna is not easily established. According to Kautsky and Van der Maarel (1990) abiotic
factors such as light, bottom type and wave exposure determine the zonation pattern in
the Baltic Sea, while biotic interactions are said to be of less importance. The biotic part
of the artificial ecosystem will on the other hand always deviate from the mother system
because of the possible taxonomic and structural simplicity and the differences in the
physical and chemical environment causing differences in fundamental ecological
processes (Schindler, 1998). Differences between species and individuals in their
capability to adapt to an artificial ecosystem, i.e. whether organisms that thrive in
mesocosms are representative of those that do not (Lawton, 1995), further complicate
the picture.
In the field Theodoxus fluviatilis has normally moved down to deeper waters in
November (Skoog, 1971; Wallentinus, 1991), while such migrations never can be any
alternative in the mesocosms. Thus unrealistically high abundances and biomasses of
this species could be demonstrated in the mesocosms in autumn. The pond snail,
Lymnaea, is subjected to similar prevented migration as Theodoxus, but the similarity
with the field for this species suffers in addition from the fact that Lymnaea never
reaches such high abundances and biomasses in the mother system as the snail did in the
stagnant ‘pond-like’ mesocosms. Another evident difference between the field and the
mesocosms is that the reproduction of many species is hampered or prevented in the
mesocosms (e.g. Idotea, Mytilus and Gammarus). There are just a small amount of
juveniles of these species present in the mesocosms, probably because of the lack of the
green algal belt, which serves as a nursery ground by offering food and protection
(Jansson, 1967, 1970; Fagerholm, 1975; Haage, 1975; Kangas et al. 1982) and severe
nutrient limitation caused by the low water flow-rate. Mytilus-larvae, both those present
in the incoming water and those descended from the mesocosm adults, may also fail to
settle because of the low degree of water movements, which normally serves as a
stimulus to settling (Bakke, 1990) and the larvae may thus just be flushed out from the
systems.
254
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
4.3. Concluding remarks
A thorough investigation of mesocosm performance is a challenging task, but due to
the large amount of resources necessary for its implementation, it is very hard to verify
mesocosm replicability, repeatability and realism in a satisfying way. Therefore it is easy
to understand that most data for such studies are more or less achieved as ‘by-products’
from other experiments with different scopes. According to Crane (1997) priorities for
further research into mesocosms and microcosms should be the extent to which results
can be repeated and reproduced, and the level of precision and accuracy of predictions
from model ecosystems to natural ecosystems. Without empirical evidence of the
repeatability and predictive ability of results from a model ecosystem study, a risk
assessor has no objective base for evaluation of the data and may come to decisions that
are potentially disastrous for the environment. Ten years ago Pilson (1990) stressed the
need for investigations of the ecological realism of mesocosms and pointed out that he
had not found a completely satisfactory comparison of the behaviour of any mesocosm
with that of the ecosystem from which it was derived. Still today his concern can only be
repeated.
A restricted degree of replicability is much easier dealt with than a poor repeatability
or a questionable realism. Of course replicates will never be and do not have to be
identical. Replicability just reflects the variability between whatever is being sampled
and the best action to prepare for future successful statistical analyses is simply to try to
increase the number of replicates. If the effects of the studied disturbances (also low
level) in turn are large and beyond any doubt, we do not have much of a problem. This
is definitely not the case for BHB-mesocosms, as will be proved by a follow-up paper on
pollutants tested 1989–1992 (Kraufvelin, unpublished data). Regarding repeatability and
realism experimenters have already lost so much in control compared to just dealing
with replicability, mostly due to the addition of inter-annual variability and inclusion of
natural field processes to the already present basin-to-basin variability and basin effects,
that the problems easily fall out of hand. Therefore more research into mesocosm
repeatability and realism is definitely needed, although such investigations ultimately
might show that currently popular model ecosystem approaches, experimental designs or
measurement endpoints are incapable of producing sufficiently accurate and precise
results for ecological risk assessors (Crane, 1997). Many measurement endpoints may
for example be so heavily dependent upon initial conditions, such as time of the start of
the treatment (Jenkins and Buikema, 1990; Winner et al., 1990), successional stage of
the system (Taub et al., 1991) or chance events (Kraufvelin, 1998) that experimental
results are unpredictable. At such occasions it would make little sense to proceed to the
stage of predictive validation, and these test systems, methods or endpoints should be
abandoned (Crane, 1997).
Acknowledgements
I am indebted to Anders Isaksson for his always critical comments on my manuscripts
¨
and for excellent statistical advice. I also thank Ismo Holopainen, Erkki Leppakoski,
and
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
255
Paul Somerfield for valuable comments on the text and two anonymous referees, who
repeatedly reviewed the paper and helped me to improve it. Tove Holm assisted in the
field and carried out analyses of macrofauna samples in 1996, which is gratefully
acknowledged. The paper is a (late?) contribution to the late Graduate School
‘Integrated Aquatic Hazard Assessment’ established and abandoned by the Academy of
Finland and the Finnish Ministry of Education. Societas Pro Fauna et Flora Fennica
provided financial support for the field work in 1996.
References
Adey, W.H., Loveland, K., 1991. Dynamic Aquaria: Building Living Ecosystems, Academic Press, San Diego,
pp. 1–643.
´ C., Vaque, D., 1988. Effects of turbulence on the development of phytoplankton
Alcaraz, M., Saiz, E., Marrase,
biomass and copepod populations in marine microcosms. Mar. Ecol. Prog. Ser. 49, 117–125.
¨
Anders, K., Moller,
H., 1983. Seasonal fluctuations in macrobenthic fauna of the Fucus belt in Kiel Fjord
(western Baltic Sea). Helgol. Meeresunters. 36, 277–283.
Bakke, T., 1990. Benthic mesocosms: II. Basic research in hard-bottom benthic mesocosms. In: Lalli, C.M.
(Ed.), Enclosed Experimental Marine Ecosystems: A Review and Recommendations. Coastal and Estuarine
Studies, Vol. 37, Springer-Verlag, Heidelberg, pp. 122–135.
Boyle, T.P., Fairchild, J.F., 1997. The role of mesocosm studies in ecological risk analysis. Ecol. Appl. 7,
1099–1102.
Brazner, J.C., Heinis, L.J., Jensen, D.A., 1989. A littoral enclosure for replicated field experiments. Environ.
Toxicol. Chem. 8, 1209–1216.
Cairns, J., McCormick, P.V., 1991. The use of community- and ecosystem-level end points in environmental
hazard assessment: a scientific and regulatory evaluation. Environ. Auditor 2, 239–248.
Carpenter, S.R., 1996. Microcosm experiments have limited relevance for community and ecosystem ecology.
Ecology 77, 677–680.
Carr, M.R., 1996. In: PRIMER User Manual, Ver 4.0, Plymouth Marine Laboratory, Plymouth, UK, pp. 1–42.
Chen, C.-C., Petersen, J.E., Kemp, W.M., 1997. Spatial and temporal scaling of periphyton growth on walls of
estuarine mesocosms. Mar. Ecol. Prog. Ser. 155, 1–15.
Clark, J.R., Cripe, C.R., 1993. Marine and estuarine multi-species test systems. In: Calow, P. (Ed.), Handbook
of Ecotoxicology, Vol. I, Blackwell Scientific, Oxford, pp. 227–247.
Clarke, K.R., 1993. Non-parametric multivariate analyses of changes in community structure. Aust. J. Ecol. 18,
117–143.
Clarke, K.R., 1999. Nonmetric multivariate analysis in community-level ecotoxicology. Environ. Toxicol.
Chem. 18, 118–127.
Clarke, K.R., Green, R.H., 1988. Statistical design and analysis for a ‘biological effects’ study. Mar. Ecol.
Prog. Ser. 46, 213–226.
Clarke, K.R., Warwick, R.M., 1994. In: Changes in Marine Communities: an Approach To Statistical Analysis
and Interpretation, Plymouth Marine Laboratory, Plymouth, UK, pp. 1–144.
Crane, M., 1997. Research needs for predictive multispecies tests in aquatic toxicology. Hydrobiologia 346,
149–155.
Crossland, N.O., La Point, T.W., 1992. The design of mesocosm experiments. Environ. Toxicol. Chem. 11,
1–4.
Davies, J.M., Gamble, J.C., 1979. Experiments with large enclosed ecosystems. Phil. R. Soc. Lond. B 286,
523–544.
Day, R.W., Quinn, G.P., 1989. Comparison of treatments after an analysis of variance in ecology. Ecol.
Monogr. 59, 433–463.
˚
Fagerholm, H.P., 1975. The effects of ferry traffic on the rocky shore macrofauna in the southern Aland
archipelago: 1. The Cladophora zone. Merentutkimuslait. Julk. / Havsforskningsinst. Skr. 239, 331–337.
256
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Fagerholm, H.P., 1978. The effects of ferry traffic (artificial wave action) on the rocky shore macrofauna in the
˚
southern Aland
archipelago in the northern Baltic: 2. The Fucus zone (a quantitative study). Kieler
Meeresforsch. Sonderheft 4, 130–137.
Fairweather, P.G., 1991. Statistical power and design requirements for environmental monitoring. Aust. J. Mar.
Freshwater Res. 42, 555–567.
Gamble, J.C., 1990. Mesocosms: statistical and experimental design considerations. In: Lalli, C.M. (Ed.),
Enclosed Experimental Marine Systems: A Review and Recommendations. Coastal and Estuarine Studies,
Vol. 37, Springer-Verlag, Heidelberg, pp. 188–196.
Gearing, J.N., 1989. The role of aquatic microcosms in ecotoxicologic research as illustrated by large marine
systems. In: Levin, S.A., Harwell, M.A., Kelly, J.R., Kimball, K.D. (Eds.), Ecotoxicology: Problems and
Approaches, Springer-Verlag, Heidelberg, pp. 188–196.
Giesy, J.P., Odum, E.P., 1980. Microcosmology: introductory comments. In: Giesy, J.P. (Ed.), Microcosms in
Ecological Research. Symposium Series 52 (Conf-781101), US Department of Energy, Washington DC, pp.
1–13.
Giesy, J.P., Allred, P.M., 1985. Replicability of aquatic multispecies test systems. In: Cairns, J. (Ed.),
Multispecies Toxicity Testing, Pergamon Press, New York, pp. 18–24.
Haage, P., 1975. Quantitative investigations of the Baltic Fucus belt macrofauna. 2. Quantitative seasonal
fluctuations. In: Contr. Asko¨ Lab. No. 9, University of Stockholm, pp. 1–88.
Heimbach, F., Pflueger, W., Ratte, H.-T., 1992. Use of small artificial ponds for assessment of hazards to
aquatic ecosystems. Environ. Toxicol. Chem. 11, 27–34.
Holm, S., 1979. A simple sequentially rejective multiple test procedure. Scand. J. Stat. 6, 65–70.
Howarth, R.W., Butler, T., Lunde, K., Swaney, D., Chu, C.R., 1993. Turbulence and planktonic nitrogen
fixation — a mesocosm experiment. Limnol. Oceanogr. 38, 1696–1711.
Hurlbert, S.H., 1984. Pseudoreplication and the design of ecological field experiments. Ecol. Monogr. 54,
187–211.
¨
Jansson, A.-M., 1967. The food-web of the Cladophora-belt fauna. Helgolander
Wiss. Meeresunters. 15,
574–588.
Jansson, A.-M., 1970. Production studies in the Cladophora-belt. Thalassia Jugosl. 6, 143–155.
Jenkins, D.G., Buikema, A.L., 1990. Response of a winter plankton food web to simazine. Environ. Toxicol.
Chem. 9, 693–705.
Jenkins, D.G., Buikema, A.L., 1998. Do similar communities develop in similar sites? A test with zooplankton
structure and function. Ecol. Monogr. 68, 421–443.
˚ Salemaa, H., 1982. A general model of the decline
¨
Kangas, P., Autio, H., Hallfors,
G., Luther, H., Niemi, A.,
¨
of Fucus vesiculosus at Tvarminne,
south coast of Finland in 1977–81. Acta Bot. Fennica 118, 1–27.
Kautsky, H., 1991. Influence of eutrophication on the distribution of phytobenthic plant and animal
communities. Int. Rev. Ges. Hydrobiol. 76, 423–432.
Kautsky, H., Van der Maarel, E., 1990. Multivariate approaches to the variation in phytobenthic communities
and environmental vectors in the Baltic Sea. Mar. Ecol. Prog. Ser. 60, 169–184.
Kautsky, H., Kautsky, L., Kautsky, N., Kautsky, U., Lindblad, C., 1992. Studies on the Fucus vesiculosus
community in the Baltic Sea. Acta Phytogeogr. Suec. 78, 33–48.
Kedwards, T.J., Maund, S.J., Chapman, P.F., 1999a. Community level analysis of ecotoxicological field
studies: I. Biological monitoring. Environ. Toxicol. Chem. 18, 149–157.
Kedwards, T.J., Maund, S.J., Chapman, P.F., 1999b. Community level analysis of ecotoxicological field
studies: II. Replicated-design studies. Environ. Toxicol. Chem. 18, 158–166.
Kennedy, J.H., Johnson, Z.B., Wise, P.D., Johnson, P.C., 1995. Model aquatic ecosystems in ecotoxicological
research: considerations of design, implementation, and analysis. In: Hoffman, D.J., Rattner, B.A., Burton,
G.A., Cairns, J. (Eds.), Handbook of Ecotoxicology, Lewis Publishers, Boca Raton, FL, pp. 117–161.
Kolding, S., 1981. Habitat selection and life cycle characteristics of five species of the amphipod genus
Gammarus in the Baltic. Oikos 37, 173–178.
Kotak, B.G., Robinson, G.G.C., 1991. Artificially-induced water turbulence and the physical and biological
features within small enclosures. Arch. Hydrobiol. 122, 335–349.
Kraufvelin, P., 1998. Model ecosystem replicability challenged by the ‘soft’ reality of a hard bottom
mesocosm. J. Exp. Mar. Biol. Ecol. 222, 247–267.
Kruskal, J.B., Wish, M., 1978. Multidimensional Scaling, Sage Publications, Beverly Hills.
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
257
Kuiper, J., Brockmann, U.H., van het Groenewoud, H., Hoorsman, G., Hammer, K.D., 1983. Influences of bag
dimensions on the development of enclosed plankton communities during POSER. Mar. Ecol. Prog. Ser. 14,
9–17.
de Lafontaine, Y., Leggett, W.G., 1987. Evaluation of in situ enclosures for larval fish studies. Can. J. Fish.
Aquat. Sci. 44, 54–65.
Lalli, C.M. (Ed.), 1990. Enclosed Experimental Marine Ecosystems: A Review and Recommendations. Coastal
and Estuarine Studies, Vol. 37, Springer-Verlag, Heidelberg, pp. 1–218.
Lamont, B.B., 1995. Testing the effect of ecosystem composition / structure on its functioning. Oikos 74,
283–295.
Landis, W.G., Matthews, R.A., Matthews, G.B., 1997. Design and analysis of multispecies toxicity tests for
pesticide registration. Ecol. Appl. 7, 1111–1116.
Landner, L., Blanck, H., Heyman, U., Lundgren, A., Notini, M., Rosemarin, A., Sundelin, B., 1989.
Community testing, microcosm and mesocosm experiments: ecotoxicological tools with high ecological
realism. In: Landner, L. (Ed.), Chemicals in the Aquatic Environment — Advanced Hazard Assessment,
Springer-Verlag, Heidelberg, pp. 216–254.
Lawton, J.H., 1995. Ecological experiments with model systems. Science 269, 328–331.
Lehtinen, K.-J., Tana, J., 1992. Effects in mesocosms exposed to effluents from bleached hardwood kraft pulp
mill. Publ. Fin. Natl. Board Water Environ. Adm. Series A 105, 1–146.
¨
¨
Lehtinen, K.-J., Tana, J., Mattsson, K., Hardig,
J., Karlsson, P., Grotell, C., Hemming, S., Engstrom,
C.,
Hemming, J., 1993. Ecological impact of pulp mill effluents. Publ. Fin. Natl. Board Water Environ. Adm.
Series A 133, 1–139.
¨
¨
¨
Lehtinen, K.-J., Tana, J., Mattsson, K., Grotell, C., Backstrom,
S., Engstrom,
C., Hemming, J., 1994. In:
Effects in Model Ecosystems of Total Mill Effluents, Bleach Plant Effluents and Subprocess Streams from
Production of ECF and TCF Pulp. MFG-report, Finnish Environmental Research Group, Esbo, pp. 1–82.
¨
Lehtinen, K.-J., Tana, J., Mattson, K., Engstrom,
C., Grotell, C., Vartiainen, T., 1995. In: Kalanviljelysta¨
¨ ¨
¨
¨
aiheutuvat ymparistovaikutukset
seka¨ ekologinen perusta niiden vahentamiseksi
(Environmental effects of
fish farming and the ecological foundation for their decrease), Vol. 17, Kala-ja Riistahallinnon Julk,
Helsinki, pp. 1–47, (in Finnish).
Lehtinen, K.-J., Mattsson, K., Tana, J., Grotell, C., 1996. Light energy as a cause of structural and functional
variations in brackish water littoral mesocosms used in ecotoxicological research. Boreal Environ. Res. 33,
71–83.
¨
Lehtinen, K.-J., Mattsson, K., Tana, J., Grotell, C., Engstrom,
C., 1998. Effects on ecosystem structure and
function of fish farming as simulated in littoral brackish water mesocosms. Aquaculture 165, 179–202.
Lundgren, A., 1985. Model ecosystems as a tool in freshwater and marine research. Arch. Hydrobiol. 70,
157–196.
Mapstone, B.D., 1996. Scalable decision criteria for environmental impact assessment: effect size, Type I and
Type II errors. In: Schmitt, R.J., Osenberg, C.W. (Eds.), Detecting Ecological Impacts: Concepts and
Applications in Coastal Habitats, Academic Press, San Diego, pp. 67–80.
Maund, S.J., Chapman, P.F., Kedwards, T.J., Tattersfield, L., Matthiessen, P., Warwick, R.M., Smith, E.P.,
1999. Application of multivariate statistics to ecotoxicological field studies. Environ. Toxicol. Chem. 18,
111–112.
˚ Grahn, O., 1977. An outdoor model simulating a Baltic Sea littoral
¨
Notini, M., Nagell, B., Hagstrom,
A,
ecosystem. Oikos 28, 2–9.
Osenberg, C.W., Schmitt, R.J., Holbrook, S.J., Abu-Saba, K.E., Flegal, A.R., 1996. Detection of environmental
impacts: natural variability, effect size, and power analysis. In: Schmitt, R.J., Osenberg, C.W. (Eds.),
Detecting Ecological Impacts: Concepts and Applications in Coastal Habitats, Academic Press, San Diego,
pp. 83–108.
Perez, K.A., 1995. Role and significance of scale to ecotoxicology. In: Cairns, J., Niederlehner, B.R. (Eds.),
Ecological Toxicity Testing — Scale, Complexity and Relevance, Lewis Publishers, CRC Press, Boca
Raton, pp. 49–72.
Peterman, R.M., 1990. Statistical power analysis can improve fisheries research and management. Can. J. Fish.
Aquat. Sci. 47, 2–15.
Petersen, J.E., Chen, C.-C., Kemp, W.M., 1997. Scaling aquatic primary productivity: experiments under
nutrient- and light-limited conditions. Ecology 78, 2326–2338.
258
P. Kraufvelin / J. Exp. Mar. Biol. Ecol. 240 (1999) 229 – 258
Pilson, M.E.Q., 1990. Application of mesocosms for solving problems in pollution research. In: Lalli, C.M.
(Ed.), Enclosed Experimental Marine Systems: A Review and Recommendations. Coastal and Estuarine
Studies, Vol. 37, Springer-Verlag, Heidelberg, pp. 155–168.
Pilson, M.E.Q., Oviatt, C.A., Nixon, S.W., 1980. Annual nutrient cycles in a marine microcosm. In: Giesy, J.P.
(Ed.), Microcosms in Ecological Research, Technical Information Center, Springfield, VA, pp. 753–778.
Rice, W.R., 1989. Analyzing tables of statistical tests. Evolution 43, 223–225.
Rosenzweig, M.S., Buikema, A.L., 1994. Phytoplankton colonization and seasonal succession in new
experimental ponds. Environ. Toxicol. Chem. 13, 599–605.
Scheiner, S.M., 1993. MANOVA: multiple response variables and multispecies interactions. In: Scheiner, S.M.,
Gurevitch, J. (Eds.), Design and Analysis of Ecological Experiments, Chapman and Hall, New York, pp.
94–112.
Schindler, D.W., 1998. Replication versus realism: the need for ecosystem-scale experiments. Ecosystems 1,
323–334.
SETAC-Europe, 1991. Guidance document on testing procedures for pesticides in freshwater mesocosms. In:
From the Workshop: A Meeting of Experts On Guidelines For Static Field Mesocosm Tests, 3–4 July 1991,
Monks Wood Experimental Station, Abbots Ripton, Huntingdon, UK, pp. 1–46.
Skoog, G., 1971. Variations in the distribution of Theodoxus fluviatilis on the stony localities in the northern
Baltic proper. Thalassia Jugosl. 7, 363–372.
Smith, E.P., 1995. Design and analysis of multispecies experiments. In: Cairns, J., Niederlehner, B.R. (Eds.),
Ecological Toxicity Testing — Scale, Complexity and Relevance, Lewis Publishers, CRC Press, Boca
Raton, pp. 73–95.
Sokal, R.R., Rohlf, F.J., 1995. Biometry, 3rd ed, W.H. Freeman, New York, pp. 1–887.
Steele, J.H., 1979. The uses of experimental ecosystems. Phil. R. Soc. Lond. B 286, 583–595.
Steele, J.H., Farmer, D.M., Henderson, E.W., 1977. Circulation and temperature structure in large marine
enclosures. J. Fish. Res. Board. Can. 34, 1095–1104.
Stephenson, G.L., Hamilton, P., Kaushik, N.K., Robinson, J.B., Solomon, K.R., 1984. Spatial distribution of
plankton in enclosures of three sizes. Can J. Fish. Aquat. Sci. 41, 1048–1054.
Strickland, J.D.H., 1967. Between beakers and bays. New Sci. 33, 276–278.
Takahashi, M., Thomas, W.H., Seibert, D.L.R., Beers, J., Koeller, P., Parsons, T.R., 1975. The replication of
biological events in enclosed water columns. Arch. Hydrobiol. 76, 5–23.
¨
Tana, J., Rosemarin, A., Lehtinen, K.-J., Hardig,
J., Grahn, O., Landner, L., 1994. Assessing impacts on Baltic
coastal ecosystems with mesocosm and fish biomarker tests: a comparison of new and old wood pulp
bleaching technologies. Sci. Total Environ. 145, 213–234.
Taub, F.B., Kindig, A.C., Meador, J.P., Swartzman, G.L., 1991. Effects of ‘seasonal succession’ and grazing
on copper toxicity in aquatic microcosms. Verh. Int. Ver. Limnol. 24, 2205–2214.
Underwood, A.J., 1996. On beyond BACI: sampling designs that might reliably detect environmental
disturbances. In: Schmitt, R.J., Osenberg, C.W. (Eds.), Detecting Ecological Impacts: Concepts and
Applications in Coastal Habitats, Academic Press, San Diego, pp. 151–175.
Underwood, A.J., 1997. Experiments in Ecology: Their Logical Design and Interpretation Using Analysis of
Variance, Cambridge University Press, Cambridge, UK, pp. 1–504.
Underwood, A.J., Peterson, C.H., 1988. Towards an ecological framework for investigating pollution. Mar.
Ecol. Prog. Ser. 46, 227–234.
Van den Brink, P.J., Ter Braak, C.J.F., 1998. Multivariate analysis of stress in experimental ecosystems by
Principal Response Curves and similarity analysis. Aquat. Ecol. 32, 163–178.
Van den Brink, P.J., Ter Braak, C.J.F., 1999. Principal response curves: analysis of time-dependent multivariate
responses of biological community to stress. Environ. Toxicol. Chem. 18, 138–148.
Wallentinus, I., 1991. The Baltic Sea Gradient. In: Niemhuist, D.H., Mathieson, A.C. (Eds.), Intertidal and
Littoral Ecosystems. Ecosystems of the World, Vol. 24, Elsevier Science, Amsterdam, pp. 83–108.
Winner, R.W., Owen, H.A., Moore, M.V., 1990. Seasonal variability in the sensitivity of freshwater lentic
communities to a chronic copper stress. Aquat. Toxicol. 17, 75–92.