spatial modelling of freshwater fish in semi

RIVER RESEARCH AND APPLICATIONS
River Res. Applic. 18: 123–136 (2002)
Published online in Wiley InterScience
(www.interscience.wiley.com). DOI: 10.1002/rra.638
SPATIAL MODELLING OF FRESHWATER FISH IN SEMI-ARID RIVER
SYSTEMS: A TOOL FOR CONSERVATION
A. F. FILIPE,a I. G. COWXb and M. J. COLLARES-PEREIRAa*
a
Centro de Biologia Ambiental/Departamento de Zoologia e Antropologia, Faculdade de Ciências, 1749-016 Lisboa, Portugal
b Hull International Fisheries Institute, University of Hull, HU6 7RX, UK
ABSTRACT
This paper examines the feasibility of using multivariate statistics to model fish species distribution and habitat requirements for intermittent streams in semi-arid regions, many of which are coming under increasing pressure from water
resource development schemes. The assessment was based on the geographical distribution of six endemic fish species in
the Guadiana river, a semi-arid river system in southern Iberia. Their presence was related to 20 environmental variables
linked to climate, geomorphology, riparian vegetation and location in the drainage basin. These variables were collected
in the field or from topographical maps to evaluate habitat suitability and to predict the presence of the species according
to season. Multivariate logistic regression in a geographic information system (GIS) environment was performed to identify regions with high probability of occurrence for each species. The variables that best explained the occurrence of
the species were the sample location in the drainage basin, the geomorphology and the riparian vegetation. The models
presented have a high predictive power and can be used in monitoring and predicting temporal changes caused by human
activities. This modelling approach can be used to predict the areas that need to be conserved to protect or rehabilitate
the endangered species. Armed with this information, managers can formulate conservation measures to prevent further
degradation of the stocks and possibly enhance the populations. Copyright  2002 John Wiley & Sons, Ltd.
KEY WORDS:
modelling distribution; landscape variables; freshwater fish; conservation; intermittent streams
INTRODUCTION
Intermittent river systems are characteristic of many semi-arid regions, and they are coming under increasing
pressure from water resource development schemes. As a result, there is growing concern about conservation
of the ecological integrity of the systems being targeted. Unfortunately, there is a paucity of information
about the ecology of these rivers, and the factors that regulate the distribution and abundance of the flora
and fauna (Davies et al., 1994). This information is crucial if the fish communities are to be managed from
a catchment-wide perspective.
The use of multivariate statistics to model species distribution and habitat requirements has increased in
the past twenty years with a wide variety of techniques. In particular, regression models have been used
widely to predict species distribution, abundance and habitat preferences (e.g. Walker, 1990; Pereira and
Itami, 1991; Bustamante, 1997; Monkkonen et al., 1997; Massolo and Meriggi, 1998; Brito et al., 1999;
Mladenoff et al., 1999). When linked to the geographic information system (GIS) environment it has been
applied for mapping ecological factors (Buckland and Elston, 1993), but studies on freshwater fish are still
very rare (e.g. Evans et al., 1998; Torgersen et al., 1999). As predicted by Margalef (1968), presence/absence
data allow the detection of macro-scale patterns in community ecological studies.
The aims of this study were to assess the feasibility of using multivariate logistic regression in a GIS
environment to quantify the macro-scale factors affecting fish habitat use and habitat suitability in semi-arid
river systems. The assessment was based on a case study in the Guadiana River in Portugal. It is a typical
intermittent system of southern Iberia, with high intra- and inter-annual flow variability, facing considerable
* Correspondence to: M. J. Collares-Pereira, Centro de Biologia Ambiental/Departamento de Zoologia e Antropologia, Faculdade de
Ciências, 1749-016 Lisboa, Portugal. E-mail: [email protected]
Copyright  2002 John Wiley & Sons, Ltd.
Received 20 September 2000
Revised 25 January 2001
Accepted 25 January 2001
124
A. F. FILIPE, I. G. COWX, AND M. J. COLLARES-PEREIRA
changes from water resource schemes and other human activities (Collares-Pereira et al., 1998, 2000). Many
of the native freshwater fish of this river system are endemic, and their conservation status is of great
concern. Consequently, there is an urgent need to assess the distribution of the endemic species and the
factors responsible for regulating their occurrence in different parts of the catchment. Furthermore, if the fish
communities are to be conserved, there is a need for information on macro-scale factors affecting distribution
in such river systems.
METHODS
Study area
The lower Guadiana River Basin is in southern Portugal. The area is delimited in the north by Portalegre
and in the south by the mouth of the main river, near Vila Real de Santo António. It has an area of 11 700 km2
(17% of the overall catchment area) and is located at latitude 37° to 39° 30 N and longitude 7° to 8° W. The
Guadiana River originates in the Ruidera Lagoons (Spain) at 1700 m and flows over 810 km to the Atlantic
Ocean, with 550 km in Spanish territory, 110 km along the border and 150 km in Portugal. Some 85 km
upstream from the mouth of the Guadiana there is a natural barrier which impedes migration of fish, apart
from eels (Anguilla anguilla (L.)), except at very high flows. The mean human population density for the
region is 28 inhabitants km−1 , but with a generally higher concentration in the north (INAG/COBA, 1995).
The study area has a north–south orientation, with variation in altitude from about 500 m down to sea
level. The geology is schist-derived, highly impermeable, and with little groundwater resources. The catchment
experiences a typical Mediterranean climate, i.e. long, warm summers with almost no rain, and mild, wet
winters (80% of total annual rainfall); the inter-annual variation in precipitation is large, with series of wet
and dry years. The hydrological regime of the rivers, especially of the smaller tributaries in the south, is
intermittent, with the rivers being reduced to permanent pools or drying up completely (Collares-Pereira
et al., 1998; Bernardo and Alves, 1999; Pires et al., 1999). This regime is mainly dependent on climatic
conditions and riparian vegetation, but an increasing demand for water resources in recent years has modified
flows both in Portugal and Spain.
Fish community
The Guadiana River Basin is considered to have the most diverse fish fauna in Portugal. From the 31
species listed, 19 are freshwater fish species. The lower part of the basin is dominated by two primary species:
Leuciscus alburnoides Steindachner 1866 complex and Barbus steindachneri Almaça, 1967. There are nine
other native species: Anaecypris hispanica (Steindachner, 1866), Chondrostoma willkommii Steindachner,
1866, C. lemmingii (Steindachner, 1866), Leuciscus pyrenaicus Günther, 1868, Barbus microcephalus Almaça,
1967, B. comizo Steindachner, 1865, B. sclateri Günther, 1868, Cobitis paludica (De Buen, 1930) and Salaria
fluviatilis (Asso, 1801)). There are also eight exotic species: Esox lucius L., 1758, Fundulus heteroclitus (L.,
1766), Lepomis gibbosus (L., 1758), Micropterus salmoides (Lacépède, 1802), Gambusia holbrookii Girard,
1859, Cichlasoma facetum (Jenyns, 1842), Cyprinus carpio L., 1758 and Carassius auratus (L., 1758) (Cowx
and Collares-Pereira, 2000).
Data collection
Sampling was performed between November 1997 and July 1998. A total of 306 samples were taken from
149 sites dispersed along the main River Guadiana and its tributaries: 20 sites were sampled on a bimonthly
basis, 41 every four months and the remainder only once. Each site was 60 m long and was UTM (Universal
Transverse Mercator) georeferenced (Figure 1). At each site the fish community structure was evaluated by
electric fishing (DC at 300/600 V and 4–6 A). Fish were identified to species level except for juveniles
(<10 cm) of the genus Barbus because they are difficult to discriminate in the field. All fish were returned
live to the river.
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
125
MODELLING FISH DISTRIBUTION IN SEMI-ARID RIVERS
PORTALEGRE
A
B
1
N
Caia R.
Elyas
Guadiana R.
2
ÉVORA
3
SPAIN
Degebe R.
Ardila R.
PORTUGAL
Moura
BEJA
Chança R.
Mertola
4
Location
Water course
Site
10 km
5
6
V.R. Sto António
Figure 1. Location of Guadiana River Basin in the Iberian Peninsula (A) and location of sample sites in the different rivers of the
Portuguese sector (B). Reservoirs: 1, Caia; 2, Lucefecit; 3, Monte Novo; 4, Chança; 5, Odeleite; 6, Beliche
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
126
A. F. FILIPE, I. G. COWX, AND M. J. COLLARES-PEREIRA
Table I. Environmental variables used in data analysis with indication of description and unit of measurement, classes
that were used, representative value and source
Variable
Description
Class
Value
Source
580–599
600–619
620–639
640–659
660–679
1
2
3
4
5
Mapa
1 : 25 000
Location in the drainage basin
EST
Distance to Greenwich
Meridian UTM (1 km)
DGU
Distance to the main river (km)
0–9
10–19
20–29
30–39
40–49
>50
1
2
3
4
5
6
Mapa
1 : 25 000
TRI
Distance between tributary
mouth and Guadiana mouth
(km)
0–49
50–99
100–149
150–199
>200
1
2
3
4
5
Mapa
1 : 250 000
ORD
Stream order
≤3
4
5
≥6
1
2
3
4
Mapb
1 : 100 000
Climate
INS
Average annual insolation (h)
< 2899
2900–2999
>3000
1
2
3
Mapc
(1931–1960)
PRE
Average annual precipitation
(mm)
400–499
500–599
>600
1
2
3
Mapc
(1931–1960)
TEM
Average annual temperature
( ° C)
15.0–16.4
16.5–17.4
>17.5
1
2
3
Mapc
(1931–1960)
FLO
Average annual run-off (mm)
50–99
100–149
>150
1
2
3
Mapc
(1931–1960)
ELE
Elevation (m)
0–49
50–99
100–149
150–199
200–249
>250
1
2
3
4
5
6
Mapa
1 : 25 000
ROC
Rock type (lithology)
Sedimentary rocks
Schist rocks
Volcanic rocks
1
2
3
Mapc
SLO
Longitudinal slope
0–0.1
0.2–0.3
0.4–0.5
>0.6
1
2
3
4
Mapa
1 : 25 000
Geomorphology
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
127
MODELLING FISH DISTRIBUTION IN SEMI-ARID RIVERS
Table I. (Continued )
Variable
Description
Class
Value
Source
VAL
Valley steepness (number of contour
lines in a perpendicular line to the
site in 500 m)
0–2
3–5
6–8
9–11
12–18
1
2
3
4
5
Mapa
1 : 25 000
NFL
Number of afferent streams
<3
3–4
5–6
7–8
9–11
>11
1
2
3
4
5
6
Mapa
1 : 25 000
WID
Median width of river bed (m)
0–9
10–19
20–29
30–39
>40
1
2
3
4
5
Fieldwork
TRE
Arboreal cover (%)
0
1–24
25–49
50–100
1
2
3
4
Fieldwork
BUS
Bush cover (%)
0
1–24
25–49
50–100
1
2
3
4
Fieldwork
PDE
Population density in the municipality
(no. inhabitants km−2 )
0–29
20–49
>50
1
2
3
Mapc
(1931–1960)
DHO
Distance to nearest house (m)
0–199
200–399
400–599
600–800
1
2
3
4
Mapc
1 : 25 000
ROA
Type of nearest road
Unmetalled
Municipal
Highway
1
2
3
Mapd
DAM
Dam capacity >0.9 hm3 within
60 km of the site
Without dam
With upstream dam
With downstream dam
1
2
3
Map a
1 : 250 000
Riparian vegetation
Human impact
= Army Cartographic Institute.
= Portuguese Institute of Cartography and Cadraste.
c CAN = Environmental National Commission: Environment Atlas (1983).
d ACP = Portugal Automobile Club, 90th edition.
a IGE
b IPCC
The topographical and environmental variables were collected at each site or from maps (see Table I for
scale). For each site, twenty macro-scale variables, exhibiting no seasonal variation, and thus likely to have
predictive value in models of fish distribution, were considered (Table I): four relate to location in the drainage
basin, four characterize the climate, six describe the geomorphology, two relate to riparian vegetation and
four relate to human impact. All variables were converted into classes.
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
128
A. F. FILIPE, I. G. COWX, AND M. J. COLLARES-PEREIRA
DATA ANALYSIS
Temporal distribution of fish species
This study concentrated on four highly threatened cyprinid taxa (Anaecypris hispanica, Barbus microcephalus, Chondrostoma willkommii and C. lemmingii ) and two more common Iberian species (Leuciscus
pyrenaicus and Cobitis paludica) which are characteristic of the fish community (SNPRCN, 1991; CollaresPereira et al., 1998, 1999, 2000). Due to the lack of information about migration patterns of these species,
temporal variation in occurrence was included in the analysis. This was based on two seasons, a wet and a dry
season, which were discriminated by average monthly precipitation and temperature at five meteorological
stations (Figure 2). The wet season was considered to be from November 1997 until March 1998 and the dry
season from March until the end of the sampling period. Jaccard’s similarity index was used to discriminate
seasonal variation in occurrence for each species. The similarity between the presence/absence of each species
in different time intervals was compared. This index was chosen because it does not count double-absences
and it smoothes the effect of rare species. The species which exhibited seasonal variation in distribution were
those with a Jaccard similarity index <0.6 (Fausch and Bramblett, 1991; Lohr and Fausch, 1997).
Model construction
Multivariate logistic regression was used to determine the effect of environmental factors on the presence/absence of each species and to calculate probability of occurrence because it is capable of using
categorical and non-normally distributed data and also continuous and/or normally distributed data (Hosmer
and Lemeshow, 1989). The aim of this technique is to find a parsimonious model within sound limits of
statistical and biological validity (Hosmer and Lemeshow, 1989; Trexler and Travis, 1993). The association
between the explanatory variables and interactions with the presence/absence of the species was tested using
the maximum likelihood method (Hosmer and Lemeshow, 1989).
Logistic regression is sensitive to extremely high correlations between variables that are supposed to be
independent (Trexler and Travis, 1993; Tabachnick and Fidell, 1996). Correlation between variables was
eliminated by retaining only the variable with the highest explanatory power for pairs of variables with
Kendall’s tau-b correlation coefficient (Siegel and Castellan, 1988) r > 0.70.
For each model, a data matrix was built based on a subset of all sites surveyed in the programme. The
selection of sites for the matrix was based on providing between 40 and 60% of sites where the target
species occurred out of the total number of sites selected. The samples eliminated were chosen randomly
(Table II). These subsets were used to confront analytically the environmental factors that influence the
300
Precipitation (mm)
150
Average monthly precipitation
Average monthly temperature
125
250
100
200
75
150
50
100
50
25
0
0
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Temperature (ºC)
350
Jul
Figure 2. Gausen’s ombrometric diagram based on precipitation and air temperature of 1997 and 1998. When precipitation falls below
the temperature line this indicates the dry period
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
129
MODELLING FISH DISTRIBUTION IN SEMI-ARID RIVERS
Table II. Total number of sites (‘Sites’) and number of sites where the species occurred (‘Present’) that
were used in each model
Species
Wet season
Sites
Present
Chondrostoma lemmingii
Chondrostoma willkommii
Anaecypris hispanica
Barbus microcephalus
Leuciscus pyrenaicus
Cobitis paludica
104
75
59
32
51
34
23
13
Dry season
Sites
Present
98
72
48
50
Both seasons combined
Sites
Present
42
31
19
20
110
100
63
75
presence/absence for both rare and common species. The linearity of the presence of a species with each
variable was checked using the Mantel–Haenszel test at a significance level of 0.05. The non-linear variables
were coded as categorical variables (dummy variables) and linear variables were treated as continuous (Hosmer
and Lemeshow, 1989).
In the multivariate analysis a stepwise backward selection procedure was applied to each selected variable
with a probability of entry of 0.15 and removal of 0.20 (Hosmer and Lemeshow, 1989; Tabachnick and Fidell,
1996). In this step-by-step selection, the addition and exclusion of variables was based on Wald’s test and
assessment of correlation was based on differences in the coefficients estimated when a variable is added to
the model and from partial correlation of the estimated coefficients for P < 0.001 (Zar, 1996). The interaction
terms were also modelled and those contributing significantly (G-test) to the model were retained (Hosmer
and Lemeshow, 1989). The type and degree of association of each variable with the presence of the species
was determined using the odds ratio (ψ).
To assess the fit of each model, the G-test and a classification table (Tabachnick and Fidell, 1996) was used.
The G-test examined the deviance of the model with the constant versus the final model and rejection was at
P < 0.05 significance based on a chi-squared distribution (Tabachnick and Fidell, 1996). For constructing the
classification table, the probability interval given by the logistic regression was transformed to a binary variable
(presence/absence), and the cut-off points (ranging between 0.4 and 0.5) that maximized the percentage of sites
correctly classified were chosen. All the statistical analyses were performed using SPSS v7.0 for Windows
package.
The probability of occurrence was calculated from the logistic regression models. Variable maps were built
in Corel PHOTOPAINT v3.0, after being treated in IDRISI v9.0 for Windows Geographic Information System
environment (Eastman, 1995), to obtain the probability of occurrence with a class amplitude of 0.1.
RESULTS
Distribution of fish species
The distribution of the most common species–Leuciscus pyrenaicus and Cobitis paludica –varied little
over the study period (Table III). The greatest differences were found in Anaecypris hispanica and Barbus
microcephalus, followed by Chondrostoma willkommii and C. lemmingii. These differences were associated
with the onset of the dry season in March when the flow declined (Figure 2).
The fish species examined were distributed throughout the study area and were found in all the larger
tributaries. Cobitis paludica, Leuciscus pyrenaicus, Chondrostoma lemmingii and C. willkommii were the
most frequently caught species (Table IV). The latter occurred mainly in the bigger tributaries in the north
of the study area. The most rare species was Anaecypris hispanica, which was not found in the main river
and only occasionally in some of the bigger tributaries, followed by Barbus microcephalus, which was most
abundant in the Ardila River and other larger tributaries. C. lemmingii was caught only once in the main
river during the dry period. All six species were found upstream of the dams on the Caia and Chança
Rivers. However, B. microcephalus and C. willkommii were not captured in the watercourses upstream of
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
130
A. F. FILIPE, I. G. COWX, AND M. J. COLLARES-PEREIRA
Table III. Jaccard measures for the similarity for each species between time intervals and number of sites
used in the calculation
Species
Nov.–Feb./Mar.–Jul.
Nov.–Mar./Apr.–Jul.
Nov.–Apr./May–Jul.
0.600
0.571
0.407
0.400
0.694
0.686
58
0.556
0.556
0.357
0.250
0.660
0.731
58
0.667
0.593
0.500
0.438
0.738
0.791
50
Chondrostoma lemmingii
Chondrostoma willkommii
Anaecypris hispanica
Barbus microcephalus
Leuciscus pyrenaicus
Cobitis paludica
Number of sites used
Table IV. Percentage of occurrence of the species in the total of sampled sites
Species
Chondrostoma lemmingii
Chondrostoma willkommii
Anaecypris hispanica
Barbus microcephalus
Leuciscus pyrenaicus
Cobitis paludica
Number of sites
a Wet
b Dry
Percentage of occurrence
Nov.–Mar.a
Apr.–Julyb
Nov.–July
46.79
31.19
21.10
11.93
109
42.86
31.63
19.39
20.41
98
69.59
71.6
149
season.
season.
the Odeleite dam, and Anaecypris hispanica and B. microcephalus were absent from watercourses upstream
of Monte Novo dam on the Degebe River. Only L. pyrenaicus occurred in the watercourses upstream of the
Beliche and Lucefecit dams (Figure 1).
Probability of occurrence
In the first step of model construction, mean annual flow (FLO) was excluded because it was based
on records of precipitation and consequently had a high correlation with mean annual precipitation (PRE)
(Kendall’s tau-b correlation coefficient: r = 0.723, P < 0.001).
Fit and classification accuracy of the logistic regression models was high, indicating a strong predictive
power, despite being based on small samples (Table V). The average percentage of correct classification in
all models was high (76.1%) and the average percentage where a species was correctly classified as being
present was also high (76.2%). The lowest percentage of correct classification was for Anaecypris hispanica
in the dry season (70.8%).
The variables describing the distribution of Barbus microcephalus and Chondrostoma willkommii in the
logistic regression models were in contrast from those describing Anaecypris hispanica, C. lemmingii and
Leuciscus pyrenaicus (Table VI). The variables influencing the distribution of Cobitis paludica were again
different from the other species. The models showed that species occurrence was significantly related to
several variables of location in the drainage basin, especially stream order (ORD) and the distance to the
main river (DGU). These variables appeared in the models for both seasons in several species (Table VII).
No interaction terms contributed significantly to the models.
The maps showing probability of occurrence of B. microcephalus for the wet and dry seasons (Plate 1)
illustrate the capacity of these models to predict distributions in the study area. For this species, the main river
is an important area in both seasons (probability of occurrence >0.8). During the dry season, the probability
of occurrence decreased in the small tributaries, and the localities where the fish was not likely to be found
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
PORTALEGRE
PORTALEGRE
N 10 km
ÉVORA
ÉVORA
BEJA
BEJA
[0.0-0.1]
[0.1-0.2]
[0.2-0.3]
[0.3-0.4]
[0.4-0.5]
[0.5-0.6]
[0.6-0.7]
[0.7-0.8]
[0.8-0.9]
[0.9-1.0]
A
B
Plate 1. Probability of occurrence of Barbus microcephalus in the lower Guadiana River Basin (Portugal): (A) wet season;
(B) dry season
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: (2002)
131
MODELLING FISH DISTRIBUTION IN SEMI-ARID RIVERS
Table V. G-test and classification rate for all the logistic regression models
Species
Season
G-test
Cut-off-point
C. lemmingii
C. willkommii
A. hispanica
B microcephalus
65.36∗
24.28∗
16.55∗
27.40∗
18.88∗
13.31∗
16.22∗
27.53∗
42.16∗
31.90∗
Wet
Dry
Wet
Dry
Wet
Dry
Wet
Dry
L. pyrenaicus
C. paludica
Average
Classification rates
%CCT
%CCP
0.4
0.5
0.4
0.5
0.4
0.4
0.4
0.4
0.5
0.5
78.2
71.1
72.0
75.0
75.0
70.8
83.3
82.0
79.0
75.0
76.1
84.8
61.9
63.3
75.9
75.9
73.7
80.0
80.0
84.6
81.7
76.2
%CCA
72.7
78.2
77.8
74.4
74.4
69.0
84.2
83.0
70.0
63.9
74.8
% CCT, percentage of total correctly classified; % CCP, percentage of presences correctly classified; % CCA,
percentage of absences correctly classified.
∗ The models fit the data.
Table VI. Logistic regression models (variables coded as dummy are those where several variables
represent the various classes)
Variable
Chondrostoma lemmingii
Wet season
ELE
BUS
BUSclass2
BUSclass3
BUSclass4
SLO
INS
Constant
Dry season
DGU
TRE
TREclass2
TREclass3
TREclass4
Constant
Chondrostoma willkommii
Wet season
ORD
TRE
DAM
Constant
Dry season
INS
ROC
DGU
DGUclass2
SD (β)
Wald (P)
ψ
CI (ψ) 95%
1.223
0.330
3.397
1.974–5.850
−3.466
−1.878
−6.259
0.982
−1.525
−0.077
1.320
1.292
1.764
0.354
0.555
2.205
< 0.001
0.001
0.009
0.146
< 0.001
0.006
0.006
0.972
0.031
0.153
0.002
2.671
0.218
0.004–0.274
0.018–1.281
0.000–0.035
1.491–4.784
0.087–0.542
0.538
0.144
1.712
1.292–2.269
0.628
2.160
−0.324
−2.721
0.534
0.832
0.761
0.671
< 0.001
0.044
0.239
0.009
0.670
< 0.001
1.874
8.671
0.723
0.658–5.338
1.698–4.290
0.163–3.213
1.151
0.309
0.491
−4.197
0.380
0.252
0.499
1.150
0.002
0.220
0.325
< 0.001
3.160
1.362
1.635
1.500–6.657
0.831–2.234
0.615–4.348
1.3526
−1.5141
0.514
0.861
3.868
0.220
1.412–0.595
0.041–1.191
−2.800
1.376
0.008
0.079
0.143
0.042
0.061
0.004–0.902
β
(continued overleaf )
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
132
A. F. FILIPE, I. G. COWX, AND M. J. COLLARES-PEREIRA
Table VI. (Continued )
Variable
DGUclass3
DGUclass4
DGUclass5
DGUclass6
ORD
Constant
Barbus microcephalus
Wet season
ORD
ELE
Constant
Dry season
SLO
TRI
TRIclass2
TRIclass3
TRIclass4
TRIclass5
ORD
Constant
Anaecypris hispanica
Wet season
DGU
SLO
Constant
Dry season
TRI
DGU
NFL
BUS
BUSclass2
BUSclass3
BUSclass4
Constant
Leuciscus pyrenaicus
ORD
ORDclass2
ORDclass3
ORDclass4
ELE
BUS
DAM
DAMclass2
DAMclass3
INS
Constant
Cobitis paludica
NFL
NFLclass2
NFLclass3
NFLclass4
NFLclass5
NFLclass6
SD (β)
Wald (P)
ψ
CI (ψ) 95%
−1.586
−0.973
−1.086
0.683
0.286
−1.040
1.051
1.071
1.127
1.101
0.328
2.565
0.131
0.364
0.335
0.535
0.384
0.685
0.205
0.378
0.337
1.980
1.331
0.026–1.605
0.046–3.085
0.037–3.073
1.980–0.229
0.699–2.534
1.821
−0.466
−2.518
0.728
0.472
2.509
0.012
0.323
0.315
6.176
0.627
1.481–25.748
0.249–1.582
−1.687
0.759
0.026
0.185
0.042–0.820
−1.083
3.091
1.327
−0.113
0.861
0.311
1.498
1.370
1.440
1.245
0.719
3.118
0.470
0.024
0.357
0.928
0.231
0.921
0.338
21.995
3.769
0.893
2.365
0.688
0.239
−3.867
0.187
0.301
1.133
< 0.001
0.425
0.001
1.990
1.271
1.379–2.872
0.705–2.289
−0.804
0.320
−0.391
0.391
0.201
0.259
0.447
1.377
0.677
0.208–0.963
0.929–2.041
0.407–1.124
−3.739
−2.683
−2.599
4.277
1.656
1.665
1.630
2.643
0.040
0.111
0.131
0.137
0.024
0.107
0.111
0.106
0.024
0.068
0.074
0.001–0.610
0.003–1.788
0.003–1.813
2.046
3.852
0.609
0.692
0.563
0.696
0.968
1.295
0.235
0.278
3.545
0.040
−0.322
−4.685
1.446
0.750
0.400
1.827
< 0.001
0.003
< 0.001
0.638
0.003
0.043
0.049
0.014
0.958
0.420
0.010
−0.570
0.943
−0.390
1.351
1.500
0.891
0.872
1.064
0.967
1.372
0.109
0.522
0.279
0.714
0.163
0.274
β
Copyright  2002 John Wiley & Sons, Ltd.
0.018–6.377
1.499–322.760
0.224–63.401
0.078–10.245
0.578–9.673
7.739
47.111
1.838
1.999
1.756
1.980–30.253
7.063–314.227
0.1451–23.275
1.260–3.169
1.018–3.030
34.635
1.040
0.724
2.037–588.915
0.239–4.522
0.330–1.587
0.565
2.568
0.667
3.859
4.482
0.099–3.240
0.465–14.176
0.084–5.451
0.579–25.712
0.305–65.943
River Res. Applic. 18: 123–136 (2002)
133
MODELLING FISH DISTRIBUTION IN SEMI-ARID RIVERS
Table VI. (Continued )
Variable
TRE
PRE
TRI
TRIclass2
TRIclass3
TRIclass4
TRIclass5
Constant
SD (β)
Wald (P)
−0.594
−0.889
0.297
0.441
2.660
0.262
0.220
1.668
2.341
1.195
0.671
0.993
0.934
1.197
0.045
0.040
0.078
0.026
0.695
0.825
0.074
0.051
β
ψ
CI (ψ) 95%
0.552
0.441
0.309–0.988
0.173–0.976
14.303
1.300
1.246
5.301
1.375–148.824
0.349–4.841
0.178–8.733
0.849–33.078
β, Estimated coefficients; SD (β), standard deviation of the estimated coefficients; Wald (P), p-value of Wald’s
test: ψ, odds ratio; CI (ψ) 95%, confidence interval of odds ratio 95%. Abbreviations for variables can be
found in Table I.
Table VII. Frequency of occurrence of the
variables in the models by species (number of
species for which an explanatory variable was
selected, max. = 5) and by model (total number
of times explanatory variable selected in two
seasonal models for each species, N = 10)
Variables
ORD
DGU
TRI
INS
ELE
SLO
TRE
BUS
NFL
DAM
PRE
ROC
TEM
WID
EST
VAL
PDE
DHO
ROA
Frequency
By species
By model
3
3
3
3
3
3
3
3
2
2
1
1
0
0
0
0
0
0
0
5
4
3
3
3
3
3
3
2
2
1
1
0
0
0
0
0
0
0
(probability of occurrence <0.5) increased. It is important to note that these probabilities apply to the river
channels only because the reservoirs were not sampled in this study.
The areas of species occurrence can be compared using the logistic regression models. Barbus microcephalus and Chondrostoma willkommii occurred in the rivers of high stream order (ORD). In the wet season
C. willkommii inhabits higher order rivers with high arboreal cover (TRE), and it may occur upstream of the
dams (DAM). In the dry season, it is found in areas with high annual insolation (INS) and low permeability
(SOI). Anaecypris hispanica and C. lemmingii have high probability of occurrence in small tributaries. During
the wet season, both species inhabit rivers with steep gradients (SLO), but the former inhabits areas far away
from the main river (DGU), and the latter inhabits areas with little arboreal cover (TRE), high altitude (ELE)
and low mean annual insolation (INS). In the dry season, both species exist in the tributaries, but the former
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
134
A. F. FILIPE, I. G. COWX, AND M. J. COLLARES-PEREIRA
occurs in areas with low riparian bush cover (BUS) and the latter with median cover of riparian trees (TRE).
Leuciscus pyrenaicus was found mostly in middle-order streams (ORD), at high elevation (ELE), low mean
annual insolation (INS), high bush cover in the margins (BUS) and downstream of dams (DAM). The Cobitis
paludica model was unstable and the variables had less significance than in the other species models. The
species is associated with areas with lower mean annual precipitation (PRE), many inflow channels (NFL)
and a low percentage of tree cover (TRE). For this species, the higher probability of occurrence was in the
rivers in the south of the study area (TRI).
DISCUSSION
The logistic regression modelling allowed the identification of a combination of variables that determined
species’ distributions. The output suggests that the occurrence of species in semi-arid streams can also
be explained to a reasonable degree of precision by landscape patterns, particularly by geomorphological
variables, as was also found by Milner et al. (1993), Paller (1994), Poff and Allan (1995) and Taylor et al.
(1996) in studies on freshwater fish in temperate river systems. This is an important finding because many of
these characteristics are fixed attributes and are relatively simple to obtain.
Chondrostoma lemmingii, C. willkommii, Anaecypris hispanica and Barbus microcephalus all shifted their
distribution in March at the end of the wet season. These movements are probably two-fold: (i) migration of
mature fish upstream to reproduce, followed by (ii) downstream dispersion, especially of larger individuals,
to avoid the harsh drought conditions and being caught in the poor water quality conditions experienced in
the few remaining permanent pools (Collares-Pereira et al., 1999; Pires et al., 1999). Cobitis paludica and
Leuciscus pyrenaicus seem to occupy the same areas throughout the year, suggesting individuals probably
do not move for the reasons given earlier. This finding indicates that in studies of this nature it is important
to select a period that covers all seasons (Doncaster et al., 1996), thus accounting for seasonal changes in
distribution of fish in relation to the intermittent flow regime.
The models proved relatively strong and explained a reasonable number of the factors responsible for the
presence of the six species studied. However, their interpretation in ecological terms can be problematic. In
Cobitis paludica, for example, the inclusion of a large number of variables suggests that none had an overriding
effect to explain the distribution of the species. Barbus microcephalus and Chondrostoma willkommii are
characteristic of higher stream orders (ORD), and their presence is more likely in the wet season. These
higher order streams have greater probability of experiencing continuous flow, even during drought periods,
compared to lower order streams which frequently dry up to isolated pools, and this might explain the presence
of these larger species. In contrast, Anaecypris hispanica, C. lemmingii and Leuciscus pyrenaicus are smaller
species that have positive associations with distance to the main river (DGU).
Variables relating to location in the drainage basin, e.g. stream order (ORD) and distance to the main river
(DGU), play a major role in discriminating the presence of fish species. Also geomorphological variables, e.g.
elevation (ELE), longitudinal slope (SLO) and vegetation cover of the banks (TRE and BUS), are important,
as found in other studies on fish distribution in the Guadiana (Godinho et al., 1997; Godinho and Ferreira,
1998). Their importance is probably linked to the role these variables play in describing the hydrological
characteristics of the river, thus providing favourable or non-favourable habitats. In this respect, the presence
of dams appears to restrict the distribution of all species, except Leuciscus pyrenaicus. This is probably
because the rivers upstream of reservoirs often dry up during the dry period and recolonization is dependent
on the species surviving the lentic conditions in the reservoirs.
The main problem that exists in the models in their current form involves the reliability of absence
observations. In the case of Barbus microcephalus, according to the logistic regression model there was a
high probability of occurrence in the main river throughout the year. The relative absence of B. microcephalus
in the electric fishing samples from the main river was probably a result of the poor sampling efficiency in
this kind of habitat (Zalewski and Cowx, 1990), and not a reflection of their true absence. This is supported by
local fishermen, who confirmed the presence of this barbel in many sites along the main river. Consequently,
assessment of the occurrence of fish species should not rely on sampling methods that may not provide a true
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
MODELLING FISH DISTRIBUTION IN SEMI-ARID RIVERS
135
reflection of the distribution range. Another factor that needs to be taken into account in these intermittent
streams is that the fish may be forced to occupy unfavourable habitats when the river recedes and they are
stranded in permanent pools. This could help explain some of the variability unaccounted for in the models.
Perhaps the greatest value of logistic regression modelling is that it is possible to identify where the fish
species would be expected to be found but are absent. This will allow those responsible for the conservation of
the species to assess the reasons for the absence of the species, e.g. whether it is due to a shift in hydrological
regime through river regulation, obstruction to free movement of fish, pollution, or change in landscape use.
Armed with this information, managers can formulate conservation measures to prevent further degradation
of the stocks and possibly enhance the populations.
It is likely that interpretation of these models is precluded by the absence of basic ecological data on the
target species, so parallel studies to address this problem are necessary. Unfortunately, this may be difficult
for many endangered fish species, especially in semi-arid streams where few biological data are available and
their status prevents any further detailed studies into aspects such as the diet and reproduction ecology being
carried out. Notwithstanding, every effort should be made in studies of this nature to provide as wide a range
of data on distribution and abundance as is feasible. This will improve the predictive ability of the model and
improve decisions made regarding conservation management.
In conclusion, the models presented have a high predictive power in semi-arid river systems, and they
can be used in monitoring and predicting temporal changes caused by human activities and shifts in climate.
Furthermore, this modelling approach can be used to predict the areas that need to be conserved to protect
or rehabilitate the endangered species of the catchment. However, further research is needed, on wider
temporal and also spatial scales, to improve the predictability of the models. This should also include temporal
information about land use and land cover changes.
ACKNOWLEDGEMENTS
The authors thank Patrı́cia Tiago, Filipe Ribeiro, Tiago Marques, Luı́s Moreira da Costa, José Rodrigues and
Leonor Rogado for their constant help in the field work. This research was undertaken in the European Union
Project LIFE-Nature ‘A conservation strategy for Anaecypris hispanica’ (contract B4-3200/97/280).
REFERENCES
Bernardo JM, Alves MH. 1999. New perspectives for ecological flow determination in semi-arid regions: a preliminary approach.
Regulated Rivers: Research and Management 15: 221–229.
Brito JC, Crespo EG, Paulo OS. 1999. Modeling wildlife distributions: Logistic Multiple Regression vs. Overlap Analysis. Ecography
22: 251–260.
Buckland ST, Elston DA. 1993. Empirical models for the spatial distribution of wildlife. Journal of Applied Ecology 30: 478–495.
Bustamante J. 1997. Predictive models for lesser kestrel Falco naumani: distribution, abundance and extinction in southern Spain.
Biological Conservation 80: 153–160.
Collares-Pereira MJ, Pires AM, Coelho MM, Cowx IG. 1998. Towards a conservation strategy for Anaecypris hispanica, the most
endangered non-migratory fish in Portuguese streams. In Stocking and Introduction of Fish, Cowx IG (ed.). Fishing News Books,
Blackwell Science: Oxford; 437–449.
Collares-Pereira MJ, Cowx IG, Rodrigues JA, Rogado L, Costa LM. 1999. The status of Anaecypris hispanica in Portugal: problems
of conserving a highly endangered Iberian species. Biological Conservation 88: 207–212.
Collares-Pereira MJ, Cowx IG, Ribeiro F, Rodrigues JA, Rogado L. 2000. Threats imposed by water development schemes on the
conservation of endangered fish species in the Guadiana river, Portugal. Fisheries Management and Ecology 6: 167–178.
Cowx IG, Collares-Pereira MJ. 2000. Conservation of endangered fish species in the face of water resource development schemes in
the Guadiana River, Portugal: harmony of the incompatible. In Management and Ecology of River Fisheries, Cowx IG (ed.). Fishing
News Books, Blackwell Science: Oxford; 428–438.
Davies BR, Thoms MC, Walker KF, O’Keeffe JH, Gore JA. 1994. Dryland rivers: their ecology, conservation and management. In The
Rivers Handbook, vol. 2, Calow P, Petts GE (eds). Blackwell Science: Oxford; 484–511.
Doncaster CP, Micol T, Jensen SP. 1996. Determining minimum habitat requirements in theory and practice Oikos 75: 335–339.
Eastman JR. 1995. IDRISI for Windows–User’s Guide version 1.0. Clark Labs For Cartografic Technology and Geographic Analysis,
Clark University: USA
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)
136
A. F. FILIPE, I. G. COWX, AND M. J. COLLARES-PEREIRA
Evans BJ, Gordon JV, Mavros WV, Perry EM. 1998. The use of Geographic Information Systems technology for salmon habitat
analysis. Report, Pacific Northwest Laboratory.
Fausch KD, Bramblett R. 1991. Disturbance and fish communities in intermittent tributaries of a Western Great Plains River. Copeia
3: 659–674.
Godinho FN, Ferreira MT. 1998. The relative influences of exotic species and environmental factors on an Iberian native fish community.
Environmental Biology of Fishes 51: 41–51.
Godinho FN, Ferreira MT, Cortes RV. 1997. Composition and spatial organization of fish assemblages in the lower Guadiana basin,
Southern Iberia. Ecology of Freshwater Fish 6: 134–143.
Hosmer DW Jr, Lemeshow S. 1989. Applied Logistic Regression. John Wiley and Sons: New York.
INAG/COBA. 1995. Recursos hı́dricos do rio Guadiana e sua utilização. Instituto Nacional da Água: Lisboa.
Lohr SC, Fausch KD. 1997. Multiscale analysis of natural variability in stream fish assemblages of a Western Great Plain as Watershed.
Copeia 4: 706–724.
Margalef R. 1968. Perspectives in Ecological Theory. University of Chicago Press: Chicago.
Massolo A, Meriggi A. 1998. Factors affecting habitat occupancy by wolves in northern Apennines (northern Italy): a model of habitat
suitability. Ecography 21: 97–107.
Milner NJ, Wyatt RJ, Scott MD. 1993. Variability in the distribution and abundance of stream salmonoids, and associated use of habitat
models. Journal of Fish Biology 43: (Suppl. A): 103–119.
Mladenoff DJ, Sickey TA, Wydeven AP. 1999. Predicting gray wolf landscape recolonization: logistic regression vs. new field data.
Ecology and Applications 9: 37–44.
Monkkonen M, Reunanen P, Nikula A, Inkeroinen J, Forsman J. 1997. Landscape characteristics associated with the occurrence of the
flying squirrel Pteromys volans in old-growth forests of northern Finland. Ecography 20: 634–642.
Paller MH. 1994. Relationships between fish assemblage structure and stream order in South Carolina Coastal Plain Streams. Transactions
of the American Fisheries Society 123: 150–161.
Pereira JM, Itami RM. 1991. GIS-based habitat modeling using Logistic Multiple Regression: A study of the Mt. Graham Red Squirrel.
Photogrammetric Engineering and Remote Sensing 57: 1475–1486.
Pires AM, Cowx IG, Coelho MM. 1999. Seasonal changes in fish community structure of intermittent streams in the middle reaches of
the Guadiana basin, Portugal. Journal Fish Biology 54: 235–249.
Poff NL, Allan JD. 1995. Functional organization of stream fish assemblages in relation to hydrological variability. Ecology 76: 606–627.
Siegel S, Castellan NJ. 1988. Nonparametric Statistics for the Behavioral Sciences (second edition). McGraw-Hill International Editions:
New York.
SNPRCN - Serviço Nacional de Parques, Reservas e Conservação da Natureza (ed.) 1991. Peixes Dulciaquı́colas e Migradores. In Livro
Vermelho dos Vertebrados. Secretaria de Estado dos Recursos Naturais: Lisbon.
Tabachnick BG, Fidell LS. 1996. Using Multivariate Statistics (third edition). Harper Collins College Publishers: New York.
Taylor CM, Winston MR, Matthews WJ. 1996. Temporal variation in tributary and mainstream fish assemblages in a Great Plains
Stream System. Copeia 2: 280–289.
Torgersen CE, Price DM, Li HW. 1999. Multiscale thermal refugia and stream habitat associations of Chinook salmon in Northeastern
Oregon. Ecology and Applications 9: 301–319.
Trexler JC, Travis J. 1993. Nontraditional regression analysis. Ecology 74: 1629–1637.
Walker PA. 1990. Modeling wildlife distributions using a geographic information system: kangaroos in relation to climate. Journal of
Biogeography 17: 279–289.
Zalewski M, Cowx IG. 1990. Factors affecting the efficiency of electric fishing. In Fishing with Electricity: Application in Freshwater
Fisheries Management, Cowx IG (ed.). Fishing News Books: Oxford; 89–111.
Zar JH. 1996. Biostatistical Analysis (third edition). Prentice Hall: New Jersey.
Copyright  2002 John Wiley & Sons, Ltd.
River Res. Applic. 18: 123–136 (2002)