Stefanski, L. A., Rubino, M., and Hess, G. (2003)Estimating Patch Occupancy when Patches are Incompletely Surveyed."

ESTIMATING PATCH OCCUPANCY WHEN PATCHES
ARE INCOMPLETELY SURVEYED
Leonard A. Stefanski, Matt Rubino and George Hess
∗
Abstract
- Institute of Statistics Mimeo Series #2543
In this paper we describe a method for statistical modeling of the probability of patch
occupancy by a species as a function of patch size and other covariates when patches may be
incompletely surveyed. The model is appropriate for study designs is which each patch: is
visited once; is surveyed at a set of access sites from which an observer can detect the species
within a measurable portion of the patch; and is surveyed until the species is detected or until
all access sites have been visited. The model explicitly accounts for incomplete patch
coverage, providing unbiased estimates of patch occupancy relative to the “naive” modeling
approach that ignores incomplete patch coverage. The statistical model is motivated by and
illustrated with data from a study of Barred owl habitat.
KEY WORDS: Barred owl habitat; biodiversity monitoring; detection probability; logistic
regression; metapopulation dynamics; negative binomial distribution; patch occupancy;
Poisson distribution; statistical modeling.
Institute of Statistics Mimeo Series No. 2543
April 2003
∗
Leonard A. Stefanski is Professor, Department of Statistics, North Carolina State University, Raleigh, NC 27695–
8203. Matt Rubino is Biologist / GIS Analyst, North Carolina Gap Analysis Project, North Carolina State University,
Raleigh, NC 27695-7617. George Hess is Associate Professor, Department of Forestry, North Carolina State University,
Raleigh, NC 27695–8002. Email addresses: [email protected], matt [email protected]. george [email protected].
Stefanski, Rubino, Hess
1
2
Introduction
Population estimates have long been based on counts of individual organisms. However, the
probability of detecting (counting) an individual organism, given that it is present, is almost
always less than one (e.g., Otis et al. 1978; Lancia et al. 1994). Uncorrected counts of individuals
therefore almost always underestimate the true size of a population. Researchers have developed a
variety of statistical modeling approaches to estimate population size when the detection
probability for individuals is less than one (e.g., Seber 1982; Nichols et al. 2000; Williams et al.
2002; Thompson, 1992, Ch 16). These models, with varying levels of sophistication, arrive at
population estimates by adjusting the counts (number of detections) to account for the probability
of detecting individual organisms. In the simplest forms, such as capture-recapture methods, the
count is divided by a single detection probability to provide a corrected population estimate (see
Seber 1982; Lancia et al. 1994). More complex models use continuous detectability functions that
estimate detectability as a function of distance from the observer (e.g., Buckland et al. 1980;
Lancia et al. 1994; Buckland et al. 2001; Williams et al. 2002).
Analogously, the probability of detecting a species in a habitat patch, given that at least one
individual of the species is present in the patch, is usually less than one. Thus, the proportion of
patches in which a species is detected underestimates the true proportion of occupied patches.
Researchers have begun to address the issue of estimating patch occupancy when detection
probability is less than one. For example, McKenzie et al. (2002) developed a method to estimate
patch occupancy that uses a sampling scheme in which patches are visited repeatedly during the
season in which the species is most detectable. Barbraud et al. (2003) developed a method to
estimate patch occupancy for bird colonies that uses a sampling scheme in which patches are
visited from multiple times each year over a number of years.
In this paper, we describe a method for estimating patch occupancy by a species when:
• each patch is visited only once;
• each patch is surveyed at a predefined set of access sites from which an observer can detect
Stefanski, Rubino, Hess
3
the species within a known portion of the patch;
• each patch is surveyed until the species is detected or until all access sites have been visited;
• patches may be surveyed incompletely, even if all access sites have been visited.
Under these conditions, observers might not detect the species in a patch for one of three reasons:
1. the species was not present in the patch;
2. the species was present in unsurveyed portions of the patch, and therefore was not detected;
3. the species was present in the portion of the patch surveyed, but was not detected during the
survey (we assume that this does not happen, i.e., that the species will be detected, if present
in the surveyed portion).
Our estimates of patch occupancy use information about the portions of the patch surveyed at
each access site in addition to the detection status information.
We developed this method as part of an ongoing project to create a wildlife conservation plan for
an approximately one million hectare suburbanizing region in the United States (Hess et al. 2000;
Hess and King 2001; Rubino and Hess 2003). The ability to quickly develop and test habitat
models for a number of species is central to our approach. The habitat models we have been
developing identify habitat patches predicted to be suitable for a species. Our objective in model
verification is to uncover major flaws quickly and inexpensively using a single survey and simple
presence/absence approaches. We are interested in identifying errors of commission (predicting a
species is present when it is not) and omission (not predicting a species is present when it is) in our
habitat model. Patch occupancy of habitat and non-habitat patches is a measure of these errors.
Our method will be of use to those interested in metapopulation dynamics and biodiversity
monitoring programs. Patch occupancy is a key measure in both of these endeavors, in which
patches are almost always incompletely surveyed and species detectability is almost always less
than one. The portion of patches occupied is a state variable in many metapopulation models (e.g.,
Stefanski, Rubino, Hess
4
Levins 1969; Hanski and Gilpin 1997). Change in the occupancy status of patches through time
can be used to estimate metapopulation extinction and recolonization rates (e.g., Barbraud et al.
2003, Mackenzie et al. 2003). In biodiversity monitoring programs, researchers often survey a
portion of an area and use that information to make inferences about a larger area (e.g., Yoccoz et
al. 2001). Clearly, getting the best estimate for patch occupancy is an important goal.
Patch occupancy has been used to define minimum patch sizes for wildlife habitat. For example,
Robbins et al. (1989) used logistic models of forest patch occupancy for 26 area-sensitive bird
species in the Middle Atlantic United States. They reported the patch size at which the probability
of species occurrence was at a maximum, as well as the patch size for which there was at least a
50% probability of finding a species. These patch sizes have been used widely to define minimum
habitat requirements for the 26 area-sensitive bird species documented. The method we present
can improve estimates of species presence upon which such recommendations are based.
2
Survey Protocol
In developing our statistical model, we made the following assumptions:
• Surveys for a species are conducted on a sample of discrete habitat patches of a range of sizes.
• Some of the patches surveyed are non-habitat and not expected to contain the species. These
are included to estimate errors of omission in the habitat model.
• Because of limited access to the patches, most patches are surveyed incompletely.
• Each patch is surveyed from a set of access sites from which the species can be detected.
Access sites are locations within or adjacent to the patch.
• It is possible to measure the total area of the patch.
• The areas of the portions of a patch surveyed from the access sites, and also the areas of
overlap surveyed from multiple sites can be measured.
Stefanski, Rubino, Hess
5
• Access sites may be visited in any order, and the order in which they are visited is recorded.
• The patch survey is considered complete once the species is detected, or after all access sites
have been visited.
We carried out such a survey to test a barred owl (Strix varia) habitat model (Rubino and Hess
2003). We visited a sample of patches delineated by the habitat model. To investigate errors of
omission in the habitat model, we also visited a number of non-habitat patches predicted by our
habitat model to be unsuitable for barred owls. Most patches were on private lands limiting
accessibility. Therefore access sites were established along public roads surrounding the patches
(Figure 1). We used a vocalization call-back technique in which surveyors broadcast a recording of
owl calls in order to elicit responses from territorial individuals (Rubino and Hess 2003). Because
barred owl vocalizations are audible for approximately 800 meters (Smith 1978; Bosakowski et al.
1987), access sites were approximately 1.6 km apart. This spacing was designed to minimize
overlapping sampling effort and maximize patch area coverage per unit effort.
A patch was surveyed until a response was heard, or until all access sites had been visited. Once a
barred owl was detected, sampling was discontinued and the patch was designated as positive for
owl presence. If there was no response at any of the access sites, the patch was scored negative for
barred owl presence. Listening coverage for a patch was almost always incomplete, especially for
larger patches, (Figure 2). For an incompletely surveyed patch, a negative score does not imply
that the patch was empty, only that no owls were present in the surveyed portion.
Rubino and Hess (2002) modeled the relationship between patch size and owl detection using
logistic regression based on data from 95 surveyed patches. This “naive” approach does not use
information about detection probability within patches to estimate the true proportion of patches
occupied. The methods presented here use information about the portion of each patch surveyed to
improve estimates of patch occupancy. Coverage data were available for 94 of the 95 patches Hess
and Rubino surveyed. Because our methods require coverage data, we used only the 94 patches
with coverage information to develop our model. To facilitate comparison, we reproduced Rubino
and Hess’ logistic regression model using data on 94 of the patches (Figure 2, solid curve).
Stefanski, Rubino, Hess
6
A global positioning system was used to obtain the coordinates of each access site during the
survey. We downloaded these coordinates into a geographic information system and quantified the
proportion of the survey patch measured from each access site. We accomplished this by overlaying
the coordinates on the habitat map and assuming an 800m detection radius for each access site.
The cumulative proportions of the patch surveyed at each access site, and the final status of the
patch (owl detected or not), served as the foundation for our statistical model of patch occupancy.
3
Methods: Statistical Modeling and Analysis
3.1
Single Patch Outcome Probabilities
The patch observation process is determined by a sequence of access sites and the portions of patch
covered from each site. Observation of a patch continues until species detection or all access sites
have been visited. For a given patch, let C1 < C2 < · · · < Cm denote the cumulative coverage
proportions afforded by the m access sites in the order that the sites are visited. For example, if 2%
of the patch is covered from the first access site, 3% from the second not covered by the first, and
6% from the third not covered from the first or second, then C1 = 0.02, C2 = 0.05, and C3 = 0.11.
The observation of a patch results in one of m + 1 possible outcomes, AP1 , · · · , APm , AP(0)
m .
• AP1 : The species is detected at the first access site and the site provided a coverage of C1 .
This outcome is summarized by (1, C1 ), where 1 indicates detection and C1 indicates the
proportion of patch surveyed.
• AP2 : The species is detected at the second access site which provided additional coverage of
C2 − C1 . The outcome is summarized by the vector (0, 1, C1 , C2 ). The first two elements (0, 1)
indicate no detection at the first site, detection at the second site. The second two elements
(C1 , C2 ) give the corresponding cumulative proportions of the patch surveyed.
• APj : The species is first detected at the jth access site which provided a coverage proportion
of Cj − Cj−1 not covered by the previous j − 1 sites. The outcome is summarized by the
7
Stefanski, Rubino, Hess
vector (0, . . . , 0, 1, C1 , . . . , Cj ). The j − 1 leading zeros indicate the failure to detect the
species at the first j − 1 sites; (C1 , . . . , Cj ) indicate the corresponding cumulative proportions
of the patch covered.
If all access sites are visited, there are two possible outcomes at the final site.
• APm : The species is first detected at the mth access site, summarized
(0, . . . , 0, 1, C1 , . . . , Cm ) where the 1 appears in the mth component.
• AP(0)
m : The species is not detected at any of the m sites, summarized (0, . . . , 0, C 1 , . . . , Cm ).
Under these assumptions, we derived the probability of each outcome. First, we determined
conditional probabilities given that there are k individuals of the species in the patch.
Unconditional probabilities are obtained by summing over an assumed distribution for the number
of individuals per patch. We refer to the successive new portions of patch surveyed at each access
site as subpatches. The mathematical assumptions used to develop our model are stated in this
section. Their reasonableness, and the likely consequences of violations, are discussed in detail in
Section 5
When the species does not occupy the patch (k = 0), the only possible outcome (assuming the
³
´
(0)
probability of a false detection is 0) is AP(0)
= 1.
m . Thus Pr (APj ) = 0, j = 1, . . . , m, and Pr APm
Probabilities when k > 0 are derived under the assumptions that individuals are uniformly and
independently distributed throughout the patch, and when present in a surveyed area, individuals
are detected with probability one. Event AP1 occurs if and only if there is at least one individual
in the first subpatch (with proportion C1 ) surveyed. This is calculated as the complement of the
probability that there are no individuals in the first subpatch, and under our assumptions is
Pr(AP1 ) = 1 − (1 − C1 )k . Here, 1 − C1 is the proportion of the patch not surveyed (i.e., all but the
first subpatch). Raising this the kth power gives the probability that all k individuals in the patch
are in the unsurveyed portion (thus (1 − C1 )k is the probability of no individuals in the first
subpatch); and subtraction from 1 results in the probability that not all individuals are in the
8
Stefanski, Rubino, Hess
unsurveyed portion, which is equivalent to having at least one individual in the surveyed portion
(the first subpatch).
Event AP2 occurs if and only if there are no individuals in the first subpatch and there is at least
one in the second subpatch. The probability Pr (AP2 ) can be factored as the conditional
probability of at least one individual in the second subpatch given no individuals in the first
subpatch, and the probability of no individuals in the first subpatch. The probability of no
individuals in the first subpatch was determined previously and is (1 − C1 )k .
The conditional probability of at least one individual in the second subpatch given no individuals
in the first subpatch is calculated using the same reasoning that was used to calculate Pr(AP 1 ).
The area surveyed from the first access site is “removed” from consideration and the remaining
area is regarded as a “new” patch, with the understanding that proportions must be normalized by
the factor 1 − C1 . The conditional probability of no individuals in the second subpatch given no
individuals in the first subpatch is thus {(1 − C2 )/(1 − C1 )}k , and hence the conditional
probability of at least one individual in the second subpatch given none in the first subpatch is
1 − {(1 − C2 )/(1 − C1 )}k . Multiplying by the probability of no individuals in the first subpatch,
(1 − C1 )k , results in Pr (AP2 ) = (1 − C1 )k − (1 − C2 )k . Similar reasoning shows that
Pr (APj ) = (1 − Cj−1 )k − (1 − Cj )k for j = 1, . . . , m. Finally, the probability of no individuals in
the total surveyed area is calculated as the probability that all k individuals are in the unsurveyed
³
´
area, and thus Pr AP(0)
= (1 − Cm )k . The probabilities for all cases can be written
m
Pr (APj ) = (1 − Cj−1 )k − (1 − Cj )k , j = 1, . . . , m,
³
´
Pr AP(0)
= (1 − Cm )k ,
m
k ≥ 0,
(1)
where by definition C0 = 0.
Assuming that the distribution of the number of individuals, K, in a patch is given by a
parametric mass function p(k; λ) = Pr(K = k; λ) depending on the parameter λ, unconditional
probabilities are obtained as
Pr (APj ) =
∞ n
X
k=0
o
(1 − Cj−1 )k − (1 − Cj )k p(k; λ),
j = 1, . . . , m
9
Stefanski, Rubino, Hess
³
Pr AP(0)
m
´
=
∞
X
(1 − Cm )k p(k; λ).
k=0
These probabilities can be written in terms of the probability generating function of K,
P(t, λ) = E(tK ), 0 ≤ t ≤ 1,
Pr (APj ) = P(1 − Cj−1 , λ) − P(1 − Cj , λ),
³
Pr AP(0)
m
´
j = 1, . . . , m,
= P(1 − Cm , λ).
Note that the probabilities depend only on whether an individual was observed (Y = 1) or not
(Y = 0), and on the last two cumulative proportions, denoted (CL−1 , CL ), leading to the concise
representation for the probability of the outcome (CL−1 , CL , Y ),
Pr (CL−1 , CL , Y ) = {P(1 − CL−1 , λ) − P(1 − CL , λ)}Y {P(1 − CL , λ)}1−Y
(
P(1 − CL−1 , λ)
−1
= P(1 − CL , λ)
P(1 − CL , λ)
)Y
for Y = 0, 1, and 0 ≤ CL−1 < CL ≤ 1.
Different choices for the distribution of K (the number of individuals per patch) result in different
versions of the models. We continue development of the model in parallel for the cases in which K
is Poisson(λ) and Negative Binomial(r, λ). The Poisson distribution is the most commonly used
model for count data and thus it is a natural choice; the Negative Binomial is included for
comparison and to illustrate the generality of the modeling approach. For the Poisson distribution,
P(t, λ) = exp{−λ(1 − t)} and
Pr (CL−1 , CL , Y ) = exp(−λCL ) {exp(λ (CL − CL−1 )) − 1}Y .
For the Negative Binomial distribution, P(t, λ) = {(1 − λ)/(1 − λt)} r and
Pr (CL−1 , CL , Y ) =
(
1−λ
1 − λ(1 − CL )
)r ("
1 − λ(1 − CL )
1 − λ(1 − CL−1 )
#r
−1
)Y
.
10
Stefanski, Rubino, Hess
3.2
Incorporating Predictor Variables
The number of individuals in a patch will depend on patch characteristics and such dependencies
are incorporated through the parameter λ. We consider patch size, although the modeling extends
readily to other predictor variables. For example, if the mean number of individuals is thought to
be proportional to patch size, then when K ∼ Poisson(λ), λ = βX is appropriate, where β is an
unknown regression parameter and X is patch size. More generally, λ = λ(X, β), some function of
patch size X and parameter vector β.
Logistic regression is a common model for relating patch occupancy to habitat characteristics such
as patch size (Rubino and Hess, 2003; Scott et al., 2002). In our modeling framework, patch
occupancy corresponds to K > 0, and thus the probability of patch occupancy is
Pr(K > 0) = 1 − Pr(K = 0) = 1 − P(0, λ(X, β)). It is possible to choose λ(X, β) so that the
induced model for patch occupancy is logistic. This facilitates comparison of logistic models fit
under the assumption of perfect detection (sometimes called “naive” models) to our models that
explicitly account for imperfect detection due to incomplete coverage. For the model with K
distributed Poisson,
λ(X, β) = ln {1 + exp(β0 + β1 X)}
(2)
induces a logistic model for patch occupancy, Pr (K ≥ 1) = F (β0 + β1 X) where F is the logistic
distribution function, F (t) = 1/ {1 + exp(−t)}. For the model with K distributed Negative
Binomial,
λ(X, β) = 1 − [1/ {1 + exp(β0 + β1 X)}]1/r
(3)
induces a logistic model for patch occupancy.
3.3
Likelihood Methods
When a sample of n patches are studied resulting in data {(Xi , CL−1,i , CL,i , Yi ) ,
the log-likelihood for β is L (β) =
Pn
i=1
i = 1, . . . , n},
Li (β) where, with λi = λ(Xi , β),
Li (β) = Yi ln {P(1 − CL−1,i , λi ) − P(1 − CL,i , λi )} − (1 − Yi ) ln {P(1 − CL,i , λi )} .
11
Stefanski, Rubino, Hess
b is found by numerical maximization of L(β). Inference is
The maximum likelihood estimator β
b The
generally based on the large-sample, normal approximation to the distribution of β.
b can be estimated by either the inverse sample information
asymptotic-distribution variance of β
³
matrix VbINFO = n−1 Ab−1 or the so-called sandwich variance matrix VbSAND = n−1 Ab−1 Bb Ab−1
n
1X
b L̇ (β)
b T,
b
B=
L̇i (β)
i
n i=1
³
1
Ab = −
n
X
n i=1
´
´T
, where
b
L̈i (β),
(4)
L̇i (β) = (∂/∂β) Li (β), and L̈i (β) = ∂/∂β T L̇i (β). The variance estimate VbINFO is preferred
when the assumed model is correct, whereas the alternative estimate VSAND is robust to model
departures (Stefanski and Boos, 2002).
3.4
Computation and Software
Fitting the proposed models to data involves standard statistical computations. We have written
programs in GAUSS (Aptech Systems) for doing the necessary calculations. These programs were
used to compute the estimates reported in the next section. The GAUSS code, and the used in the
following section, are available from the first author upon request.
4
Results: Statistical Modeling and Analysis
We illustrate our statistical models using data from the Barred owl study of Rubino and Hess
(2003). They presented a logistic regression model of observed patch occupancy, i.e., their analysis
did not account for incomplete patch coverage. Observed patch occupancy can differ from true
patch occupancy. Our models are designed to provide improved estimates of true patch occupancy
by explicitly accounting for incomplete coverage. We compare the standard logistic model (ignoring
incomplete coverage) to models from Section 3 In our models we use X = ln(patch size + 1) to
facilitate comparison to the results presented by Rubino and Hess (2003).
Examination of the data revealed some large patches with no observed occupancy (Y = 0) and
coverages much less than 100%. It is not unreasonable to expect that had these large patches been
surveyed completely, owls would have been detected in some or all of them. It follows that the
Stefanski, Rubino, Hess
12
effect of these points on the fitted naive logistic regression model is to bias downward the estimated
probability of patch occupancy for large patch sizes, which tend to have a smaller proportion of
their areas surveyed. Consequently, the logistic model of observed patch occupancy is biased low
for large patch sizes. The models in Section 3 explicitly account for incomplete coverage and should
result in estimates of patch occupancy parameters that are not biased due to incomplete coverage.
We estimated the occupancy curve as function of ln(patch size + 1) based on several choices for
the distribution of the number of individuals per patch, K (Table 1). The naive model is standard
logistic regression of observed patch occupancy, Y , on X = ln(patch size + 1) and coincides with
the results reported by Rubino and Hess (2003), with the previously noted exception that we used
data on only ninety-four patches. The Poisson and Negative Binomial (r = 1, 2, 4) are the models
described in Section 3 with λ(X, β) chosen so that the induced models for patch occupancy are
logistic. The adjusted-coverage models have intercepts (βb0 ) that are less than the intercept of the
logistic model, and slopes (βb1 ) that are greater than that of the logistic model. These differences
are consistent with the expectation that the coverage-adjusted models have higher estimated
occupancy probabilities for large patches than does the naive logistic model. This fact is illustrated
in Figure 2, which displays the estimated patch occupancy models for the naive, Poisson and
Negative Binomial (r = 1) models. All of these models have two parameters so that AIC =
−2(Log-likelihood)+2. The Negative Binomial models fit the barred owl data marginally better
than the Poisson model.
For additional comparison we fit a standard logistic model to data that were modified to represent
the data that might have been obtained with more complete patch coverage. There are five large
patches (patch size > 934ha) in the data set with no owl observed (Y = 0) and low coverage
proportions (< 0.56). Under the supposition that these patches were truly occupied by owls, and
that Y = 0 only because of the low coverage values, we set Y = 1 for these five patches and refit
the standard logistic regression model to the modified data. The estimates from this model appear
in Table 1 under the heading of a modified logistic model, and the corresponding occupancy curve
is shown in Figure 2. The similarity of the modified-data estimates to the coverage-adjusted model
Stefanski, Rubino, Hess
13
estimates is noteworthy.
5
Discussion
We have presented a statistical model to estimate patch occupancy when patches are incompletely
surveyed. Our motivation for developing this method was to improve our ability to validate a
habitat model being used for conservation planning. Compared to the naive occupancy model (i.e.,
proportion of patches in which we detected barred owls) the incomplete coverage model indicates
that our habitat model has fewer errors of omission at larger patch sizes.
For metapopulation modeling and environmental monitoring applications, our incomplete coverage
model provides a better estimate of patch occupancy than simply counting the number of patches
in which a species is detected. In metapopulation modeling, the proportion of occupied sites is a
state variable; in fact, it describes completely the state of the system in the simplest
metapopulation models. Naive approaches that do not account for detection probability less than
one underestimate the proportion of occupied sites. Similarly, monitoring programs require
inferences about areas that cannot be completely surveyed. Naive approaches would again result in
underestimates of the occurrence of species.
Where patch occupancy has been used to define minimum patch sizes for wildlife habitat (e.g.,
Robbins et al. 1989), the naive approach misrepresents the minimum patch size. For the barred
owl, and for all but the lowest occupancy probabilities, the naive approach overestimates the patch
size required for a given probability of occupancy (Figure 2). This has important implications for
conservation reserve planning initiatives — patches previously considered too small to serve as
conservation habitat might be adequate. If one were to choose a minimum patch size for barred owl
habitat based on our data, the incomplete coverage models suggest smaller patch sizes than does
the naive model. For example, for the patch size at which there is a 50% chance of finding a barred
owl (Robbins et al. 1989) our incomplete coverage model (Neg Bin (r = 1)) results in a smaller
estimated patch size (≈ 164ha) than does the naive model (≈ 288ha).
Stefanski, Rubino, Hess
14
Our model relies on a number of assumptions, all of which are likely to be violated to some extent
in applications. The assumptions are of three types: (1) assumptions on the dispersion of
individuals in a patch; (2) assumptions on the subpatch detection method; and (3) statistical
modeling assumptions. We review the key assumptions and give a qualitative assessment of the
effects of violations. Future work on the statistical model will study the impact of model violations
quantitatively, and describe extensions of the basic model that address certain model violations.
The key assumptions on the species under study is that individuals are independently and
uniformly distributed throughout the patch. This assumption is not necessarily violated for species
whose members occur in pairs, packs, flocks, or other family or social units. In such cases the
model applies with the understanding that “individual” means a family or social unit. For
example, for the barred owl data a detection is regarded as a detection of a nesting pair.
The uniformity assumption is likely to be reasonable when habitat quality does not vary
excessively throughout a patch. For the barred owl study, patches were identified in such a way
that ensured a fair degree of habitat uniformity throughout a patch.
The appropriateness of the independence assumption depends on whether individuals tend to
aggregate or be self-avoiding. Self-avoiding species will, on average, be more evenly dispersed than
would be expected under the independence assumption. This means that, relative to the case of
independence, not as much area would have to be surveyed (on average) before an individual is
encountered. Individuals will be more likely to be detected at one of the first few access sites
relative to the case when individuals are independently dispersed, and observed population density
will likely be higher than what would be predicted by a model that assumes independence. This
suggests that estimates of occupancy probabilities derived from a model that assumes
independence might be biased high. The opposite (occupancy probabilities biased low) will likely
be true of gregarious species. With regard to the barred owl study, mating pairs compete for a
limited resource (habitat patches) within the larger landscape matrix of the study area (Research
Triangle) and are therefore territorial. As a result, there is a tendency for pairs to avoid one
another. Thus pairs may be more evenly distributed throughout patches and co-occur less
Stefanski, Rubino, Hess
15
frequently than would be the case under independence assumption. The impact of this possible
violation of the assumptions is not known at this time. Future work is planned to investigate the
robustness of our model to violations of the independence assumption.
The assumptions on the subpatch survey detection method are threefold. The first two are that the
probability of a false detection is zero, and the probability of detection is one when an individual is
in a surveyed subpatch. Violations of the latter are more common and hence more problematic
than are violations of the former, and result in underestimation of occupancy probabilities. Our
basic model can be modified to handle both types of errors provided the error rates of the
detection method are known or can be estimated. The modifications are technically involved and
will be presented in a forthcoming paper. The required error rates can be estimated by multiple
surveys of the same access sites, or experimentally by repeatedly surveying subpatches that are
known to contain individuals (to estimate the probability of detection) or known not to contain
individuals (to estimate the probability of false detection). The third assumption is that subpatch
coverage areas can be determined with acceptable accuracy. Subpatch areas are accumulated over
access sites and divided by total patch area to obtain CL−1 and CL , the patch coverage proportions
required in our model. Thus errors in the subpatch areas are propagated to CL−1 and CL . For the
barred owl study, coverage areas were determined by GIS based on GPS-determined locations that
are accurate to within seven-to-ten meters, resulting in very accurate subpatch coverage area
measurements and thus errors in CL−1 and CL are not likely to be problematic. This may not be
the case in other applications and future work will investigate the effect of errors in C L−1 and CL .
The key modeling assumption relates to the distribution for the latent variable, K, the number of
individuals in a patch. There is very little information in a single data set on which to base the
choice. However, the models fit to the barred owl data suggest that the choice is not too critical.
The Negative Binomial distribution is heavier tailed and more dispersed than the Poisson
distribution, yet the parameter estimates and patch occupancy curves displayed in Table 1 and
Figure 2 are similar, indicating a reasonable measure of insensitivity to the model for K.
Stefanski, Rubino, Hess
6
16
Acknowledgements
We thank Jim Nichols for encouragement and advice in the initial stages of our work, and for
comments on the manuscript. The manuscript also benefitted from the thoughtful comments of
Cavell Brownie and Ken Pollock.
Literature Cited
Barbraud, Christophe, James D. Nichols, James E. Hines, And Heinz Hafner 2003. Estimating
Rates Of Local Extinction And Colonization In Colonial Species. Oikos, in press.
Burnham, K.P., D.R. Anderson, and J.L. Laake. 1980. Estimation of a density from line transect
sampling of biological populations. Wildlife Monographs 72.
S.T. Buckland, D.R. Anderson, K.P. Burnham, J.L Laake, D.L. Borchers and L. Thomas. 2001.
Introduction to Distance Sampling: Estimating Abundance of Biological Populations. Oxford
University Press, New York.
Hanski, I. A. and M. E. Gilpin (editors). 1997. Metapopulation biology: ecology, genetics, and
evolution. Academic Press, New York, New York, USA.
Hess, G. R., K. Dixon and M. Woltz. 2000. State of open space 2000: the status of the Triangle’s
green infrastructure. Triangle Land Conservancy, Raleigh, North Carolina, USA.
Hess, G. R. and T. J. King. 2002. Planning for wildlife in a suburbanizing landscape I: selecting
focal species using a Delphi survey approach. In Press. Landscape and Urban Planning 58(1):
25-40.
Lambeck, R. J. 1997. Focal species: a multi-species umbrella for nature conservation. Conservation
Biology 11(4):849-856.
Lambeck, R.J. 2002. Focal species and restoration ecology: Respose to Lindenmayer et al.
Conservation Biology, in press.
Lancia, R. A., J. D. Nichols, and K. H. Pollock. 1994. Estimating the number of animals in wildlife
populations. Pages 215-253 in T. Bookhout, editor. Research and management techniques for
wildlife and habitats. The Wildlfie Society, Bethesda, Maryland, USA.
Levins, R. 1969. Some demographic and genetic consequences of environmental heterogeneity for
biological control. Bulletin of the Entomological Society of America 15:237-240.
Lindenmayer, D.B., A.D. Manning, P.L. Smith, H.P. Possingham, J. Fischer, L. Oliver, and M.A.
Stefanski, Rubino, Hess
17
McCarthy. 2002. The focal-spsecies approach and landscape restoration: A critique.
Conservation Biology, in press.
MacKenzie, D.I. James D. Nichols, Gideon B. Lachman Sam Droege, J. Andrew Royle and
Catherine A. Langtimm. 2003; Estimating Site Occupancy Rates When Detection Probabilities
Are Less Than One. Ecology, 83:2248-2255.
Nichols, J. D., J. E. Hines, J. R. Sauer, F. W. Fallon, J. E. Fallon, and P. J. Heglund. 2000. A
double-observer approach for estimating detection probability and abundance from point
counts. Auk 117: 393-408.
Otis, D. L., K. P. Burnham, G. C. White and D. R. Anderson. 1978. Statistical inference from
capture data on closed animal populations. Wildlife Monographs 62.
Robbins, C.R., D.K. Dawson, and B.A. Dowell. 1989. Habitat area requirements of breeding forest
birds of the middle Atlantic states. Wildlife Monographs 103: 1-34.
Rubino, M. J. 2001. Identifying barred owl habitat in the North Carolina Piedmont: using GIS in
focal species conservation planning. Thesis, North Carolina State University, Raleigh, North
Carolina, USA.
Rubino, M.J. and G.R. Hess. 2003. Planning open spaces for wildlife II: Modeling and verifying
focal species habitat. Landscape and Urban Planning, in press.
Scott, J.M., P.J. Heglund, M. L. Morrison, M.G. Raphael, W.A. Wall, and F.B. Samson (editors).
2002. Predicting species occurrences: Issues of scale and accuracy. Island Press, Covello,
California, USA.
Seber, G. A. F. 1982. The estimation of animal abundance and related parameters. Second
Edition, MacMillan Press, New York, New York, USA.
Simberloff, D. 1998. Flagships, umbrellas, and keystones: is single-species management passe in the
landscape era? Biological Conservation 83(3):247-257.
Stefanski, L. A. and Boos, D. (2002). The Calculus of M-estimation. The American Statistician,
56:29-38.
Thompson, S. K. 1992. Sampling. Wiley, New York, New York, USA.
Williams, B. K., J. D. Nichols, and M. J. Conroy. 2002. Analysis and Management of Animal
Populations. Academic Press, San Diego, California, USA.
18
Stefanski, Rubino, Hess
I JLK
MLNPO QSRUTUVUWUXUV
I(JLKMLNPO QSRUTUWUTUYUZ
067168:9
'3)2 %54)2 . ! $ /
"!#$&%10) )2 /
- ) .+/
"! #$&%('*)+#$"! , $
"
! #$&%;0 / $2 2 ,=< !&$ >
?@@BAC! . /BD
E @F E F
<G+ $ /+/ "!#$&%
) 2 H 9
Figure 1: Portion of study area showing Patches Nos. 26746 and 27295 (gray), access sites along
public roads, survey listening areas, and subpatch coverages (lined). Access site 105015 provided
≈ 99% coverage of Patch No. 27295; the other access sites combined provided ≈ 75% of Patch No.
26746.
19
Stefanski, Rubino, Hess
βb0
βb1
−2(Log-likelihood)
Logistic (naive)
-3.40 (0.82)
0.60 (0.15)
97.34
Poisson
-4.07 (0.96)
0.84 (0.18)
207.27
Neg Bin (r = 1)
-3.83 (0.84)
0.75 (0.14)
198.72
Neg Bin (r = 2)
-4.05 (0.92)
0.82 (0.16)
201.61
Neg Bin (r = 4)
-4.09 (0.94)
0.83 (0.17)
204.10
Logistic (modified) -4.13 (0.97)
0.79 (0.17)
87.02
Model
Table 1: Parameter estimates, standard errors (in parentheses) and log-likelihoods for the naive
logistic model; several models of the distribution of the number of individuals per patches (K) from
Section 3; and the logistic model fit to modified data in which the five largest patches in which no
owls were detected were assumed to have owls present (but not detected because of low coverage).
Stefanski, Rubino, Hess
20
Figure 2: Top. Patch coverage fractions by transformed patch size (non-habitat patches excluded).
Bottom. Estimated probability of patch occupancy. Naive, logistic regression with no adjustment
for incomplete coverage; Poisson, coverage-adjusted Poisson model; NB(r = 1), coverage-adjusted
Negative Binomial model with r = 1; Modified Data, logistic regression with modified data.