Ecography 38: 311–320, 2015 doi: 10.1111/ecog.00722 © 2014 The Authors. Ecography © 2014 Nordic Society Oikos Subject Editor: Timothy Keitt. Accepted 25 June 2014 Novel spatial analysis methods reveal scale-dependent spread and infer limiting factors of invasion by Sahara mustard Yue M. Li, Katrina M. Dlugosch and Brian J. Enquist Y. M. Li ([email protected]), K. M. Dlugosch and B. J. Enquist, Dept of Ecology and Evolutionary Biology, Univ. of Arizona, Tucson, AZ 85721, USA. Multiple scale-dependent ecological processes influence species distributions. Uncovering these drivers of dynamic range boundaries can provide fundamental ecological insights and vital knowledge for species management. We develop a transferable methodology that uses widely available data and tools to determine critical scales in range expansion and to infer dominating scale-dependent forces that influence spread. We divide a focal geographic region into different sized square cells, representing different spatial scales. We then used herbarium records to determine the species’ occupancy of cells at each spatial scale. We calculated the growth in cell occupancy across scales to infer the scale dependent expansion rate. This is the first time such a ‘box-counting’ method is used to study range expansion. We coupled this multi-scale analysis with species distribution models to determine the range and spatial scales where suitable climate allows the species to spread, and where other factors may be influencing the expansion. We demonstrate our methodology by assessing the spread of invasive Sahara mustard in North America. We detect critical scales where its spread is limited (100–500 km) or unconstrained (5–50 km) by climatic variables. Using climate-based models to assess the similarity of climate envelopes in its native and invaded range, we find that the climate in the invaded range generally predicts the native distribution, suggesting that either there has been little local adaptation to climate occurring since introduction or the biological interaction experienced in the invaded range has not driven the species to occupy climatic conditions much different from its native range. Our novel method can be broadly utilized in other studies to generate critical insights into the scale dependency of different ecological drivers that influence the spread and distribution limits, as well as to help parameterizing predictions of future spread, and thus inform management decisions. Species’ distributions are shaped by a myriad of ecological factors that fall into three general categories: physical environment, biotic interactions and dispersal (Brown et al. 1996, Kirkpatrick and Barton 1997, McGill 2010). Disentangling these multiple drivers of range boundaries is difficult, and can be confounded by their inherent scale-dependency. Multiple ecological and evolutionary processes that operate on differing spatial and temporal scales work together to set the range boundaries of species (MacArthur 1972, Holt 2003, Gaston 2009, Holt and Barfield 2011), and consequently the distribution of species can show strong correlation with different ecological factors when analyzed at different scales (Wiens 1989, McGill 2010, Estrada-Villegas et al. 2011). For example, climatic variables such as annual precipitation show the strongest variation at large spatial scales, whereas habitat variables such as soil pH vary more strongly at finer scales. Consequently, climatic variables often appear to be the strongest factor explaining species ranges at the coarsest scale in space (Caughley et al. 1987, Root 1988, Davis and Shaw 2001). Within a coarse scale, a distribution can be patchy given finer scale variation in habitat structure, biotic interactions and dispersal limitation (GreigSmith 1979, Wethey 2002). Approaches that utilize multi- scale analyses are essential for revealing how the influence of specific ecological factors changes with scale (Wiens 1989). In this study, we present a novel methodology that can be broadly-applied to assess the scale-dependency of species’ distributions. Distribution changes – such as those that occur in response to species introductions or climate change – provide ecological experiments that can reveal the factors that set range boundaries at specific scales (Holt et al. 2005, Kinlan and Hastings 2005, Pysek and Hulme 2005, Ricklefs 2005, Sexton et al. 2009). Increasing number of studies have assessed the distribution of invasive species across multiple spatial scales. Most of them emphasize on how current extent of invasion changes with spatial scales (Foxcroft et al. 2009) and what ecological factors are associated with this scale dependency of existing invasion extent (Catford et al. 2009, Akasaka et al. 2012). But very few (Lonsdale 1993, Pysek et al. 2008) have examined how historical and current rate of range expansion change with spatial scales. It is important to reveal the temporal dynamics of scale dependent range expansion in order to infer historical and existing ecological factors that may influence distribution changes at specific scales. We suspect the lack of such study is largely due to methodological barrier. Here we illustrate the use of the 311 ‘box-counting’ method (Mandelbrot 1983 p. 19) to assess the expansion rate of an invading species along a continuum of spatial scales and across multiple decades. In particular, we ask how the rate of decadal expansion varies across those scales and whether there are critical scales at which the expansion rate changes dramatically. The box-counting method is a simple and powerful tool normally used in scaling analysis for detecting the fractal dimension of a system (Mandelbrot 1983, Morse et al. 1985, Ritchie 1998). Veldtman et al. (2010) used this method to assess how environmental factors affect the occupancy pattern of an invasive Acacia species. In this study, instead of taking a static view of invasive distribution, we extend its use to multi spatial scale analysis of the temporal dynamics of range expansion. To infer specific ecological factors driving scaledependent expansion, we demonstrate a focal analysis of the importance of climate variables. We use climate based species distribution models to predict the expansion potential of a focal species across spatial scales. Climatic variables are expected to restrict species distribution particularly at large spatial scales (Whittaker et al. 2001, Pearson and Dawson 2003, McGill 2010). Here we not only evaluate this hypothesis but also attempt to quantitatively determine the range of spatial scales where climate has the strongest influence. Next, we infer the influence of dispersal by evaluating the temporal change in the scale dependency of range expansion. One can infer strong contribution by long distance dispersal if high expansion rates prevail at large scales. Moreover, one can infer when this contribution matters the most to range expansion by finding periods during which large scale expansion is most prevalent. Our focal species is a winter annual plant Brassica tournefortii (Brassicaceae), commonly known as Sahara mustard. It is native to Eurasia and Africa and was unintentionally introduced to southern California in the 1920s. It has since invaded vast areas of the southwest (Minnich and Sanders 2000), mainly in the Sonoran and Mojave Deserts. This species grows not only on sand, sandy loam soil and rocky hillside in the Mojave and Sonoran Deserts but also in more mesic environments in California coasts. The distribution of Sahara mustard in its native range covers 0–2400 m elevations (Miller and Cope 1996) and a variety of landscapes such as desert floors, dunes, oases, desert mountains and steppes (Maire 1965, Townsend and Guest 1980, Zohary et al. 1980). Individual seeds germinate after major winter storms. Survived germinants have rapid phenology and can produce seed sets as quick as in less than two months (Marushia et al. 2012). Under favorable conditions, an individual plant can develop basal rosette leaves ∼1 m in diameter and stem more than 80 cm tall (unpubl.), and produce ⬎ 16 000 seeds (Trader et al. 2006). Its seed coat becomes mucilaginous in contact with water, which may aid dispersal by having wet seeds stick to objects capable of traveling long distance (Bangle et al. 2008). Sahara mustard is one of the very few self-compatible species in the Brassica genus (Hinata and Nishio 1980). Compared to congenic self-incompatible species, Sahara mustard produces smaller flowers with a smaller amount of pollen grains. Its anthers face the pistils and pollinate the protruding stigma. Its successful invasion may be a result of many of the aforementioned attributes: tolerance of a wide range of 312 environments, rapid phenology, high fecundity, aided seed dispersal, and self-fertilization. Its decades of expansion recorded in herbarium collections provide an opportunity to conduct a multi-scale analysis of its spread. Moreover, its threat to native species (Barrows et al. 2009) begs effective management based on sound understanding of the processes that limit its distribution. We demonstrate the use of our novel method by asking 1) how the expansion of Sahara mustard has varied temporally across spatial scales, 2) whether climate limits the expansion of Sahara mustard at specific spatial scales, and 3) how long distance dispersal affects the scale dependency of its expansion. Finally, we explore the potential for evolution and biological interaction to limit invasive spread. For introduced species in particular, whether or not a species can occupy novel environments has important implications for predicting spread (Wiens et al. 2010). We use species distribution models constructed from the climate of non-native range to predict the native range of invasive Sahara mustard. If the predicted native range based on non-native distributions consists new territories not observed in its native range, the difference may suggest that either an invasive species has adapted to novel climates during invasion or novel biological interactions in the invaded range has driven the species to occupy climatic conditions that it normally cannot occupy in the native range. If the predicted native range is only a subset of the observed native range, the result may suggest that either the species has not reached the climatic limits of invasion or novel biological interactions in the invaded range prevents the species from occupying all of its native climatic range. Methods Estimating the expansion rate of Sahara mustard across multiple spatial scales We utilized herbarium collection records of Sahara mustard to assess the species’ expansion over North America. Like many other species, there is no specific long term study to track the change in distribution of Sahara mustard. We show that herbarium collections are useful for revealing the distribution dynamics of poorly studied species despite the potential sampling biases associated with such collections. We searched a large number of online databases covering North America (Supplementary material Appendix 3, Table A1). Only the following four herbarium databases had records of Sahara mustard: the Southwest Environmental Information Network (SEINet), Consortium of California Herbaria (CCH), Global Biodiversity Information Facility (GBIF) and New Mexico Biodiversity Collections Consortium. Aggregating the herbarium records by decades, we reconstructed the distribution of the species in each decade since its first discovery in North America. We then used this information to study the scale dependency of range expansion. Next we assessed the robustness of the boxcounting method on a simulated diffusion growth model (Supplementary material Appendix 1) and applied the method to estimate the decadal expansion rate of Sahara mustard across 7 spatial scales (5, 10, 20, 50, 100, 200, and 500 km). We selected the rectangular region that contained all the herbarium records of Sahara mustard in North America (25° to 38.25°N, 99.33° to 122.79°W) and evenly divided this region into square cells of 5 ⫻ 5 km2 to 500 ⫻ 500 km2 size, corresponding to the 7 scales. We avoided scales smaller than 5 km because imprecise or inaccurate geographic coordinates of herbarium specimens can introduce large errors to small scale estimates. We summed the number of cells that had herbarium records in each decade. Where the number of occupied cells of size s in decade d is Ns, d, the expansion rate at spatial scale of s from decade d to d ⫹ 1 was calculated as rd ⫹ 1,s ⫽ ln(Nd ⫹ 1,s/Nd,s). Because only five records were reported prior to 1950 (Supplementary material Appendix 3, Table A2), we applied the method to the decadal intervals from year 1950 to 2009. Our simulated diffusion growth model suggests that as the spatial scale increases, the expansion rate calculated by the box-counting method becomes more sensitive to the placement of the grid. Fortunately, this sensitivity can be largely eased by averaging estimates from multiple placements (Supplementary material Appendix 1). In this analysis, we shifted the placement of the grid in 16 directions (π/8 radians apart) and three distances (600 m, 26 km, and 108 km) in each direction. We graphically report estimates from all those placements and averaged them to determine the expansion rate at each scale. To assess how small scale imprecision in geographic coordinates affects our estimations at large scales, we perturbed the original collection data by a normally distributed random error. We used errors with a mean of zero and a standard deviation of 1 km and applied the box-counting method to this perturbed data. Lastly, we assessed how variation in sampling efforts may affect our analysis. The intensity of floristic surveys at different spatial scales could vary in time, confounding our analysis (e.g. fewer herbarium records within a decade could be a result of less sampling effort). We quantified the variation in sampling efforts by performing the same expansion analysis on native species with presumably stable range (Delisle et al. 2003). We then subtracted this native ‘expansion’ rate (averaged over all grid placements) from the invasive expansion rate to correct for sampling bias. We obtained the herbarium records of three common and widely distributed native winter annual species (Plantago patagonica, Chaenactis stevioides and Lepidium lasiocarpum) from the same databases and within the same geographic boundary as Sahara mustard. Using their combined records, we reconstructed the decadal variation in herbarium sampling efforts (Supplementary material Appendix 2) and used it as a baseline to adjust the expansion rate of Sahara mustard. Using MaxEnt species distribution models to evaluate the range of Sahara mustard under suitable climatic conditions We predicted the range of Sahara mustard in North America using species distribution models (SDMs) based on the climatic conditions under which this species has been recorded. We built our SDMs using MaxEnt (ver. 3.3.3k), a program specifically developed for treating presence-only records (Phillips et al. 2006, Elith et al. 2010b). Our MaxEnt SDMs attempted to construct the species’ distribution under the maximum influence of climatic variables we chose and minimum influence of any other factors. The expanding range of a non-native species does not reflect a stable relationship with the invaded environments (Elith et al. 2010a). This lack of equilibrium presents challenges for modeling a potential distribution using data from the current distribution. To address this problem, we reduced the complexity of our model by choosing only a few climatic variables that are most biologically relevant and by using features and regularization parameters in MaxEnt that are more appropriate for modeling species with an unstable range (Supplementary material Appendix 4). Our models used four climatic variables from the Global Climate Database (WorldClim.org): mean temperature of the coldest quarter (TEMPCOLDQ), precipitation of the coldest quarter (RAINCOLDQ), annual precipitation (RAINYEAR), and mean temperature of the warmest quarter (TEMPWARMQ), all of which are biologically relevant to the life history of Sahara mustard (Table 1). Those variables were derived from climatic records between 1950 and 2000 (Hijmans et al. 2005), consistent with the period when most of the occurrences of Sahara mustard were recorded (1940–2010). The variables have a spatial resolution of 30 arcsecond (∼1 km2), which was achieved by interpolating data from a global network of weather stations. The interpolation took into account the elevational difference in space but ignored other local scale factors such as aspect (Hijmans et al. 2005). The background of our models was a polygon that consists of the majority of southwestern North America, from which we drew 10 000 random samples. We also allowed for further expansion by using an enlarged background containing all lower 48 states of the U.S. and the entire territory of Mexico. Models trained under this enlarged background did not give qualitatively different results (Supplementary material Appendix 4, Fig. A3 and A4). Therefore we only report results from models based on the standard background. Table 1. The four climatic variables used in the species distribution models. In line with modeling a range shifting species, we reduced the complexity of our models by choosing a very few climatic variables that are most biologically relevant to Sahara mustard. Climatic variable Abbreviation Biological relevance Mean temperature of the coldest quarter TEMPCOLDQ Precipitation of the coldest quarter RAINCOLDQ Annual precipitation RAINYEAR Mean temperature of the warmest quarter TEMPWARMQ The average temperature condition in the growing season of a winter annual plant The average resource (water) availability in the growing season of a winter annual plant The average of maximal potential resource (water) availability for an annual plant The high summer temperature required to break seed dormancy in desert winter annual plants (Baskin and Baskin 2006) 313 None of the climatic variables used were overly correlated with each other (Supplementary material Appendix 4, Table A4); therefore we included all four variables in each model. We used two datasets of spatially distinctive records to train our models. The first dataset included 414 occurrences drawn from the herbarium records. The second dataset included 2662 records drawn from both the herbaria and invasive plant surveys (hereafter combined records). Those surveys (Supplementary material Appendix 3, Table A1) were conducted in the core region of its invasion (southern California and Arizona) in the 2000s. The first dataset reflects records acquired by relatively equal sampling efforts across North America. The second dataset reflects records influenced by intensive sampling efforts in its core invaded area. Points close to the species’ continental range limit are expected to have a stronger influence on models based on the herbarium records than those on the combined records. We derived ensemble predictions from models based on both datasets to balance the influence of occurrences at range limit and in the range center. We tested all of our models through 10-fold cross validation and made predictions of presence or absence based on the logistic output of the models (Supplementary material Appendix 4). Comparing the climatic conditions of Sahara mustard in its native and invaded range to infer evolutionary constraints We used our developed SDMs to project the range of Sahara mustard over the continents to which it is native (Eurasia and northern Africa). We also surveyed the existing literature and searched the Global Biodiversity Information Facility (GBIF) database to estimate the recorded native range of Sahara mustard (Supplementary material Appendix 3, Table A3). We then compared the model-projected and the recorded native range to identify any key differences. The literature describes detailed regional distribution and habitat type within each country in the Middle East and Europe, but gives very coarse scale description in northern Africa and central Asia, often mentioning an area that covers multiple countries (Supplementary material Appendix 3, Table A3). We didn’t find any flora describing its distribution in Morocco, Tunisia, and Burkina Faso even though the GBIF database shows collections from those countries. We therefore highlighted the entire territory of these countries as potential native range of the species. Results Scale-dependent expansion of Sahara mustard After corrected the bias in sampling efforts, we found that in the 1960s the species experienced rapid expansion across all scales (Fig. 1c). However, estimates beyond the scale of 50 km (inclusive) should be taken with caution given their high sensitivity to grid placement (Fig. 1a). In contrast, range expansion in the 1970s saw a substantial slowdown at 314 5–200 km, but a dramatic surge at the 500 km scale reflecting range boundary expansion. This is the decade during which the species was first recorded in western Texas, central California and southern Utah. In the 1980s, the expansion rate increased at the 5–20 km scale but decreased at larger scales. Note, the increase at smaller scales is a result of correction for sampling efforts. Without the correction, the expansion rate was indicated to rather decrease at those smaller scales (compare Fig. 1a and c). In the 1990s, the species’ range experienced a contraction across most spatial scales. In the 2000s, the species almost stopped its expansion at the 100–500 km scales but sustained a high rate of expansion at the smaller scales. This is the decade in which the species rose to prominence as a significant concern in the southwest. The herbarium collections nearly quadrupled from 76 records in the 1990s to 279 in the 2000s (Supplementary material Appendix 3, Table A2). With the exception of the 1960s, estimations of expansion rate of Sahara mustard were generally insensitive to the placements of grids (Fig. 1a). As expected, sensitivity increases with the spatial scale of focus. But even at the largest scales (200 and 500 km), there is no severe fluctuation in estimations as those seen in our simulated model (Supplementary material Appendix 1, Fig. A1). Therefore, the box-counting method produced reliable scale-dependent expansion rate of Sahara mustard for the period between the 1970s and the 2000s. Estimations in the 1960s have wider fluctuation especially at scales larger than 50 km (inclusive), and thus are less reliable. This is probably due to lower number of sampling points. Small scale error introduced to the geographic coordinates has minor but noticeable effect on the estimated expansion rates across all spatial scales (Fig. 1b). However, its effect does not qualitatively change any pattern of scale dependency. Estimates of the expansion rate are also greatly affected by the variation in sampling efforts (comparing Fig. 1a and 1c). Predicted range of Sahara mustard in North America under its suitable climate The range boundaries of Sahara mustard predicted by our climate-based SDMs generally agreed with the recorded boundaries at the regional scale (Fig. 2), indicating little room for further regional scale expansion of this species except in the large area of the Central Valley, California. According to our models, Sahara mustard is likely to occur in areas with mean temperature in the coldest quarter between 3.9 and 17°C, mean temperature of the warmest quarter between 15.7 and 36°C, precipitation of the coldest quarter between 16 and 302 mm, and annual precipitation between 46 and 657 mm (Supplementary material Appendix 4, Fig. A5). All SDMs performed well according to their crossvalidation test scores (Supplementary material Appendix 4, Table A6). Models trained by herbarium records (more influenced by points at the range limit) predicted a more extended range (green areas in Fig. 2) that includes more areas in northern Mexico, Baja California, central and coastal California and a small region in northwestern Florida (not shown in figure). In comparison, models trained by Figure 1. Decadal expansion rates of Sahara mustard from the 1960s to the 2000s at spatial scales of 5–500 km, as estimated from (a) original herbarium data and (b) original data perturbed by a normally distributed random error with a standard deviation of 1 km. ‘⫹’ indicates each estimate from a single grid placement. Circles indicate a mean value averaged over estimates from all grid placements. (c) Expansion rate adjusted for variation in sampling efforts by subtracting the mean value by an estimate for native species. 315 combined records (more influenced by points in the core invaded area) predicted a more thorough coverage by the species over the Mojave Desert (yellow areas in Fig. 2). Agreement of climatic conditions between native and invaded range The recorded native range of Sahara mustard generally agrees with the projection based on the climatic conditions in its North American range (Fig. 3), suggesting that either this species has not adapted to novel climatic conditions during invasion or biological interactions in the invaded range has not driven the species to occupy new climatic conditions. The SDMs trained by herbarium records predicted a native range that generally extends more southward into the drier regions in Sahara, on the Arabian Peninsula and in central Asia (green areas in Fig. 3), whereas models trained by combined records in North America predicted more coverage in Turkmenistan (yellow areas in Fig. 3), which has drier and colder winters than regions further south. The models predict that Sahara mustard is likely to occur on its native continents in areas with mean temperature of the coldest quarter between –2.3 and 20.6°C, mean temperature of the warmest quarter between 14.8 and 36.7°C, precipitation of the coldest quarter between 0 and 449 mm, and annual precipitation between 0 and 782 mm (Supplementary material Appendix 4, Fig. A5). There are some mismatches between the model projected and the recorded native range. No records suggest the species’ presence in the Nejd region of Saudi Arabia and in Turkmenistan. But our models, even those more influenced by points in the core invaded range predicted its presence in Figure 2. Distribution of Sahara mustard in North America predicted by climate-based SDMs. One model used only herbarium records (red dots) and the other, herbarium and invasive plant survey records (blue triangles) combined. Each ensemble shows the area predicted by both models (peach) and by each model alone (green or yellow). An additional small area predicted by the model in Dixon County, Florida is not shown. 316 Figure 3. Projected distribution of Sahara mustard in its native range using SDMs based on the climate in its invaded range in North America. Striped areas represent its native range recorded in the literature and the GBIF database. those regions. None of the models predicted its presence in Burkina Faso, but the species has been found in this country according to GBIF. Discussion Climatic constraints on the spread of Sahara mustard and their spatial extent Our results indicate that the spread of Sahara mustard in North America is indeed spatially scale-dependent and that climate is limiting its spread at regional scales. The expansion of the species at scales beyond 100 km declined to a negligible level in the 2000s (Fig. 1), suggesting a cessation of further expansion of its range boundaries at those scales. The current boundaries of its regional distribution largely agree with those predicted by climate based SDMs (Fig. 2), supporting the role of climate in restricting the spread of the species at those large scales (100–500 km). This scale-dependent pattern in our study supports the common view that long-term climate shapes species distributions at relatively large scales. Although this is not a surprising finding, our method is unique in its ability of defining a quantitative extent of the scales of influence. Of course, we cannot rule out the possibility that the expansion is also limited by other unexamined factors that vary strongly at the 100–500 km scales. Our results also reveal that any constraints imposed by long-term climate vanish at smaller scales. Rapid expansion continued at 5–50 km scales in the 2000s with a rate generally higher than in the 1970s–1990s (Fig. 1). Our climate based SDMs indicate ample space for the species to further occupy inside its regional climatic envelope, suggesting that long-term climate is not limiting the species’ expansion at local to intermediate scales. The influence of dispersal on the spread of Sahara mustard Dispersal is generally expected to drive rates of expansion in invasive species (Kot et al. 1996, Phillips et al. 2010), and Sahara mustard in particular appears to have reached its climatic limits rapidly as a result of long-distance movement. At the 500 km scale, aside from the less reliable estimate in the 1960s, the expansion rate was consistently high in both the 1970s and the 1980s, supporting the view that long distance dispersal at the scale of hundreds of kilometers allowed the species to quickly reach its regional climatic envelope in merely a few decades. Moreover, expansion rates at larger scales were often greater than or equivalent to those at smaller scales between the 1960s and 1980s, suggesting long distance movement that did not rely on sequential population establishment along a shorter dispersal path. Those dispersal leaps could allow Sahara mustard to rapidly extend its range over spatially heterogeneous environments, skipping hostile territories and paving the way for future space filling within its range limit. Its ongoing rapid expansion at the 5–50 km scales also suggests low dispersal limitation at those smaller scales. Processes that drive the spatiotemporal dynamics of Sahara mustard expansion The expansion of an invasive species has three phases: a quiescent period between initial colonization and rapid range expansion (the lag phase), a period of rapid range expansion (the spreading phase), and a stage of little further expansion (the ceased phase) (Pysek and Hulme 2005). Scale dependency of invasive spread means that any of the three phases may occur simultaneously when observed at different spatial scales (Pysek and Hulme 2005, Arim et al. 2006). In our case, the expansion of Sahara mustard in the 1960s–1980s was in the spreading phase across all scales. In the 2000s, the species was in the spreading phase at the 5–50 km scales but in the ceased phase at larger scales. Climatic constraints can explain the ceased phase, while the availability of fine scale niche space allowed the species to further expand at smaller scales. Our results also showed that Sahara mustard had experienced two distinctive lag phases. The first lag phase occurred from the 1920s to the 1940s, during which there was a 14 yr gap between its first and second recorded occurrence (1927 and 1941). The second lag phase occurred in the 1990s, during which expansion was limited across all scales. An invasive species can experience multiple lag phases due to different causes (Wangen and Webster 2006). In our case, the first lag phase may reflect the time required by Sahara mustard to overcome the problems commonly faced by small founding populations such as loss of genetic diversity, Allee effects, and demographic stochasticity (Crooks and Soulé 1999, Sakai et al. 2001). But it might also reflect the time between the first introduction and a later introduction of a more invasive form of the species. There is some anecdotal evidence that the first introduced form of Sahara mustard, which can still be found near Mecca, California, is smaller and more gracile than those spreading in North America and is more specialized in disturbed soils (M. Dimmit pers. comm.). The second lag phase, during which the species was well established in North America, was most likely caused by factors other than those that hindered its early spread. A likely candidate is a period of less-favorable environmental conditions, such as low water availability (a key resource for desert annuals). We examined whether reduced cold (growing) season precipitation was associated with this lag period. Our analysis shows that the cold season precipitation averaged over the 1990s was in fact above the long-term mean in areas where Sahara mustard achieved low (but positive) local-scale expansion (Supplementary material Appendix 5), contradicting a simple explanation of low resource availability. Plasticity vs adaptation as a process for successful invasion An introduced species can become a widespread invader by either having low genetic diversity and a ‘general purpose’ genotype or rapidly adapting to its various new environments (Parker et al. 2003). A species with a general purpose genotype tolerates a wide range of environments and can grow in a multitude of climates and habitats (Baker 1965). Sahara mustard, which rapidly spread to cover a broad range of climatic and habitat conditions is a strong candidate for carrying such a plastic genotype. Sahara mustard reproduces primarily by self-fertilization, which is a strategy often associated with maintaining a general purpose genotype (Baker 1965), and limiting the capacity of new populations to rapidly adapt to local conditions (Barrett et al. 2008). Those ideas are supported by the general agreement between the climatic conditions over its native and invaded range, suggesting that the species’ strategies for coping with climatic variation are through well-preserved plasticity rather than novel adaptation. Therefore, its invasion may reflect niche conservatism (Wiens et al. 2010). Nevertheless, just how much self-fertilization limits Sahara mustard’s genetic variability is an open question. Its native populations in Iraq were found to retain the potential for outcrossing (Al-Shehbaz 1977). Used as female parents, Sahara mustard was also able to artificially hybridize with other Brassicaceae species (Hinata et al. 1975). These opportunities for genetic recombination leave open the possibility that traits may evolve in invading Sahara mustard, with potential impacts on its distribution. Such genetic changes are particularly likely if repeated introductions are made from different parts of the native range, introducing new adaptive variation into the invasion (Ellstrand and Schierenbeck 2000, Dlugosch and Hays 2008, Dlugosch and Parker 2008), though in this case Sahara mustard appears to already have invaded most habitats that reflect the breadth of climates occupied in its native range. Management implication for Sahara mustard and other invasive species The climatic limits on the regional expansion of Sahara mustard and the species’ continuing spread at 5–50 km scales suggest that control efforts should focus on containing ongoing local expansion. Our SDM prediction highlights areas where climate provides ample room for local-scale expansion of the species. Prevention, early detection and eradication should focus on those high risk areas to slow down its local expansion. Nevertheless, regional scale monitoring is needed 317 to detect any new shifts of its range boundaries under the changing global climate. The rapid spread of this species is fueled by effective long distance dispersal, which may be carried out by vehicle transportation. Early detection efforts should focus on suitable habitats with strong linkage, such as those accessible by roads. Strength and novelty of our approach Using widely available data from herbarium collections, the box-counting method successfully revealed the scale dependent spread of an invasive species, whose invasion history was not closely monitored. The method can be readily adopted to study the scale dependent expansion of many other species, for which the only data for interpreting range dynamics are from long term herbarium collections. Some studies have used the box-counting method to assess the spatial structure of species distribution across multiple spatial scales (Foxcroft et al. 2009, Veldtman et al. 2010, Akasaka et al. 2012), but all have focused only on the current distribution and neglected the temporal range dynamics. Our work presents the first case of applying the box-counting method to studying scale-dependent range expansion. Our analysis effectively uncovered the scale dependency of an invasive spread. Quantifying such scale dependency and its contributing factors not only fosters understanding of species distribution but is also essential for effective control of any biological invasion. By highlighting critical scales where an invasion is contained or unconstrained due to specific factors, one can direct the management efforts to the appropriate range and scales where invasion is to continue. However, so far few studies have examined expansion rate of invasive species using multi-scale approaches (Lonsdale 1993, Pysek et al. 2008). Those that did use the approach limited their investigation to a very few discrete scales (e.g. local to regional to continental scale in Pysek et al. 2008) with a large range of intermediate scales missing in-between. Failing to assess a continuous range of scales can result in missing critical scales where major change in the expansion rate takes place. Due to these limitations, we have poor knowledge of the prevalence and general pattern of the scale dependency of invasive expansion. Less do we know the factors that lead to such scale dependency (Pysek and Hulme 2005). The methods demonstrated in our study provide a powerful tool for filling this research gap in order to better inform the management of many invasive species. Caveats of our approach Herbarium collections provide inexpensive and widely available long-term data for inferring species distributions. However, using those data can be challenging because they were not collected with the intention to discover multi-scale distribution patterns. We have incorporated in our methods means to reduce the influence by potential sampling biases. Nevertheless, several sources of error may still influence this type of analysis. First, many of the herbarium collection records have imprecise or inaccurate geographic coordinates. We showed 318 that such errors at smaller scale could quantitatively impact estimates of expansion at larger scales, though they would be unlikely to alter patterns qualitatively. Second, various spatial sampling biases could affect our analysis. Our method used the ‘expansion’ rate of native species with assumed stable range to correct for difference in spatial sampling efforts between decades. However, if the nature of spatial sampling bias is different between invasive and native species, our method will not be able to adequately correct the bias specific to the invader. For instance, if herbarium collectors paid more attention to a rapidly expanding invasive species more recently but meanwhile consistent attention to the natives, our method will overestimate the expansion rate for the invasive. It is also possible that collectors might avoid collecting species in the same local area over time. This sampling bias does not necessarily affect the accuracy of our estimates because our method relies on measuring the change in total number of occupied cells in each time step rather than tracking the geographic movement of those cells. Making a first collection in one cell and the second in a different cell will not change the total number of occupied cells between the two time steps and thus will not change the estimated expansion rate at the scale represented by those cells. However, such method will affect our estimates if it forces multiple collections to concentrate in one cell at one time and to diverge to different cells at another time. Also, if collectors specifically avoid returning to the same places, those locations will not be recorded in the following decade. Therefore, there may be a smaller number of occupied cells than there should be, leading to an underestimate of the expansion rate. Our method would not be able to fully correct the bias, if such bias is different between the invasive and the native collections. Our method assumes equal sampling efforts over space. It is evident that few records of Sahara mustard exist in Mexico and southern Central Valley of California despite the prediction of high likelihood of its presence by our SDMs (Fig. 2). Scarcity of records in Mexico may be a direct result of lower collection effort or the lack of digitally available data from Mexican herbaria. Scarcity of records in southern Central Valley of California might be due to limited habitat. Most of the valley has been developed for industrial scale farming. Sahara mustard might lack habitat there and herbarium collectors might avoid this region due to the lack of natural plant habitat. A search for all specimens of Asteraceae and Brassicaceae recorded in California Consortium of Herbaria show that the number of records from the three counties in the valley (Kern, Merced and San Joaquin County) was orders of magnitude lower than those in adjacent counties with more abundant natural plant habitat. Any large disparity in sampling efforts between regions will result in biased estimates of expansion rate across the regions. In our case, we know very little how the historical and current distribution of Sahara mustard in Mexico and Central Valley, California would have affected the estimated expansion rate across North America. Fortunately, the majority of its predicted range was relatively well sampled by herbarium collectors (Fig. 2). Spatial sampling bias will also affect the prediction by MaxEnt SDMs, which assume equal sampling efforts over the entire area of interest (Phillips et al. 2009). If herbarium records are spatially biased towards better surveyed area for the species (e.g. areas where an invasive species is a serious concern), less surveyed areas will have less influence on the SDMs, resulting in higher omission errors in those areas. Indeed, our SDMs using southwestern North America as the background fail to predict the presence of Sahara mustard in some of its recorded eastern and southern range boundaries. Fortunately, our models using North America as the background do include those areas as potential range for Sahara mustard (Supplementary material Appendix 4, Fig. A3). The latter models use a larger background with a wider range of values in explanatory variables. Values from the recorded range boundaries become less extreme in models using this enlarged background. An alternative solution to minimize such problem caused by spatial sampling bias is using spatial filtering to reduce the number of occurrence records in oversampled areas (Kramer-Schadt et al. 2013). Finally, our analysis only indirectly infers the limiting factors that drive the scale dependency of invasive range expansion. Our methods look for correlation between certain ecological factors and observed scale dependency of range expansion. It provides useful insight for designing future studies that can directly test mechanisms that drive or limit range expansion through manipulative experiments conducted at various scales. Conclusions In this study we have demonstrated the validity of a novel methodology that can be broadly applied to uncover the scale dependency of invasive range expansion and its contributing ecological and evolutionary factors. Large online herbarium databases provide free and accessible data to which the box-counting method can be applied to quantify scale-dependent spread of many invasive plant species (after adjusting for variation in sampling efforts). Coupling this multi-scale analysis with MaxEnt SDMs, one can effectively determine on what scales the variables chosen in the models are limiting or contributing to the spread. Controlling efforts can then focus on the scales where the invasion is not constrained and areas where ecological factors are still favoring spread. Moreover, using SDMs based on variables in the invasive range to project its native distribution allows inference of the adaptation of an invasive species to novel environmental conditions. Such knowledge is essential for predicting the risk of further invasion powered by evolution. Acknowledgements – John Donoghue II, Irena Simova and Jin Wu guided us to cross the mysterious terrain of ArcGIS. Naia MoruetaHolme circled many MaxEnt pitfalls that might have trapped us. Cal-IPC and Cameron Barrows shared important data of Sahara mustard recorded in the Californian deserts. Mara MacKinnon and Kimberly Byers spent months finding the latitude and longitude coordinates of hundreds of herbarium records. We thank Cameron Barrows, Aaryn Olsson, and Peter Chesson, who inspired critical improvement on the manuscript. We also thank all the herbarium collectors for documenting the history of Sahara mustard invasion. Funding was proved to YML by the US Marine Corps Air Stations, Yuma and Luke Air Force. References Akasaka, M. et al. 2012. Invasive non-native species attributes and invasion extent: examining the importance of grain size. – J. Veg. Sci. 23: 33–40. Al-Shehbaz, I. A. 1977. Protogyny in the Cruciferae. – Syst. Bot. 2: 327–333. Arim, M. et al. 2006. Spread dynamics of invasive species. – Proc. Natl Acad. Sci. USA 103: 374–378. Baker, H. G. 1965. Characteristics and modes of origin of weeds. – In: Baker, H. G. and Stebbins, G. L. (eds), The genetics of colonizing species. Academic Press, pp. 147–169. Bangle, D. N. et al. 2008. Seed germination of the invasive plant Brassica tournefortii (Sahara mustard) in the Mojave Desert. – West. N. Am. Nat. 68: 334–342. Barrett, S. C. H. et al. 2008. Plant reproductive systems and evolution during biological invasion. – Mol. Ecol. 17: 373–383. Barrows, C. W. et al. 2009. Effects of an invasive plant on a desert sand dune landscape. – Biol. Invasions 11: 673–686. Baskin, C. C. and Baskin, J. M. 2006. The natural history of soil seed banks of arable land. – Weed Sci. 54: 549–557. Brown, J. H. et al. 1996. The geographic range: size, shape, boundaries, and internal structure. – Annu. Rev. Ecol. Syst. 27: 597–623. Catford, J. A. et al. 2009. Reducing redundancy in invasion ecology by integrating hypotheses into a single theoretical framework. – Divers. Distrib. 15: 22–40. Caughley, G. et al. 1987. Kangaroos and climate: an analysis of distribution. – J. Anim. Ecol. 56: 751–761. Crooks, J. A. and Soulé, M. E. 1999. Lag times in population explosions of invasive species: causes and implications. – In: Sandlund, O. T. et al. (eds), Invasive species and biodiversity management. Kluwer, pp. 103–125. Davis, M. B. and Shaw, R. G. 2001. Range shifts and adaptive responses to Quaternary climate change. – Science 292: 673–679. Delisle, F. et al. 2003. Reconstructing the spread of invasive plants: taking into account biases associated with herbarium specimens. – J. Biogeogr. 30: 1033–1042. Dlugosch, K. M. and Hays, C. G. 2008. Genotypes on the move: some things old and some things new shape the genetics of colonization during species invasions. – Mol. Ecol. 17: 4583–4585. Dlugosch, K. M. and Parker, I. M. 2008. Invading populations of an ornamental shrub show rapid life history evolution despite genetic bottlenecks. – Ecol. Lett. 11: 701–709. Elith, J. et al. 2010a. The art of modelling range-shifting species. – Methods Ecol. Evol. 1: 330–342. Elith, J. et al. 2010b. A statistical explanation of MaxEnt for ecologists. – Divers. Distrib. 17: 43–57. Ellstrand, N. C. and Schierenbeck, K. A. 2000. Hybridization as a stimulus for the evolution of invasiveness in plants? – Proc. Natl Acad. Sci. USA 97: 7043–7050. Estrada-Villegas, S. et al. 2011. Climate, habitat, and species interactions at different scales determine the structure of a Neotropical bat community. – Ecology 93: 1183–1193. Foxcroft, L. C. et al. 2009. Patterns of alien plant distribution at multiple spatial scales in a large national park: implications for ecology, management and monitoring. – Divers. Distrib. 15: 367–378. Gaston, K. J. 2009. Geographic range limits: achieving synthesis. – Proc. R. Soc. B 276: 1395–1406. Greig-Smith, P. 1979. Pattern in vegetation. – J. Ecol. 67: 755–779. Hijmans, R. J. et al. 2005. Very high resolution interpolated climate surfaces for global land areas. – Int. J. Climatol. 25: 1965–1978. Hinata, K. and Nishio, T. 1980. Self-incompatibility in Crucifers. – In: Tunoda, S. et al. (eds), Brassica crops and wild allies: 319 biology and breeding. Japan Scientific Societies Press, pp. 223–234. Hinata, K. et al. 1975. Interspecific crossability in the tribe Brassicaceae with special reference to the self-incompatibility. – Tohoku J. Agric. Res. 25: 58. Holt, R. D. 2003. On the evolutionary ecology of species’ ranges. – Evol. Ecol. Res. 5: 159–178. Holt, R. D. and Barfield, M. 2011. Theoretical perspectives on the statics and dynamics of species’ borders in patchy environments. – Am. Nat. 178: S6–S25. Holt, R. D. et al. 2005. Theories of niche conservatism and evolution. – In: Sax, D. F. et al. (eds), Species invasions: insights into ecology, evolution, and biogeography. Sinauer Associates, pp. 259–290. Kinlan, B. P. and Hastings, A. 2005. Rates of population spread and geographic range expansion: what exotic species tell us. – In: Sax, D. F. et al. (eds), Species invasions: insights into ecology, evolution, and biogeography. Sinauer Associates, pp. 381–419. Kirkpatrick, M. and Barton, N. H. 1997. Evolution of a species’ range. – Am. Nat. 150: 1–23. Kot, M. et al. 1996. Dispersal data and the spread of invading organisms. – Ecology 77: 2027–2042. Kramer-Schadt, S. et al. 2013. The importance of correcting for sampling bias in MaxEnt species distribution models. – Divers. Distrib. 19: 1366–1379. Lonsdale, W. M. 1993. Rates of spread of an invading species – Mimosa pigra in northern Australia. – J. Ecol. 81: 513–521. MacArthur, R. 1972. Geographical ecology: patterns in the distribution of species. – Harper and Row. Marie, R. 1965. Flore de L’Afrique du Nord. – Paul Lechevalier, Paris. Mandelbrot, B. B. 1983. The fractal geometry of nature. – W. H. Freeman. Marushia, R. G. et al. 2012. Phenology, growth, and fecundity as determinants of distribution in closely related nonnative taxa. – Invasive Plant Sci. Manage. 5: 217–229. McGill, B. J. 2010. Matters of scale. – Science 328: 575–576. Miller, A. G. and Cope, T. A. 1996. Flora of the Arabian Peninsula and Socotra. – Edinburgh Univ. Press. Minnich, R. A. and Sanders, A. C. 2000. Brassica tournefortii Gouan. – In: Bossard, C. C. et al. (eds), Invasive plants of California’s wildlands. Univ. of California Press, pp. 68–72. Morse, D. R. et al. 1985. Fractal dimension of vegetation and the distribution of arthropod body lengths. – Nature 314: 731–733. Parker, I. M. et al. 2003. An evolutionary approach to understanding the biology of invasions: local adaptation and generalpurpose genotypes in the weed Verbascum thapsus. – Conserv. Biol. 17: 59–72. Pearson, R. G. and Dawson, T. P. 2003. Predicting the impacts of climate change on the distribution of species: are bioclimate Supplementary material (Appendix ECOG-00722 at ⬍www. ecography.org/readers/appendix⬎). Appendix 1–5. 320 envelope models useful? – Global Ecol. Biogeogr. 12: 361–371. Phillips, S. J. et al. 2006. Maximum entropy modeling of species geographic distributions. – Ecol. Model. 190: 231–259. Phillips, S. J. et al. 2009. Sample selection bias and presence-only distribution models: implications for background and pseudoabsence data. – Ecol. Appl. 19: 181–197. Phillips, B. L. et al. 2010. Evolutionarily accelerated invasions: the rate of dispersal evolves upwards during the range advance of cane toads. – J. Evol. Biol. 23: 2595–2601. Pysek, P. and Hulme, P. E. 2005. Spatio-temporal dynamics of plant invasions: linking pattern to process. – Ecoscience 12: 302–315. Pysek, P. et al. 2008. Comparing the rate of invasion by Heracleum mantegazzianum at continental, regional, and local scales. – Divers. Distrib. 14: 355–363. Ricklefs, R. E. 2005. Taxon cycles: insights from invasive species. – In: Sax, D. F. et al. (eds), Species invasions: insights into ecology, evolution, and biogeography. Sinauer Associates, pp. 165–199. Ritchie, M. 1998. Scale-dependent foraging and patch choice in fractal environments. – Evol. Ecol. 12: 309–330. Root, T. 1988. Environmental factors associated with avian distributional boundaries. – J. Biogeogr. 15: 489–505. Sakai, A. K. et al. 2001. The population biology of invasive species. – Annu. Rev. Ecol. Syst. 32: 305–332. Sexton, J. P. et al. 2009. Evolution and ecology of species range limits. – Annu. Rev. Ecol. Evol. Syst. 40: 415–436. Trader, M. R. et al. 2006. Seed production by the non-native Brassica tournefortii (Sahara mustard) along desert roadsides. – Madrono 53: 313–320. Townsend, C. C. and Guest, E. 1980. Flora of Iraq. – Ministry of Agriculture and Agrarian Reform, Baghdad. Veldtman, R. et al. 2010. Using scale–area curves to quantify the distribution, abundance and range expansion potential of an invasive species. – Divers. Distrib. 16: 159–169. Wangen, S. R. and Webster, C. R. 2006. Potential for multiple lag phases during biotic invasions: reconstructing an invasion of the exotic tree Acer platanoides. – J. Appl. Ecol. 43: 258–268. Wethey, D. S. 2002. Biogeography, competition, and microclimate: the barnacle Chthamalus fragilis in New England. – Integr. Comp. Biol. 42: 872–880. Whittaker, R. J. et al. 2001. Scale and species richness: towards a general, hierarchical theory of species diversity. – J. Biogeogr. 28: 453–470. Wiens, J. A. 1989. Spatial scaling in ecology. – Funct. Ecol. 3: 385–397. Wiens, J. J. et al. 2010. Niche conservatism as an emerging principle in ecology and conservation biology. – Ecol. Lett. 13: 1310–1324. Zohary, M. et al. 1980. Conspectus Florae Orientalis: an annotated catalogue of the flora of the Middle East. – The Israel Academy of Sciences and Humanities, Jerusalem.
© Copyright 2026 Paperzz