International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 41 Box-Counting Dimension of Fractal Urban Form: Stability Issues and Measurement Design Shiguo Jiang, Department of Geography, The Ohio State University, Columbus, OH, USA Desheng Liu, Department of Geography and Department of Statistics, The Ohio State University, Columbus, OH, USA ABSTRACT The difficulty to obtain a stable estimate of fractal dimension for stochastic fractal (e.g., urban form) is an unsolved issue in fractal analysis. The widely used box-counting method has three main issues: 1) ambiguities in setting up a proper box cover of the object of interest; 2) problems of limited data points for box sizes; 3) difficulty in determining the scaling range. These issues lead to unreliable estimates of fractal dimensions for urban forms, and thus cast doubt on further analysis. This paper presents a detailed discussion of these issues in the case of Beijing City. The authors propose corresponding improved techniques with modified measurement design to address these issues: 1) rectangular grids and boxes setting up a proper box cover of the object; 2) pseudo-geometric sequence of box sizes providing adequate data points to study the properties of the dimension profile; 3) generalized sliding window method helping to determine the scaling range. The authors’ method is tested on a fractal image (the Vicsek prefractal) with known fractal dimension and then applied to real city data. The results show that a reliable estimate of box dimension for urban form can be obtained using their method. Keywords: Box-Counting Method, Fractal Dimension, Pseudo-Geometric Sequence, Scaling Range, Sliding Window Method, Urban Form 1. INTRODUCTION Fractal dimension is a useful landscape metric in that it can capture the irregularity and complexity of landscape patterns (Hargis et al., 1998; Herold et al., 2005; Imre & Bogaert, 2004; Pincheira-Ulbrich et al., 2009). The usefulness of fractal in characterizing urban form is shown by various researchers (Batty, 1985; Batty & Longley, 1994; Benguigui et al., 2000; Frankhauser, 1994; Thomas et al., 2008; Thomas et al., 2007). Mostly inspired by the studies on coast line (Mandelbrot, 1967; Richardson, 1961), early studies of urban form mainly focus on city boundaries (Batty & Longley, 1987; Batty & Longley, 1988; Longley & Batty, 1989a, 1989b). Later fractal urban form studies extend to urban surface, i.e., urban land use (Batty & Longley, 1994; Benguigui et al., 2000; Shen, 2002; Thomas et al., 2008; White & DOI: 10.4018/jalr.2012070104 Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 42 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Engelen, 1993). It seems to be widely accepted that urban boundary as well as urban surface are both fractals at least in certain stages (Batty & Longley, 1994; Batty & Xie, 1996; Benguigui et al., 2000). Fractal analysis of urban form relies heavily on the calculation of fractal dimension – the main scaling exponent to describe a fractal set. The fractal dimension of a deterministic fractal (e.g., the Vicsek fractal) can usually be estimated analytically. However, the dimension of a stochastic fractal (e.g., urban form) needs to be estimated numerically. Three numerical methods are popular in the research community: the perimeter-area relation method (Batty & Longley, 1988; Batty & Longley, 1994), the area-radius method (Frankhauser, 1994; White & Engelen, 1993), and the box-counting method (Benguigui et al., 2000; Lu & Tang, 2004; Shen, 2002). Due to its simple algorithm and equal effectiveness to point sets, linear features, areas, and volumes, the box-counting method enjoys a wide popularity across various disciplines such as physics (Lovejoy et al., 1987), earth sciences (Walsh and Watterson 1993), biology (Foroutan-pour et al., 1999), ecology (Halley et al., 2004), and urban studies (Benguigui et al., 2000; Feng & Chen, 2010; Lu & Tang, 2004; Shen, 2002; Verbovsek, 2009). In the case of urban studies, box-counting dimension is an indicator of compactness for the distribution of built-up areas. Despite its popularity in the research community, several issues of the boxcounting method remain unsolved. The first issue is concerned with the ambiguities in setting up a proper box cover of the object. This issue has two aspects. The first aspect is related to the shape of the grids and boxes in the box-counting method. Theoretically, the shape of the grid and box does not influence the estimate of the box dimension. However, in practice, the object does not have infinite detail, thus different covering schemes may lead to different estimates. For simplicity of practical calculation, the conventional boxcounting method is performed based on square boxes (Shen, 2002; Verbovsek, 2009). Although the square box is widely used, it is not necessarily the only caliber. The choosing of square boxes cannot always efficiently cover the object. The second aspect is more related to the analysis of urban form. Due to the scale-free characteristics of urban form, it is difficult to find the exact boundary of a city and there is no agreement on a theoretically correct definition of urban boundary (Benguigui et al., 2000; Berry et al., 1968). In urban studies, we often need to make a subjective decision about the city boundary. As we will illustrate later, the fractal dimension of the same city changes with the study area. Therefore, the box dimension of a city should be given along with the study area. We should be cautious in comparing the fractal dimension of different cities as the calculated dimension may not be comparable. The second issue in the box-counting method is the problem of dimension estimation due to the limited number of data points for regression (Pruess, 1995). The issue comes from the use of a dyadic sequence. Box size approaches zero quickly and thus provides only a few data points for regression. Take the unit box size as level one, and the box size changes as 1/2, 1/4… In the 10th level of box division, the box size is 1/29, and the total number of boxes is 218. This is the lower bound for box division in most literature, which provides 10 data points of box size (Benguigui et al., 2000; Grau et al., 2006; Lu & Tang, 2004). Besides the dyadic sequence, other choices have been proposed, such as the odd number sequence (Bisoi and Mishra 2001; Chen et al., 1993), modified arithmetic sequence (Buczkowski et al., 1998; Foroutan-pour et al., 1999), and modified dyadic sequence (Shen, 2002). The third issue is the difficulty in determining the scaling range of box sizes over which the fractal analysis is applied (Saucier & Muller, 1998). It is widely accepted that fractal properties only exist on a certain scaling range for stochastic fractals (Goodchild, 1980; Lam & Quattrochi, 1992; Roy et al., 2007). However, there is no generally agreed method to specify the upper and lower bounds of the scaling range (Foroutan-pour et al., 1999; Huang et al., 1994; Liebovitch & Toth, 1989). Most studies suggest Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 43 that the largest box size used should be no larger than one half or one fourth of the shorter side of the image. When the box size is large, all the boxes are likely to be occupied, producing a slope of 2 on the scatter plot (Foroutan-pour et al., 1999; Shen, 2002). However, the empirical choice of this upper bound has no sound justification. As to choosing the lower bound of the box size, there are mainly five methods in the literature. 1. Use the image resolution as the lower bound without considering the scale break (Benguigui et al., 2000). 2. Eliminate or add the data points of the box size one by one to observe the best fit (Foroutan-pour et al., 1999). 3. Use a sliding window method to calculate the local fractal dimension (Brewer & Di Girolamo, 2006; Dubuc et al., 1989). For example, we can compute the fractal dimension for the first 5 values of box size, then for 2 to 6, and so forth. Thus we can find the lower bound that will result a stable estimation of fractal dimension. 4. Use the condition ds / dr → 0 to get the cutoff lower bound of the box size (Roy et al., 2007). Here, s is the standard deviation from a linear regression equation fitted to log(N) versus log(r) with data for r < rcutoff sequentially excluded. However, this method is not well justified because the best fit does not necessarily guarantee the best estimate (Huang et al., 1994). 5. Use configuration entropy (Andraud et al., 1997) to determine the lower bound of the box size in multifractal analysis (Dathe et al., 2006; Tarquis et al., 2006). The three issues discussed are common in numerical analysis of stochastic fractals and lead to an unstable estimate of fractal dimension. For example, the calculations of the same fractal set by different people often result in different fractal dimensions (Huang et al., 1994; Hunt, 1990; Li et al., 2009; Sarkar & Chaudhuri, 1994). Although various methods have been proposed to search for reliable calculations (Buczkowski et al., 1998; Foroutan-pour et al., 1999; Li et al., 2009; Sarkar & Chaudhuri, 1994; Walsh & Watterson, 1993), there still lacks a systematic exploration of the problems. In this paper, we thoroughly address these issues and provide our solutions. By using rectangular grids and boxes with efficient cover of the object, we propose a method to estimate the fractal dimension of urban form. We develop an algorithm to create a pseudo-geometric sequence which provides enough data points for the dimension estimation. We also develop a generalized sliding window method to determine the upper and lower bound of the scaling range (box size). Our method is first tested on a fractal image (the Vicsek prefractal) with known dimension. It is then applied in analyzing the fractal urban form of Beijing City. In most literature, only the point estimate of fractal dimension is given. However, empirical determination of a particular quantity is not error free (Taylor, 1997). The range of the measurement quantity should be given along with the point estimate. In this paper, we will compare the results of our method with those from traditional method in two aspects: the point estimate and the measurement range (95% confidence interval). The rest of this paper is organized as follows. Section 2 is a simple introduction of the data used in this study. Section 3 introduces the box-counting method and our modification. Section 4 presents the results of the calculation. Section 5 discusses the results as well as the problems in fractal urban analysis. Section 6 concludes the paper. For convenience, we will treat urbanized area, city, and urban form as interchangeable terms in this paper. 2. DATA 2.1. Prefractal Image with Known Dimension We estimate the fractal dimension of the Vicsek prefractal to test our method. The Vicsek fractal (also known as Vicsek snowflake or box fractal; Vicsek, 1983) is generated by decomposing a Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 44 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 basic square into nine smaller squares in the 3-by-3 grid. The basic square is called the initiator (Figure 1(a)). Number the nine squares as 1, 2…9, from left to right and from top to bottom. Squares 2, 4, 6, 8 are removed, and squares 1, 3, 5, 7, 9 are left. The result of this procedure is also called the generator (Figure 1(b)), because it specifies a rule that is used to generate a new form. Repeat the process recursively for each of the five remaining sub-squares and the Vicsek fractal is obtained at the limit of this procedure. In practice, due to the limitation of computer power and resolution, the fractal object is usually generated using finite number of iterations, say the nth iteration, the result of which is called prefractal (Feder, 1989). For convenience, we denote the size of the squares in the last iteration (i.e., nth) as unit square whose size is 1 (unit one). In remote sensing literature, the unit square is generally referred to as image pixel. The size of the initiator is thus 3n. Figure 1 shows the result of three iterations. The size of the initiator is 33 = 27. Vicsek fractal is a typical deterministic fractal, which is created by a rule of some sort. Properties like the Hausdorff dimension of a deterministic fractal can be accounted accurately. The Hausdorff dimension of the Vicsek fractal is log 5 / log 3 ≈ 1.4649 . 2.2. Real Data of Beijing City As this paper aims at the measurement methods in fractal analysis, we only chose two Landsat-5 TM (WRS-2 Path 123 Row 32) images for demonstration purposes. The acquisition dates for the two images are 3 October 1984 and 29 October 1999. The two images are rectified to a common UTM coordinate system based on 1:50000 topographic maps of Beijing (provided by the National Bureau of Surveying and Mapping). The two images are well aligned with RMSE ≤ 15 meters. Other reference data include an administrative boundary map of Beijing and a district map of Beijing (both provided by the Beijing Municipal Government Planning Commission). Unsupervised classification is performed first, and is improved by intensive human editing aided by supplemental data and field investigation. The final map is validated using land use maps (prepared by the Beijing Municipal Government Planning Commission) and aerial photos with an accuracy of 95.5%. This highly reliable urban map product is used in our study (Figure 2). As discussed before, it is difficult to find the exact boundary of a city. In practice, we can set the study area based on administrative boundary, development of the city, and road system, etc. The ring road system of Beijing serves as a reference to boundary for the city in different periods. The old Beijing before the 1950s was confined in walls around the city. In the 1950~1960s, the walls were demolished and Ring Road 2 was built. With the expansion of the city, Ring Road 3 (completed in 1994), Ring Road 4 (2001), Ring Road 5 (2003), and Ring Road 6 (2009) have been built in the past fifty years. Three study areas are chosen based on ring road system, the administrative boundaries, and the development of the city (Figure 2). Although Ring Road 4, 5, and 6 were completed Figure 1. Three iterations of the Vicsek fractal: (a) the basic square and initiator; (b) the generator and 1st iteration; (c) 2nd iteration; (d) 3rd iteration Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 45 Figure 2. Maps of Beijing City: (a) ring and radial road system: 2~6, ring roads; (b), (c) land cover map in 1984, 1999: 1, region inside the Ring Road 5; 2, urbanized area; 3, region inside Ring Road 6 after 1999, part of them had already been in use before the completion of the whole rings. Therefore, they provide a good reference limit to define the study area. Study Area 1: It is the main part of the region bounded by Ring Road 5. It includes four central districts (Dongcheng, Xicheng, Xuanwu, Chongwen) and the main parts of three inner suburban districts (Haidian, Chaoyang, Fengtai). This is the densely developed and closely connected area of Beijing before 1990s. Study area 1 has a size of 29*25 =725km2. Study Area 2: It is based on the traditional “Cheng Baqu” (i.e., eight main districts of Beijing), including four central districts and four inner suburban districts of Beijing: Dongcheng, Xicheng, Xuanwu, Chongwen, Haidian, Shijingshan, Chaoyang, Fengtai. We regard this study area as the urbanized area of Beijing. Study area 2 has a size of 40*39 =1560km2. Study Area 3: It is the region bounded by Ring Road 6. Four outer suburban cities (Mengtougou, Fangshan, Daxing, and Shunyi) are included in addition to study area 2. This is the metropolitan area of Beijing. Study area 3 has a size of 57*54 =3078km2. 3. BOX-COUNTING METHOD AND MODIFICATION 3.1. Traditional BoxCounting Method “Box-counting” is a sampling process to find the complexity, irregularity, and heterogeneity of the object of interest at different scales (Barnsley, 1988). It is a numerical method developed independently by many authors from a description by Mandelbrot (Mandelbrot, 1983; Pruess, 1995). Here we present an intuitive and descriptive definition of the box-counting method. For a precise technical definition, the reader is urged to read the book by Falconer (2003). The basic procedure is to systematically lay a series of grids composed of boxes in decreasing size over an image and then record data (counting) for each successive box size. The objective is to find fractal dimensions characterizing the spatial structure of the object. Box-counting dimension is the dimension calculated based on box-counting method. The procedures of box-counting method can be explained as follows (Figure 3(a)). First, use a square grid of only one box to cover the city image so that the urbanized area of the city is well covered by the grid. The side length of the box, r1, equals the side length of the basic grid, L. Count the number of distinct Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 46 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Figure 3. Box-counting methods with different covering grids: (a) square grid with square boxes; (b) rectangular gird with square boxes, L2 = kL1, k is integer; (c) rectangular grid with square boxes, L2 = αL1, α is rational number; (d) rectangular grid with rectangular boxes. (Note: Method (a)-(c) are commonly used in previous literature although they are not consistent with the minimum cover rule of box-counting dimension. In our paper, we used method (d) which provides minimum cover of the object. The box dividing is for demonstration purpose. It does not reach the maximum number of dividing or the pixel resolution level, therefore the boxes does not necessarily either filled completely or completely empty) occupied boxes N1 that can cover the city completely. Here, as we only have one box, N1 = 1. Second, divide the grid by 2×2 to get a grid composed of four boxes (quadrants) with the size r2 =L/2. Count the number of occupied boxes N2. Continue the division process for n iterations. Record the corresponding Nn. As the box size becomes smaller and smaller, the total area Nnrn2 of those occupied boxes should approach the actual urban area closely. The box-counting dimension is then estimated as: D = lim rn → 0 log N n log(1 / rn ) . (1) In practice, the dimension is estimated as the slope of the scatter plot of logNn against log(1/rn). Thus, we estimate the following linear equation, ln N (r ) + D ln(1 / r ) + b1 = 0. (2) Equation (1) defines the capacity dimension (Nayfeh, 1995; Peitgen, 1986; Weisstein, 2003). In practice, we usually start with the smallest box size and increase the box size by sequentially aggregating boxes together. In previous literature, three versions of grid cover and box division method are generally found as follows. 1. Use a square grid to cover the object (Shen 2002). The side length of the grid, L = 2n ε, where ε is the image resolution Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 47 (Figure 3(a)). The smallest box size equals the size of the image resolution, ε. 2. Use a rectangular grid to cover the object, and the length and width of the grid are of integer ratio (Saa et al., 2007). Let the width of the grid, L1 = 2n ε, then the length of the grid L2= kL1, where k is an integer number (Figure 3(b)). 3. Use a rectangular grid to cover the object, and the length and width of the grid are of any rational number (Lu and Tang 2004). Let the width of the grid L1 = 2n ε, then the length of the grid L2 = αL1, where α is a rational number (Figure 3(c)). Although the methods in Figure 3 (a), (b), (c) are widely used, they violate the minimum cover rule of box-counting dimension (Falconer, 2003). The minimum cover rule means that when we measure the fractal dimension, we should use minimum number of open box to cover the object. In the case of box-counting dimension, we should use minimum number of rectangular boxes to cover the object. Strictly speaking, these methods do not provide estimate of box-counting dimension per se. 3.2. Improved Box-Counting Method with Modified Measurement Design Our improved box-counting method can be illustrated in Figure 4. Detailed explanation is introduced in the following three sub-sections 3.2.1-3.2.3. 3.2.1. Grid Dividing with Rectangular Boxes In order to overcome the minimum cover problem in traditional box-counting method, we use rectangular grid and boxes to cover the object (Figure 3(d)). The image preparation and processing is shown as follows. 1. If the source map of the object is of vector format, we can encompass it with a minimum rectangular grid, called basic grid (initiator). Suppose the short side length of the grid is L. Divide the grid into amax by amax rectangular boxes whose size is L/ amax. Here, amax is the number of boxes per row/column. In practice, amax is usually determined by the software threshold of record storage. In our processing, the Create Fishnet ArcToolbox in ESRI ArcGIS is used, which can be accessed through ArcToolbox → Data Management Tools → Feature Class → Create Fishnet (ESRI, 2010). The limit is arrived when the basic grid is divided into 211 × 211 boxes, i.e., amax = 211 = 2048. 2. If the source map of the object is of raster format, there are two equivalent methods to process the map. The first method is to convert the raster image into a vector image and then process it as in (1). The second method is to resample the raster image into amax by amax pixels of rectangular shape. 3.2.2. Pseudo-Geometric Sequence of Box Size The definition of box dimension does not necessarily require a dyadic sequence of box size. Dyadic sequence is a special case of the geometric sequence rn = r1b −n +1 with factor b = 2, where rn is the box size in the nth division, r1 is the largest box size. The use of a geometric sequence (sometimes it is referred to as logarithmic sequence) is well justified in that fractal is a hierarchy with cascade structure. The data points based on a geometric sequence are distributed evenly in the log-log plot. In general, we can use a geometric sequence of any factor, for example, b = 1.05, 1.10, 1.15 … 1.95, 2.00, 3.00, etc. When non-integer factors are used, rounding is applied to the box size, resulting in pseudo-geometric sequences. The general algorithm to derive the pseudo-geometric sequence is as follows. For consistency, we use a1 to represent amax. Suppose the geometric sequence has n elements thus producing n grids. Grid 1 is obtained by dividing the basic grid into a1 × a1 boxes (for vector map) or by resampling the Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 48 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Figure 4. Flow chart of the improved box-counting method image into a1 × a1 pixels (for raster data). Each box can be treated as a unit box, or an abstract pixel in general sense. The size of the unit box is, r1 = L , a1 (3) where L is the side length of the basic grid, a1 is the number of boxes per row/column, in our case, a1=2048. Grid i is obtained by grouping a1 × a1 unit boxes into ai × ai blocks. Each block is composed of K i × K i unit boxes which are then aggregated into one large box with size ri = Kir1 (Figure 5). Partial blocks at the boundary (if there are any) are padded with empty boxes (Figure 5 (c)). Ki and ai are determined by Equation (4) and (5), K i = round (b i −1 ) ≤ a1, (4) a ai = ceil 1 , K i (5) Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 49 Figure 5. Generation of example grids by aggregating unit boxes at different levels: (a), grid 1 with unit boxes, a1 = 8, K1 = 1; (b) grid 2, a2 = 4, K2 = 2; (c) grid 3, partial blocks at boundary are padded with empty boxes (dashed boxes), a3 = 3, K3 = 3; (d) grid 4, a4 = 2, K4 = 4. (Note: For the meanings of notations ai, Ki, please refer to the notations below Equation (5).) where Ki is the number of unit boxes per row/column in a block to be aggregated at level i, ai is the number of blocks per row/column with size ri = Kir1, b is the exponent factor, e.g., 1.05, 1.1 … 2, 3 … i is the division level, i = 1, 2 ... n, round(A) function rounds A to the nearest integer, ceil(A) function returns the nearest integer that is greater than or equal to A. Note that both Ki and ai should be integer. The actual sequence is created as follows. First, the raw number of elements in the pseudo-geometric sequence, i.e., the raw number of box size is calculated as log(a ) 1 n = round log(b) (6) Second, calculate Ki using Equation (4). If Ki = Ki+1 =…= Ki+s, only Ki+s is accounted in order to remove the redundant values. The corresponding ai, ai+1 … ai+s-1 calculated from Equation (5) are also dropped. Update the value of n and re-code the sequences as K1, K2 … Kn and a1, ak … an. For example, if m pairs of elements for Ki and ai are removed, the value of n is reduced by m. Third, if aj = aj+1 =…= aj+t, only aj is accounted. The corresponding Kj, Kj+1 … Kj+t-1 are removed. Update the value of n again and re-code the sequences as K1, K2 … Kn and a1, ak … an. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 50 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Fourth, reduce the over coverage. When partial blocks exist at the boundary of the grid as shown in Figure 5(c), the over cover rate (OCR) is defined in Equation (7), OCR = [Ki × ai – a1] / a1 × 100%. (7) If Ki > ceil (a1/ai), replace Ki with ceil (a1/ai). This algorithm minimizes the over cover rate. The above algorithm generates optimal cover of the object using a pseudo-geometric sequence with different factors. Table 1 provides an example of our pseudo-geometric sequence with 20 data points (b = 1.45). If we choose b = 1.05, we get 73 data points, the average over cover rate (AOCR) is 0.42%. 3.2.3. Generalized Sliding Window Method There is no theoretical method to determine the scaling range of stochastic fractals. Among the existing empirical methods, the sliding window method provides a better choice (Brewer & Di Girolamo, 2006; Dubuc et al., 1989). In previous research, the sliding window method was only used to find the lower bound of the scaling range, while taking half or one fourth of the shorter side of the image as the upper bound. It is natural to extend the algorithm to find the upper bound. In the following, we present a generalized sliding window method to find the scaling range. Let Oi be points on the scatter plot, i = 1, 2…n. O1 corresponding to the minimum box size. Let k be the size (length) of the window, k = 3, 4…n. The generalized sliding window method is composed of three ways of sliding windows (Figure 6). The scaling range can be determined by inspecting the the dimension profiles (Figure 7, Figure 9) and scatter plot (Figure 8, Figure 10). The generalized sliding window method includes three parts as follows. Through method (1) we found that the window size should be at least 20. We then identify the lower and upper bound of the scaling range through method (2) and (3). Method (1)-(3) are combined together to obtain the appropriate scaling range. 1. Window Size Fixed, Slide the Window Through Data Points: For window size k, using O1, O2…Ok to fit a straight line, we get dimension Dk,1 and coefficient of determination R2(k,1). Then use O2, O3… Ok+1 to get Dk,2 and R2(k,2). Continue the sliding window until the last k points, On-k+1, On-k+2…On, and the dimension, Dk,n-k+1, R2(k, Table 1. Example pseudo-geometric sequence for box size with 20 data points (b = 1.45) i Ki ai Ki × ai OCR (%) i Ki ai Ki × ai OCR (%) 1 2048 1 2048 0.00 11 41 50 2050 0.10 2 1024 2 2048 0.00 12 28 74 2072 1.17 3 683 3 2049 0.05 13 20 103 2060 0.59 4 512 4 2048 0.00 14 13 158 2054 0.29 5 342 6 2052 0.20 15 9 228 2052 0.20 6 256 8 2048 0.00 16 6 342 2052 0.20 7 171 12 2052 0.20 17 4 512 2048 0.00 8 121 17 2057 0.44 18 3 683 2049 0.05 9 86 24 2064 0.78 19 2 1024 2048 0.00 10 59 35 2065 0.83 20 1 2048 2048 0.00 Note: Box size ri = Kir1. For meanings of notations Ki, ai, ri, please refer to the explanation below Equation (5). Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 51 Figure 6. Three ways of sliding windows: (a) Window size fixed, slide the window through data points; (b) Lower bound fixed, increase window size; (c) Upper bound fixed, increase window size n-k+1) (Figure 6(a)). The scatter plot of Dk,i shows the variability of the dimension with the scaling range of the length k, where i = 1, 2...n-k+1. Figure 7(a)-(b) shows the influence of window size on the estimation of box dimension and R2. As we can see, when the window size is small, the local slope (fractal dimension, Figure 7(a)) and the R2 (Figure 7(a)) vary greatly. In other words, the fractal dimension estimated using small window size is not stable and thus are not likely valid. As the window size increases to 20, the box dimension curve becomes smooth, and the R2 becomes stable. Therefore, we should choose a window size larger than 20 where stable estimation can be obtained. 2. Lower Bound Fixed, Increase Window Size: Start from point Oi, using Oi, Oi+1, Oi+2 to fit a straight line, we get a slope and dimension Di,1. Then use Oi, Oi+1, Oi+2, Oi+3 to get Di,2. Continue increasing window size until all Oi, Oi+1…On are used, and we get the dimension, Di,n-i+1(Figure 6(b)). Figure 7(c) is the dimension profile indicating the variability of dimension across scaling ranges with a fixed lower bound at point Oi. It shows that we can get a stable fractal dimension estimation when the lower bound is 4 and the window size is between 20~30. 3. Upper Bound Fixed, Increase Window Size: Start from point Oj, using Oj, Oj-1, Oj-2 to fit a straight line, we get a slope and dimension Dj,1. Then use Oj, Oj-1, Oj-2, Oj-3 to get Dj,2. Continue increasing window size until all Oj, Oj-1…O1 are used, and we get the dimension, Dj,n-j+1(Figure 6(c)). Figure 7(d) is the dimension profile indicating the variability of dimension across scaling ranges with a fixed upper bound at point Oj. It shows that we can get a stable fractal dimension estimation when the upper bound is 41 and the window size is between 20~30. 3.3. Confidence Interval of the Point Estimate The range of the confidence interval for an empirical estimate is a good index to measure the accuracy of estimation. In this paper, the fractal dimension is estimated through log-linear regression based on points in the scaling range. The confidence interval is defined as D ± tα/2δ. (8) where: D is the point estimate of the fractal dimension, tα/2δ is the margin of error, tα/2 is the t value providing an area of α / 2 in the upper tail of a t distribution with n - 2 degrees of freedom, δ is the standard error of the estimate. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 52 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Figure 7. Dimension profile of the Vicsek prefractal (generated with 5 iterations). Note: (1) The upper two subplots indicate the influence of window sizes: (a) shows the influence on the estimated box dimension, (b) shows the influence on R2. r(min) is the lower bound of the window, and the number labels in the legend of (a) and (b) represent window size. (2) The lower two subplots indicate the influence of lower and upper bound on fractal dimension estimation: (c) shows the influence of lower bound, (d) shows the influence of upper bound.(3) We only show some sample curves here. (4) For better view, please refer to the online digital figure with colors In our method, the 95% confidence interval will be calculated, i.e., α = 0.05 . while the number of boxes (Ni) grows with the order of 5. The fractal dimension can be simply represented as 4. RESULTS 4.1. Results for the Vicsek Prefractal The fractal dimension of the Vicsek prefractal can be estimated using both analytical and numerical methods. The analytical method is based on the rule to generate the Vicsek fractal. The box sizes (ri) decrease with the order of 3, D =− log 5i −i log 3 = log 5 = 1.4649. log 3 In order to examine the method presented in this paper, we give two numerical estimates of the Vicsek prefractal using the box-counting method with different sequences of box sizes. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 53 Figure 8. Scatter plot for the Vicsek prefractal using two methods: (a) Our method, (b) Traditional method. Note: The Vicsek prefractal is generated with 5 iterations The measuring procedure consists of three steps. First, generate a Vicsek prefractal with n iterations. As defined in Section 2.1, the size of the squares in the nth iteration is 1 (unit one), and the size of the initiator is 3n. Second, generate grids whose box sizes follow a geometric sequence of factor b. In our experiment, two values are tested for factor b: 3 and 1.05. The exact fractal dimension of the Vicsek prefractal can be estimated using a geometric sequence of factor 3 since it is the common ratio of the sizes of the fractal copies in different orders. The other factor, 1.05 is chosen to test our method. When b = 3, box size ri = 3i. When b = 1.05, the box size is determined using the method in Section 3.3. Third, overlay the Vicsek prefractal with the grids, and calculate the number of occupied boxes Ni for each grid. The two sequences ri and Ni are used to estimate the fractal dimension of the Vicsek prefractal. Figure 8 compares the scatter plot for the Vicsek prefractal using two methods. Figure 8(a) shows the result from our method, where the pseudo-geometric sequence with factor b = 1.05 is used, and the scaling range is determined through the generalized sliding window method. Based on the analysis in Section 3.3 and referring to Figure 7, the lower bound of the scaling range is the 4th smallest box size while the upper bound is the 41st largest box size. Therefore, by removing the 40 largest box sizes, and 3 smallest box sizes, the remaining 30 points fitted a line with a slope of D = 1.4669. Figure 8(b) shows the result from the traditional method, where the geometric sequence of factor b = 2 is used, and the scaling range is determined through observing the scatter points. There are 12 points corresponding to box sizes r(i) = 20, 2-1…2-11. The points corresponding to the three largest box sizes (20, 2-1, 2-2) are removed since they fall into a line with the slope of 2. The point corresponding to the smallest box size (2-11) is also removed because it deviates from the other points. The remaining 8 points fitted a line with a slope of D = 1.5199. Table 2 compares the detailed parameters of the two estimates. There are mainly three findings. First, the point estimate of our method (D = 1.4669) is smaller than that of the traditional method (D = 1.5199), and our point estimate is also closer to the real fractal dimension of the Vicsek fractal (D = 1.4649). Second, the 95% confidence interval of our estimate, [1.4496, 1.4841], covers the real value, while that from the traditional method, [1.4936, 1.5463], does not. Third, the 95% confidence intervals of both methods do not overlap with each other, and the range of our estimate (0.0345) is smaller than that of the traditional method (0.0527). The experiment on the Vicsek prefractal shows that our method can provide a more accurate estimate of the real fractal dimension. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 54 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Table 2. Comparing fractal dimension estimates of the Vicsek prefractal D R2 df δ Lower 95% Upper 95% Range Our method 1.4669 0.9990 28 0.0084 1.4496 1.4841 0.0345 Traditional method 1.5199 0.9997 6 0.0108 1.4936 1.5463 0.0527 Note: (1) The Vicsek prefractal is generated with 5 iterations. (2) The true dimension of the Vicsek fractal is 1.4649. (3) Degree of freedom (df) is defined as the number of data points involved in the least squares calculation minus 2. (4) δ is the standard error of the estimate. (5) Range is the difference of the Upper 95% minus Lower 95%. 4.2. Results for Beijing City The scaling range can be obtained by observing the dimension profiles (Figure 9). Similar to Figure 7, when the window size is small, the variability of the box dimension and R2 are both very high. Ideally, the window size should be greater than 20, i.e., at least 20 data points should be used to get a stable estimate of fractal dimension (Figure 9(a), (b)). Figure 9(c) is the dimension profile showing the variability of dimension across scaling ranges with a fixed lower bound. For the first several curves, i.e., curves 1-6, there is large variability of the estimated fractal dimension. Starting from curve 7, there roughly exists a plateau showing a stable dimension when window size is greater than 20. Thus the box size corresponding to the 7th smallest point might be the lower bound of the box size. Figure 9(d) is the dimension profile showing the variability of dimension across scaling ranges with a fixed upper bound at point Oj. Curve 1 is very smooth and there is a plateau of 2 for the first several window sizes as expected. All the curves are smooth if the window size is greater than 20. Starting from curve 31, there is roughly a plateau when window size is greater than 20. This plateau means that the dimension estimation is stable when the upper bound of the box size is the value corresponding to the 31th largest box size. Based on the scatter plot (Figure 10) and dimension profiles (Figure 9), the 7th smallest box size is chosen as the lower bound of the scaling range, and the 31st largest box size is the upper bound of the scaling range. Starting from the 7th sample point, the box dimension has less variability for the first 30 window sizes. Starting from the 31th largest sample point, the box dimension has less variability for the 20~30 window sizes. Figure 10 shows an example of regression on the scaling range (the middle scale range). Table 3 gives the OLS regression results for the three scale ranges as well as that for all the points. There is a significant difference between the dimension on scale range 2 and the dimension on scale range 1 or 3 because their 95% confidence intervals are well separated. The real dimension is the slope of the fitted straight line on scale range 2, i.e., D = 1.672. As expected, the slope of the fitted line on scale range 1 is close to 2 while the slope of the fitted line on scale range 3 is pulled down from 2 by the true dimension. The R2 for all data points is smaller than those on separate scale ranges, although they are all very high. We examine Beijing City in two years for method demonstration purpose. The box dimension of Beijing City is shown in Table 4. 5. DISCUSSION 5.1. Implications for the Beijing City We compared the results from our method with that from the traditional method (Table 4). Similar to the results for the Vicsek prefractal, three main findings can be found from the comparison. First, the point estimates of our method are smaller than those of the traditional method for all study areas. Second, the 95% confidence Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 55 Figure 9. Dimension profile Beijing City (Note: Same as Figure) intervals of both methods do not overlap with each other except study area 1 in 1999. Third, the ranges of 95% confidence intervals from our methods are smaller than those from the traditional method. The box dimensions increase from 1984 to 1999 in all three study areas, which confirms previous research on fractal trend of urban growth (Batty and Longley 1994; Benguigui et al., 2000; Feng and Chen 2010). Beijing is an international city growing with a fast speed. Box dimension is an indicator of the distribution of built-up areas. High box dimension usually indicates high density of a built-up environment and less open space. As is shown in a research by Feng and Zhou (2003), the population of Beijing in different zones increased during 1982-2000 (Table 5). The box dimension decreases with the expansion of the study area. This is true for urban area, as cities usually develop from a center (or several centers) and expand outside. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 56 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Figure 10. Example of the log linear regression on the scale ranges. Note: (1) three scale ranges are given here, denoted as scale range 1, 2, 3, where scale range 2 is the true scaling range. (2) The scatter plot is based on vector data, which are different from that based on raster data (i.e., pixels). When raster data (pixels) is used, the regression line of the data points in scale range 3 will approach zero Table 3. Regression results for three scale ranges Parameter/Statistic Scale Range 1 Scale Range 2 Scale Range 3 All Points D 1.977 1.672 1.804 1.802 R2 0.9999 1.0000* 0.9997 0.9977 δ 0.0044 0.0016 0.0132 0.0102 df 29 33 5 71 Lower 95% 1.968 1.668 1.768 1.782 Upper 95% 1.986 1.675 1.841 1.822 * The real value round to five digits is 0.99997. Note: (1) The calculation is based on vector data, which are different from that based on raster data (i.e., pixels). When raster data (pixels) is used, the slope of the regression line of the data points in scale range 3 will approach zero, not as large as the 1.804 shown here. (2) Only scale range 2 is the true scaling range on which D is the estimated fractal dimension. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 57 Table 4. Box dimension estimate of Beijing urban area using two methods Range Our method Traditional Method Parameter /Statistic Study Area 1 1984 Study Area 2 1999 1984 Study Area 3 1999 1984 1999 D 1.779 1.880 1.672 1.772 1.545 1.670 R2 1.0000* 1.0000** 1.0000*** 0.9999 0.9998 0.9998 δ 0.0023 0.0027 0.0016 0.0032 0.0037 0.0046 df 32 32 33 27 25 28 Lower 95% 1.7743 1.8745 1.6687 1.7654 1.5374 1.6606 Upper 95% 1.7837 1.8855 1.6753 1.7786 1.5526 1.6794 Range 0.0094 0.011 0.0066 0.0132 0.0152 0.0188 D 1.831 1.898 1.738 1.820 1.652 1.750 R2 0.9998 0.9999 0.9997 0.9998 0.9992 0.9996 δ 0.0085 0.0068 0.0120 0.0087 0.0175 0.0134 df 6 6 6 6 6 6 Lower 95% 1.8105 1.8813 1.7090 1.7985 1.6088 1.7170 Upper 95% 1.8522 1.9148 1.7678 1.8411 1.6944 1.7828 Range 0.0417 0.0335 0.0588 0.0426 0.0856 0.0658 Note: The values round to five digits are, *: 0.99996; **: 0.99995; ***: 0.99997. Range is the difference of the dimension in the upper 95% and lower 95% bounds of the estimates. Beijing is a very centralized city with economic activities densely concentrated in the core area. The change pattern of box dimensions for different study areas is expected. This change trend is also confirmed by research on other cities (Benguigui et al., 2000). 5.2. Difficulty in Determining the Scaling Range Theoretically, there should be three scale ranges on the scatter plot of the urban form data (Figure 10). Scale range 1: when the box size is very large, i.e., all the boxes are very likely to be occupied, and thus we get a regression line with the slope of 2. Scale range 3: When the box size is very small, we are just dividing inside pixels (which are vectorized as patches), and the slope of the regression line is also 2. Scale range 2: Between scale range 1 and scale range 3 is the true scaling range that should be used to estimate the fractal dimension. It should be noticed that, when raster data or point data are used, the scatter points in scale range 3 will approach to a line with the slope of 0. It is difficult to determine the scaling range. One may want to determine the scaling range by using the cutoff slope value of 2. However, we cannot do so in practice due to three reasons. First, there usually exists a transition region between two scale ranges. In the transition region between scale range 1 and scale range 2, the box dimension changes from 2 to the true dimension, while in the transition region between scale range 2 and scale range 3, the box dimension changes from the true dimension to 2. Second, the points on the scatter plot deviate from their theoretical values due to the existence Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 58 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Table 5. Population growth in different zones of Beijing during 1982-2000 (1982-1990) Population Change (Persons) Total Change Rate (%) (1990-2000) Change Rate Per Year (%) Population Change (Persons) Total Change Rate (%) Change Rate Per Year (%) Core city and inner suburbs 1067508 37.08 3.91 2177663 50.65 3.83 Outer suburbs 521236 13.12 1.55 572124 12.73 1.21 Metropolitan area 1446204 18.02 2.09 2710174 28.62 2.55 Source: Compiled from Feng and Zhou (2003). The exact extents of the three regions (Core city and inner suburbs, Outer suburbs, Metropolitan area) are not the same as our three study areas but they show similar increasing trends during 1982-2000 as the dimensions during 1984-1999. of over coverage of the object by boxes. The cutoff slope value is not exactly equal to 2. Third, due to the limit of computation power, the box sizes are not small enough. Most of the points on scale range 3 lie in the transition region. The slope of the regression line on scale range 3 is significantly influenced by the true dimension. It might have a value very close to the true dimension, which makes it difficult to separate scale range 2 and 3. 5.3. Measurement Design and Fractal Dimension The properties of scaling range and dimension profiles of urban form can be explored in our new measurement design: 1) setting up a proper box cover of the city; 2) using grids of sufficient box sizes; 3) determining the correct scaling range. The choosing of study area, image type (raster or vector), and box shape all have influences on building a proper box cover. The difference of dimensions as to the study areas poses a problem for comparison across cities or the same city in different years. Ideally, we should cover the target cities with study areas of the same size and shape. For example, Shen (2002) correctly used images of the same size (width by height in 1000 × 1000 pixels) in a study of 20 US cities. The strength of comparison study turns weak if images of different sizes are used. In a study of Hangzhou city, China, Feng and Chen (2010) compared the box dimensions calculated using different study areas. The extents of the study areas in the two compared years (1980 and 1996) are different. Their conclusions would have been stronger if they kept the study areas consistent in both years. When the same study area is used to calculate the fractal dimension of the same city from different years, the rural areas inside the study area should be removed in order to get accurate results. The problem of observing too few scales in most research raises issues of incorrect or inaccurate estimates for fractal dimension (Halley et al., 2004; Hamburger et al., 1996). Using pseudo-geometric sequence with small factors, we can get much more data points of box size than from the dyadic sequence. The generalized sliding window method provides new views of looking at the data, and can help the researchers to find proper scaling range. 5.4. How Confident the Analysis is? Compared to the traditional method, our method can result a more accurate estimate of fractal dimension. However, limitation still exists in determining the scaling range. In our method, the scaling range is determined by observing the dimension profiles (Figure 7 or Figure 9) and scatter plot (Figure 8 or Figure 10). Observing the existence of plateau in the dimension profile can be subjective, especially when shapes of the dimension profile for different objects are different. The subjectivity is reduced by com- Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 59 paring it with the scatter plots. The existence of a transition region between scale ranges also makes a fuzzy boundary between scale ranges. The results may be improved with more data points by reducing the factor of the geometric sequence. There is a tradeoff between the factor and the noise (over coverage rate) introduced. For factors smaller than 1.05, the over coverage rate increases quickly, which influences the dimension profile and reduces the accuracy of dimension estimation. As to Beijing City in the two years we examined, b = 1.05 provide a good alternative to get reliable dimension estimation. When applied to other cities, different factors may be selected. Another problem is concerned with the length of the scaling range. Some researchers argue that for genuine fractal pattern, the scaling range should have a length larger than two orders of magnitude (Halley et al., 2004). This criterion is not generally conformed to in practice. The factor of magnitude generally refers to 10, thus the criterion requires a scaling range across 103. For natural fractals, the scaling range is generally small. As to Beijing City shown in Figure 10, the scaling range lies between 0.0034 ~ 0.0308, which is only one order of magnitude ( 0.0308 / 0.0034 ≈ 101 ). Two modifications can be made to the criterion. One is to extend the magnitude factor of 10 to any integer number. In Figure 10, the scaling range 2-8.20 ~ 2-5.02 is more than three orders of magnitude 2. Another alternative is to treat the length of the scaling range itself as a property of fractal pattern. A larger scaling range means a more mature fractal structure. The third problem is due to the regression method used in box-counting method. As is shown by numerous researchers (Batty & Longley, 1994; Feng & Chen, 2010; Halley et al., 2004; Shen, 2002), the goodness of fit (R2) for a straight line on the scatter plot is very high. Although the variability might be large, as shown in Figure 7 (b) and Figure 9(b), R2 is well above 0.9 or even greater than 0.95 on the selected scaling range. The standard error and confidence intervals are generally small. It is argued that irregular patterns of scatter points are not easy to be disguised on logarithmic axes (Halley et al., 2004). Here comes the critical question: does the unanimously good fit of the scatter plot guarantee a fractal pattern? In order to answer this difficult and fundamental question, we should look for the evidence of processes that can produce a natural fractal. Accurate estimation of fractal dimension is important in practice although we cannot completely eliminate the uncertainties in the computation process. Inaccurate calculation of fractal dimension may lead to misunderstanding of our study objects. As discussed in Section 1, due to the issues related to the box-counting method, different people often have different estimates of the dimension for the same fractal set (Huang et al., 1994; Hunt, 1990; Li et al., 2009; Sarkar & Chaudhuri, 1994). It makes no sense to compare the fractal dimension of different cities if the estimation is inaccurate or even wrong. The further interpretation based on comparison of fractal dimension would be misleading. 6. CONCLUSION In this paper, we explored three main issues of the conventional box-counting method in fractal urban analysis. Corresponding techniques with improved measurement designs were proposed to address these issues. First, we proposed a new algorithm to extract land use information using rectangular grids and boxes to measure the object. Rectangular boxes can cover the object more efficiently. As the urban boundary is generally fractal, different study areas of the same city can have different box dimensions. It is suggested that the fractal dimension of a city should be given along with the study area. Second, we developed an algorithm to generate a pseudo-geometric sequence for box size. The factor of the geometric sequence can be any value specified by the researchers. Our experiment on the urbanized area of Beijing shows that b = 1.05 is a reliable factor, which produces sufficient points on the scatter plot Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 60 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 and introduces less noise (over coverage rate, OCR). When a different object is used, the readers may need to try different factors. Third, we introduced a generalized sliding window method to determine the scaling range. Sliding the window with fixed window size on different starting points show that the minimum window size should be no less than 20 in order for a stable estimate of box dimension. Sliding the window from the points for the smallest box size with changing window size can be used to determine the lower bound of the scaling range, while sliding the window from the points for the largest box size can help to find the upper bound of the scaling range. Integrating the generalized sliding window method with the scatter plot, we can get a proper scaling range. Our method is first experimented on the Vicsek prefractal with known dimension. It is then applied to Beijing City. It is found that the dimensions of the city in three study areas increased during 1984-1999, which is consistent with the population growth trend in 1982-2000. The dependence of the dimensions on the study areas suggests that the comparison among cities should be bounded in images of the same size and shape. Our measurement design provides an alternative method to explore the properties of dimension profile and scaling range. There is still room for better methods to determine an accurate scaling range. ACKNOWLEDGMENT The authors wish to thank Dr. Yanguang Chen from Peking University, Dr. Chandana Gangodagamage from Los Alamos National Lab, and Professor Yixing Zhou from Peking University for valuable discussions; Mrs Carolyn Pickard for editorial help. The research was supported by the International Affairs Grant to Shiguo Jiang from the Office of International Affairs at The Ohio State University. REFERENCES Andraud, C., Beghdadi, A., Haslund, E., Hilfer, R., Lafait, J., & Virgin, B. (1997). Local entropy characterization of correlated random microstructures. Physica A. Statistical Mechanics and Its Applications, 235, 307–318. doi:10.1016/S03784371(96)00354-8. Barnsley, M. (1988). Fractals everywhere. Boston, MA: Academic Press. Batty, M. (1985). Fractals-geometry between dimensions. New Scientist, 106, 31–35. Batty, M., & Longley, P. A. (1987). Urban shapes as fractals. Area, 19, 215–221. Batty, M., & Longley, P. A. (1988). The morphology of urban land use. Environment and Planning. B, Planning & Design, 15, 461–488. doi:10.1068/ b150461. Batty, M., & Longley, P. A. (1994). Fractal cities: A geometry of form and function. London, UK: Academic Press. Batty, M., & Xie, Y. (1996). Preliminary evidence for a theory of the fractal city. Environment & Planning A, 28, 1745–1762. doi:10.1068/a281745 PMID:12292847. Benguigui, L., Czamanski, D., Marinov, M., & Portugali, Y. (2000). When and where is a city fractal? Environment and Planning. B, Planning & Design, 27, 507–519. doi:10.1068/b2617. Berry, B. J. L., Goheen, P. J., & Goldstein, H. (1968). Metropolitan area definition: A re-evaluation of Concept and Statistical Practice. Washington, DC: US Bureau of the Census. Bisoi, A. K., & Mishra, J. (2001). On calculation of fractal dimension of images. Pattern Recognition Letters, 22, 631–637. doi:10.1016/S01678655(00)00132-X. Brewer, J., & Di Girolamo, L. (2006). Limitations of fractal dimension estimation algorithms with implications for cloud studies. Atmospheric Research, 82, 433–454. doi:10.1016/j.atmosres.2005.12.012. Buczkowski, S., Kyriacos, S., Nekka, F., & Cartilier, L. (1998). The modified box-counting method: Analysis of some characteristic parameters. Pattern Recognition, 31, 411–418. doi:10.1016/S00313203(97)00054-X. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 61 Chen, S. S., Keller, J. M., & Crownover, R. M. (1993). On the calculation of fractal features from images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15, 1087–1090. doi:10.1109/34.254066. Dathe, A., Tarquis, A. M., & Perrier, E. (2006). Multifractal analysis of the pore- and solid-phases in binary two-dimensional images of natural porous structures. Geoderma, 134, 318–326. doi:10.1016/j. geoderma.2006.03.024. Dubuc, B., Quiniou, J. F., Roquescarmes, C., Tricot, C., & Zucker, S. W. (1989). Evaluating the fractal dimension of profiles. Physical Review A., 39, 1500–1512. doi:10.1103/PhysRevA.39.1500 PMID:9901387. ESRI. (2010). ArcGIS Desktop: Release 10. Redlands, CA: Environmental Systems Research Institute. Falconer, K. J. (2003). Fractal geometry: Mathematical foundations and applications. Chichester, UK: Wiley. doi:10.1002/0470013850. Feder, J. (1989). Fractals. New York, NY: Plenum. Feng, J., & Chen, Y. (2010). Spatiotemporal evolution of urban form and land-use structure in Hangzhou, China: evidence from fractals. Environment and Planning. B, Planning & Design, 37, 838–856. doi:10.1068/b35078. Feng, J., & Zhou, Y. X. (2003). The latest development in demographic spatial distribution in Beijing in the 1990s. City Planning Review, 27, 55–63. Foroutan-pour, K., Dutilleul, P., & Smith, D. L. (1999). Advances in the implementation of the box-counting method of fractal dimension estimation. Applied Mathematics and Computation, 105, 195–210. doi:10.1016/S0096-3003(98)10096-6. Frankhauser, P. (1994). La fractalité des structures urbaines. Paris, France: Economica. Goodchild, M. F. (1980). Fractals and the accuracy of geographical measures. Journal of the International Association for Mathematical Geology, 12, 85–98. doi:10.1007/BF01035241. Grau, J., Mendez, V., Tarquis, A. M., Diaz, M. C., & Saa, A. (2006). Comparison of gliding box and boxcounting methods in soil image analysis. Geoderma, 134, 349–359. doi:10.1016/j.geoderma.2006.03.009. Halley, J. M., Hartley, S., Kallimanis, A. S., Kunin, W. E., Lennon, J. J., & Sgardelis, S. P. (2004). Uses and abuses of fractal methodology in ecology. Ecology Letters, 7, 254–271. doi:10.1111/j.14610248.2004.00568.x. Hamburger, D., Biham, O., & Avnir, D. (1996). Apparent fractality emerging from models of random distributions. Physical Review E: Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 53, 3342–3358. doi:10.1103/PhysRevE.53.3342 PMID:9964642. Hargis, C. D., Bissonette, J. A., & David, J. L. (1998). The behavior of landscape metrics commonly used in the study of habitat fragmentation. Landscape Ecology, 13, 167–186. doi:10.1023/A:1007965018633. Herold, M., Couclelis, H., & Clarke, K. C. (2005). The role of spatial metrics in the analysis and modeling of urban land use change. Computers, Environment and Urban Systems, 29, 369–399. doi:10.1016/j. compenvurbsys.2003.12.001. Huang, Q., Lorch, J. R., & Dubes, R. C. (1994). Can the fractal dimension of images be measured? Pattern Recognition, 27, 339–349. doi:10.1016/00313203(94)90112-0. Hunt, F. (1990). Error analysis and convergence of capacity dimension algorithms. SIAM Journal on Applied Mathematics, 50, 307–321. doi:10.1137/0150018. Imre, A. R., & Bogaert, J. (2004). The fractal dimension as a measure of the quality of habitats. Acta Biotheoretica, 52, 41–56. doi:10.1023/ B:ACBI.0000015911.56850.0f PMID:14963403. Lam, N. S. N., & Quattrochi, D. A. (1992). On the issues of scale, resolution, and fractal analysis in the mapping sciences. The Professional Geographer, 44, 88–98. doi:10.1111/j.0033-0124.1992.00088.x. Li, J., Du, Q., & Sun, C. X. (2009). An improved box-counting method for image fractal dimension estimation. Pattern Recognition, 42, 2460–2469. doi:10.1016/j.patcog.2009.03.001. Liebovitch, L. S., & Toth, T. (1989). A fast algorithm to determine fractal dimensions by box counting. Physics Letters. [Part A], 141, 386–390. doi:10.1016/0375-9601(89)90854-2. Longley, P. A., & Batty, M. (1989a). Fractal measurement and line generalization. Computers & Geosciences, 15, 167–183. doi:10.1016/00983004(89)90032-0. Longley, P.A., & Batty, M. (1989b). On the fractal measurement of geographical boundaries. Geographical Analysis, 21, 47–67. doi:10.1111/j.1538-4632.1989. tb00876.x. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. 62 International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 Lovejoy, S., Schertzer, D., & Tsonis, A. A. (1987). Functional box-counting and multiple elliptical dimensions in rain. Science, 235, 1036–1038. doi:10.1126/science.235.4792.1036 PMID:17782251. Saucier, A., & Muller, J. (1998). Multifractal approach to textural analysis. In M. M. Novak (Ed.), Fractals and beyond, complexities in the sciences (pp. 161–171). London, UK: World Scientific Publishing Co.. Lu, Y. M., & Tang, J. M. (2004). Fractal dimension of a transportation network and its relationship with urban growth: A study of the Dallas-Fort Worth area. Environment and Planning. B, Planning & Design, 31, 895–911. doi:10.1068/b3163. Shen, G. Q. (2002). Fractal dimension and fractal growth of urbanized areas. International Journal of Geographical Information Science, 16, 419–437. doi:10.1080/13658810210137013. Mandelbrot, B. (1967). How long is the coast of Britain? Statistical self-similarity and fractional dimension. Science, 156, 636–638. doi:10.1126/ science.156.3775.636 PMID:17837158. Tarquis, A. M., McInnes, K. J., Key, J. R., Saa, A., Garcia, M. R., & Diaz, M. C. (2006). Multiscaling analysis in a structured clay soil using 2D images. Journal of Hydrology (Amsterdam), 322, 236–246. doi:10.1016/j.jhydrol.2005.03.005. Mandelbrot, B. (1983). The fractal geometry of nature (Revised and enlarged edition ed.). New York, NY: W.H. Freeman and Co. Taylor, J. (1997). An introduction to error analysis: The study of uncertainties in physical measurements. Mill Valley, CA: University Science Books. Nayfeh, A. H., & Balachandran, B. (1995). Applied nonlinear dynamics: Analytical, computational, and experimental methods. New York, NY: Wiley. doi:10.1002/9783527617548. Thomas, I., Frankhauser, P., & Biernacki, C. (2008). The morphology of built-up landscapes in Wallonia (Belgium): A classification using fractal indices. Landscape and Urban Planning, 84, 99–115. doi:10.1016/j.landurbplan.2007.07.002. Peitgen, H.-O., & Richter, D. H. (1986). The beauty of fractals: Images of complex dynamical systems. New York, NY: Springer-Verlag. Pincheira-Ulbrich, J., Rau, J. R., & Pena-Cortes, F. (2009). Patch size and shape and their relationship with tree and shrub species richness. Phyton-International Journal of Experimental Botany, 78, 121–128. Pruess, S. A. (1995). Some remarks on the numerical estimation of fractal dimension. In C. C. Barton, & P. R. L. Pointe (Eds.), Fractals in the earth sciences (pp. 65–75). New York, NY: Plenum Press. doi:10.1007/978-1-4899-1397-5_3. Richardson, L. F. (1961). The problem of contiguity: an appendix of Statistics of deadly quarrels. General Systems Yearbook, 139-187. Roy, A., Perfect, E., Dunne, W. M., & McKay, L. D. (2007). Fractal characterization of fracture networks: An improved box-counting technique. Journal of Geophysical Research. Solid Earth, 112, 9. Saa, A., Gasco, G., Grau, J. B., Anton, J. M., & Tarquis, A. M. (2007). Comparison of gliding box and box-counting methods in river network analysis. Nonlinear Processes in Geophysics, 14, 603–613. doi:10.5194/npg-14-603-2007. Sarkar, N., & Chaudhuri, B. B. (1994). An efficient differential box-counting approach to compute fractal dimension of image. IEEE Transactions on Systems, Man, and Cybernetics, 24, 115–120. doi:10.1109/21.259692. Thomas, I., Frankhauser, P., & De Keersmaecker, M.L. (2007). Fractal dimension versus density of builtup surfaces in the periphery of Brussels. Papers in Regional Science, 86, 287–308. doi:10.1111/j.14355957.2007.00122.x. Verbovsek, T. (2009). BCFD - a visual basic program for calculation of the fractal dimension of digitized geological image data using a box-counting technique. Geological Quarterly, 53, 241–248. Vicsek, T. (1983). Fractal models for diffusion controlled aggregation. Journal of Physics. A, Mathematical and General, 16, L647–L652. doi:10.1088/0305-4470/16/17/003. Walsh, J. J., & Watterson, J. (1993). Fractal analysis of fracture pattern using the standard box-counting technique: Valid and invalid methodologies. Journal of Structural Geology, 15, 1509–1512. doi:10.1016/0191-8141(93)90010-8. Weisstein, E. W. (2003). Concise encyclopedia of mathematics (2nd ed.). Boca Raton, FL: Chapman and Hall/CRC. White, R., & Engelen, G. (1993). Cellular automata and fractal urban form: A cellular modelling approach to the evolution of urban land-use patterns. Environment & Planning A, 25, 1175–1199. doi:10.1068/ a251175. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited. International Journal of Artificial Life Research, 3(3), 41-63, July-September 2012 63 Shiguo Jiang is a PhD student (2007-) of Geography at The Ohio State University. He received B.S. degree in Urban and Regional Planning (2001), B. A. degree in Economics (2001), and M.S. degree in Urban Studies (2004) from Peking University, China. He was a research associate and urban planner at Beijing Tsinghua Urban Planning and Design Institute from 2004 to 2007. His research interests include remote sensing of urban and natural environment, spatial statistics, land use and land cover change, GIS and spatial modeling. Recently he has been involved with a project on classification confidence of remote sensing imagery. Desheng Liu is an Associate Professor of Geography and Statistics at The Ohio State University. He obtained his M.A. (2004) in Statistics and his M.S. (2003) and PhD (2006) in Environmental Science from the University of California at Berkeley. His research focuses on understanding forest ecosystem dynamics, land use and land cover change, and human-environment systems through the integrated use of geospatial information technologies (GIS, remote sensing, and GPS) and spatial statistical methods. Copyright © 2012, IGI Global. Copying or distributing in print or electronic forms without written permission of IGI Global is prohibited.
© Copyright 2026 Paperzz