480 IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, VOL. 7, NO. 2, FEBRUARY 2014 Extracting Structural Vegetation Components From Small-Footprint Waveform Lidar for Biomass Estimation in Savanna Ecosystems Joseph McGlinchy, Member, IEEE, Jan A. N. van Aardt, Barend Erasmus, Gregory P. Asner, Member, IEEE, Renaud Mathieu, Konrad Wessels, David Knapp, Ty Kennedy-Bowdoin, Harvey Rhody, Member, IEEE, John P. Kerekes, Senior Member, IEEE, Emmett J. Ientilucci, Member, IEEE, Jiaying Wu, Member, IEEE, Diane Sarrazin, and Kerry Cawse-Nicholson Abstract—Measurement of vegetation biomass accumulation is critical for ecosystem assessment and monitoring, but doing so typically involves extensive field data collection that yields relatively crude structural outputs, e.g., plot- or site-level metrics. This study assessed the utility of airborne light detection and ranging (lidar) waveform features to explain structural and biomass variation in a savanna ecosystem across a land-use gradient. The ability of aboveground waveform lidar features to model field-based woody and herbaceous biomass measurements was evaluated statistically by regression models using forward variable selection. Waveform features explained 76% of the variation in woody biomass in a regulated communal land use area (RMSE = 29.0 kg). The waveform features were also correlated to herbaceous measurements in the same land-use area, with increased correlations at higher biomass levels. These results indicate that small-footprint waveform lidar data potentially can be used as a single modality to describe heterogeneous woody cover in a savanna environment; however, further research is warranted during the full growing season to fully evaluate its performance. Index Terms—Biomass estimation, feature extraction, lidar, waveform. Manuscript received February 07, 2013; revised May 21, 2013; accepted June 25, 2013. Date of publication August 15, 2013; date of current version February 03, 2014. This work was funded through the Masters of Science program in Imaging Science at the Chester F. Carlson Center for Imaging Science at Rochester Institute of Technology, NY, USA. J. McGlinchy is with the Environmental Systems Research Institute, Redlands, CA 92373 USA (corresponding author, e-mail: [email protected]). J. A. N. van Aardt, H. Rhody, J. P. Kerekes, E. J. Ientilucci, and K. Cawse-Nicholson are with the Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology, Rochester, NY 14623 USA (e-mail: [email protected]; [email protected]; [email protected]; [email protected]; [email protected]). G. P. Asner, D. Knapp, and T. Kennedy-Bowdoin are with the Carnegie Institution for Science, Stanford University, Stanford, CA 80309 USA (e-mail: [email protected]; [email protected]; [email protected]). B. Erasmus is with the School of Animal, Plant and Environmental Science, University of the Witwatersrand, Johannesburg, South Africa (e-mail: Barend. [email protected]). R. Mathieu is with the Council for Scientific and Industrial Research, Pretoria, South Africa (e-mail: [email protected]). K. Wessels is with the Council for Scientific and Industrial Research, Pretoria, South Africa, and also with the Centre for Geoinformation Science, Department Geography, Geoinformatics and Meteorology, University of Pretoria, Pretoria, South Africa (e-mail: [email protected]). J. Wu is with Apple, Inc., Cupertino, CA 95014 USA (e-mail: wjy@[email protected]). D. Sarazzin is with the Department of National Defence Canada, Ottawa, Ontario, Canada (e-mail: [email protected]). Digital Object Identifier 10.1109/JSTARS.2013.2274761 T I. INTRODUCTION HE NEED forsynoptic descriptionsofecosystem structure, specifically biomass or carbon estimation, has received much attention (e.g., [1]–[3]). Such efforts have implications for the management of carbon stocks [4], maintenance of structural biodiversity [2], and support for sustainable resource extraction [3], among other ecological applications. However, scalable assessment of savanna ecosystem structure, specifically the structure coupled to disparate vegetation components, e.g., trees vs. bush vs. grass structures, remains elusive. Typical inventory efforts rely on local-scale labor-intensive field surveys, which are constrained by their inability to draw regional inferences; an example includes transects covering only 1 ha [5], [6]. Researchers have circumvented this challenge for savanna ecosystems by using a novel assessment alternative, discrete return light detection and ranging (lidar) remote sensing [1], [2]. The technology for remote sensing of vegetation using lidar has seen considerable advancement with the advent of full waveform digitizing sensors. The intensity of the returning laser pulse is recorded as a function of the time between laser emittance, interaction with a target surface, and detection by the sensor. Waveform lidar sensors have the advantage of recording the backscattered energy at a very high sampling rate to derive information on waveform shape, typically on the order of nanoseconds. The combination of high temporal resolution detection and full backscatter digitization enables the extraction of structural information that is embedded within the waveform [7]. Various studies have shown that signal metrics, calculated from large footprint lidar waveforms (footprint diameters on the order of tens of meters), can be used to assess vegetation structure in forested environments. Lefsky et al. [8] have shown that waveform lidar measurements were able to explain 70% of the variance in stand basal area and 80% of the variance in aboveground biomass in a deciduous forest environment. In another study, Lefsky [1] also found high correlations between waveform-derived canopy heights and aboveground biomass in boreal, coniferous, and deciduous forested environments , 0.65, 0.87, respectively). Boudreau [9] developed ( a generic airborne lidar-based biomass equation from small-footprint lidar-derived mean and quadratic mean canopy heights, while Neuenschwander [10] demonstrated that features such as the canopy, ground, and the height of median energy from small-footprint lidar waveforms can be used to classify various land cover types with up to 85% accuracy. 1939-1404 © 2013 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. MCGLINCHY et al.: EXTRACTING STRUCTURAL VEGETATION COMPONENTS FROM SMALL-FOOTPRINT WAVEFORM LIDAR 481 Fig. 1. (a) Map of South Africa and study area for this work. Our study focused on the protected Kruger National Park and the communal rangelands of Bushbuckridge (see localization of Land Uses 2, 7, and 8 in central map), along with site- (50 50 m, 36 plots) and plot-level field sampling strategy. Examples of herbaceous biomass plot samples for Land Use 2, 7, and 8 are shown in (b), (c), and (d), respectively. Drake [11] showed that measures such as tree height, crown volume, and biomass could be accurately predicted and modeled from the waveform height of median energy and upper quantile energy measurements, resulting in good correlation between waveform-derived features and estimated above-ground biomass ( up to 0.94) for large footprint systems. However, certain challenges remain in terms of land cover assessment through biomass estimation, specifically: (i) most previous work has dealt with large-footprint systems in forested environments, with results in the measured field data typically being an order of magnitude smaller in actual ground area than the laser footprint and (ii) a detailed breakdown of woody, herbaceous, and bare ground structural components along the laser trajectory, similar to the “end member” concept in an imaging spectroscopy context, is still lacking [12], [13]. This latter aspect has bearing on our ability to map land cover types in the structural (3D) domain, as opposed to the traditional spectral approaches. For example, woody and herbaceous vegetation often is highly intermixed in savanna ecosystems, making them difficult to separate in terms of carbon and system functional dynamics [14]. The objective of this study was thus to establish a method by which to extract structural components, e.g., woody and herbaceous, from small-footprint lidar waveforms as they relate to field biomass measurements. II. METHODOLOGY A. Study Area and System Specifications The area under study encompasses land within and surrounding the Kruger National Park (KNP) in South Africa, bounded by ( ” S; ” E) and ( ” S; ” E) (see Fig. 1(a)) and spanning an east-west land-use gradient. This gradient is defined by sampling in the Kruger National Park (state-owned conservation), Sabie Sands game reserve (private conservation), and Bushbuckridge (communal rangelands; high rural population density) areas. The topography is gently undulating with a slowly decreasing terrain height toward the East, with an average elevation of approximately 450 m. Vegetation communities are influenced largely by geomorphological and pedological processes at the landscape level. Dominant geology includes granite and gneiss with local intrusions of gabbro. Vegetation has a discontinuous overstory of woody plants, mostly in the 2–5 m height category, and an herbaceous layer dominated by C4 grasses [15]. The vegetation communities are classified as granite lowveld or gabbro grassy bushveld [16]. This study area enabled an assessment of the various waveform lidar features and their ability to describe structural parameters across a range of vegetation structures and land uses. The Carnegie Airborne Observatory (CAO) Alpha system was used for acquiring the lidar data. CAO-Alpha consisted of an integrated imaging spectrometer (hyperspectral sensor) and a small-footprint scanning-waveform lidar system, which also collects discrete return data. These systems are geometrically well integrated by using a common global positioning system/ inertial measurement unit (GPS/IMU) data stream, boresight alignment among sensor field-of-views, and co-mounting of instrumentation on a single, integrated platform. The waveform lidar system operated at 1064 nm wavelength with full waveform digitization at one nanosecond temporal/vertical resolu- 482 IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, VOL. 7, NO. 2, FEBRUARY 2014 Fig. 2. Clockwise from top left: Tree height distributions for sites in land-use 2, 7, and 8 These graphs illustrate the range and variability in tree height across the study area. tion; this effectively implies signal digitization for every 0.15 m for single, non-return direction of travel. The laser beam divergence was 0.56 mrad, corresponding to a 0.56 m ground footprint at an operational flying height of 1000 m above ground [17]. Field data were collected in association with the airborne data collection campaign during April-May 2008; this is the late autumn/early winter season, with many trees showing signs of leaf loss. This was highly variable, though, with certain species retaining more of their leaves at this time of year. The field data were collected from seven sites, distributed across two land use (LU) types: a protected area within KNP (LU2, four sites) and communal rangelands (LU7, three sites and LU8, two sites) located within the Bushbuckridge area (Fig. 1(a)). Each site consists of approximately 36 plot-level measurements of herbaceous biomass, tree height and diameter, species, and a qualitative assessment of cover (vegetation health, crusting, bare soil, herbaceous, and woody cover) [18]. The sites are 50 50 m in size and were laid out with 10 m spacing between plots, resulting in a grid-like pattern (see Fig. 1(a)). A Trimble (Trimble® Recon® Handheld with aerial backpack) or Leica (GS20 Professional Data Mapper with handheld aerial) differential GPS was used to collect accurate geographic coordinates for each plot, which were differentially corrected to sub-meter accuracy using the Nelspruit trigonometric base station one second data (http://www.trignet.co.za/). The woody biomass was sampled with variable-radius plots defined by the average between-tree distance at each site, i.e., larger diameter was used when average distance between trees increased; plot radii ranged between 4–5 m. Histograms of the tree heights measured in each land use are shown in Fig. 2. The herbaceous biomass was sampled within a 0.25 m quadrant at the plot center. Examples of the herbaceous biomass sampled in each land use are shown in Figs. 1(b)–1(d). Summary statistics of the woody and herbaceous biomass sampled in this study are provided in Table I. B. Feature Extraction For a waveform to accurately represent the scattering crosssection, the incoming waveform requires physics-based preprocessing: (i) deconvolution with the outgoing pulse and the lidar system response; (ii) tying ground interactions to the digital elevation model (DEM), i.e., for analysis of waveform interaction in terms of height-above-ground; and (iii) correction of interactions for off-nadir effects. The reader is referred to a detailed description of this processing chain by Wu et al. [19]. Our goal was to extract features related to the structural components in the lidar waveforms based on our knowledge of woody and herbaceous components present in the savanna environment (Fig. 3). As the light pulse propagates through the canopy to the ground, earlier interactions, i.e., those further from the ground, are more correlated to the woody vegetation (e.g., trees) and as the pulse approaches ground level, the MCGLINCHY et al.: EXTRACTING STRUCTURAL VEGETATION COMPONENTS FROM SMALL-FOOTPRINT WAVEFORM LIDAR 483 Fig. 3. Potential structural components in a lidar waveform. (left) an example of pulse interaction with potential canopy, low-lying structure, and ground. (right) lidar pulse interaction with various structural components embedded within the waveform. Picture is from the study area. TABLE I SUMMARY STATISTICS OF HERBACEOUS AND WOODY BIOMASS SAMPLED IN THIS STUDY process of removing the bare ground signal component was accomplished by defining a bare ground reference waveform and subtracting it from the waveform of interest after aligning the bare-ground points. The bare ground reference waveform was computed from samples of dirt roads identified on the hyperspectral images from the concurrent data collection, via the assumption that such roads represent hard, diffuse scatterers. The subtraction was done in such a manner so as to minimize the amount of information removed from the waveform. This was done to account for intensity variations between ground peaks due to varying levels of energy attenuation, e.g., in the case of a waveform with multiple aboveground interactions (reduced ground return intensity) compared to a single bare ground return. The amplitude of the ground reference waveform therefore was matched to each waveform before performing the subtraction by minimizing , defined as (1) interactions are more indicative of shrubby or herbaceous vegetation [19]. 1) Ground Removal: The CAO system’s relatively broad 16 ns outgoing pulse width allowed for a consistent ground level laser interaction on a per-pulse basis. Extracting this bare ground interaction from every waveform was hypothesized to leave only aboveground interactions remaining in each lidar waveform, including finer vegetation structure associated with the ground, such as grass/herbaceous components. This where is the waveform under consideration, is the bare ground (road) reference waveform, is the ground reference with the amplitude matched to , and is the window over which the subtraction is being performed (full-width-half-maximum (FWHM) of ). This process is illustrated in Fig. 4 for cases of low to high signal biomass present in the waveform. Although this removal of the per-pulse ground interaction could potentially remove information related to grassy content, we assume that the approach described above would limit such omissions to removal of data related to the bare ground components of the lidar return. This assumption is based on the use of a hard target, diffuse scattering signal, 484 IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, VOL. 7, NO. 2, FEBRUARY 2014 Fig. 4. Clockwise from top left: an illustration of the ground removal algorithm as applied to waveforms of low to medium to high biomass plots. The solid, bold signal represents the waveform after ground removal. The dashed line shows the portion of the signal that was removed by the algorithm. This resultant waveform is then analyzed for above-ground interactions. based on a ground return sample from gravel roads in the study area. Three structural metrics were extracted based on the amount of ground information removed from the waveform: (i) the ground ratio (roadRatio, e.g., the value of discussed above), (ii) the fraction of the waveform removed by the subtraction (ratioOut), and (iii) the amount of the ground pulse remaining after the ground removal (leftover), defined as the area of the ground pulse after ground removal divided by the area of the original ground pulse. These metrics enabled a detailed characterization of the ground-level laser interactions. 2) Aboveground Features: Waveform features extracted in this study were similar to those identified by Neuenschwander et al. [10] and the United States Geological Survey (USGS) [20] and included the height of median energy (HOME), total integrated energy, canopy energy, ground energy, and canopy ratio (CRR). HOME is defined as the height at which the integrated energy of the waveform is equal to half of the energy of the waveform above the ground return. The total energy of the lidar signal, , is defined in Equation 2 (discretized version of [21]) as (2) where is the received backscattered signal, is the total number of time bins, and the summation sign is used because of the discretized nature of the signal. Canopy ratio (CRR) is defined as the integral of the portion of the waveform return reflected from above ground divided by the integral of the entire waveform return [20]. An adaptation of the CRR metric (aCRR) was also introduced to focus on the upper canopy elements. This was achieved by considering the integral of the waveform up to and including the first major laser–target interaction, rather than the entire aboveground return, similar to CRRmin, as defined by Muss et al. [22]. The first major interaction was defined as the first valley following the first peak of a multiple-interaction waveform. A novel metric, related to the integrated energy of the signal, was derived by measuring the amount of time it takes for , the lidar signal for a single interaction, to increase from 10% to 90% of its total integrated energy. This measure was defined as duration and is indicative of the type of structural component present in the waveform, since a waveform containing multiple interactions will theoretically take longer to bridge the time from 10% to 90% of the total interaction. This metric is sensitive to the height above ground at which the laser-surface interaction occurs, and as such is hypothesized to be sensitive to larger biomass, as is found in taller trees. MCGLINCHY et al.: EXTRACTING STRUCTURAL VEGETATION COMPONENTS FROM SMALL-FOOTPRINT WAVEFORM LIDAR 485 C. Biomass Modeling The metrics extracted from the waveforms were used as inputs to a forward selection regression model in Statistical Analysis Software (SAS, v9.2) to identify which features best describe the field-measured biomass. This approach successively introduced variables into the model and retained them only if the statistical performance of the model was improved [25], [26]. For a woody biomass estimate, plots that contained no woody biomass or only small trees were not included in the modeling analysis. The total number of structural variables extracted for each plot was 18. The significance level for variable entry into the model was set to . As more variables enter the model, the goodness of fit was penalized via the adjusted fit metric [27]. The general equation of the regression model is defined by (4) Fig. 5. First and second derivative summary statistics calculated per waveform across a site in Land Use 8. Lighter areas represent larger values. One other metric used to describe canopy presence was calculated as the volume of canopy present in each plot. This was calculated as (3) where is the area defined by the spatial resolution of each waveform (e.g., , is the footprint of the laser pulse) and is defined as the height above ground of the first laser–canopy interaction. 3) Derivative Features: Previous work has identified stressrelated spectral features for vegetation assessment using hyperspectral derivative analysis, especially for near-infrared spectral regions [23]. We therefore hypothesized that derivatives of laser–target interactions in a vegetation environment are related to how the scattering surface reflects light at the laser wavelength (1064 nm). Summary statistics of the lidar waveform’s first and second derivative were computed as additional metrics related to the aboveground structure. These included the mean, median, mode, standard deviation, and range of the first and second derivative signals. Since the signals were not normalized for intensity, the derivative signals were normalized by the peak value to represent a percentage of the maximum change in slope on a per-waveform basis. Derivatives are also shape descriptors, and as such should be independent of amplitude [24]. Fig. 5 shows image representations of these derivative statistics for a site in Land Use 8; it is immediately clear that these measures are visually correlated to above-ground structure. It should be noted that these metrics are by no means exhaustive, but arguably present a robust set, as defined by previous studies and inclusive of novel approaches, for describing the woody and herbaceous biomass in a savanna ecosystem. The range of values for each feature extracted from each plot-level waveform are summarized relative to the minimum and maximum lidar-estimated volume measurements in Table II. where is the model coefficient (intercept), is the variable that has entered the model, is the variable coefficient, is the number of variables that enter the model, and denotes one of the field measurements to be modeled. The dependent, biomass variables that were included in the modeling procedure included the plot-level woody kilogram measurement (kg) and the plotcenter herbaceous measurement (g). Windows around the plot center GPS points were constructed to extract the appropriate waveforms for each plot commensurate with the plot sampling strategy. This was done to account for GPS errors that might be present in the plot center herbaceous measurements, since these measurements were taken at the waveform lidar footprint level. This ensured that the location where herbaceous measurements were made, actually contained features from the appropriate waveforms. Land usespecific sizes therefore were chosen individually for each site, thus ensuring that each window matched the specified field data measurements. The lidar waveforms within each window were aggregated to construct a plot level waveform, from which the features discussed above were extracted. It should be noted that the process of creating plot-level waveforms involved taking an arithmetic average of adjacent waveforms. This (i) effectively applied a linear averaging to potential nonlinear interactions and (ii) potentially smoothed interactions that occur at different locations within the composite window, resulting in a spatial averaging of the waveform lidar data. III. RESULTS AND DISCUSSION A. Woody Biomass Results Due to the variable nature of each land use, different features were required to explain woody biomass. In this case, the waveform features were able to explain 76% of the variation in woody biomass estimates for LU8, but performed poorly in LU2 and LU7, as seen in Table III. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, VOL. 7, NO. 2, FEBRUARY 2014 486 VALUES OF TABLE II EXTRACTED FEATURES CORRESPONDING TO MINIMUM AND MAXIMUM LIDAR-ESTIMATED PLOT VOLUME MEASUREMENTS (BOLD). INDICATES THE GLOBAL MINIMUM AND MAXIMUM VALUE FOR THIS FEATURE WAS DIFFERENT The model-predicted woody biomass values were plotted against the field measured woody biomass for LU8, as shown in Fig. 6. Analysis of the residuals for these two models showed a relatively even distribution around the zero-line, although residual values increased with larger biomass values. The major drivers of woody biomass are the physical height of a tree and its stem diameter-at-breast height (DBH) [28]. The allometric equations for the tree species in this study area describe woody biomass as a function of DBH only [29], and since lidar data are inherently height- and cover-based measurements, this could have contributed to poor residual distributions [30], along with typical increased residual variance (heteroscedasticity) at higher biomass levels [4]. The LU8 specific model was able to explain 76% of the variance in woody biomass measurements made at the plot level. This model was a function of the 10–90% energy duration of MCGLINCHY et al.: EXTRACTING STRUCTURAL VEGETATION COMPONENTS FROM SMALL-FOOTPRINT WAVEFORM LIDAR 487 TABLE III . DESCRIPTION: LUX, WOODY BIOMASS MODELING RESULTS WHERE ‘X’ RELATES TO LAND USE 2, 7 OR 8, ALONG WITH FEATURES SELECTED FOR EACH MODEL. SUBSCRIPT NUMBERS DENOTE FIRST OR SECOND DERIVATIVE TABLE IV HERBACEOUS BIOMASS MODELING RESULTS FOR LU8 AT INCREASING LOWER LIMITS AND DECREASING SAMPLE SIZES. THE SUBSCRIPT NUMBER ON THE VARIABLE DENOTES SECOND DERIVATIVE Fig. 6. Predicted vs. observed woody biomass values in LU8 ( , ). using land-use-specific models. Solid line is 1:1 line. the waveform and the range of values in the waveform’s first derivative. The presence of more distinct upper canopy elements in LU8 was evident in the variable selection for the LU8 woody biomass model. These results reflect the physical environment of the different land uses. The rangelands in LU8 are well-kept and regulated by the local village, so the vegetation density higher above ground is more prominent than in the other land uses. The poor modeling results in LU2 and LU7 can be partially explained by the uniformity in lower-level woody cover. Tree cover above 5 meters within KNP was the lowest of the entire study area [3], [31], and the majority of the tree cover in LU7 fell within the height distribution of LU2 (see Fig. 2) . The LU2 sites also were in the protected KNP area, where the pattern and density of the woody biomass in this land use is driven more by wildlife utilization than human interaction and preservation. LU7 and LU8, on the other hand, are comprised of communal rangelands and fields. The rangelands in LU7 are mostly dominated by shrub cover, whereas tree cover is more dominant in the cultivated fields [3]. LU7 is heavily utilized and unregulated with fewer large trees, but consists of a more uniform and dense shrubbery vegetation layer, hence the woody components are physically more variable and dominant at lower heights. The lack of model predictive ability for this land use area was attributed to these factors. In summary, we found that land use specific models are most appropriate for modeling woody biomass variation in the savanna ecosystem represented in the study area, given the features that were extracted from the waveforms. Table III summarizes the relative performance for each model and the land use generalization to which they were best applied. B. Herbacious Biomass Results Land uses 2 and 7 were determined to be unsuitable for this study due to the senescent state of the vegetation in these regions (representative photos shown in Fig. 1(b)–(d). Land use 8 still contained a large amount of green biomass at the time of the study, and waveform features accounted for a significant amount of herbaceous biomass variation in this site. In particular, selected waveform features accounted especially well for variation in higher herbaceous biomass values in this study area. Table II shows the effect of increasing the lower limit of the herbaceous biomass used for modeling purposes. The reader is reminded that in removing the ground response from the waveform signals, small biomass components such as grass may be removed as well, possibly resulting in poor correlation for lower biomass levels. We ascribed this improvement in modeling results to the following: As the lower herbaceous biomass limit was increased, more physical structure was present and the CRR and aCRR variables entered the regression models. If the waveform signal contains only one interaction other than the ground interaction, these two metrics are equal to each other. However, aCRR is the more explanatory feature when the waveform contains multiple interactions above ground, regardless of the height at which they occur. The herbaceous biomass modeling was expected to be challenging for a number of reasons: (i) the outgoing pulse width was set to 16 ns, which along with the vertical resolution of the sensor (1 ns = 0.15 m), led to merging of ground and herbaceous features and limited detection of the multiple scattering within the herbaceous elements, (ii) the majority of the metrics extracted from the waveforms, aside from the ground removal metrics, were directly related to the canopy elements (e.g., CRR, aCRR, duration, and VOL), even though the same features were available to the woody and herbaceous modeling, and (iii) the low and narrow range (26-60 grams, many with equivalent biomass measurements) of available measured herbaceous biomass values had a significant impact on the modeling results, effectively disabling a full range model fit [4], (iv) the herbaceous backscatter of the waveform is embedded in the ground pulse, i.e., since the bare ground component was removed from the ground pulse, information related to the herbaceous content may also have been removed, 488 IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, VOL. 7, NO. 2, FEBRUARY 2014 and (v) all field and remotely sensed data were collected in the fall season. During this time much of the herbaceous cover in the study area was in a senescent state, which does not produce a strong return in the near-infrared wavelength at which the lidar sensor operates (1064 nm) [32]. The limiting factors that determined how well the herbaceous biomass can be estimated in this environment therefore included not only the amount of biomass measured within the plot, but also the phenological stage of the vegetation. The results nonetheless were encouraging and hinted at the potential of this approach to model herbaceous biomass using small-footprint waveform lidar signals. IV. CONCLUSION This study assessed the utility of small-footprint waveform lidar data to model woody and herbaceous vegetation structural variation across a land use gradient in a South African savanna landscape. This was accomplished by creating plot-level composite waveforms linked to field measurements, thereby extracting waveform signal features corresponding to areas of aboveground structure, and incorporating these features into regression modeling in order to explain the variance in field biomass measurements. This incorporated shape and non-range specific features in biomass models of vegetation, and was able to use signal shape in generating waveform features to assess vegetation structure. Model performance was evaluated in terms of the model fit parameters ( , adjusted for multiple variable entry, and RMSE) for explaining the variance in biomass measurements within the individual land uses. Model fit values, i.e., the amount of biomass variance explained, were as high as in the case of the individual regulated communal area of land use 8 (LU8), attributed to the larger range of woody biomass measurements and regulated fuel-wood harvesting in this area. In the conservation land use area (LU2) and unregulated communal area (LU7), woody biomass measurements were less accurately modeled. Similarly, the selected waveform features were only able to properly explain the herbaceous variation in LU8, but consideration had to be given to how sensitive the modeling was to the observed range of biomass values and the phenological state of the vegetation. Through this effort, we have drawn the following specific conclusions: • The features extracted from the waveform lidar data were sensitive to the varying vegetation structure within each land use, as dictated by the different approaches to land utilization on a per-land use basis. This was the case even though the majority of the woody measurements fell into a narrow biomass range. • The presence of derivative features in these models showed that these shape metrics are helpful in describing the woody variation, without being directly related to a physical aboveground entity, such as canopy height, cover, etc. • The herbaceous modeling exhibited increased performance for LU8 when the plot observations contained larger biomass values; this was attributed to increased backscatter from physical herbaceous structure. We concluded that small-footprint waveform lidar has significant potential for quantifying complex above ground structure, but further research is warranted in terms of both processing and feature extraction. For example, the features extracted in this study are not exhaustive. It would be interesting to examine the ability to discern fine scale herbaceous measurements from waveform lidar, whether this is accomplished by collecting data with a narrower pulse width, during the peak growing season, or both. However, the confirmation of our ability to extract woody and herbaceous estimates, even if ideally for areas with more significant biomass in the latter case, hints at the potential signal content in waveform lidar signals over savanna vegetation. ACKNOWLEDGMENT The authors would like to thank Dr. Izak Smit (Kruger National Park) and Dr. Jolene Fisher (University of the Witwatersrand) for inputs during waveform lidar feature extraction and biomass estimation discussions. The airborne campaign was supported by the Andrew Mellon Foundation. The Carnegie Airborne Observatory is made possible by the W. M. Keck Foundation, Gordon and Betty Moore Foundation, John D. and Catherine T. MacArthur Foundation, Grantham Foundation, and William Hearst III. REFERENCES [1] M. Lefsky, D. Harding, W. Cohen, G. Parker, and H. Shugart, “Lidar remote sensing of above ground biomass in three biomes,” Global Ecology and Biogeography, vol. 11, pp. 393–399, 1999. [2] G. Asner, S. Levick, T. Kennedy-Bowdoin, D. Knapp, R. Emerson, J. Jacobson, M. Colgan, and R. Martin, “Large-scale impacts of herbivores on the structural diversity of African savannas,” Proc. Nat. Acad. Sci. USA, vol. 106, pp. 4947–4952, 2009. [3] K. Wessels, R. Mathieu, B. Erasmus, G. Asner, I. Smit, J. v. Aardt, R. Main, J. Fisher, W. Marais, T. Kennedy-Bowdoin, D. Knapp, R. Emerson, and J. Jacobson, “Impact of communal land use and conservation on woody vegetation structure in the lowveld savannas of South Africa,” Forest Ecology and Manage., vol. 261, no. 1, pp. 19–29, Jan. 2011. [4] J. v. Aardt, R. Wynne, and R. Oderwald, “Forest volume and biomass estimation using small-footprint lidar-distributional parameters on a per-segment basis,” Forest Sci., vol. 52, no. 6, pp. 636–649, 2006. [5] K. Neke, N. Owen-Smith, and E. Witkowski, “Comparative resprouting response of savanna woody plan species following harvesting: the value of persistence,” Forest Ecology and Manage., vol. 232, pp. 114–123, 2006. [6] S. Higgins, C. Shackleton, and E. Robinson, “Changes in woody community structure and composition under contrasting landuse systems in a semi-arid savanna, South Africa,” J. Biogeobraphy, vol. 26, pp. 619–627, 1999. [7] C. Mallet and F. Bretar, “Full-waveform topographic lidar: State-ofthe-art,” ISPRS J. Photogramm. Remote Sens., vol. 64, no. 1, pp. 1–16, 2009. [8] M. Lefsky, D. Harding, W. Cohen, G. Parker, and H. Shugart, “Surface lidar remote sensing of basal area and biomass in deciduous forests of eastern Maryland, USA,” Remote Sens. Environ., vol. 67, pp. 83–98, 1999. [9] J. Boudreau, R. Nelson, H. Margolis, A. Beaudoin, L. Guindon, and D. Kimes, “Regional aboveground forest biomass using airborne and spaceborne lidar in Quebec,” Remote Sens. Environ., vol. 112, pp. 3876–3890, 2008. MCGLINCHY et al.: EXTRACTING STRUCTURAL VEGETATION COMPONENTS FROM SMALL-FOOTPRINT WAVEFORM LIDAR [10] A. Neunschwander, L. Magruder, and M. Tyler, “Landcover classification of small-footprint, full-waveform lidar data,” J. Appl. Remote Sens., vol. 3, Aug. 2009. [11] J. Drake, R. Dubayah, R. Knox, D. Clark, and J. Blair, “Sensitivity of large-footprint lidar to canopy structure and biomass in a neotropical rainforest,” Remote Sens. Environ., vol. 81, pp. 378–392, 2002. [12] G. Asner, C. Borghi, and R. Ojeda, “Desertification in central argentina: Changes in ecosystem carbon and nitrogen from imaging spectroscopy,” Ecological Applicat., vol. 13, pp. 629–648, 2003. [13] N. Keshava and F. Mustard, “Spectral unmixing,” IEEE Signal Processing Mag., vol. 19, no. 1, pp. 44–57, Jan. 2002. [14] J. Beringer, L. Hutley, J. Hacker, B. Neininger, and U. Paw, “Patterns and processes of carbon, water and energy cycles across northern australian landscapes: From point to region,” Agricultural and Forest Meteorology, vol. 151, pp. 1409–1416, 2011. [15] F. Venter, R. Scholes, and H. Eckhardt, “The abiotic template and its associated vegetation pattern,” in The Kruger Experience: Ecology and Management of Savanna Hetergeneity. London, U.K.: Island Press, 2003, pp. 83–129. [16] L. Muncina, M. Rutheford, and L. Powrie, “The logic of the map: approaches and procedures,” in The Vegetation of South Africa, Lesotho and Swaziland, Strelitzia, 2006, pp. 14–18. [17] G. Asner, D. Knapp, M. Jones, T. Kennedy-Bowdoin, R. Martin, J. Boardman, and C. Field, “Carnegie airborne observatory: in-flight fusion of hyperspectral imaging and waveform light detection and ranging (WLIDAR) for three-dimensional studies of ecosystems,” J. Appl. Remote Sens., vol. 1, Sep. 2007. [18] D. Tongway and N. Hindley, “Landscape function analysis: a system for monitoring rangeland function,” African J. Range Forage Sci.. [19] J. Wu, J. v. Aardt, J. McGlinchy, and G. Asner, “A robust signal preprocessing chain for small-footprint waveform lidar,” IEEE Trans. Geosci. Remote Sens., vol. 50, no. 8, pp. 3242–3255, 2012. [20] USGS, 2009, Accessed 12–03-2010 [Online]. Available: http://ngom. usgs.gov/dsp/mapping/lidar_vegetation_metrics.html [21] B. Lathi, Linear Systems and Signals, 2nd ed. New York, NY, USA: Oxford Univ. Press, 2005, pp. 69–69. [22] J. D. Muss, D. J. Mladenoff, and P. A. Townsend, “A pseudo-waveform technique to assess forest structure using discrete lidar data,” Remote Sens. Environ., vol. 115, no. 3, pp. 824–835, Mar. 2011. [23] K. Smith, M. Steven, and J. Colls, “Use of hyperspectral derivative ratios in the red-edge region to identify plant stress responses to gas leaks,” Remote Sens. Environ., vol. 92, pp. 207–217, 2004. [24] J. Stewart, “How derivatives affect the shape of a graph,” Calculus: Early Transcendentals, pp. 293–303, 2003, Brooks/Cole—Thomson Learning. [25] SAS Inc., Published Online 2010, Accessed 11–01-2011 [Online]. Available: http://support.sas.com/documentation/cdl/en/statug/63033/ HTML/default/viewer.htm#statug_reg_sect007.htm [26] UCLA: Academic Technology Services, Statistical Consulting Group, published 2010, accessed 11–01-2011 [Online]. Available: http://www. ats.ucla.edu/stat/sas/library/SASReg_mf.htm [27] R. L. Ott, An Introduction to Statistical Methods and Data Analysis, 4th ed. Pacific Grove, CA, USA: Duxbury Press, 1993, pp. 648–664. [28] T. E. Avery and H. E. Burkhart, Forest Measurements, 4th ed. New York, NY, USA: McGraw-Hill, 1994. [29] A. Kirton, R. Scholes, and S. Archibald, “A method for calculating the variance and confidence intervals for biomass estimates obtained from allometric equations,” South African J. Sci., vol. 107, no. 5-6, pp. 86–95, 2011. [30] J. v. Aardt, “An object-oriented approach to forest volume and aboveground biomass modeling using small-footprint lidar data for segmentation, estimation, and classification,” Virginia Polytechnic Institute and State University, Blacksburg, VA, 2004. [31] J. Fisher, E. Witkowski, B. Erasmus, G. Asner, K. Wessels, and R. Mathieu, “Human-modified landscapes: Patterns of fine-scale woody vegetation structure in communal savanna rangelands,” Environmental Conservation, vol. 39, no. 1, pp. 72–82, 2010. [32] M. W. Straatsma, Hydrodynamic Roughness of Floodplain Vegetation: Airborne Parameterization and Field Validation. Utrecht, The Netherlands: Koninklijk Nederlands Aardrijkskund Genootschap, 2007. 489 Joseph McGlinchy received the M.S. degree in imaging science from the Rochester Institute of Technology, Rochester, NY, USA, in 2010, and the B.S. degree in electrical engineering from the University of Akron, Akron, OH, USA, in 2008. During his time at Rochester Institute of Technology, his research focused on feature extraction and algorithm development to facilitate land cover component determination and woody and herbaceous biomass modeling in South African savanna ecosystems using airborne waveform LiDAR and hyperspectral imagery. He is a currently a member of the 3-D and Imagery team at Environmental Systems Research Institute, Inc., and is interested in point cloud modeling and processing, feature extraction from remotely sensed 2-D and 3-D data, and 2-D and 3-D signal processing. Jan A. N. van Aardt received the B.Sc. and Hons. degrees in forestry from the University of Stellenbosch, Western Cape, South Africa, in 1996 and 1998, respectively, and the M.S. and Ph.D. degrees in forestry and remote sensing from the Virginia Polytechnic Institute and State University, Blacksburg, VA, USA, in 2000 and 2004, respectively. He is currently an Associate Professor within the Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology (RIT), Rochester, NY, USA, and a member of the Digital Imaging and Remote Sensing group within the Center. This group focuses on a complete system approach to remote sensing applications based on system integration, processing workflow, and applied algorithm development. Before joining the faculty at RIT, he worked in the academic (Katholieke Universiteit Leuven, Leuven, Belgium) and private (Council for Scientific and Industrial Research, South Africa) sectors. His research interests include the application of imaging spectroscopy and light detection and ranging for spectral-structural characterization of natural systems (remote sensing of natural resources). Barend Erasmus is interested in the effects of global change on various aspects of biodiversity, and how these impacts translate to system level responses at different scales of observation. He prefers to collaborate across traditional discipline boundaries, and use multispectral and hypertemporal satellite imagery, aerial photography and airborne lidar to investigate long term changes in savanna woody vegetation structure to understand land cover dynamics across boundaries. The contrast between national parks and surrounding rural communities provide an interesting natural experiment to highlight systems-level behaviors. Gregory P. Asner is a faculty member of the Department of Global Ecology, Carnegie Institution for Science and the Department of Environmental Earth System Science, Stanford University. He explores interactions between species, environment, land use and climate change, with ongoing research and capacity building work focused on deforestation and forest degradation, biological diversity at large geographic scales, and terrestrial biogeochemistry. Dr. Asner develops new scientific approaches and technologies for investigation and conservation assessment of large geographic regions, including their carbon emissions, three-dimensional habitat, chemical functioning, and biological diversity. He leads the CLASlite forest monitoring project, Spectranomics biodiversity project, and the Carnegie Airborne Observatory. Renaud Mathieu received the Dipl.Ing. degree in agricultural sciences from the École Supérieure d’Agriculture de Purpan, Toulouse, France, in 1990, the M.Sc. degree in applied remote sensing from Cranfield University, Cranfield, U.K., in 1991, and the Ph.D. degree in geographic information sciences from Université Paris-Est Marne-la-Vallée, Paris, France. He is currently a Principal Researcher with the Council for Scientific and Industrial Research, Pretoria, South Africa, and leads the Earth Observation Research Group in Natural Resources and Environment. His research interests focus on the application of remote sensing technologies to soil and water conservation, natural resource and wildlife management, environment, and agriculture. 490 IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, VOL. 7, NO. 2, FEBRUARY 2014 Konrad Wessels is currently Chief Scientist and Research Group Leader for the Remote Sensing Research Unit of the CSIR-Meraka Institute. He is an extraordinary senior lecturer at University of Pretoria, Centre for Geoinformation Science. He was awarded a NASA Earth System Sciences fellowship and completed his Ph.D. in geography at the University of Maryland, specializing in remote sensing of land degradation. He spent his post-doc at NASA Goddard Space Flight Centre. David Knapp received the B.S. degree from The Pennsylvania State University in 1987 and the M.S. degree from the University of Illinois in 1991, both in geography with a specialization in remote sensing and GIS. He worked as a scientist for Hughes ITSS, contracted to NASA Goddard Space Flight Center where he participated in various remote sensing research projects including the Large Scale Biosphere-Atmosphere Experiment in Amazonia (LBA) and the Boreal Ecosystem-Atmosphere Study (BOREAS). He is currently an image processing specialist and develops software for processing satellite and aircraft imagery at the Carnegie Institution for Science Department of Global Ecology. Ty Kennedy-Bowdoin studies ecosystem dynamics using LiDAR at the Carnegie Institution for Science. He was the first Carnegie staff member to be fully dedicated to the newly formed Carnegie Airborne Observatory in 2007 and has been a key member of the team responsible for the development of three ground-breaking sensor systems incorporating co-aligned LiDAR and spectrometers to study carbon stocks, biodiversity, gap dynamics, and deforestation in Central and South America, Hawaii, and Southern Africa. Harvey Rhody received the B.S.E.E. from the University of Wisconsin, Madison, WI, USA, the M.S.E.E. from the University of Cincinnati, Cincinnati, OH, USA, and the Ph.D. in electrical engineering from Syracuse University, Syracuse, NY, USA. He is a Professor of imaging science in the Chester F. Carlson Center for Imaging Science at Rochester Institute of Technology. His research interests include imaging systems, remote sensing, imaging algorithms and image processing and he has conducted numerous projects designed in those areas. His current focus is in the area of 3D imaging, where he is developing courses and leading research projects in 3D sensing. He has spent nearly four decades at RIT as a faculty member and administrator, previously serving as head of the Department of Electrical Engineering and president of the RIT Research Corporation. John P. Kerekes (S’81–M’89–SM’00) received the B.S., M.S., and Ph.D. degrees in electrical engineering from Purdue University, West Lafayette, IN, USA, in 1983, 1986, and 1989. From 1983 to 1984, he was a Member of the Technical Staff with the Space and Communications Group, Hughes Aircraft Co., El Segundo, CA, where he performed circuit design for communications satellites. From 1986 to 1989, he was a Graduate Research Assistant, working with both the School of Electrical Engineering and the Laboratory for Applications of Remote Sensing at Purdue University. From 1989 to 2004, he was a Technical Staff Member with the Lincoln Laboratory, Massachusetts Institute of Technology, Lexington, MA. From 2004 to 2013 he was an Associate Professor in the Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology, Rochester, NY, where he is now Professor. His research interests include the modeling and analysis of remote sensing system performance in pattern recognition and geophysical parameter retrieval applications. Dr. Kerekes is a member of Tau Beta Phi, Eta Kappa Nu, the American Geophysical Union, and the American Society for Photogrammetry and Remote Sensing. He is a Senior Member of the Optical Society of America and SPIE. From 1995 to 2004, he served as the Chair of the Boston Section Chapter of the IEEE Geoscience and Remote Sensing Society (GRSS), and from 2007 to 2010 he served as the founding Chair of the Western New York Chapter of GRSS. He was a Co-General Chair of IGARSS 2008 held in Boston, MA. Since 2010 he has been a member of the GRSS Administrative Committee (AdCom) and is currently serving as the Vice-President of Technical Activities of the GRSS. Emmett J. Ientilucci received the B.S., M.S., and Ph.D. degrees in imaging science from the Rochester Institute of Technology, Rochester, NY, USA, in 1996, 1999, and 2005, respectively. He is currently a Research Professor with the Digital Imaging and Remote Sensing Laboratory, Chester F. Carlson Center for Imaging Science, Rochester Institute of Technology. Prior to his faculty position, he was a Postdoctoral Research Fellow for the Intelligence Community. His research interests include physics-based signature modeling, hyperspectral image analysis, radiometry, hybrid detection methods, and atmospheric and radiative transfer modeling and scattering from small particles as it relates to bio-aerosols. He has taught courses and laboratories in radiometry, remote sensing, geometrical optics, measurement and analysis, photo science, dimensional metrology, and computer techniques for technicians, as well as served as Referee on eleven scientific journals. He is currently the vice chair for the Western New York Geoscience and Remote Sensing Society (GRSS) and has a manuscript prepared for his text book entitled Radiometry and Radiation Propagation. Dr. Ientilucci is also a member of the Society for Photographic Instrumentation Engineers. Jiaying Wu (M’09) received the B.S. degree in optical information science from the University of Shanghai for Science and Technology, Shanghai, China, in 2006 and the Ph.D. degree in Imaging Science from Rochester Institute of Technology, Rochester, NY, USA, in 2012. He is currently working at Apple Inc., Cupertino, CA, USA, as a color imaging scientist. His doctoral research involves studies of waveform LiDAR signal and 3D image processing in Digital Imaging and Remote Sensing (DIRS) group at RIT. From 2007 to 2008, he was a research assistant with the Real-Time Vision and Image Processing Laboratory, RIT. During this time, his research focused on the development of medial augmented reality algorithm based on 2-D/3-D image fusion to facilitate image-guided surgery. In 2008 summer, he was a research intern with Sharp Research Laboratories of America, Camas, WA, USA. The research topic involved LCD display color modeling, and his work was published in SPIE Electronic Imaging in 2009. He also spent a summer as an Image System Engineer with Microsoft Corporation, Redmond, WA, USA, in 2010, where he worked on video compression and quality assessment algorithms. His research interests include digital image processing, remote sensing, computer vision, and color science. Diane Sarrazin received the B.Sc. degree in physics and space science from the Royal Military College of Canada in 2005. She is an officer in the Canadian Forces currently working at Defence Research and Development Canada, Quebec. She worked as Deputy to the Senior Aircraft Maintenance Engineering Officer at 438 Tactical Helicopter Squadron from 2006 to 2008. She received her M.Sc. in Imaging Science with Remote Sensing application from Rochester Institute of Technology in 2010. She worked as Imaging Officer for the Canadian Open Skies office from 2010 to 2012 and published an article in the Canadian Journal of Remote Sensing entitled “Fusing small-footprint waveform LiDAR and hyperspectral data for canopy level species classification and herbaceous biomass modeling in savanna ecosystems” in 2011. Kerry Cawse-Nicholson received the Ph.D. degree in computational and applied mathematics from the University of the Witwatersrand, South Africa, in 2012, obtained while working at the Remote Sensing Research Unit, Meraka Institute, Council for Scientific and Industrial Research (CSIR), South Africa. She is currently pursuing a post-doc at Rochester Institute of Technology, USA in the field of waveform LiDAR. Her current research interests include image processing, hyperspectral imagery and LiDAR.
© Copyright 2026 Paperzz