biosystems engineering 106 (2010) 223–233 Available at www.sciencedirect.com journal homepage: www.elsevier.com/locate/issn/15375110 Research Paper Neural-network model for estimating leaf chlorophyll concentration in rice under stress from heavy metals using four spectral indices Meiling Liu a, Xiangnan Liu a,*, Mi Li a, Meihong Fang a, Wenxue Chi b a b School of Information Engineering, China University of Geosciences, Beijing 100083, China Meteorological Observation Center of CMA, Beijing 100081, China article info Heavy metal stress in soils results in subtle changes in leaf chlorophyll concentration, Article history: which are related to crop growth and crop yield. Accurate estimation of the chlorophyll Received 1 September 2009 concentration of a crop under heavy metal stress is essential for precision crop production. Received in revised form The objective of this paper is to create a back propagation (BP) neural-network model to 1 December 2009 estimate chlorophyll concentration in rice under heavy metal stress. Three experiment Accepted 14 December 2009 farms located in Changchun, Jilin Province, China with level II pollution, with level I Published online 15 May 2010 pollution and with safe level were selected, The assessment was based on the input parameters normalised difference vegetation index (NDVI), optimized soil-adjusted vegetation index (OSAVI), modified triangle vegetation index/modified chlorophyll absorption ratio index (MTVI/MCARI), MTVI/OSAVI and the output parameters of rice leaf chlorophyll concentration. The output parameters were sensitive to heavy metal stress. The result indicated that an optimum BP neural-network prediction model has 4-10-2-1 network architecture with gradient descent learning algorithm and an activation function including the sigmoid tangent function in the input layer, a hidden layer and sigmoid logistic functions in the output layer. The correlation coefficient (R2) between the measured chlorophyll concentration and the predicated chlorophyll concentration was 0.9014, and the root mean square error (RMSE) was 2.58. ª 2009 IAgrE. Published by Elsevier Ltd. All rights reserved. 1. Introduction With the development of industry, some pollutants have affected farms. Heavy metals from industrial pollutants have caused stress to crops adjacent to industrial areas. Heavy metal stress is characterised by the reduction of the enzymes involved in chlorophyll biosynthesis and induced photosynthetic inhibition (De Filipps & Pallaghy, 1994), membrane disruption (Droppa & Horvath, 1990), substitution of metal ions for the chlorophyll molecule and the reduction in leaf chlorophyll concentration (Kastori et al., 1998). Heavy metal stress results in crop growth restriction and crop yield loss. It is therefore essential and urgent that measures are made to detect crop heavy metal stress. Remote sensing is a valid alternative to traditional groundbased methods to detect plant stress, especially with the * Corresponding author. E-mail addresses: [email protected] (M. Liu), [email protected] (X. Liu), [email protected] (M. Li), [email protected] (M. Fang). 1537-5110/$ – see front matter ª 2009 IAgrE. Published by Elsevier Ltd. All rights reserved. doi:10.1016/j.biosystemseng.2009.12.008 224 biosystems engineering 106 (2010) 223–233 Nomenclature xi, xmin, xmax, xnorm real, minimum, maximum, normalised value of an input variable yai, ymin, ymax, ynorm, ymi real, minimum, maximum, indexed value of an output variable, measured output variable emergence of hyperspectral data. Some research has been conducted to establish a relationship between heavy metal content in abandoned mines and reflectance characteristics, biochemical composition, and pigment content of the plants respectively (Choe et al., 2008; Gan, Liu, & Zhou, 2004; Liu, Gan, & Wang, 2004). Studies that examined plant response to heavy metal-contaminated soil have generally focused on mines (Ellis & Eslick, 1997). Only a few studies have focused on farmland under heavy metal stress. Recently, a few researchers have demonstrated the correlation between crop spectral characteristics and crop chlorophyll content under copper or lead contamination stress (Chi, Liu, Liu, & Yang, 2006; Li, Yang, Zhang, Zhang, & Zhou, 2008; Ren, Zhuang, & Pan, 2008). However, these experiments were performed under laboratory conditions by adding copper, lead, chromium, or zinc, etc. The fact that most studies have been conducted under laboratory conditions, rather than under field conditions, is one of the primary stumbling blocks in applying hyperspectral remote sensing to precision agriculture. The level of the pollutants found in farmlands is much lower than the level found in abandoned mines or used in laboratory-based experiments. Alterations in plant biochemistry and cellular composition imposed by heavy metal stress are subtle, and the spectral differences derived from heavy metal stress are not obvious; hence using conventional methods utilising spectral characteristic analysis has presented a challenge with respect to detecting crop heavy metal stress. Therefore, priority should be given to selecting effective methods for detecting variations in biochemical composition. It is well known that artificial neural-network models have advantages in determining the input–output relationship for complex systems based on the strength of their interconnections presented in a set of sample data (Howard & Mark, 2000). Tumbo, Wagner, and Heinemann (2002) developed a back propagation (BP) neural-network model to estimate chlorophyll content based on spectrometry readings, and verified that the estimated values from this neuralnetwork model correlated with chlorophyll measurements reasonably. Wang and Thai (2003) created a generic algorithm model to select an optimal second-order polynomial model to assess crop nitrogen stress based on a ratio of leaf reflectance measured at 700 nm and 740 nm. Noh, Zhang, Shin, Han, and Feng (2006) used a BP neural-network model to assess crop nitrogen stress based on maize canopy reflectance in three channels (green, red, and near-infrared – NIR) of a multispectral image sensor. Significant progress has been made in this field. However, there has been little research on the variations in leaf chlorophyll concentration under various levels of heavy metal stress, RSME root mean square error between real output variable and measured output variable wjk, wij, h, a connection weights of hidden layer and output layer, input layer and hidden layer, learning rate and momentum coefficient especially for crops growing in field situations rather than under laboratory conditions. In this paper, we propose a neuralnetwork model to retrieve leaf chlorophyll concentration accurately from a rice crop grown under heavy metal stress using performance spectral indices from hyperspectral data. 2. Materials and methods 2.1. Field experiment design The city of Changchun, Jilin Province in China is an important industrial district and agricultural district. Some areas have been contaminated by industrial pollutants; among the most important of these are heavy metals. Suburban farms have soils with copper and cadmium at concentrations above those Fig. 1 – Location map for experimental sites in Changchun, Jilin Province, China. 225 biosystems engineering 106 (2010) 223–233 Table 1 – The information of the experiment sites. Geographical location Copper content (mg kg1) Cadmium content (mg kg1) Pollution level Soil quality standarda (mg kg1) 68.2 45.5 20.4 0.465 0.182 0.093 II I Safe II (50 Cu 400; 0.3 Cd 0.6) I (35 Cu < 50; 0.2 Cd < 0.3) Safe (Cu < 20.8; Cd < 0.097) 43 52.20 N, 125 10.20 E 43 54.60 N, 125 10.40 E 44 06.30 N, 125 10.20 E a Soil quality standard according to the Environment Monitoring Centre of China. considered to be normal for the area. Three experimental sites (43 510 34.800 N–43 510 37.000 N, 125 090 07.200 E–125 100 25.300 E) adjacent to the China First Automobile Factory (i.e., the contamination source) in Changchun were selected (Fig. 1). Heavy metal contamination stress levels in the soil of the three field experiments (labelled A, B, and C) varied. The soil and the stress rates were determined as safe level, level I pollution and level II pollution according to a soil sample analysis (Table 1). The site is within the temperate continental climate zone with a mean annual rainfall of 522–615 mm, where soils are dominated by black soils with sufficient organic matter (2–4%). The crop selected in this site was rice which is one of the most important crops in China. 2.2. Data preparation 2.2.1. Spectral data measurement The data collection was carried out during four days during a typical rice growth season: 8 July, 4 August, 29 August and 18 September 2008, which corresponded to the seeding, tillering, booting and mature growth stages of rice. All spectral measurements were taken under cloudless or near cloudless conditions between 10:00 and 14:00, using an ASD FieldSpec Pro spectrometer (Analytical Spectral Devices, Boulder, CO. USA). The spectrometer was fitted with 10 field of view fibre optics, operated in the 350–2500 nm spectral region with sampling intervals of 2 nm. A BaSO4 calibration panel was used for determining the black and baseline reflectance. A panel radiance measurement was taken before and after rice measurement using two scans each time. Rice radiance measurements were made at 20 sites in each plot and each site was scanned 10 times. These measurements were then averaged for the particular plot. The average reflectance curves for the three field experiments are shown in Fig. 2. 2.2.2. Corporation, NJ, USA). The chlorophyll content in rice leaves was measured at six randomly chosen times and the average value for each sample was calculated. The chlorophyll concentration was calculated from the SPAD-502 chlorophyll readings (Wood, Reeves,& Himelrick, 1993), using the following equation: y ¼ 0:996x 1:52 where y is the chlorophyll concentration and x is the SPAD-502 chlorophyll readings (mg cm2). 2.3. The effect of Cu and Cd contamination on rice Cd is a non-essential element that can be easily absorbed by plants. While Cu is an essential element, high doses can adversely affect plant growth (Pahlsson, 1989). The interaction between Cu and Cd is complex and has an effect on their individual functions. Huang, Hu, and Liu (2009) demonstrated that soil application of Cu greatly enhanced Cd accumulation, but the application of Cd had a negligible effect on Cu uptake by rice plants. Biochemical, physiological, and structural aspects affected by excess Cu and Cd in rice included following: (1) Cd and Cu can inhibit the activities of several antioxidative enzymes including superoxide dismutase (SOD), ascorbate peroxidase (APOD), and glutathione reductase (GR) (Fernandes & Henriques, 1991); (2) they can inhibit the stomatal opening (Barcelo & Poschenrieder, 1990); (3) they damage the photosynthetic apparatus (Chien & Kao, 2000; Krupa, 1988; Sidlecka & Baszynsky, 1993); (4) they are substituted for magnesium (Mg), the central atom of chlorophyll, and then reduce the chlorophyll content (Chien, Wang, Metal content and leaf chlorophyll measurement The spectral measurements were correlated with measurements of the chlorophyll content of the rice and heavy metal concentrations. The metal content in the soil was analysed by the Chinese Academy of Agricultural Sciences, Beijing, China. The soils samples were firstly digested by reverse aqua-regia (HNO3: HCl ¼ 3:1) (Stevens, 2002), then Cu was measured using a flame atomic absorption spectrophotometer (AAS) (Spectr AA 110/220, Varian, USA) and Cd with a graphite furnace AAS. Soil samples were also analysed for diethylenetriaminepentaacetic acid (DTPA) extractable heavy metals using the method of Lindsay and Norvell (1969) and the metal concentrations were determined by AAS. A simple method of determining the chlorophyll content is the portable Chlorophyll Meter SPAD-502 (Minolta Fig. 2 – Reflectance spectrum of rice under different levels of heavy metal pollution. 226 biosystems engineering 106 (2010) 223–233 values of Cd and Cu always occurred in rice with level II pollution, followed by level I pollution, and then safe level. A distinction was found between the uptakes of Cd and Cu at different growth stages of rice. The highest values of Cd and Cu uptake occurred in the tillering and mature growth stages respectively. 2.4. Fig. 3 – Chlorophyll concentration in rice leaves at different growth stages. Lin & Kao, 2001; Lagriffoul, Mocquot1, Mench1, & Vangronsveld, 1998; Larsson, Bordman, & Asp 1998). In summary, the main toxic effect of Cu and Cd is oxidative stress. Heavy metal toxicity in rice leaves was assessed by the decrease in chlorophyll and protein contents (Huang et al., 2009). An average value of chlorophyll concentration was obtained from four critical growth stages of rice. The result showed that Cd and Cu induced stress in rice had a close correlation with chlorophyll concentration in rice at different growth stages (Fig. 3). From Fig. 3, within seeding, tillering, booting and mature growth stages, the chlorophyll concentration of rice with safe level was higher than those with level I pollution, and higher than level II pollution. In addition, the greatest difference in chlorophyll concentration in rice with different pollution levels occurred at the mature growth stage. Therefore, the difference in chlorophyll concentration of rice proved credible and feasible method for estimating Cu and Cd contamination levels. Low values in the chlorophyll concentrations indicated that the rice was suffering from very serious contamination. The reduction of chlorophyll concentration in rice is caused by Cd and Cu preferentially accumulating in the chloroplasts as the rice grows. Cd and Cu levels in rice leaves with different growth stages are displayed in Fig. 4. As seen in Fig. 4, the highest Spectral indices used in the model As illustrated in Fig. 2, in the NIR region, the leaf reflectance of rice under heavy metal stress is somewhat lower than that of the safe level. A number of studies have demonstrated that the shifts in vegetation spectra of plant under heavy metal stress occur in both the visible and the NIR part of the spectrum (Kooistra et al., 2004). With the previous analysis results, the primary effect of Cu and Cd metal-induced stress on rice is the corresponding reduction in chlorophyll. Therefore, our goal was to select spectral indices sensitive to leaf chlorophyll concentration in rice. Strong relationships exist between chlorophyll concentrations and spectral indices formulated with some specific narrow spectral bands including 550 nm, 670 nm, 700 nm, 800 nm (Haboudane et al., 2002; ZarcoTejada, Miller, Noland, Mohammed, & Sampson, 2001). Therefore, several spectral indices formulated with the above values were selected in this study. 2.4.1. Normalised difference vegetation index (NDVI) The most well known and widely used vegetation index is the normalised difference vegetation index (NDVI) developed by Rouse, Haas, Schell, Deering, and Harlan (1974). It is based on the contrast between the maximum absorption in the red due to chlorophyll pigments and the maximum reflection in the infrared caused by leaf cellular structure. Using hyperspectral narrow wavebands, this index is quantified by the following equation: NDVI½670; 800 ¼ R800 R670 R800 þ R670 (1) where Rx is the reflectance at the given wavelength (nm). Our results showed that NDVI has a strong non-linear relationship with chlorophyll concentration (Fig. 5), the correlation coefficient (R2) being 0.6986. Gallagher, Pechmann, Bogden, Grabosky, and Weis (2008) demonstrated that NDVI can predict heavy metal stress and that it had a curvilinear relationship with total heavy metal load (R2 ¼ 0.85). Fig. 4 – Cu and Cd in rice leaves at different growth stages. biosystems engineering 106 (2010) 223–233 227 Fig. 5 – Relationship between different vegetation indices and chlorophyll concentration. 2.4.2. Modified triangle vegetation index (MTVI) The MTVI belongs to the triangle vegetation index family (Broge & Leblance, 2000). The general principle behind this index is to describe the radiative energy absorbed by the pigments as a function of the relative difference between red and NIR reflectance in conjunction with the magnitude of reflectance in the green region, where the light absorption by chlorophylls is relatively insignificant. The index is calculated as the area of the triangle defined by the green peak, the chlorophyll absorption minimum, and the NIR shoulder in spectral space. It based on the fact that chlorophyll absorption causes a decrease in red reflectance. Some researchers have tried to modify the triangle vegetation index by adjusting the coefficient and the waveband of equation. Guang and Liu (2009) developed the MTVI by as the followed equation: MTVI ¼ 1:5½1:2ðR712 R550 Þ 2:1ðR670 R550 Þ (2) The results showed that MTVI was insensitive to chlorophyll concentration of rice under heavy metal stress. This was attributed to the insignificant difference in the area of the triangle derived from green peak and the chlorophyll absorption maximum, especially in rice with level I pollution. 2.4.3. Modified chlorophyll absorption ratio index (MCARI) To reduce non-photosynthetic factors, Kim, Daughtry, Chappelle, McMurtrey, and Walthal (1994) developed the chlorophyll absorption ratio index (CARI) which measures the depth of chlorophyll absorption at 670 nm relative to the green reflectance peak at 550 nm and the reflectance at 700 nm. It uses bands corresponding to the minimum absorption of the photosynthetic pigments, centred at 550 nm and 700 nm, in conjunction with the chlorophyll a maximum absorption band, around 670 nm. Daughtry et al. (2000) developed the MCARI which can be simplified as the following equation: 228 biosystems engineering 106 (2010) 223–233 Table 2 – Hyperspectral vegetation indices used in the sensitivity analysis. Indices Wavebands (nm) NDVI[670,800] 670,800 OSAVI[670,800] MCARI[670,700] MTVI[550,712] MTVI/MCARI[550,712] 670,800 550,670,700 550,670,712 550,670,700,712 MTVI/OSAVI[550,800] 550,670,712,800 Formula R800 R670 R800 þ R670 OSAVI ¼ ð1 þ 0:5ÞðR800 R670 Þ=ðR800 þ R670 þ 0:5Þ MCARI ¼ ½ðR700 R670 Þ 0:2ðR700 R550 ÞðR700 =R670 Þ MTVI ¼ 1:5½1:2ðR712 R550 Þ 2:1ðR670 R550 Þ 1:5½1:2ðR712 R550 Þ 2:1ðR670 R550 Þ MTVI=MCARI ¼ ½ðR700 R670 Þ 0:2ðR700 R550 ÞðR700 =R670 Þ 1:5½1:2ðR712 R550 Þ 2:1ðR670 R550 Þ MTVI=OSAVI ¼ ð1 þ 0:5ÞðR800 R670 Þ=ðR800 þ R670 þ 0:5Þ NDVI½670; 800 ¼ MCARI ¼ ½ðR700 R670 Þ 0:2ðR700 R550 ÞðR700 =R670 Þ (3) Reference Rouse et al. 1974 Huete, 1988 Daughtry et al. 2000 Guang & Liu, 2009 _ In sensitivity analysis, bands at 550 nm, 670 nm, 700 nm, 712 nm and 800 nm were selected. Therefore, reflectance in these bands was selected to formulate vegetation indices which may have potential in estimating chlorophyll concentration. Six indices (Table 2) were tested in this study; the result indicated that NDVI, OSAVI, MTVI/MCARI and MTVI/ OSAVI were sensitive to chlorophyll content variations of rice under heavy metal stress. Our results showed that MCARI was also insensitive to chlorophyll concentration of rice under heavy metal stress; while other researchers have reported that MCARI was sensitive to chlorophyll concentration (Wu, Niu, Tang, & Huang, 2008). The reason is that the green peak position moved towards longer wavelengths as the heavy metal content increased. Daughtry et al. (2000) have pointed out that MCARI was still sensitive to non-photosynthetic element effects, mainly at low chlorophyll concentrations. 2.5. Neural-network model development 2.4.4. 2.5.1. Data pre-processing phase Optimized soil-adjusted vegetation index (OSAVI) Daughtry et al. (2000) proved that when MCARI is combined with a soil line vegetation index like OSAVI (Rondeaux, Steven, & Baret, 1996), the sensitivity to the underlying soil reflectance properties can be reduced. OSAVI belongs to the soil-adjusted vegetation index (SAVI; Huete, 1988) family and is defined by the following equation: OSAVI ¼ ð1 þ 0:5ÞðR800 R670 Þ=ðR800 þ R670 þ 0:5Þ (4) As shown in Fig. 5, rice leaf chlorophyll concentration was strongly correlated with OSAVI (R2 ¼ 0.6990). This agrees well with the results of Haboudane et al. (2002). 2.4.5. Integrating two vegetation indices Research has indicated that an integrated index modelling is more sensitive to chlorophyll content variations and more resistant to the variations of leaf area index (LAI) and solar zenith angle than a single vegetation index (Haboudane et al., 2002). Wu et al. (2008) proved that integrated indices transformed chlorophyll absorption ratio index (TCARI/OSAVI) [705,750] and MCARI/OSAVI [705,750] have better linearity with chlorophyll content together with high correlation: R2 of 0.8808 and 0.9406 respectively. To improve sensitivity of the vegetation index to subtle variations in rice leaf chlorophyll concentration under heavy metal stress, in this paper, two integrated indices are proposed. They be defined by the following equations: MTVI=MCARI ¼ 1:5½1:2ðR712 R550 Þ 2:1ðR670 R550 Þ ½ðR700 R670 Þ 0:2ðR700 R550 ÞðR700 =R670 Þ (5) MTVI=OSAVI ¼ 1:5½1:2ðR712 R550 Þ 2:1ðR670 R550 Þ ð1 þ 0:5ÞðR800 R670 Þ=ðR800 þ R670 þ 0:5Þ (6) Our results indicated that the two integrated indices MTVI/ MCARI, MTVI/OSAVI and leaf chlorophyll content were highly correlated with R2 of 0.7280 and 0.7198 respectively (Fig. 5). _ In this study, to avoid data saturation, the input variables in this model were normalised, based on their possible ranges using the following equation: xnorm ¼ xi xmin xmax xmin (7) where xi, xmin, xmax and xnorm are the real-valued input variable, the minimum input variable, maximum input variable and its normalised value respectively. The output from a neural-network model is an indexed value that corresponds to the input variable. To get the real-valued output, the indexed output value needs to be de-normalised according to the following equation: yai ¼ ymin þ ynorm ymax ymin (8) Where yai, ymin, ymax and ynorm are the real-valued output variable, the minimum and maximum values of the realvalued output, and the indexed output value from the neuralnetwork model, respectively. 2.5.2. Model building phase Artificial neural network (ANN) models have found wide applications, including prediction, classification,system modelling, signal processing, noise filtering (Pachepsky, Timlin, & Varallyay, 1996; Pal, Pal, Das, & Majumdar, 2003; Plate, Bert, Grace, & Band, 2000; Tumbo et al., 2002). BP neutral networks are popular neutral network architectures (Bishop, 1995; Cichocki & Unbehauen, 1993; Foody, 1995; Pachepsky et al., 1996; Pal et al., 2003; Plate et al., 2000; Tumbo et al., 2002). In this paper, a BP neural-network model was established to estimate the leaf chlorophyll concentration, which in turn, indicates the stress levels of rice growing under heavy metal stress. The input variables for the BP neural-network model were NDVI, OSAVI, MTVI/OSAVI and MTVI/MCARI, while the output variable of this model was chlorophyll concentration. biosystems engineering 106 (2010) 223–233 Fig. 6 – A simple BP neural-network example flow chart with 4-6-1 structure. The topological structure of this BP neural-network model consisted of four input neurons in the input layer, one output neuron in the output layer and one or two hidden layers. The flow chart of the 4-6-1 structure as an example is shown in Fig. 6. The model building phase aims to produce a robust ANN model that can accurately map outputs from inputs. A good ANN model mainly relies on the choice of an optimum neutral network architecture and network internal parameters (i.e., the number of hidden layers, number of neurons, type of scaling and activation functions, learning and momentum rates, and stopping criterion), and the selection of the training algorithm (Gautam, Panigrahi, & Franzen, 2006). In this study, different network architectures, activation functions and training algorithms were tested. The ideal 229 activation function develops models using a sigmoid tangent function that scales the input data in the open interval of 1–1, a hidden layer utilising the sigmoid tangent function, and an output layer utilising sigmoid logistic functions which scales the input data to lie within the open interval of 0–1. The typical gradient descent BP algorithm in learning algorithms proved to be suitable for this study. Maier and Dandy (2000) and Zealand, Burn, and Simonovic (1999) reported that over 80% of previous neural-network models used a gradient descent BP training algorithm, which is a supervised learning paradigm. By supervising learning, a desired response is available to guide the learning process. In a BP algorithm, the weights are initially assigned arbitrary small values. As training progresses, the root mean square error (RMSE) between the target output and the network output is calculated. If the network output does not produce the expected criterion, the training transfers to the BP. The error signal carries through a reverse calculation by a connecting pathway. The gradient descent BP algorithm utilises a learning rate and momentum coefficient to adjust weight values and to move towards a global minimum in the error surface. Fig. 7 shows the gradient descent BP learning algorithm in the neural-network architecture. By trying different numbers of hidden layers and different numbers of neurons in each hidden layer, the optimum model architectures were 4-6-1, 4-8-1, 4-10-2-1 and 4-10-3-1. Compared with one hidden layer, two hidden layers had higher accuracy in estimating chlorophyll concentration. Table 3 summarised the optimum model architecture and internal parameters utilising in the different models developed in this study. 2.5.3. Model evaluation phase Generally, RMSE and average accuracy have been used to measure the performance of neural-network models (Gautam Fig. 7 – A process flow chart of a gradient descent BP learning algorithm. 230 biosystems engineering 106 (2010) 223–233 Table 3 – Summary table showing optimum ANN model’ architecture and internal parameters. Model Optimum network Activation function Training algorithm 1 2 3 4 5 6 7 8 4-6-1 4-6-1 4-8-1 4-8-1 4-10-3-1 4-10-3-1 4-10-2-1 4-10-2-1 {tansig, {tansig, {tansig, {tansig, {tansig, {tansig, {tansig, {tansig, Traingdm Traingdm Traingdm Traingdm Traingdm Traingdm Traingdm Traingdm logsig} linear} logsig} linear} tansig, logsig } tansig, linear} tansig, logsig } tansig, linear} Learning rate Momentum coefficient R2 RMSE Pav 0.2 0.2 0.2 0.2 0.15 0.15 0.15 0.15 0.2 0.2 0.2 0.2 0.2 0.2 0.2 0.2 0.8040 0.7412 0.8014 0.7016 0.8483 0.7378 0.9014 0.7791 4.01 5.25 3.37 4.34 3.14 4.03 2.58 3.06 0.8516 0.8423 0.8664 0.8510 0.8662 0.8421 0.8922 0.8737 Notes: Tansig denotes sigmoid tangent function; logsig denotes sigmoid logistic functions; linear denotes linear function; traingdm denotes gradient descent BP algorithm. et al., 2006). In this study, the parameters were: (i)RMSE; (ii) average accuracy (Pav); (iii) R2. The three parameters were computed by: sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 2 PN i¼1 yai ymi RMSE ¼ N1 (9) Where yai, ymi, RMSE are the real-valued output variable, measured output variable, RMSE between real-valued output variable and measured output variable respectively, N is the sample number. The lower RMSE, the better the performance of the model. PN Pav ¼ i¼1 Pi N j yai ymi j and Pi ¼ 1 yai (10) Where Pav and Pi are the average accuracy and the accuracy of the ith result. Pav provides information on the accuracy that the model can yield using a given data set. The nearer the value approaches 1; the better is the performance of the model. PN i¼1 R2 ¼ PN i¼1 yai ya yai ya PN i¼1 2 PN ymi ym i¼1 ðymi ym 2 2 (11) Where R2, ym and ya are the correlation coefficient, average measured output variable and average real-valued output variable respectively. The R2 represents the correlation between predicted and measured variables. It is assumed that the predicted and measured variables follow a normal distribution. Its value ranges from 0 to 1. The higher the value of the correlation, the stronger is the indication of existing linear relations between the actual and predicted variables. 3. Result and discussion Due to the large variation of chlorophyll concentration in rice with different levels of pollution that occur in the mature growth stage, the best stage for monitoring heavy metal stress is at the mature growth stage of rice. Therefore, in this research, 138 training data sets and 69 test data sets from the mature growth stage of rice were obtained for different levels of heavy metal pollution. By trying different neutral network architectures, network internal parameters and training algorithms, satisfactory neural-network models were developed (Table 3); namely, the BP neural-network architectures were 4-6-1, 4-8-1, 4-10-2-1 and 4-10-3-1; the learning algorithm was a gradient descent BP Table 4 – Quantitative relationships of leaf chlorophyll concentration ( y) to individual spectral index (x) in rice. Spectral index NDVI OSAVI MTVI/OSAVI MTVI/MCARI Fig. 8 – Comparison of predicated chlorophyll concentration using an artificial neural-network model and multiple regression models. Regression equation R2 y ¼ 4.9705x 0.7648 y ¼ 5.6535x2 2.0623x þ 1.3419 y ¼ 0.5746e2.1683x y ¼ 2.8959ln(x) þ 3.7551 y ¼ 7.8019x 0.7146 y ¼ 14.493x2 3.4562x þ 1.3885 y ¼ 0.5859e3.4098x y ¼ 2.8214ln(x) þ 5.0382 y ¼ 0.6401x þ 4.871 y ¼ 0.1123x2 1.5414x þ 6.5817 y ¼ 6.699e0.2786x y ¼ 2.4517ln(x) þ 5.6369 y ¼ 1.2729x þ 4.8087 y ¼ 0.9391x2 5.3069x þ 8.9348 y ¼ 6.7585e0.5721x y ¼ 2.7379ln(x) þ 4.0951 0.6799 0.6986 0.6514 0.6441 0.6780 0.6990 0.6521 0.6384 0.6886 0.7198 0.6568 0.7179 0.6131 0.7280 0.6236 0.6794 biosystems engineering 106 (2010) 223–233 231 Fig. 9 – Measured and predicted chlorophyll concentration using BP neutral network using different network structures. algorithm; with a the learning rate of 0.2 or 0.15; the momentum coefficient was 0.2; the activation function was the sigmoid tangent function in the input layer and hidden layer and there were sigmoid logistic functions in the output layer. According to the three parameters for assessing the performance of the neural-network models, optimal network architecture should have the lowest RMSE, and the value of Pav and R2 close to 1. As seen in Table 3, the optimal network architecture was obtained when the neural-network model had ten neurons in the first hidden layer and two neurons in the second hidden layer. The R2 for the optimal structure BP network model was 0.9014; RMSE was 2.58 and Pav was 0.8922. To assess the performance of the neural-network model more thoroughly, a comparison between the neural-network model and a statistical regression model was made. The baseline model was a best-fit regression model obtained from four spectral indices analysis of the rice crop and estimated values of the optimal structure BP network model. Applying two methods to the results gave the following respectively: y ¼ 3:209 þ 1:993 NDVI 0:6390 OSAVI 0:363 ðMTVI=OSAVIÞ 0:220 ðMTVI=MCARIÞ 4. Where R2 ¼ 0.713, y is predicated chlorophyll concentration from four spectral indices using multiple regression models. yai ¼ 0:9873ymi 0:091 2 variable (measured chlorophyll concentration) using optimal structure BP network model. The quantitative relationships between leaf chlorophyll concentration and individual spectral indices in rice are summarised in Table 4. As shown in Fig. 8 and Table 4, the neural-network model showed stronger correlation than the statistical regression model. Before a neural-network model can be confidently applied to predict the leaf chlorophyll concentration to determine rice heavy metal stress levels, it is necessary to validate the model by checking the degree to which the model predicted leaf chlorophyll concentration matches the measured leaf chlorophyll concentration obtained from field tests. Fig. 9 shows that the predicted leaf chlorophyll concentration matched the measured leaf chlorophyll concentration well. As well as predicting that the rice leaf chlorophyll concentration increases with decreasing heavy metal content, the highest chlorophyll concentration values were predicted for the experiment site with safe level, followed by that of level I pollution and then level II pollution. (13) Where R ¼ 0.9014,yai, ymi are the real-valued output variable (predicated chlorophyll concentration), measured output Conclusion Hyperspectral data can provide an effective means for estimating leaf chlorophyll concentration variation of crops under heavy metal stress. Because of heavy metal stress, the biochemical component of the crop changes and crop leaf cellular structure is damaged, resulting in differences in 232 biosystems engineering 106 (2010) 223–233 spectral reflectance. Therefore, it is feasible to detect such variations in the biochemical composition of crops, specifically using leaf chlorophyll concentration in the crop through spectral reflectance. In this research, some spectral indices were formulated by the reflectance of two or more bands. In sensitivity analysis, it was found that NDVI, OSAVI, MTVI/ MCARI and MIVI/OSAVI are more sensitive to chlorophyll concentration than the other indices, so they were selected to estimate leaf concentration. The relationship between these indices and chlorophyll concentration was developed by the BP neural-network model. The input neurons in the model were NDVI, OSAVI, MTVI/MCARI and MIVI/OSAVI, and the output neuron from this model was chlorophyll concentration which was then used to assess rice heavy metal stress levels. After testing a training set, our results showed that the neuralnetwork model developed can accurately estimate chlorophyll concentration variation to assess heavy stress levels. When crops were contaminated by heavy metal, the leaf chlorophyll concentration in the crop decreased. The higher the level of heavy metal content, the lower the leaf chlorophyll concentration. This finding agrees with the results of Kastori et al. (1998). This research verified that the neural-network model developed here could provide an effective and faithful estimation of leaf chlorophyll variation based on the four performance spectral indices. In trials with the BP neuralnetwork model, the optimal model, with ten neurons in the first hidden layer and two neurons in the second hidden layer, performed better than a statistical model of multivariable spectral indices and a single spectral index. Based on the results obtained from this study, it can be concluded that using a neural-network model and hyperspectral data can provide sufficient information to efficiently detect crop biochemistry components and assess the level of pollutants in field operations. Acknowledgements This research was supported by the National Natural Science Foundation of China (40771155) and National High-tech R&D Program of China (863 Program) (2007AA12Z174). references Barcelo, J., & Poschenrieder, C. (1990). Plant water relations as affected by heavy metal stress: review. Journal of Plant Nutrition, 13, 1–37. Bishop, C. M. (1995). Neural networks for pattern recognition. Oxford, England: Clarendon Press. Broge, N. H., & Leblance, E. (2000). Comparing prediction power and stability of broadband and hyperspectral vegetation indices for estimation of green leaf area index and canopy chlorophyll density. Remote Sensing of Environment, 76, 156–172. Chi, G. Y., Liu, X. H., Liu, S. H., & Yang, Z. F. (2006). Studies of relationships between Cu pollution and spectral characteristics of TritiZnm Aestivum L. Spectroscopy and Spectral Analysis, 26(7), 1272–1276. Chien, H. F., & Kao, C. H. (2000). Accumulation of ammonium in rice leaves in response to excess cadmium. Plant Science, 156, 111–115. Chien, H. F., Wang, J. W., Lin, C. C., & Kao, C. H. (2001). Cadmium toxicity of rice leaves is mediated through lipid peroxidation. Plant Growth Regulation, 33, 205–213. Choe, E., Meer, F. V. D., Ruitenbeek, F. V., Werff, H. V. D., Smeth, B. D., & Kim, K. W. (2008). Mapping of heavy metal pollution in stream sediments using combined geochemistry, field spectroscopy, and hyperspectral remote sensing: a case study of the Rodalquilar mining area, SE Spain. Remote Sensing of Environment, 112, 3222–3233. Cichocki, A., & Unbehauen, R. (1993). Neural networks for optimization and signal processing. Chichester, NY, USA: John Wiley & Sons. Daughtry, C. S. T., Walthall, C. L., Kim, M. S., Brown de Colstoun, E., & McMurtrey, J. E., III (2000). Estimating corn leaf chlorophyll concentration from leaf and canopy reflectance. Remote Sensing of Environment, 74, 229–239. De Filipps, L. F., & Pallaghy, C. K. (1994). Heavy metals: sources and biological effects. In L. C. Rai, J. P. Gaur, & C. J. Soeder (Eds.), Advances in limnology series: algae and water pollution. Stuttgart, Germany: E. Scheizerbartsche Press. Droppa, M., & Horvath, G. (1990). The role of copper in photosynthesis. CRC Critical Reviews in Plant Science, 9, 111–123. Ellis, R. W., & Eslick, L. (1997). Variation and range of mercury uptake into plants at a mercury-contaminated abandoned mine site. Bulletin of Environmental Contamination and Toxicology, 59, 763–769. Fernandes, J. C., & Henriques, F. S. (1991). Biochemical, physiological, and structural effects of excess copper in plants. The Botanical Reivew, 57(3), 246–266. Foody, G. M. (1995). Using prior knowledge in artificial neural network classification with a minimal training set. International Journal of Remote Sensing, 16, 301–312. Gallagher, F. J., Pechmann, I., Bogden, J. D., Grabosky, J., & Weis, P. (2008). Soil metal concentrations and productivity of Betula populifolia (gray birch) as measured by field spectrometry and incremental annual growth in an abandoned urban Brownfield in New Jersey. Environmental Pollution, 156, 699–706. Gan, F. P., Liu, S. W., & Zhou, Q. (2004). Identification of mining pollution using Hyperion data at Dexing copper mine in Jiangxi province, China. Earth Science-Journal of China University of Geosciences, 29(1), 119–126. Gautam, R., Panigrahi, S., & Franzen, D. (2006). Neural network optimisation of remotely sensed maize leaf nitrogen with a genetic algorithm and linear programming using five performance parameters. Biosystems Engineering, 95(3), 359–370. Guang, L., & Liu, X. N. (2009). Two kinds of modified spectral indices for retrieval of crop canopy chlorophyll content. Advances in Earth Science, 29(5), 548–554. Haboudane, D., John, R., Millera, J. R., Tremblay, N., ZarcoTejada, P. J., & Dextraze, L. (2002). Integrated narrow-band vegetation indices for prediction of crop chlorophyll content for application to precision agriculture. Remote Sensing of Environment, 81, 416–426. Howard, D., & Mark, B. (2000). Neural network toolbox, version4. Natick, MA, USA: The MathWorks, Inc. Huang, Y. Z., Hu, Y., & Liu, Y. X. (2009). Combined toxicity of copper and cadmium to six rice genotypes (Oryza sativa L.). Journal of Environmental Sciences, 21, 647–653. Huete, A. R. (1988). A soil vegetation adjusted index (SAVI). Remote Sensing of Environment, 25, 295–309. Kastori, R., Plesnicar, M., Sakac, Z., Pancovic, D., & Arsenijevic Maksimovic, I. (1998). Effect of excess lead on sunflower growth and photosynthesis. Journal of Plant Nutrition, 21, 75–85. Kim M. S., Daughtry C. S. T., Chappelle E. W., McMurtrey III J. E., & Walthal C. L. (1994). The use of high spectral resolution bands biosystems engineering 106 (2010) 223–233 for estimating absorbed photosynthetically active radiation (Apar). In Proceedings of the sixth symposium on physical measurements and signatures in remote sensing (pp. 299–306). Val D’Isere, France, January 17–21. Kooistra, L., Salas, E. A. L., Clevers, J. G. P. W., Wehrens, R., Leuven, R. S. E. W., Nienhuis, P. H., et al. (2004). Exploring field vegetation reflectance as an indicator of soil contamination in river floodplains. Environmental Pollution, 127, 281–290. Krupa, Z. (1988). Cadmium-induced changes in the composition and structure of the light-harvesting complex II in radish cotyledons. Physiologia Plantarum, 73, 518–524. Lagriffoul, A., Mocquot1, B., Mench1, M., & Vangronsveld, J. (1998). Cadmium toxicity effects on growth, mineral and chlorophyll contents, and activities of stress related enzymes in young maize plants (Zea mays L.). Plant and Soil, 200, 241–250. Larsson, E. H., Bordman, J. F., & Asp, H. (1998). Influence of UV-B radiation and Cd2þ on chlorophyll fluorescence, growth and nutrient content in Brassica napus. Journal of Experimental Botany, 49, 1031–1039. Li, Q. T., Yang, F. J., Zhang, B., Zhang, X., & Zhou, G. Z. (2008). Biogeochemistry responses and spectral characteristics of Rhus Chinensis Mill under heavy metal stress. Journal of Remote Sensing, 12(2), 284–290. Lindsay, W. L., & Norvell, W. A. (1969). Development of a DTPA micronutrients soil test. Agronomy Abstracts, 6, 84. Liu, S. W., Gan, F. P., & Wang, R. S. (2004). The application of Hyperion data to extracting contamination information of vegetation in the DEXING copper mine Jiangxi province, China. Remote Sensing for Land & Resources, 59, 7–10. Maier, H. R., & Dandy, G. C. (2000). Neural networks for the prediction and forecasting of water resources variables: a review of modeling issues and application. Ecological Modeling, 15, 101–124. Noh, H., Zhang, Q., Shin, Han, S., & Feng, L. (2006). A neural network model of maize crop nitrogen stress assessment for a multi-spectral imaging sensor. Biosystems Engineering, 94(4), 477–485. Pachepsky, V., Timlin, A. D., & Varallyay, G. (1996). Artificial neural networks to estimate soil water retention from easily measurable data. Soil Science Society of America Journal, 60(3), 727–733. Pahlsson, A. M. B. (1989). Toxicity of heavy-metals (Zn, Cu, Cd, Pb) to vascular plants. Water Air and Soil Pollution, 47, 287–319. Pal, N. R., Pal, S., Das, J., & Majumdar, K. (2003). SOFM-MLP: a hybrid neural network for atmospheric temperature 233 prediction. IEEE Transactions on Geoscience and Remote Sensing, 41(12), 2783–2791. Plate, T., Bert, J., Grace, J., & Band, P. (2000). Visualizing the function computed by a feed forward neural network. Neural Computation, 12(6), 1355–1370. Ren, H. Y., Zhuang, D. F., & Pan, J. J. (2008). Canopy hyperspectral characteristics of paddy plants contaminated by lead. GeoInformation Science, 10(3). 314–319 (in Chinese). Rondeaux, G., Steven, M., & Baret, F. (1996). Optimization of soil adjusted vegetation indices. Remote Sensing of Environment, 55, 95–107. Rouse J. W., Haas R. H., Schell J. A., Deering D. W., & Harlan J. C. (1974). Monitoring the vernal advancements and retrogradation of natural vegetation. In NASA/GSFC, Final report (pp. 1–137). Greenbelt, MD, USA. Sidlecka, A., & Baszynsky, T. (1993). Inhibition of electron flow around photosystem I in chloroplasts of cadmium-treated maize plants is due to cadmium-induced iron deficiency. Physiologia Plantarum, 87, 199–202. Stevens, D. (2002). CSIRO land and water’s methods manual. ACIAR and CSIRO. pp. 15–16. Tumbo, S. D., Wagner, D. G., & Heinemann, P. H. (2002). Hyperspectral-based neural network for predicting chlorophyll status in corn. Transactions of the ASAE, 45(3), 825–832. Wang, T., & Thai, C. N. (2003). A genetic algorithm based tool for analysis and modeling of multi-spectral data. ASAE Paper No. 03-3126. MI, USA: St. Joseph. Wood, C. W., Reeves, D. W., & Himelrick, D. G. (1993). Relationship between chlorophyll meter readings and leaf chlorophyll concentration, N status, and crop yield: a review. Proceedings of the Agronomy Society of New Zealand, 23, 1–9. Wu, C. Y., Niu, Z., Tang, Q., & Huang, J. (2008). Estimating chlorophyll content from hyperspectral vegetation indices: modeling and validation. Agricultural and Forest Meteorology, 148, 1230–1241. Zarco-Tejada, P. J., Miller, J. R., Noland, T. L., Mohammed, G. H., & Sampson, P. H. (2001). Scaling-up and model inversion methods with narrow-band optical indices for chlorophyll content estimation inclosed forest canopies with hyperspectral data. IEEE Transactions on Geoscience and Remote Sensing, 39, 1491–1507. Zealand, C. M., Burn, D. H., & Simonovic, S. P. (1999). Short term stream flow forecasting using artificial neural networks. Journal of Hydrology, 214, 32–48.
© Copyright 2026 Paperzz