306 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 24, NO. 1, FEBRUARY 2009 Day-Ahead Price Forecasting of Electricity Markets by Mutual Information Technique and Cascaded Neuro-Evolutionary Algorithm Nima Amjady and Farshid Keynia Abstract—In a competitive electricity market, price forecasts are important for market participants. However, electricity price is a complex signal due to its nonlinearity, nonstationarity, and time variant behavior. In spite of much research in this area, more accurate and robust price forecast methods are still required. In this paper, a combination of a feature selection technique and cascaded neuro-evolutionary algorithm (CNEA) is proposed for this purpose. The feature selection method is an improved version of the mutual information (MI) technique. The CNEA is composed of cascaded forecasters where each forecaster consists of a neural network (NN) and an evolutionary algorithm (EA). An iterative search procedure is also incorporated in our solution strategy to fine-tune the adjustable parameters of both the MI technique and CNEA. The price forecast accuracy of the proposed method is evaluated by means of real data from the Pennsylvania-New Jersey-Maryland (PJM) and Spanish electricity markets. The method is also compared with some of the most recent price forecast techniques. Index Terms—Cascaded neuro-evolutionary algorithm (CNEA), iterative search procedure, mutual information (MI), price forecast. I. INTRODUCTION W ITH the introduction of restructuring into the electric power industry, the pricing of electricity in the electricity markets has become very important [1]. Accurate day ahead price forecast in the spot market helps power suppliers to adjust their bidding strategies to achieve the maximum benefit. Similarly, consumers can derive a plan to maximize their purchased electricity from the pool, or use self production capability to protect themselves against high prices. On a short time scale, transmission bottlenecks may prevent a free exchange among different regions resulting in extreme price volatility or even price spikes in the electricity market, e.g., the price spikes of the Pennsylvania-New Jersey-Maryland (PJM) and California markets in 1999 and 2000, respectively [2], [3]. Besides, volatility in fuel price, load uncertainty, fluctuations in the hydroelectricity production, generation uncertainty (outages) and behavior of market participants also contribute to electricity price uncertainty [1]. Manuscript received January 25, 2008; revised August 26, 2008. First published December 09, 2008; current version published January 21, 2009. Paper no. TPWRS-00027-2008. The authors are with the Department of Electrical Engineering, Semnan University, Semnan, Iran (e-mail: [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TPWRS.2008.2006997 In a power market, the price of electricity is the most important signal to market participants and the most basic pricing concept is the market-clearing price (MCP) [1], [3]. Generally, when there is no transmission congestion, MCP is the only price for the entire system. However, when there is congestion, the zonal market clearing price (ZMCP) or the locational marginal price (LMP) could be employed. ZMCP may be different for various zones, but it is the same within a zone. LMP can be different for different buses. LMP is the sum of generation marginal cost, transmission congestion cost, and cost of marginal losses, although the cost of losses is usually ignored [1]. When there is no congestion, LMP is the same as MCP. When there is congestion, transmission line constraints are considered in order to balance supply and demand at each bus. The marginal cost of each bus is the LMP. The importance of electricity price forecasting on the one hand, and its complexity on the other hand, motivates many research works in the recent years. Stationary time series models such as auto-regressive (AR) [4], dynamic regression and transfer function [5], [6], auto-regressive integrated moving average (ARIMA) [7], and nonstationary time series models like generalized auto-regressive conditional heteroskedastic (GARCH) [8] have been proposed for this purpose. However most of the time series models are linear predictors, while electricity price is generally a nonlinear function of its input features. So, the behavior of the price signal may not be completely captured by the time series techniques [3]. To solve this problem, some other research works proposed neural networks (NNs) and fuzzy neural networks (FNNs) for electricity price forecast [2], [9]–[14]. NNs and FNNs have the capability of modeling the nonlinear input/output mapping functions. The appropriate selection of input features is a key factor for the success of FNN or NN based price forecast methods. Besides, efficiency of these methods is usually dependent on the correct tuning of their adjustable parameters, e.g., the number of hidden nodes of the NNs. Some other price forecast methods have been also proposed in recent years. In [15], combination of fuzzy inference system (FIS) and least-squares estimation (LSE) has been proposed for prediction of LMP. In [16], electricity price series is decomposed by wavelet transform with the aim of finding less volatile components and then each subseries is separately forecasted by the ARIMA technique. Combination of similar days (SDs) and NN techniques are proposed for LMP prediction in [17]. In [18], a mixed model has been proposed for the price forecast, where weekends and weekdays are modeled by separate ARIMA time series techniques. Weighted nearest 0885-8950/$25.00 © 2008 IEEE Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. AMJADY AND KEYNIA: DAY-AHEAD PRICE FORECASTING OF ELECTRICITY MARKETS neighbors (WNNs) method has been presented for electricity price forecast in [19]. In this paper a new prediction strategy is proposed for dayahead price forecasting of electricity markets. The contribution of the paper can be summarized as follows. 1) Presentation of a new feature selection technique based on MI method. The proposed technique can rank candidate inputs according to their information value for the prediction of electricity price signal. 2) Presentation of a new forecast method based on CNEA. The CNEA is composed of 24 cascaded forecasters in which each forecaster consists of an NN and EA. In the CNEA, the 24 hourly time series of electricity price are separately modeled. 3) The adjustable parameters of both the feature selection technique and CNEA are adaptively fine-tuned for each forecast horizon by an iterative search procedure. The paper is organized as follows. In Section II, the proposed feature selection technique is described. The CNEA is explained in Section III. In Section IV, the proposed search procedure for fine-tuning of the adjustable parameters is described. In Section V, the proposed approach has been applied to the PJM and Spanish electricity markets and the results are discussed. The proposed method has also been compared with some of the most recent price forecast techniques. A brief review of the paper and future research are described in Section VI. 307 Based on (1) and (2), the entropy is often considered a measure of uncertainty. As an example, let the random variable represent the presence of a disease D. If there is no uncertainty about or disdisease presence, i.e., , then the ease absence, i.e., equals zero. If however, there is high uncertainty entropy about the presence or absence of disease, i.e., , then the entropy equals 1. Generally, diseases are equally possible , if any one of achieves its maximum value . This value then represents the highest possible uncertainty about the random variable . of two discrete random variables The joint entropy and with a joint probability distribution is defined by (3) indicates the total entropy of random variables where and [21]. When certain variables are known and others are not, the remaining uncertainty is measured by the conditional entropy [22] II. PROPOSED FEATURE SELECTION TECHNIQUE Feature selection is a process commonly used in machine learning, wherein a subset of features available from data is selected for application of a learning algorithm. The best subset contains the least number of key features contributing to accuracy; while discarding the remaining unimportant features. This is an important stage of pre-processing and is one way of avoiding the curse of dimensionality. One of the most recently developed feature selection techniques is mutual information, which is based on the entropy concept. This technique has been frequently used for feature selection in classification problems such as pattern recognition, cancer classification, medical image processing, etc. [20]–[23]. Here, a new formulation of the mutual information technique is proposed for the feature selection of electricity price forecast. of a continuous random variable with The entropy is defined as follows [21], [22]: probability distribution (1) If is a discrete random variable with values and probabilities is defined as follows: then (4) The joint entropy and the conditional entropy have the following relation [22]: (5) This, known as the “chain-rule,” implies that the total entropy of random variables and is the entropy of plus the remaining entropy of for a given . This matter is graphically shown in Fig. 1 [21]. The information found commonly in two random variables and is of importance in our work, which between the two is defined as the mutual information variables (6) , respectively, (2) If the mutual information between two random variables is large, the two variables are closely related and vice versa. If the mutual information becomes zero, the two random variables are totally unrelated or the two variables are independent. Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. 308 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 24, NO. 1, FEBRUARY 2009 Fig. 1. Representation of mutual information and different entropies. The mutual information and the entropy have the following relations, as shown in Fig. 1: (7) (8) (9) (10) (11) measures the a priori uncertainty of the random variable measures the conditional a posteriori uncertainty of after is observed [22]. According to (7), the mutual information measures how much the uncertainty of is reduced if has been observed. If and are indepenbased on (9). Consedent, then quently, their mutual information is zero, i.e., observing does not reduce the uncertainty of . If, however, the two random and becomes completely related, which occurs variables , then according to (11), mutual information bewhen . tween these two variables reaches to its maximum value . becomes known Now, suppose that the random variable resulting in its uncertainty becoming negligible. If the two variand are tightly related then the mutual information ables is high and vice versa. So by observing , the unsignificantly (insignificantly) certainty of random variable decreases. In the price forecast process, we assume that the (like the lagged candidate set of input features values of the price, load demand or available generation) are owning more mutual informaknown. The candidate input with the target variable is a better candition the uncertainty date as input feature, since by employing of reduces more than using the other candidate inputs. It is noted that for the feature selection of electricity price forecast, target variable becomes the next hour price. Hence, the mutual can assign a value to each candidate information to forecast the target variable . In other words, we input can rank the candidate inputs based on their mutual information with the target variable or their information value for the forecast process. However, the mutual information algorithm suffers from the curse of dimensionality for the price forecast of electricity markets. We explain this problem in the form of an example. For instance, considering the daily and weekly periodicity characteristics of the electricity price signal, its candidate set of input variables at least includes lagged values of price up to and lagged values 200 hours ago of load (as the most important price driver [1]) up to 200 , totally 400 candidates. hours ago , etc.), lagged This set contains short run trend ( values indicating daily representing weekly periodicity [24]. periodicity and The auto-regression part (the lagged values of the price signal) and only one exogenous variable, i.e., load are included in this candidate set [2]. More exogenous variables such as available generation can be also considered in the candidate set provided that their data be available in the corresponding electricity market [3]. In the next step, a training period (for preparation of training samples) should be selected. A long training period may cause inaccuracies because price characteristics vary with time. Besides, a longer results in more computation burden. On the other hand, a short training period may cause volatile estimates [16]. In [18], the effect of different training periods on the price forecast accuracy has been evaluated. In this paper, previous 50 days is used as a training period as recommended training samples. in [2], [16], which results in Based on the above explanations, 400 probability distribution functions for the candidate inputs and one probability distribution function for the target variable should be constructed where each probability distribution is based on 1200 sample data. This input data is required by the mutual information technique. Besides, joint probability distribution between each candidate input and the target variable should also be computed as shown in (6). So, implementation of the mutual information technique for the electricity price forecast requires high computation burden. Since the price signal has a time variant mapping function with respect to its input features, the feature selection process should be regularly repeated for each forecast interval (e.g., once every day) to avoid using irrelevant input variables. Hence, we changed the formulation of the mutual information technique for the price forecast process so that it can be implemented by a reasonable computation burden. As seen from (2), (3), (4), and (6), the basic definitions of the mutual information technique use logarithms in base 2 [14]–[17]. So, this fact motivates using binomial distributions for the inputs and output. For this purpose, at first, all the candidate inputs and target variable are linearly normalized in , denoted by “Normalization” in the data the range of preparation part of Fig. 2. Then, the median of each normalized variable is computed. Half of the values of the normalized variable are more than its median which are rounded to 1 and the other half are less than it which are rounded to 0. After this process, a binomial distribution is obtained for each candidate Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. AMJADY AND KEYNIA: DAY-AHEAD PRICE FORECASTING OF ELECTRICITY MARKETS 309 Fig. 2. Structure of the proposed forecast strategy. input and the target variable. The joint probability distribution of these binomial distributions can be obtained as shown in Table I. To find the joint distributions, an auxiliary variable corresponding to joint probability is defined as follows: TABLE I JOINT PROBABILITY DISTRIBUTION BETWEEN CANDIDATE INPUT Y AND TARGET VARIABLE X (12) The auxiliary variable can have four values ranging from 0 counts number of data points (out of all to 3. samples) in which . and are similarly defined. So, the four states of the joint probability can be easily computed as shown in Table I. Besides, the two and can be also determined as state probabilities follows: (13) (14) (15) (16) Based on the above individual and joint probabilities, the previous (6) can be written as shown in (17) at the bottom of the page. In the proposed feature selection technique, we compute and target mutual information between the candidate input variable , i.e., , by means of (17) and then rank the candidate inputs based on their mutual information. This formulation of the mutual information technique can be implemented with reasonable computation burden and acceptable accuracy. III. PROPOSED FORECAST METHOD For day ahead prediction, two approaches including the iterative forecasting and direct forecasting can be used [25], [26]. In iterative forecasting, a single forecaster with one output node is employed and the forecast values are iteratively used as inputs for the next forecasts. On the other hand, if the number of output nodes is equal to the length of the forecast horizon (here, 24 hours ahead), then the direct forecasting approach is used in which we forecast the future values directly from the forecaster outputs. In [26], it has been discussed that the first approach may generate more prediction errors and the second approach can introduce serious round off errors. To modify these imperfections, we apply the idea of cascaded forecasters as shown in the forecast part (CNEA) of Fig. 2. As seen, the CNEA block consists (17) Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. 310 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 24, NO. 1, FEBRUARY 2009 of 24 consecutive forecasters, where each forecaster has a single output allocated to predict the price of one hour of the next day. In other words, the 24 hourly time series have been separately modeled instead of the complete time series of the price. The larger homogeneity of the 24 hourly time series in comparison with the complete one, as well as the fact that 24 onestep-ahead forecasts are calculated everyday instead of 24 individual forecasts with prediction horizons varying from 1 to 24 hours, will allow an improvement in the accuracy of the forecasts [18]. The structure of cascaded forecasters can be considered as a combination of direct and iterative forecasting, where price of each hour is directly predicted and each forecaster has only one output. Each of the 24 forecasters of the CNEA is composed of an NN part and EA part (Fig. 2). The NN part of all 24 forecasters has multilayer perceptron (MLP) structure and Levenberg–Marquardt (LM) learning algorithm, which is an efficient training mechanism for the prediction tasks. The LM algorithm trains a neural network 10 to 100 times faster than the usual gradient descent back propagation method [24]. This algorithm is an approximation of Newton’s method and computes the approximate Hessian matrix. Mathematical details of this learning algorithm can be found in [27]. According to Kolmogorov’s theorem, the MLP can solve a problem by using one hidden layer provided it has the proper number of neurons [28]. So, one hidden layer has been considered in the MLP structure of all 24 NNs. The MLP network is good at capturing global data trends [2], [10]. However, its forecast capability can be enhanced if we add the local search ability to it. An appropriate solution for this purpose is EA. The results of the training phase of each NN, i.e., its weights, are given to the corresponding EA part (Fig. 2). An MLP network usually searches the solution space in a special direction based on its learning algorithm (like the steepest descent). The EA searches around the final solution of the learning algorithm in various directions as much as possible to find a better solution. Since a local search around the solution of the NN is required, we use EA with momentum as the EA part of the forecaster, which has smoother search paths avoiding from sudden changes. In the EA with momentum, evolution from a generation to the next one is performed as follows: (18) (19) where is an adjustable parameter (weight) of the correindicates its change. The subscripts sponding NN and and represent two successive generations (parent and child, respectively) of the EA. is a small random number separately generated for each adjustable parameter. is the and is momentum constant. In our examinations selected in the range of (0, 0.1) for all generations of the EA. is the obtained value from At the beginning of the EA, the learning algorithm of the NN for the adjustable parameter and . In each cycle, the EA repeats (18) and (19) until the next generation of all adjustable parameters is obtained. Then the error function of the NN (its validation error) is evaluated for the new generation. If the child has less error than its parent, the parent is replaced by the child, otherwise the parent is restored and the next cycle of the EA is executed. So, at the end of the EA, the best examined solution among all generations will be selected. In the rare cases where the EA part cannot find a better solution, the NN’s one will be restored. We considered 100 generations for the EA in our solution strategy for day ahead price forecasting. IV. ITERATIVE SEARCH PROCEDURE The proposed forecast strategy has two main adjustable paand number of hidden rameters: number of input features . As with many other forenodes of the NNs denoted by cast methods, the efficiency of the proposed prediction strategy (including the feature selection technique and forecast method) is dependent on the appropriate adjustment of its parameters. It is noted that the MI based feature selection technique can only rank the candidate inputs, but the number of selected candidates is a degree of freedom and should be separately deteris considered for all 24 NNs of the forecast mined. The same method to decrease the number of adjustable parameters. Usually the adjustable parameters of load or price forecast methods are selected based on experience or heuristics. Here, an iterative search procedure is proposed, which can automatically adjust and (Fig. 2) with minimum reliance on the heuristics. This procedure can be summarized as the following step-by-step algorithm. . Select candidate inputs 1) Set an initial value for owning the highest MI values (absolute value) with the is selected equal target variable. The initial value of to [2]. 2) By the selected inputs, training samples can be constructed based on the data of the 50-day training period. Hence, for each of 24 forecasters, 50 training samples related to its respective hour are generated. Each forecaster can be trained (including the learning algorithm of the NN part and evolution of the EA part) by its own training samples. Then, validation errors of the forecasters are evaluated. is kept constant and is varied at a neighborhood 3) around its previously selected number. For each value of in the neighborhood, training phase of the forecasters is repeated and their validation error is evaluated. The value resulting in the least validation error is selected and of fixed. , is varied at a neighborhood 4) With the fixed value of around its previously selected number. For each value of in the neighborhood, training samples are constructed. Then training phase of the forecasters is repeated and their resulting in validation error is evaluated. The value of the least validation error is selected and fixed. and in the current cycle 5) If the obtained values for coincide with their previous values, the iterative search procedure is terminated. Otherwise go back to step 5. The validation set is a part of training samples which is removed from training phase and so becomes unseen for NN. Thus, the error of NN for validation set or validation error can be a measure of the NN’s error for forecast horizon. Validation set should be as similar as possible to forecast horizon so that Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. AMJADY AND KEYNIA: DAY-AHEAD PRICE FORECASTING OF ELECTRICITY MARKETS 311 the validation error can be a true representative of the prediction error. We considered the day before the forecast day as the validation set, since it includes both the short run trend and daily periodicity characteristics of the price signal [2]. The other alternatives, such as the same day in the previous week, can be also considered as the validation set. So, each of 24 forecasters training samples and validated is really trained by by one unseen validation sample (the last sample, related to the respective hour in the day before the forecast day). The mean absolute percentage error (MAPE) for 24 validation samples is calculated and considered as the validation error in this paper (20) and are actual and forecasted prices of where is number of samples. Here, . hour , respectively; The error of validation samples is also used for early stopping of the training phase of the NNs and avoiding from overfitting. For a better illustration, the sample results of the iterative search procedure for the PJM electricity market is shown in Fig. 3. The validation set is March 12, 2006. In the right and left vertical axes of this figure, the results of the MAPE with fixed (step 3 of the algorithm) and MAPE with fixed (step 4 of the algorithm) are shown. Two consecutive transitions (from left to right and right to left) represent one cycle of the procedure. As seen, from top to bottom, slope of lines decreases, until we reach to a horizontal line indicating the convergence point of the procedure where more cycles of the algorithm again reach the same point and so the procedure is terminated. It cannot be claimed that this procedure finds the global optimum of the adjustable parameters since it is based on local searches. It can only find appropriate values for these parameters. However, to enhance the efficiency of the procedure, we add the adaptive search capability to it. In our initial implementation of the iterative search of or procedure, a fixed radius of neighborhood (e.g., was considered in the local searches (steps 3 and 4 of the algorithm). This results in slow convergence since a different radius of neighborhood may be suitable for each cycle of the procedure. Besides, appropriate adjustment of these parameters depends on experience. So, we implemented the iterative search procedure with an adaptive neighborhood. It is explained in the is 20 while form of an example. Suppose the initial value of is fixed (step 3 of the algorithm). The procedure increases by one and evaluates the validation error. If results in a lower error than , the procedure proceeds ; otherwise the proceand evaluates the next value and stops searching values of . dure returns is returned The search is continued until the value of where after that the validation error begins to increase. Then, the step by step and similarly evaluate the procedure decreases until the validation validation error for error begins to increase and searching the values of is stopped (the value of with minimum validation found in the error is returned). Between the two values of and , reincrease and decrease directions ( spectively), i.e., two local minimums, the better one with the Fig. 3. Sample results of the iterative search procedure. least validation error is selected. If no better value can be found in the both directions, the procedure returns the previous value . The adaptive neighborhood is similarly executed of (step 4 of the algorithm). The adaptive in the local search of neighborhood increases the convergence rate of the procedure. Besides, it provides an unsymmetrical search capability in the two directions. We executed the iterative search procedure with an adaptive neighborhood for many validation sets in different electricity markets, but it always converges in a few iterations. For instance, in Fig. 3, it converges in five iterations. Table II from top to bottom represents obtained results for this figure. This table includes ten half cycles or five cycles. In each half cycle, the local or as described in the search varies only one parameter ( algorithm. The procedure terminates when and , i.e., their previous values are obtained which are indicated in the last two rows of Table II. The effect of adaptive neighborhoods can be seen in Table II. changes from 15 to 27 (80% inFor instance, in cycle 1, crease) indicated in the third and fourth rows, while in cycle changes from 24 to 25 (about 4% increase) indicated in 4, the ninth and tenth rows. Table II also represents the efficiency of the proposed procedure where validation error shrinks from 13.54% at the beginning of the procedure to 4.63% at the end of it (a three times decrease in the validation error). It is noted that in NN based price forecast methods, the prediction error usually has a nonlinear behavior with respect to the number of input nodes and number of hidden nodes. So, the correct tuning of these parameters is a difficult task, which is usually performed based on the experience and heuristics. For instance, in Fig. 4, the nonlinear behavior of the validation error is shown ( is kept constant) for the same with respect to validation set of Fig. 3. By considering both degrees of freedom and , we reach to Fig. 5, representing the complex behavior of the validation error with respect to the adjustable parameters. The proposed procedure can automatically and adaptively fine-tune the parameters with minimum reliance on the Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. 312 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 24, NO. 1, FEBRUARY 2009 TABLE II EXECUTION OF THE ITERATIVE SEARCH PROCEDURE WITH THE ADAPTIVE NEIGHBORHOODS(FROM TOP TO BOTTOM) the optimum solution in the next one. So, when the procedure converges at the end of the iterations, the probability of not finding the optimum solution is low. V. NUMERICAL RESULTS Fig. 4. Curve of the validation error with respect to N (N N Fig. 5. Curve of the validation error with respect to and = 25 and = 30). of the validation error occurs for N N N = 25). (the minimum heuristics (only initial value of is required in the step 1 of the algorithm). This iterative search procedure can be easily extended for any number of adjustable parameters. For this purpose, in each step of each iteration, one parameter is varied while the others are kept constant. It is noted that if the proposed search procedure has only one cycle, then it possibly traps in a local minimum and misses the optimum solution. However, the search process of the proposed procedure is repeated in each iteration with the refined values of the adjustable parameters. Thus, if the procedure misses the optimum solution in one iteration, then it has the chance of finding The proposed method is examined for the day ahead electricity markets of mainland Spain and PJM. Besides, we compared the method with at least seven recently published price forecast techniques. Our forecast strategy used in all examinations of this paper can be briefly summarized as follows (Fig. 2). At first, the candidate inputs and target variable are linearly nor. The MI based feature selection malized in the range of technique selects the input variables. By the selected inputs, training and validation samples are constructed based on the data of 50 days ago. Each of 24 forecasters of the CNEA is trained by its respective 49 training samples and validated by one validation sample (the last sample). The adjustable parameand , are fine-tuned by the iterative search proters, i.e., cedure. The candidate inputs for the PJM electricity market include lagged values of price, load, and available generation and for the Spanish electricity market include lagged values of price and load (each variable up to 200 hours ago), which are publicly available data on their websites [29] and [30], respectively (no specific data has been used). The whole proposed price forecast strategy (including the feature selection technique, 24 forecasters of the CNEA and iterative search procedure) has price data up to the midnight of last night. In general, if the th forecaster requires the hourly prices of the forecast day, the predicted price values for these hours by its previous forecasters for the second forecaster is the preare used. For instance, dicted price by the first forecaster. It is noted that considering the congestion information can affect LMP forecast. Congestion occurs when a transmission line flow exceeds its limit. So line flow and line limit information together could reveal line flow congestion and its severity [1]. If we have a large number of lines, considering line flows and line limits highly increases the number of inputs of the price forecast method. Processing of this large number of inputs (even after feature selection) is a complex task and may even deteriorate the accuracy of the price forecast method since the forecasters (Fig. 2) should also learn the input/output mapping function between these added inputs and output. The other alternatives such as congestion index [1] and congestion price [29] have been also proposed to include the congestion information in the price forecast method. However, most of research works in the area do not consider the congestion information in the LMP forecasting like [15] and [17]. The first verification for the method proposed in this paper has been performed by means of real data of the PJM electricity market in year 2006. The obtained results for some test days and weeks are shown in Table III and compared with the results of [17], which proposes combination of SD and NN techniques for day ahead price forecasting. For the sake of a fair comparison, the same test days and weeks of [17] have been considered in have been directly quoted Table III and the results of from [17]. Besides, the same MAPE definition of this reference Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. AMJADY AND KEYNIA: DAY-AHEAD PRICE FORECASTING OF ELECTRICITY MARKETS TABLE III MAPE OF THE PROPOSED METHOD AND SD + NN PLUS THE MAPE IMPROVEMENT FOR THE PJM ELECTRICITY MARKET IN YEAR 2006 313 TABLE V SELECTED FEATURES PLUS THEIR RANKS AND NORMALIZED MI VALUES FOR A TEST DAY OF THE FIRST EXAMINATION (FEBRUARY 10, 2006) TABLE IV ERROR VARIANCE OF THE PROPOSED METHOD AND SD + NN PLUS THE VARIANCE IMPROVEMENT FOR THE PJM ELECTRICITY MARKET IN YEAR 2006 has been used for the results of Table III, denoted by in this paper (21) (22) and are as defined for (20) and indicates the average of actual prices as defined in (22). indicates forecast horizon. For daily and equals to 24 and 168, respectively. For the weekly MAPE, five test days and two test weeks of Table III, daily MAPE and weekly MAPE are used, respectively [17]. The forecasting error is the main concern for power engineers; a lower error indicates a better result. For all test days and weeks of Table III, the proposed method outperforms . Besides, improvement in the MAPE of the proposed method with respect to is shown in the last column of Table III. In addition to the mean error, stability of results is another important factor for the comparison of forecast methods. In Table IV, variance of the prediction errors, as a measure of uncertainty, for the forecast days and weeks is presented. have been quoted from Again, the variance values of [17]. The proposed method also has lower error variances than in all test days and weeks indicating less uncertainty in the predictions of the proposed method. Improvement in the error variance of the proposed method with respect to is shown in the last column of Table IV. where Now, sample results for the feature selection part are presented. For the sake of conciseness, obtained results for only one of the test days (February 10, 2006) are represented in Table V. , , and in Table V indicate price, load, and available generation of hour , respectively. In this Table, selected features (columns 1 and 4) plus their ranks (columns 2 and 5) and normalized MI values (columns 3 and 6) are shown. The reported results have been obtained after the convergence of the value. As seen, out of iterative search procedure with final 600 candidate inputs considered for the PJM electricity market (200 lagged values of price, load, and available generation), only 28 inputs are selected, which indicates high filtering ratio of the proposed feature selection technique. For the other test days of this examination, the final value varies from 24 to 32. Among the 28 selected features of Table V, 11 inputs are from the auto-regression part and 17 inputs are from the exogenous variables. The 17 inputs include 12 and 5 features from the load and available generation, respectively. So, load is a more important price driver than available generation for this test case. Between load and price variables, the selected load features are slightly more than the selected price features, but the price features overall have higher ranks. Similar results have been obtained for the other test days. It is noted that all 24 forecasters of the CNEA use the same set of 28 input features (for February 10, 2006) based on the sliding window technique. The first forecaster of the CNEA uses the input feaor the price of the first tures shown in Table V and predicts hour of the forecast day. Then, the input features of Table V are shifted by one hour ahead and used by the second forecaster of (the price of the second hour of the the CNEA to predict forecaster of the chain of foreforecast day). In general, the casters of the CNEA uses the selected inputs of Table V that are hours ahead. shifted by We considered a large candidate set of input variables including the effects of short-run trend, daily and weekly periodicities so that the maximum information content of the input data is utilized (in other words, no effective input variable is missed). Then, this candidate set is refined by the proposed feature selection technique to remove candidate inputs with low information value. Although the effect of weekly periodicity may Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. 314 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 24, NO. 1, FEBRUARY 2009 be less than the effect of daily periodicity in the price time series, however it is still useful for the price forecast. For instance, is among the selected inputs, but as seen from Table V, with a lower rank than . This is due to the fact that load demand is an important price driver and it has weekly periodicity behavior [24]. Thus, we can expect that the effect of weekly periodicity is also observed in the price time series. We also included price data of coal and natural gas in the feature selection process of the PJM electricity market. Coal and natural gas have the highest portions among different fuels in the PJM electricity market. We examined different test periods of year 2006; however, prices of coal and natural gas were never selected in competition with the initial candidate inputs including the history of electricity price, load demand and available generation. When the fuel prices have normal changes (like those of year 2006), they may be less effective than the initial candidate inputs. However, sudden changes in the fuel prices and especially fuel price spikes can be effective in the unexpected changes of electricity price and even electricity price spikes may be generated. This is a matter that demands further research especially on the electricity price spike analysis and forecasting. In the next examinations, only the final results of the whole proposed method are evaluated and compared with the other price forecast techniques. The second examination of this paper has been performed on the Spanish electricity market. Obtained results for four weeks corresponding to four seasons of year 2002 are presented in Tables VI and VII. The fourth week of February, May, August, and November are selected for winter, spring, summer, and fall seasons, respectively, which is the test period considered in [2], [16], [18], and [31]. In the second and third columns of Table VI, results of ARIMA and ARIMA plus wavelet transform have been quoted from [16]. The wavelet transform was used to decompose the price signal into less volatile components. The fourth column represents the obtained results from the FNN (quoted from [6]), in which instead of decomposing the price values, its solution space is softly divided into subspaces and each functional relationship of the price is implemented in one subspace. In the fifth column of Table VI, obtained results from an MLP neural network with LM learning algorithm, quoted from [31], are shown. In the sixth column, results of mixed model are represented (quoted from [18]). In this reference, a separate ARIMA model has been presented for each hour of the forecast day while weekdays and weekends have been separately modeled. It is noted that although the idea of individual modeling of the 24 hourly time series of electricity price has been presented in [18], their prediction method is based on the ARIMA time series. However, in the CNEA, the combination of NN and EA has been proposed as the forecast block. Besides, the other parts of our forecast strategy including the MI based feature selection and iterative search procedure for fine-tuning of adjustable parameters are new and not seen in the previous works. Finally, in the last column of Table VI, the obtained results of the proposed method are represented. The weekly MAPE values in Table VI are in terms in (21) with . The proposed method of has slightly more weekly MAPE than the wavelet-ARIMA and FNN in the winter test week and than the mixed model in the TABLE VI WEEKLY MAPE VALUES IN TERMS OF PERCENTAGE (%) FOR FOUR WEEKS OF THE SPANISH ELECTRICITY MARKET IN YEAR 2002 TABLE VII ERROR VARIANCE FOR FOUR WEEKS OF THE SPANISH ELECTRICITY MARKET IN YEAR 2002 spring test week. However, in all other test cases of Table VI, the proposed method has lower weekly MAPE than the other techniques. Moreover, the average of the weekly MAPE values of the proposed method is considerably less than all other techniques (indicated in the last row of Table VI). Improvement in the average MAPE of the proposed price forecast strategy with respect to ARIMA, wavelet-ARIMA, FNN, NN, and mixed model is 46.59%, 34.40%, 29.25%, 40.29%, and 42.79%, respectively. In Table VII, the variance of the prediction errors for the ARIMA [16], wavelet-ARIMA [16], FNN [2], and proposed method for the four test weeks are presented. Although the error variance of the proposed method is slightly more than the other techniques in the winter and spring test weeks, but the average variance value of the method is less than the others. Improvement in the average error variance of the proposed price forecast strategy with respect to ARIMA, wavelet-ARIMA, and FNN is 60.87%, 43.75%, and 33.33%, respectively. For the NN and mixed model, the error variance has not been presented in the respective references. Another important characteristic of the proposed method, which can be seen from Table III, Table IV, Table VI, and Table VII, is its less sensitivity to the electricity market volatility than the other examined price forecast techniques. For instance, volatility of the Spanish electricity market increases in summer and fall seasons as discussed in [2] and [16]. So, the price forecast methods generally encounter more prediction errors in these seasons, which can be seen from Table VI. However, the weekly MAPE of the proposed method has less seasonal variation than the other techniques. Table VII shows that the error variance of the proposed method also has less variations. We also executed the proposed method for all 52 weeks of year 2002 for the Spanish electricity market. The obtained weekly MAPEs and error variances for 52 weeks are close to the results of Tables VI and VII, respectively. For instance, average of weekly MAPEs for 52 weeks is 5.38% versus 5.32% in Table VI (average of the four test weeks). Average of error Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. AMJADY AND KEYNIA: DAY-AHEAD PRICE FORECASTING OF ELECTRICITY MARKETS variances for both 52 weeks and four weeks (Table VII) is 0.0036. These results indicate the robustness of the proposed method and its performance in a long run for a complete year. The averages of MAPE values for working days and weekends of year 2002 in the Spanish electricity market are 5.22% and 5.79%, respectively. As seen, the average of MAPE values for working days (5.22%) is slightly less than the average of MAPE values for all days (5.38%) while the average of MAPE values for weekends (5.79%) is slightly more than the average of MAPE values for all days (5.38%). As a comparison, in [19], weighted nearest neighbors technique has been proposed for day-ahead electricity price forecasting and examined on the Spanish electricity market. The best average MAPE value of [19] for the working days of year 2002 is 8.45% compared with our 5.22% average MAPE value for the working days of this year. We also modeled the calendar effect in our price forecast strategy in three different ways. In the first alternative, the partitioning method of [18] is used in which the proposed method separately trains and forecasts the data of working days and weekends. In the second alternative, one calendar indicator ranging from 1 to 7, is used to encode days of week [1]. The calendar indicator is added to the selected inputs of the feature selection technique. In the third alternative, seven extra zero-one input factors indicating day of the week are added to the selected inputs. However, in all of these alternatives, the price forecast accuracy of the proposed method degrades. By partitioning working days and weekends, the available data of the proposed method for working days and weekends decreases. The information content of the calendar indicators, introduced by the second and third alternatives, are included in the load and price information. In addition to day-ahead price forecast, we executed the proposed method for week ahead (168 hours ahead) price forecast. The MAPE values for week ahead price forecast of the proposed method for the two test weeks of Table III are 7.14% and 8.98%, respectively, and for the four test weeks of Table VI are 9.15%, 8.38%, 9.12%, and 10.32, respectively. Although the week-ahead MAPE values of the proposed method are larger than its day-ahead MAPEs, they are comparable with day-ahead MAPEs of the other techniques, which indicate performance of the proposed method for one week forecast horizon. In the last examination, the proposed method is compared with the mixed model and another ARIMA model proposed by Contreras et al. [7] for two other test weeks of the Spanish electricity market. The obtained results are shown in Tables VIII and IX, where the daily mean errors of the mixed model and Contreras ARIMA model (quoted from [18] and [7], respectively) are calculated like the MAPE in (20), i.e., appears in the . For the sake of a fair comdenominator instead of parison, we used the same definition of the daily mean error for the results of the proposed method in Tables VIII and IX. These tables show that in 10 out of 14 test days, the proposed method has less daily mean error than both the mixed model and Contreras ARIMA. Besides, the proposed method has considerably lower average value of daily mean errors than the two other methods. Since the variance data of the other methods are not 315 TABLE VIII DAILY MEAN ERRORS FOR MAY 25–31, 2000 OF THE SPANISH ELECTRICITY MARKET TABLE IX DAILY MEAN ERRORS FOR AUGUST 25–31, 2000 OF THE SPANISH ELECTRICITY MARKET Fig. 6. Hourly prices (dark), price forecasts (green), and absolute value of forecast errors (red) for March 5, 2006 of the PJM electricity market. available, comparison of the variances cannot be accomplished in this case. To also give a graphical view about the forecast accuracy of the proposed method, its results for March 5, 2006 of the PJM electricity market (owning the highest MAPE value in Table III) and the fall’s test week, 2002 of the Spanish electricity market (with the highest weekly MAPE in Table VI) are shown in Figs. 6 and 7, respectively. Although the worst test periods have been purposely selected, however, the forecast curve acceptably follows the real price curve in the both figures. Only some minor deviations in the morning and evening peaks of the test day of Fig. 6 and peaks and valleys of the test week of Fig. 7 are observed. Total set-up time of the proposed method including the MI based feature selection, training of the 24 forecasters of the CNEA and fine-tuning of the adjustable parameters (with Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. 316 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 24, NO. 1, FEBRUARY 2009 TABLE XI OBTAINED MAPE VALUES FROM THE TWO ADDITIONAL EXPERIMENTS INCLUDING HOURLY LOADS OF THE FORECAST DAY TABLE XII MAPE OF THE PROPOSED PRICE FORECAST STRATEGY WITH AND WITHOUT EA ADJUSTMENT FOR THE PJM ELECTRICITY MARKET IN YEAR 2006 Fig. 7. Hourly prices (dark), price forecasts (green), and absolute value of forecast errors (red) for the fall’s test week of the Spanish electricity market. TABLE X MAPE OF THE PROPOSED METHOD WITH CORRELATION ANALYSIS FOR THE PJM AND SPANISH ELECTRICITY MARKETS the adaptive neighborhood in the iterative search procedure) was about 40 min on a Pentium P4 3.2-GHz personal computer with 1 GB of RAM memory. This set-up time is acceptable within a day-ahead decision making framework. Correlation analysis has been recently used by some researchers for feature selection of price forecasting [6], [15], [26]. However, correlation analysis is a linear feature selection technique while MI is a nonlinear criterion. The electricity price has a nonlinear and multivariate input/output mapping function and so a nonlinear forecast method (CNEA) is proposed to predict it. Using a linear analysis technique for feature selection of a nonlinear signal and combination of it with a nonlinear forecast method is incoherent, which degrades the forecast accuracy of the whole strategy. For a better illustration of this matter, we replaced the feature selection technique of the proposed strategy with correlation analysis (the other parts of the strategy are kept unchanged) and examined it on the same test days and weeks of the PJM and Spanish electricity markets. The obtained results are shown in Table X. From this table, it can be seen that the forecast errors of the proposed strategy with the correlation analysis are higher than the forecast errors of the proposed strategy with the MI technique. In the previous experiments, only lagged values of price, load and available generation are considered as the input features of the CNEA. We perform two additional experiments to evaluate the effect of next day load values on the price forecast accuracy of the proposed method. In the first experiment, load value (actual load) of each hour of the forecast day is added to the selected inputs of the corresponding forecaster of the CNEA. In the second experiment, load of the next hour is added to the candidate inputs. Then, the processes of feature selection, training of the forecasters of the CNEA and fine-tuning of the adjustable and ) are repeated with the extended set of parameters ( the candidate inputs including 401 candidates for the Spanish electricity market. The obtained results from these two experiments for the four test weeks of the Spanish electricity market are shown in Table XI. Comparing the results of these two additional experiments with the results of the original method shown in the last column of Table VI (without including the next day load values) indicates that the MAPE values of the experiment 1 are lower than those of the original method due to considering the information content of the next day hourly loads. The MAPE values of experiment 2 are even lower than those of experiment 1. In the second experiment, the added feature (next hour load) participates in the optimization process of the feature selection. The added feature can be either selected or not for a forecast day based on its information value. Evaluation of the effect of the other noncausal features (such as the values of available generation or congestion probabilities for the forecast day) on the price forecast accuracy needs further study, which we will consider it in our future research. To evaluate the effect of the EA adjustment on the performance of the proposed price forecast strategy, its MAPE values with and without the EA adjustment for the same test days and weeks of Table III are shown in Table XII. As seen from Table XII, the EA adjustment improves the forecast accuracy of the proposed strategy in all test days and weeks with MAPE improvements ranging from 5.66% to 12.80%. Our proposed price forecast method in its original form has not been designed for price spike forecasting. However, we add a preprocessor and postprocessor before and after the proposed Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. AMJADY AND KEYNIA: DAY-AHEAD PRICE FORECASTING OF ELECTRICITY MARKETS TABLE XIII MAPE OF THE PROPOSED PRICE FORECAST METHOD IN THREE CASES FOR THE PJM ELECTRICITY MARKET IN YEAR 2006 price forecast method, respectively, to process price spikes [1]. The preprocessor converts prices as follows: if if price spikes is decreased and accordingly a slight decrease in the MAPE of all prices is also observed. If prices greater than 200 , then the $/MWh are considered as the price spikes MAPE values of the price spikes and all prices are decreased by and , , the MAPE values of the price respectively. With spikes and all prices are decreased by and , respectively. So, by adding the preprocessor and postprocessor, accuracy of the proposed price forecast method for prediction of price spikes increases, which can be considered as an additional benefit of the proposed method. (23) VI. CONCLUSION are original price and its processed value, reIn (23), and is the upper limit of the price. If a price is higher spectively. , its processed value is plus times the logathan . The postprocessor rithm (base 10) of the ratio of price to performs the inverse transform as follows: if if 317 (24) and are forecasted price and modified forecasted where price (after post-processor). We performed an extra experiment values of 200 on the PJM electricity market with two $/MWh and 150 $/MWh. The results of the proposed price forecast method in three cases are presented in Table XIII. In the first case, the proposed price forecast method in its original form (without using preprocessor and postprocessor) is examined by the data of the whole year 2006 of the PJM electricity market. The MAPE values of this case for prices greater than 200 $/MWh, for prices greater than 150 $/MWh and for all prices of year 2006 are shown in the first column of Table XIII, respectively. The MAPE values of the Table XIII are calculated based on (20). There are 25 and 73 hourly prices greater than 200 $/MWh and 150 $/MWh in year 2006 of the PJM electricity market, respectively. So, in the Table XIII, , 73 and 8760 (number of hours in year 2006) for ”, “ ”, and “All Prices”, respectively. “ In the second and third cases (shown in the second and third columns of Table XIII), the preprocessor and postprocessor are added to the original price forecast method with and , respectively. In other words, in the second and third cases, prices greater than 200 $/MWh and 150 $/MWh are considered as the price spikes, respectively. The MAPE values of the second case for prices greater than 200 $/MWh (considered as price spikes in this case) and “All Prices” are separately reported in the second column of Table XIII, respectively. Similarly, the MAPE values of the third case for prices greater than 150 $/MWh and all prices are mentioned in the third column of Table XIII, respectively. From Table XIII it can be seen that the MAPE values for price ” and “ ”) are considerably higher spikes (for both “ than the MAPE values for all prices, especially in the first case. This can be expected since a price spike is an abnormal price that is significantly higher than its expected value. However, when we add the preprocessor and postprocessor, the MAPE of Price forecast plays a major role in today’s electricity markets. Companies that trade in electricity markets make extensive use of price forecast techniques either to bid or hedge against volatility. However, price prediction has its own complexities. At the same time, utility companies usually have limited and uncertain information for price prediction. Thus, companies that trade in the electricity markets require efficient and robust price forecast methods. In this paper, a new price forecast strategy is proposed, which is composed of MI based feature selection, CNEA as the prediction part and an iterative search procedure to fine-tune the adjustable parameters. The proposed strategy has been compared with seven of the most recently published price forecast methods. These comparisons reveal the superior forecast capability of the proposed method. Further research is needed to expand the proposed method to the special case of price spike forecasting. REFERENCES [1] M. Shahidehpour, H. Yamin, and Z. Li, Market Operations in Electric Power Systems. New York, Wiley: , 2002. [2] N. Amjady, “Day-ahead price forecasting of electricity markets by a new fuzzy neural network,” IEEE Trans. Power Syst., vol. 21, no. 2, pp. 887–896, May 2006. [3] N. Amjady and M. Hemmati, “Energy price forecasting—problems and proposals for such predictions,” IEEE Power Energy Mag., vol. 4, no. 2, pp. 20–29, Mar./Apr. 2006. [4] O. B. Fosso, A. Gjelsvik, A. Haugstad, M. Birger, and I. Wangensteen, “Generation scheduling in a deregulated system. The Norwegian case,” IEEE Trans. Power Syst., vol. 14, no. 1, pp. 75–81, Feb. 1999. [5] F. J. Nogales, J. Contreras, A. J. Conejo, and R. Espinola, “Forecasting next-day electricity prices by time series models,” IEEE Trans. Power Syst., vol. 17, no. 2, pp. 342–348, May 2002. [6] H. Zareipour, C. A. Canizares, K. Bhattacharya, and J. Thomson, “Application of public-domain market information to forecast Ontario’s wholesale electricity prices,” IEEE Trans. Power Syst., vol. 21, no. 4, pp. 1707–1717, Nov. 2006. [7] J. Contreras, R. Espinola, F. J. Nogales, and A. J. Conejo, “ARIMA models to predict next-day electricity prices,” IEEE Trans. Power Syst., vol. 18, no. 3, pp. 1014–1020, Aug. 2003. [8] R. C. Garcia, J. Contreras, M. V. Akkeren, and J. B. C. Garcia, “A GARCH forecasting model to predict day-ahead electricity prices,” IEEE Trans. Power Syst., vol. 20, no. 2, pp. 867–874, May 2005. [9] L. Zhang, P. B. Luh, and K. Kasiviswanathan, “Energy clearing price prediction and confidence interval estimation with cascaded neural network,” IEEE Trans. Power Syst., vol. 18, no. 1, pp. 99–105, Feb. 2003. [10] J. J. Guo and P. B. Luh, “Improving market clearing price prediction by using a committee machine of neural networks,” IEEE Trans. Power Syst., vol. 19, no. 4, pp. 1867–1876, Nov. 2004. [11] A. M. Gonzalez, A. M. San Roque, and J. G. Gonzalez, “Modeling and forecasting electricity prices with input/output hidden Markov models,” IEEE Trans. Power Syst., vol. 20, no. 1, pp. 13–24, Feb. 2005. Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply. 318 IEEE TRANSACTIONS ON POWER SYSTEMS, VOL. 24, NO. 1, FEBRUARY 2009 [12] L. Zhang and P. B. Luh, “Neural network-based market clearing price prediction and confidence interval estimation with an improved extended Kalman filter method,” IEEE Trans. Power Syst., vol. 20, no. 1, pp. 59–66, Feb. 2005. [13] C. P. Rodriguez and G. J. Anders, “Energy price forecasting in the Ontario competitive power system market,” IEEE Trans. Power Syst., vol. 19, no. 1, pp. 366–374, Feb. 2004. [14] Y. Y. Hong and C. F. Lee, “A neuro-fuzzy price forecasting approach in deregulated electricity markets,” Elect. Power Syst. Res., vol. 73, pp. 151–157, 2005. [15] G. Li, C. C. Liu, C. Mattson, and J. Lawarree, “Day-ahead electricity price forecasting in a grid environment,” IEEE Trans. Power Syst., vol. 22, no. 1, pp. 266–274, Feb. 2007. [16] A. J. Conejo, M. A. Plazas, R. Espinola, and A. B. Molina, “Day-ahead electricity price forecasting using the wavelet transform and ARIMA models,” IEEE Trans. Power Syst., vol. 20, no. 2, pp. 1035–1042, May 2005. [17] P. Mandal, T. Senjyu, N. Urasaki, T. Funabashi, and A. K. Srivastava, “A novel approach to forecast electricity price for PJM using neural network and similar days method,” IEEE Trans. Power Syst., vol. 22, no. 4, pp. 2058–2065, Nov. 2007. [18] C. Garcia-Martos, J. Rodriguez, and M. J. Sanchez, “Mixed models for short-run forecasting of electricity prices: Application for the Spanish market,” IEEE Trans. Power Syst., vol. 22, no. 2, pp. 544–551, May 2007. [19] A. T. Lora, J. M. R. Santos, A. G. Exposito, J. L. M. Ramos, and J. C. R. Santos, “Electricity market price forecasting based on weighted nearest neighbors techniques,” IEEE Trans. Power Syst., vol. 22, no. 3, pp. 1294–1301, Aug. 2007. [20] X. Liu, A. Krishnan, and A. Mondry, “An entropy-based gene selection method for cancer classification using microarray data,” Int. J. BMC Bioinformat., pp. 1–14, Mar. 2005. [21] N. Kwak and C. H. Choi, “Input feature selection for classification problems,” IEEE Trans. Neural Netw., vol. 13, no. 1, pp. 143–159, Jan. 2002. [22] N. Kwak and C. H. Choi, “Input feature selection by mutual information based on Parzen window,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 12, pp. 1667–1671, Dec. 2002. [23] G. D. Tourassi, E. D. Frederick, M. K. Markey, and C. E. Floyd, “Application of the mutual information criterion for feature selection in computer-aided diagnosis,” Int. J. Med. Phys., vol. 28, no. 12, pp. 2394–2402, Dec. 2001. [24] N. Amjady, “Short-term bus load forecasting of power systems by a new hybrid method,” IEEE Trans. Power Syst., vol. 22, no. 1, pp. 333–341, Feb. 2007. [25] A. J. Rocha Reis and A. P. Alves da Silva, “Feature extraction via multiresolution analysis for short-term load forecasting,” IEEE Trans. Power Syst., vol. 20, no. 1, pp. 189–198, Feb. 2005. [26] H. T. Pao, “Forecasting electricity market pricing using artificial neural networks,” Energy Convers. Manage., no. 48, pp. 907–912, 2007. [27] M. T. Hagan and M. B. Mehnaj, “Training feed forward networks with Marquardt algorithm,” IEEE Trans. Neural Netw., vol. 5, no. 6, pp. 989–993, Nov. 1994. [28] G. J. Tsekouras, N. D. Hatziargyriou, and E. N. Dialynas, “An optimized adaptive neural network for annual midterm energy forecasting,” IEEE Trans. Power Syst., vol. 21, no. 1, pp. 385–391, Feb. 2006. [29] PJM Web Site. [Online]. Available: http://www.pjm.com. [30] Market Operator of the Electricity Market of Mainland Spain, OMEL. [Online]. Available: http://www.omel.es/frames/es/index.jsp. [31] J. P. S. Catalão, S. J. P. S. Mariano, V. M. F. Mendes, and L. A. F. M. Ferreira, “Short-term electricity prices forecasting in a competitive market: A neural network approach,” Elect. Power Syst. Res., vol. 77, pp. 1297–1304, Aug. 2007. Nima Amjady was born in Tehran, Iran, on February 24, 1971. He received the B.Sc., M.Sc., and Ph.D. degrees in electrical engineering from Sharif University of Technology, Tehran, in 1992, 1994, and 1997, respectively. At present, he is a Professor with the Electrical Engineering Department, Semnan University, Semnan, Iran. He is also a Consultant with the National Dispatching Department of Iran. His research interests include security assessment of power systems, reliability of power networks, load and price forecasting, artificial intelligence, and its applications to the problems of power systems. Farshid Keynia was born in Kerman, Iran. He received the B.Sc. degree in electrical engineering from S. B. University, Kerman, Iran, in 1996 and the M.Sc. degree in electrical engineering from Semnan University, Semnan, Iran, in 2001. Currently he is pursuing the Ph.D. degree in the Electrical Engineering Department of Semnan University. His research focuses on short-term and midterm price and load forecasting in deregulated electricity markets as well as feature selection and classification algorithms. Authorized licensed use limited to: IEEE Xplore. Downloaded on March 12, 2009 at 06:58 from IEEE Xplore. Restrictions apply.
© Copyright 2026 Paperzz