IEEE TRANSACTIONS ON SMART GRID, VOL. 5, NO. 3, MAY 2014 1475 Game-Theoretic Demand-Side Management With Storage Devices for the Future Smart Grid Hazem M. Soliman and Alberto Leon-Garcia Abstract—We study the demand side management (DSM) problem when customers are equipped with energy storage devices. Two games are discussed: the first is a non-cooperative one played between the residential energy consumers, while the second is a Stackelberg game played between the utility provider and the energy consumers. We introduce a new cost function applicable to the case of users selling back stored energy. The non-cooperative energy consumption game is played between users who schedule their energy use to minimize energy cost. The game is shown to have a unique Nash equilibrium, that is also the global system optimal point. In the Stackelberg game, the utility provider sets the prices to maximize its profit knowing that users will respond by minimizing their cost. We provide existence and uniqueness results for the Stackelberg equilibrium. The Stackelberg game is shown to be the general case of the minimum Peak-to-Average power ratio (PAR) problem. Two algorithms, centralized and distributed, are presented to solve the Stackelberg game. We present results that elucidate the interplay between storage capacity, energy requirements, number of users and system performance measured in total cost and peak-to-average power ratio (PAR). Index Terms—Demand-side management, distributed algorithms, game theory, PAR. I. INTRODUCTION A N interesting feature of smart grids is the possibility of a mutually beneficial relationship, a “win-win” situation, between the users and the utility. Users would like to minimize the cost they pay to the utility, whereas the utility cares not only about what the users pay, but also about when they will consume, i.e., PAR [1]–[3]. In this paper, we show that two factors help to create a win-win situation when storage devices are introduced: a smart cost function, and a suitable game. Our methodology to reach the win-win situation is through Demand-side Management (DSM) [4]. DSM can be used for different applications, such as conservation of energy, efficiency of power delivery, fuel substitution, and residential or commercial load management [5], [6]. Residential load management programs try to reduce home power consumption, as well as shift it temporarily for better utilization and cost. Since utility providers are challenged by high peak hour usage, smart time-varying pricing in residential load management programs helps shift usage away from peak hours to less congested times. Manuscript received May 06, 2013; revised August 21, 2013 and November 29, 2013; accepted January 19, 2014. Date of publication April 02, 2014; date of current version April 17, 2014. Paper no. TSG-00356-2013. The authors are with the School of Electrical and Computer Engineering, University of Toronto (e-mail: [email protected]; [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TSG.2014.2302245 Game theory is seen as a useful mathematical tool to handle DSM problems. In particular, game theory can serve the goals and characteristics of the smart grid as discussed in [2], [3]. The ability of game theory to capture the competition between users, study the possible outcomes and resulting equilibrium as well as their stability are reasons for the wide use of game theory in the smart grid literature. Mohsenian et al. in [4] have proposed a distributed game-theoretic algorithm for scheduling energy usage optimized to minimize energy cost. In [7], a Stackelberg game is introduced to optimize the energy retailers’ pricing decisions. The case when users are equipped with storage devices was studied in [8] using welfare theory to find optimal prices. In [9] another formulation was proposed employing a strategy-proof double auction. However, the papers studying energy storage have not considered the scheduling or PAR problems, nor have they tackled the choice of an appropriate cost function that can meet the requirements of the system. We extend the literature in two directions: 1) we provide a generalized treatment of storage by introducing a novel cost function that can model the requirements when storage is present. Our cost function can be viewed as a generalization of the more common quadratic and linear functions; 2) we provide a general framework for the interaction between users and the utility through a Stackelberg game, where the utility decides its prices taking into consideration the reaction of the customers, and where the users schedule their energy in order to minimize their cost based on the prices given by the utility. We provide distributed algorithms for both interactions, and prove their convergence. We also establish strategy-proof properties, as well as the relation between our framework and the minimum peak-to-average power ratio (PAR) problem. Finally, we present an extensive set of simulation results covering a range of possible scenarios. We find that improved collaboration between the utility and users results in lower user cost and lower system PAR. We also explore the impact of number of users and total amount of storage capacity on system performance. We discover that (under the assumptions of our study) the system performance depends solely on the total storage capacity and not on its allocation among users. This suggests the possibility of establishing a single user-owned storage site that acts on behalf of all users. The paper is organized as follows. We introduce the system model in Section II. The energy cost minimization problem and the energy cost minimization game are discussed in Section III. In Section IV we present the Stackelberg game and two algorithms for solving it. Simulation results are discussed in Section V. Conclusions are presented in Section VI. 1949-3053 © 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. 1476 IEEE TRANSACTIONS ON SMART GRID, VOL. 5, NO. 3, MAY 2014 II. SYSTEM MODEL A. Power Consumption Model In the smart grid, each user is equipped with a smart meter that can interact with smart appliances and control them [10]. Hence, one of the roles of a smart meter is to serve as an energy consumption scheduler (ECS) that uses price information to schedule the energy consumption of its user. Let denote the set of users. The total number of users is . Let denote the total load of user at time . The total daily schedule for each user is then , and the total load of all users during a time is [4] (1) B. Energy Cost Model Cost functions are pricing tariffs used by utility companies to determine the price at which they sell energy to their customers. Cost functions can be used by utility companies as incentives for users to follow a specific consumption behavior. The utility obtains energy from multiple sources and sells it to the users. The utility first uses energy from cheaper sources like hydro-electric generators before moving to more expensive fuel-based generators during peak-hours [11]–[13]. The cost function determines the prices users have to pay, and the income to the utility. Since we use a game-theoretic framework, a smart cost function is needed in order to minimize the impact of selfish behavior. Let be the cost that users have to pay to the utility for an amount of energy during time . We based our choice of the cost function on several requirements that govern the operation of demand side management. First, the utility provider is responsible for satisfying all the needs of all users, hence the cost function is a function of the total consumption by all users during some time . Moreover, the cost function can vary from time to time, i.e., higher cost during peak times because of the more expensive energy used, and changes in prices charged by the energy providers. Other assumptions for cost functions are: 1) Cost function is an increasing function of demand, i.e., more energy should entail higher cost [4]. (2) 2) Cost function is convex, that is the increase in the price is also increasing. Moreover, we assume that the cost function is strictly convex [4]. (3) . When 3) When the user sells energy back, we have this happens the user is paid for this energy, that is, the user cost function is since the user pays a negative amount to the utility. Thus we have the requirement that, for , . The quadratic cost function, clearly does not satisfy this condition. Alternatively, we could try which does satisfy the negativity condition. However this choice of cost function is not convex. We note that the linear cost function is increasing and convex, but not strictly convex, so it is not guaranteed to have a unique Nash equilibrium. 4) Another valid condition for the cost function is that, at any given time h, the utility always makes profit, i.e., the price at which it buys is always less than the price at which it sells: (4) This condition also prevents excessive buying and selling by the users. 1) Cost Function Rationale: The above assumptions disqualify the quadratic or linear cost functions, so we had to propose a new cost function. The function we chose is a variant of the widely used logarithmic barrier functions, used as a penalty function in interior point methods [14]: (5) where is a pricing coefficient, determined by the utility to give higher prices during peak-hours, is the total load, is a parameter that we introduce to give cost values very close to the values given by a quadratic one, it also serves as the maximum typical value for . The relation between the proposed cost function we use and the quadratic one can be understood from its Taylor expansion. Since , the Taylor series expansion is (6) So we can see that the logarithmic cost function is essentially a sum of linear and quadratic terms, where the parameter is used to govern the the weight of the linear and quadratic terms. Hence, the proposed cost function can be viewed as a modification of the quadratic function to satisfy all the conditions we have imposed. Fig. 1 compares the quadratic function and our logarithmic cost function. This cost function is monotonically increasing and strictly convex, see Appendix A. C. Residential Load Control In the absence of scheduling and storage, each user will use appliances at their preferred times. The absence of coordination among users leads to busy hours of high consumption, resulting in a high PAR [4]. When scheduling is used to optimize energy consumption, users avoid the peak hours to avoid the associated high prices to the extent possible [4]. However, the users have limited flexibility in the scheduling of their appliances and hence they have limited freedom in avoiding the peak hours. A phenomenon that is often associated with selfish optimized scheduling is the appearance of “redundant” peaks [15]. The aggregate behavior of users individually avoiding peak periods results in new “redundant” peaks during the lower price periods [15]. The introduction of energy storage gives users SOLIMAN AND LEON-GARCIA: GAME-THEORETIC DEMAND-SIDE MANAGEMENT WITH STORAGE DEVICES 1477 As in [4], the user is assumed to have a set of shiftable and non-shiftable appliances. Let be the predetermined amount of energy required by appliance belonging to user . A user may require this amount to be provided for the appliance during a certain interval, beginning at and ending at . Then for each user and each appliance , the following conditions must hold (11) where . Each appliance also has a minimum and a maximum power level that it can operate on while working in its scheduled time. Let denote the minimum power level and denote the maximum power level for appliance , then (12) Each storage device has upper and lower limits on the amount of power it can store. Let and be the minimum and maximum charging levels of the storage device belonging to user . The following condition must hold (13) Fig. 1. Proposed logarithmic cost function as compared with the quadratic cost function. greater freedom in scheduling, so they can buy extra energy ahead of time, and consume it later during peak times or even sell it back to the grid. Let be the set of all appliances belonging to user . Each appliance can get power either by buying power directly from the grid (external power), or by taking it from its own stored storage device (internal power). The external power scheduling vector for each appliance is: where the middle term represents the net inward power for user storage device, during all the time intervals up to time . III. PROBLEM FORMULATION A. Energy Cost Minimization Our goal is to find the optimal scheduling of all appliances for all users that minimizes the total energy cost: (7) where is the amount of power scheduled for appliance belonging to user during time , to be bought for this appliance from the grid. Similarly, the internal power scheduling vector for each appliance is denoted by (8) where is the amount of power scheduled for appliance belonging to user during time , to be taken by this appliance from the user storage device. The power consumption schedule for storage for user is: (9) where is the amount of power scheduled by user during time , to be bought and stored. The total load of user during time is then (10) (14) where , is the optimization variable formed by concatenating all the variables into a single vector variable, is a vector specifying all energy being bought from the utility provider during time . and are the lower and upper bounds formed by concatenating all the lower and upper bounds in (12). is a matrix giving the battery net inward flux in (13), and is the battery charging limit vector. is a matrix capturing energy provided for each appliance in (11), and is the vector of the required energy amount for each appliance. The requirements we have imposed on the cost function imply that (14) is a strictly convex optimization problem with a unique optimum solution [14]. B. Energy Consumption Game We use game theory to capture the competition between users. We assume users are selfish and interested only in 1478 IEEE TRANSACTIONS ON SMART GRID, VOL. 5, NO. 3, MAY 2014 optimizing their cost, and that they might deviate from a centralized solution if they find a better schedule with less prices. Moreover, we suppose that users wish to preserve their privacy and are willing to share some information if it leads to lower cost. Game theory is able to capture the competitive behavior of the users and the stability of equilibrium solutions, which is why we chose it as our optimization tool next. The energy consumption game is defined as follows: Energy Consumption Game: • Players: All users in . • Strategies: Each user selects its strategy by scheduling appliance to minimize its own cost. • Payoffs: for user , where The parameters are similar to the ones in (14) but they are now specific to user . Problem (18) is still strictly convex and admits a unique optimal solution. The algorithm basically allows each player to play his best response strategy, announce only his total consumption, and repeat until equilibrium is reached. The distributed algorithm is explained in detail in Algorithm 1. The algorithm progresses by allowing each player to solve his optimization problem, i.e., play his best response strategy. Each step will result in the total cost decreasing or remaining the same. Since the objective function is bounded from below, the iterations will eventually converge to a fixed point, i.e., a NE point. Once users reach this point, they will have no preference to change since the NE is unique, and convergence is achieved. (15) Algorithm 1 Executed by each user where Randomly Initialize other users loads (16) repeat Solve Problem (18). represents the proportion of energy consumed by user , relative to the total energy consumed by all users. This is a proportional price sharing mechanism that divides the total cost among users according to their consumption proportion. This price sharing mechanism has been used in the literature, for example [4]. denotes the consumption schedules for all users except user . Nash Equilibrium (NE) Consider a game played between a set of players. For each player , let denote his strategy space. A set of strategies constitute a NE if (17) Theorem 3.1: The Nash equilibrium of the Energy Consumption Game exists and is unique. See Appendix A for proof. Theorem 3.2: The Nash equilibrium of the Energy Consumption Game is also the global system optimal solution of the energy cost minimization problem (14). See Appendix B for proof. The above game is a non-cooperative game. The structure of the objective function and the alignment of the NE with the system optimal point indicates that the users’ strategies are strategic complements, not substitutes, i.e., they mutually reinforce each other [16]. if changes compared to previous scheduling then Update to the new solution. Broadcast the new total load to all other users. end if if A new update is received then Update end if until No new announcements are received. D. Strategy-Proof Property be the NE of the Energy Consumption Game Let when only a subset of users are truthful. Also let denote the NE when all players are truthful, which is also the system optimal point from Theorem 3.2. Then for any cheating user (19) By dividing both sides by , we get C. Distributed Algorithm for the Energy Consumption Game Since is independent of , it can be dropped from the optimization. This is a significant simplification and the problem to be solved by each user becomes: (18) (20) which contradicts Theorem 3.2. Hence, no user has an incentive to deviate because the total cost is divided among users in proportion to their consumption. Since the NE point is also the unique system optimal solution from Theorems 3.1 and 3.2, any deviation by any user will result in increased cost not only for the other users, but for the user himself. SOLIMAN AND LEON-GARCIA: GAME-THEORETIC DEMAND-SIDE MANAGEMENT WITH STORAGE DEVICES 1479 players, and the objective function is continuous in both variables, the Stackelberg equilibrium is guaranteed to exist from [17] Proposition 3.1. IV. PRICE CONTROL A. Pricing Problem Thus far, we assumed that the utility provider decides pricing apriori based on typical peak hours. Simulation results will show that even though the PAR decreases as some users get their storage devices, after a certain number of storage devices the PAR will start to increase again. The reason is that users equipped with storage devices will tend to buy extra energy before peak hours and sell it back during the peak hours to minimize their cost. Once too many users do this, the former peak consumption is shifted, even though with a less value than before, to a new hour. The new effect that also appears when users can sell back is that they tend to sell mostly during the high price hours, i.e., former peak hours. We call this phenomena “Reverse Peaks,” since the shift of peak consumption is accompanied by a high tendency of users to sell their stored energy back during high price hours. This undesirable effect can be resolved by allowing the utility provider to become a player in the game. When the users shift their consumption to a newly formed peak hour, the utility will respond by adjusting the price again. Repeated play will converge to an equilibrium where users have no interest in shifting their loads and the utility has no interest in changing prices. This interaction can be modeled as a Stackelberg game. Stackelberg equilibrium is what we called a win-win situation, whereby the users minimize their cost, while the utility maximizes its profit, and more importantly, indirectly manages to decrease the PAR, as will be shown later. We assume the utility selects the price from a set of possible prices. We also assume the users’ energy needs are will always be satisfied. Therefore, the utility’s problem involves selecting its best pricing option out of a reasonable set of possible prices. A situation might happen when the set of possible prices is selected in a manner that could lead to arbitrarily high prices. We note that this is also found in the real world when a utility operates as a monopoly. In this case, regulatory mechanisms are activated to counterbalance the tendency to increase prices. Such mechanisms are outside the scope of this paper. C. Equivalence Between Stackelberg Game and Minimum PAR Optimization We note that any cost schedule can be normalized so that the sum of the terms in the vector is 1. Here we show that when the set is defined as a simplex , the solution of (21) also minimizes PAR. To show this consider (22) Our objective function is convex in , and concave (linear) in . For the class of convex-concave functions, the maximum and minimum operations can be exchanged [14], so the problem becomes (23) and the solution is a saddle-point. Consider (23), when is a simplex, the maximization over is solved by letting for the the largest term in the summation, so (23) reduces to (24) Note that (23) is continuous minimax, while (24) is discrete minimax. Since is a monotonic increasing function in its operand , then the function is maximized by maximizing its argument. Hence the solution of (24) is also the solution of (25) is The average energy consumption variable-independent and can be inserted into the optimization to obtain the minimum PAR problem [4]: (26) B. Stackelberg Game A Stackelberg game is a sequential game played between a Leader and a set of Followers. In our game, the utility plays the role of the leader, who sets the prices first, while the users are followers who optimize their usage to minimize the cost they pay [2]. Using Backward Induction, the problem the utility provider needs to solve to find the optimal prices is formulated as follows: (21) is selected from where the cost schedule a compact set . Since the constraint sets are compact for both D. Solving the Stackelberg Problem 1) Interior Point Solution for Minimax: We can use the interior-point algorithm developed in [18] to solve the Stackelberg min-max optimization problem. We don’t provide the algorithm details here due to space limitations. 2) Iterative Entropic Regularization: We propose an algorithm, based on [19], that transforms the continuous min-max problem into a sequence of finite min-max problems using entropic regularization to smooth each finite problem.1 The algorithm involves two stage optimization problems. In the first stage, the users receive the pricing coefficients as determined by the utility, and schedule their energy to minimize their cost. The utility uses the players decisions to find the pricing 1Note that here we remove the negative sign in the objective function and solve it as a min-max problem instead of a max-min problem. 1480 IEEE TRANSACTIONS ON SMART GRID, VOL. 5, NO. 3, MAY 2014 coefficients that maximize its own profit. The utility also needs to take into account the reaction of players to these new pricing coefficients. The utility uses an Iterative Entropic algorithm that approximates the continuous set of players strategies by a discrete set. The main advantage of this algorithm is its distributed nature. The inner optimization problem is the original energy cost minimization in (14). This problem, as shown before, can be solved in a distributed manner using Algorithm 1. The total consumption can then be sent to the provider which is responsible for solving the outer problem, i.e., determine his optimal pricing, thereby preserving the privacy of the users. First, define as then use Algorithm 1 to find the optimum energy schedule corresponding to the current set of prices. In step (4), the set is updated by the obtained energy schedules. Repeat the steps until the convergence criteria as explained in [19] is satisfied. Algorithm 2 Iterative Entropic Algorithm Choose initial and let , Set Choose repeat (1) Find such that (27) (34) where is the optimization set defined by the constraints in (14). We approximate by a finite subset of points in . Define as an approximation for given by: (2) if is then (3) Use Algorithm 1 to find (28) The entropic smoothed version of . (4) Set where (5) (6) Set (29) end if if and then (7) (30) end if until and is some regularization constant. Then, in accordance with the explanation given above, the inner optimization problem to be solved by the users is (31) while the outer optimization to be solved by the utility is and More details about the operation of the algorithm and its convergence proof can be found in [19]. V. SIMULATION RESULTS A. Simulation Scenarios and System Parameters (32) where in the outer minimization problem, we minimize instead of and . The algorithm uses the following terms: (33) The algorithm is summarized in Algorithm 2. First, start with some random energy schedules. At each iteration, the utility finds the optimum set of prices by solving (34). This problem involves minimizing the entropy-smoothed version of up to some accuracy . In step (3), if the difference between and its entropy-smoothed version is below a certain threshold, The system has 5 important parameters: • : number of users equipped with storage devices. • : Each user with a storage device is given a random storage capacity at the beginning. This capacity is then changed by dividing by to study the effect of decreasing system storage capacity. • : This is first time in a day at which users start to make their energy requests. This parameter is important in differentiating the no storage case from the storage one. The time before typically favors storage because it has low prices. • Storage-to-Energy Ratio: the ratio of the total storage available to the total energy requirements. This parameter allows us to study when storage capacity in the system will saturate, as well as to determine whether the system has SOLIMAN AND LEON-GARCIA: GAME-THEORETIC DEMAND-SIDE MANAGEMENT WITH STORAGE DEVICES 1481 an optimum storage capacity that achieves the minimum PAR. We are interested in two performance metrics, total cost and peak-to-average power ratio (PAR). We will study three different cases. First, when there is no storage available in the grid and users can only schedule their use. This is the case studied in [4] using a quadratic cost function. Second, when the users have storage capability and can sell energy back to the grid. Third, when the users have storage capability, but can store only for their own use and can’t sell back. Each storage-equipped user is given a random initial storage capacity which is then varied by varying . We divide the day into 24 periods, 1 hour each. Analysis of cyclic behavior through a duration of multiple days is left for future work. B. The Energy Consumption Game The main points we want to emphasize for the Energy Consumption Game are: • Storage always helps; it outperforms no-storage scheduling in cost and PAR in all cases we have studied. • The no-selling scenario results in lower PAR than the selling scenario. • In the no-selling scenario, there is an optimum number of users equipped with storage that produces the lowest PAR. • In the selling scenario, the storage is shared virtually among all users, i.e., the performance depends on the total storage capacity and not on how it is distributed among users. • Reverse peaks appear, which encourages the shift to Stackelberg games. 1) & : Figs. 2 and 3 shows the system performance versus the number of users equipped with storage for different storage capacities . Note that corresponds to the first (no-storage) scenario. We plot the results when for the selling and no selling scenarios, and when we decrease the storage capacity by a factor of 4. When the users are equipped with storage devices, they have more degrees of freedom in scheduling their appliances, allowing them to decrease their cost. We can see in Fig. 2 that the selling and no selling scenario start close to each other, but soon diverge, implying that the users use selling to decrease their cost. The gap is greater for increasing storage as more degrees of freedom are given to the users. Hence, depriving the users from the option of selling back generally results in higher cost. Fig. 3 shows that the PAR either remains the same or is slightly decreased for the selling scenario. This is due to “reverse” peaks, where users who have storage buy extra energy ahead of time, and sell it back during peak times. Thus, the old peak hours become new peak hours for selling rather than buying. A solution to this problem is to prevent users from selling back (the third scenario we are studying). We observe a larger decrease in PAR for the no-selling scenario. We note that in the selling scenario, each additional user obtaining storage is akin to increasing the total storage capacity that is shared among all users. This is apparent when we compare the two points with & and Fig. 2. Total cost versus Fig. 3. PAR versus for different for different . . & which have the same total storage (2) and are almost the same in total cost (66). However, this not the case in the no-selling scenario. Here users use their storage solely for their own favor, so the point of & has higher total cost and much higher PAR than the point of & , reflecting that users with storage buy less during cheap times, and the other users have less degrees of freedom. Hence we conclude that only the total storage matters when users can sell back. This finding can greatly simplify the problem, as the solution will be the same as when only one user has storage, resulting in significantly fewer variables. We conclude based on our simulations that the storage is virtually shared among users 1482 IEEE TRANSACTIONS ON SMART GRID, VOL. 5, NO. 3, MAY 2014 Fig. 4. Comparison of the total cost for the selling and no selling scenarios when the total amount of storage is a constant. Fig. 5. Comparison of the PAR for the selling and no selling scenarios when the total amount of storage is a constant. when they can sell back, and recommend the use of efficient optimization techniques to make use of this feature. This centralized optimization can be justified from the results of game theoretic competition provided in this paper. We may consider the application of optimization techniques as part of future work. Two parameters control the value of the cost function, the pricing coefficient determined by the utility, and the users’ consumption. The figures show that initially, only a few users have storage and are able to shift their consumption to less expensive times, making use of lower . As more users get storage, they will buy more at these less expensive times. Eventually, the increase in cost due to users’ consumption will make up for the decrease in cost due to , leading to the saturation apparent in the curves. Faster saturation occurs when users have the option to sell due to the “virtual” sharing of storage explained above in the selling case. We can also see that in the no selling case, PAR reaches a minimum before increasing again. This is due to the inability of users to share their storage in the case of no selling. When almost half the users get storage, the PAR is minimum. After that, the users shift their consumption too much leading to an increase in PAR. 2) Energy Requirements vs. Storage Capacity: We now consider the relationship between the total storage capacity in the system and the aggregate energy demand. Obviously, these are the two most important parameters in the system and their relationship leads to a better understanding of system performance as well as savings by avoiding excess storage. In Figs. 4 and 5, we assume the storage to energy ratio is fixed for different number of users, so the combined storage capacity of 2 users is the same as the combined capacity of 10 users. We consider storage to energy ratios between 10% to 100%. In Fig. 4, we see that for the selling scenario (solid lines), the total cost is the same for any number of users, confirming the observation that storage is shared between users. However, in the no-selling scenario, the cost decreases monotonically as more users get storage. In Fig. 5, we see that for the selling scenario, the PAR is constant for any number of users. However, for the no selling scenario, there is an optimum number of users with storage, after which the PAR will increase again. This optimum number varies between 2 when storage capacity is low, to 5 when storage capacity is high. 3) Daily Consumption Profile: In Fig. 6, it can be seen that users in all cases try to avoid the peak hours as much as they can. Storage allows freedom to avoid buying at peak expensive hours. In the selling scenario, they are actually making use of these peak prices to sell energy back to make profit. They achieve lower cost but at the expense of higher PAR. We can see from Fig. 6 that in the selling scenario, the extra capacity is virtually for the whole grid. However, in the no-selling scenario, users keep their energy for themselves. This can be seen in the greater similarity between the curves in the selling scenario (solid lines) than in the no-selling one. C. Stackelberg Game 1) & : In Figs. 7 and 8, we study the effect of increasing the number of users equipped with storage. We can see that the equilibrium in Stackelberg achieves almost uniform pricing and consumption, resulting in very low PAR, except for the case when storage is very low. When prices depend on the user interaction as in a Stackelberg case, the effect of adding extra storage to the system, whether by giving storage to more users or increasing the storage capacity, diminishes quickly after some threshold. At this threshold, prices become almost uniform and users don’t tend to store much energy, resulting in PAR very close to the minimum value of 1. We can also see the matching between the no selling and selling scenarios. This is because whenever storage is present, users use it to decrease SOLIMAN AND LEON-GARCIA: GAME-THEORETIC DEMAND-SIDE MANAGEMENT WITH STORAGE DEVICES Fig. 6. Comparison of different consumption profiles with Fig. 7. Total cost versus . Fig. 8. PAR versus . . using Stackelberg strategy for different using Stackelberg strategy for different 1483 , with , with their cost, but the utility responds by changing its prices, until an equilibrium is reached. 2) Energy Requirements vs. Storage Capacity: In Figs. 9 and 10, we plot the total cost and PAR versus the storage over energy ratio in the Stackelberg framework. We can see that the system reaches the uniform pricing equilibrium after a threshold around 20%. At this point the system has converged to a uniform pricing and the PAR has become very low. A similar simulation was performed without employing the Stackelberg game, and it was found that the total cost of the no selling scenario when the Stackelberg game is employed is 70, Fig. 9. The total cost for the selling and no selling scenarios and its relation with the storage energy ratio using Stackelberg. and when the Stackelberg game is not employed is 61, while the PAR when the Stackelberg game is employed is 1.03, and when the Stackelberg game is not employed is 1.6. For the selling scenario, the total cost when the Stackelberg game is employed is 70, and 50 when the Stackelberg game is not employed, while the PAR when the Stackelberg game is employed is 1.03, and when the Stackelberg game is not employed is 2.1. These results show that the Stackelberg game is highly effective in reducing PAR and they also show how the total cost that results after the game reaches equilibrium is somewhat higher than the cost 1484 IEEE TRANSACTIONS ON SMART GRID, VOL. 5, NO. 3, MAY 2014 Fig. 10. The PAR for the selling and no selling scenarios and its relation with the storage energy ratio using Stackelberg. Fig. 11. Energy consumption profile with strategy. , , using minimax achieved by the Energy Consumption Game (at much higher PAR). 3) Daily Consumption Profile: In Fig. 11 we plot the consumption profile and the optimized prices when the Stackelberg strategy is employed. When only one user has storage, the storage capacity is too low to have uniform pricing. Optimal prices are low at the beginning of the day, before any user has any devices operational, and increases after that. During these cheap times, the user buys energy as much as it can. Later when prices are higher, the other users come into play. Once 2 or more users get storage, the system reaches equilibrium at almost uniform pricing, resulting in almost uniform consumption and very low PAR. 4) Convergence: In Fig. 12 we plot the convergence of the outer iteration, when the utility receives the consumption of each user and determines its optimal prices using Algorithm 2. The algorithm converges in about 25 iterations. VI. CONCLUSIONS We have presented a game-theoretic approach to analyze the interaction between the different users and the utility in the smart grid in the presence of storage. A continuous non-cooperative game was used to model the interaction between the different users and a Stackelberg game was used to model their interaction with the utility. For the first game, we have shown the difference between the selling and no-selling scenarios in terms of cost, PAR and effect of storage. Simulation results show the decrease in total cost and PAR when storage is present. They show that depriving the users from the selling option is beneficial in terms of PAR. They also indicate that sometimes it is better to give storage only to a subset of users, not all of them. Moreover, they show how selling option makes extra storage of benefit to all users, which is not the case with the no-selling scenario. The reverse peak phenomenon motivates a Stackel- Fig. 12. Convergence of outer loop iterations for the iterative entropic algorithm. berg game. We provided two algorithms for solving the Stackelberg game and showed its equivalence with the minimum PAR problem. Simulation results showed the very significant reduction in terms of PAR in the case of Stackelberg game. APPENDIX A. Proof of Theorem 3.1 in (5) as well as the local The proposed cost function in (18) is a composition of an affine function one SOLIMAN AND LEON-GARCIA: GAME-THEORETIC DEMAND-SIDE MANAGEMENT WITH STORAGE DEVICES with a strictly convex func. From [14], this composition produces a strictly tion convex function. From Theorem 1 in [20], the Nash equilibrium of an game exists if it is a concave game defined over a convex compact set. The set specified by the optimization constraints specifies a closed, bounded, and convex set [14]. Since our game is strictly concave, cost is strictly convex, defined over a compact and convex set, then Nash equilibrium exists. Moreover, from Theorem 6 in [20], uniqueness is guaranteed if , where is the Jacobian matrix of the cost functions, i.e., diagonally strictly convex (DSC). In our case, all users are optimizing the same global cost function. It follows then is actually the Hessian of the objective func. In other words, DSC reduces to strict contion vexity when the objective function is the same for all users. We have just proven that the objective cost function is strictly convex, hence the payoff function is strictly concave, then its Hessian is negative and the Nash equilibrium is unique. B. Proof of Theorem 3.2 Let be the optimal solution of (14). Then 1485 [10] P. McDaniel and S. W. Smith, “Security and privacy challenges in the smart grid,” IEEE Security Privacy, vol. 7, no. 3, pp. 75–77, May–Jun. 2009. [11] G. M. Masters, Renewable and Efficient Electric Power Systems. Hoboken, NJ, USA: Wiley-IEEE Press, Jul. 2004. [12] OECD/IEA, “The power to choose: Demand response in liberalised electricity markets,” International Energy Agency, Organisation for Economic Co-operation and Development, 2003. [13] K. Herter, “Residential implementation of critical-peak pricing of electricity,” Energy Policy, vol. 35, no. 4, pp. 2121–2130, April 2007. [14] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge, U.K.: Cambridge Univ. Press, 2004. [15] S. Kishore and L. Snyder, “Control mechanisms for residential electricity demand in smart-grids,” in Proc. IEEE Int. Conf. Smart Grid Commun. (SmartGridComm), Gaithersburg, MD, USA, Oct. 2010. [16] J. Bulow, J. Geanakoplos, and P. Klemperer, “Multimarket oligopoly: Strategic substitutes and strategic complements,” J. Political Econ., vol. 93, no. 3, pp. 488–511, Jun. 1993. [17] M. Simaan and J. B. Cruz, “On the stackelberg strategy in nonzero-sum games,” J. Optim. Theory Appl., vol. 11, no. 5, pp. 533–555, May 1973. [18] S. Zakovic, C. Pantelides, and B. Rustem, “An interior point algorithm for computing saddle points of constrained continuous minimax,” Ann. Oper. Res., vol. 99, no. 1–4, pp. 59–77, Dec. 2000. [19] R. L. Sheu and J. Y. Lin, “Solving continuous min-max problems by an iterative entropic regularization method,” J. Optim. Theory Appl., vol. 121, no. 3, pp. 597–612, Jun. 2004. [20] J. B. Rosen, “Existence and uniqueness of equilibrium points for concave n-person games,” Econometrica, vol. 33, pp. 347–351, Jul. 1965. (35) Multiplying both sides by Hazem M. Soliman (S’08) was born in Egypt in 1987. He received his B.Sc. and M.Sc. in electronics and electrical communications engineering with first class honors from Cairo University, Egypt, in 2009 and 2011, respectively. In 2011, he joined the Department of Electrical and Computer Engineering at the University of Toronto, Toronto, ON, Canada, where he is currently pursuing his Ph.D. degree in communication networks. His current research interests include multi-cell MIMO, smart grids, game theory, virtual networks, and future networks , we get (36) which is the definition of Nash Equilibrium [2]. Hence the globally optimum solution is also the Nash equilibrium point. REFERENCES [1] S. M. Amin and B. F. Wollenberg, “Toward a smart grid: Power delivery for the 21st century,” IEEE Power Energy Mag., vol. 3, no. 5, pp. 34–41, Oct. 2005. [2] T. Basar and G. J. Olsder, Dynamic Noncooperative Game Theory (SIAM Series in Classics in Applied Mathematics). Philadelphia, PA, USA: Society for Industrial and Applied Mathematics (SIAM), Jan. 1999. [3] W. Saad, Z. Han, H. V. Poor, and T. Basar, “Game-theoretic methods for the smart grid: An overview of microgrid systems, demand-side management, and smart grid communications,” IEEE Signal Process. Mag., vol. 29, no. 5, pp. 86–105, Sep. 2012. [4] A.-H. Mohsenian-Rad, V. W. S. Wong, J. Jatskevich, R. Schober, and A. Leon-Garcia, “Autonomous demand-side management based on game-theoretic energy consumption scheduling for the future smart grid,” IEEE Trans. Smart Grid, vol. 1, no. 3, pp. 320–331, Dec. 2010. [5] B. Ramanathan and V. Vittal, “A framework for evaluation of advanced direct load control with minimum disruption,” IEEE Trans. Power Syst., vol. 23, no. 4, pp. 1681–1688, Nov. 2008. [6] M. A. A. Pedrasa, T. D. Spooner, and I. F. MacGill, “Scheduling of demand side resources using binary particle swarm optimization,” IEEE Trans. Power Syst., vol. 24, no. 3, pp. 1173–1181, Aug. 2009. [7] S. Bu, F. R. Yu, and P. X. Liu, “A game-theoretical decision-making scheme for electricity retailers in the smart grid with demand-side management,” in Proc. IEEE SmartGridComm, Brussels, Belgium, Oct. 2011. [8] N. Li, L. Chen, and S. H. Low, “Optimal demand response based on utility maximization in power networks,” in IEEE Power Energy Soc. Gen. Meet., San Diego, CA, Jul. 2011. [9] W. Saad, Z. Han, H. V. Poor, and T. Basar, “A noncooperative game for double auction-based energy trading between PHEV’s and distribution grids,” in Proc. IEEE SmartGridComm, Brussels, Belgium, Oct. 2011. architectures. Alberto Leon-Garcia (S’74–M’77–SM’97–F’99) received the B.S., M.S., and Ph.D. degrees in electrical engineering from the University of Southern California, Los Angeles, CA, USA, in 1973, 1974, and 1976, respectively. He was founder and CTO of AcceLight Networks, Ottawa, ON, Canada, from 1999 to 2002, which developed an all-optical fabric multiterabit, multiservice core switch. He is currently a Professor in Electrical and Computer Engineering at the University of Toronto, ON, Canada. He holds a Canada Research Chair in Autonomic Service Architecture. He holds several patents and has published extensively in the areas of switch architecture and traffic management. His research team is currently developing a network testbed that will enable at-scale experimentation in new network protocols and distributed applications. He is recognized as an innovator in networking education. In 1986, he led the development of the University of Toronto-Northern Telecom Network Engineering Program. He has also led in 1997 the development of the Master of Engineering in Telecommunications program, and the communications and networking options in the undergraduate computer engineering program. He is the author of the leading textbooks Probability and Random Processes for Electrical Engineering and Communication Networks: Fundamental Concepts and Key Architecture. His current research interests include application-oriented networking and autonomic resources management with a focus on enabling pervasive smart infrastructure. Prof. Leon-Garcia is a Fellow of the Engineering Institute of Canada. He received the 2006 Thomas Eadie Medal from the Royal Society of Canada and the 2010 IEEE Canada A. G. L. McNaughton Gold Medal for his contributions to the area of communications.
© Copyright 2026 Paperzz