Impacts of suppliers` learning behaviour on market equilibrium

Impacts of suppliers’ learning behaviour on market
equilibrium under repeated linear supply-function
bidding
Y.F. Liu, Y.X. Ni and F.F. Wu
Abstract: The paper studies the impacts of learning behaviour of electrical-power suppliers on
electricity-spot-market equilibrium under repeated linear supply-function bidding. In the markets,
the supplier will conduct ‘learning’ to improve his strategic bidding in order to obtain greater profit.
Therefore, it is significant to explore the impacts of such learning behaviour on market equilibria
and market-clearing price (MCP). First the mathematical model for supplier’s optimal bidding is
established. This is then used to solve for market equilibrium. It is shown that supply-function
equilibrium is a Nash equilibrium; and that under certain conditions the overall learning behaviour
will reduce the MCP, which in turn increases consumers’ surplus and decreases suppliers’ profits,
while in some other conditions the results are just the contrary. In either case, the MCP at
equilibrium induced by the overall learning behaviour reflects the true relationship of supply and
demand. Numerical results support the analytical conclusions very well.
1
Introduction
In recent years, competition has been introduced to the
power industry in order to increase social welfare and
improve market efficiency. A variety of restructuring modes
have been proposed in different countries. Among these
modes, the market structure of the power pool (‘poolco’
type) is the most popular one. In poolco, electricity spot
markets, the suppliers (gencos) bid for the next-day hourly
price and generation; then the independent system operator
(ISO) creates the next-day hourly generation schedules to
meet the demand at a minimum cost, and establishes the
corresponding market-clearing price (MCP) for each hour
[1]. Theoretically, in a perfectly competitive market,
suppliers will bid their true marginal production costs to
maximise their profits. However, it is well known that the
power industry is more akin to an oligopoly, where
suppliers can influence the market-clearing price by their
strategic bidding [2–4]. Therefore, each supplier faces the
problem of how to improve its strategic bidding to
maximise its own profit based on its production cost, its
expectation about rivals’ behaviour, and some other
available market information.
Numerous papers [5–17] have addressed the strategicbidding issue in competitive power markets. An optimal
bidding strategy to maximise a bidder’s profit in the
England & Wales power market is presented in [7] under the
assumption of perfect competition. In [8], the optimal
bidding strategy is derived as a function of one’s own cost
and the rivals’ cost distributions. In [9–12], game theory is
r IEE, 2006
IEE Proceedings online no. 20050137
doi:10.1049/ip-gtd:20050137
Paper first received 19th April and in final revised form 5th September 2005
The authors are with the Department of Electrical and Electronic Engineering,
Hong Kong University, Pokfulam Road, Hong Kong
E-mail: [email protected]
44
applied to find Nash equilibria of power markets. Other
methods, such as genetic algorithm [13], Markov decision
process [14], Lagrangian relaxation [15], stochastic optimisation [16] and ordinal optimisation [17], are also used to
solve the optimal-bidding problem.
Recently, supply-function models have been used widely
to investigate the optimal-bidding problem [4, 18–24], since
the power-market rules require the individual supplier to
submit its supply function for representing the willingness of
generating at various electricity prices. The concept of
supply-function equilibrium (SFE) was originally developed
by Klemperer and Meyer [25] as a way to model market
players’ behaviour in a market with uncertain demand. In
[4], the Klemperer–Meyer equation is first used to compute
SFE in the England &Wales spot market. In [18], a linear
version of the supply-function-based model is developed. In
[19], some basic issues about learning through the SFE
approach are discussed. In [20], a two-point boundary
problem is defined using the SEF concept. A generalisation
of Green’s model [18] is presented in [21]. Then, Baldick and
Hogan [22] show that only the affine supply function can be
stable. A conjectural supply-function model is introduced in
[23]. In [24], the existence and uniqueness of linear SFE are
studied.
However, most of the prior work is based on the static
analysis of market equilibrium; little attention has been paid
to market evolution in the situation of repeated bidding. It
is clear that, during repeated bidding, suppliers will conduct
‘learning’ so as to make full use of available information
and then improve their bidding strategies based on knowledge acquired. Apparently, it is significant to explore
whether the suppliers have incentives to learn and the
subsequent impacts of suppliers’ rational learning behaviour
on the evolution of market equilibrium and MCP. The two
issues will be addressed in the paper, with proof. In
addition, [2] and [10] mention a ‘prisoner-dilemma’ issue
induced by supplier’s strategic bidding in power markets,
but the mechanism on why and how it may happen has not
IEE Proc.-Gener. Transm. Distrib., Vol. 153, No. 1, January 2006
been illustrated. This issue will also be explained in this
paper. Computer test results of a six-genco power market
show clearly that analytical results are correct, i.e.
(i) Suppliers have incentive to learn;
(ii) In certain condition, the overall learning behaviour will
reduce the MCP which, in turn, increases consumers’
surplus and decreases suppliers’ profits, while in some other
conditions the results are just the contrary; and
(iii) In either case, the MCP at equilibrium induced by the
overall learning behaviour reflects the true relationship of
supply and demand.
2
Bidding with optimal linear supply function
2.1 Assumptions on demand curve and
production-cost function
Assume that each electrical-power supplier has a cluster of
generators and hence the ISO has no need to consider
suppliers’ operation constraints, such as ramp rate, minimum up and minimum down time, generation capacity
etc., when we explore supplier’s strategic-bidding behaviour.
Also, for simplicity, the network impacts (congestion
and losses) are not considered at this stage. Also, suppose
that the overall demand can be described by a linear
inverse demand curve, and the demand is stationary
for the same hour of any day. Hence for the studied hour
we have:
n
X
qi
ð1Þ
p ¼ e fD ¼ e f
i¼1
where D ¼ total demand; qi ¼ electrical-power output of
supplier i; e, f ¼ positive coefficients publicly known from
historical records, constant here; n ¼ number of suppliers
(generator); and p ¼ MCP. As
Pelectrical-power energy can
not be stored, we have Q ¼ ni¼ 1 qi ¼ D.
Suppose that each supplier has a quadratic productioncost function which takes the form
1
Ci ðqi Þ ¼ ai þ bi qi þ ci q2i ; i ¼ 1; . . . ; n
ð2Þ
2
where ai , bi , ci are parameters of the cost function, generally
nonnegative.
2.2
Determination of MCP
For the studied hour, if all suppliers bid their supply
functions as qi ¼ qi ðpÞ ði ¼ 1; . . . ; nÞ, the ISO will
aggregate these bids, determine the MCP (p) and
individuals’ outputs to satisfy the demand at lowest costs.
It is clear that supplier i’s profit function is given by [see
(2) for Ci ðqi Þ]:
ð3Þ
pi ¼ pqi Ci ðqi Þ
Assume that each supplier is asked to submit a linear supply
function:
p ¼ b0i þ xi qi
ð4Þ
It is proven in [21] that optimal supply-function coefficient
b0i is equal to the true cost-function coefficient bi in (2).
Therefore, in our formulation, b0i is replaced by bi , and only
slope xi in (4) should be optimised. When ISO receives all
the bids, he will solve the following supply–demand-balance
equation for MCP (p) based on (1) and (4):
n
n
X
X
p bi
ep
¼ D
ð5Þ
qi ¼
¼
Q ¼
f
x
i
i¼1
i¼1
Here we assume that p4bi ði ¼ 1; . . . ; nÞ; otherwise the
corresponding supplier will not be dispatched. Solving (5),
IEE Proc.-Gener. Transm. Distrib., Vol. 153, No. 1, January 2006
yields MCP:
n b
P
i
i ¼ 1 xi
p ¼
n 1
P
1þf
i ¼ 1 xi
eþf
ð6Þ
Given the parameters e, f and fbi ; xi gi ¼ 1;...;n , the MCP can
be calculated according to (6). The individual generation qi
and the total supply and demand can be calculated
thereafter without difficulty. It is clear that parameters e,
f, bi are basically stationary and easily estimated from
historical data, while xi is versatile and time varying in the
repeated bidding. It is the decision of xi by an individual
supplier for maximising its profit that exercises market
power and causes spikes of MCP.
2.3
Optimal decision on xi
The slope of supply function xi should be nonnegative. In
this paper we assume xi 40 (if xi ¼ 0, we can assume it is a
very small positive value x). According to microeconomics,
we know that a rational supplier will not submit a supply
i
function below its marginal-cost function MCi ¼ @C
@qi ¼
bi þ ci qi ; therefore we have xi maxðci ; xÞ ði ¼ 1; . . . ; nÞ
and hence (because f 40; xj 40):
!
!
n
n
X
X
1
1
1þf
40; and
1þf
40 ð7Þ
x
x
j¼1 j
j ¼ 1;j6¼i j
Since p4bi ði ¼ 1; . . . ; nÞ, then with (6) and (7), we can
see that
n
X
bj bi
p bi 40 ) e bi þ f
40;
xj
ð8Þ
j ¼ 1;j6¼i
i ¼ 1; . . . ; n
According to (2), (3) and (4), the supplier’s profit function
can be rewritten as [eliminate p with (6)]:
p bi
p bi
1
p bi 2
pi ¼ p
ai bi
ci
2
xi
xi
xi
32
n
P
bj bi
7 6 e bi þ f
xj 7
6
1
j ¼ 1;j6¼i
7
6
!
¼ 6
7 x i 2 c i ai
n
P
5
4
1
xi 1 þ f
þf
x
j ¼ 1;j6¼i j
2
ð9Þ
It is clear from (9) that, if bj6¼i is fixed and xj6¼i can be
estimated from historical data; then, in the nonco-operative
situation, each supplier can determine its optimal xi based
on (9) to maximise its profit. Differentiating pi in (9) with
respect to xi , yields
!2
!
n
n
P
P
bj bi
1
e bi þ f
1þf
xj
xj
j ¼ 1;j6¼i
j ¼ 1;j6¼i
9
>
>
=
8
>
>
<
@pi
¼
@xi
f
þ ðci xi Þ
n
P
>
>
>
>
1
;
: 1þf
xj
j ¼ 1;j6¼i
(
xi 1 þ f
n
P
j¼1
!)3
1
xj
ð10Þ
45
For (10), with conditions (7) and (8), we can conclude that
0
1
B
C
@pi
f
B
C
þ
c
x
sign
¼ signB
C
i
i
n
P
1
@
A
@xi
1þf
j ¼ 1;j6¼i xj
and the supplier i’s optimal decision should be
f
xoipt ¼
þ ci
n
P
1
1þf
j ¼ 1;j6¼i xj
ð11Þ
ð12Þ
Eqns. (11) and (12) are reasonable from microeconomics,
i.e. when the number of supplier
P is infinite ðn ! 1Þ (or the
competition is perfect), then nj¼ 1;j6¼i x1j ! 0 and we have
xoipt ! ci ; when there is only one market supplier (monopoly) or the suppliers conduct a Cournot game (e.g.
duopoly of Cournot game), it is easy to prove [26] that
xoipt ¼ ci þ f . It is well known that a real market will be
less competitive than ‘perfect competition’, but more
competitive than ‘monopoly’; thus a supplier in an
oligopoly market should submit a supply function with
xoipt 2 ½ci ; ci þ f .
2.4
Features of SFE
When all suppliers learn, [24] has proven that there exists
the unique equilibrium, called supply-function equilibrium
(SFE).
Definition 1: A set of linear supply-function coefficients x
¼ ðxi Þi ¼ 1;...;n with xi 4 maxðci ; 0Þ is a Nash-supply-function equilibrium (SFE) if:
xi 2 arg maxfpi ðxi ; xi Þg; 8i ¼ 1; . . . ; n
ð13Þ
xi
where xi ¼ fxj jj ¼ 1; . . . ; n; j ¼
6 ig.
Definition 1 is correct since it is based on the definition of
Nash equilibrium. Thus at linear-supply-function equilibrium, we have [see (11)]:
f
; i ¼ 1; . . . ; n
ð14Þ
xi ¼ ci þ
n
P
1
1þf
j ¼ 1;j6¼i xj
Since (14) satisfies Definition 1, so the set of xi ði ¼
1; . . . ; nÞ constitutes a Nash SFE. This is an important
feature of SFE. From (14), we can see that to find the Nash
equilibrium of a power market under linear-supply-function
bidding is to solve a set of coupled nonlinear equations.
Also we can see that the equilibrium is independent of the
demand characteristic e and the marginal-cost-function
intercepts bi .
3 Impacts of overall learning on the market
equilibrium and market price
3.1
Concept of learning
In power markets, certain market information ðp; qj ; j ¼
1; . . . ; nÞ will be publicised. Supplier i can make estimates of
rivals’ bidding coefficients xj , based on the market
information (history data). Then, in the next round of
46
xi ½k ¼
1þf
Eqn. (11) shows that each supplier i has an incentive
to learn about xj of its rivals so as to get maximum
profit pi .
6 iÞ, we can conclude from (11) that a
Since xj 40 ðj ¼
rational supplier will take
ci xoipt f þ ci
bidding, it will improve its supply-function coefficient xi to
be the best response to the estimated coefficients xj;j6¼i . It is
clear that according to (11), supplier i can make the
following rational adjustment:
f
n
P
1
j ¼ 1;j6¼i xj ½k 1
þ ci
ð15Þ
where k is the time series and [k1] and [k] represented
the same hour of two subsequent days of k 1 and k.
Apparently, each supplier will have an incentive to make
the optimal decision through learning in the market in
order to obtain maximum profit. Here, learning in
the market means that supplier i adjusts its supply-function
coefficient xi according to (15) based on the estimation of
supply-function coefficients xj;j6¼i of others.
3.2
Impacts of rivals’ behaviour
From (15), we can see that each supplier’s profit pi is
also dependent on all the supply-function coefficients.
We can see that, during the repeated bidding, each supplier
will adjust its supply-function coefficient through
learning, which will in turn influence not only its own
profit, but also the others according to (9). Thus we need to
explore the impacts of one’s learning behaviour on
others’ profits and the impacts of overall learning on
6 iÞ
MCP. Differentiating pi in (9) with respect to xj ðj ¼
yields
!
n
X
@pi
1
bk bi
¼ 2 xi ci
e bi þ f
2
@xj
xk
k¼1
n
! e bj þ f P bk bj
ð16Þ
xk
fxi
k¼1
3
n 1
x2j
P
xi 1 þ f
k ¼ 1 xk
It is clear that we have [see (7), (8) and, with the constraint
xj maxðcj ; xÞ ðj ¼ 1; . . . ; nÞ]:
1
xi ci 40
2
!
n
X
bk bi
e bi þ f
40
xk
k¼1
!
fxi
40
x2j
!
n
X
bk bj
e bj þ f
40
xk
k¼1
(
xi
n
X
1
1þf
x
k¼1 k
!
)
þ f 40
Thus a very important observation can be made, i.e.
@pi
40
@xj
j¼
6 i
ð17Þ
Inequality (17) means that, if supplier j increases its
supply-function coefficient xj , its rivals can obtain more
profit, which lays a foundation for the study of interaction
of suppliers’ learning. To show the underlying economic
IEE Proc.-Gener. Transm. Distrib., Vol. 153, No. 1, January 2006
meaning, define supplier i’s residual demand function as
n
n
X
X
p bj
ep
ep
qi ð p Þ ¼
Qdi ¼
f
f
xj
j ¼ 1;j6¼i
j ¼ 1;j6¼i
!
!
n
n
X
X
bj
e
1
1
þ
þ
¼
p
f j ¼ 1;j6¼i xj
f j ¼ 1;j6¼i xj
ð18Þ
Thus, it can be seen that, if any other supplier increases its
6 iÞ, supplier i will face a
supply function coefficient xj ðj ¼
less elastic residual-demand curve, which is apparently
beneficial to supplier i.
3.3
Two dominant scenarios of learning
Two significant issues arising from learning in the markets
are how one’s decision will influence one’s rivals’ profits and
how the overall learning will affect MCP and total power
exchange. Two dominant scenarios are used as examples,
followed by further discussion. In the first case, all suppliers
have an initial guess of ‘highly competitive’ on market
conditions. In this case, supplier i revises its xi through
learning, and the MCP will increase with respect to time
and all suppliers will obtain more profits until market
equilibrium is reached. In the second case, all suppliers have
an initial guess of ‘highly oligopolistic’ on market conditions; then the MCP in equilibrium will decrease with
respect to time and at least some suppliers will obtain less
profit when the market equilibrium is reached, although
their incentives are to increase their profits. The detailed
proof is presented below.
Case 1: ‘Highly competitive’ initial guess
For this case, assume that the supplier’s initial guess is
that the market is ‘highly competitive’; therefore each
supplier will submit a supply function the same as its true
marginal cost function, i.e. xi ½0 ¼ ci ði ¼ 1; . . . ; nÞ.
According to (10), obviously we have
@pi ðx ½0 ¼ ci Þ 40 ði ¼ 1; . . . ; nÞ
@xi i
Since each supplier has the incentive to learn in the market
for maximum profit, we can expect that each supplier will
update its bid according to the rational learning process
(15).
Differentiating xi ½k in (15) with respect to xj ½k 1,
yields
f2
@xi ½k
¼
@xj ½k 1
ðxj ½k 1Þ
j ¼ 1; . . . ; n;
2
n
P
1
1þf
x
½k
j ¼ 1;j6¼i j 1
!2 40;
We know that the upper-bounded, increasing time series
fxi ½kg will finally converge [Here, it will converge to the
point xi ði ¼ 1; . . . ; nÞ, the unique solution of coupled
nonlinear equation (14)]. In the meantime, from (17), we
know that this adjustment will also increase others’ profits
(the profit function moves in the same direction as supply
function coefficients). Therefore, from the initial guess of a
‘highly competitive’ market, the overall learning behaviour
will benefit to all suppliers, i.e.
pi ðx1 ; x2 ; . . . ; xn Þ4pi ðx1 ½0; x2 ½0; . . . ; xn ½0Þ;
xi ½0 ¼ ci ; i ¼ 1; . . . ; n
Without loss of generality, it is easy to prove that, for any
xi ½0 2 fci xi ½0oxi g, inequality (19) still holds.
Case 2: ‘Highly oligopolistic’ initial guess
In this case, we assume that the supplier’s initial guess is
that the market P
is ‘highly oligopolistic’. Differentiating the
industry profit ( nj ¼ 1 pj , the total profit of all suppliers)
with respect to the variable xi at the equilibrium point fxi g,
i
we have [see (10), (14) and (17) and note that @p
@xi ¼ 0 at
xi ¼ xi ]:
n
n
X
X
@pj @pj ðxi ¼ xi Þ ¼
ðx ¼ x Þ 40; i ¼ 1; . . . ; n
@xi
@xi i i
j¼1
j ¼ 1;j6¼i
ð21Þ
Equation (21) implies, that at the equilibrium point fxi g,
the industry profit is not maximised ( from game theory; we
know that the industry profit is maximised at the cooperative-equilibrium point. Here we study the noncooperative equilibrium, and it is reasonable that the industry
profit is not maximised at the equilibrium). Also, from (9),
we know that the profit function pi ði ¼ 1; . . . ; nÞ is twice
continuously differentiable with respect to all coefficients xj
ðj ¼ 1; . . . ; nÞ, which means that the derivative given by
(21) is also continuous. Without loss of generality, we can
always find an upper-bound positive x, such that:
n
X
@pk
0; xi 2 xi oxi ^xi ; i ¼ 1; . . . ; n ð22Þ
@x
i
k¼1
where ^xi ¼ xi þ x ði ¼ 1; . . . ; nÞ.
At the point of f^xi gi ¼ 1;...;n , we have [see (7)]:
f
1þf
We know that xi ½k is a monotonically increasing function
6 iÞ.
of xj ½k 1 ðj ¼
Apparently, from the ‘highly competitive’ initial condition with xj ½0 ¼ cj ðj ¼ 1; . . . ; nÞ and (15), we have
f
f
þ ci ¼
xi ½1 ¼
n
n
P
P
1
1
1þf
1þf
x
c
½0
j ¼ 1;j6¼i j
j ¼ 1;j6¼i j
ði ¼ 1; . . . ; nÞ
It will in turn induce others’ responses with (19), so we
obtain an increasing time series fxi ½kg for any
i ¼ 1; . . . ; n. The series has its boundary condition (12).
IEE Proc.-Gener. Transm. Distrib., Vol. 153, No. 1, January 2006
1
^
x
j ¼ 1;j6¼i j
¼
j¼
6 i
þ ci 4xi ½0 ¼ ci
n
P
1þf
ð19Þ
ð20Þ
f
n
P
1
j ¼ 1;j6¼i xj þ x
x 1 þ f
¼
þ ci ^xi
n
P
j ¼ 1;j6¼i xj
n
P
f
1
1þf
þx
x
j ¼ 1;j6¼i j
1þf
n
P
1
j ¼ 1;j6¼i xj
1
þf
þx
!
n
P
x
1
x
j ¼ 1;j6¼i j
n
P
!
1
1þf
x
j ¼ 1;j6¼i j
ð23Þ
! o0
Therefore, with (14) and (23), it is not difficult to show that
@pi i ¼ 1; . . . ; n
ð24Þ
ðx ¼ ^x ;j ¼ 1;...;nÞ o0;
@xi j j
This is an important feature of case 2.
Now, under the ‘highly oligopolistic’ initial condition
ði ¼ 1; . . . ; nÞ,
we
have
with
xi ½0 ¼ xi þ x
@pi @xi ðxj ¼ xj þx;j ¼ 1;...;nÞ o0. Thus each supplier has the incentive
to decrease its supply-function coefficient, i.e., xi ½1oxi ½0
47
ði ¼ 1; . . . ; nÞ. Furthermore from (19), we know that the
time series fxi ½kgi ¼ 1;...;n is monotonically decreasing. The
series also has its boundary condition defined in (12).
We know that the low-bounded, decreasing time series
fxi ½kgi ¼ 1;...;n will finally converge [here, it will converge to
the equilibrium point xi ði ¼ 1; . . . ; nÞ, the unique solution
of coupled nonlinear (14)].
For the decreasing series fxi ½kgi ¼ 1;...;n , with condition
(22),
nP we knowothat the corresponding total-profit time series
n
j ¼ 1 pj ½k is decreasing too, i.e.
n
X
pj ðx1 ½k; x2 ½k; . . . ; xn ½kÞ
j¼1
o
n
X
pj ðx1 ½k 1; x2 ½k 1; . . . ; xn ½k 1Þ;
ð25Þ
(c) Moreover, the overall learning behaviour will lead to
reasonable market equilibrium. For example, if all suppliers
guess that the market initial competition is more likely
‘highly oligopolistic’ and bid with a large xi , then the
learning behaviour will guide suppliers to reduce the
bidding coefficients xi rationally. At the new equilibrium
point, the MCP will decrease, which reflects the true
relationship between market demand and supply. The
opposite is also true.
(d) The fact of (c) shows that the MCP at the market
equilibrium is actually an important economic signal to
reflect the true relationship of market demand and supply.
That is to say, suppliers’ rational learning and optimal
decisions are not a bad thing, since they can help to reach
an appropriate power-market equilibrium, which shows
clearly the advantages of free markets as ‘an invisible hand’.
j¼1
k ¼ 1; 2; . . .
4
Thus, at the equilibrium point fxi g there must be
n
X
pj ðx1 ; x2 ; . . . ; xn Þo
j¼1
n
X
pj ðx1 ½0; x2 ½0; . . . ; xn ½0Þ
j¼1
ð26Þ
If all suppliers have the same cost function (symmetrical
situation), then, at the equilibrium point fxi g, all supplies
have the same profit ½p1 ðx1 ; x2 , . . . , xn Þ ¼ p2 ðx1 , x2 , . . . ,
xn Þ ¼ . . . ¼ pn ðx1 , x2 , . . . , xn Þ. According to (26), we know
that individual suppliers’ profit will also decrease with
respect to time, i.e.
pi ðx1 ; x2 ; . . . ; xi ; xiþ1 ; . . . ; xn Þopi ðx1 ½0; x2 ½0; . . . ; xn ½0Þ;
xi ½0 ¼ xi þ x;
i ¼ 1; . . . ; n
ð27Þ
It is, indeed, a scenario of ‘prisoner dilemma’, in which each
supplier conducts ‘learning’ for more profit, but the
outcome is just the contrary.
In the unsymmetrical situation (i.e. suppliers’ cost
functions are different), according to (26), we know that
at least one supplier (say supplier l) will obtain less profit,
i.e.
pl x1 ; x2 ; . . . ; xi ; xiþ1 ; . . . ; xn opl ðx1 ½0; x2 ½0; . . . ; xn ½0Þ;
xi ½0 ¼ xi þ x;
i ¼ 1; . . . ; n;
9l 2 f1; . . . ; ng
Numerical test results
The IEEE six-generator 30-bus system is used for testing.
The suppliers’ cost-function coefficients are listed in Table 1.
Suppose that the (hourly-based) inverse demand function in
a spot market is known as (p in $/MWh, D and qi in MW):
p ¼ 50 0:02D ¼ 50 0:02
6
X
ð29Þ
qi
i¼1
Suppose that the market information is complete and the
supplier conducts learning based on (15). ISO receives all
the submitted linear supply functions [as (4)], determines the
MCP according to (5) and the individual generation. We
have simulated and analysed eight cases, where case 1 is a
pure Cournot competition, where all suppliers adopt the
Cournot competition strategy and the corresponding supply
function is p ¼ bi þ ðci þ f Þqi ; and cases 2–7 corresponding to different numbers of learners ( from one to six) with
the remaining suppliers adopting the Cournot strategy;
while in case 8, all suppliers adopt the perfect competition
strategy (being a price taker, the supply function is
p ¼ bi þ ci qi ). Table 2 shows the MCP, each supplier’s
generation output qi and profit pi at the equilibrium for the
studiedPhour as well as MCP and industry output Q and
profit i pi .
ð28Þ
For supplier l, the ‘prisoner dilemma’ still exists. (Moreover,
if the unsymmetry is not too serious, we can still expect that
most suppliers will obtain less profit.)
In both situations of case 2, the series fxi ½kgi ¼ 1;...;n are
all decreasing; thus the total supply will increase and the
MCP will decrease, which is beneficial to the consumers.
We can make further comments on the learning impacts
as follows:
(a) As an extension of cases 1and 2, we can expect that, if
the initial condition is mixed, i.e. some suppliers’ initial
actions are based on some ‘more competitive’ market
estimation, while others are just the contrary, then the
outcome of overall learning is uncertain. Generally, the
former will increase its profit, while the latter will experience
profit loss during the learning.
(b) In any situation, a rational supplier will always have the
willingness to learn, as learning means making optimal
decisions and bring about more profit compared with
nonlearning, no matter whether others are learning or not.
48
Table 1: Cost-function coefficients
Supplier
ai
bi
ci
1
0
2
0.02
2
0
1.75
0.0175
3
0
3
0.0175
4
0
3
0.025
5
0
1
0.0625
6
0
3.25
0.00834
Without loss of generality, let us examine supplier 2’s
profit fluctuation in different cases. In case 1 (all supplies
adopt Cournot strategy, i.e. the initial guess of market
competition is ‘highly oligopolistic’ and all suppliers do not
learn), supplier 2’s profit is $3461.7. In case 2 (only supplier
1 learns), supplier 2’s profit is reduced to $3031.4 for
nonlearning. In case 3, supplier 2 will also learn, and its
IEE Proc.-Gener. Transm. Distrib., Vol. 153, No. 1, January 2006
Table 2: Computation results (q in MW, p in $, p in $/Mwh)
Case 1
q1
319.06
p1
3054
q2
347
p2
3461.7
q3
261.39
2220.5
p3
261.39
q4
2220.5
p4
q5
166.82
1426.2
p5
406.23
q6
3988.5
p6
p
Q
P
14.76
1761.9
i
pi
16371
Case 2
459.89
3370
324.71
3031.4
242.82
1916.2
242.82
1916.2
156.69
1258.2
376.74
3430.5
13.927
1803.7
14923
Case 3
Case 4
432.11
2874.8
416.73
2617.6
487.63
3392.6
3099.5
1596.6
1737.3
1432.8
145.14
343.12
2845.6
13386
5
IEE Proc.-Gener. Transm. Distrib., Vol. 153, No. 1, January 2006
451.33
2759.6
298.19
1516.6
298.19
1516.6
353.4
1730.4
405.12
2080.6
258.44
1085.4
258.44
Case 8
348.43
1214
412.49
1488.8
238.74
712.47
238.74
1085.4
712.47
132.77
162.13
142.9
127.5
903.46
931.76
709.48
507.98
324.58
307.12
302.17
560.18
2279.7
11.954
1902.3
12420
In this paper, we study the impacts of power suppliers’
learning behaviour on electricity spot-market equilibrium
under repeated linear supply-function bidding. In such
markets, each individual supplier will conduct ‘learning’ to
improve its strategic bidding in order to earn more profit.
Therefore, it is significant to explore the impacts of such
learning behaviour on market equilibria and market
clearing prices (MCP). In this paper, we first establish the
mathematical model for a supplier’s optimal bidding, which
is used to solve for the market equilibrium. Then it is shown
that supply-function equilibrium is a Nash equilibrium; and
that, in certain conditions, the overall learning behaviour
will reduce the market clearing price, which in turn increases
consumers’ surplus and decreases suppliers’ profits; while in
some other conditions the results are just the opposite. In
either case, the MCP at equilibrium induced by the overall
learning behaviour reflects the true relationship of supply
397.32
2320.5
Case 7
138.77
12.448
Conclusions
Case 6
986.92
1877.6
profit will increase from $3031.4 to $3392.6, i.e. supplier 2
has the incentive to learn. From case 3 to case 6, when other
suppliers join the learning team one by one, supplier 2’s
profit will decrease successively. In case 7 (when all suppliers
learn), supplier 2’s profit is reduced to $2080.6, which is far
less than the profit of $3461.7 in case 1. Considering other
suppliers’ profit evolution, we can also make similar
observations. Therefore, there exists a ‘prisoner dilemma’,
i.e. although all supplies learn for more profits, the outcome
is that all suppliers obtain less profits if, at the very
beginning, a ‘highly oligopolistic’ guess about the market is
made by all suppliers. Moreover, we can see that the MCP
continuously decreases from case 1 to case 7, which means
that, when the initial estimate of market competition is
‘highly oligopolistic’, suppliers’ rational learning behaviours
will lead to a reduction of xi and hence a lower MCP. When
all suppliers behave as price takers from the guess of ‘perfect
competition’ in case 8, all suppliers’ profits are even less
than that in case 7. Therefore we can expect that, if starting
from case 8, and all suppliers learn rationally, the learning
will lead to more benefits to all suppliers. Finally, the
market will still converge to the equilibrium in case 7 with
MCP larger than case 8, but smaller than cases 1–6.
302.24
1564.3
2546.3
12.974
1851.6
302.24
1564.3
209.97
1079.6
456.11
2833.7
315.88
221.65
401.84
2385
471.66
221.65
1596.6
Case 5
11531
2206.9
11.814
1909.3
11252
2713.8
10.43
685.68
1960.5
8.97
1978.5
2051.6
9405.2
6596.3
and demand. Numerical results support the analytical
conclusions very well.
6
Acknowledgment
The authors acknowledge financial support from the RGC,
Hong Kong Government, Hong Kong.
7
References
1 Shahidehpour, M., and Almoush, M.: ‘Restructured electrical power
systems: operation, trading, and volatility’ (Marcel Dekker Inc., New
York, 2001)
2 Rudkevich, A., Duckworth, M., and Rosen, R.: ‘Modeling electricity
pricing in a deregulated generation industry: the potential for
oligopoly pricing in a Poolco’, Energy J., 1998, 19, (3), pp. 19–48
3 Berry, C.A., and Hobbs, B.F. et al.: ‘Understanding how market
power can arise in network competition: a game theoretic approach’,
Util. Policy, 1999, 8, pp. 139–158
4 Green, R.J., and Newbery, D.M.: ‘Competition in the British
electricity spot market’, J. Political Economy, 1992, 100, (5)
5 David, A.K.: ‘Competitive bididng in electricity supply’, IEEE Proc.
Circuit. Transm. Distrib., 1993, 140, pp. 421–426
6 David, A.K., and Wen, F.S.: ‘Strategic bidding in competitive
electricity markets: a literature survey’. Proc. IEEE PES 2000 Summer
Power Meeting, Seattle, USA, 2000, pp. 2168–2173
7 Gross, G., and Finlay, D.J.: ‘Optimal bidding strategies in competitive
electricity markets’. Proc. 12th Power Syst. Comput. Conf.
(PSCC’96), Dresden, Germany, 1996, pp. 815–823
8 Hao, S.: ‘A study of basic bidding strategy in clearing pricing
auctions’, IEEE Trans. Power Syst., 2000, 15, pp. 975–980
9 Ferrero, R.W., Rivera, J.F., and Shahidehpour, S.M.: ‘Application of
games with incomplete information for pricing electricity in deregulated power pools’, IEEE Trans. Power Syst., 1998, 13, pp. 184–189
10 Guan, X., Ho, Y.-C., and Pepyne, D.L.: ‘Gaming and price spikes
in electric power markets’, IEEE Trans. Power Syst., 2001, 16,
pp. 402–408
11 Park, J.-B., Kim, B. H., Jung, M.-H., and Park, J.-K.: ‘A continuous
strategy game for power transactions analysis in competitive electricity
markets’, IEEE Trans. Power Syst., 2001, 16, pp. 847–855
12 Song, H., Liu, C.-C., and Lawarr!ee, J.: ‘Nash equilibrium bidding
strategies in a bilateral electricity market’, IEEE Trans. Power Syst.,
2002, 17, pp. 73–79
13 Richter, C.W. Jr., Shebl!e, G.B., and Ashlock, D.: ‘Comprehensive
bidding strategies with genetic programming/finite state automata’,
IEEE Trans. Power Syst., 1999, 14, pp. 1207–1212
14 Song, H., Liu, C.-C., Lawarr!ee, J., and Dahlgren, R.W.: ‘Optimal
electricity supply bidding by Markov decision process’, IEEE Trans.
Power Syst., 2000, 15, pp. 618–624
15 Zhang, D., Wang, Y., and Luh, P.B.: ‘Optimization based bidding
strategies in the deregulated market’, IEEE Trans. Power Syst., 2000,
15, pp. 981–986
16 Wen, F., and David, A.K.: ‘Optimal bidding strategies and modeling
of imperfect information among competitive generators’, IEEE Trans.
Power Syst., 2001, 16, pp. 15–21
49
17 Guan, X., Ho, Y.-C., and Lai, F.: ‘An ordinal optimization based
bidding strategy for electric power suppliers in the daily energy
market’, IEEE Trans. Power Syst., 2001, 16, pp. 788–797
18 Green, R.: ‘Increasing competition in the British electricity spot
market’, J. Ind. Economics, 1996, 44, (2)
19 Rudkevich, A.: ‘Supply function equilibrium in power markets:
learning all the way’, TCA technical paper 1299-1702, 22 December
1999
20 Rudkevich, A.: ‘Supply function equilibrium: theory and applications’.
Proc. 36th Hawaii Int. Conf. System Sciences (HICSS’03),
2003
21 Baldick, R., Grant, R., Kahn, E.: ‘Linear supply function equilibrium:
generalizations, applications, and limitations’, University of California
Energy Institute POWER paper PWP-078, www.ucei.berkeley.edu/
PDF/pwp078.pdf, August 2000
50
22 Baldick, R., Hogan, W.: ‘Capacity constrained supply function
equilibrium models of electricity markets: stability, non-decreasing
constraints, and function space iterations’. University of California
Energy Institute POWER paper PWP-089, August 2002
23 Day, C.J., Hobbs, B.F., and Pang, J.-S.: ‘Oligopolistic competition in
power networks: a conjectured supply function approach’, IEEE
Trans. Power Syst., 2002, 17, (3), pp. 597–607
24 Liu, Y., Ni, Y.X., and Wu, F.F.: ‘Existence & uniqueness of linear
supply function equilibrium in electricity markets’. IEEE GM2004,
June 2004, Vol. 1, pp. 249–254
25 Klemperer, P.D., and Meyer, M.A.: ‘Supply function equilibria in
oligopoly under uncertainty’, Econometrica, 1989, 57, (6), pp. 1243–
1277
26 Fudenberg, D., and Tirole, J.: ‘Game theory’ (MIT Press, Cambridge,
Mass., 1991)
IEE Proc.-Gener. Transm. Distrib., Vol. 153, No. 1, January 2006