Power and Delay Optimal Policies for Wireless Systems

Power and Delay Optimal Policies for
Wireless Systems
Satya Kumar V, Anusha Lalitha, Vinod Sharma
Department of Electrical Communications,
Indian Institute of Science, Bangalore
Email: [email protected], [email protected], vinod @ece.iisc.ernet.in
Abstract—In this paper we consider a single discrete time
queue with infinite buffer. The channel may experience fading.
The transmission rate is a linear function of power used for
transmission. In this scenario we explicitly obtain power control
policies which minimize mean power and/or mean delay. There
may also be peak power constraint.
Index Terms—Fading channel, Optimal power policy, Optimal
delay policy.
I. INTRODUCTION
For last few decades Information and communication technology (ICT)/Signal Processing (SP) applications are growing
very rapidly. Consequently in near future power consumption
by these applications is going to be around 15% of the total
world consumption of electricity [1]. Thus ICT infrastructure
is contributing to global warming of the planet. This impact
is magnified by the requirement for expensive diesel fuel for
many locations in developing regions where the regular power
supply may not be available. Given the significant impact that
communications infrastructure has on the environment today,
any reduction, even incremental, in power usage will scale
to important improvements in the overall carbon footprint of
this industry. The primary objective of green communication
is to provide quality of service (QoS) at reduced energy
consumption.
In this paper we would like to address the problem of
reducing power consumption in a wireless communication
system. A cellular system consumes power mainly in the
Base Station (BS). In a BS the power is consumed mainly
in the air conditioner, data processors, power amplifier and
RF transmission. We will focus on reducing power in RF
transmission.
Goldsmith and Varaiya [5] presented an optimal power
allocation policy for a single fading link. They ignored higher
layer performances like queueing delay and emphasized on
physical layer performance. Goyal et al. [6] considered an
optimal average delay policy under the average power constraint and proved existence of an optimal policy and its
structural results and an explicit expression. Chen et al. [4]
discussed an energy efficient scheduling in presence of fading
with individual packet delay constraints when the number of
packets arrivals in a time slot and channel state information
are know before hand (optimal offline scheduling) and also
978-1-4673-0816-8/12/$31.00 ©2012 IEEE
proposed an online scheduling algorithm that performs close
to the offline case. In a multi-user scenario for time varying
channels, Neely [11] proposed an optimal power policy which
allocates power based on current channel state and current
queue backlogs and provides stability for all users under a
peak power constraint. Neely [12] proposed a new innovative
algorithm which optimizes power when there is no knowledge
of arrival rates and channel proababilities and intelligently
drops a small number of packets. Alvi et al. [10] found a
policy which minimizes energy for sending a fixed number of
packets under given delay constraints in presence of fading.
We consider the problem of minimizing the average power
under an average delay constraint when the transmit rate
is a linear function of power. In this set up we provide
explicit optimal power schemes that minimize mean delay
also when there is no fading. We also consider the case when
there is a peak power constraint. For the fading channel we
provide explicit optimal average power policies that stabilize
the queue. Finally we show existence of optimal power policies
when there are average delay constraints.
This paper is organised as follows. In Section II the system
model is described. Power and delay optimal policies under
different scenario are discussed in Sections III and IV. In
Section V, we formulate the problem as a Markov decision
problem (MDP) and show existence and optimal structure of
stationary average cost optimal policy. Section VI concludes
the paper.
II. SYSTEM MODEL
We consider a discrete time, single server queue. A slot is
of duration τ . Let Ak bits/packets arrive in slot k representing
interval [kτ, (k + 1)τ ], k ≥ 0 and can be transmitted after
time (k + 1)τ . The packets on arrival are stored in an infinite
buffer queue till transmission. In slot k, let hk be the channel
gain and Rk bits are allowed to be transmitted. Let qk denote
the queue length in bits at time kτ . Then the {qk } evolves
as (in the following we will assume that the packets can be
fragmented arbitrarily)
+
qk+1 = (qk − Rk )+ + Ak
(1)
where (x) = max(0, x). We will assume {Ak , k ≥ 0} to be
stationary ergodic.
To transmit Rk bits in slot k, the power required Pk depends
on Rk and the channel gain hk . We are concerned about
finding policies which as a function of (qk , hk ) and their
past values, find the Rk (equivalently Pk ) that will satisfy
certain objectives under different constraints. We will ignore
the power consumed in other activities.
Usually this problem is formulated as a MDP. However this
way often one computes the optimal policy numerically but
does not get a closed form expression of the policy and also
not much insight. Thus, in the following we will consider the
special case where Rk is a linear function of Pk and hk . Often
in practical situations the Rk is approximately linear ([14],
[15]). Under this assumption, we will get closed form explicit
optimal solutions for several scenarios of interest. Finally, we
will also use MDP to consider more general scenario.
First we consider the channel without fading. Then we
assume
(2)
Rk = αPk .
Fig. 1.
Sytem Model
if E[A1 ] < Rmax . Then Eπ [q] < ∞ if E[A21 ] < ∞. The
optimal mean power ≤ Pmax . If Eπ [q] = ∞ then it equals
Pmax ; otherwise = E[A]/α.
IV. O PTIMAL PO LICIES : W ITH FADING
for some α > 0 (This is a good approximation of Shannon
formula 12 log(1 + SN R) at low SNR). We will consider
scenarios where we optimize the average transmit power.
There may or may not be a peak power constraint. Also,
we will always require that the queue is stable but will
often be concerned about the delays also. By Little’s law [8]
Eπ [q] = E[A]Eπ [D] where Eπ [q] is the mean queue and
Eπ [D] is the mean delay of a bit under stationarity. Thus
a constraint on mean delay can be written in terms of mean
queue length.
Now we consider a channel with fading. We will assume
that {hk } is stationary ergodic. Let |hk | ≤ h ≤ ∞ with
probability one and Rk = αPk hk . Also assume that p =
P (hk = h) > 0 if h < ∞.
III. O PTIMAL P OLICIES : N O FADING
If there is no h < ∞ with P (h ≤ h) = 1, then there is no
policy that minimizes mean power. But the following policies
are good policies. Take h0 > 0, now P (h ≥ h0 ) > 0. Consider
the policy
�
qk
, for hk ≥ h0 ,
Pk = αhk
(4)
0,
otherwise.
We will obtain explicit policies which minimize
n−1
lim sup
n→∞
1 �
E
Pk = P .
n
(3)
k=0
The proof of the following proposition is simple and can for
example, be obtained using the exchangeable argument in [16].
Proposition 1. The policy Rk = qk minimizes (3) and also
has minimum delay. �
The policy in Proposition 1 is actually Rk+1 = Ak for
k > 1 and the delay of each packet/bit is one slot. Thus the
optimal power (3) equals E[A1 ]/α.
Often in practice there is peak power constraint Pk ≤ Pmax ,
for all k, where Pmax < ∞. Then Rmax = αPmax is the
maximum number of bits that can be transmitted in a slot.
The following policy simultaneously minimizes mean power
(3) as well as mean delay under peak power constraint.
Proposition 2. The policy Rk = min(qk , Rmax ) minimizes
mean power as well as mean delay. �
Although the policy in Proposition 2 minimizes mean power
and delay, the queue may still be unstable and then Eπ [q] =
∞. The queue is stable (has a unique stationary distribution)
Proposition 3. If h < ∞, the policy Rk = qk 1{hk =h} is
mean power optimal. �
Although the policy in Proposition 3 is mean power optimal
and stabilizes the queue, it may not be mean delay optimal.
We will consider such policies later. Also, under this policy
P = E[A]
αp .
All these policies stablize the queue. As h0 increases the
average limiting power decreases but the average delay also
increases.
If there is also a peak power constraint Pk ≤ Pmax , then
under the policy in Proposition 3, (modified to satisfy peak
power constraint : Rk = min(qk , Rmax )1{hk =h} the queue
may be unstable. Let
� ∞
x = max{x : αPmax
ydPh (y) > E[A]}
(5)
x
where Ph is distribution of h. Then if Ph is continuous at
x the optimal policy is to transmit at power Pmax whenever
hk ≥ x and
P = Pmax P [h ≥ x].
(6)
If Ph has a jump at x, then policy (6) is modified as : at x
transmit at power P < Pmax such that mean transmit rate just
exceeds E[A].
V. MEAN DELAY CONSTRAINT
Next we consider the problem when there is fading and we
want to minimize
n−1
lim sup
n→∞
1 �
E
Pk
n
n→∞
k=0
+ E[N ]E[A] < ∞
1 �
E
qk ≤ q
n
(7)
n−1
1�
E[Pk + βqk ]
n
Now from [13], an optimal discounted policy exists which
minimizes
n−1
�
lim sup E[
(Pk + βqk )γ k ]
(11)
n→∞
k=0
for an appropriately chosen q. To solve this problem, we
first use Lagrange multiplier’s ([9]) to convert this to an
unconstrained problem of finding the policy that minimizes
lim sup
n−1
1�
E[N ]E[A] 1
E[ |h ≥ h0 ]
E[Pk + βqk ] ≤
n
α
h
k=0
n−1
n→∞
lim sup
n→∞
subject to a constraint on the mean waiting time. By Little’s
law we can replace this constraint with
lim sup
Let p = P [h ≥ h0 ] and N = inf {k : hk ≥ h0 }. Then since
p > 0, E[N ] < ∞. Also, for this policy
(8)
k=0
where β > 0 is appropriately chosen. In the following
we use MDP to show the existence of an optimal policy
for this problem. For that we assume that {Ak } and {hk }
are independent and identically distributed (i.i.d) sequences,
independent of each other. Then (qk , hk ) will be taken as the
state of the system at time k, where qk ∈ {0, 1, 2, ...} and
hk ∈ R+ .
The following theorem shows the existence of an optimal,
stationary Markov policy in the class of all policies which
depend on the whole history of the system, i.e., the power
used at time k depends on (q0 , h0 ), ..., (qk , hk ).
�
�
Theorem 1: If E h1 |h ≥ h0 < ∞ for some h0 ≥ 0 with
P (h ≥ h0 ) > 0 then there is a stationary Markov policy which
minimizes (8) and at that policy (8) is finite and independent
of initial cost.
Proof: To show the existence of an optimal policy we use
Theorem 3.8 in [13]. We consider condition W in [13]. For
this observe that if the state at a time is (q, h) then the actions
possible are
q
}.
(9)
A(q, h) = {p : p ≤
αh
A(q, h) is compact and the set function (q, h) �→ A(q, h)
is upper semi-continuous. Also if action P is taken, the
cost is c(P, q, h) = P + βq. Thus the cost function is a
continuous function of (P, q, h). Furthermore, the next state
P +
then is (q − αh
) + A where A is a r.v. independent of
(P, q, h) and has the distribution of Ak . Thus the transition
function of (Pk , qk , hk ) �→ P [(qk+1 , hk+1 ) ∈ B|(Pk , qk , hk )]
is weakly continuous. Thus condition W in [13] is satisfied.
Next we show that for some initial condition (q, h), there
is a stationary policy that provides a finite average cost. We
take initial state (0, h). Also, we use the stationary policy
�
qk
if hk ≥ h0
Pk = αhk
(10)
0
otherwise
k=0
for any 0 < γ < 1. Let us denote the optimal cost for a given
γ by vγ (q, h) where (q0 , h0 ) = (q, h).
To show the existence of a policy minimizing (8), we next
need to prove that
�
supγ<1 (vγ (q, h) − inf
vγ (q , h� )) < ∞
�
�
(q ,h )
(12)
for any (q, h) in the state space. It is easy to show that vγ (q, h)
is an increasing function of q. Also, from our system equation
(1) we can see that vγ (0, h) ≡ vγ (0) is independent of h.
Thus,
�
�
�
�
�
vγ (q, h) = min{p + βq + γ vγ (q , h )dP ((q , h )|(q, h))}
p
whereP (.|.) is the transition function of the Markov chain
(qk , hk ) and hence
q
+ βq + γvγ (0).
vγ (q, h) ≤
αh
q
The inequality is obtained by taking the policy p = αh
in step
one. Since the policy (10) gives a finite discounted cost for
any γ > 0, vγ (0) < ∞ and hence
q
+ βq < ∞
(13)
vγ (q, h) − vγ (0) ≤
αh
for any (q, h) and any 0 < γ < 1. Thus we obtain the existence
of an average cost optimal policy for any β > 0. �
To obtain the Markov stationary optimal policy for the
constrained problem, from [9], we need to show that at the
optimal policy obtained in Theorem 1, the limit (8) is attained
and there is a β > 0 at which this optimal policy has the limit
in (7) equal to q. From Theorem E.10 of [7], we know that
the limit in (8) is attained.
�n−1 Also as β increases, at the optimal
policy limn→∞ E[ n1 k=0 qk ] decreases. Thus we can choose
an appropriate β to get q.
VI. CONCLUSIONS
In this paper we have considered the problem of obtaining
power control policies for a discrete time single queue that
minimizes long term average power. We require that the
queue should be stable. The transmission rate is a linear
function of power. For this case explicit optimal policies are
obtained for a channel with or without fading. There may
be peak power constraint also. These policies also minimize
mean delay for the non-fading channel. Finally, for the fading
channel the existence of optimal policies that satisfy mean
delay constraints via Markov decision problem.
In future one can consider these problems for a multiple
access channel, broadcast channel and multihop networks.
R EFERENCES
[1] A. Amanna, “Green Communications Annotated Literature Review and
Research Vision,” ICTAS, Virginia Tech, 2010.
https://filebox.vt.edu/users/aamanna/webpage/Amanna%20%20Green%20Communications%20Literature%20Review%20and
%20Research%20Proposal.pdf
[2] R. A. Berry, R. G. Gallager, “Communication over Fading Channels
with Delay Constraints,” IEEE Transaction on Information Theory, Vol.
48, no. 5, pp. 1135-1149, May 2002.
[3] Bertsekas and R. Gallager, “ Data Networks,” Second Edition, Prentice
Hall.
[4] W. Chen, U. Mitra and M. J. Neely, “Energy-efficient scheduling with
individual delay constraint Over a Fading Channel,” Proc. of the 5th
Int. Symposium on Modeling and Optimization in Mobile, Ad Hoc, and
Wireless Networks, 2007.
[5] A. J. Goldsmith, P. Varaiya, “ Capacity of Fading Channels with Channel
Side Information,” IEEE Transaction on Information Theory, vol. 43, no.
6, 1986-92, Nov 1997.
[6] M. Goyal, A. Kumar and V. Sharma, “Power Constrained and Delay
optimal policies for Scheduling Transmission over a Fading Channel,”
IEEE INFOCOM, pp. 311-320, 2003.
[7] O. Hernandez-Lerma and J. B. Lassere, “Discrete Time Markov Control
Process,” Springer-Verlang, 1996.
[8] J. D. C, Little, “A Proof of the Queuing Formula: L=λW,“ Operations
Research, vol. 9, no. 3, pp. 383-387.
[9] D. J. Ma, A. M. Makowski, A. Shwartz, “Estimation and Optimal
Control for Constrained Markov Chains,” IEEE Conf. on Decision and
Control, Dec 1986.
[10] A. Fu. E. Modiano and J. Tsitsiklis, “Optimal Energy Allocation for
Delay-Constrained Data Transmission over a Time-Varying channel,”
Proc. IEEE INFOCOM, 2003.
[11] M. J. Neely, “Optimal Energy and Delay Tradeoffs for Multi-User
Wireless Downlinks,” IEEE Transactions on Information Theory, vol.
53, no. 9, Sep 2007.
[12] M. J. Neely, “Intelligent Packet Dropping for Optimal Energy- Delay
Tradeoffs in Wireless Downlinks,” IEEE Transactions on Automatic
Control, vol. 54, no. 3, March 2009.
[13] M. Schal, “Average Optimality in Dynamics Programming with General
State Space,” Mathematics of operations Research, vol. 18, no. 1, pp.
163-172, Feb 1993.
[14] V. Sharma, U. Mukherji, V. Joseph and S. Gupta, “Optimal Energy
Management Policies for Energy Harvesting Sensor Nodes,” IEEE
Transactions on Wireless Communications, Vol. 9, no. 4, April 2010.
[15] D. Tse, V. Pramod, “ Fundamentals of Wireless Communication,”
Cambridge University, 2005.
[16] J. Walrand, “Introduction to queueing networks,” Prentice Hall, 1988.