TWO-STAGE NEWSBOY MODEL WITH BACKORDERS AND

TWO-STAGE NEWSBOY MODEL WITH BACKORDERS AND
INITIAL INVENTORY
Ali Cheaitou† , Christian van Delft‡ , Yves Dallery†‡ and Zied Jemai†
†
Laboratoire Génie Industriel, Ecole Centrale Paris, Grande Voie des Vignes,
92295 Chatenay-Malabry Cedex, France, [email protected], [email protected], [email protected]
‡
HEC Paris, 1 Rue de la Libération, 78351 Jouy-en-Josas Cedex, France, [email protected]
Abstract: In this paper, we develop a general two-stage newsboy model. Each period decision
induces specific costs. In addition to the usual decision variables for such models, we consider
that, at the beginning of the decision process, an initial inventory is available and some preliminary fixed orders are to be delivered at each period. The unsatisfied demands during a period
are backlogged to be satisfied in the future. The model is solved by a dynamic programming
approach. We then provide insight regarding this type of two-stage inventory decision process
with the help of numerical examples.
Keyword: Supply Chain Planning; Dynamic Programming; Demand Management; Modelling
Methodologies.
1
Introduction
The single-stage newsboy model has received a lot of attention in operations research and operations management literature. Basically, the numerous versions of this model determine, under
different assumptions, the orders and inventory quantities that optimally satisfy an uncertain
future demand. Given the intrinsic simplicity of this class of inventory model, the optimal
solutions can often be explicitly given under an analytic form.
In many real-life applications, such simple single-stage models do not really apply because several correlated decisions have to be sequentially taken. It is thus quite natural to consider two-period newsboy models to analyze the structure of optimal decisions in such multiperiod decision processes. This class of models is characterized by several features: the structure of demand uncertainty, the structure of the decision process and related costs and the way
unsatisfied demands at a given period are dealt with.
In this paper, we consider style-goods type products with a short life cycle. For this kind of
product, a two-period decision model appears quite naturally. The induced costs are purchasing
costs, inventory holding costs and backorder costs. The demands at the first and second period
are described by independent random variables, with known probability distributions. We assume that at the end of the season, the remaining inventory can be sold to a specific market with
a given salvage value.
In the literature about style-goods production and inventory problem, most of the models
are a newsboy single-period problems (Khouja 1999). However, several two-stage extensions
have been developed. All these papers exploited the two-period horizon in order to improve the
inventory management process facing the demand uncertainty. Such two-period decision processes permit one to adapt the inventory levels to the demand variability. In other words, using a
single period it is ordered only once, at the beginning of the season before information about the
effective demand is available. On the contrary, in a two-period model, after the first order, the
realized demand of the first period can be observed and a second order is made, which clearly
exploits this information. Several authors have considered such models. First, Hillier Lieberman (2001) analyzed a two-period model with uniformly distributed independent demands. Via
a dynamic programming approach, these authors analytically solved this model and proposed
an explicit optimal order-up-to policy. Lau Lau (1997,1998) developed lost sales two-period
models and proposed numerical solutions via dynamic programming. Bradford Sugrue (1990)
proposed another class of model in which the second period demand is correlated to the first
period demand. A bayesian update for the second period’s demand forecast can thus be used
after having observed the value of the first period demand. These authors determined a conditional order-up-to policy for the second period and an optimal order quantity for the first period.
Another important two-period model has been proposed by Fisher Raman (1996). In this paper
the demand of the whole horizon and the demand of the first period are characterized via a joint
probability density function. Furthermore, the order size for the second period is constrained
by a limited amount. Gurnani Tang (1999) considered a two-period model with a first period
demand equal to zero. In their model, the dynamic structure concerns available information for
the sequential decisions: at the end of the first period, exogenous information is collected which
permits one to update the forecast for the second period demand. Choi, et al. (2003) proposed a
quite similar two-stage newsboy model with an update of the forecast of the second-period demand via some market information. Donohue (2000) applies a similar approach for developing
supply contracts.
The contributions of our model are the followings:
• First, the periodic ordering process is quite general in the sense that at each time period
orders can be made for the different subsequent periods, possibly with different costs,
• Second, the periodic selling process is quite general, in the sense that, in addition to the
classical selling process, it is possible, at the beginning of each period, to sell a part of
the available inventory to a parallel market, at a given salvage value,
• Third, the data are dynamic : the selling prices, costs, salvage values and demand probability distributions are period-dependent,
• Fourth, the model includes initial inventory and initially fixed order quantities to be delivered in the different periods.
The remaining part of this paper is structured as follows: the second section describes the model
(namely the complete decision process, the information structure and the costs and profits structure), the third section details the objective function and the dynamic programming approach.
Numerical examples are solved in section four. The last section is dedicated to the conclusion.
2
2.1
Decision process and information structure
Demand processes description
Define D1 and D2 as the demand at the first and the second period respectively. These random
variables are characterized by probability distributions F1 (·) and F2 (·) and probability density
functions f1 (·) and f2 (·).
2.2
Decision process description
First, the state variables of the model are the inventory level at the beginning of each period,
X1 and X2 and the inventory level at the end of each period I1 and I2 (I0 being the given initial
inventory for the problem). The decision variables of the model are as follows. First, we define
Qts as the quantity ordered at the beginning of period t to be received at the beginning of period
s (with t ≤ s and t, s = 1, 2, 3). Then we introduce, for each period, the variable St , the
quantity that is salvaged (to the parallel market) at the beginning of period t (with t = 1, 2, 3).
We will show that the decision variables Q33 and S3 can be optimally chosen directly as explicit
functions of the other variables.
Figure 1 presents the structure of the decision process and demand realization, which is
the following:
the available inventory at the beginning of the first period, before current orders are decided and
demand occurs, is
X1 = I0 + Q01 ,
where, in fact, I0 and Q01 can be considered as data. Then decision variables Q11 , Q12 and S1
are fixed. Then, the demand D1 occurs and the available inventory at the end of the first period
is given by
I1 = X1 + Q11 − S1 − D1 .
The decision structure for the second period is similar, with the initial inventory given by
X2 = I1 + Q02 + Q12 ,
and the final inventory given by the expression
I2 = X2 + Q22 − S2 − D2 .
The terminal decision process is then as follows: after demand occurs in the second period, it
is optimal to order Q33 = −I2 units to satisfy backlogged demand (when I2 < 0) or to sell
S3 = I2 units with a salvage value to eliminate the remaining inventory (when I2 ≥ 0).
2.3
Costs and profits structure and assumptions
In each period, any demand is charged at a price Pt , even if not immediately delivered. The
unit order cost of Qts is cts . In the case of a positive inventory at the end of a period, an
inventory holding cost ht is paid. Unsatisfied orders in period t are backlogged to the next
period, with a penalty bt . We will show that under the assumptions of this paper, it is optimal
that all backlogged orders in a given period be satisfied at the beginning of the next period. The
unit salvage value at the beginning of period t is given by st . It is necessary to introduce some
-D
-S
+Q )
I
I
(s )
X
I
(P
-D
h
b
-S
(c
)
)
+Q +Q
(c
I
I
(s )
X
I
+Q
(P
h
b
-S (s )
I
)
+Q (c
)
+Q (c
)
Figure 1: Decision process
assumptions about the different periodical costs in order to guarantee the coherence and interest
of our model. These assumptions could be classified in three categories:
2.3.1
Type 1 assumptions:
c11 <c22 + b1 , c11 <c12 + b1 , c12 <c33 + b2 and c22 <c33 + b2 .
These constraints aim at avoiding situations with systematic backlogs of demands to the next
period. For example, if the first constraint is not satisfied, the optimal policy will consist of
backlogging the first period demand to the second period and to satisfy this demand with a
second period order (with c22 as unit order cost).
2.3.2
Type 2 assumptions:
s2 <c11 + h1 , s3 <c12 + h2 , s3 <c11 + h1 + h2 and s3 <c22 + h2 .
These constraints aim at avoiding situations where it would be profitable to order at a given
period in order to sell to the parallel market at a salvage price. For example, if the first constraint
is not satisfied, the optimal policy will consist of ordering an infinite Q11 quantity in the first
period and selling it at a salvage price s2 in the second period.
2.3.3
Type 3 assumptions:
s1 <c11 , s2 <c22 , s2 <c12 and s3 <c33 .
These constraints aim at avoiding other situations where it would be profitable to order at a given
period and to sell at the delivery period to the parallel market at the corresponding salvage price.
For example, if the first constraint is not satisfied, the optimal policy will consist of ordering an
infinite quantity in the first period and selling it at a salvage price s1 in the same period.
2.4
Terminal conditions
The optimal value of the decision variables Q33 and S3 can be shown to be explicit functions of
the state variable I2 as follows (Cheaitou, et al.,2005):
if I2 ≤ 0 ⇒ Q∗33 = −I2 and S3∗ = 0,
if I2 ≥ 0 ⇒ Q∗33 = 0 and S3∗ = I2 .
3
3.1
(1)
(2)
The dynamic programming approach
The model
First, we recall the equilibrium equations of the system,
X1 = I0 + Q01
(3)
I1 = X1 + Q11 − D1 − S1
(4)
X2 = I1 + Q02 + Q12 = X1 + Q11 − D1 − S1 + Q02 + Q12
(5)
I2 = X2 + Q22 − D2 − S2
(6)
Introduce Π(I0 , Q01 , Q02 , Q11 , Q12 , Q22 , Q33 , S1 , S2 , S3 ) as the expected profit with respect to
the random variables D1 and D2 . This expected profit Π(·) is formulated as follows,
Π(I0 , Q01 , Q02 , Q11 , Q12 , Q22 , Q33 , S1 , S2 , S3 ) =
Z ∞
D1 f1 (D1 ) dD1 + s1 S1 − c11 Q11 − c12 Q12
P1
0
Z X1 +Q11 −S1
− h1
(X1 + Q11 − S1 − D1 )f1 (D1 ) dD1
0
Z ∞
− b1
(D1 − X1 − Q11 + S1 )f1 (D1 ) dD1
X1 +Q11 −S1
Z ∞
+ P2
D2 f2 (D2 ) dD2 + s2 S2 − c22 Q22
0
Z X2 +Q22 −S2
− h2
(X2 + Q22 − S2 − D2 )f2 (D2 ) dD2
0
Z ∞
− b2
(D2 − X2 − Q22 + S2 )f2 (D2 ) dD2
(7)
X2 +Q22 −S2
X2 +Q22 −S2
Z
+ s3
(X2 + Q22 − S2 − D2 )f2 (D2 ) dD2
Z0
∞
− c33
(D2 − X2 − Q22 + S2 )f2 (D2 ) dD2
X2 +Q22 −S2
3.2
Problem decomposition
Using a dynamic programming approach, this problem can be decomposed into two one-period
subproblems that are the followings. The first subproblem is associated to the second period.
The solution of this problem is optimal value of the second-period decision variables, namely
Q∗22 and S2∗ . These variables are expressed as a function of the state variable X2 and are computed as the solution of the optimization problem
max {Π2 (X2 , ξ2 (X2 ))} ,
ξ2 (X2 )
(8)
where we formally have ξ2 (X2 ) = (Q22 (X2 ), S2 (X2 )).
Then, the second subproblem exploits ξ2∗ (X2 ) in order to find the optimal policy for the
first period, namely ξ1∗ (X1 ) = (Q∗11 (X1 ), Q∗12 (X1 ), S1∗ (X1 )). This optimal policy is obtained as
the solution of the problem
max {Π1 (X1 , ξ1 (X1 )) + ED1 {Π∗2 (X2 , ξ2∗ (X2 ))}} ,
ξ1 (X1 )
(9)
where Π1 (X1 , ξ1 (X1 )) is the expected first period profit function, while the second term is the
expectation, with respect to D1 , of the second period profit function, under the optimal policy
ξ2∗ (X2 ).
3.3
Second-period subproblem.
The objective function of the second period is defined by the following expression:
½ Z ∞
Π2 (X2 , Q22 , S2 ) =
P2
D2 f2 (D2 ) dD2 + s2 S2 − c22 Q22
0
Z X2 +Q22 −S2
−h2
(X2 + Q22 − S2 − D2 )f2 (D2 ) dD2
Z 0∞
−b2
(D2 − X2 − Q22 + S2 )f2 (D2 ) dD2
X2 +Q22 −S2
X2 +Q22 −S2
Z
+s3
(X2 + Q22 − S2 − D2 )f2 (D2 ) dD2
0
Z
¾
∞
−c33
(D2 − X2 − Q22 + S2 )f2 (D2 ) dD2
(10)
X2 +Q22 −S2
This class of model has been analyzed by Cheaitou, et al. (2005). These authors have shown
that the objective function defined by (10) is a concave function of Q22 and S2 . Furthermore,
the following properties have been proven.
∗
Property 1: The optimal values of the two decision variables Q∗22 and S22
can not be
simultaneously positive.
Property 2: If I1 < 0 and X2 < 0, the optimal quantity Q∗22 satisfies
X2 + Q∗22 ≥ 0.
Optimal policy: From Cheaitou, et al. (2005) the second period optimal policy is given
by
½
if X2 <
Y1∗
⇒
Q∗22 = Y1∗ − X2 ,
S2∗ =
0,
½
if
Y1∗
Q∗22 = 0,
S2∗ = 0,
(12)
Q∗22 =
0,
∗
S2 = X2 − Y2∗ .
(13)
Y2∗
≤ X2 ≤
(11)
⇒
and
½
if X2 >
Y2∗
⇒
These conditions amount to
Q∗22 = max (Y1∗ − X2 ; 0) and S2∗ = max (X2 − Y2∗ ; 0)
µ
with
Y1∗
=
F2−1
b2 + c33 − c22
b2 + c33 + h2 − s3
µ
¶
and
Y2∗
=
F2−1
b2 + c33 − s2
b2 + c33 + h2 − s3
Note that it is easily seen that under the assumptions of this paper, one has
Y1∗ < Y2∗ .
(14)
¶
.
(15)
3.4
First period subproblem.
Using the results of the second period subproblem, it is possible to numerically solve the first
period subproblem and compute the optimal values of the decision variables of the first period,
Q11 , Q12 and S1 . The total expected profit function Π(·), under the optimal second-period
policy ξ2∗ (X2 ) becomes:
Π(I0 , Q01 , Q02 , Q11 , Q12 , S1 ) = Π1 (X1 , Q11 , Q12 , S1 ) + ED1 {Π∗2 (X2 , Q∗22 (X2 ), S2∗ (X2 ))} (16)
The optimization problem to solve for this subproblem is then the following:
Π∗ (I0 , Q01 , Q02 ) =
=
max
Q11 ,Q12 ,S1
{Π(I0 , Q01 , Q02 , Q11 , Q12 , S1 )}
½ Z ∞
P1
D1 f1 (D1 ) dD1 + s1 S1 − c11 Q11 − c12 Q12
max
0
Z X1 +Q11 −S1
−h1
(X1 + Q11 − S1 − D1 )f1 (D1 ) dD1
0
Z ∞
−b1
(D1 − X1 − Q11 + S1 )f1 (D1 ) dD1
Q11 ,Q12 ,S1
X1 +Q11 −S1
+ED1 {Π∗2 (X2 , Q∗22 (X2 ), S2∗ (X2 ))}}
(17)
By using the optimal values Q∗22 (X2 ) and S2∗ (X2 ) defined by (14), it is easily seen that the
total expected profit function Π(I0 , Q01 , Q02 , Q11 , Q12 , S1 ) is concave w.r.t. Q11 , Q12 and S1 ,
in such a way that the optimization problem described in equation (17) has a unique maximum.
However, no closed form formula exists for general demand probability distributions and the
problem has to be numerically solved.
4
Numerical examples
In this section we show, via some numerical applications, how our model gives some insight
regarding this type of general two-stage inventory decision process. In the first example we
show the behavior of the first period optimal policy in function of the initial inventory and of
the salvage value s1 . In the second example we show the impact of the variability and of the
difference between the costs on the optimal value of Q12 and the expected optimal value of
Q22 . For the two following examples we will suppose normal distributed demands for the two
periods.
4.1
Example 1
Q11
S1
800
700
600
Q 11, S 1
500
400
300
200
100
0
100
500
900
1300
1700
2100
2500
2900
Initia l inventory I 0
Figure 2: First period optimal policy - low s1 value
In the first example we consider the following parameter values: D1 = N [1000, 200],
D2 = N [1000, 200], h1 = 5, h2 = 5, P1 = 100, P2 = 100, b1 = 25, b2 = 25, c11 = 50,
c12 = 30, c22 = 50, c33 = 50, s2 = 20, s3 = 20, Q01 = 0 and Q02 = 0, with N[µ,σ] a normal
distribution with mean of µ and standard deviation σ. In this example, the optimal policy is
numerically computed as a function of the initial inventory I0 . In the first part (Figure 2) we
have s1 = 20 and in the second part (Figure 3) we have s1 = 29. It is clearly seen, from this
Q11
S1
1800
1600
1400
Q 11, S 1
1200
1000
800
600
400
200
0
100
500
900
1300
1700
2100
2500
2900
Initial inventory I 0
Figure 3: First period optimal policy - high s1 value
numerical example, that the first period optimal policy has the same structure as the second
period, for which the explicit form is known. From figures 2 and 3 we deduce that there are
two thresholds. For the values of I0 between these two thresholds, the optimal values for the
decision variables Q11 and S1 are both equal to zero. Below the low threshold only the optimal
value of Q11 is positive and above the high threshold only the optimal value of S1 is positive.
The higher threshold value depends on s1 (see Figure 2 and Figure 3). When s1 increases the
higher threshold value decreases. It can be seen that the low threshold value depends on c11 ,
while the difference between the two thresholds is proportional to the difference c11 − s1 .
4.2
Example 2
Q11
Q12
Q22
3000
Q 11, Q 12 , Q 22
2500
2000
1500
1000
500
0
23
29
35
41
47
53
59
65
71
The cost c12
Figure 4: Early and late reorder modes: optimal values with low D1 variability
In the second example, we have the following parameter values : D2 = N [1000, 200],
h1 = 5, h2 = 5, P1 = 100, P2 = 100, b1 = 25, b2 = 25, c11 = 50, c22 = 50, c33 = 50, s1 = 20,
s2 = 20, s3 = 20, I0 = 0, Q01 = 0 and Q02 = 0. In this example, the optimal values are
computed as a function of the unit ordering cost c12 . For the first part of this example (Figure 4)
we consider the first period demand with a low variability, namely we have D1 = N [1000, 200].
For the second part (Figure 5) we suppose a higher demand variability, i.e. D1 = N [1000, 400]
It can be seen that the c12 values can be divided into three intervals.
• The first interval corresponds to small c12 values, for which the optimal value of Q12 is
positive and the expected optimal value of Q22 is equal to zero.
Q11
Q12
Q22
3000
Q 11, Q 12 , Q 22
2500
2000
1500
1000
500
0
23
29
35
41
47
53
59
65
71
The cost c12
Figure 5: Early and late reorder modes: optimal values with high D1 variability
• The second region corresponds to mean c12 for which the optimal values of Q12 and Q22
are both positive.
• The third region corresponds to the situation c12 > c22 , where the optimal value of Q12 is
equal to zero.
Another insight can be seen with respect to variability of D1 . When D1 variability increases
several phenomenons occur, see figures 4 and 5:
• The width of the second region (for medium c12 values) increases toward the low c12
values side. In fact Q22 is decided after observing the realization of D1 and therefore this
decision benefits from the realization information. Q12 is fixed before the realization of
D1 . So when the variability of D1 increases it is profitable to wait for realization of the
first period demand and then to use this information to optimize the reorder, and this even
if the cost (c22 ) is high.
• For low c12 values, the optimal value of Q12 increases with demand variability while the
optimal value of Q11 decreases. This corresponds to the different numerical costs values
that we have defined. In our case, for small c12 values, the differences between c11 and
c12 and between c12 and c22 are important, in such a way that for high D1 variability,
it becomes more profitable to order a small Q11 value and a bigger Q12 quantity, and if
necessary reorder a Q22 quantity in period 2. So in this case Q12 increases to face the first
period variability and to satisfy the second period demands.
• For medium c12 values, the optimal Q11 and Q12 values decrease while the expected
optimal Q22 value increases to face the variability of the first period demand and to satisfy
the second period demand.
• For high c12 values, the optimal value of Q12 is equal to zero (c12 > c22 ). When the
variability of D1 increases, the optimal Q11 value increases and the expected optimal Q22
value decreases. This is associated to the fact that c11 is equal to c22 in our example, and
that the holding inventory cost in the first period is very small with respect to the shortage
cost b1 . So in this case, Q11 faces the high D1 variability and satisfies a part of the second
period orders.
5 Conclusion
In this paper, we have analyzed a general two-period newsboy model, including initial inventory,
pre-fixed orders and periodic salvage opportunities. Via a dynamic programming approach, we
characterized the structure of the optimal policy and provided a numerical approach in order to
compute this optimal policy in a general setting. Via the numerical examples, we gave some
insight into the effect of the parameters values on the solution.
6
Biography
Ali Cheaitou studied mechanical engineering at the Lebanese University(Beirut). In 2004 he
prepared a DEA in industrial engineering at Ecole Centrale Paris(France). Since October 2004,
he is PhD student and teaching assistant in the Laboratoire Génie Industriel of the Ecole Centrale Paris. He is mainly interested in production planning and stochastic optimization.
Christian van Delft is an associate professor in the department of Industrial Management
and Logistics, at HEC-Paris School of Management (France), where he teaches and researches
in operations management, operations research and quality control. He has published in IEEE
Transactions on Robotics and Automation, Annals of Operations Research, European Journal
on Operations Research, Optimal Control: Applications and Methods and others.
Yves Dallery is a Professor of Industrial Engineering at /Ecole Centrale Paris/. His research interests are in production management, supply chain management and service operations management, with a special emphasis on modelling and optimization.
Zied JEMAI is an assistant Professor in the department of Industrial Engineering, at Ecole
Central Paris, where he teaches and researches in stochastic models and supply chain management. He has published in IIE Transactions and European Journal of Operational Research.
References
Bradford, J. W. Sugrue, P. K. 1990. A bayesian approach to the two-period style-goods inventory problem with single replenishment and heterogeneous poisson demands, Journal of
the operational research society 41: 211–218.
Cheaitou, A., van Delft, C., Dallery, Y. Jemai, Z. 2005. Generalized newsboy model with initial
inventory and two salvage opportunities. Working paper.
Choi, T.-M., Li, D. Yan, H. 2003. Optimal two-stage ordering policy with bayesian information
updating, Journal of the operational research society 54: 846–859.
Donohue, K. L. 2000. Efficient supply contracts for fashion goods with forecast updating and
two production modes, Management science 46: 1397–1411.
Fisher, M. Raman, A. 1996. Reducing the cost of demand uncertainty through accurate response to early sales, Operations research 44: 87–99.
Gurnani, H. Tang, C. S. 1999. Optimal ordering decisions with uncertain cost and demand
forecast updating, Management science 45: 1456–1462.
Hillier, F. S. Lieberman, G. J. 2001. Introduction to operations research, 5th edn, McGraw-Hill.
Khouja, M. 1999. The single-period (news-vendor) problem: literature review and suggestions
for future research, Omega 27: 537–553.
Lau, A. H.-L. Lau, H.-S. 1998. Decisions models for single-period products with two ordering
opportunities, International journal of production economics 55: 57–70.
Lau, H.-S. Lau, A. H.-L. 1997. A semi analytical solution for a newsboy problem with midperiod replenishment, The journal of the operational research society 48: 1245–1253.