International Journal of Production Economics 187 (2017) 216–228 Contents lists available at ScienceDirect Int. J. Production Economics journal homepage: www.elsevier.com/locate/ijpe Optimality of (s, S) policies in EOQ models with general cost structures a,⁎ b Sandun Perera , Ganesh Janakiraman , Shun-Chen Niu a b MARK b School of Management, University of Michigan–Flint, Flint, MI 48502, USA Naveen Jindal School of Management, The University of Texas at Dallas, Richardson, TX 75080, USA A R T I C L E I N F O A BS T RAC T Keywords: EOQ models (s, S) optimality General ordering/procurement cost structures The classical economic-order-quantity (EOQ) model is at the heart of supply chain optimization and the theory of inventories. We study an extension of the original EOQ model that permits a minimal set of assumptions on the ordering/procurement and holding/backorder costs, and establish necessary and sufficient conditions for the existence of an optimal policy of the (s, S ) type. Our work lends theoretical credibility to the practice of using (s, S ) policies for virtually any cost structure of practical interest. Our proof is constructive and elementary, in the sense that it is based on first principles that do not rely on advanced mathematical machinery. We also prove that an optimal policy, not necessarily of the (s, S ) type, always exists within our general EOQ framework. 1. Introduction The economic-order-quantity (EOQ) model (Harris, 1913)1, along with the Newsvendor model, is at the heart of the practice of supply chain optimization and the theory of inventories. The EOQ model assumes that demand arises continuously and at a constant rate. Several versions of this model of trading off inventory-holding costs (and, in some cases, shortage costs) with procurement/logistics costs (e.g., fixed ordering costs, volume discounts, etc.) are presented in basic texts and courses in Operations Research and Operations Management. The typical analysis of the EOQ model involves assuming a policy with the following structure: Place a constant order quantity every time the net-inventory level decreases to a threshold. The order quantity and the threshold are then optimized. This approach usually leads to square-root formulas or algorithms to determine these quantities (Porteus, 2002; Winston, 2004; Zipkin, 2000). In this paper, we take a step back and investigate the important issue of when/why such a policy structure—commonly referred to as (s, S )—is optimal in the EOQ model. We present an elementary proof for the existence of an optimal policy of the (s, S ) type in the EOQ model under general ordering/ procurement and holding/shortage cost structures. By elementary proof, we mean a rigorous proof from first principles that do not require significant mathematical machinery. This is the main contribu- tion of our work. To shed some light on this contribution relative to the literature, we will begin with a discussion both about our work and on the “EOQ proofs” that exist in the literature. The assumptions in the standard textbook version of the EOQ model are as follows. The ordering cost includes a set-up cost plus a linear variable cost. Inventory holding costs are also linear. Backordering is either not allowed or is allowed with a linear shortage cost. This is the model for which convenient square-root formulas are available for the optimal re-order point, s, and the optimal re-order quantity, S − s . Generalizations to accommodate volume discounts in ordering costs are also presented frequently; in such cases, the optimal values of s and S − s do not take simple square-root forms, but algorithms are presented for computing them. What is striking is that the use of an (s, S ) policy is usually taken as granted; and in fact, formal discussions of whether or not there exists an optimal policy of the (s, S ) type in the EOQ model appeared in the literature only recently (relative to the history of this model). For example, an (s, S )-optimality proof for a particular type of EOQ model without backordering is given in Lippman (1971). In this paper, the ordering cost structure is that of a set-up cost for placing an order plus a set-up cost for every new truck used, where a truck has a finite capacity. Beyer and Sethi (1998) appear2 to be the first authors that attempted an (s, S )-optimality proof for the EOQ model under a more general cost structure. Other extant optimality analyses for the standard EOQ model and a variety of ⁎ Corresponding author. E-mail addresses: [email protected] (S. Perera), [email protected] (G. Janakiraman), [email protected] (S.-C. Niu). 1 We refer the reader to Erlenkotter (1989, 1990) for interesting historical accounts on the EOQ model. 2 Beyer and Sethi state: “Every OR textbook contains a calculus derivation of the EOQ formula. However, such a derivation does not establish formally that the EOQ formula provides a production policy that minimizes the long-run average cost over the class of all admissible policies. Certainly there are many possible ways to prove the formula: But there does not seem to be a publication devoted directly to providing a mathematically rigorous proof of the famous formula. It is also possible that there are a number of papers dealing with complex inventory models, of which the simple lotsize model is a special case.” http://dx.doi.org/10.1016/j.ijpe.2016.09.017 Received 4 July 2016; Received in revised form 5 September 2016; Accepted 21 September 2016 Available online 23 September 2016 0925-5273/ Published by Elsevier B.V. International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. inventory-holding and backordering cost functions are monotone. These assumptions are rather minimal, potentially the weakest reasonable assumptions within the EOQ framework. Hence, any cost structure of practical interest is subsumed in our formulation. Some well-known examples are all-unit discounts, incremental discounts, and truckload discounts. These are discussed, for example, in Altintas et al. (2008), Benton and Park (1996), Chen (2009), Federgruen and Lee (1990), Li et al. (2004, 2012), and Zipkin (2000). Other variants of these cost structures, such as modified all-unit discounts (cf. Chan et al., 2002) and generalized truckload discount (cf. Li et al., 2004), are also covered. The widely-studied multiple set-up costs structures (Alp et al., 2014; Lippman, 1969, 1971) and the interesting quantitydependent fixed costs scheme discussed in Caliskan-Demirag et al. (2012) are also allowed under our ordering-cost assumptions. Moreover, when a supplier imposes stringent constraints on the size of the orders, e.g., batch-ordering3, minimum-order-quantity (MOQ)4 and a combination of both batch-ordering and MOQ (cf. Zhu et al., 2015) restrictions, the cost of ordering the forbidden amounts could be (theoretically) considered to be infinite; our results also apply to these and other scenarios where the ordering costs could be infinite in some regions. Finally, we emphasize that some of the interesting orderingcost functions noted above are not increasing5, e.g., all-unit discounts6, batch-ordering, and MOQ. To the best of our knowledge, an attempt to study (s, S )-optimality for not-necessarily-increasing ordering-cost functions does not exist in the literature. The remainder of the paper is organized as follows. We present the model, assumptions, and the main results formally in Section 2. Section 3 is devoted to proving the main results. Finally, in Section 4, we present some concluding remarks. deterministic extensions can be found in Adelman and Klabjan (2005), Brimberg and Hurley (2006), Hassin and Meggido (1991), and Sun (2004). In contrast, it is interesting to note that multiple proofs of (s, S ) optimality in stochastic models exist and are celebrated (Scarf, 1960; Iglehart, 1963a, 1963b; Veinott, 1966; Zheng, 1991, 1994). These proofs are either for discrete-time (i.e., periodic review) models or for continuous-time models in discrete-space (i.e., demands and order quantities are integer-valued). The proofs in this literature do not readily accommodate the continuous-space, continuous-time features of the EOQ model. The proof proposed by Beyer and Sethi (1998) is based on the method of Quasi-Variational Inequalities (QVI). Their formulation allows a cost structure that is weaker than that in the standard EOQ model; but the QVI approach makes it necessary for the authors to introduce several technical assumptions on the cost functions (e.g., continuity and differentiability). In a subsequent paper, Presman and Sethi (2006) also apply the QVI method to analyze a stochastic version of the EOQ model in which the demand process is the sum of a deterministic process and a compound Poisson process. In this latter paper, the ordering-cost function is restricted to have the standard structure of a fixed set-up cost plus a linear variable cost. Our proof of (s, S )-optimality is entirely different and is built from first principles, which allows us to work with a rather general model with minimal cost assumptions. The proof is based on a lowerbounding approach, and it involves two steps. The first step is to consider a restricted class of policies that order only when net-inventory is non-positive and always raise it to a nonnegative level upon ordering. That this restriction is without loss of optimality requires a proof. To prove this, we develop a coupling that takes any policy that violates the desired property and constructs a new policy that satisfies the property while always having a lower amount of inventory and a lower amount of backorder than the original. Thus, by construction, the latter policy has lower cumulative holding and shortage costs than the former at any time epoch. To complete the proof, the ordering costs accumulated by the two policies also need to be considered. To this end, we develop an inequality that compares the ordering costs for these two policies over two suitably-chosen nested time intervals, and combine it with the other cost comparison above to show that the constructed policy has a long-run average total cost that is no greater than the original. In the second step, we show that the total costs for policies in the restricted class of policies above can be naturally decomposed into those in successive “ordering cycles”, defined as time periods that span two consecutive zero-inventory epochs (i.e., time instants when the net inventory is zero). We then exploit this cost decomposition to show that the long-run average costs of policies in this restricted class are bounded from below by the infimum of the average costs over any of such cycles. This lower bound then yields a simple characterization that an (s, S ) policy is optimal under our cost assumptions if and only if an optimal solution exists for the problem of minimizing the long-run average cost within the class of (s, S ) policies (i.e., whenever the infimum above is attained by an (s, S ) policy). To cover the scenario where the above infimum is not attained, that is, to have a full closure for the (s, S )-optimality problem within the EOQ framework, we also provide a simple sufficient condition on the ordering-cost function (lower semi-continuity) that ensures the existence of an optimal (s, S ) policy. Furthermore, whenever this existence does not prevail, we show that our approach enables us to constructively establish that an optimal policy, not of the (s, S ) type, always exists. Next, we briefly summarize the cost assumptions in our model, and indicate, in particular, the breadth of ordering cost structures that can be accommodated in our formulation. We only assume that the ordering-cost function is uniformly bounded from below by a strictly positive constant for all strictly positive order quantities, and that the 2. Model formulation and the main results In this section, we present our cost assumptions and state our main results. We first introduce the following notation: λ :=the constant demand rate, λ > 0, c(q ):=non-negative cost for ordering q units, q≥0, g(x ):=non-negative holding/shortage cost rate when net-inventory level is x, −∞ < x < ∞, and I0:=net-inventory level at time 0. In the standard EOQ model with backorder, it is assumed that c(·) is composed of a fixed set-up cost and a linear variable cost (i.e., c(q ) = K 1{q >0} + νq , where K > 0 and ν ≥ 0 are constants), and g(x ) is of the form h max(0, x ) + b max(0, − x ) for some positive constants h and b. In contrast, we only need the following minimal assumptions on c(·) and g(·). Assumption 1. The function c(·) satisfies c(0) = 0 and 0 < K ≤ c(q ) for all q > 0 , where c(q) is not necessarily finite-valued. 3 This means that there is a constraint on the order size that stipulates that orders must be integer multiples of a constant B; see Chen (2000) and Li et al. (2004). Such a constraint can be interpreted as saying that the ordering cost, c(·), is finite at all integer multiples of B, but is infinite elsewhere (see Fig. A.1 in Appendix A). 4 This type of order constraint is widely used. We refer the reader to Zhao and Katehakis (2006), where the authors note that: “Minimum Order Quantity (MOQ) is widely used in many industries (e.g., apparel, pharmaceutical, and consumer packaged products). In these industries, customers (e.g., distributors or retailers) typically have two choices, either not to order from their suppliers or to order at least a minimum quantity, namely the MOQ”. Such a cost scheme can be reinterpreted as saying that the cost of ordering q units, c(q), is infinite when q is strictly positive but is less than MOQ, and is finite elsewhere (see Fig. A.2 in Appendix A). The reader is also referred to Hellion et al. (2012), Kiesmüller et al. (2011), and Zhou et al. (2007) for further discussion regarding the importance of the MOQ constraint. 5 Throughout this paper, by “increasing” and “decreasing”, we mean “non-decreasing” and “non-increasing”, respectively. 6 The ordering-cost function for the all-unit discount scheme is illustrated in Fig. A.3 of Appendix A. 217 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. if the function α(·, ·) has a minimizer. Assumption 2. The function g(·) is increasing on [0, ∞) and decreasing on (−∞, 0], with lim x→±∞ g(x ) = ∞. Our second result provides a simple sufficient condition for the existence of a minimizer for α(·, ·). Now, for 0 ≤ a ≤ b , let C π[a, b] denote the total cost (i.e., the sum of ordering, holding and backordering costs) incurred over the time interval [a,b]7 for a given policy π . Here, a policy π is a specification of an infinite sequence of ordering time instants and order sizes. We will measure the performance of a policy π by its long-run average cost, which is defined as f π C π[0, T ] = lim sup . T T →∞ Theorem 2. If c(·) is lower semi-continuous9, then, α(·, ·) is guaranteed to have a minimizer; hence, there exists an (s, S ) policy that is optimal in Π . Since a minimizer for α(·, ·) may not exist, the remaining question is whether or not there exists a policy in Π (not necessarily of the (s, S ) type) with long-run average cost α*. Our final result answers this question in the affirmative. (1) π Our objective is to minimize f over an admissible policy class, which we specify next. Let tnπ and qnπ respectively be the n-th ordering epoch and the size of the n-th order under a given policy π . Observe that if lim n →∞ tnπ < ∞, then f π = ∞8. Hence, we will restrict our attention to policies that satisfy the condition that lim n →∞ tnπ = ∞; and formally define an admissible policy as follows: Theorem 3. An optimal policy always exists in Π . 3. Proofs of the theorems 3.1. Proof of Theorem 1 The first step in our proof is to restrict consideration from the class of all policies Π to a subset of policies that are more amenable to analysis. In light of Assumption 2, an immediate intuition is that any attempts to maintain the net-inventory level “as close to zero as possible” should result in a reduction in total cost. When backordering is not allowed, this intuition can be formalized as: Order only when the inventory level decreases to zero. Indeed, in that case, it is a wellknown result (Beyer and Sethi, 1998; Brimberg and Hurley, 2006; Lippman, 1971; Porteus, 2002; Zipkin, 2000) that restricting attention to this smaller policy class, which we denote by Π 0p , can be made without loss of generality. When backordering is allowed, it is a natural conjecture that the corresponding class of policies to consider is Π 0, defined as policies that order only when net-inventory is non-positive and always raise the inventory to a non-negative level upon ordering. (Clearly, Π 0p ⊂ Π 0 .) The following lemma rigorously confirms this conjecture10. Definition 1. An admissible policy π is a sequence {(tnπ , qnπ ): n ≥ 1}, where t1π ≥ 0 , tnπ < tnπ+1, lim n →∞tnπ = ∞, and qnπ > 0 for all n. Let Π denote the set of all admissible policies. Our goal is to identify conditions under which an optimal policy exists in Π ; that is, we seek a * policy π * ∈ Π that satisfies f π = infπ ∈Π f π . In addition, of particular interest is the question of whether or not the search for an optimal policy in Π can be limited to policies of the (s, S ) type, where an (s, S ) policy is defined to be one that raises the net-inventory level to S every time it decreases to s. Next, observe that for a given (s, S ) policy with s=x and S=y, where x < y , the long-run average cost α(x, y ) is given by y α (x , y ) = c (y − x ) + ∫ g (z ) x ( y − x )/ λ dz λ . (2) Let α*≔inf{(x, y): x < y} α (x , y ); and note that, in general, α* may not be finite. To have a clean setup, we shall assume that there exists at least one policy π in Π with f π < ∞. Clearly, this assumption can be made without loss of generality; and in particular, it implies that there must exist x 0 < y0 such that c( y0 − x 0 ), the right limit of g(·) at x0, and the left limit of g(·) at y0 are all finite. It then follows from Assumption 2 that α(x 0 , y0 ) < ∞, which ensures the finiteness of α*. In the same spirit, we will further assume that if I0 is positive, then the left limit of g(·) at I0 is finite. We are now ready to state the main results of this paper. Unless explicitly indicated otherwise, we assume henceforth that Assumptions 1 and 2 hold. The first result establishes a lower bound for f π for all π ∈ Π. Lemma 1. It is sufficient to restrict attention to policies in Π 0 ; that is, inf f π = inf f π . π ∈Π π ∈Π 0 Proof. Clearly, it is sufficient to show that for a given π ∈ Π⧹Π 0 , there ∼ exists a corresponding policy π∼ ∈ Π 0 that satisfies f π ≤ f π . π π For n ≥ 1, denote by xn and yn , respectively, the net-inventory levels just before and after the n-th order under a policy π in Π. Note that by definition, qnπ = ynπ − xnπ . Observe that for any n ≥ 1, the vector (xnπ , ynπ ) associated with the n-th order must belong to one of the following three categories: (i) 0 < xnπ < ynπ , (ii) xnπ ≤ 0 and ynπ ≥ 0 with xnπ < ynπ , or (iii) xnπ < ynπ < 0 . Clearly, every order for a π ∈ Π 0 is by definition in Category (ii). In contrast, any π ∈ Π⧹Π 0 must have at least one order in either Category (i) or Category (iii). Let π be an arbitrary policy in Π⧹Π 0 ; and let π∼ be the policy constructed from π according to the following prescription: Theorem 1. The long-run average cost of any policy π ∈ Π is bounded from below by α*. ∼ tnπ = tnπ + It follows immediately from Theorem 1 that if there exists a policy in Π whose long-run average cost exactly equals α*, then that policy is optimal. In particular, if a minimizer (x*, y*) for the function α(·, ·) exists in the set {(x, y ): x < y}, then the (s, S ) policy with s = x* and S = y* is optimal in Π . Thus, the following characterization holds. ∼ tnπ = tnπ + ∼ tnπ = tnπ xnπ λ ynπ λ and ∼ if xnπ > 0; and qnπ = qnπ , ∼ if ynπ < 0; ∼ qnπ = qnπ , otherwise. and qnπ = qnπ , and π For t ≥ 0 , denote by I (t ) the net-inventory level at time t under a policy π in Π (with I π (0) = I0 for all π ∈ Π ). Clearly, the inventory Corollary 1. An optimal policy of the (s, S ) type exists in Π if and only 7 Similar notation for other types of intervals, namely [a, b ), (a, b], and (a,b), will be adopted throughout this paper. 8 Suppose that lim n →∞ tnπ = M < ∞ for a policy π. Then, we have that tnπ ≤ M for all n ≥ 1, and this implies that for every N, the number of orders placed in the time interval [0, M ] exceeds N. For any T > M , let N = T 2 . Then, we have C π[0, T ] ≥ KT 2 . Therefore, C π[0, T ]/T ≥ KT for all T > M . Letting T → ∞ then yields f π is infinite for all π with lim n →∞ tnπ < ∞. 9 A function h: n → [ − ∞, ∞] is called lower semi-continuous at x 0 if for every ε > 0 there exists a neighborhood B ⊂ n of x 0 such that h(x) ≥ h(x 0) − ε for all x ∈ B . The function h is said to be lower semi-continuous on E ⊂ n if it is lower semi-continuous at every point in E. 10 When backordering is allowed, the necessary argument to extend the result by the above-cited authors is new, and not immediate. 218 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. Fig. 1. Construction of a policy π∼ ∈ Π 0 from a given π ∈ Π⧹Π 0 . trajectory {I π (t ), t ≥ 0} is completely determined by I0 and the sequence {(tnπ , qnπ ), n ≥ 1}. The inventory trajectories under both π and π∼ are illustrated in Fig. 1. It is easily seen from this figure that if ∼ ∼ xnπ > 0 for any n, then xnπ = 0 (see, e.g., epochs t2π and t2π ); and similarly ∼ π∼ π that if yn < 0 for any n, then yn = 0 (see, e.g., epochs t8π and t8π ). ∼ Therefore, π ∈ Π 0 by construction. We now compare the costs under π and π∼. Consider a given time interval [0, T ]. Observe that if an order in π is relocated to arrive at one in π∼, then this corresponding order in π∼ could be either a postponement or an early placement of the original. Consequently, the total number of orders under π∼ in [0, T ] can be larger than that of π over the same interval. To facilitate cost comparison, we will therefore consider the total cost under π for a possibly larger interval [0, T + δ ], where δ is an appropriately-chosen non-negative increment to T that depends on both π and T. Our intent is to pick δ so as to ensure that every order placed by π∼ within [0, T ] originates from an order placed by π within [0, T + δ ]. For this purpose, let τ∼π (T ) be the epoch at which the last order under π∼ in [0, T ] is placed; let τ π (T ) be the epoch at which the “corresponding” order under π (the one with the same order index) is placed; and denote by y π (T ) the net-inventory level under π immediately after the placement of this corresponding order. Then, we will let δ = (y π (T ))− / λ , where (y π (T ))−≔ − min(y π (T ), 0) (the negative part of y π (T )), and claim that the following inequality holds: π∼ π π − C [0, T ] ≤ C [0, T + ( y (T )) / λ] for all T ≥ 0. and (5) yields (3). ∼ We are finally ready to show that f π ≤ f π holds under our π construction. If f = ∞, we obviously have nothing to prove; we will therefore assume that f π is finite. We begin by noting that (3) can be rewritten as: ∼ C π[0, T + ( y π (T ))− / λ] T + ( y π (T ))− / λ C π [0, T ] ≤ . T T + ( y π (T ))− / λ T Upon taking lim sup , (6) yields ∼ fπ (y π (T ))− 1 lim sup . π ≤ 1 + f λ T →∞ T lim sup n →∞ (3) ( y π (Tn ))− = 0. Tn (8) In words, this means that whenever there exists a sequence {y π (Tn ), n ≥ 1} that strictly descends to −∞ as n → ∞, the average growth of the magnitude ( y π (Tn ))− is negligible relative to that of Tn. Since g(·) is assumed to satisfy lim x→−∞ g(x ) = ∞ and f π is finite, it is intuitive that (8) should hold. We next formalize this intuition and complete the proof by establishing (8). First, observe that whenever y π (T ) is negative (e.g., when T = t8π in Fig. 1), we have τ∼π (T ) = τ π (T ) − ( y π (T ))− / λ . This scenario is illustrated in Fig. 2 (T′ in the figure indicates another possible location for T). From this figure, we see that the inventory trajectory under π in the interval [τ∼π (T ), τ π (T )) (with the right endpoint open) is no higher than the diagonal border of the shaded region spanning the same interval. That is, I π (t ) ≤ − λ(t − τ∼π (T )) holds for any t in [τ∼π (T ), τ π (T )). It then easily follows that the shortage cost under π during [τ∼π (T ), τ π (T )) is no less than g( y π (T )/2)(( y π (T ))− /2)(1/λ ) (see corresponding interval markers in Fig. 2). Therefore, (4) π always holds. Obviously, (4) is valid if τ (T ) ≤ T . Suppose on the other hand that T < τ π (T ). Then, since τ∼π (T ) ≤ T by assumption, we neces∼ sarily have y π (T ) < 0 (e.g., when T is located in the interval [t8π , t8π ) in π Fig. 1), implying that I (T ) must not be positive. Hence, we have λ(τ π (T ) − T ) ≤ − y π (T ) = ( y π (T ))−; and this is equivalent to (4). We now prove (3). To facilitate the argument, it will be convenient to split the total cost into two parts and then compare them separately. For 0 ≤ a ≤ b and any given π , let R π [a, b] and H π [a, b] be the cumulative ordering cost and, respectively, the cumulative sum of both holding and shortage costs over the time interval [a, b]. By definition, we have C π[a, b] = R π [a, b] + H π [a, b]. Recall that τ∼π (T ) is the last ordering epoch under π∼ in [0, T ]. It follows that ∼ ∼ R π [0, T ] = R π [0, τ∼π (T )] = R π [0, τ π (T )] ≤ R π [0, T + (y π (T ))− / λ], (7) Clearly, if y π (T ) is uniformly bounded from below for all T, then (7) ∼ implies11 that f π ≤ f π . (Note in particular that if backordering is not allowed, then this conclusion is immediate.) Therefore, to complete the proof, it is sufficient to show that for any given increasing sequence π − π − {Tn}∞ n=1 satisfying lim n →∞Tn = ∞, lim n →∞( y (Tn )) = ∞, and ( y (Tn )) is strictly increasing in n with ( y π (T1))− > 0 (the last requirement is without loss of generality since we are concerned with lim sup ), we have Before proving (3), we will first show that τ π (T ) ≤ T + ( y π (T ))− / λ (6) (5) 11 In a previous version of this paper, the requirement of having a uniform lower bound was introduced. The authors are grateful to two anonymous reviewers of that earlier version for demonstrating that this requirement is not necessary. Their contributions led to a significantly shortened proof of Theorem 1. where the inequality is due to (4). Clearly, we also have ∼ H π [0, T ] ≤ H π [0, T ] ≤ H π [0, T + (y π (T ))− /λ]. Summing this inequality 219 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. decompose the total cost over a given interval under a policy into those over successive ordering cycles, defined as intervals that span two consecutive zero-inventory epochs; and then bound the total cost in every ordering cycle from below by α* times the length of the same cycle. We begin with an analysis of the total costs in successive ordering cycles. Consider an arbitrary policy π in Π 0. Let u1π ≔max[0, t1π + x1π / λ] and v1π ≔t1π + y1π / λ ; and for i ≥ 2 , let uiπ ≔tiπ + xiπ / λ and viπ ≔tiπ + yiπ / λ . We will formally refer to the closed time interval [uiπ , viπ ], i ≥ 1, as the i-th ordering cycle. Note that, since π ∈ Π 0 , we have xiπ ≤ 0 , yiπ ≥ 0 , and xiπ < yiπ for all i ≥ 1; therefore, these intervals are all well defined (and have positive durations). It is also easily seen that viπ = uiπ+1 holds for all i ≥ 1. These notations are illustrated in Fig. 3. Next, let κiπ ≔c(yiπ − xiπ ) + ∫x yiπ π i g (z ) dz , λ for i ≥ 1. (13) For i ≥ 2 , we will assign (or define) κiπ as the total cost associated with the i-th order, or alternatively with the i-th ordering cycle12. For i=1, Fig. 2. Lower bound for shortage cost over [τ∼π (T ), τ π (T )) . Fig. 3. Illustration of ordering cycles. π − π − π π π − observe that t1π + x1π / λ could be strictly negative so that [u1π , v1π ] may not be a “full” cycle; therefore, we should and will assign δ1π ≔c(y1π − x1π ) + H π [u1π , v1π ] (≤κ1π ) as the total cost associated with the first ordering cycle. Given the above cost assignments, we will next establish a lower bound for the total cost over a time interval. Surprisingly, for a given time epoch T, instead of working with the default choice of interval [0, T ], the key idea turns out to be the consideration of a suitablyenlarged time interval. Details are now developed. Consider the last order in [0, T ] under π . For t ≥ 0 , denote by N π (t ) the total number of orders in [0, t ]; formally, π g( − (y (T )) /2)(y (T )) /(2λ ) ≤ C [τ (T ) − (y (T )) / λ , τ (T )] ≤ C π[0, τ π (T )] ≤ C π[0, T + (y π (T ))− / λ], (9) where the last inequality is again due to (4). Now, for our given π , let {Tn}∞ n=1 be a sequence satisfying the stipulations in conjunction with (8). (An example can be constructed to show that such sequences could exist.) Then, after setting T = Tn in (9) and dividing the resulting inequality by Tn + ( y π (Tn ))− / λ , we see, in light of the assumed finiteness of f π , that for any ϵ > 0 , g( − ( y π (Tn ))− /2)(y π (Tn ))− /(2λ ) ≤ fπ + ϵ Tn + ( y π (Tn ))− / λ N π (t ) ≔max{n ≥ 1: tnπ ≤ t}. (10) holds for all sufficiently large n; and furthermore, g( − (y π (Tn ))− /2) is necessarily finite. Since (y π (Tn ))− is positive, a rearrangement of (10) now yields π − ⎤ Tn 1 ⎡ g( − ( y (Tn )) /2) − 1⎥ ≤ . ⎢ π λ⎣ 2( f + ϵ) ( y π (Tn ))− ⎦ Whenever N π (T ) ≥ 1, denote by vNπ (T ) the right endpoint of the ordering cycle associated with the N π (T )-th order; that is, let vNπ (T )≔v Nππ (T ). Clearly, we have vNπ (T ) = T + I π (T )/ λ . We will next examine the total cost under π over the interval [0, T + (I π (T ))+ / λ], where (I π (T ))+≔max(I π (T ), 0) (the positive part of I π (T )). The motivation behind this enlargement of [0, T ] is that whenever vNπ (T ) overshoots T, which occurs if and only if I π (T ) > 0 , (11) Finally, since lim x→−∞ g(x ) = ∞, we can further assume without loss of generality that the left-hand side of (11) is positive. It follows that 0< ⎡ g( − ( y π (Tn ))− /2) ⎤ ( y π (Tn ))− ≤λ⎢ − 1⎥ ⎣ ⎦ Tn 2( f π + ϵ) Letting n → ∞ in (12) then completes the proof. 12 Note that the ordering cycles are not disjoint; hence, the language here may appear inappropriate. However, the intersection between any two consecutive cycles is a single point. Therefore, the total holding and shortage costs over any time interval are not affected by these intersections. If we adopted disjoint ordering cycles, then the indexing of the successive ordering costs would be complicated by the fact that orders could appear at either end of a cycle (see Fig. 3). Our cost assignment scheme here is intended to overcome this notational complication, by allowing intersections in the definition of ordering cycles. −1 . (12) □ Given the reduction established in Lemma 1, the next step is to analyze the average costs of policies in Π 0. Our approach is to first 220 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. the entire N π (T )-th ordering cycle would be included in the cost calculation, and this facilitates the development of a “clean” lower bound. It is easily seen that13 C π[0, T ] ≥ H π [T − (I π (T ))− / λ , T ] ≥ g( − (I π (T ))− /2)((I π (T ))− /2)(1/λ ). N π (T ) C π[0, T + (I π (T ))+ / λ] = H π [0, u1π ) + δ1π + ∑ κiπ + H π [min(vNπ (T ), T ), T ], (14) i =2 ∞> where we have assumed without loss of generality that T is sufficiently large so that N π (T ) ≥ 2 . Observe further that, for i ≥ 2 , the average cost incurred in the i-th ordering cycle is given by κiπ /(viπ − uiπ ). Since viπ − uiπ = (yiπ − xiπ )/ λ , it follows from (2) and (13) that κiπ = α(xiπ , yiπ )(viπ − uiπ ) ≥ α*(viπ − uiπ ). Noting that the first two and the last terms on the right-hand side of (14) are non-negative, we then have Consider any (x, y ) with x < y . First, observe from (2) and Assumption 2 that when x > 0 or y < 0 , we have α(x, y ) ≥ α(0, y − x ) or α(x, y ) ≥ α(x − y, 0), respectively. Hence, we can limit our search of (x*, y*) to + 0 ≔{(x , y ): x < y and x ≤ 0 ≤ y}. We will next argue that we can further restrict the search to a closed and bounded subset of + 0 . The idea is to show that α(x, y ) is uniformly bounded from below when y − x is sufficiently large. For a given q > 0 , let (x, y ) be any vector in + 0 such that y − x = q . Then, from (2), we have κiπ ≥ α*(vNπ (T ) − u 2π ) i =2 ⎞ ⎛ I π (T ) = α*⎜T + − u 2π ⎟ . ⎠ ⎝ λ Since π ∈ Π 0 , it is easily seen that if I0 ≤ 0 and that if I0 > 0 (i.e., u1π > 0 ), then (i.e., u1π = 0 ), (15) then u 2π ≤ q1π / λ ; u 2π = u1π + q1π / λ=(I0 + q1π )/ λ . Therefore, y u 2π ≤ (I0 )+ + q1π λ α (x , y ) = . (16) Lemma 2. For any policy π ∈ Π 0 , ⎛ (I0 )+ + q1π ⎞ I π (T ) ⎟⎟ C [0, T + (I (T )) / λ] ≥ α*⎜⎜T + − λ λ ⎠ ⎝ π c (q ) + ∫ g (z ) x q /λ q . (23) Since g(z ) → ∞ as z → ∞, we have that for a given M > 0 , there exists a positive z 0 such that g(z ) > M for all z > z 0 . It follows that, for all y > z 0 , Recall that our aim is to show that f π ≥ α*. In light of this objective, we see that the significance of Lemma 2 is that it uncovers the fact that the successful development of a tight lower bound for the average cost under π hinges on the asymptotic behavior of I π (T )/ T as T → ∞. We now take up this task. Dividing both sides of (17) by T + (I π (T ))+ / λ and using the fact that I π (T ) = (I π (T ))+ − (I π (T ))− lead to ∫z ∫0 0 ∫z y M dz = M (y − z 0 ). 0 y g(z )dz ≥ ∫z y g(z )dz > M (y − z 0 ). 0 ⎛y z ⎞ α (x , y ) > M ⎜ − 0 ⎟ . q⎠ ⎝q Observe that the condition y ≥ − x implies that 2y ≥ y − x = q , or y / q ≥ 1/2 ; therefore, Upon taking lim sup , this yields ⎛1 z ⎞ α (x , y ) > M ⎜ − 0 ⎟ . q⎠ ⎝2 (24) (18) Next, for any ε > 0 , there exists a q0 > 0 such that z 0 / q < ε whenever q > q0 . Hence, from (24), we have α(x, y ) > M (1/2 − ε ) whenever q > q0 and y > z 0 . Note that, when y ≥ − x , the requirement q > max(q0 , 2z 0 ) guarantees that q > q0 and y > z 0 . Therefore, α(x, y ) > M (1/2 − ε ) holds if q > max(q0 , 2z 0 ) and y ≥ − x . The analysis for the other case with −x > y is parallel. Again, from (22), we have Finally, we claim that whenever f π is finite, we have (I π (T ))− = 0. T g(z )dz > Dividing this inequality by q and using (23) then yield ((I0 )+ + q1π )/λ (I π (T ))− / λ − . T T fπ 1 (I π (T ))− ≥ 1 − lim sup . * λ T →∞ T α y This implies that ((I0 )+ + q1π )/ λ 1 C π[0, T + (I π (T ))+ / λ] ( I π (T ))− / λ ≥1− − π + π + T + (I (T )) / λ T + (I (T )) / λ T + (I π (T ))+ / λ α* T →∞ ∫0 g(z )dz (17) holds for all sufficiently large T. lim sup (22) y α (x , y ) ≥ + ≥1− dz y 0 g(z )dz ∫ g(z )dz λ ≥ ∫x + 0 . q q Observe that with q fixed, we have either y ≥ − x or −x > y . We will next examine these two scenarios. Suppose first that y ≥ − x . Clearly, (22) implies that Combining (15) and (16) then yields the following lemma. π (21) 3.2. Proof of Theorem 2 N (T ) ∑ fπ 1 ≥ lim sup g( − (I π (Tn ))− /2). ϵ 2λ n →∞ Since the right-hand side of (21) is unbounded when n → ∞, we have arrived at a contradiction. This establishes (19) (because ϵ is arbitrary), which together with (18) imply f π ≥ α*; and hence the proof of Theorem 1 is complete. π C π[0, T + (I π (T ))+ / λ] ≥ (20) After setting T = Tn in (20), dividing both sides of the resulting inequality by Tn, and then taking lim sup , we obtain (19) The proof of this claim is similar to that for (8). Suppose that there exist a positive ϵ and an increasing sequence {Tn}∞ n=1 with lim n →∞Tn = ∞ that satisfy (I π (Tn ))− / Tn > ϵ for all n. Consider the time interval [T − (I π (T ))− / λ , T ] (which would be a single point when I π (T ) ≥ 0 ), and observe that 0 α (x , y ) ≥ ∫x g(z )dz q . An argument similar to that for the previous case then shows that there exist positive constants X1 and z1 such that α(x, y ) > M (1/2 − ε ) holds if q > max(q1, 2z1) and −x > y . This now allows us to conclude that If u1π = 0 , the interval [0, u1π ) is vacuous; otherwise, this interval does not contain any order. 13 221 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. scenarios: (i) I0 > 0 , (ii) 0 > I0 ≥ x 0 , and (iii) 0 ≥ x 0 > I0 . For scenario (i), we can always do nothing until inventory hits level 0 (recall that the left limit of g(·) at I0 was assumed to be finite whenever I0 is positive). For scenario (ii), we will simply follow the (x 0 , y0 ) policy and let it run until the net-inventory level hits level 0 for the first time. For scenario (iii), we will immediately place an order of size nl (y0 − x 0 ), where nl is the smallest positive integer such that nl (y0 − x 0 ) + I0 > x 0 . Note that the cost of placing this order is finite due to the sub-additivity of c(·) and the finiteness of c(y0 − x 0 ); moreover, since this order takes the net-inventory level up to a position higher than x0 (but no greater than y 0), we can then duplicate the action prescribed for scenario (ii). It follows that, for all three scenarios, we will be able to reach a zero-inventory epoch in finite time. Since the total cost involved is guaranteed to be finite, this establishes the claim. Next, let ε0 ≔ α0 − α*, which is positive. Now, with α(x, y ) > M (1/2 − ε ) holds if q = y − x > qm≔ max (q0 , 2z 0 , q1, 2z1), regardless of the relative sizes of −x and y. Next, recall that there exists an (x 0 , y0 ) such that α(x 0 , y0 ) < ∞ (see the paragraph following (2)). Without loss of generality, we can further assume that (x 0 , y0 ) is in + 0 . Then, since M and ε are arbitrary, they can be chosen to satisfy M (1/2 − ε ) ≥ α(x 0 , y0 ). Therefore, for any (x, y ) ∈ + 0 such that y − x > qm , we have that α(x, y ) > α(x 0 , y0 ). Since (x 0 , y0 ) ∈ + 0 , we have established that inf α(x, y ) = x<y inf α(x, y ), (x, y )∈+ (25) where + ≔{(x, y ): 0 < y − x ≤ qm and x ≤ 0 ≤ y}. Recall that α(x, y ) is originally defined for x < y . This original domain can be extended to include (0, 0) with the convention that α(0, 0) = ∞. That is, we now consider α(x, y ) on the closed and bounded (i.e., compact) set + = + ∪ (0, 0). Since c(·) is lower semi-continuous, and y − x is continuous on + , it can be easily shown, using the definition of lower semi-continuity, that c(y − x ) is lower semi-continuous on + . Next, since c(y − x ) and 1/(y − x ) are both lower semicontinuous, c(y − x )/(y − x ) is also lower semi-continuous on + . y Observe further that ∫ g(z )dz /(y − x ) is continuous. Therefore, α(x, y ) x is lower semi-continuous on + . Note that from Assumption 1, we have limq→0+ c(q ) ≥ K , implying that lim (x, y)→(0−,0+) α(x , y ) = ∞ = α(0, 0). This means that α(x, y ) is continuous at (0, 0). Therefore, α(x, y ) is lower semi-continuous on + , and hence it attains its minimum on + . However, since (x 0 , y0 ) ∈ + and α(x 0 , y0 ) < ∞ = α(0, 0), the point (0, 0) cannot be a minimizer of α(x, y ) over + . Hence, α(x, y ) attains its minimum on + . Thus, from (25), we conclude that there exists an (x*, y*) ∈ + such that α*≔α(x*, y*) = infx < y α(x, y ). (s0 , S0 ) ≔ (x 0 , y0 ), the definition of infimum implies that there exists a sequence {(sn , Sn ), n ≥ 0} in + such that αn≔α(sn , Sn ) ≤ α* + εn , where εn≔ε0 /2n , for all n ≥ 0 ; moreover, this sequence can be chosen so that the αn 's are decreasing in n. Observe that, since αn ≤ α* + ε0 and Sn − sn ≤ qm for all n ≥ 0 , we have c(Sn − sn ) ≤ c(Sn − sn ) + Sn g(z )dz / λ n ≤ (α* + ε0 )(Sn − sn )/ λ ≤ κ+ , (27) where κ+ ≔α0 qm / λ , a finite constant. Note in addition that since (sn , Sn ) is in + for every n ≥ 0 , we always have sn +1 ≤ Sn . For n ≥ 0 , define wn≔(Sn − sn )/ λ . Since I0 = 0 , the exact length of the period in which policy π * follows the (s0 , S0 ) policy is ζ0≔k 0 w0 . We shall refer to the time interval [0, ζ0] as the 0-th policy segment, or just the 0th segment, of π *; and from epoch ζ0 onward, we will let π * consist of a sequence of contiguous segments, where, for n ≥ 1, the n-th segment is a time interval of length ζn≔kn wn , where kn is a positive integer, in n which the (sn , Sn ) policy is employed. Let Γn≔ ∑ j =0 ζj , for n ≥ 0 ; then, note that Γn is the epoch at which the n-th segment terminates and at which the (n + 1)-th segment begins. To “connect” the (s, S ) policies in successive policy segments, notice however that there exists a complication, namely that for any n ≥ 1, it is possible to have Sn −1 = 0 = sn . If such a scenario should occur, we will let π * place a single order of size Sn − sn−1 at Γn−1, as opposed to the “inadmissible” action of placing two concurrent orders of respective sizes Sn −1 − sn −1 = − sn −1 and Sn − sn = Sn . If, on the other hand, Sn −1 = 0 = sn does not apply, then, at most one order could exist at Γn−1 and it is easily seen that the (sn −1, Sn −1) policy would “automatically” switch into the (sn , Sn ) policy at that epoch without any intervention; this is because Γn−1 is a zero-inventory epoch if and only if Sn −1 = 0 = sn does not hold. To complete our definition of policy π *, what remains is the specification of the kn's. Recall that with our choice of εn's above, the convergence of the αn 's to α* is at least at an exponential rate. The basic intuition behind this last part of our construction, then, is that the lengths of the successive policy segments in π *, which are determined by the kn's, can be made to increase at a sufficient pace so that the longrun average cost of π * actually achieves α*. We will now proceed by setting k 0 = 1; this means that, starting from time 0, the (s0 , S0 ) policy is followed for a time interval of length ζ0 = w0 . Hence, the total cost contribution (see (13)) associated with the ordering cycle spanning [0, Γ0] equals α0 w0 , or α0 Γ0 . Next, at epoch Γ0, observe that if S0 = 0 = s1 holds, then the “combined” order of size S1 − s0 there creates a larger (or merged) ordering cycle of duration w0 + w1. Moreover, since c(·) is sub-additive, the total cost contribution associated with this combined order is no greater than α0 w0 + α1 w1, which, interestingly, precisely equals the total of the cost contributions 3.3. Proof of Theorem 3 In light of (25), we will assume that α* is not attained by any (x, y ) ∈ + , as otherwise this theorem is a consequence of Theorem 1. Under this assumption, we will construct a policy in Π whose long-run average cost exactly equals α*. Specifically, our strategy is to show that there exists a policy π * whose cumulative cost function satisfies, for all sufficiently large T, an inequality of the form * C π [0, T ] ≤ α* + β (T ), T ∫s (26) where β (T ) is a (non-negative) function that converges to 0 as T → ∞. * * Clearly, (26) implies that f π ≤ α* holds. Since f π ≥ α* also holds * (Theorem 1), we will then have the desired result that f π = α*. Before proceeding, we will make the additional assumption that the ordering-cost function is sub-additive, i.e., c(q1 + q2 ) ≤ c(q1) + c(q2 ) for all positive q1 and q2. This assumption is without loss of generality as any cost function that is not sub-additive can be modified to one that is, by splitting an order of size q1 + q2 into two orders of sizes q1 and q2 each and placing these orders consecutively with little time delay to achieve a lower ordering cost14. Our starting point is the vector (x 0 , y0 ) in + with α0 ≔ α(x 0 , y0 ) < ∞, defined in Section 3.2. We will begin by letting π * follow the (x 0 , y0 ) policy for k0 ordering cycles, where k0 is a positive integer. To initialize this (x 0 , y0 ) policy, we will assume I0 = 0 and claim that this assumption can be made without loss of generality. To see the validity of this claim, observe that if I0 = 0 does not hold, then there are three possible 14 For discrete-time inventory models with a constraint on total production capacity, ordering q1 and q2 units concurrently could be infeasible; and this would result in an example of an ordering-cost function that is not sub-additive. However, in our continuous-time EOQ setting, these two orders could be placed “back-to-back”; that is, within very little time of each other. 222 S. Perera et al. International Journal of Production Economics 187 (2017) 216–228 associated with the two “scheduled” orders of sizes S0 − s0 and S1 − s1 at Γ0 if we had not been “forced” into combining these orders. Recall that our goal is to develop an upper bound on the cumulative total cost over time. It follows that for bounding purposes, we can and will conveniently “pretend” that the ordering cost for the single order at Γ0 is equal to c(S0 − s0 ) + c(S1 − s1) if S0 = 0 = s1. With this device, we then have that the total cost associated with the order(s) during the “w0 + w1 interval” that covers Γ0 always equals α0 w0 + α1 w1, regardless of whether or not S0 = 0 = s1; in addition, we will assign c(S0 − s0 ) as a cost increment that belongs to the interval [0, Γ0] and c(S1 − s1) as one that belongs to [Γ0, Γ] 1 . This assignment scheme will also be adopted without further comment for all future segments. Next, let k1 be sufficiently large so that Γ1 − Γ0 = ζ1 ≥ ζ0 = Γ0 . Clearly, if there is no combined order at Γ1, then the increment to the total cost in the interval [Γ0, Γ] 1 would be exactly α1 ζ1; otherwise, there exists an additional cost c(S2 − s2 ) = c(S2 ) at Γ1 that is from the next segment [Γ1, Γ2]. Recall from (27) that c(Sn − sn ) ≤ κ+ for all n ≥ 0 ; hence, we have c(S2 ) ≤ κ+ . It follows that * * lθ (T ) ] + κ+ . C π [0, T ] ≤ C π [0,Γ * C π [0, Γ] 1 ≤ Γθ (T ) and then invoking (31) and (30) now Dividing this inequality by l yield (26) with β (T )≔ε0 Finally, observe that θ (T ) → ∞ as T → ∞; hence, we indeed have β (T ) → 0 as T → ∞. This completes the proof of Theorem 3. 4. Concluding remarks Our definition of the policy class Π in Section 2 only allows deterministic policies. However, in the interest of generality, it is also worthwhile to consider randomized and history-dependent policies. Denote this larger class of policies by Π͠ . We now show that whenever α* is attained by an (x*, y*), then the corresponding (x*, y*) policy is, in fact, also optimal over Π͠ . Somewhat surprisingly, such an extension is an easy consequence of our proof of Theorem 1. For π ∈ Π͠ , define 1 ∑ αj ζj + κ+ j =0 (28) f π = lim sup always holds. Since we also have α1 ≤ α0 = α* + ε0 , α1 ≤ α* + ε1, and ζ1 ≥ ζ0 , it is easily seen that (28) further implies that15 T →∞ E(C π[0, T ]) , T where E denotes expectation17; and let this be our performance measure. Observe that * κ C π [0, Γ] 1 1 ≤ α* + (ε0 + ε1) + + . Γ1 2 Γ1 2κ θ (T ) + 2 + +. Γθ (T ) 2θ (T )+1 (29) E(C π[0, T ]) E(C π[0, T ]) ≥ lim inf T →∞ T T T →∞ π ⎛ ⎞ C [0, T ] ≥ E⎜lim inf ⎟, ⎝ T →∞ ⎠ T f π = lim sup Note that, since ε1 = ε0 /2 , the term (1/2)(ε0 + ε1) can be rewritten as ε0(n + 2)/2n +1 with n=1. Our intent beyond k1 should now be clear. For any n ≥ 2 , we will let kn be sufficiently large so that the length of the n-th segment (i.e., ζn) is no shorter than the total length of all previous segments (i.e., Γn−1). That is, we will require that the kn's be chosen so that Γn − Γn −1 ≥ Γn −1 holds for every n ≥ 1; and this completes our construction of π *. It should also be clear that this construction is designed to ensure that, for all n ≥ 1, we have (32) holds. Next, note that the cost associated with the interval lθ (T ), Γ lθ (T ) + wθ (T )+1] equals αθ (T )+1 wθ (T )+1, and that, in light of (27), this [Γ cost is bounded from above by κ+ . Hence, we have where the second inequality is due to Fatou's Lemma. Next, note that our proof of Theorem 1 directly applies to every realized inventory trajectory under π ; and moreover, the argument remains valid when lim sup in (1) is replaced by lim inf . That is, lim infT →∞C π[0, T ]/ T ≥ α* holds for every sample path. This implies that the right-hand side of * (32) is also bounded below by α*. Since f π = α*, we see that π * is optimal over Π͠ . Finally, we note that under stronger assumptions on the cost structure, simple and direct proofs for our Theorem 1 exist. Since such proofs have been desperately absent in standard textbooks, we believe they are of pedagogical interest. We will articulate two examples, but, in the interest of brevity, delegate the details to the Appendix. The first example is for the special case when c(·) is increasing and backordering is not allowed (i.e., g(x ) = ∞ for all x < 0 ). We have uncovered an extremely short proof. The argument, which complements the extant literature (see, e.g., Beyer and Sethi (1998) and Lippman (1971)), is given in Appendix B. When compared with our full proof of Theorem 1, it can be seen from Appendix B that the shortness of the proof is primarily due to the no-backordering constraint. In the absence of this requirement, it turns out that a direct proof of Theorem 1 that does not rely on the reduction in Lemma 1 also exists if the cost structure is restricted to that of the standard EOQ model; and this is our second example. The argument for this extension of the original EOQ model in Harris (1913) is provided in Appendix C. 15 This follows from the fact that a ≥ b ≥ 0 and y ≥ x > 0 together imply that (ax + by )/(x + y ) ≤ (1/2)(a + b ) . 16 Note that the right-hand-side ratio is no less than αθ (T ) . This inequality then follows from the fact that for any pair of positive x and y, a ≥ (ax + by )/(x + y ) if and only if a ≥ b ≥ 0. 17 Conceptually, this expectation is on the probability space of all inventory trajectories generated by π . * κ C π [0, Γn] n+2 ≤ α* + ε0 n +1 + + . Γn Γn 2 (30) This inequality is an immediate extension of (29), obtained by * * * decomposing C π [0, Γn] into the sum of C π [0, Γn −1] and C π [Γn −1, Γn] and then carrying out a simple induction on n. Consider now an arbitrary T >Γ 0. Let θ (T ) be the index of the last completed segment under π * by time T; and correspondingly, define χ (T )≔max{j ≥ 0: Γθ (T ) + j wθ (T )+1 ≤ T}. Also, denote Γθ (T ) + χ (T )wθ (T )+1 lθ (T ) ≤ T < l Γθ (T ) . Then, by definition, we have Γθ (T ) ≤ Γ Γθ (T ) + wθ (T )+1. by l Since αθ (T )+1 is no greater than αθ (T ) (which, in turn, is no greater than any αn for 0 ≤ n ≤ θ (T )), it immediately follows that16 * * lθ (T )] C π [0, Γ C π [0, Γθ (T )] ≤ l Γθ (T ) Γθ (T ) (31) 223 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. Appendix A. Well-known examples of ordering-cost functions that are not increasing. Fig. A.1. Ordering-cost function with batch-ordering constraint. Fig. A.2. Ordering-cost function with MOQ constraint. Fig. A.3. Ordering-cost function with all-unit discounts. 224 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. Appendix B. Proof of Theorem 1 with an increasing c(·) and no backordering Recall that Π 0p denotes the class of policies that place an order only when the inventory level decreases to zero. Clearly, Lemma 1 remains valid after replacing Π 0 by Π 0p . The proof for this special case is well known and much shorter, as we do not have the complications that arise from allowing backorders. Let π be a policy in Π⧹Π 0p . Then, π must place at least an order at some epoch t with a positive I π (t ); clearly, this is equivalent to the existence of π xi > 0 for some i ≥ 1. Let π∼ be the policy constructed from π by setting: ∼ tnπ = tnπ + ∼ tnπ = tnπ xnπ λ and ∼ if xnπ > 0; qnπ , otherwise. and qnπ = qnπ , ∼ qnπ = and An illustration of the inventory trajectories under both π and π∼ is shown in Fig. B.1. It is easily seen from this figure (see the shaded region) that the cumulative sum of holding and shortage costs under π∼ over time is dominated by that under π . Moreover, it is clear that for a given T ≥ 0 , every order placed within [0, T ] under π∼ corresponds to an order placed under π within the same interval (and furthermore π may have additional orders in [0, T ], e.g., the order at epoch t6π in Fig. B.1). It follows that the total ordering cost under π∼ is no greater than that under π within [0, T ]. Taken ∼ ∼ together, these two observations imply that C π [0, T ]/ T ≤ C π[0, T ]/ T for all T ≥ 0 . By letting T → ∞, we then have f π ≤ f π . π p Now, consider a policy π ∈ Π 0 . Let T be a time epoch that is sufficiently large so that N (T ) ≥ 2 ; then, the total cost under π in [0, T ] can be written as: N π (T )−1 C π[0, T ] = C π[0, t1π ) + ∑ C π[tiπ , tiπ+1) + C π[tNπ (T ), T ] . (B.1) i =1 This cost decomposition is illustrated in Fig. B.2. Now, it is immediate that C π[tiπ , tiπ+1) = α(0, yiπ )(tiπ+1 − tiπ ) ≥ α*(tiπ+1 − tiπ ). (B.2) Moreover, since c(·) and g(·) are increasing functions, we see from the shaded area in Fig. B.2 that if T − similar to tNπ (T ) , we have C π[tNπ (T ), T ]≥α(0, yNπ(T ) − I π (T ))(T − tNπ (T )) and hence C π[tNπ (T ), T ] ≥ α* (T − tNπ (T )). Notice in addition that this lower bound continues to hold even if T − that tNπ (T ) is positive, then, with yNπ(T ) defined (B.3) π = 0 . Since C [0, t1π ) ≥ 0 and Fig. B.1. Construction of a policy π∼ from a given π ∈ Π⧹Π 0p . Fig. B.2. Cost decomposition over [0, T ] for π ∈ Π 0p . 225 tNπ (T ) t1π = I0 / λ , we then conclude from (B.1)–(B.3) International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. C π[0, T ] ≥ α* (T − I0 / λ ). (B.4) Dividing both sides of (B.4) by T and then letting T → ∞ now yield f π ≥ α*, completing the proof for this special case of Theorem 1. Finally, we note that our proof in this section is similar in spirit to the approach in Lippman (1971). Lippman studies a single-product, singlelocation model without backordering. The holding cost is linear, and the ordering-cost structure allows multiple set-up costs. Under these specific cost assumptions, Lippman proves (s, S )-optimality using a lower-bounding argument. Since the multiple set-up cost function is increasing and left continuous, (B.4) and our Theorem 2 also imply that an optimal (s, S ) policy exists in Lippman's model. Appendix C. A simple proof of the standard EOQ formula with backorder We first recall that the cost assumptions in the standard EOQ model with backorder are: c(q ) = K 1{q >0}+ νq and g(x ) = h max(0, x ) + b max(0,−x ) for some constants K > 0 , ν ≥ 0 , h > 0 , and b > 0 . Now, for a given policy π , let Q π [0, T ] and ηπ [0, T ] be the cumulative order quantities and, respectively, the total number of orders placed under π in the time interval [0, T ]. Then, ν Q π [0, T ] + Kηπ [0, T ] + H π [0, T ] T T →∞ ν Q π [0, T ] + Kηπ [0, T ] + H π [0, T ] ≥ lim inf T →∞ T Kηπ [0, T ] + H π [0, T ] Q π [0, T ] ≥ ν lim inf + lim inf . T →∞ T →∞ T T f π = lim sup (C.1) Next, we claim that we can restrict attention to policies that satisfy lim inf T →∞ Q π [0, T ] ≥ λ. T (C.2) The intuition behind this claim is obvious: If the long-run rate of procurement is less than the demand rate, then the amount of backorder will diverge to infinity. This is formalized in the following lemma. Lemma 3. The long-run average cost f π for any policy π that violates inequality (C.2) is infinite. Proof. Suppose lim infT →∞Q π [0, T ]/ T < λ for a given policy π . Then, by the definition of lim inf , there exist an ε ∈ (0, λ ), an n 0 > 0 , and an increasing π π π sequence {Tn}∞ n=1 with lim n →∞Tn = ∞ such that Q [0, Tn ]/ Tn < λ − ε for all n ≥ n 0 . Observe that I (Tn ) = I0 + Q [0, Tn ] − λTn ; and this, together with π π the previous inequality, implies that I (Tn ) < I0 − εTn for all n ≥ n 0 . (Hence, I (Tn ) diverges to −∞ as n → ∞.) Suppose further that n (≥n 0 ) is sufficiently large so that I0 − εTn is strictly negative and Tnl ≔Tn − (εTn − I0 )/ λ = (1− ε / λ )Tn + I0 / λ is non-negative. The time epochs Tnl and Tn are illustrated in Fig. C.1. From this figure, we also see that (εTn − I0 )/ λ is the amount of time necessary for the net-inventory level to drop down to I0 − εTn , assuming that it starts from level 0 at time Tnl and no future orders are placed. Now, consider the time interval (Tnl, Tn]. Clearly, the cumulative shortage cost in [0, Tn] is no less than that within the subinterval (Tnl, Tn]. Moreover, the latter cost is, in turn, greater than b(1/2)(εTn − I0 )((εTn − I0 )/ λ ), which is the total shortage cost in (Tnl, Tn] if the inventory level under π were zero at epoch Tnl and no future orders were placed. (The diagonal border of the shaded area in Fig. C.1 always lies entirely above the inventory trajectory under π in (Tnl, Tn].) It follows that C π[0, Tn]/ Tn > b(1/2)(ε − I0 / Tn )((εTn − I0 )/ λ ). Finally, letting n → ∞ in this inequality yields that f π = ∞. □ From Lemma 3, we have that the first limit in (C.1) is no less than νλ . We now develop a lower bound for the second limit in (C.1). Observe that this limit can be interpreted as the objective function in a standard EOQ model with ν = 0 . Denote by α0(x, y ) the function (2) for this special case. It is easily shown that α0(x, y ) attains its minimum at s0*= − 2λKh /[b(b + h )] and S0*= 2λKb /[h(b + h )] . We will next show that, for any π ∈ Π (not necessarily limited to Π 0), Fig. C.1. Lower bound for backordering cost over [0, Tn]. 226 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. Fig. C.2. Cost decomposition over [0, T ]. lim inf T →∞ Kηπ [0, T ] + H π [0, T ] ≥ α0*≔α0(s0*, S0*). T (C.3) The argument relies on a lower bound that is similar in spirit to (B.4). For a given policy π , denote by C0π the cumulative cost function under the assumption that ν = 0 . Then, (B.1) again holds18 with C0π replacing C π ; and this is illustrated in Fig. C.2. It is easily seen from Fig. C.2 and (2) that, for 1 ≤ i ≤ N π (T ) − 1, C0π[tiπ , tiπ+1) = K + ∫x yiπ π i +1 g (z ) dz = α0(xiπ+1, yiπ )(tiπ+1 − tiπ ) ≥ α0*(tiπ+1 − tiπ ). λ Now, consider the particular scenario in which both t1π and T − tNπ (T ) are positive. Then, similar reasoning also shows that C0π[tNπ (T ), T ] = α0(I π (T ), yNπ (T ) )(T − tNπ (T )) ≥ α0*(T − tNπ (T )); and that C0π[0, t1π ) = α0(x1π , I0 )(t1π −0) − K ≥ α0* t1π − K . Consequently, from the modified (B.1), we have Kηπ [0, T ] + H π [0, T ] = C0π[0, T ] ≥ α0* T + γ , (C.4) may not be positive, there are three other scenarios. It is easily seen that for all scenarios, the constant γ where γ = − K . Since and/or T − equals −K , 0, or K. Dividing both sides of (C.4) by T and letting T → ∞ now yield (C.3). * From (C.1)–(C.3), we now have f π ≥ νλ + α0*. It is easily verified that f π0 = νλ + α0*, where π0* denotes the (s0*, S0*) policy. Therefore, π0* is optimal over Π . t1π tNπ (T ) Federgruen, A., Lee, C., 1990. The dynamic lot size model with quantity discount. Nav. Res. Logist. 37 (5), 707–713. Harris, F.W., 1913. How many parts to make at once. Fact. Mag. Manag. 10 (2), 135–136, 152. Hassin, R., Meggido, N., 1991. Exact computation of optimal inventory policies over an unbounded horizon. Math. Oper. Res. 16 (3), 534–546. Hellion, B., Mangione, F., Penz, B., 2012. A polynomial time algorithm to solve the single-item capacitated lot sizing problem withminimum order quantities and concave costs. Eur. J. Oper. Res. 222 (1), 10–16. Iglehart, D., 1963a. Optimality of (s, S) policies in the infinite-horizon dynamic inventory problem. Management Sci 9 (2), 259–267. Iglehart, D., 1963b. Dynamic programming and stationary analysis of inventory problems (Chapter 1). In: Scarf, H., Gilford, D., Shelly, M. (Eds.), Multistage Inventory Models and Techniques. Stanford University Press, Stanford, CA. Kiesmüller, G.P., de Kok, A.G., Dabia, S., 2011. single item inventory control under periodic review and a minimum order quantity. Int. J. Prod. Econ. 133 (1), 280–285. Li, C., Hsu, V.N., Xiao, W., 2004. Dynamic lot sizing with batch ordering and truckload discounts. Oper. Res. 52 (4), 639–654. Li, C., Ou, J., Hsu, V.N., 2012. Dynamic lot sizing with all-units discount and resales. Nav. Res. Logist. 59 (3–4), 230–243. Lippman, S.A., 1969. Optimal inventory policy with multiple set-up costs. Manag. Sci. 16 (1), 118–138. Lippman, S.A., 1971. Economic order quantities and multiple set-up costs. Manag. Sci. 18 (1), 39–47. Porteus, E.L., 2002. Foundations of Stochastic Inventory Theory. Stanford University Press, Stanford, CA. Presman, E., Sethi, S.P., 2006. Inventory models with continuous and Poisson demands and discounted and average costs. Prod. Oper. Manag. 15 (2), 279–293. References Adelman, D., Klabjan, D., 2005. Duality and existence of optimal policies in generalized joint replenishment. Math. Oper. Res. 30 (1), 28–50. Alp, O., Huh, W.T., Tan, T., 2014. Inventory control with multiple setup costs. Manuf. Serv. Oper. Manag. 16 (1), 89–103. Altintas, N., Erhun, F., Tayur, S., 2008. Quantity discounts under demand uncertainty. Manag. Sci. 54 (4), 777–792. Benton, W.C., Park, S., 1996. A classification of literature on determining the lot size under quantity discounts. Eur. J. Oper. Res. 92 (2), 219–238. Beyer, D., Sethi, S.P., 1998. A proof of the EOQ formula using quasi-variational inequalities. Int. J. Syst. Sci. 29 (11), 1295–1299. Brimberg, J., Hurley, W.J., 2006. A note on the assumption of constant order size in the basic EOQ model. Prod. Oper. Manag. 15 (1), 171–172. Caliskan-Demirag, O., Chen, Y., Yang, Y., 2012. Ordering policies for periodic-review inventory systems with quantity-dependent fixed costs. Oper. Res. 60 (4), 785–796. Chan, L.M.A., Muriel, A., Shen, Z., Simchi-Levi, D., 2002. On the effectiveness of zeroinventory-ordering policies for the economic lot-sizing model with a class of piecewise linear cost structures. Oper. Res. 50 (6), 1058–1067. Chen, F., 2000. Optimal policies for multi-echelon inventory problems with batch ordering. Oper. Res. 48 (3), 376–389. Chen, X., 2009. Inventory centralization games with price-dependent demand and quantity discount. Oper. Res. 57 (6), 1394–1406. Erlenkotter, D., 1989. An early classic misplaced: Ford W. Harris's economic order quantity model of 1915. Manag. Sci. 35 (7), 898–900. Erlenkotter, D., 1990. Ford Whitman Harris and the economic order quantity model. Oper. Res. 38 (6), 937–946. 18 If there is an order at time zero, i.e., if t1π = 0 , then C0π[0, t1π ) = 0 . 227 International Journal of Production Economics 187 (2017) 216–228 S. Perera et al. (2), 257–270. Zheng, Y.-S., 1991. A simple proof of the optimality of (s, S) policies in infinite-horizon inventory systems. J. Appl. Probab. 28 (4), 802–810. Zheng, Y.-S., 1994. Optimal control policy for stochastic inventory systems with Markovian discount opportunities. Oper. Res. 42 (4), 721–738. Zhou, B., Zhao, Y., Katehakis, M.N., 2007. Effective control policies for stochastic inventory systems with a minimum order quantity and linear costs. In. J. Prod. Econ. 106 (2), 523–531. Zhu, H., Liu, X., Chen, Y., 2015. Effective inventory control policies with a minimum order quantity and batch ordering. Int. J. Prod. Econ. 168 (1), 21–30. Zipkin, P., 2000. Foundations of Inventory Management. McGraw-Hill, Boston, MA. Scarf, H., 1960. The optimality of (S, s) policies in dynamic inventory problems. In: Arrow, K.J., Karlin, S., Suppes, P. (Eds), Proceedings of the First Stanford Symposium on Mathematical Methods in the Social Sciences. Stanford University Press, Stanford, California, pp. 196–202. Sun, D., 2004. Existence and properties of optimal production and inventory policies. Math. Oper. Res. 29 (4), 923–934. Veinott, A.F., 1966. On the optimality of (s, S) inventory policies: new conditions and a new proof. SIAM J. Appl. Math. 14 (5), 1067–1083. Winston, W.L., 2004. Operations Research: Applications and Algorithms 4th ed.. Thomson Brooks/Cole, Belmont, CA. Zhao, Y., Katehakis, M.N., 2006. On the structure of optimal ordering policies for stochastic inventory systems with minimum order quantity. Probab. Eng. Inf. Sci. 20 228
© Copyright 2026 Paperzz