Optimal and Asymptotically Optimal Policies for Assemble-to-Order N- and W-Systems Lijian Lu,1 Jing-Sheng Song,2 Hanqin Zhang3 1 Graduate School of Business, Columbia University, New York, 10027 New York 2 Fuqua School of Business, Duke University, Durham, 27708 North Carolina 3 NUS Business School, National University of Singapore, Singapore Received 25 October 2014; revised 30 November 2015; accepted 1 December 2015 DOI 10.1002/nav.21671 Published online 5 January 2016 in Wiley Online Library (wileyonlinelibrary.com). Abstract: We consider two specially structured assemble-to-order (ATO) systems—the N- and W-systems—under continuous review, stochastic demand, and nonidentical component replenishment leadtimes. Using a hybrid approach that combines samplepath analysis, linear programming, and the tower property of conditional expectation, we characterize the optimal component replenishment policy and common-component allocation rule, present comparative statics of the optimal policy parameters, and show that some commonly used heuristic policies can lead to significant optimality loss. The optimality results require certain symmetry in the cost parameters. In the absence of this symmetry, we show that, for systems with high demand volume, the asymptotically optimal policy has essentially the same structure; otherwise, the optimal policies have no clear structure. For these latter systems, we develop heuristic policies and show their effectiveness. © 2016 Wiley Periodicals, Inc. Naval Research Logistics 62: 617–645, 2015 Keywords: inventory control; assemble-to-order system; asymptotic analysis 1. INTRODUCTION This article concerns optimal control policies for two specially structured, continuous-review, assemble-to-order (ATO) systems as shown in Fig. 1. In these systems, components are kept in stock, but final products are assembled only after customer orders are realized. Customer orders arrive randomly. The components are procured from outside suppliers with positive leadtimes. The assembly times of the products are negligible relative to the component replenishment leadtimes. The ATO approach allows manufacturers to only build products that customers desire and hence to eliminate finished-product inventories, achieving great flexibility at low cost. Well-known examples include Dell Computer’s online service and the “build-to-order” practices of BMW, Toyota, and GM. Incidentally, the same system structure also applies to mail-order and online retailers. In such businesses, customers may order different, but possibly overlapping, subsets of items that are kept in stock and Correspondence to: Lijian Lu ([email protected]) © 2016 Wiley Periodicals, Inc. customer satisfaction is based on the fulfillment of the entire order. Despite the widespread use of ATO systems, the structure of the optimal control policy remains largely unknown. The difficulty arises because the system topology comprises both the assembly structure (for each product) and the distribution structure (for each common component). While the form of an optimal policy for an assembly system with a single final product has been identified [28], for systems with multiple final products this is still an open problem. As stated in [34], control of an ATO system consists of two decisions: inventory replenishment and inventory allocation. Until now, only simple heuristic policies have been implemented. Correspondingly, the academic literature almost exclusively focuses on these simple policies, such as independent base-stock (IBS) replenishment policies (i.e., each component follows a standard base-stock policy) and the first-come-first-served (FCFS) component allocation rule. This article addresses the structure of an optimal policy for two basic ATO systems—the N- and W-systems (see Fig. 1)—in the following setup: demands are generated by 618 Naval Research Logistics, Vol. 62 (2015) Figure 1. Illustration of ATO systems. a stationary process with independent increments. Unsatisfied demand is fully backlogged and replenishment orders for components face a constant component specific leadtime. Our objective is to minimize the expected long-run average system cost, under linear holding and procurement costs for the components and linear backlogging costs for the final products. For conciseness and ease of presentation, the main body of the article focuses on the N-system and Appendix presents the extension to the W-system. In the N-system, product-1 requires one unit of component1 only, while product-12 requires a unit of component-1 and one of component-2. In the following, we often refer to component-1 as the common component and component2 the product-specific component. In the context of computers, for example, component-1 may be a hard disk and component-2 may be a memory card. The hard disk may be sold directly to a customer as a portable hard disk drive (product-1) or may be assembled together with the memory card into a laptop (product-12). Similarly, component-1 may be a LCD monitor, which may either be sold directly to a customer (product-1) or be assembled with a CPU (component2) into a desktop (product-12). In the context of automobiles, component-1 may be an engine and component-2 may be the body of the car. The engine may either be sold separately as product-1 or be assembled with the car body into a car (product-12). Clearly, the N-system possesses the basic elements of an ATO system—with both assembly and distribution structures. Thus, it serves as an important building block for general multiproduct, multicomponent ATO systems. We now summarize our main findings. These depend on the cost structure of the system. 1.1. Cost Symmetry We first study a system in which the cost parameters satisfy a certain symmetry condition: The unit backorder cost rate for product-1 equals the sum of the unit backorder cost rate Naval Research Logistics DOI 10.1002/nav for product-12 and the unit holding cost rate of component-2. Therefore, when component-2 is available, there is no immediate cost difference for allocation of a unit of component-1 among different product-demands. This cost structure is plausible in practice if product-1 serves an external market that enjoys a higher profit margin or faces a higher shortage cost, while product-12 serves internal production, as an intermediate product (or a subassembly). For instance, in the computer example mentioned above, product-1 may be a high-end portable hard disk competing with several substitutes in the market, while product-12 is used to assemble a laptop in a medium price range. In the automobile example, the engine may be used to satisfy an emergency supply contract for a spare part that incurs a high delay penalty, while product-12 is used to assemble a car which has a smaller penalty cost. For this system, we obtain the following full characterization of the optimal policy: a. Allocation Rule: For any given replenishment policy, a no-holdback (NHB) allocation rule is optimal. A NHB rule allocates a unit of a component to an outstanding product demand if and only if this allocation can result in the fulfillment of that demand. Note that there are multiple NHB rules; under the above symmetry condition, any NHB rule is optimal. b. Replenishment Policy: Assuming this optimal allocation rule is followed, the optimal replenishment policy is a coordinated base-stock (CBS) policy. Under a CBS policy, i. The component with the longer leadtime uses a standard base-stock policy, acting on inventory position = inventory in stock and on order minus backlogs. ii. The component with the shorter leadtime adopts a state-dependent base-stock policy. Remarkably, the base stock level depends only on a one-dimensional state variable, namely, the aggregate demand for one of the two products during the most recent time interval of a duration given by the difference between the components’ leadtimes. We also provide a simple procedure to determine the parameters of the CBS policy. In addition, we show that, in general, a higher demand rate for either product increases the optimal inventory levels for both components. This conforms with conventional wisdom. However, counter-intuitively, when the product specific component-2 has a shorter leadtime, a higher product-1 demand rate lowers the optimal component-2 inventory. The above findings are significant. The structure of the optimal policy is remarkably simple, given the fact that a basic representation of the problem as a Markov Decision Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies process requires a complex, multidimensional state description, including vectors of outstanding orders and the times at which they are placed, along with inventory levels for the two components, and backlog sizes for the two final products. Our results show that, the combination of a simple allocation rule (NHB) and the above stated replenishment policy (CBS) is truly optimal. The CBS policy synchronizes the component inventory levels. (Clearly, when the component leadtimes are identical, the CBS policy reduces to an IBS policy.) This policy is closely related to the well-known balanced-base-stock (BBS) policy, proved to be optimal by [28] for the singleproduct assembly system that consists of product-12 only. It is noteworthy that a similar policy is optimal with the addition of a second final product, that is, product-1. With the form of optimal policy identified, we are able to investigate the effectiveness of the commonly used IBS policies. Our numerical experiments indicate that the optimality loss incurred using the best IBS policy can be significant. The IBS policy performs well only when the two component leadtimes are similar. (Recall that IBS is optimal when the leadtimes are identical). At the same time, we make a methodological contribution. Our optimality proof introduces a novel, hybrid approach for identifying an optimal policy. The standard approach for characterizing the structure of an optimal policy is to formulate the problem as a dynamic program or Markov Decision process. However, as mentioned before, the complexity of the system state and the nonexponential feature of the leadtime for each component precludes this approach. Our approach consists of two steps. The first step establishes the optimality of a NHB allocation policy under an arbitrary replenishment policy. To this end, we derive a sample-path lower bound of the system cost and construct a NHB allocation rule to achieve the lower bound via a linear programming model. Thereafter, a NHB allocation policy can be used without loss of optimality. The second step is to find the optimal replenishment policy to minimize the expected cost lower bound obtained from the first step, and show that a CBS policy achieves this expected lower bound. We accomplish this by invoking the tower property of conditional expectations—that is, the informed expectation on average equals the uninformed expectation; see Eq. (25)— for the expected cost that arises in any finite time interval. In particular, the tower property enables us to sequentially find the optimal replenishment policy for the component with the longer leadtime and then that for the component with the shorter leadtime by leveraging the realized product demand information during the time difference between the two component leadtimes. 1.2. Cost Asymmetry Next, we consider systems with non-symmetric costs. We obtain the following results: 619 a. Asymptotically Optimal Policy: Under the highvolume demand regime (demand rates for both product-1 and product-12 are significantly high), we use heavy-traffic limiting arguments to identify the asymptotically optimal policy by allocating components at discrete review points separated by an infinitesimal interval. Under this asymptotic regime, we show that the optimal replenishment policy is a CBS policy, and the optimal allocation policy is the periodic-review NHB policy with a priority based backorder clearing (PBC) rule. Thus, interestingly, the form of the asymptotically optimal policy is essentially the same as that of the optimal policy in the symmetric-cost case. b. Bounds: When demand is in the small to medium range, there is no clear structure for the optimal policy and we construct two systems with symmetric costs whose long-run average costs form a lower bound and an upper bound on the long-run average cost of the original system. This allows us to exploit the optimal policies of these bounding systems based on our results for the symmetric-cost case to devise heuristic policies for the original system. In addition, we use the cost lower bound as a benchmark to evaluate the performance of the heuristics. c. Heuristics: We develop various heuristic policies based on our results for the symmetric-cost case (as applied to the bounding systems) and the asymptotic analysis for high-volume demand systems with asymmetric costs. Numerical results show that, in general, when cost parameters are asymmetric, the most effective heuristic policy consists of a CBS replenishment policy combined with a rationing allocation policy. Under a rationing allocation policy, one of the two final products is treated with a higher priority than the other. Any unit of the common component is used to fill an outstanding demand with high priority if the allocation results in an immediate fulfilment. A unit of the common component is allocated to fill an outstanding demand with low priority only when the common component inventory level exceeds a threshold value characterized by a piecewise linear function of the component2 inventory. (For the special case of ample supply of component-2, the system reduces to a single-item inventory system with two demand classes. For such systems, it has been shown that a simple base-stock policy is optimal for replenishment and a rationing policy is optimal for inventory allocation among the two products; see [34]). Finally, we show that our techniques to derive cost bounds can be extended to W-systems. Moreover, under specific cost Naval Research Logistics DOI 10.1002/nav 620 Naval Research Logistics, Vol. 62 (2015) and leadtime conditions, we specify a policy (a CBS replenishment policy combined with a NHB allocation rule) whose cost equals the lower bound, and thus, is optimal. The rest of the article is organized as follows. Section 2 reviews the relevant literature. Section 3.1 introduces the basic notation and model formulation, Section 3.2 outlines our basic methodology for the optimality proof. Section 4 introduces our sample path methods and characterizes the optimal common-component allocation rule. Section 5 focuses on the form of the optimal replenishment policy and its comparative statics under symmetric cost. Section 6 analyzes the system with asymmetric costs. In Section 7, we develop bounds and heuristics based on the results for systems with symmetric and asymmetric costs. We also investigate the effectiveness of the IBS policy for systems with symmetric cost. Finally, Section 8 summarizes the article. Appendix contains the extensions to the W-system and the online companion contains all proofs. 2. LITERATURE REVIEW As mentioned above, the majority of studies in the ATO literature focus on specific classes of policies. In particular, for continuous-review models, most papers assume the FCFS allocation rule, and develop tools to evaluate and optimize the IBS replenishment policies. See, for example, [17–19, 30–33, 37, 42]. For periodic-review models, because the demand during each period is batched and filled at the end of the period, different allocations rules have been considered. These include a fixed priority rule [41], the FCFS rule [12], and a fair-share rule [3]. [4] study a product-based allocation rule that makes optimal or near-optimal allocation decisions within each period. Similarly, [14] study optimal allocation decisions within each period using the idea of “multi-matching,” in the sense that multiple components must be matched with multiple products. However, all these studies apply FCFS to demand between periods. In other words, these alternative allocation rules, if implemented in a continuous-review environment, reduce to FCFS. A second allocation policy considered in the literature is the NHB allocation rule, which was first described and analyzed by [35], followed by [8, 20]. (It is worth mentioning that both FCFS and NHB are commonly seen in practice; [15] describe an example of NHB at Dell and [40] describe an example of FCFS at Amazon.com.). The NHB rule describes a class of allocation rules, which allocates a component to a demand only if such an allocation will result in the fulfillment of the demand. When there are multiple types of backorders waiting for the same common component, however, how to allocate that component among different backorders (i.e., the backorder clearing rule) still needs to be specified. [35] assume the FCFS backorder Naval Research Logistics DOI 10.1002/nav clearing rule, and, therefore, they call the allocation policy a modified FIFO rule. [16] show that, for the N-system and W-system (and their generations) under the IBS policy and symmetric cost, the NHB rule is sample-path optimal among all possible allocation rules, regardless of the types of backorder clearing rule. However, when the costs are asymmetric, PBC outperforms FCFS. [8] provide extensive discussions on effectiveness of various backorder clearing rules, including PBC and priority with reservation (PR). For systems with asymmetric costs, we employ NHB, PBC, and PR allocation rules to construct heuristic policies. Different from [8, 20], however, we use the CBS policy for replenishment, instead of the IBS policy. We also show the impact of alternative component allocation rules, together with a CBS replenishment policy, on the system cost. Only very few papers discuss the form of the optimal control policy for ATO systems. [5] consider the special case of a single end-product and multiple demand classes. They assume the supply process for each component is a singleserver queue with exponentially distributed processing times. They show that under Markovian assumptions on demand and production, the optimal replenishment policy is a statedependent base-stock policy, and the optimal allocation rule is a multilevel rationing policy that depends on the inventory levels of all other components. [9] extend the above results to nested systems and [21] extend them to general M-systems, assuming at most one job in the supply system at any time. We consider a very different component supply process, which is an infinite-server queue with constant processing times. For the W-system with identical deterministic component leadtimes (a special case of the supply system that we consider), [8] show that when a symmetric cost condition or a “balanced capacity” condition is satisfied, the optimal replenishment policy is a simple base-stock (i.e., IBS) policy, and the optimal allocation policy is a specific NHB policy, which is state-dependent. For the same system, that is, when the leadtimes are identical, [26] show that an IBS plus an allocation rule with a “target” backlog level (which is similar to the above rationing policy) becomes asymptotically optimal when the leadtimes go to infinity. In most ATO systems, leadtimes are not identical. Very little is known about the structure of an optimal policy under nonidentical leadtimes, both for N- and W-systems. [20] show, for a W-system with symmetric cost, that under an IBS replenishment policy, the common component is optimally allocated with a NHB allocation rule. Using a different proof technique, we generalize the result for arbitrary replenishment policies. This extension is important because, as mentioned, the optimal replenishment policy is not, in general, IBS. [25] independently derive a lower bound for the optimal cost in a W-system, by solving a stochastic program. We derive a lower bound for the W-system as well, based on the Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies approach described above. Our lower bound is much easier to compute. In addition, they present conditions (called verification lemma) that the optimal policy should satisfy to achieve the lower bound under symmetric cost and leadtime conditions whereby L2 = L3 > L1 , see Fig. 1. However, these conditions involve a set of complex equations and require solving a stochastic program, which is computationally challenging. Different from their approach, we explicitly characterize the optimal policies (a CBS replenishment policy together with a NHB allocation rule) with an intuitive and easy-to-implement ordering policy. This characterization also enables us to conduct various comparative statics analyses, which would be challenging with Reiman and Wang’s verification lemma. Moreover, we show that our policy is asymptotically optimal in the high-demand region. This, in turn, helps us construct effective heuristic policies. The concept of a CBS policy is closely related to the BBS policy, shown by [28] to be the optimal replenishment policy for a single product assembly system. [6] provide an alternative proof using a lower bound argument. Various authors have studied a single-item system with multiple demand classes. Assuming periodic review and leadtime equals the review cycle, and assuming all previous backorders are cleared at each review point, [36] shows that a base-stock policy is optimal for replenishment and a rationing policy is optimal for stock allocation. In continuous review systems, assuming a make-to-stock production facility and exponentially distributed production time, [10] shows that a base-stock policy is optimal for the production decision and a rationing policy is optimal for stock allocation for lost sales systems. For backlogging systems with two demand classes, [11] shows that the optimal production policy is still base–stock-type and that the optimal stock allocation policy follows a monotone switching curve. [38] extend [11] to backlogging systems with multiple demand classes and further show that the optimal allocation is a rationing policy. Assuming a constant leadtime and economies of scale in replenishment, [1, 7] propose a (Q, r) policy for replenishment and PR for stock allocation. In their concluding remarks, [7] also suggest using the same policy for managing the common component inventory in the ATO environment. We also study a heuristic that employs the same allocation policy. The asymptotic studies by [22–24] on high-volume demand ATO systems consider different settings from ours: Each component is produced by a production facility with finite capacity and the production times are random. The decision variables include the component production capacities, assembled product production sequence, and product prices. Once a component’s production capacity is chosen, the production facility produces the component at full capacity, so there is no inventory decision. Nevertheless, we adopt a similar approach as these authors (i.e., restricting allocation decisions to discrete points) for our system with constant 621 leadtimes and asymmetric costs. We derive asymptotically optimal replenishment and allocation policies under the highvolume demand regime. Recently, [39] consider a system similar to that in [23] and derive asymptotically optimal continuous-review component allocation policies. 3. MODEL AND PRELIMINARIES In this section, we first present the model setting. We then provide an outline of our approach to establishing the optimality results. 3.1. Model Setting For convenience, we introduce the notation for a general continuous-review ATO system so that both N- and W-systems can be specified as special cases. The system has multiple components and multiple products. Each product uses several components. Components are indexed by i ∈ I = {1, 2, · · · , m}, and products are indexed by K, a subset of I, if it requires one unit of each component in K and none in I \ K. Let K be the set of all products. We use subscripts i to index the quantities related to component i, and superscripts K to index the quantities related to product K. For each component i, let Ki be the set of products that require component i. For any t ≥ 0, i ∈ I and K ∈ K, denote DK (t) = cumulative demand of product K in (0, t], Di (t) = DK (t) = cumulative demand for K∈Ki component i in (0, t], For any process {X (t), t ≥ 0}, denote X (t − a, t] = X (t) − X (t − a), 0 ≤ a < t, X (t − a, t] = X (0, t], 0 < t < a, X (t) = X (0), t < 0. We assume that the demand process (DK (t), K ∈ K), t ≥ 0 is stationary and has independent increments. This implies that the component-i demand process {Di (t), t ≥ 0} is also stationary and has independent increments. We assume continuous demand. A similar analysis can be carried through to the discrete demand case with the derivative operation replaced by the difference operation. For each component i, let L i be the replenishment leadtime for component-i, a positive constant. Denote L = min {Li : i ∈ I} and i = Li − L. Let Di (t) = Di (t − Li , t] = leadtime demand of component i (ending at time t), Naval Research Logistics DOI 10.1002/nav 622 Naval Research Logistics, Vol. 62 (2015) Di (t) = Di (t − i , t], Di = a generic random variable having the same distribution as Di (L) with pdf ψi (·), cdf i (·)and ccdf i (·), Di = a generic random variable having the same distribution as Di (i ). For each product K, let D K (t) = DK (t − L, t], B K (t, π ) = B K (0) + DK (t) − AK (t), I Pi (t, π ) = I Pi (0) + Oi (t) − Di (t), where I Pi (0) is the initial inventory position. Without loss of generality, we assume I Pi (0) = Ii (0) and B K (0) = 0. The net inventory of component i can be calculated from the state variables as follows: Ii (t, π ) − B K (t, π ) = Ni (t, π ), i ∈ I. (2) K∈Ki D K = a generic random variable having the same distribution as DK (L) ψ(·) = the joint pdf of(D K , K ∈ K). We now describe the state of the system and the control policies. Let Ii (0) and B K (0) be the initial on-hand inventory for component-i and initial backorders for product-K, respectively. For any t ≥ 0, define Oi (t) = cumulative replenishment orders in[0, t], with Oi (0) = 0, AK (t) = total product K demands satisfied in [0, t], with AK (0) = 0, Ft = DK (s), 0 ≤ s ≤ t, K ∈ K . 1 (Admissible Policy). A policy π = DEFINITION (Oi (t), AK (t)), t ≥ 0 is called admissible if it satisfies, for any t > 0, (i) Oi (t) and AK (t)are jointly determined by the system’s available information until time t, Ft , K (ii) A (t) ≤ Ii (0) + Oi (t − Li ), K∈Ki K (iii) A (t) ≤ B K (0) + DK (t). The set of all admissible policies is denoted as A. Any process that satisfies the property in Definition 1 is called Ft -adaptive. We call {(Oi (t), i ∈ I), t ≥ 0} a replenishment policy or process, and {(AK (t), K ∈ K), t ≥ 0} an allocation rule or process. For any π = {(Oi (t), AK (t)), t ≥ 0} ∈ A, let Ii (t, π ) = on − hand inventory of component i at time t, B K (t, π ) = backorders for product K at time t, I Pi (t, π ) = inventory position of component i, Let hi be the unit holding cost rate of component-i and bK be the unit backorder cost rate of product-K, the total inventory cost at time t is C(t, π ) = hi · Ii (t, π ) + bK · B K (t, π ), (3) i Our objective is to find an admissible π ∈ A to minimize the long-run average expected total inventory cost T 1 lim sup E C(t, π )dt . (4) T →∞ T 0 Let C ∗ be the optimal system cost. Our optimization problem can be stated as follows: T 1 (P1) C ∗ = min lim sup E C(t, π )dt . π ∈A T →∞ T 0 A policy π ∗ that achieves the above minimum is an optimal policy. The focus of this article is to study two special but important building blocks of the ATO system. In particular, in Section 3-5, we study the N-system, namely, I = {1, 2} and K = {{1} , {1, 2}}. In Appendix, we show that our analysis is applicable to the W-system, that is, I = {1, 2, 3} and K = {{1, 2} , {1, 3}}. For any real number x, we denote x + = max {x, 0} and − x = max {−x, 0}. 3.2. The system dynamics are: Ii (t, π ) = Ii (0) + Oi (t − Li ) − K∈Ki Naval Research Logistics DOI 10.1002/nav AK (t), Sketch of the Main Approach Because our approach to the optimality proof is nonstandard and takes several steps, to facilitate understanding of the development in the rest of the article, we now provide a sketch of the main idea by using the N-system. Let C(t) denote a sample-path lower bound, that is, after ordering at time t, Ni (t, π ) = I Pi (t − Li , π) − Di (t) = net inventory of component i at time t. (1) K minC(t, π ) ≥ C(t) (P2) π ∈A Note that and T 0 C(t, π )dt ≥ T 1 C ∗ ≥ lim sup E T T →∞ 0 for all t ≥ 0. C(t)dt for all π ∈ A, T > 0 0 T C(t)dt ≡ C. Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies Thus, a sample-path lower bound generates a lower bound on the expected cost, which, in turn, yields a lower bound on the optimal cost. If the expected lower bound E[C(t)] is attainable by a policy in A (this attainability and (3) imply that C(t) is nonnegative with probability one), in view of E T C(t)dt = 0 STEP 3: Assuming cost symmetry and taking the expected value of the above sample-path lower bound, we obtain a lower bound on the expected system cost: E[C(t, π )] ≥ E[ϕ(N1 (t, π ), N2 (t, π ))] = E[ϕ(I P1 (t − L1 , π ) − D1 (t), T I P2 (t − L2 , π ) − D2 (t))], E[C(t)]dt 0 (by Fubini’s theorem, see p. 518 of [13]), then this policy must be optimal. Recall that N i is the component net inventory defined in (1). To characterize the optimal policy, our approach is to first develop a conditional sample-path lower bound and identify conditions under which a structured allocation policy achieves this conditional lower bound for all t. Given the current inventory status (N1 (t, ·) and N2 (t, ·)), this step essentially finds an optimal allocation rule. Next, we restrict our attention to this optimal allocation policy and proceed to characterize the structure of a replenishment policy that achieves the expected lower bound E[C(t)] = E{E[C(t)|Ni (t, ·), i = 1, 2]} for all t. More specifically, we take the following steps: STEP 1: For any given policy π ∈ A, we analyze how it affects the cost C(·, π) on each sample path. This involves state variable decomposition and reexpressing the cost function and system dynamics; see Section 4.1. STEP 2: For any given t, applying the results in Step 1 and conditioning on the net component inventory Ni (t, ·) = ui , i = 1, 2, we obtain a linear program which yields a conditioned sample-path lower bound ϕ(u1 , u2 ) = min{C(t, π) : π ∈ Â(t, u1 , u2 )}, where Â(t, u1 , u2 ) = {π : Ni (t, π) = ui , i = 1, 2} . Thus, for any given t, C(t, π ) ≥ ϕ(N1 (t, π), N2 (t, π)), for all π ∈ A We then show that under cost symmetry (Condition 1 defined later), the NHB allocation rules can achieve this (conditioned) lower bound for all t, and are, thus, optimal. However, without the cost symmetry, we are not able to identify any allocation policy of simple structure to attain the lower bound for all t. This is accomplished in Section 4.2. 623 t ≥ 0, (5) where the last equality follows from (1). Using conditional expectations and the tower property of the conditional expectations, we identify the structure of the replenishment policy [in terms of I P1 (t, ·) and I P2 (t, ·)] that minimizes (5) over all π ∈ A. This completes the characterization of the optimal policy π ∗ that minimizes the expected cost (4), under cost symmetry; see Section 5.1. STEP 4: Without cost symmetry, we construct two bounding systems with symmetric costs and then use the optimal policies for these systems to suggest several heuristic policies for the original problem. 4. N-SYSTEM: SAMPLE PATH RESULTS 4.1. Sample-Path State Decomposition For any π ∈ A, without loss of generality, we assume that replenished inventory units will be used according to the sequence of their arrivals, that is, according to the firstavailable-first-use rule. Also, the demands for each product will be satisfied on a FCFS basis. This assumption is supported by an observation made by [20]. These authors show that any allocation rule can be modified in the above described fashion without changing the number of backorders of each product or the on-hand inventory of each component. With this assumption, the product-specific component-2 is assigned (or committed) to the product-12 demands on the FCFS basis. Similar to [20], we decompose the on-hand inventory and backorders at time t as follows: • Y2 (t, π ): on-hand inventory of product-specific component-2 not yet assigned to a demand of product12; • B212 (t, π ): backorders of product-12 due to missing component-2, including the backorders missing both components 1 and 2; • B112 (t, π ): backorders of product-12 due to missing only the common component-1, either because component-1 is missing or because it is committed to product-1 according to π . Note that I2 (t, π ) = Y2 (t, π ) + B112 (t, π ), (6) Naval Research Logistics DOI 10.1002/nav 624 Naval Research Logistics, Vol. 62 (2015) B 12 (t, π) = B112 (t, π) + B212 (t, π), (7) Y2 (t, π) × B212 (t, π) = 0. (8) + − It is easy to verify that v1 + v2 = (u− 1 − u2 ) at optimality. Stating the optimal solution of the linear program (P3) in the original problem data, we obtain: Thus, (3) can be reexpressed as C(t, π ) = h1 I1 (t, π) + h2 Y2 (t, π) + b1 B 1 (t, π) (1) + b12 B212 (t, π) + (h2 + b12 )B112 (t, π), (9) and (2) becomes i. If b1 < b12 + h2 , then I1 (t, π ) − B (t, π ) − 1 PROPOSITION 1: If there exists an admissible policy π̂ such that the conditional sample-path lower bound given by the linear program (P3) can be achieved, it has the following sample-path properties. B112 (t, π) − B212 (t, π) = N1 (t, π ), (10) Y2 (t, π ) − B212 (t, π ) = N2 (t, π). (11) Combining (6)–(8) and (10)–(11) yields + Y2 (t, π ) = [N2 (t, π)] , B212 (t, π) + B 1 (t, π̂ ) = ([N1 (t, π̂ )]− − [N2 (t, π̂ )]− ) , (15) B112 (t, π̂ ) = 0, (16) C(t, π̂ ) = h1 N1 (t, π̂ ) + h2 N2 (t, π̂ ) − = [N2 (t, π)] , (12) I1 (t, π ) = B 1 (t, π) + B112 (t, π) + N1 (t, π) + [N2 (t, π )]− . (13) Furthermore, from the nonnegativity of I1 (t, π) and (13), we obtain + B 1 (t, π ) + B112 (t, π) ≥ ([N1 (t, π)]− − [N2 (t, π)]− ) . (14) + (h1 + h2 + b12 )[N2 (t, π̂ )]− + (h1 + b1 )([N1 (t, π̂ )]− − [N2 (t, π̂ )]− )+ . ii. If b1 = b12 + h2 , then B 1 (t, π̂ ) + B112 (t, π̂ ) + = ([N1 (t, π̂ )]− − [N2 (t, π̂ )]− ) , (17) Thus, a sample-path lower bound can be generated by (P2 ) C(t) = min C(t, π) subject to (12)– (14). π ∈A 4.2. Sample-Path Lower Bound: A Linear-Programming Approach C(t, π̂ ) = h1 N1 (t, π̂ ) + h2 N2 (t, π̂ ) + (h1 + h2 + b12 )([N1 (t, π̂ )]− ∨ [N2 (t, π̂ )]− ). (18) iii. If b1 > b12 + h2 , then C(t, π̂ ) equals (18) and For any fixed t, let B 1 (t, π̂ ) =0, (v1 , v2 , v3 , v4 , v5 ) = (B 1 + (t, π ), B112 (t, π), I1 (t, π), Y2 (t, π), B212 (t, π )). Conditioning on (N1 (t, π), N2 (t, π)) = (u1 , u2 ), the system equations with these two variables fixed can be expressed as a polyhedron and Problem (P2 ) can be expressed as the following linear program: ϕ(u1 , u2 ) = (P3) = ⎧ ⎪ min ⎪ ⎪ ⎪ ⎪ ⎪ ⎨s.t. ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ min {C(t, π)} π ∈Â(t,u1 ,u2 ) b1 v1 + (h2 + b12 )v2 + h1 v3 + h2 v4 + b12 v5 v3 − v1 − v2 − v5 = u1 , v4 = u+ 2, − v 5 = u2 , vi ≥ 0, i = 1, · · · , 5. Naval Research Logistics DOI 10.1002/nav B112 (t, π̂ ) = ([N1 (t, π̂ )]− − [N2 (t, π̂ )]− ) . iv. A necessary condition for an admissible π̂ ∈ A to be sample-path optimal is (17). Proposition 1 (iv) indicates that, at sample-path optimality, the lower bound in (14) is tight. Thus, from (12) and (17), if a sample-path optimal admissible π̂ exists, it will yield the lowest possible total number of backorders under π̂ for every sample path: Bmin (t, π̂ ) = B 1 (t, π̂ ) + B112 (t, π̂ ) + B212 (t, π̂ ) = [N1 (t, π̂ )]− ∨ [N2 (t, π̂ )]− . (19) Next, we identify the type of policy that leads to (19). Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies 4.3. NO-Holdback Policies The following definition is a generalization of the NHB allocation policy introduced by [20]. 2: We say that an admissible policy π = DEFINITION (Oi (t), AK (t)), t ≥ 0 ∈ A is NHB if the following conditions are satisfied for all t ≥ 0, B K (t, π ) × min {Ii (t, π), i ∈ K} = 0, K = 1, 12. (20) When K π is NHB, we also call the allocation rule A (t), t ≥ 0 an NHB allocation rule. Thus, under an NHB policy, a demand is backordered if and only if there is no on-hand inventory of at least one of its components. Obviously, for any given replenishment policy {Oi (t), t ≥ 0}, (20) prescribes an allocation rule at t (even though it leaves some degree of freedom as for how exactly the backorders should be satisfied), which justifies the last statement of the definition. It is straightforward but tedious to prove the following result: PROPOSITION 2: For any Ft -adaptive replenishment policy {(O1 (t), O2 (t)), t ≥ 0}, there exists a Ft -adaptive allocation process {(A1 (t), A12 (t)), t ≥ 0} such that {(Oi (t), AK (t)), t ≥ 0} is NHB. REMARK 1: The above definition of the NHB policy can be generalized to the general ATO system defined in Section 3. Proposition 2 also holds. Let N be the set of all NHB policies. We have LEMMA 1: For any π ∈ N , I1 (t, π ) × (B 1 (t, π) + B112 (t, π)) = 0, policy π̂ . This result relies on the following cost symmetry condition: CONDITION 1: The cost parameters in the N-system satisfy b1 = b12 + h2 = c. This condition indicates that, backlogging a product-1 demand incurs the same cost as backlogging a product-12 demand while holding one unit of component-2 in inventory. Therefore, when component-2 is available, there is no immediate cost difference for allocation of a unit of component-1 among different product-demands. We can either use this unit to satisfy a product-1 demand (and hence backlog a unit of product-12, incurring cost b12 + h2 ), or use it to satisfy a product-12 demand (and hence backlog a product-1, incurring cost b1 ). This cost structure is reasonable in certain situations in practice as discussed in the Introduction. 4.3.2. Asymmetric Costs The next question is, is the sample-path cost lower bound given by Proposition 1 achievable when Condition 1 does not hold? Consider, for example, b1 < b12 + h2 . This condition implies that, to reduce cost, whenever component-2 inventory is available, we should give product-12 higher priority when allocating component-1. This is exactly what the solution (16) in Proposition 1 (i) indicates: under a sample-path optimal policy π, we should never backlog product-12 only because of a shortage of component-1. As a result, Proposition 1 (i) implies that I1 (t, π ) ≥ Y2 (t, π ) for any t. (Otherwise, if there exists t 0 such that I1 (t0 , π ) < Y2 (t0 , π ), on the event when at least I1 (t0 , π ) + 1 units of demand of product-12 have arrived and nothing else has changed, there would be at least one unit of backlog of product-12 due to stockout of component-1). Plugging (15) and (16) into (13), it follows from (11) that + ([N1 (t, π )]− − [N2 (t, π )]− ) + N1 (t, π ) + [N2 (t, π )]− + I1 (t, π ) = (N1 (t, π ) + [N2 (t, π)]− ) , + B 1 (t, π ) + B112 (t, π) = ([N1 (t, π)]− − [N2 (t, π)]− ) . Lemma 1 shows that the NHB policies satisfy the necessary condition (17), and hence, they lead to the lowest total number of backorders (19) on every sample path. These results are extensions of those in [16]. While these authors derive the total number of backorders under any given IBS replenishment policy and NHB allocation rule, our results apply to any Ft -adaptive policy that is NHB. In addition, we obtain an exact expression for the on-hand inventory of the common component as well. 4.3.1. 625 Symmetric Costs Lemma 1 also shows that the sample-path lower bound in Proposition 1 (ii) is achievable by an NHB admissible ≥ [N2 (t, π )]+ . This, in turn, implies either (a) [N1 (t, π )]− ≥ [N2 (t, π )]− and [N1 (t, π )]+ ≥ [N2 (t, π )]+ , or (b) N1 (t, π ) ≥ [N2 (t, π )]+ − [N2 (t, π )]− for any t. However, recall from (1) that Ni (t, π ) (before allocation) is controllable only through I Pi (t − Li , π ) by replenishment orders before time t − Li . If D1 (t − L1 , t] turns out to be big, there is no way we can adjust N 1 to guarantee (a) or (b). Therefore there does not exist a Ft -adaptive allocation policy such that (15) and (16) hold. Consequently, we conclude that, in this case, there does not exists a sample-path myopic allocation policy such that the sample-path cost lower bound is achievable. Similarly, we can argue that no matter which kind of replenishment policy and allocation rule are used, we cannot always keep B 1 (t, π ) = 0. The reason is the same as the Naval Research Logistics DOI 10.1002/nav 626 Naval Research Logistics, Vol. 62 (2015) above: we cannot control D 1 (t − L1 , t]. Hence, the solution in Proposition 1 (iii) is not implementable. To summarize, we have: THEOREM 1 (Lower Bound’s Attainability) Assume that the component net inventory is given for any time t. i. Under Condition 1, there exists an NHB policy such that the conditional sample-path lower bound given by the linear program [P3; see Proposition 1 (ii)] can be achieved. ii. When Condition 1 does not hold, no policy can achieve the conditional sample-path lower bound given by the linear program [P3; see Proposition 1 (i) and (iii)]. 5. SYMMETRIC COSTS: EXACT OPTIMAL POLICY In this section we assume Condition 1 holds. We first characterize the form of the optimal policy and then discuss the properties of the optimal policy. 5.1. Form of Optimal Policy Lemma 1 shows that the conditional sample path lower bound of Proposition 1 (ii) is achievable by an NHB policy π̂ . Because the NHB allocation rules depend on the system state at time t only (after receiving the replenishment shipment), the next step is to see whether we can identify a type of replenishment policy that can minimize E[ϕ(I P1 (t −L1 , π )−D1 (t), I P2 (t −L2 , π)−D2 (t))] among all π̂ ∈ N (see (5)). Note that the best an admissible replenishment policy can do to influence the net inventory of component i at t is through an order decision at t −Li so to increase the inventory position I Pi (t − Li , π ), using the latest possible information Ft−Li . The problem then becomes how to synchronize the replenishment decisions of both components. For ease of presentation, we divide our analysis into two cases: (SL) Product-specific component with longer leadtime: L2 = L + ≥ L1 = L; (CL) Common component with longer leadtime: L1 = L + > L2 = L. A special case of (SL) is (IL) Identical leadtimes: = 0, L1 = L2 = L. First we look at SL case, L2 ≥ L1 . Let G(y1 , y2 ) = h1 E(y1 − D1 ) + h2 E(y2 − D2 ) + (c + h1 )E((D1 − y1 )+ ∨ (D2 − y2 )+ ), (21) Naval Research Logistics DOI 10.1002/nav which is the expected inventory (i.e., holding and backorder) cost at time t, given the inventory position of component-i at t − Li is yi , i = 1, 2. Note that, because the Di have densities, G is C 2 , twice continuously differentiable. LEMMA 2: G(y1 , y2 ) is L convex. That is, (i) it is jointly convex and submodular, and (ii) its Hessian matrix is a diagonally dominant M-matrix. Note that after the order decision for component-2 is made at t − L2 , by the time we make an order decision on component-1 at t − L1 , some random demand has occurred in the time interval (t − L2 , t − L1 ] = (t − L − , t − L], which should be taken into account. Observe that I P2 (t − L − , π ) − D2 (t) = I P2 (t − L − , π ) − D2 (t − L) − D2 (t − L, t]. Because Ft−L− ⊆ Ft−L , by the tower property of conditional expectations and Proposition 1 (ii), E[C(t, π )|Ft−L− ] = E {E[C(t, π )|Ft−L ]|Ft−L− } ≥ E{E[ϕ(I P1 (t − L, π ) − D1 (t), I P2 (t − L − , π ) − D2 (t))|Ft−L ]|Ft−L− } = E[G(I P1 (t − L, π ), I P2 (t − L − , π ) − D2 (t − L))|Ft−L− ] = E{E[G(I P1 (t − L, π ), I P2 (t − L − , π ) − D2 (t − L))|Ft−L ]|Ft−L− }. (2) (22) Our objective is to find π ∈ A to minimize (22). Noting that I P1 (t − L, π ), I P2 (t − L − , π ) and D2 (t − L) are measurable to Ft−L , we can first find y1 to minimize G(y1 , I P2 (t − L − , π ) − D2 (t − L)). (23) Denote this minimizer by s̃1 (I P2 (t − L − , π ) − D2 (t − L)). Because I P2 (t − L − , π ) is measurable to Ft−L− and D2 (t − L) is independent of Ft−L− , we can find y2 to minimize E[G(s̃1 (y2 − D2 (t − L)), y2 − D2 (t − L))]. (24) As each step of (22)–(24), the construction reaches the lower bound. A more rigorous proof is given in the Appendix. This analysis establishes the optimality of a CBS replenishment policy of the form (s̃1 (·), s̃2 ), where s̃2 is a constant and s̃1 (·) is a real function. The policy works as follows: the product-specific component-2 follows a standard base-stock policy with base-stock level s̃2 . The common component-1 follows a state-dependent base-stock policy with base-stock level s̃1 (·) that depends on the demand realization in the last Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies time periods. That is, for any time t > 0, keep the component2 inventory position at s̃2 and the component-1 inventory position at s̃1 (s̃2 − D2 (t − , t]). ∗ , and s1 (y) be determined by More specifically, let s1∗ , s12 c c ∗ = , Pr(D 1 + D 12 ≤ s12 )= , Pr(D ≤ c + h1 c + h1 (25) h1 1 12 1 s1 (y) = min x : Pr(D + D ≥ x, D ≥ x − y) = . c + h1 (26) 1 s1∗ ) As for any given y, Pr(D 1 + D 12 ≥ x, D 1 ≥ x − y) is decreasing in x, s1 (y) is well-defined. Let ⎧ ∗ ⎪ ⎨y + s1 , y ≤ 0, ∗ s̃1 (y) = s1 (y), (27) 0 < y ≤ s12 , ⎪ ⎩ ∗ ∗ s12 , y > s12 , G̃(y) = E[G(s̃1 (y − D2 ), y − D2 )], s̃2 = argminy G̃(y). (28) (29) ∗ ]. We have Clearly, y ≤ s̃1 (y) ≤ y + s1∗ for any y ∈ (0, s12 LEMMA 3: Assume SL. (i) For any fixed y, s̃1 (y) given by (27) minimizes G(·, y). Moreover, s̃1 (y) is increasing and continuous, and 0 ≤ d s̃1 (y)/dy ≤ 1. (ii) G̃(y) defined by (28) is convex. Thus, its minimal point s̃2 is unique. LEMMA 4: Assume SL. Suppose that the initial system satisfies I P2 (0) = s̃2 and I P1 (0) = s̃1 (s̃2 ). Then, under the admissible policy π̃ determined by the CBS replenishment policy (s̃1 (·), s̃2 ) and the NHB allocation rule, the system is fully coordinated in the following sense : for any t ≥ 0, which is determined by Pr(D 1 + D 12 ≥ s̃1 (s̃2 ), D 1 ≥ s̃1 (s̃2 ) − s̃2 ) = Pr(D 1 ≤ s̃1 (s̃2 ) − s̃2 , D 12 ≥ s̃2 ) = G̃(y) = minG(y1 , y). h2 . c + h1 h2 , c + h1 s2 (y) = min x : Pr(D1 (L1 ) ≤ y − x, h2 D 12 ≥ x) = . (32) c + h1 Pr(D1 (L1 ) ≤ s2∗ ) = Let s̆2 (y) = y − s2∗ , y ≤ s2∗ , y > s2∗ , s2 (y), G1 (y1 , y2 ) = EG(y1 − D 1 , y2 ), Ğ(y) = E[G (y − D2 , s̆2 (y − D2 ))], 1 y1 s̆1 = argminy Ğ(y). Hence, we have (s̃1 (s̃2 − D2 (t − , t]), s̃2 ) = (s̃1 (s̃2 ), s̃2 ) = argminG(y1 , y2 ), y1 ,y2 (31) Obviously, a CBS policy requires more information than an IBS policy. It requires recording the demand for product12 in the last interval (t − , t]. Because component-1 serves product-1 and product-12, at any time t, the inventory of component-1 will be updated in such a way that its available net inventory will be larger than the available amount of component-2 at time t + L1 , that is, s̃1 (s̃2 − D2 (t − , t]) ≥ s̃2 − D2 (t − , t] (see Lemma 3). When the leadtimes are identical, a CBS policy reduces to an IBS policy. In particular, when the net inventory position of component-2, ∗ s̃2 −D2 (t −, t], is higher than s12 , we order component-1 up ∗ to s12 , which serves both products. When component-2 will definitely be in shortage (s̃2 − D2 (t − , t] ≤ 0), we raise the inventory position of component-1 higher than component2’s, with amount s1∗ , to serve the demand for product-1. When component-2 is not definitely in shortage but there is not a sufficiently large inventory of it, we order component-1 in a way that balances between these two extremes. For the CL case, following a similar line of analysis as in Theorem 2, we can show that a similar type of policy is optimal. Define Based on the above two lemmas, we can prove REMARK 2: In the case of IL, (28) degenerates into h1 , (30) c + h1 Denote s̄1 = s̃1 (s̃2 ), s̄2 = s̃2 as the solution of (30)–(31), the IBS policy with base-stock levels (s̄1 , s̄2 ) and an NHB allocation rule are optimal among all admissible policies A. The minimum long-run average inventory cost is C ∗ = G(s̄1 , s̄2 ). I P1 (t, π̃ ) = s̃1 (s̃2 − D2 (t − , t]), I P2 (t, π̃) = s̃2 . THEOREM 2 (Optimal Policy SL). Assume Condition 1. When the common component has a shorter leadtime (L1 < L2 ), the optimal admissible policy constitutes a CBS replenishment policy with parameters (s̃1 (·), s̃2 ) and an NHB allocation rule. The minimum long-run average inventory cost is C ∗ = G̃(s̃2 ). 627 (33) (34) (35) (36) It is not difficult to see that s̆2 (y) ≤ y − s2∗ for any y. Different from Theorem 2, in order to make the minimizer (see Naval Research Logistics DOI 10.1002/nav 628 Naval Research Logistics, Vol. 62 (2015) (33), s̆2 (·)) an implementable policy, we need to filter out the demand information for product-1 in the last interval (t − , t], i.e., D 1 (see (34)). To make this possible, we need to assume independent product demands. Parallel to the SL case, we have the following results. LEMMA 5: Assume CL. (i) G1 (y1 , y2 ) is L convex. (ii) For any fixed y, s̆2 (y) given by (33) minimizes G1 (y, ·). In addition, s̆2 (y) is continuous, increasing, and 0 ≤ d s̆2 (y)/dy ≤ 1. (iii) Ğ(y) defined by (35) is convex. Thus, its minimal point s̆1 is unique. LEMMA 6: Assume CL. Suppose that the initial system satisfies I P1 (0) = s̆1 and I P2 (0) = s̆2 (s̆1 ). Under the admissible policy π̆ given by the CBS replenishment policy (s̆1 , s̆2 (·)) and the NHB allocation rule, the system is fully coordinated in the following sense: for any t ≥ 0, I ˘P 1 (t, π̆ ) = s̆1 , I ˘P 2 (t, π̆) = s̆2 (s̆1 − D2 (t − , t]). THEOREM 3 (Optimal Policy CL) Assume Condition 1. When the common component has a longer 1leadtime (L 1> L ), and the product demand processes D (t), t ≥ 0 and 2 12 D (t), t ≥ 0 are independent, the optimal admissible policy constitutes an NHB allocation rule and a CBS replenishment policy with parameters (s̆1 , s̆2 (·)) (namely, for any time t > 0, keep the component-1 inventory position at s̆1 and the component-2 inventory position at s̆2 (s̆1 −D2 (t −, t])). The minimum long-run average inventory cost is C ∗ = Ğ(s̆1 ). It is worth mentioning that the policy requires only recording the demand for product-12 in the last interval (t − , t]. Because component-2 only serves product-12, at any time t, the inventory of component-2 will be updated in such a way that its available net inventory will be less than the remaining inventory of component-1 after product-12’s depletion during (t − , t]. That is, s̆2 (s̆1 − D2 (t − , t]) ≤ s̆1 − D2 (t − , t] (see Lemma 5). In particular, this CBS policy implies that I P2 (t) and D1 (t − , t] are independent for any t. REMARK 3: The optimal CBS policies in Theorems 2, 2 reduce to the optimal BBS policy for the corresponding assembly system when D1 (t) ≡ 0. The N-system is degenerated into a standard single-item inventory model consisting of component-1 if D12 (t) ≡ 0, and the optimal CBS policy reduces to the standard base-stock policy. 5.2. Properties of the Optimal Replenishment Policy We next investigate how the optimal policy parameters change in response to changes in demand andleadtimes. We assume demand processes D1 (t), t ≥ 0 12that the product and D (t), t ≥ 0 are independent. In particular, we show Naval Research Logistics DOI 10.1002/nav how the demand parameters and leadtime impact the optimal policies, that is, (a) (s̄1 , s̄2 ) under IL (Remark 2); (b) (s̆1 , s̆2 (·)) under CL (Theorem 2); and (c) (s̃1 (·), s̃2 ) under SL (Theorem 2). DEFINITION 3: A stochastic process Ŵ(t), t ≥ 0 is said to be stochastically larger than another stochastic process {W(t), t ≥ 0} if for any fixed t and x, Pr(W(t) > x) ≤ Pr(Ŵ(t) > x). Regarding the effect of demand variability, we have PROPOSITION 3: Assume Condition 1. (i) When the demand process for product-12 becomes stochastically larger, the optimal base-stock levels under IL and the CBS-levels under both SL and CL increase. That is, s̄1 , s̄2 , s̆1 , s̃2 , s̆2 (y) and s̃1 (y) increase for any fixed y; (ii) When the demand process for product-1 becomes stochastically larger, component-1’s base-stock levels under IL and CL, and CBS level under SL, that is, s̄1 , s̆1 and s̃1 (y) for any fixed y, respectively, increase, while the CBS level for component-2 under CL, s̆2 (y), decreases for any fixed y. The above proposition shows that we should keep more inventory of both components when the demand process for product-12 is stochastically larger, which is intuitive. Similarly, a stochastically larger demand process for product-1 pushes up the optimal inventory position of component-1. However, less intuitively, when component-1 has a longer leadtime (CL), a stochastically larger demand process for product-1 actually pulls down the optimal inventory position of component-2. This can be explained as follows. Recall that one unit of component-2 needs to match with one unit of component-1 to fulfill one unit of product-12. But a larger demand process product-1 will consume more of component1, leaving less component-1 for product-12, so it is optimal to lower component-2’s inventory to save the inventory holding cost without changing the backorder cost. The next result reveals the impact of the leadtimes and the leadtime difference. PROPOSITION 4: Assume Condition 1. (i) When increases, s̆1 and s̃2 increase, and there is no effect on s̃1 (y) and s̆2 (y) for any fixed y; (ii) When L increases, s̄1 and s̃1 (y) increase for any fixed y. According to the above proposition, a bigger leadtime difference will always push up the optimal inventory positions for both components. This is because a larger implies a larger leadtime demand for the component with a longer leadtime and bigger demand realization in the time interval of length . The latter fact affects the component that has a shorter leadtime indirectly. However, does not affect Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies 6. 629 ASYMMETRIC COSTS: ASYMPTOTICALLY OPTIMAL POLICY In this section, we consider systems with asymmetric costs. That is, Condition 1 may not hold. Recall that in this case there is no simple exact optimal policy, so our goal is to characterize an asymptotically optimal policy as the demand volume grows large. Following the notation and procedures in [2, 22, 23], we consider a sequence of the N-systems, indexed by n, in which all problem parameters remain the same as before, but the product demand rates are linear in n. All processes in the nth system are superscripted by n. In particular, the product-K demand rate in the nth system is EDK,n (1) = nλK , K ∈ {1, 12} . Figure 2. Effect of L on s̆2 (y). DEFINITION 4: An admissible policy π̂ is asymptotically optimal if for any T > 0 and π ∈ An the CBS levels s̃1 (·) and s̆2 (·). Thus, to serve the additional demand resulting from the increased , it is optimal to increase the inventory of the component that has a longer leadtime. When L increases, the optimal common-component inventory increases under IL and SL, but the optimal inventory level for the product-specific component can increase or decrease. The reason is as follows: a larger L increases leadtime demand for both products. Conversely, it seems reasonable to raise the inventory of component-2 to serve the increased demand of product-12. We call this the first effect. Conversely, it may be wise to lower the inventory of component-2 because, due to increased demand for product1, fewer of component-1 may be left for product-12. We call this the second effect. Whether L pulls down or pushes up the optimal inventory of component-2 depends on which of the two effects dominates, as shown in Fig. 2. When inventory of component-1 is low, the second effect dominates as a result, a larger L reduces the inventory of component-2. On the contrary, when the inventory of component-1 is high, the first effect dominates, and a larger L pushes up the inventory of component-2. The results in this section are summarized in Table 1, in which each row reports the impact of changing the corresponding variable unilaterally. We note that a stochastically larger demand implies a larger mean demand. It may imply a higher demand variability in some cases but not necessary for all case. It would be interesting to investigate the effects of demand variability with mean demands fixed. These effects, however, are much more complex to characterize, because they involve the relative magnitudes of the demand, cost, and leadtime parameters. This is even true for a single-item inventory system, in which the relative magnitude of cost parameters matters; see, for example, [29]. Therefore, we leave such a study to future research. 1 lim sup √ E n n→∞ 0 T ⎛ 2 ⎝ hi · Iin (t, π̂ ) + i=1 1 ≤ lim sup √ E n n→∞ 0 T ⎞ b ·B K K,n (t, π̂ )⎠ dt K∈{1,12} ⎛ 2 ⎝ hi · Iin (t, π ) + i=1 ⎞ b ·B K K,n (t, π )⎠ dt. K∈{1,12} From the analysis in Section 5, we know that the component allocation and replenishment actions become more and more frequent as the demand rates increase. To detect these actions, we first equally partition time interval [0, ∞) into subintervals such that in each subinterval there is only one component allocation decision. Thus, similar to Proposition 1, we can again use the linear-programming approach to develop a lower bound on the system cost and identify an allocation rule to achieve this lower bound. Then, following the procedure described in Section 3.2, by leveraging the tower property of conditional expectations we can find the optimal component replenishment policy. n For any π = (Oi (t), AK,n (t)), t ≥ 0 ∈ An, K,n we first modify the allocation process A (t), t ≥ 0 into a discrete-review policy AK,n such that (t), t ≥ 0 n π = (Oin (t), AK,n (t)), t ≥ 0 ∈ A . The modified pol icy allocates components for products at review points l n , 2l n , 3l n , · · · with ⎛ ⎞2/3 1 ⎟ ⎜ ln = ⎝ ⎠ n (λ1 )2 + (λ12 )2 , and does not allocate any component at all other times. Consider any K ∈ {1, 12}. Initially, set AK,n (0) = 0 and K,n B (0) = 0. Then, the process is defined recursively by n K,n ( l n ) − BK,n ( l n ), AK,n ( l ) = D (37) Naval Research Logistics DOI 10.1002/nav 630 Naval Research Logistics, Vol. 62 (2015) Table 1. Summary of comparative statics on optimal replenishment policy IL (L1 = L2 = L) Demand-1 Demand-12 Leadtime L SL (L1 = L, L2 = L + ) CL (L1 = L + , L2 = L) Component-1 s̄1 Component-2 s̄2 Comp-1 s̃1 (·) Comp-2 s̃2 Comp-1 s̆1 Comp-2 s̆2 (·) – ? ? – – ? ? ? ? – Note: , , –, and ? stands for increasing, decreasing, unaffected, and unknown or nonuniversal, respectively. where BK,n ( l n ) solves Let min b1 B 1 + b12 B 12 + h1 I1 + h2 I2 (38) B K ≥0 s.t. I1 = In1 (0)+On1 ( ln −L1 ) − D1,n ( ln ) − D12,n ( ln )+B1 +B12 ≥ 0, (39) I2 = I2n (0) + O2n ( l n − L2 ) − D12,n ( l n ) + B 12 ≥ 0, (40) AK,n (( − 1)l n ) + B K ≤ DK,n ( l n ). (41) K,n n For t ∈ [ l n , ( + 1)l n ), define AK,n (t) = A ( l ). Then we know n π = (Oin (t), AK,n (t)), t ≥ 0 ∈ A . (42) For ease of exposition, we assume that the variances and covariance of D1,n ( n1 ) and D12,n ( n1 ) are independent of n, and denote 1 2 1,n 1 12 2 12,n 1 , (σ ) = V ar D , (σ ) = V ar D n n 1 1 2 (σ 1,12 ) = Cov D1,n − λ1 , D12,n − λ12 . n n 2+ε < We also assume (in this subsection) that E[DK,n ( n1 )] ∞, K ∈ {1, 12} for ε = 3. Let (1 , 12 ) be a bivariate normal random vector with zero-mean and covariance matrix = (ij ) where 2 11 = (σ 1 ) L, 2 22 = (σ 12 ) L, 2 12 = 21 = (σ 1,12 ) L. (43) K Also, let be a normal variable with zero-mean and vari2 ance (σ K ) . Denote 1 = 1 + 12 and 2 = 12 . The following is an analogous to (21): F (y1 , y2 ) = h1 E(y1 − 1 ) + h2 E(y2 − 2 ) + (h1 + h2 + b12 )E(2 − y2 )+ + + (h1 + b1 )E(1 − y1 − (2 − y2 )+ ) . Naval Research Logistics DOI 10.1002/nav θ1 (y) = min x : Pr(1 + 12 ≥ x, h1 1 ≥ x − y) = , h1 + b 1 (44) F̃ (y) = E[F (θ1 (y − 12 ), y − 12 )], θ̃2 = argminF̃ (y), (45) y θ2 (y) = min{x : h2 − (h1 + h2 + b12 )Pr(12 > x) + (h1 + b1 )Pr(1 ≤ y − x, 12 ≥ x) = 0}, F 1 (y1 , y2 ) = EF (y1 − 1 , y2 ), F̆ (y) = E[F 1 (y − 12 , θ2 (y − 12 ))], θ̆1 = argminF̆ (y). (46) y We have THEOREM 4 (Asymptotically Optimal Policy) Assume that b1 ≤ h2 + b12 . (i) (SL) When the common component has a shorter leadtime, the CBS replenishment policy √ I P1n (t) = (λ1 + λ12 )nL1 + nθ1 D12,n (t − , t] − λ12 n θ̃2 − , √ n √ I P2n (t) = λ12 nL2 + nθ̃2 , and the discrete-review NHB (myopic) allocation policy with the PBC rule defined by (37)–(41) are asymptotically optimal in An . (ii) (CL) When the common component has a longer 1,n leadtime and the product demand processes D (t), t ≥ 0 and D12,n (t), t ≥ 0 are independent, the CBS replenishment policy I P1n (t) = (λ1 + λ12 )nL1 + √ nθ̆1 , Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies √ I P2n (t) = λ12 nL2 + nθ2 D12,n (t − , t] − λ12 n θ̆1 − , √ n and the discrete-review NHB (myopic) policy with the PBC rule (37)–(41) are asymptotically optimal in An . (ii) (CL) When the common component has a longer 1,n leadtime and the product demand processes D (t), t ≥ 0 and D12,n (t), t ≥ 0 are independent, the CBS replenishment policy and a discrete-review NHB (myopic) policy with the PBC rule defined by (37)–(41) are asymptotically optimal in An . The above theorem shows that when costs are asymmetric and demand volume is large, a CBS-policy—a policy type identified in Section 5—continues to be optimal. For example, under SL, it is optimal to use a base-stock policy to control the product–specific component-2, and use a realtime CBS-policy, which is updated according to the latest demand information, to control the common component. The optimal policy of each component has two parts: the mean of leadtime demand for the component and a safety–stock which hedges against the demand volatility. This form of policy is intuitive and commonly seen in practice. Next, we look at the case b1 ≥ h2 + b12 . Define H (y1 , y2 ) = h1 E(y1 − 1 ) + h2 E(y2 − 2 ) 631 √ I P1n (t) = (λ1 + λ12 )nL1 + nη̆1 , √ I P2n (t) = λ12 nL2 + nη2 D12,n (t − , t] − λ12 n η̆1 − , √ n and a discrete-review NHB (myopic) policy with the PBC rule (37)–(41) are asymptotically optimal in An . We have, thus, shown that in the high-volume demand regime, the structure of the asymptotically optimal policy is essentially the same as that of the exact optimal policy for the symmetric cost case. + (h1 + h2 + b12 )E[(1 − y1 )+ ∨ (2 − y2 )+ ], η1 (y) = min x : Pr(1 + 12 ≥ x, 7. In this section, we first examine the effectiveness of the commonly used IBS policies for the symmetric cost case. Then, based on the optimal and asymptotically optimal policies given in Section 5 and 6, we develop several heuristic policies for the asymmetric cost case. To compare the effectiveness of these heuristic policies, we also construct bounds on the system cost. 1 ≥ x − y) = HEURISTICS h1 , h1 + h2 + b12 H̃ (y) = E[H (η1 (y − 12 ), y − 12 )], η̃2 = argminH̃ (y), y η2 (y) = min{x : h2 − (h1 + h2 + b12 ) Pr(1 ≤ y − x, 12 ≥ x) = 0}, 7.1. H (y1 , y2 ) = EH (y1 − , y2 ), 1 1 H̆ (y) = E[H 1 (y − 12 , η2 (y − 12 ))], η̆1 = argminH̆ (y). y (47) Then, we can show THEOREM 5 (Asymptotically Optimal Policy) Assume that b1 ≥ h2 + b12 . (i) (SL) When the common component has a shorter leadtime, the CBS replenishment policy I P1n (t) = (λ1 + λ12 )nL1 √ D12,n (t − , t] − λ12 n + nη1 η̃2 − , √ n √ I P2n (t) = λ12 nL2 + nη̃2 , Effectiveness of Independent Base-stock Policies for Symmetric Costs In this subsection, we assume Condition 1 holds and the NHB allocation rule is followed. In this case, with the form of the optimal replenishment policy known to be a CBS policy, it is interesting to learn under what situations the widely used IBS policy can perform well. First, we describe how to compute the optimal IBS levels for the CL (L1 > L2 ) case. The case of SL (L1 < L2 ) can be done analogously. Similar to Eq. (21), under the IBS policy with base-stock levels (y1 , y2 ), the long-run average system cost denoted by C bs (y1 , y2 ) is C bs (y1 , y2 ) = Eϕ(y1 − D1 (t), y2 − D2 (t)) = h1 · E(y1 − D1 − D1 ) + h2 · E(y2 − D2 ) + (c + h1 ) · E((D1 + D1 − y1 )+ ∨ (D2 − y2 )+ ). Naval Research Logistics DOI 10.1002/nav 632 Naval Research Logistics, Vol. 62 (2015) 7.2. Heuristic Policies for Asymmetric Costs In this subsection, we assume asymmetric costs. In this case, the form of the exact optimal policy is unknown, so we develop several heuristic policies and examine their effectiveness. Before presenting the heuristic policies, we first establish upper and lower bounds for the system cost, which will be used to assess the performance of the heuristic policies. To get these bounds, we construct two bounding systems that differ from the original one only in their cost structures. For convenience, denote the original system by S = S(b1 , b12 , h1 , h2 ). Define two systems S = S(b1 , b12 , h1 , h2 ) and S = S(b̄1 , b̄12 , h̄1 , h̄2 ), where h1 = h1 , Figure 3. Optimality loss of IBS policies (b1 = 4). This is Eq. (21) with D1 replaced by D1 + D1 . As a result, the optimal value of (y1 , y2 ) can be calculated as in Eqs. (30) and (31). The corresponding optimal long-run average cost is denoted by C ∗bs . Next, we report the results of a numerical study. The effectiveness of the IBS policy is measured by err = 100 × C ∗bs − C ∗ . C∗ In this study, we assume that the demand processes of product-1 and product-12 are Poisson with rates λ1 and λ12 , and keep h1 = 1, h2 = 1, b12 + h2 = b1 and (λ1 + λ12 ) × (L + ) = 60. We vary the other parameters as follows: b1 ∈ {2, 3, . . . , 10} , λ12 /λ1 ∈ {0.1, 0.5, 1, 2, . . . , 10} , /L ∈ {0.1, 0.6, . . . , 4.6, 5} . Thus, there are a total of 9×12×11 = 1188 scenarios. These results show that the IBS policy can perform very poorly. In the 1188 scenarios, compared with the CBS policy, the IBS policy results in an average 7.65% loss of optimality and a maximum loss of 33.87%. Fig. 3 demonstrates how the optimality loss of the IBS policy varies with system parameters such as leadtimes and demand rates. We observe that, the IBS policy performs fairly well when leadtimes are similar, that is, L, or when product-1 demand dominates, that is, λ12 λ1 . For example, the average optimality loss is only 1.57% when /L = 0.1 and is about 1.84% when λ12 /λ1 = 0.1. However, the optimality loss can be significant when the component leadtimes differ substantially or when λ12 λ1 . For instance, the average optimality loss is approximately 10.97% when /L = 5. Naval Research Logistics DOI 10.1002/nav h2 = b1 ∧ h2 , b1 = b12 + h2 = b1 ∧ (b12 + h2 ), h̄1 = h1 , h̄2 = h2 , b̄1 = b̄12 + h̄2 = b1 ∨ (b12 + h2 ). Clearly, both S and S satisfy Condition 1. From §5, the optimal policies for these systems are characterized by Theorems 2–3, with c = b1 ∧(b12 +h2 ) and c = b1 ∨(b12 +h2 ), respectively. Let the resulting minimum long-run average costs of S and S be C ∗ and C̄ ∗ . We have PROPOSITION 5: If b1 = b12 + h2 , the minimum longrun average cost C ∗ is bounded by those of systems S and S. That is, C ∗ ≤ C ∗ ≤ C̄ ∗ . In the following, we present five heuristic policies that are simple and easy to implement. The first three are constructed from the optimal policies for symmetric costs, and the other two are generated by the asymptotically optimal policies for asymmetric costs. Heuristic CN: CBS Replenishment Policy + NHB Allocation Rule. For the component replenishment, this heuristic adapts the CBS policy characterized by Theorems 2–3 for symmetric costs with the modified c = (b1 + b12 + h2 )/2 in place of (21). This choice of c reflects the desire to balance the backorder cost of product-1 and the sum of the product-12 backorder cost and the holding cost of the product-specific component. For component allocation, note that for any given Ft -adaptive replenishment policy (which includes the CBS policy), there are two ways to reduce the backorder costs through a NHB allocation rule, which is also Ft -adaptive and satisfies Definition 1 (ii–iii): (a) reduce the total number of backorders or (b) redistribute backorders among different products such that the product with higher penalty cost has a small number of backorders. It is shown by [20] that any NHB allocation rule would achieve (a) if an IBS replenishment policy is adopted. Furthermore, when there are backorders of both products, a PBC policy can achieve (b). For this reason, this heuristic uses the combination of NHB and PBC for component allocation. Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies Heuristic IS: IBS Replenishment Policy + Static Rationing Allocation Rule. This heuristic uses the simpler IBS replenishment policy with parameters (sIS1 , sIS2 ) given by §7.1. The allocation rule is a rationing policy with a “static” (i.e., fixed) reservation level r for the common component. Consider b1 < b12 + h2 , for example, the common component-1 goes to product-1 only when its on-hand inventory is higher than r; otherwise, the on-hand inventory of component-1 is reserved for product-12. Now we demonstrate how to determine r. By an analysis for M/M/1 make-to-stock queue, [8] (see their paper, P858, the right column, Section 3.3.3 Priority with Reservation Level) prove that when the holding cost is h and penalty cost is b, the optimal reservation + level is ln(h)−ln(b) , where ρ is a ratio between two product ln(ρ) demand rates. For N-system, when a common component is available but being held for satisfying the future product-12 demand while keeping product-1 demand backordered, the actual holding cost for the common component is h1 + b1 . Similarly, the backorder cost for product-12 is h1 + h2 + b12 . Thus, the level r developed by [8] should be modified as r= ln(h1 + b1 ) − ln(h1 + h2 + b12 ) ln(ρ) λ12 where ρ = 1 . λ + λ12 + , (48) Heuristic CD: CBS Replenishment Policy + Dynamic Rationing Allocation Rule. The static reservation level r in Heuristic IS may be too high when the system states such as Y2 (t) becomes very low. This is because when there are not many units of product-specific component available, it is not economical to reserve many common components to wait for the future product-12 demand arrivals while back ordering product-1. For this reason, this heuristic dynamically adjusts the reservation level r̃(t) according to the inventory level of the product-specific component. As we argued in Section 4.3, one natural choice is to choose the available on-hand inventory of component-2 (Y2 (t)) as the level. However, this choice is not optimal for symmetric costs. To overcome this issue, we assign an asymmetry weight to Y2 (t) and use r and ρ defined in (48) to set reservation level as follows r̃(t) = min {αY2 (t), r} , where α = (b12 + h2 − b1 ) · ρ · b + h2 . + h2 + b 1 12 b12 For component replenishment, this heuristic uses the CBS policy with the same parameters as in CN. It is not difficult to see that r = 0 and r̃(t) = 0 when b1 = b12 + h2 . Therefore, the allocation rules of all three heuristics so far are identical to the optimal policy under symmetric costs. 633 Heuristic AS: Asymptotically Optimal Replenishment Policy + Static Rationing Allocation Rule. This heuristic adopts the asymptotically optimal replenishment policy given by Theorems 4–5 and the same allocation rule as in Heuristic IS. Heuristic AD: Asymptotic Optimal Replenishment Policy + Dynamic Rationing Allocation Rule. This heuristic employs the replenishment policy as in Heuristic AS and and the allocation policy as in Heuristic CD. Table 2 below summarizes the allocation and replenishment policies of these five heuristics. We have conducted a numerical experiment to test the performance of these heuristic policies, focusing on the impact of cost asymmetry and product demand ratio between two products. The effectiveness of a heuristic k is measured by Ckh = Ckh − C ∗ × 100, C∗ k ∈ {AS, AD, CN, CD, IS} . Figure 4 reports some results. Here, we assume Poisson demand processes with demand rates λ1 = 3 and λ12 = 3 (ratio = λ1 /λ12 = 1), fix h1 = 1, h2 = 1, L = 4, = 1 and vary b1 /(b12 + h2 ) ∈ {0.1, 0.2, . . . , 1, 2, . . . , 10}. From these numerical results, we observe that when cost asymmetry is strong, the performance of both CN and IS are dominated by the other three. This is not surprising, because IS uses IBS instead of CBS, and CN does not use rationing mechanism. For this range of cost parameters, among all the heuristics, AD is most promising, because it takes into account the latest information in both replenishment and allocation decisions. Conversely, when cost asymmetry is not strong, the heuristics CN and CD perform better or reasonably well. We have also tested the robustness of these observations by varying demand rates, e.g., letting λ1 = 1 and λ12 = 5 (ratio = λ1 /λ12 = 0.2) or λ1 = 5 and λ12 = 1 (ratio = λ1 /λ12 = 5). These additional tests confirm the above qualitative findings. For space concerns, we do not report these results here. 8. CONCLUDING REMARKS In this article, we have presented the characterization of the optimal control policy for ATO N- and W-systems with nonidentical leadtimes and general cost structures. Our main contributions can be summarized as follows: • Joint Optimal Inventory Replenishment and Component Allocation Policy. We show that NHB allocation rule together with a CBS replenishment policy are optimal control policies when costs are symmetric. When the cost symmetry is violated, we establish a Naval Research Logistics DOI 10.1002/nav 634 Naval Research Logistics, Vol. 62 (2015) Table 2. Heuristics of allocation and replenishment policies Replenishment Allocation CN IS CD AS AD CBS NHB-PBC IBS rationing r CBS rationing r̃(t) Asymptotic CBS rationing r Asymptotic CBS rationing r̃(t) Figure 4. Comparison of AS, AD, CN, CD, and IS. cost lower bound and show that our NHB + CBS policy continues to be asymptotically optimal in the highdemand regime. Our policy has an explicit characterization which facilitates implementation: The CBS policy updates components’ inventory using real-time information that is closely related to the Rosling’s well-known BBS policy. The explicit characterization of the optimal policy also enables sensitivity analysis, which would not be easy to do otherwise. • Practical Guidance. We suggest an efficient heuristic which uses a CBS replenishment policy and a rationing allocation policy whose rationing level is “dynamically” adjusted by the inventory status. (The rationing policy reduces to a NHB rule when costs are symmetric.) Our numerical study shows that the heuristic outperforms other policies widely adopted in the literature, such as the IBS replenishment policy (e.g., [20]) and rationing allocation rules with constant level (e.g., [5, 10, 11]). We find that the IBS policy can lead to significant optimality loss when the component leadtimes are very different. These findings provide a useful guidance for practitioners in determining control policies of various ATO systems. • Methodology. The characterization of optimal control policies for ATO systems has long been considered a challenging problem. The breakthrough in this research is due to a hybrid approach that combines Naval Research Logistics DOI 10.1002/nav sample-path analysis, a linear programming lower bound, and the tower property of conditional expectations. This is very different from the traditional dynamic programming technique. We hope our approach can inspire other efforts in related studies. APPENDIX: W-SYSTEM In this appendix, we show that our methodology for the N-system is applicable to establish the optimality results for the W-system. Here, I = {1, 2, 3} and K = {{1, 2} , {1, 3}} . Similar to the the analysis in Section 4.1, for any admissible policy π , let • Yi (t, π ): On-hand inventory of product-specific component i not yet assigned to a demand of product-1i, i = 2, 3; • Bi1i (t, π ): Backorders of product-1i due to missing product-specific component i, including backorders missing both components i and 1, i = 2, 3; • B11i (t, π ): Backorders of product- {1i} due to missing only common component-1, either component-1 is missing or it is committed to other products by policy π . We have that for i = 2, 3, Ii (t, π ) = Yi (t, π ) + B11i (t, π ), (49) B 1i (t, π ) = B11i (t, π ) + Bi1i (t, π ), (50) Yi (t, π ) × Bi1i (t, π ) = 0. (51) Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies Similar to (9), it follows from (49)–(51) that (3) can be written as 635 Similar to the N-system, we pay special attention to the following condition: C(t, π ) = h3 · Y3 (t, π ) + h2 · Y2 (t, π ) + b13 · B313 (t, π ) + b12 · B212 (t, π ) + h1 · I1 (t, π ) + (h3 + b13 ) · B113 (t, π ) + (h2 + b12 ) · B112 (t, π ). (52) Analogous to Theorem 1, we have Moreover, similar to (10)–(11), by (49)–(51), (2) can be expressed as I1 (t, π ) − B112 (t, π ) − B113 (t, π ) − B212 (t, π ) − B313 (t, π ) = N1 (t, π ), (53) Yi (t, π ) − Bi1i (t, π ) = Ni (t, π ), i = 2, 3. PROPOSITION 6: If a sample-path optimal admissible policy π̂ exists, it would yield the following sample-path properties. i. Under Condition 2, there exists an NHB policy (see Remark 1) such that the conditional sample-path lower bound ϕ((N1 (t, π ), N2 (t, π ), N3 (t, π )) given by (58) in Proposition 6 can be achieved. ii. When Condition 2 does not hold, no allocation rule can achieve the conditional sample-path lower bound ϕ((N1 (t, π ), N2 (t, π ), N3 (t, π )) given by (58) in Proposition 6 [see Proposition 6 (i) and (iii)]. Next, we investigate the form of the optimal replenishment policy. Without loss of generality, we assume L2 ≥ L3 . We consider three different scenarios: (CS) Common component has the shortest leadtime: L2 ≥ L3 ≥ L1 ; (CL) Common component has the longest leadtime: L1 > L2 ≥ L3 ; (CM) Common component has the intermediate leadtime: L2 ≥ L1 ≥ L3 . Define G(y1 , y2 , y3 ) = h1 · E(y1 − D1 ) + h2 · E(y2 − D2 ) + h3 · E(y3 − D3 ) i. If b13 + h3 < b12 + h2 , then − B113 (t, π̂ ) = (N1 (t, π̂ ) + [N2 (t, π̂ )]− + [N3 (t, π̂ )]− ) , (55) B112 (t, π̂ ) = 0, C(t, π̂ ) = h1 N1 (t, π̂ ) + h2 N2 (t, π̂ ) + h3 N3 (t, π̂ ) + (h1 + b THEOREM 6: [Sample-Path Lower Bound Attainability (W)] Assume that the component net inventory Ni (t, ·) is given for any time t. (54) Similar to Section 4.2, we use a linear programming approach to obtain a conditional sample-path cost lower bound. In particualar, we formulate a linear-program in which B112 (t, π ), B113 (t, π ), I1 (t, π ), Y2 (t, π ), B212 (t, π ), Y3 (t, π ) and B313 (t, π ) are the decision variables, C(t, π ) given by (52) is the objective function for minimization, and (49)–(51) and (53)–(54) are the constraints, assuming Ni (t, π ) are fixed. Then, the minimum of the objective function, denote by ϕ((N1 (t, π ), N2 (t, π ), N3 (t, π )) is a lower bound of C(t, π ). Solving this linear-program leads to the following results that are similar to Proposition 1: 13 CONDITION 2: The cost parameters in the W-system satisfy b13 + h3 = + h2 = d. b12 + (c + h1 ) · E[(y1 − D1 )− ∨ ((y2 − D2 )− + (y3 − D3 )− )]. (61) Similar to Lemma 2, we have − + h3 )[N3 (t, π̂ )] LEMMA 7: G(y1 , y2 , y3 ) is jointly convex with the unique minimal point (s̄1 , s̄2 , s̄3 ). Furthermore, s̄2 + s̄3 ≥ s̄1 . + (h1 + h2 + b12 )[N2 (t, π̂ )]− + (h1 + b13 + h3 )(N1 (t, π̂ ) + [N2 (t, π̂ )]− + [N3 (t, π̂ )]− )− . (56) s̆1 (y2 , y3 ) = argminy1 G(y1 , y2 , y3 ), ii. If b13 + h3 = b12 + h2 , then + [N3 (t, π̂ )]− )− , (57) + (h1 + b − + h3 )[[N1 (t, π̂ )] ∨ ([N2 (t, π̂ )] + [N3 (t, π̂ )]− )]. iii. If b13 + h3 > b12 (58) + h2 , then B113 (t, π̂ ) = 0, (59) − B112 (t, π̂ ) = (N1 (t, π̂ ) + [N2 (t, π̂ )]− + [N3 (t, π̂ )]− ) , h1 s1∗ = max y : Pr(D2 + D3 ≥ y) ≥ , c + h1 vi (x) = max y : Pr(D2 + D3 ≥ y, Di ≥ y − x) ≥ i = 2, 3, s1 (y2 , y3 ) = max y : Pr(D2 + D3 ≥ y, D3 ≥ y − y2 , h1 D 2 ≥ y − y3 ) ≥ . c + h1 C(t, π̂ ) = h1 N1 (t, π̂ ) + h2 N2 (t, π̂ ) + h3 N3 (t, π̂ ) + (h1 + b13 + h3 )[N3 (t, π̂ )]− + (h1 + b12 + h2 )[N2 (t, π̂ )]− + (h1 + h2 + b12 )(N1 (t, π̂ ) + [N2 (t, π̂ )]− + [N3 (t, π̂ )]− )− . (s̆2 , s̆3 ) = argmin{y2 ,y3 } Ğ(y2 , y3 ), (64) As G(y1 , y2 , y3 ) is jointly convex, s̆1 (y2 , y3 ) and s̆i , i = 2, 3 are unique and well-defined. To characterize the properties of s̆1 (y2 , y3 ), define s1∗ , vi (x), i = 2, 3, and s1 (y2 , y3 ) by C(t, π̂ ) = h1 N1 (t, π̂ ) + h2 N2 (t, π̂ ) + h3 N3 (t, π̂ ) − (62) Ğ(y2 , y3 ) = EG(s̆1 (y2 − D2 , y3 − D3 ), y2 − D2 , y3 − D3 ), (63) B113 (t, π̂ ) + B112 (t, π̂ ) = (N1 (t, π̂ ) + [N2 (t, π̂ )]− 13 We first consider the CS case. Let (60) iv. A necessary condition for an allocation rule π̂ to be optimal is (57). (65) h1 , c + h1 (66) (67) Divide the two dimensional space into the following three regions (see Fig. 5), 1 = (y2 , y3 ) : y3 ≥ s1∗ , y2 ≥ s1∗ , Naval Research Logistics DOI 10.1002/nav 636 Naval Research Logistics, Vol. 62 (2015) REMARK 4: For the identical leadtime case (L1 = L2 = L3 ), from Theorem 7, we have that an optimal admissible policy is given by the basestock policy with stock level (s̄1 , s̄2 , s̄3 ) and an NHB allocation rule. The minimum long-run average inventory cost is given by C ∗ = G(s̄1 , s̄2 , s̄3 ). For the cases CL and CM, we cannot get the optimal long-run average cost, however, through our approach (linear-programming and the tower properties of conditional expectations), a lower bound is established. Basically, it tells us that any replenishment policy that would achieve the lower bound only uses the last demand information over an interval whose length is the gap between leadtimes, that is, max {Li , i = 1, 2, 3} − L. Denote {o(1), o(2), o(3)} as a permutation of 1, 2, 3 such that L = Lo(1) ≤ Lo(2) ≤ Lo(3) . Let Ĉ be the minimum value of the following multistage stochastic programming (SP) problem Ĉ = minE{minE[minE(ϕ(y1 − D1 (Lo(3) ), y2 yo(3) yo(2) yo(1) − D2 (Lo(3) ), y3 − D3 (Lo(3) )) |FLo(3) −Lo(1) )|FLo(3) −Lo(2) ]}. ⎧ ⎪ miny1 E[miny2 E(Ĝ(y1 − D1 (0, L1 − L3 ], ⎪ ⎪ ⎪ ⎨ y − D (L − L , L − L ])|F 2 3 1 2 1 3 L1 −L2 )], CLCase, = ⎪ E[min E( Ĝ(y − D (L − L1 , L2 − L3 ], min ⎪ y2 y1 1 2 2 ⎪ ⎪ ⎩ y − D (0, L − L ])|F )], CM Case, 2 3 2 3 L2 −L1 Figure 5. Optimal replenishment policy for common component s̆1 (y2 , y3 ). 2 = {(y2 , y3 ) : y2 ≤ v2 (y3 ), y3 ≤ v3 (y2 )} , 3 = − 1 − 2 . (69) where Ĝ(x, y) = minz G(x, y, z). Then, s̆1 (y2 , y3 ) can be characterized by ⎧ ⎪ s1∗ , ⎪ ⎪ ⎪ ⎨s (y , y ), 1 2 3 s̆1 (y2 , y3 ) = ⎪v2 (y3 ), ⎪ ⎪ ⎪ ⎩v (y ), 3 2 THEOREM 8 (Lower Bound for CL and CM) Assume Condition 2. The minimum long-run average cost C ∗ under both CL and CM is bounded from below by Ĉ, namely, Ĉ ≤ C ∗ . (y2 , y3 ) ∈ 1 , (y2 , y3 ) ∈ 2 , (y2 , y3 ) ∈ 3 and y2 ≥ y3 , (y2 , y3 ) ∈ 3 and y2 ≤ y3 . (68) Moreover, LEMMA 8: For any given y2 and y3 , s̆1 (y2 , y3 ) defined by (68) satisfies dvi (x)/dx ≤ 1, ∂s1 (y2 , y3 )/∂yi ≤ 1, i = 2, 3, and s̆1 (y2 , y3 ) is continuous and increasing in yi , i = 2, 3. We now study the case of asymmetric cost structure: b12 + h2 = b13 + h3 . Similar to the N-system, we first construct bounding systems for the original system, so that each of the bounding systems has a symmetric cost structure. For simplicity, we use the same notation as in the N-system, with the understanding that we are discussing the W-system here. The bounding systems differ from the original one only in their cost structures. Let S = S(b12 , b13 , h1 , h2 , h3 ) denote the original system. Define two systems S = S(b12 , b13 , h1 , h2 , h3 ) and S = S(b̄12 , b̄13 , h̄1 , h̄2 , h̄3 ), where h1 = h1 , h2 = h2 ∧ h3 , h3 = h2 ∧ h3 , b13 + h3 = b12 + h2 We say a replenishment policy for the W-system is a CBS policy with parameters (s̆1 (·, ·), s̆2 , s̆3 ) if the policy works as follows: at any time t, maintain the inventory positions of component-2 and component-3 at s̆2 and s̆3 respectively, while keeping the inventory position of component-1 at = (b13 + h3 ) ∧ (b12 + h2 ) (= d), h̄1 = h1 , h̄2 = h2 ∨ h3 , h̄3 = h2 ∨ h3 , b̄13 + h̄3 = b̄12 + h̄2 = (b13 + h3 ) ∨ (b12 + h2 ) (= d̄). s̆1 (s̆2 − D2 (t − 2 , t], s̆3 − D3 (t − 3 , t]). We have LEMMA 9: Assume I P1 (0) = s̆1 (s̆2 − D2 , s̆3 − D3 ), I P2 (0) = s̆2 , I P3 (0) = s̆2 , that is, the system starts with zero inventory and backorders. Then, system is coordinated when π̆ is used, that is, for any t ≥ 0, Let the resulting minimum long-run average costs of systems S and S be C ∗ and C̄ ∗ respectively. We have PROPOSITION 7: If b13 + h3 = b12 + h2 , the minimum long-run average cost C ∗ is bounded from below by C ∗ and from above by C̄ ∗ , that is, C ∗ ≤ C ∗ ≤ C̄ ∗ . APPENDIX: PROOFS IP1 (t, π ) = s̆1 (s̆2 − D2 (t), s̆3 − D3 (t)), IP2 (t, π ) = s̆2 , IP3 (t, π ) = s̆3 . THEOREM 7 (Optimal Admissible Policy CS(W)) Assume Condition 2. When the common component-1 has the shortest lead time, an optimal admissible policy constitutes the CBS replenishment policy (s̆1 (·, ·), s̆2 , s̆3 ) and an NHB allocation rule in N . The minimum long-run average cost is given by C ∗ = Ğ(s̆2 , s̆3 ). CBS replenishment policy s̆1 (·, ·) is graphically characterized in Fig. 5. Naval Research Logistics DOI 10.1002/nav PROOF OF PROPOSITION 2: For Ft -adaptive replenish policy {(O1 (t), O2 (t)), t ≥ 0}, we define ⎧ 1 ⎪ ⎨D (t), if D1 (t) + (I2 (0) + O2 (t − L2 )) A (t) = ∧D12 (t) ≤ I1 (0) + O1 (t − L1 ), ⎪ ⎩ 1 1 D (τ (t)) + Ã (t, t), otherwise; 1 (E-1) Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies ⎧ 12 ⎪ ⎪ ⎪(I2 (0) + O2 (t − L2 )) ∧ D (t), ⎪ ⎨ if D1 (t) + (I (0) + O (t − L )) ∧ D12 (t) 2 2 2 A12 (t) = ⎪ (0) + O (t − L ≤ I ), ⎪ 1 1 1 ⎪ ⎪ ⎩(I (0) + O (τ (t) − L )) ∧ D12 (τ (t)) + Ã12 (t, t), otherwise, 2 2 2 (E-2) 637 2 = {(D 1 , D 12 ) : D 1 ≤ y1 − y2 , D 12 ≥ y2 }, 3 = {(D 1 , D 12 ) : D 1 ≥ y1 − y2 , D 1 + D 12 ≥ y1 }. Noted that when y1 < y2 , 1 , 2 and 3 can be simplified as {(D 1 , D 12 ) : D 1 + D 12 ≤ y1 }, ∅, and {(D 1 , D 12 ) : D 1 + D 12 ≥ y1 }, respectively. First, by (21), we have where − − (y1 − D 1 − D 12 ) ∨ (y2 − D 12 ) ⎧ ⎪ (D 1 , D 12 ) ∈ 1 , ⎨0, 12 = D − y2 , (D 1 , D 12 ) ∈ 2 , ⎪ ⎩ 1 12 D + D − y1 , (D 1 , D 12 ) ∈ 3 . τ (t) = sup{s ≤ t : D (s) + (I2 (0) + O2 (s − L2 )) ∧ D (s) 1 12 ≤ I1 (0) + O1 (s − L1 )} and {(Ã1 (t, s), Ã12 (t, s)), s ∈ [τ (t), t]} has the following properties: Thus, we have P1 : Ã1 (t, s) and Ã12 (t, s) are increasing in s ∈ [τ (t), t], P2 : Ã1 (t, s) + Ã12 (t, s) = O1 (s − L1 ) − O1 (τ (t) − L1 ), P3 : Ã1 (t, s) ≤ D1 (τ (t), s], + P4 : Ã12 (t, s) ≤ D12 (τ (t), s] + (D12 (τ (t)) − I2 (0) − O2 (τ (t) − L2 )) , + ⎧ ⎪h1 − (c + h1 ) · Pr(D 1 + D 12 ≥ y1 , ∂G(y1 , y2 ) ⎨ = D 1 ≥ y1 − y2 ), y1 ≥ y2 , ⎪ ∂y1 ⎩ h1 − (c + h1 ) · Pr(D 1 + D 12 ≥ y1 ), y1 < y2 . Hence, P5 : Ã12 (t, s) ≤ (I2 (0) + O2 (τ (t) − L2 ) − D12 (τ (t))) + O2 (s − L2 ) − O2 (τ (t) − L2 ). Noting that from the definition of τ (t) we know that for any s ∈ (τ (t), t], D1 (s) + (I2 (0) + O2 (s − L2 )) ∧ D12 (s) > I1 (0) + O1 (s − L1 ). This implies that the existence of {(Ã1 (t, s), Ã12 (t, s)), s ∈ [τ (t), t]} with properties (P1)–(P5). Then by (2), ∂G(y1 , y2 ) = h1 − (c + h1 ) · Pr(D 1 + D 12 ≥ y1 , D 1 ≥ y1 − y2 ). ∂y1 (E-4) Similarly, we can show ∂G(y1 , y2 ) = h2 − (c + h1 ) · Pr(D 1 ≤ y1 − y2 , D 12 ≥ y2 ). ∂y2 (E-5) Taking further derivatives, we get that I1 (t) = I1 (0) + O1 (t − L1 ) − A1 (t) − A12 (t), ∞ ∂ 2 G(y1 , y2 ) = (c + h1 ) ψ(y1 − y2 , ξ2 )dξ2 2 ∂y1 y2 y2 + ψ(y1 − ξ2 , ξ2 )dξ2 , I2 (t) = I2 (0) + O2 (t − L2 ) − A12 (t), B 1 (t) = D 1 (t) − A1 (t), B 12 (t) = D 12 (t) − A12 (t). These imply 0 ∂ 2 G(y The Ft -adaptiveness of {(A1 (t), A12 (t)), t ≥ 0} directly follows from its definition (E-1)–(E-2). Hence, the proposition is proved. PROOF OF LEMMA 1: Denote v and u as defined in Section 3.1. Then (20) can be expressed as (E-3) (v2 + v5 ) × min{v3 , v2 + v4 } ≥ min{v2 × v3 , (v2 )2 } > 0. This contradicts (E-3). Hence we have v2 = 0. Note that v3 − v1 − v2 − v5 = − u1 and v5 = u− 2 mean v3 − (v1 + v2 ) = u1 + u2 . This, together with (a), implies (b) and (c). (E-7) (E-8) y2 E-7 implies G(y1 , y2 ) is submodular, proving the second part of (i). Thus, the minimal point exists. It is trivial to see ∂ 2 G(y1 , y2 ) ∂ 2 G(y1 , y2 ) , i = 1, 2. ≤ ∂y1 ∂y2 ∂yi2 + We aim to prove (a) v3 × (v1 + v2 ) = 0; (b) v3 = (u1 + u− 2 ) ; and (c) − v1 + v2 = (u1 + u− 2) . We first prove (a). This can be done by proving that if v3 > 0, then v1 = v2 = 0. From (E-3), we have v1 = 0. If v2 > 0, then (E-6) ∞ 1 , y2 ) = −(c + h1 ) ψ(y1 − y2 , ξ2 )dξ2 , ∂y1 ∂y2 y2 y1 −y2 ∂ 2 G(y1 , y2 ) = (c + h1 ) ψ(ξ1 , y2 )dξ1 ∂y22 0 ∞ + ψ(y1 − y2 , ξ2 )dξ2 . B 1 (t) × I1 (t) = 0, B 12 (t) × min{I1 (t), I2 (t)} = 0. v1 × v3 = 0, (v2 + v5 ) × min{v3 , v2 + v4 } = 0. (E-9) Thus, the Hessian matrix is diagonal dominant, proving (ii). By the contraction theorem, the minimal point (s̄1 , s̄2 ) is unique. Combined with (E-6), and (E-8)–(E-9) we know that the Hessian matrix is positive definite, and hence G is jointly convex, proving the first part of (i). PROOF OF LEMMA 3: First we prove (i). We consider the following three cases. ∗ (see (25)). By definition, s̃ (y ) = s ∗ < y . Thus, by Case 1: y2 > s12 1 2 2 12 (E-4), we have PROOF OF LEMMA 2: Let 1 = {(D 1 , D 12 ) : D 1 + D 12 ≤ y1 , D 12 ≤ y2 }, ∂G(y1 , y2 ) ∂y1 ∗ y1 =s12 ∗ ∗ = h1 − (c + h1 ) · Pr(D 1 + D 12 ≥ s12 , D 1 ≥ s12 − y2 ) Naval Research Logistics DOI 10.1002/nav 638 Naval Research Logistics, Vol. 62 (2015) ∗ = h1 − (c + h1 ) · Pr(D 1 + D 12 ≥ s12 ) By the definition of G̃(·) given by (28) and noting the independence of D2 (t − L) and Ft−L− , by (E-15)–(E-16), taking a further expectation yields = 0. ∗ minimizes G(y , y ) when y > s ∗ . Hence, s̃1 (y2 ) = s12 1 2 2 12 ∗ . By definition, it is not difficult to show s (y ) ≥ y . Case 2: 0 < y2 ≤ s12 1 2 2 Thus, by the first order condition (E-4), s1 (y2 ), which is the same as s̃1 (y2 ) given by (27), minimizes G(y1 , y2 ). Case 3: y2 ≤ 0. By (E-4) again, ∂G(y1 , y2 ) ∂y1 y1 =y2 +s1∗ = h1 − (c + h1 ) · Pr(D 1 + D 12 ≥ y2 + s1∗ , D 1 ≥ s1∗ ) = h1 − (c + h1 ) · Pr(D 1 ≥ s1∗ ) = 0. This implies that for given y2 , s̃1 (y2 ) = y2 + s1∗ (see (25)) minimizes G(y1 , y2 ). ∗ ) = s∗ . The continuity of s̃1 (·) follows directly from s̃1 (0) = s1∗ and s̃1 (s12 12 ∗ ), Using (E-6)–(E-7), we have that for y2 ∈ (0, s12 ds̃1 (y2 ) =− dy2 ! ∂ 2 G(y ∂ 2 G(y 1 , y2 ) 1 , y2 ) ∂y2 ∂y1 ∂y12 y1 =s1 (y2 ) ∞ ψ(s (y ) − y , ξ )dξ 1 2 2 2 2 y2 y2 = ∞ ∈ [0, 1]. y2 ψ(s1 (y2 ) − y2 , ξ2 )dξ2 + 0 ψ(s1 (y2 ) − ξ2 , ξ2 )dξ2 (E-10) ds̃1 (y2 ) = 1, dy2 ds̃1 (y2 ) ∗ If y2 > s12 , then = 0. dy2 (E-11) (E-12) Combining (E-10)–(E-12) yields the derivative of s̃1 (y2 ) belongs to the interval [0, 1]. Finally we prove (ii). Note that for any jointly convex function g(x1 , x2 ), g̃(x2 ) = minx1 g(x1 , x2 ) is convex in x 2 . (See p. 54 of [27]). Therefore, for any realization of D2 , G(s̃1 (y − D2 ), y − D2 ) is convex in y. Because expectations preserve convexity, G̃(·) is convex. PROOF OF LEMMA 4: As component-2 uses a base-stock policy s̃2 , the second equality is trivial. To show that the first equality holds, it suffices to show that for any τ and t with 0 < τ < t, s̃1 (s̃2 − D2 (t − , t]) ≥ s̃1 (s̃2 − D2 (τ − , τ ]) − D2 (τ , t]. This directly follows from Lemma 3, 0 ≤ ds̃1 (y2 )/dy2 ≤ 1. (E-13) For simplicity, for any admissible policy π ∈ A, in the following we write IPi = IPi (t − Li , π ), i ∈ I. (E-14) PROOF OF THEOREM 2: For any π ∈ A, by (22), we have ≥ E[ϕ(IP1 − D1 (t), IP2 − D2 (t))|Ft−L− ] (E-15) As IP1 , IP2 and D2 (t − L) are adapted to Ft−L , by Lemma 3, we obtain E[G(IP1 , IP2 − D2 (t − L))|Ft−L ] ≥ G(s̃1 (IP2 − D2 (t − L)), IP2 − D2 (t − L)). Naval Research Logistics DOI 10.1002/nav (E-17) The last two inequalities in (E-17) will be equalities, if for all t ≥ 0, IP1 = s̃1 (s̃2 − D2 (t − L)), IP2 = s̃2 . By Lemma 4, these two equations hold when the CBS policy (s̃1 (s̃2 − D2 (t)), s̃2 ) is used for component replenishment. By Theorem 1 (i), Proposition 1 (ii), and Proposition 2, if we employ the CBS policy (s̃1 (s̃2 − D2 (t)), s̃2 ) and an NHB allocation rule that just follows it, then the first inequality in (E-17) will be an equality. Hence this policy gives us EC(t, π ) = G̃(s̃2 ). The theorem is thus proved. PROOF OF LEMMA 5: First we prove (i). Noting that G1 (y1 , y2 ) differs from G(y1 , y2 ) only by replacing D1 with D 1 + D 1 , similar to (E-4) and (E-5), we have ∂G1 (y1 , y2 ) = h1 − (c + h1 ) · Pr(D 1 + D 1 + D 12 ≥ y1 , ∂y1 D 1 + D 1 ≥ y1 − y2 ), (E-18) (E-19) The result can be argued in the same way as in Lemma 2, so we omit the details here. Next, we prove (ii) We consider two cases. Case 1: y1 ≤ s2∗ (see (32)). By definition, s̆2 (y1 ) = y1 − s2∗ ≤ 0. This implies that Pr(D 1 + D 1 ≤ y1 − s̆2 (y1 ), D 12 ≥ s̆2 (y1 )) = Pr(D 1 + D 1 ≤ s2∗ , D 12 ≥ 0) = h2 , c + h1 i.e., s̆2 (y1 ) satisfies the first order condition given by (E-19). Hence, s̆2 (y1 ) minimizes G1 (y1 , ·). Case 2: y1 > s2∗ . It can be verified that, in this case, s2 (y1 ) > 0. Thus, by the first order condition given by (E-19), s2 (y1 ), which is same as s̆2 (y1 ) given by (33), minimizes G1 (y1 , ·). The continuity of s̆2 (y1 ) directly follows from s2 (s2∗ ) = 0. Noting that for y1 > s2∗ , ds̆2 (y1 ) ∂ 2 G1 (y1 , y2 ) =− dy1 ∂y2 ∂y1 ∞ ! ∂ 2 G1 (y1 , y2 ) ∂y22 y2 =s2 (y1 ) s2 (y1 ) ψ(y1 − s2 (y1 ), ξ2 )dξ2 = y −s (y ) , ∞ 1 2 1 ψ(ξ1 , s2 (y1 ))dξ1 + s2 (y1 ) ψ(y1 − s2 (y1 ), ξ2 )dξ2 0 we have 0 ≤ ds̆2 (y1 )/dy1 ≤ 1. If y1 < s2∗ , clearly, ds̆2 (y1 )/dy1 = 1. Consequently, the derivative of s̆2 (y1 ) always belongs to the interval [0, 1]. The proof of (iii) is similar to that in Lemma 3 (ii). E[C(t, π )|Ft−L− ] = E{E[G(IP1 , IP2 − D2 (t − L))|Ft−L ]|Ft−L− }. ≥ G̃(IP2 ) ≥ G̃(s̃2 ). ∂G1 (y1 , y2 ) = h2 − (c + h1 ) · Pr(D 1 ≤ y1 − y2 , D 12 ≥ y2 ). ∂y2 It is straightforward to see that If y2 < 0, then E[C(t, π )|Ft−L− ] ≥ E[ϕ(IP1 − D1 (t), IP2 − D2 (t))|Ft−L− ] (E-16) PROOF OF LEMMA 6: The proof is similar to that for Lemma 4. Similar to (E-13), we need to show that for any t and τ with t > τ , s̆2 (s̆1 − D2 (t − , t]) ≥ s̆2 (s̆1 − D2 (τ − , τ ]) − D2 (τ , t]. This follows directly from 0 ≤ ds̆2 (y1 )/dy1 ≤ 1 by Lemma 5. (E-20) Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies 639 PROOF OF THEOREM 3: The proof is similar to that of Theorem 2. Here, we omit the details and sketch only what is unique to this proof. 12 be the system’s information given by F 12 Let Ft−L t−L− and D (s) for s ∈ (t − L − , t − L]. Then D1 (t − L − , t] and D12 (t − L, t] are 12 . Noting that independent of Ft−L where the equality is given by the fact that both sides equal to h1 /(c + h1 ) by (30). Similarly, by (31) and (E-23), we get D1 (t) = D1 (t − L, t] + D 1 (t − L) + D2 (t − L), From (E-24) we have either z̄1 ≥ s̄1 or z̄1 − z̄2 ≥ s̄1 − s̄2 , and from (E-25) either z̄2 ≥ s̄2 or z̄1 − z̄2 ≤ s̄1 − s̄2 . Thus, by checking all four combinations Pr(D 1 ≤ s̄1 − s̄2 , Z 12 ≥ s̄2 ) ≥ Pr(D 1 ≤ s̄1 − s̄2 , D 2 ≥ s̄2 ) = Pr(D 1 ≤ z̄1 − z̄2 , Z 12 ≥ z̄2 ). 12 , we have that for π ∈ A, and IP1 and D2 (t − L) are adapted to Ft−L 12 E[ϕ(IP1 − D1 (t), IP2 − D2 (t))|Ft−L ] = E[ϕ(IP1 − D2 (t − L) − D 1 (t 12 IP2 − D2 (t))|Ft−L ] 12 (by Ft−L ⊆ 1 • • • • − L) − D1 (t − L, t], Ft−L ) 12 ] = E[G(IP1 − D2 (t − L) − D (t − L), IP2 )|Ft−L = G1 (IP1 − D2 (t − L), IP2 ) ≥ G1 (IP1 − D2 (t − L), s̆2 (IP1 − D2 (t − L))). By the definition of Ğ(·) given by (35) and noting that D2 (t − L) has the same distribution as D2 , taking further expectation yields 12 ]|Ft−L− } ≥ Ğ(IP1 ) ≥ Ğ(s̆1 ), E{E[ϕ(IP1 − D1 (t), IP2 − D2 (t))|Ft−L where the last inequality follows from (36) and Lemma 5 (ii). Hence, we have z̄1 ≥ s̄1 z̄1 ≥ s̄1 z̄1 − z̄2 z̄1 − z̄2 (E-25) and z̄2 ≥ s̄2 ; and z̄1 − z̄2 ≤ s̄1 − s̄2 ; ≥ s̄1 − s̄2 and z̄2 ≥ s̄2 ; ≥ s̄1 − s̄2 and z̄1 − z̄2 ≤ s̄1 − s̄2 , and using (E-25) for the last combination yields that z̄1 ≥ s̄1 and z̄2 ≥ s̄2 . Under CL, it is not difficult to show that s̆2 (y1 ) ≤ z̆2 (y1 ) for any given y1 from (33) and (32), because Pr(D 1 + D 1 ≤ y1 − y2 , Z 12 ≥ y2 ) ≥ Pr(D 1 + D 1 ≤ y1 − y2 , D 12 ≥ y2 ) and both sides are decreasing in y2 . To show s̆1 ≤ z̆1 , we point out that from the equations (35) and (36) (see (E-18)), s̆1 is the solution of Pr(D 1 ≥ y1 − D 1 − D 12 − s̆2 (y1 − D 12 ), D 1 + D 12 ≥ y1 − D1 ) = h1 . c + h1 (E-26) From Lemma 5, y1 − s̆2 (y1 ) is increasing with y1 . Thus, Pr(y1 − Z 12 − s̆2 (y1 − Z 12 ) > x) EC(t, π ) ≥ E[ϕ(IP1 − D1 (t), IP2 − D2 (t))] ≤ Pr(y1 − D 12 − s̆2 (y1 − D 12 ) > x). 12 = E{E[E(ϕ(IP1 − D1 (t), IP2 − D2 (t))|Ft−L )|Ft−L− ]} ≥ Ğ(s̆1 ). (E-21) Because s̆2 (y1 ) ≤ z̆2 (y1 ) for any y1 , we have Pr(y1 − Z 12 − z̆2 (y1 − Z 12 ) > x) The last inequality will be an equality if for all t ≥ 0, ≤ Pr(y1 − D 12 − s̆2 (y1 − D 12 ) > x). IP1 = s̆1 , IP2 = s̆2 (s̆1 − D2 (t − L)). Therefore, By Theorem 1 (i), Proposition 1 (ii), and Proposition 2, if we use the component replenishment policy (s̆1 , s̆2 (s̆1 − D2 (t))) and an NHB allocation rule that just follows it, the first inequality in (E-21) will be an equality. The theorem thus follows from Lemma 6 immediately. PROOF OF PROPOSITION 3: (i) We consider two N-systems—the original system and a new system. We use Z(t) for the new demand process, Z for the counterpart of D, and z for the counterpart of s. Both systems are identical except that in the new system the product-12 demand process {Z 12 (t), t ≥ 0} is stochastically larger than {D12 (t), t ≥ 0}. First, we have that for any x, Pr(D1 (0, ] + D12 (0, ] > x) ≤ Pr(D1 (0, ] + Z 12 (0, ] > x), (E-22) Pr(D (0, ] > x) ≤ Pr(Z (0, ] > x), Pr(D 12 12 12 > x) ≤ Pr(Z 12 > x). (E-23) Under IL, from (30) and (E-23), by the independence of D1 and D12 , and the independence of D1 and Z 12 , we have Pr(D 1 ≥ s̄1 − s̄2 , D 1 + Z 12 ≥ s̄1 ) ≥ Pr(D 1 ≥ s̄1 − s̄2 , D 1 + D 12 ≥ s̄1 ) = Pr(D 1 ≥ z̄1 − z̄2 , D 1 + Z 12 ≥ z̄1 ), (E-24) Pr(D 1 ≥ s̆1 − D 1 − Z 12 − z̆2 (s̆1 − Z 12 ), D 1 + Z 12 ≥ s̆1 − Z1 ) ≥ Pr(D 1 ≥ s̆1 − D1 − s̆2 (s̆1 − D 12 ), D 1 + D 12 ≥ s̆1 − D1 ) = h1 . c + h1 Thus, s̆1 ≤ z̆1 as the left-hand side of (E-26) is decreasing in y1 . Finally, the case for SL can be proved in a similar way. (ii) The proof is similar to the above case with some slight differences. We consider two N-systems, namely, the original one and the new one. We still use Z(t) for the new demand process, Z for the counterpart of D, and z for the counterpart of s. The two systems are identical except that the product-1 demand process in the new system is changed to {Z 1 (t), t ≥ 0}. The process {Z 1 (t), t ≥ 0} is stochastically larger than {D1 (t), t ≥ 0}. First, we have that for any x, Pr(D1 (0, ] + D12 (0, ] > x) ≤ Pr(Z 1 (0, ] + D12 (0, ] > x), (E-27) Pr(D 1 > x) ≤ Pr(Z 1 > x). (E-28) Under IL, adding (30) and (31), we have Pr(D 1 + D 12 ≤ s̄1 , D 12 ≤ s̄2 ) = 1 − h1 + h 2 . c + h1 (E-29) Naval Research Logistics DOI 10.1002/nav 640 Naval Research Logistics, Vol. 62 (2015) ≥ Pr(D 1 + D 12 ≤ s̆1 − Z1 , D 12 ≤ z̆2 (s̆1 − Z 12 )). Similar to (E-24)–(E-25), by (30) and (E-29), we have Pr(Z 1 ≥ s̄1 − s̄2 , Z 1 + D 12 ≥ s̄1 ) ≥ Pr(Z 1 ≥ z̄1 − z̄2 , Z 1 + D 12 ≥ z̄1 ), (E-30) Pr(Z 1 + D 12 ≤ s̄1 , D 12 ≤ s̄2 ) ≤ Pr(Z 1 + D 12 ≤ z̄1 , D 12 ≤ z̄2 ). (E-31) From (E-30), we have either z̄1 ≥ s̄1 or z̄1 − z̄2 ≥ s̄1 − s̄2 , and from (E-31), either z̄1 ≥ s̄1 or z̄2 ≥ s̄2 . Assuming z̄1 < s̄1 , then it must be z̄1 − z̄2 ≥ s̄1 − s̄2 and z̄2 ≥ s̄2 ; This is contradictory, as under this assumption, we have 0 ≤ z̄2 − s̄2 ≤ z̄1 − s̄1 < 0. Thus, we have s̄1 ≤ z̄1 . Similar to (i), one can easily show that under CL, s̆2 (y1 ) ≥ z̆2 (y1 ) for any y1 , and under SL, s̃1 (y2 ) ≤ z̃1 (y2 ) for any y2 . To show under CL, s̆1 ≤ z̆1 , we note that (E-26) is equivalent to Pr(D 1 + D 12 ≤ y1 − D1 , D 12 ≤ s̆2 (y1 − D 12 )) = 1 − h1 + h 2 . c + h1 (E-32) This is because for any realization of D 12 = δ, from (32), we have Pr(D 12 ≥ s̆2 (y1 − δ), D 1 + D 1 ≤ y1 − δ − s̆2 (y1 − δ)) = h2 . c + h1 Thus, Pr(D 12 ≥ s̆2 (y1 − D1 ), D + D ≤ y1 − D 1 1 12 h2 − s̆2 (y1 − D )) = . c + h1 12 (E-33) By (E-32), we know that h1 + h 2 , c + h1 h1 + h 2 ≤ z̆2 (z̆1 − Z 12 )) = 1 − . c + h1 Pr(D 1 + D 12 ≤ s̆1 − D1 , D 12 ≤ s̆2 (s̆1 − D 12 )) = 1 − Pr(D 1 + D 12 ≤ z̆1 − Z1 , D 12 Hence, s̆1 ≤ z̆1 directly follows from (E-33) and the monotonicity of s̆2 (y1 ). Similarly, we can prove SL case. (ii) We consider two N-systems, namely, the original one and the new one. The two systems are identical except that the shorter leadtime in the new system is changed to L z from L with L < Lz . Then Pr(D 1 ≥ x) ≤ Pr(Z 1 ≥ x) and Pr(D 12 ≥ x) ≤ Pr(Z 12 ≥ x). The remaining part of the proof is similar to the proof of Proposition 3 (ii). The details are therefore omitted. PROOF OF PROPOSITION 5: First, we establish the lower bound. From Proposition 1 and the definition of φ(u1 , u2 ) in (P3), letting c = b1 ∧ (b12 + h2 ), we have − ϕ(u1 , u2 ) ≥ h1 u1 + h2 u2 + (h1 + c)(u− 1 ∨ u2 ). From Theorem 1, therefore, we have, This, together with (E-26), yields the above equation (E-32). Because Pr(z̆2 (y1 − D 12 ) ≥ x) ≤ Pr(s̆2 (y1 − D 12 ) ≥ x), C(t, π ) ≥ ϕ(IP1 − D1 (t), IP2 − D2 (t)) ≥ h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + (h1 + b1 ∧ (h2 + b12 )) we have · ((IP1 − D1 (t))− ∨ (IP2 − D2 (t))− ) Pr(Z 1 + D 12 ≤ s̆1 − Z1 , D 12 ≤ z̆2 (s̆1 − D 12 )) ≤ Pr(D 1 + D 12 ≤ s̆1 − D1 , D 12 ≤ s̆2 (s̆1 − D 12 )) =1− h1 + h 2 c + h1 = Pr(Z 1 + D 12 ≤ z̆1 − Z1 , D 12 ≤ z̆2 (z̆1 − D 12 )), where the last two equations follow from (E-32). Because the left-hand side of (E-32) is increasing in y1 , we obtain s̆1 ≤ z̆1 . PROOF OF PROPOSITION 4: (i) We consider two N-systems, namely, the original one and the new one. We use Z(t) for the new demand process, Z for the counterpart of D, and z for the counterpart of s. The two systems are identical except that the leadtime difference in the new system is changed to z from with < z . From (27), it is trivial to see that s̃1 (·) has nothing to do with , and s̆2 (y) ≥ z̆2 (y). Thus, we need to prove that under CL, s̆1 ≤ z̆1 , and under SL, s̃2 ≤ z̃2 . First, we consider the CL case. As < z , then Pr(D + D ≤ s̆1 − D1 , D ≥ Pr(D + D 1 12 12 ≤ s̆1 − Z1 , D ≤ s̆2 (s̆1 − Z )) Naval Research Logistics DOI 10.1002/nav − By (a + b− ) + b− = a − ∨ b− with any real numbers a and b, C(t, π ) = h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + (h1 + h2 + b12 ) − · (IP2 − D2 (t))− + h1 · ((IP1 − D1 (t)) + (IP2 − D2 (t))− ) − · (IP2 − D2 (t))− + h1 · ((IP1 − D1 (t)) + (IP2 − D2 (t))− ) + c̄ · (B112 (t, π ) + B 1 (t, π )) = h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + (h1 + h2 + b12 ) · (IP2 − D2 (t))− + (h1 + c̄) · ((IP1 − D1 (t)) ≤ s̆2 (s̆1 − D )) 12 12 − B112 (t, π ) + B 1 (t, π ) = ((IP1 − D1 (t)) + (IP2 − D2 (t))− ) . ≤ h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + (h1 + h2 + b12 ) Recall that s̆2 (y1 ) is increasing with y1 . Noting that (D 1 , D 12 ) and (Z 1 , Z 12 ) have the same distribution, (D 1 , D 12 ) and D1 are independent, and (Z 1 , Z 12 ) and Z1 are independent, we have that 12 for any feasible π ∈ A. We know that the last expression above is just the inventory cost at time t of another system which is identical with the original one except that both the common component backorder cost and the sum of the product-12 backorder cost and the component-2 holding cost are changed into b1 ∧ (h2 + b12 ). Hence, the lower bound is obtained. Next, we move to the upper bound. Here, we consider the CL case; the others can be treated similarly. Let c̄ = b1 ∨ (b12 + h2 ) in (21), and we have h2 + b12 ≤ c̄. For any π ∈ A, + (h2 + b12 ) · B112 (t, π ) + b1 · B 1 (t, π ) Pr(D 12 ≥ x) ≤ Pr(Z 12 ≥ x). 1 (E-34) 12 + (IP2 − D2 (t))− )− Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies ≤ h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) X2n (t − L2 ) = I2n (0) + O2n (t − L2 ) − D12,n (t − L2 ) − λ12 nL2 . (E-44) + (h1 + c̄) · ((IP1 − D1 (t))− ∨ (IP2 − D2 (t))− ). Then, similar to the proof for the lower bound, we get C∗ ≤ (E-35) For any fixed t, we know that C̄ ∗ . PROOF OF THEOREM 4: Going along the same lines of Plambeck and Ward (2006) (see Proposition 4.2 on Page 462), for any π = {(Oin (t), AK,n (t)), t ≥ 0} ∈ An and T > 0, in view of (38)–(41), ⎛ ⎞ T 2 1 n K K,n ⎝ hi · Ii (t, π ) + b · B (t, π )⎠ dt lim sup √ E n n→∞ 0 i=1 K=1,12 ⎛ ⎞ T 2 1 n K K,n ⎝ hi · Ii (t, π ) + b · B (t, π )⎠ dt, ≥ lim sup √ E n n→∞ 0 ((D1,n (t − L, t] − λ1 nL), (D12,n (t − L, t] − λ12 nL)) and (X1n (t − L), X2n (t − L2 ) − (D12,n (t − L − , t − L] − λ12 n)) are independent. As {(D1,n (t), D12,n (t)), t ≥ 0} is stationary and has inde2+ pendent increments, and E(DK,n ( n1 )) < ∞, we have that for any t ≥ L, K=1,12 i=1 (E-36) where π is defined in (42). To get the analytic expression of Iin (t, π ) and B K,n (t, π ), let B̂K,n ( l n ) be the solution of (38) only with constraints (39)–(40). Then we have ⎛ ⎞ T 2 1 n K K,n ⎝ lim sup √ E hi · Ii (t, π ) + b · B (t, π )⎠ dt n n→∞ 0 i=1 K=1,12 ⎛ ⎞ T 2 1 n K K,n ⎝ hi · Iˆi (t) + b · B̂ (t)⎠ dt, ≥ lim sup √ E n n→∞ 0 i=1 K=1,12 converges in distribution to a normal variable 12 (t − L) with zero mean 2 and variance ·(σ 12 ) . By the Skorohod embedding theorem, we can assume that with probability one, n n Iˆ1 (t) = N̂1n (t) + B̂1,n (t) + B̂12,n (t), Iˆ2 (t) = N̂2n (t) + B̂12,n (t), N̂1n (t) = I1n (0) + O1n (t − L1 ) − D1n (t), If ≤ + h2 , it directly follows from Proposition 1 (i) that the solution of (38) with constraints (39)–(40) is given by + − + n n Iˆ1 (t) = (N̂1n (t) + (N̂2n (t)) ) , Iˆ2 (t) = (N̂2n (t)) , B̂1,n (t) 2 = (N̂1n (t) + (N̂2n (t)) n hi · Iˆi (t) + i=1 ) , B̂12,n (t) = (E-38) − (N̂2n (t)) , (E-39) bK · B̂K,n (t) = h1 · N̂1n (t) + h2 · N̂2n (t) + (h1 + h2 + b12 ) · (N̂2n (t)) − − + (h1 + b1 ) · (N̂1n (t) + (N̂2n (t)) ) . (E-40) First, we consider the case L1 ≤ L2 . In view of the proof of Theorem 2, we write = I1n (0) + O1n (t 1 12 − L) − D − (λ + λ )nL − (D − (D 12,n (t) − D 12,n 1,n 1,n (E-45) (t − L) − D (t) − D 1,n 12,n < converge to (1 (t), 12 (t), 12 (t − L)). In view of E(DK,n ( n1 )) ∞, K = 1, 12 for = 3, thus the random variable (vector) given in (E-45) also converges to (1 (t), 12 (t), 12 (t − L)) in the sense of L1 . Define X1∞ (t − L) = liminf n→∞ X1n (t − L) X n (t − L2 ) , X2∞ (t − L2 ) = liminf 2 √ . √ n→∞ n n (E-46) Thus, by (E-40), in view of L1 -convergence of the random vector given by (E-45), K=1,12 − N̂1n (t) D1,n (t − L, t] − λ1 nL D12,n (t − L, t] − λ12 nL , , √ √ n n D12,n (t − L − , t − L] − λ12 n √ n 2+ b12 − − D12,n (t − L − , t − L] − λ12 n √ n where N̂2n (t) = I2n (0) + O2n (t − L2 ) − D2n (t). D1,n (t − L, t] − λ1 nL D12,n (t − L, t] − λ12 nL , √ √ n n converges in distribution to a bivariate normal variable (1 (t), 12 (t)) with zero mean and covariance matrix defined in (43), and (E-37) b1 641 ⎞ ⎞⎞ ⎛ ⎛ ⎛ 2 K,n n (t) Iˆi K B̂ (t) n ⎠ n ⎝ ⎝ ⎝ Ft−L Ft−L−⎠⎠ lim sup E E E hi · √ + b · √ n n n→∞ i=1 ≥ − (t (t − L) (t − L) − λ nL) K=1,12 E(E(E(F (X1∞ (t 12 − L), X2∞ (t − L2 ) ∞ ∞ − L))|Ft−L )|Ft−L− )). (E-47) 1 (t − L) − λ nL), 12 Similar to (E-16), by (44), we have (E-41) N̂2n (t) = I2n (0) + O2n (t − L2 ) − D12,n (t − L2 ) − λ12 nL2 ≥ F (θ1 (X2∞ (t − L2 ) − 12 (t − L)), X2∞ (t − L2 ) − 12 (t − L)). (E-48) − (D12,n (t) − D12,n (t − L) − λ12 nL) − (D12,n (t − L) − D12,n (t − L2 ) − λ12 n(L2 − L)). ∞ E(F (X1∞ (t − L), X2∞ (t − L2 ) − 12 (t − L))|Ft−L ) (E-42) Similar to (E-17), by (44)–(45) and (E-48), we have Let E(E(F (X1∞ (t − L), X2∞ (t − L2 ) X1n (t − L) = I1n (0) + O1n (t − L) − D1,n (t − L) − D12,n (t − L) − (λ1 + λ12 )nL, (E-43) ∞ ∞ − 12 (t − L))|Ft−L )|Ft−L− ) ≥ F̃ (θ̃2 ). (E-49) Naval Research Logistics DOI 10.1002/nav 642 Naval Research Logistics, Vol. 62 (2015) Therefore, in view of (E-37)–(E-40) and (E-47)–(E-49), we know that − (D1,n (t − L) − D1,n (t − L1 ) − λ1 n) ⎞ ⎛ T 2 1 hi · Iin (t, π ) + bK · B K,n (t, π )⎠dt ≥ F̃ (θ̃2 ). lim sup √ E ⎝ n 0 n→∞ − (D12,n (t − L) − D12,n (t − L1 ) − λ12 n) i=1 N̂2n (t) = I2n (0) + O2n (t − L) − D12,n (t − L) − λ12 nL K=1,12 − (D12,n (t) − D12,n (t − L) − λ12 nL). (E-50) Similar to (E-43)–(E-44), let By again by (44)–(45), the inequality of (E-49) would be equality if X1n (t − L1 ) = I1n (0) + O1n (t − L1 ) − D1,n (t − L1 ) X1∞ (t − L) = θ1 (θ̃2 − 12 (t − L)), X2∞ (t − L2 ) = θ̃2 . − D12,n (t − L1 ) − (λ1 + λ12 )nL1 , Hence, by the definition of X1∞ (t − L) and X2∞ (t − L2 ) given by (E-46), we know that if √ D12,n (t − L − , t − L] n · θ1 θ̃2 − √ n √ X2n (t − L2 ) = nθ̃2 , X1n (t − L) = For any fixed t, we also know that and This implies that (E-47) becomes equality with (X1n (t−L), X2n (t−L2 )) given by (E-51). Consequently, the inequality of (E-50) would become equality if (X1n (t − L), X2n (t − L2 )) is chosen according to (E-51). It follows from (E-43)–(E-44) that the inequality of (E-50) would become equality if O1n (t − L1 ) = D1,n (t − L1 ) + D12,n (t − L1 ) + (λ1 + λ12 )nL1 √ D12,n (t − L1 − , t − L1 ] − λ12 n , + n · θ1 θ̃2 − √ n (E-52) √ O1n (t − L2 ) = D12,n (t − L2 ) + λ12 nL2 + nθ̃2 . (E-53) Therefore, to finish the proof of (i), in view of (E-36)–(E-37), it suffices to prove that for the replenishment process given by (E-52)–(E-53), the solution of (38) only with constraints (39)–(40) should be asymptotic same as BK,n ( l n ) (see below (E-54)). That is, in high-volume, the constraint (41) 2+ does not matter. By E[DK,n ( n1 )] < ∞, K ∈ {1, 12} for = 3, we know that, by Ata and Kumar (2006), for any finite constant α > 0, there exists a constant β > 0 such that " # max max DK,n (( + 1)l n ) − DK,n ( l n ) − λK nl n < αn1/3 =0,···l n K=1,12 ≥ 1 − βn−1/6 . n (t) and B̂K,n (t) given by (E-38)–(E-39) are the Lipschitz conNote that Iˆi tinuous in (N̂1n (t), N̂2n (t)). Hence, following the proof of Proposition 4.1 in Plambeck and Ward (2006), we can show that there exists a constant β > 0 such that $ % Pr(B̂K,n ( l n ) = BK,n ( l n ), = 1, · · · , 1/l n ) ≥ 1 − βn−1/6 . (E-54) With (E-54) in hand, we obtain that the replenishment policy defined by (E-52)–(E-53), and the discrete-review allocation rule defined by (38)–(41) are asymptotically optimal in An . Next, consider (ii), L1 > L2 . We write (E-41)–(E-42) as N̂1n (t) = I1n (0) + O1n (t − L1 ) − D1,n (t − L1 ) − D12,n (t − L1 ) − (λ1 + λ12 )nL1 − (D1,n (t) − D1,n (t − L) − λ1 nL) − (D12,n (t) − D12,n (t − L) − λ12 nL) Naval Research Logistics DOI 10.1002/nav ((D1,n (t − L, t] − λ1 nL), (D12,n (t − L, t] − λ12 nL)) (E-51) then n X1 (t − L) X2n (t − L2 ) , converges to (θ1 (θ̃2 − 12 (t − L)), θ̃2 ) in L1 . √ √ n n Pr X2n (t − L2 ) = I2n (0) + O2n (t − L) − D12,n (t − L) − λ12 nL. and (X1n (t − L1 ) − (D1,n (t − L1 , t − L] − λ1 n) − (D12,n (t − L1 , t − L] − λ12 n), X2n (t − L)) are independent. Then as in the proof for L1 ≤ L2 , we can prove (ii). PROOF OF THEOREM 5: In the proof of Theorem 4, we use Proposition 1 (i) to get the solution of (38) with constraints (39)–(40) (see (E-38)– (E-40)). Similarly we first use Proposition 1 (iii) to get the solution of (38) with constraints (39)–(40) for the case b1 ≥ h2 +b12 , then follow the exactly same procedure to get the theorem. PROOF OF LEMMA 7: First note that − − a4 u1 + a5 u2 + a3 u3 + (a1 + a3 + a4 )(u− 3 ∨ [u1 + u2 ]) − − is jointly convex over (u1 , u2 , u3 ) because u− 1 + u2 and u3 are both jointly − − − convex over (u1 , u2 , u3 ), and thus, (u1 + u2 ) ∨ u3 is also jointly convex over (u1 , u2 , u3 ). Consequently, G(y1 , y2 , y3 ) defined by h1 · E(y1 − D1 ) + h2 · E(y2 − D2 ) + h3 · E(y3 − D3 ) + (c + h1 ) · E[[y1 − D1 ]− ∨ ([y2 − D2 ]− + [y3 − D3 ]− )] is also jointly convex over (y1 , y2 , y3 ). To show that the optimal point (s̄1 , s̄2 , s̄3 ) satisfies s̄2 + s̄3 ≥ s̄1 . Assume, otherwise, s̄2 + s̄3 < s̄1 is optimal. For any demand realization, we have s̄1 − D2 − D3 > s̄2 + s̄3 − D2 − D3 , thus, (s̄1 − D2 − D3 )− ≤ (s̄2 + s̄3 − D2 − D3 )− ≤ (s̄3 − D3 )− + (s̄2 − D2 )− . Consequently, G(s̄1 , s̄2 , s̄3 ) = h1 s̄1 + h2 s̄2 + h3 s̄3 − 2ED2 − 2ED3 + (c + h1 )E[(s̄2 − D2 )− + (s̄3 − D3 )− ]. This means that we can get a lower value by decreasing s̄1 to s̄2 + s̄3 , which contradicts the optimality of (s̄1 , s̄2 , s̄3 ). PROOF OF LEMMA 8: As G(y1 , y2 , y3 ) is convex in y1 for any (y2 , y3 ), the first order condition is guaranteed to find the minimal point. Taking derivatives, we get ∂G(y1 , y2 , y3 ) = h1 − (c + h1 ) · Pr(D2 + D3 ≥ y1 , ∂y1 D3 ≥ y1 − y2 , D2 ≥ y1 − y3 ). (E-55) Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies We need to consider the following three cases. Case I: (y2 , y3 ) ∈ 1 . In this case, y2 , y3 ≥ s1∗ , thus, we have ∂G(y1 , y2 , y3 ) ∂y1 y1 =s1∗ = h1 − (c + h1 ) · Pr(D2 + D3 ≥ s1∗ ) = 0, where the first equality follows from the nonnegativity of the demand, and the last equality follows from definition of s1∗ in (65). Thus, s̆1 (y2 , y3 ) = s1∗ . In this region, it is trivial that s̆1 (y2 , y3 ) is increasing and continuous. Case II: (y2 , y3 ) ∈ 2 . 2 is partitioned into two regions: (R1 ) y3 ≤ y2 ≤ v2 (y3 ), y3 < s1∗ and (R2 ) y2 ≤ y3 ≤ v3 (y2 ), y2 < s1∗ . We only consider region (R1 ), the region (R2 ) can be proved similarly. Because y2 ≤ v2 (y3 ) (defined in (66)), we have ∂G(y1 , y2 , y3 ) ∂y1 y1 =y2 = Pr(D2 + D3 ≥ y2 , D2 ≥ y2 − y3 ) ≥ h1 , c + h1 where the inequality is because the left-hand-side is decreasing in y2 and y2 ≤ v2 (y3 ). By the definition of s1 (y2 , y3 ) in (67), we have s1 (y2 , y3 ) ≥ y2 . Thus, by (E-55), s̆1 (y2 , y3 ) = s1 (y2 , y3 ) minimizes G(·, y2 , y3 ) for any y2 and y3 . To prove the other parts of the lemma, note that Pr(D2 + D3 ≥ y1 , D3 ≥ y1 − y2 , D2 ≥ y1 − y3 ) ⎧ ⎪ ⎨Pr(D3 ≥ y3 , D2 ≥ y1 − y3 ) + Pr(D2 + D3 ≥ y1 , = y1 − y2 ≤ D3 < y3 ), if y1 ≤ y2 + y3 , ⎪ ⎩ Pr(D2 ≥ y1 − y3 , D3 ≥ y1 − y2 ), if y1 < y2 + y3 . 643 PROOF OF LEMMA 9: As under the coordinated base-stock policy (s̆1 (·, ·), s̆2 , s̆3 ), the component-2 and component-3 use a base-stock policy with levels s̆2 and s̆3 , respectively, the first two equalities of (4) are trivial. We only need to show that the third one holds. To show this, we first show that for any τ and t with 0 ≤ τ < t, s̆1 (s̆2 − D2 (t − 2 , t], s̆3 − D3 (t − 3 , t]) ≥ s̆1 (s̆2 − D2 (τ − 2 , t], s̆3 − D3 (τ − 3 , t]) ≥ s̆1 (s̆2 − D2 (τ − 2 , τ ], s̆3 − D3 (τ − 3 , t]) − D3 (τ , t] ≥ s̆1 (s̆2 − D2 (τ − 2 , τ ], s̆3 − D3 (τ − 3 , τ ]) − D2 (τ , t] − D3 (τ , t], where the first inequality is because ∂ s̆1 (y2 , y3 )/∂yi ≥ 0, i = 2, 3 by Lemma 8, while the second and third inequalities are because ∂ s̆1 (y2 , y3 )/∂yi ≤ 1, i = 2, 3 by Lemma 8 and the fact that Di (τ − i , t] = Di (τ − i , τ ] + Di (τ , t]. Thus, under the above CBS policy, for any τ ≥ 0 such that IP1 (τ ) = s̆1 (s̆2 − D2 (τ ), s̆3 − D3 (τ )), for any t > τ such that no order for component-1 is placed during (τ , t), we have IP− 1 (t) = IP1 (τ ) − D1 (τ , t] = s̆1 (s̆2 − D2 (τ − 2 , τ ], s̆3 − D3 (τ − 3 , τ ]) − D2 (τ , t] − D3 (τ , t] ≤ s̆1 (s̆2 − D2 (t − 2 , t], s̆3 − D3 (t − 3 , t]). As a result, IP1 (t) = s̆1 (s̆2 − D2 (t), s̆3 − D3 (t)) is achievable. Thus, taking a further derivatives in (E-55), we get that if y1 ≤ y2 + y3 , ∞ ∂ 2 G(y1 , y2 , y3 ) = (c + h ) ψ(ξ1 , y1 − y3 )dξ1 1 ∂y12 y3 ∞ y3 + ψ(y1 − y2 , ξ2 )dξ2 + ψ(ξ1 , y1 − ξ1 )dξ1 , y2 ∂ 2 G(y1 , y2 , y3 ) = −(c + h1 ) ∂y1 ∂y3 ∂ 2 G(y1 , y2 , y3 ) = −(c + h1 ) ∂y1 ∂y2 y1 −y2 ∞ ∞ As IPi − Di (t − L) and IP1 are adapted to Ft−L , by equations (62), we have ψ(y1 − y2 , ξ2 )dξ2 . y2 E[G(IP1 , IP2 − D2 (t − L), IP3 − D3 (t − L))|Ft−L ] By the Implicit Function Theorem, we have that if y1 ≤ y2 + y3 , ∂s1 (y2 , y3 ) =− ∂yi ∂ 2 G(y 1 , y2 , y3 ) ∂y1 ∂yi ! ∂ 2 G(y 1 , y2 , y3 ) ∂y12 ∈ [0, 1], i = 2, 3. Along the same lines, we can show that if y1 > y2 +y3 , the above relationship still holds. Case III: (y2 , y3 ) ∈ 3 . We only prove the case for y2 > v2 (y3 ) and y3 < s1∗ (see Figure 5), the other case y3 > v3 (y2 ) and y2 < s1∗ can be similarly proved. It is not difficult to see that v2 (y3 ) ≥ y3 when y3 < s1∗ . Thus, by (E-55) and (66), we obtain ∂G(y1 , y2 , y3 ) ∂y1 y1 =v2 (y3 ) E[C(t, π )|Ft−L2 ] = E{E[G(IP1 , IP2 − D2 (t − L), IP3 − D3 (t − L))|Ft−L ]|Ft−L2 }. ψ(ξ1 , y1 − y3 )dξ1 , y3 PROOF OF THEOREM 7: In view of Theorem 6, we can assume that an NHB policy π ∈ N is used. By the tower properties of conditional expectations, we have = h1 − (c + h1 ) · Pr(D2 + D3 ≥ v2 (y3 ), D2 ≥ v2 (y3 ) − y3 ) = 0. Thus, s̆1 (y2 , y3 ) = v2 (y3 ) minimizes G(·, y2 , y3 ). Applying a similar methodology as in Case II, v2 (y3 ) can be shown to be increasing and 0 ≤ dv2 (y3 )/dy3 ≤ 1. The details are omitted. The global continuity of s̆1 (y2 , y3 ) can be easily verified by checking the boundary of each region. ≥ G(s̆1 (IP2 − D2 (t − L), IP3 − D3 (t − L)), IP2 − D2 (t − L), IP3 − D3 (t − L)). Taking a further expectation, by the fact that IPi is adapted to Ft−L and Di (t − L) is independent to Ft−L2 , together with equations (63) and (64), we have E{E[G(IP1 , IP2 − D2 (t − L), IP3 − D3 (t − L))|Ft−L ]|Ft−L2 } ≥ Ğ(IP2 , IP3 ) ≥ Ğ(s̆2 , s̆3 ). As a result, EC(t, π ) ≥ Ğ(s̆2 , s̆3 ). The inequality will be equality if and only if for any t ≥ 0, IP1 (t − L, π ) = s̆1 (s̆2 − D2 (t − L), s̆3 − D3 (t − L)), IP2 (t − L − , π ) = s̆2 , IP3 (t − L − , π ) = s̆3 , this follows immediately from Lemma 9. PROOF OF THEOREM 8: The proof is similar to that of Theorem 7. We only present the proof for the case CL, the case CM can be done in the Naval Research Logistics DOI 10.1002/nav 644 Naval Research Logistics, Vol. 62 (2015) same way. According to Theorem 6, we can assume that an NHB admissible policy π is used. By Proposition 6, for any π ∈ N , we have C(t, π ) = Eϕ(IP1 − D1 (t), IP2 − D2 (t), IP3 − D3 (t)). (E-56) Next, we move to the upper bound. Similar to (E-35) in the proof of Proposition 5, we have C(t, π ) = h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + h3 · (IP3 − D3 (t)) + (h1 + h3 + b13 ) · (IP3 − D3 (t))− + (h1 + h2 + b12 ) Let · (IP2 − D2 (t))− + h1 · ((IP1 − D1 (t)) + (IP2 − D2 (t))− Ĝ1 (x, y) = EĜ(x − D1 (t − L1 , t − L3 ], y − D2 (t − L3 )), Ĝ2 (x) = minĜ1 (x, y), (E-57) + (IP3 − D3 (t))− )− + (h3 + b13 ) · B113 (t, π ) + (h2 + b12 ) (E-58) · B112 (t, π ) y Ĉ = minEĜ2 (x − D1 (t − L1 , t − L2 ]), ≤ h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + h3 · (IP3 − D3 (t)) (E-59) x As L3 ≤ L2 ≤ L1 , we have Ft−L1 ⊆ Ft−L2 ⊆ Ft−L3 , by the tower property of conditional expectations, we have + (h1 + h3 + b13 ) · (IP3 − D3 (t))− + (h1 + h2 + b12 ) · (IP2 − D2 (t))− + h1 · ((IP1 − D1 (t)) + (IP3 − D3 (t))− + (IP2 − D2 (t))− )− + d̄ · (B113 (t, π ) + B112 (t, π )) = h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + h3 · (IP3 − D3 (t)) E[C(t, π )|Ft−L1 ] = E{E[E(ϕ(IP1 − D1 (t), IP2 − D2 (t), + (h1 + h3 + b13 ) · (IP3 − D3 (t))− + (h1 + h2 + b12 ) IP3 − D3 (t))|Ft−L3 )|Ft−L2 ]|Ft−L1 }. · (IP2 − D2 (t))− + (h1 + d̄) · ((IP1 − D1 (t)) As IP3 , IP2 − D2 (t − L3 ) and IP1 − D1 (t − L3 ) are adapted to Ft−L3 , and D3 (t), D2 (t) and D1 (t) are independent of Ft−L3 , we have + (IP2 − D2 (t))− + (IP3 − D3 (t))− )− ≤ h1 · (IP1 − D1 (t)) + h̄2 · (IP2 − D2 (t)) + h̄3 · (IP3 − D3 (t)) + (h1 + d̄) · ((IP1 − D1 (t))− ∨ [(IP2 − D2 (t))− E(ϕ(IP1 − D1 (t), IP2 − D2 (t), IP3 − D3 (t))|Ft−L3 ) + (IP3 − D3 (t))− ]). = G(IP1 − D1 (t − L3 ), IP2 − D2 (t − L3 ), IP3 ) ≥ minG(IP1 − D1 (t − L3 ), IP2 − D2 (t − L3 ), y3 ) Then, similar to the proof for the lower bound, we get C ∗ ≤ C̄ ∗ . y3 = Ĝ(IP1 − D1 (t − L3 ), IP2 − D2 (t − L3 )), in which the first equation is from the equation (61) and the fact that Di (t) is a stationary process, and the last equation is from the definition of Ĝ(·). Taking a further expectation, conditioning on Ft−L2 , yields E[Ĝ(IP1 − D1 (t − L3 ), IP2 − D2 (t − L3 ))|Ft−L2 ] = Ĝ1 (IP1 − D1 (t − L1 , t − L2 ], IP2 ) ACKNOWLEDGMENT We would like to thank the Associate Editor and three anonymous referees for their valuable suggestions and corrections. We would also like to thank Awi Federgruen and Paul Zipkin for their helpful comments. This research was supported in part by the National Natural Science Foundation of China under award 70328001 and 70731003. ≥ minĜ1 (IP1 − D1 (t − L1 , t − L2 ], y2 ) y2 REFERENCES = Ĝ2 (IP1 − D1 (t − L1 , t − L2 ]), where the first equation is because IP2 and IP1 − D1 (t − L1 , t − L2 ] are adapted to Ft−L2 , D2 (t − L3 ) and D1 (t − L2 , t − L3 ] are independent of Ft−L2 , and (E-57). The last equation is from (E-58). Similarly, we get E[Ĝ2 (IP1 − D1 (t − L1 , t − L2 ])|Ft−L1 ] ≥ Ĉ. Thus, we have that for t ≥ 0 and π , C(t, π ) = E[E(C(t, π )|Ft−L1 )] ≥ Ĉ. From the above, it is easily to see that Ĉ is exactly the same as (69). PROOF OF PROPOSITION 7: Similar to (E-34) in the proof of Proposition 5, we have that for any feasible π ∈ A, C(t, π ) ≥ h1 · (IP1 − D1 (t)) + h2 · (IP2 − D2 (t)) + h3 · (IP3 − D3 (t)) + (h1 + d) · ((IP1 − D1 (t))− ∨ [(IP2 − D2 (t))− + (IP3 − D3 (t))− ]). The last expression above is just the inventory cost at time t of another system which is identical with symmetric costs with b12 + h2 = b13 + h3 = d. Hence, similar to Theorem 8, the lower bound is obtained. Naval Research Logistics DOI 10.1002/nav [1] H. Arslan, S. Graves, and T. Roemer, A single-product inventory model for multiple demand classes, Manag Sci 53 (2007), 1486–1500. [2] B. Ata and S. Kumar, Heavy traffic analysis of open processing networks with complete resource pooling: Asymptotic optimality of discrete review policies, Ann Appl Probab 15 (2005), 331–391. [3] N. Agrawal and M. Cohen, Optimal material control and performance evaluation in an assembly environment with component commonality, Nav Res Logist 48 (2001), 409–429. [4] Y. Akcay and S. H. Xu, Joint inventory replenishment and component allocation optimization in an assemble-to-order system, Manag Sci 50 (2004), 99–116. [5] S. Benjaafar and M. Elhafsi, Production and inventory control of a single product assemble-to-order system with multiple customer classes, Manag Sci 52 (2006), 1896–1912. [6] F. Chen and Y.S. Zheng, Lower bounds for multi-echelon stochastic inventory systems, Manag Sci 40 (1994), 1426–1443. [7] V. Deshpande, M.A. Cohen, and K. Donohue, A threshold inventory rationing policy for service-differentiated demand classes, Manag Sci 49 (2003), 683–703. [8] M. Dogru, M. Reiman, and Q. Wang, A stochastic programming based inventory policy for assemble-to-order systems with application to the W model, Oper Res 58 (2010), 849–864. Lu, Song, and Zhang: Optimal and Asymptotically Optimal Policies [9] M. El Hafsi, H. Camus, and E. Craye, Optimal control of a nested-multiple-product assemble-to-order system, Int J Prod Res 46 (2008), 5367–5392. [10] A. Ha, Inventory rationing in a make-to-stock production system with several demand classes and lost sales, Manag Sci 43 (1997a), 1093–1103. [11] A. Ha, Stock rationing policy for a make-to-stock production system with two priority classes and backordering, Nav Res Logist 43 (1997b), 458–472. [12] W.H. Hausman, H.L. Lee, and A.X. Zhang, Joint demand fulfillment probability in a multi-item inventory system with independent order-up-to policies, Eur J Oper Res 109 (1998), 646–659. [13] D.P. Heyman and M.J. Sobel, Stochastic models in operations research, Vol. 2, Dover Publications, Mineola, NY, 1984. [14] K. Huang and T. de Kok, Cost minimization schemes of component allocation in an assemble-to-order system, Naval Research Logistics 62 (2015), 158–169. [15] R. Kapuscinski, R. Zhang, P. Carbonneau, R. Moore, and B. Reeves, Inventory decisions in Dell’s supply chain, Interfaces 34 (2004), 191–205. [16] L. Lu, J. Song, and H. Zhang, Optimal and near-optimal policies for assemble-to-order systems, Fuqua School of Business, Duke University, 2010a. [17] Y. Lu and J. Song, Order-based cost optimization in assembleto-order systems, Oper Res 53 (2005), 151–169. [18] Y. Lu, J. Song, and D.D. Yao, Order fill rate, lead-time variability and advance demand information in an assemble-to-order system, Oper Res 51 (2003), 292–308. [19] Y. Lu, J. Song, and D.D. Yao, Backorder minimization in multiproduct assemble-to-order systems, IIE Trans 37 (2005), 763–774. [20] Y. Lu, J. Song, and Y. Zhao, No-holdback allocation rules for continuous-time assemble-to-order systems, Oper Res 58 (2010b), 691–705. [21] E. Nadar, M. Akan, and A. Scheller-Wolf, Technical note: Optimal structural results for assemble-to-order generalized M-systems, Oper Res 62 (2014), 571–579. [22] E. Plambeck, Asymptotically optimal control for an assemble-to-order system with capacitated component production and fixed transport costs, Oper Res 56 (2008), 1158– 1171. [23] E. Plambeck and A. Ward, Optimal control of a highvolume assemble-to-order system, Math Oper Res 31 (2006), 453–471. [24] E. Plambeck and A. Ward, Note: A separation principle for a class of assemble-to-order systems with expediting, Oper Res 55 (2007), 603–609. [25] M. Reiman and Q. Wang, A stochatic program based lower bound for assemble-to-order systems, Oper Res Lett 40 (2012), 89–95. 645 [26] M. Reiman and Q. Wang, Asymptotically optimal inventory control for assemble-to-order systems with identical lead times, Oper Res 63 (2015), 716–732. [27] R. Rockafellar, Convex analysis, Princeton University Press, Princeton, NY, 1968. [28] K. Rosling, Optimal inventory policies for assembly systems under random demands, Oper Res 37 (1989), 565–579. [29] J. Song, The effect of lead time uncertainty in a simple stochastic inventory model, Manag Sci 40 (1994), 603–612. [30] J. Song, On the order fill rate in a multi-item, base-stock inventory system, Oper Res 46 (1998), 831–845. [31] J. Song, Order-based backorders and their implications in multi-item inventory systems, Manag Sci 48 (2002), 499–516. [32] J. Song, S. Xu, and B. Liu, Order-fulfillment performance measures in an assemble-to-order system with stochastic lead times, Oper Res 47 (1999), 131–149. [33] J. Song and D.D. Yao, Performance analysis and optimization in assemble-to-order systems with random lead-times, Oper Res 50 (2002), 889–903. [34] J. Song and P. Zipkin, “Supply chain operations: Assemble-toorder systems,” in: T. de Kok and S. Graves (Editors), Handbooks in operations research and management science, Chapter 11, Vol. 11: Supply Chain Management, North-Holland Publisher, 2003. [35] J. Song and Y. Zhao, The value of component commonality in a dynamic inventory system with lead times, Manuf Serv Oper Manag 11 (2009), 493–508. [36] D. Topkis, Optimal ordering and rationing policies in a nonstationary dynamic inventory model with n demand classes, Manag Sci 15 (1968), 160–176. [37] W. van Jaarsveld and A. Scheller-Wolf, Optimization of industrial-scale assemble-to-order systems, Journal of Computing 27 (2015), 544–560. [38] F. de Vericourt, F. Karaesmen, and Y. Dallery, Optimal stock allocation for a capacitated supply system, Management Sci 48 (2002), 1486–1501. [39] H. Wan and Q. Wang, Asymptotically-optimal component allocation for assemble-to-order production inventory systems, Oper Res Lett 43 (2015), 304–310. [40] P. Xu, R. Allgor, and S. Graves, Benefits of reevaluating realtime order fulfillment decisions, Manuf Serv Oper Manag 11 (2009), 340–355. [41] A. Zhang, Demand fulfillment rates in an assemble-to-order system with multiple products and dependent demands, Prod Oper Manag 6 (1997), 309–323. [42] Y. Zhao and D. Simchi-Levi, Performance analysis and evaluation of assemble-to-order systems with stochastic sequential lead times, Oper Res 54 (2006), 706–724. Naval Research Logistics DOI 10.1002/nav
© Copyright 2026 Paperzz